JP3046221B2

JP3046221B2 - Information retrieval device

Info

Publication number: JP3046221B2
Application number: JP7146898A
Authority: JP
Inventors: 隆正小山; 哲也木下; 裕文篠木; 忠一菊池
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1995-05-23
Filing date: 1995-05-23
Publication date: 2000-05-29
Anticipated expiration: 2015-05-29
Also published as: JPH08314948A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、データベースに登録さ
れたデータの中から、入力されたキーワードを含むデー
タを検索する情報検索装置に関し、特に、テーブルなど
のデータベースから該当するデータを的確に検索できる
ようにしたものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information retrieval apparatus for retrieving data including an input keyword from data registered in a database, and more particularly, to accurately retrieve data from a database such as a table. It is made possible.

【０００２】[0002]

【従来の技術】近年、情報検索では、利用者が思いつい
た言葉をキーワードとして入力すると、データベースに
登録されたデータを全文検索して、そのキーワードの文
字列を含むデータを自動検索するシステムが注目を集め
ている。特に、リレーショナルデータベース管理システ
ム（ＲＤＢＭＳ）などではその製品化が図られており、
多方面での応用が広まっている。2. Description of the Related Art In recent years, in information retrieval, when a user enters a keyword as a keyword, a full-text search is performed on data registered in a database, and a system for automatically retrieving data including a character string of the keyword has attracted attention. Are gathering. In particular, products such as relational database management systems (RDBMS) are being commercialized.
Applications in various fields are spreading.

【０００３】リレーショナルデータベースでは、例え
ば、図２２に示すように、レコード番号１のレコードが
神奈川県の横浜に関する記録であり、レコード番号２の
レコードが神奈川県の鎌倉に関する記録である場合に、
レコードの属性（県名、都市名、‥）を表す複数の項目
をデータ構造として設け、レコード番号に対応させて各
属性のデータをそれぞれの項目に格納して、テーブルを
構成する。In the relational database, for example, as shown in FIG. 22, when the record of record number 1 is a record relating to Yokohama in Kanagawa prefecture and the record of record number 2 is a record relating to Kamakura in Kanagawa prefecture,
A plurality of items representing record attributes (prefecture name, city name, ‥) are provided as a data structure, and data of each attribute is stored in each item in correspondence with the record number to form a table.

【０００４】このテーブルのデータベースから、例えば
「鎌倉」のレコードを検索する場合には、図２３に示す
ように、先ず、操作者は、情報検索装置に検索対象とな
るテーブルの項目（項目２）と検索対象となるキーワー
ド「鎌倉」とを入力する。これを受けて、情報検索装置
は、図２２のテーブルの全てのレコードに関するデータ
を読み込み、検索対象に指定された項目２を照合の対象
に設定して、キーワードと項目２の各データとを照合す
る。この照合によって一致したデータを検出したとき
は、そのデータに対応するレコード番号（番号２）を出
力する。When a record of, for example, "Kamakura" is searched from the database of this table, first, as shown in FIG. 23, the operator inputs an item (item 2) of the table to be searched into the information search device. And the keyword "Kamakura" to be searched. In response to this, the information search device reads data relating to all records in the table of FIG. 22, sets item 2 designated as a search target as a target to be checked, and checks a keyword against each data of item 2. I do. When matching data is detected by this collation, a record number (number 2) corresponding to the data is output.

【０００５】こうして従来の情報検索装置では、操作者
の入力した項目とキーワードとを手掛かりに、目的のレ
コードを検索する。[0005] As described above, the conventional information retrieval apparatus retrieves a target record by using the items and keywords input by the operator as clues.

【０００６】[0006]

【発明が解決しようとする課題】しかし、従来の情報検
索では、例えば、一方のレコードが「横浜」の「中華
街」に関する記録、他方が「横浜」の「山下公園」に関
する記録である場合に、これらのレコードを区別するた
め、新たに「名所」に関する項目を増設して、「中華
街」及び「山下公園」のデータを各レコードに対応させ
て格納する必要があり、そのため項目数が非常に多くな
るという問題点を有している。また、この場合、横浜の
中華街に関するレコードを検索するときは、「横浜」を
検索する項目と「中華街」を検索する項目との複数項目
を指定する必要があり、操作者はどの項目に必要とする
データが格納されているかを予め知っていなければなら
ず、検索の操作が煩雑になり、検索処理に時間が掛かる
などの問題点がある。However, in the conventional information retrieval, for example, if one record is a record relating to "Chinatown" of "Yokohama" and the other is a record relating to "Yamashita Park" of "Yokohama" In order to distinguish these records, it is necessary to add a new item related to “Famous Places” and store data of “Chinatown” and “Yamashita Park” in correspondence with each record. There is a problem that it increases. Also, in this case, when searching for a record related to Chinatown in Yokohama, it is necessary to specify a plurality of items, one for searching for "Yokohama" and the other for searching for "Chinatown". Since it is necessary to know in advance whether necessary data is stored, there is a problem that a search operation becomes complicated and a search process takes time.

【０００７】また、こうした場合、１項目に複数のデー
タを列記する方法もある（先の例では「横浜中華街」
「横浜山下公園」とする）が、このときには、「横浜」
「中華街」「山下公園」はそれぞれ独立のデータとして
存在する訳ではないから、例えば、「横浜」という単独
のデータを有するレコードを検索したいときに、これら
のレコードは検索漏れとなってしまう。このとき、「横
浜」という語を含むデータを全て拾い上げる検索を行な
うならば、これらのレコードも検索漏れを免れることが
できるが、しかし、こうした検索では、検索者の意図に
反したデータ（検索ゴミ）を大量に検出してしまう可能
性がある。In such a case, there is also a method of listing a plurality of data in one item (in the above example, "Yokohama Chinatown").
"Yokohama Yamashita Park"), but at this time, "Yokohama"
Since "Chinatown" and "Yamashita Park" do not exist as independent data, for example, when it is desired to search for a record having a single data of "Yokohama", these records are omitted from the search. At this time, if a search is performed that picks up all data that includes the word “Yokohama”, these records can be prevented from being omitted. However, in such a search, data contrary to the searcher's intention (search garbage) ) May be detected in large quantities.

【０００８】また、このように１項目に複数のデータを
記述した場合には、データの文字列が長くなるので、入
力されたキーワードをデータの先頭から順次照合するの
に時間が掛かる。[0008] When a plurality of data are described in one item as described above, a character string of the data becomes long, and it takes time to sequentially collate input keywords from the beginning of the data.

【０００９】また、このキーワードによる検索では、例
えば「横○×」や「○×横」のように、前方に「横」の
付くデータを検索（以下「前方一致検索」と呼ぶ）した
り、後方に「横」の付くデータを検索（以下「後方一致
検索」）したり、前方と後方の両者に指定された文字が
付くデータを検索（以下「完全一致検索」と呼ぶ）する
ことができるが、これらの検索時間も長くなる。[0009] In the search using this keyword, for example, data having a "horizontal" in front of it, such as "horizontal XX" or "XX horizontal", is searched (hereinafter, referred to as "front match search"). It is possible to search for data with "horizontal" at the back (hereinafter "rearrangement search") or to search for data with characters specified both forward and rearward (hereinafter referred to as "perfect match search"). However, these search times also become longer.

【００１０】本発明は、こうした従来の問題点を解決す
るものであり、テーブルの構成要素として設けた一つの
項目に複数のデータをそれぞれ独立に登録することがで
き、また、項目内のこれらのデータと入力されたキーワ
ードとの照合を高速で行なうことができる情報検索装置
を提供することを目的としている。The present invention solves such a conventional problem. A plurality of data can be independently registered in one item provided as a constituent element of a table. It is an object of the present invention to provide an information retrieval apparatus capable of performing high-speed collation between data and input keywords.

【００１１】[0011]

【課題を解決するための手段】そこで、本発明では、複
数の項目を持つレコードデータの項目内におけるデータ
から、入力されたキーワードを含むデータを検索する情
報検索装置において、この項目内のデータが複数のデー
タから成るとき、これらのデータを繋げた列の先頭、終
端及びデータの間に区切りコードを付与する区切り付与
手段と、この列の隣接する２文字の組から成る文字連鎖
を作成する文字連鎖パターン生成手段と、文字連鎖を構
成する各文字のデータにおける出現回数、レコード番
号、データの番号から成る文字連鎖情報を作成し、文字
連鎖と対応させてメモリに格納して検索ファイルを作成
する検索ファイル作成手段と、入力されたキーワードに
区切りコードを付与するキーワード区切り付与手段と、
区切りコードが付与されたキーワードからキーワードの
文字連鎖を作成するキーワード文字連鎖生成手段と、検
索ファイルからキーワードの文字連鎖に対応する文字連
鎖情報を読出し、それらを照合してキーワードを含むデ
ータを検索する照合手段とを設けている。Therefore, according to the present invention, there is provided an information retrieval apparatus for retrieving data including an input keyword from data in an item of record data having a plurality of items. When composed of a plurality of data, a delimiter providing means for providing a delimiter code between the beginning and end of the column connecting these data and the data, and a character for creating a character chain consisting of a set of two adjacent characters in this column A chain pattern generating means and character chain information including the number of appearances in each character data constituting a character chain, a record number, and a data number are created, stored in a memory in association with the character chain, and a search file is created. Search file creating means, keyword delimiter assigning means for assigning a delimiter code to the input keyword,
Keyword character chain generation means for generating a character chain of keywords from keywords to which a delimiter code has been assigned; and character chain information corresponding to the character chains of the keywords are read from a search file and collated to search for data including the keyword. Collation means is provided.

【００１２】また、照合手段に対して、区切りコードを
含むキーワードの文字連鎖に対応する文字連鎖情報の照
合を優先させる優先照合手段を設けている。Further, the matching means is provided with priority matching means for giving priority to matching of character chain information corresponding to a character chain of a keyword including a delimiter code.

【００１３】また、文字連鎖パターン生成手段は、区切
りコードを文字連鎖の中に含めて文字連鎖を作成する。[0013] The character chain pattern generating means creates a character chain by including the delimiter code in the character chain.

【００１４】また、このとき検索ファイル作成手段は、
文字連鎖情報における区切りコードの出現回数として、
特定の番号を設定する。At this time, the search file creating means includes:
As the number of occurrences of the delimiter code in the character chain information,
Set a specific number.

【００１５】また、文字連鎖パターン生成手段が、区切
りコードを含めずに文字連鎖を作成し、検索ファイル作
成手段が、文字連鎖の内、区切りコードと隣接する文字
連鎖の文字連鎖情報をそれ以外の文字連鎖情報と区別し
て作成する。Further, the character chain pattern generating means creates a character chain without including the delimiter code, and the search file creating means converts the character chain information of the character chain adjacent to the delimiter code out of the character chains. Created separately from character chain information.

【００１６】また、検索ファイル作成手段が、区切りコ
ードと隣接する文字連鎖の文字連鎖情報をメモリに群別
に格納する。[0016] Further, the search file creating means stores the character chain information of the character chain adjacent to the delimiter code in the memory for each group.

【００１７】また、区切り付与手段が、先の列に階層を
表す複数種類の区切りコードを付与し、キーワード区切
り付与手段が、入力されたキーワードに階層を表す区切
りコードを付与する。Further, the delimiter assigning means assigns a plurality of types of delimiter codes indicating the hierarchy to the preceding column, and the keyword delimiter assigning means assigns the input keyword with the delimiter code indicating the hierarchy.

【００１８】さらに、区切り付与手段が、先の列に順序
を表す複数種類の区切りコードを付与し、キーワード区
切り付与手段が、入力されたキーワードにこの順序を表
す区切りコードを付与する。Further, the delimiter assigning means assigns a plurality of types of delimiter codes indicating the order to the preceding column, and the keyword delimiter assigning means assigns a delimiter code indicating the order to the input keyword.

【００１９】[0019]

【作用】そのため、キーワードの文字列に相当する連続
する文字が同一データ内に存在するかどうかを、キーワ
ードの文字連鎖に対応させて読出した文字連鎖情報を照
合することにより、調べることができる。このデータが
１項目内に複数存在する場合でも、各データを区切りコ
ードで区切っているため、各データはそれぞれ独立のデ
ータとして作用する。従って、レコードデータの構成要
素である１項目に対して、複数のデータを繰り返し登録
することができ、入力されたキーワードを含む文字列や
パターンなどのデータを項目内で繰り返し検索すること
が可能である。Therefore, it can be checked whether or not continuous characters corresponding to the character string of the keyword exist in the same data by comparing the character chain information read out in correspondence with the character chain of the keyword. Even when a plurality of such data exist in one item, each data acts as an independent data because each data is separated by a delimiter code. Therefore, a plurality of data can be repeatedly registered for one item which is a component of the record data, and data such as a character string and a pattern including the input keyword can be repeatedly searched in the item. is there.

【００２０】また、キーワードの前または後ろに区切り
コードを付与し、この区切りコードを含む文字連鎖に対
応する文字連鎖情報を優先させて照合することにより、
前方一致検索や後方一致検索を高速で行なうことができ
る。Further, a delimiter code is added before or after the keyword, and character chain information corresponding to a character chain including the delimiter code is preferentially collated.
It is possible to perform a prefix search and a tail match search at high speed.

【００２１】また、区切りコードを文字連鎖の中に含め
ずに、この区切りコードに隣接する文字連鎖をデータの
先端または末端として扱う場合には、文字連鎖情報の数
を減らすことができ、照合回数を削減することができ
る。また、この区切りコードに隣接する文字連鎖の文字
連鎖情報をメモリに群別に格納することにより、メモリ
からの読出しが容易になる。また、前方一致検索、後方
一致検索、完全一致検索に応じて、文字連鎖情報の読出
し順序や照合順序を合わせることができ、これらの検索
を高速で行なうことができる。When a character chain adjacent to the delimiter code is treated as the head or end of data without including the delimiter code in the character chain, the number of character chain information can be reduced, and the number of times of collation can be reduced. Can be reduced. Also, by storing the character chain information of the character chain adjacent to the delimiter code in the memory for each group, reading from the memory becomes easy. In addition, the reading order and collation order of the character chain information can be matched in accordance with the forward match search, the backward match search, and the perfect match search, and these searches can be performed at high speed.

【００２２】また、階層構造または順序構造を持つデー
タに対しては、階層または順序を示す区切りコードを付
与して検索ファイルを構成し、入力するキーワードにも
階層または順序を指定する区切りコードを付与すること
により、階層関係や順序関係を持った検索を実現するこ
とができる。Further, for data having a hierarchical structure or an order structure, a delimiter code indicating the hierarchy or order is assigned to form a search file, and a keyword to be input is also assigned a separator code for designating the hierarchy or order. By doing so, a search having a hierarchical relationship or an order relationship can be realized.

【００２３】[0023]

【実施例】（第１実施例）実施例の情報検索装置では、図６に示す用に、レコード
の項目内に「あいうあえ、あいあえ、あとあし」の３個
のデータがある場合に、このデータ中の区切り「、」に
代わって、区切りコード（∧Ｗ）を付加し、「∧Ｗあい
うあえ∧Ｗあいあえ∧Ｗあとあし∧Ｗ」というデータに
変換する。次に、各データごとの文字出現回数を求め
る。この文字出現回数は、それぞれの文字が、同一種別
の文字の何回目に出現した文字であるかを表すもので、
図６（３）に示すようになる。次に、「∧Ｗあいうあえ
∧Ｗあいあえ∧Ｗあとあし∧Ｗ」の先頭より１文字ずつ
移動しながら隣接する２文字の組から成る文字連鎖パタ
ーン、即ち、（∧Ｗ，あ）（あ，い）（い，う）（う，
あ）（あ，え）（え，∧Ｗ）（∧Ｗ，あ）（あ，い）
（い，あ）（あ，え）（え，∧Ｗ）（∧Ｗ，あ）（あ，
と）（と，あ）（あ，し）（し，∧Ｗ）を作成し、この
文字連鎖パターンにおける各文字の文字出現回数と、デ
ータ番号（３個のデータの順番を表す番号）と、レコー
ド番号とを組合せて文字連鎖情報を作成する。Embodiment (First Embodiment) In the information retrieval apparatus of the embodiment, as shown in FIG. 6, when there are three data items "Ah, Ah, Ah," in a record item, In place of the delimiter "," in the data, a delimiter code (@W) is added, and the data is converted into data "@W," AW, AW, AW, W ". Next, the number of character appearances for each data is obtained. This character appearance count indicates the number of times each character appears in a character of the same type.
The result is as shown in FIG. Next, a character chain pattern consisting of a set of two adjacent characters while moving one character at a time from the beginning of "@W aa ae @ W aa ai @ W aa aa @ W", that is, (@W, a) (a , I) (i, u) (u,
A) (A, E) (E, @W) (@W, A) (A, I)
(A, A) (A, E) (E, @W) (@W, A) (A,
(), (A, a) (a, s) (s, ∧W) are created, the number of appearances of each character in the character chain pattern, a data number (a number indicating the order of three data), Create character chain information by combining with the record number.

【００２４】例えば、２番目の文字連鎖パターンである
（あ，い）の文字連鎖情報は、「あ」の文字出現回数が
１、「い」の文字出現回数が１、データ番号が１、そし
てレコード番号が１であるから（１，１，１，１）とな
る。また、最後の文字連鎖パターン（し，∧Ｗ）の文字
連鎖情報は（１，ｍ，３，１）となる。ここでは∧Ｗの
文字出現回数をｍで表している。こうして求めた各文字
連鎖パターンとその文字連鎖情報とを纏めて検索ファイ
ル（図６（４））を作成する。For example, the character chain information of (a, i) which is the second character chain pattern is such that the number of appearances of the character "a" is 1, the number of occurrences of the character "i" is 1, the data number is 1, and Since the record number is 1, it becomes (1,1,1,1). Further, the character chain information of the last character chain pattern (（, ∧W) is (1, m, 3, 1). Here, the number of appearances of the character ∧W is represented by m. A search file (FIG. 6 (4)) is created by combining the character chain patterns thus obtained and the character chain information.

【００２５】一方、検索時に、図１０に示すように「あ
とあし」というキーワードが入力されたとすると、この
キーワードに区切りコードを付加して「∧Ｗあとあし∧
Ｗ」を作成し、その文字連鎖パターン（∧Ｗ，あ）
（あ，と）（と，あ）（あ，し）（し，∧Ｗ）に対応す
る文字連鎖情報を検索ファイルから求める（図１０
（４））。On the other hand, if a keyword "rear toe" is inputted as shown in FIG. 10 at the time of retrieval, a delimiter code is added to the keyword and "@W rear toe"
W ”, and its character chain pattern ($ W, a)
Character chain information corresponding to (A, A) (A, A) (A, S) (S, $ W) is obtained from the search file (FIG. 10).
(4)).

【００２６】次いで、この文字連鎖情報から、連続する
「あとあし」という文字列がデータ１〜３の中に存在す
るかどうかを次のように調べる。もしも、その文字列が
データ１〜３の中に存在するならば、前記の各文字連鎖
パターンにおける文字連鎖情報のデータ番号及びレコー
ド番号は一致している筈であるし、また、各文字連鎖パ
ターンにおける文字連鎖情報の２番目の要素の値は、そ
の次の文字連鎖パターンにおける文字連鎖情報の１番目
の要素の値と一致している筈である。検索ファイルから
求めた文字連鎖情報の中に、こうした条件を満たす文字
連鎖情報を検出することができたときには、そのレコー
ド番号を検出結果として出力する。Next, it is checked from the character chain information as to whether or not a continuous character string "rear foot" exists in the data 1 to 3 as follows. If the character string exists in the data 1 to 3, the data number and the record number of the character chain information in each of the character chain patterns should match, and each character chain pattern , The value of the second element of the character chain information in the next character chain pattern should match the value of the first element of the character chain information in the next character chain pattern. When character chain information satisfying such conditions can be detected in the character chain information obtained from the search file, the record number is output as a detection result.

【００２７】こうした動作を行なう情報検索装置は、図
１に示すように、複数の項目から構成されるレコードデ
ータが格納されたレコードデータ記憶部２と、格納され
たレコードデータを読み取るレコード読取部１と、読み
取ったレコードデータから各項目毎のデータを切り出
し、データの順番をカウントし、さらにデータの先頭、
間、終端に区切りコードを付加する区切り付加部３と、
区切り付加部３からデータの順番を取得するとともに、
レコードデータ記憶部２からレコード番号を取得する番
号付加部９と、区切りコードの付加された各項目のデー
タにおける文字出現回数を計算するとともに文字連鎖パ
ターンを作成する文字連鎖パターン生成部７と、区切り
コードを含む文字連鎖パターンを作成する区切り判定部
８と、文字連鎖パターン生成部７及び区切り判定部８か
ら得た文字連鎖パターンと前記番号付加部９から得たデ
ータ番号及びレコード番号とから文字連鎖情報を作成す
る検索ファイル生成部10と、文字連鎖パターン、文字連
鎖情報、区切りコード用の文字種別の各データを記憶す
る検索ファイル記憶部11と、入力されたキーワード及び
部分一致、前方一致、後方一致、完全一致の各検索種別
を読み取るキーワード読取部12と、検索種別に応じてキ
ーワードの前後に区切りコードを付加するキーワード区
切り付加部13と、区切りコードの付加されたキーワード
から文字連鎖パターンを生成するキーワード文字連鎖生
成部17と、キーワードから生成された文字連鎖パターン
に対応する文字連鎖情報を検索ファイル11から引き出し
て照合処理を行ない、キーワードと一致するレコードデ
ータを抽出する照合部18と、検索結果を表示する検索結
果表示部19とを備えるとともに、さらに、区切りコード
を第２要素に含む文字連鎖の文字連鎖情報を検索ファイ
ル記憶部11の優先的に引き出し得る位置に格納するよう
に指示する優先指示部６と、区切りコードを第２要素に
含む文字連鎖の文字連鎖情報を優先的に照合するように
指示する優先照合部16と、１項目に含まれる各データの
末端２文字の文字種組合せに対して区切りコードを付加
する群別区切り付加部４と、入力されたキーワードの末
端２文字の文字種組合せに対して区切りコードを付加す
るキーワード群別区切り付加部14と、レコードデータの
項目に含まれる複数のデータが階層構造を持つときに各
データの末端に階層構造に応じた区切りコードを付加す
る階層・順序別区切り付加部５と、入力されたキーワー
ドに階層を示す区切りコードを付加するキーワード階層
別区切り付加部15とを備えている。As shown in FIG. 1, the information retrieval apparatus which performs such an operation includes a record data storage section 2 storing record data composed of a plurality of items, and a record reading section 1 for reading the stored record data. And the data of each item is cut out from the read record data, the order of the data is counted,
A delimiter adding unit 3 for adding a delimiter code at the end of the
While acquiring the order of the data from the delimiter addition unit 3,
A number adding unit 9 for acquiring a record number from the record data storage unit 2; a character chain pattern generating unit 7 for calculating the number of character appearances in the data of each item to which a delimiter code is added and creating a character chain pattern; A segment determining unit 8 for creating a character chain pattern including a code; a character chain based on the character chain pattern obtained from the character chain pattern generating unit 7 and the segment determining unit 8 and the data number and record number obtained from the number adding unit 9 A search file generation unit 10 for creating information; a search file storage unit 11 for storing character chain patterns, character chain information, and data of character types for delimiter codes; input keywords and partial matches, forward matches, backward matches A keyword reading unit 12 that reads each of the search types, that is, a match and an exact match, is separated before and after the keyword according to the search type. Keyword addition unit 13 for adding a code, a keyword character chain generation unit 17 for generating a character chain pattern from keywords to which a delimiter code has been added, and searching for character chain information corresponding to the character chain pattern generated from the keyword It includes a collating unit 18 for extracting collating data from the file 11 to perform collation processing and extracting record data matching the keyword, and a search result display unit 19 for displaying search results, and further includes a character including a delimiter code in a second element. The priority instructing unit 6 for instructing the retrieval file storage unit 11 to store the character chain information in the retrieval file storage unit 11 at a position where it can be preferentially extracted, and the character chain information of the character chain including the delimiter code in the second element are preferentially compared. The priority collating unit 16 instructs the user to perform a delimiter command for the last two characters of each data included in one item. Group addition unit 4 for adding a keyword, a keyword group separation addition unit 14 for adding a separation code to the character type combination of the last two characters of the input keyword, and a plurality of items included in the record data item. When the data has a hierarchical structure, a hierarchical / sequential delimiter 5 for adding a delimiter code according to the hierarchical structure to the end of each data, and a keyword hierarchical delimiter for adding a delimiter code indicating the hierarchy to the input keyword. An additional unit 15 is provided.

【００２８】第１実施例では、この情報検索装置におけ
る検索ファイル生成の動作及びキーワード検索の動作に
ついて詳しく説明する。この動作では、図１の構成の
内、優先指示部６、優先照合部16、群別区切り付加部
４、キーワード群別区切り付加部14、階層・順序別区切
り付加部５及びキーワード階層別区切り付加部15は使用
しない。In the first embodiment, the operation of generating a search file and the operation of searching a keyword in this information search apparatus will be described in detail. In this operation, the priority instructing unit 6, the priority collating unit 16, the group-specific delimiter adding unit 4, the keyword-group-specific delimiter adding unit 14, the hierarchy / order-specific delimiter adding unit 5, and the keyword-level delimiter adding in the configuration of FIG. Part 15 is not used.

【００２９】コードデータ記憶部２の構造は、図３に示
すように、レコード番号、項目などから構成されてお
り、各項目の中には複数のデータが格納されている（da
ta1，data2，‥）。本実施例では項目２にデータが存在
するものとする。このコードデータ記憶部２に格納され
たレコードデータから検索ファイルを生成するまでの手
順は図２の処理フローに従って行なわれる。As shown in FIG. 3, the structure of the code data storage unit 2 is composed of record numbers, items, and the like, and each item stores a plurality of data (da).
ta1, data2, ‥). In this embodiment, it is assumed that data exists in item 2. The procedure up to generating the search file from the record data stored in the code data storage unit 2 is performed according to the processing flow of FIG.

【００３０】ステップ１：レコード読取部１は、レコー
ドデータ記憶部２からレコードデータを読取り、その最
初のレコードデータを処理すべきレコードデータとして
設定（Ｌ＝１）し、ステップ２：このレコードデータの処理すべき項目（こ
こではｋ番目の項目）を指定する。この指定された項目
には、図６（１）で示すように、「あいうあえ、あいあ
え、あとあし」の３個のデータが区切り「、」で分けら
れているものとする。Step 1: The record reading section 1 reads the record data from the record data storage section 2 and sets the first record data as record data to be processed (L = 1). The item to be processed (the k-th item here) is specified. As shown in FIG. 6 (1), it is assumed that three data items "Ah, Ah, Ah, and Ahashi" are separated by "," in the designated item.

【００３１】ステップ３：区切り付加部３は、このデー
タの先頭及び末尾に区切りコード（∧Ｗ）を追加し、ま
た、データ間の区切り「、」を区切りコードに変える。
図６（２）に区切りコードが付加されたデータを示す。Step 3: The delimiter adding unit 3 adds a delimiter code (@W) to the beginning and end of the data, and changes a delimiter "," between data into a delimiter code.
FIG. 6B shows data to which a delimiter code is added.

【００３２】ステップ４：文字連鎖パターン生成部７
は、項目ｋの処理すべきデータのデータ番号ｈを１に設
定し、ステップ５：このデータ番号１のデータにおける区切り
コードの文字種別に対する出現回数を特定の番号（ここ
ではｍ）にセットするとともに、ステップ６：このデータの各文字について、文字種別ご
との出現回数を算出する。図６（３）には、第１〜第３
データの全てについて文字種別ごとの出現回数を算出し
たときに得られる、データ別の文字出現回数と区切りコ
ード（∧Ｗ）との関係を示している。Step 4: Character chain pattern generator 7
Sets the data number h of the data to be processed of the item k to 1; Step 5: sets the number of appearances for the character type of the delimiter code in the data of the data number 1 to a specific number (here, m) Step 6: For each character of this data, the number of appearances for each character type is calculated. FIG. 6 (3) shows first to third
Obtained when calculating the number of occurrences of each character type for all data, shows the relationship between data different appearance count and delimiter code (∧W).

【００３３】ステップ７：文字連鎖パターン生成部７
は、次いで、データ番号１のデータにおける文字連鎖
（Ｆ（ａ），Ｆ（ｂ））を作成する。このとき区切り判
定部８は、区切りコードを含む文字連鎖パターンを作成
する。Step 7: Character chain pattern generator 7
Creates a character chain (F (a), F (b)) in the data of data number 1. At this time, the delimiter determination unit 8 creates a character chain pattern including the delimiter code.

【００３４】ステップ８：検索ファイル生成部10は、文
字連鎖パターン生成部７及び区切り判定部８から文字連
鎖パターンとその出現回数とを得るとともに、番号付加
部９が取得しているデータ番号（ｈ）とレコード番号
（Ｌ）とを得て、文字連鎖情報を作成し、検索ファイル
記憶部11に格納する。この文字連鎖情報は、図４に示す
ように、連続する２文字の文字出現回数、データ番号、
レコード番号から構成される。Step 8: The search file generator 10 obtains the character chain pattern and the number of appearances thereof from the character chain pattern generator 7 and the delimiter judging unit 8, and obtains the data number (h ) And the record number (L) to create character chain information and store it in the search file storage unit 11. This character chain information includes, as shown in FIG. 4, the number of appearances of two consecutive characters, a data number,
Consists of a record number.

【００３５】ステップ９：こうしてデータ番号ｈのデー
タの全ての文字連鎖パターンにおける文字連鎖情報を作
成する。Step 9: In this way, character chain information in all character chain patterns of the data with the data number h is created.

【００３６】検索ファイル記憶部11は、図５に示すよう
に、文字連鎖情報の種別、文字連鎖パターン、文字連鎖
情報のリストから構成され、文字連鎖情報の種別は、文
字連鎖パターンに区切りコードが含まれているかどう
か、含まれている場合に、それが連続する２文字の前方
であるか後方であるかを区別しており、ステップ10：Ｆ（ａ）＝ｍであるとき、この文字連鎖の
文字連鎖情報を前方区切りコード付き文字連鎖情報とし
て格納し、ステップ11：Ｆ（ｂ）＝ｍであるとき、この文字連鎖の
文字連鎖情報を後方区切りコード付き文字連鎖情報とし
て格納する。As shown in FIG. 5, the search file storage unit 11 is composed of a type of character chain information, a character chain pattern, and a list of character chain information. It distinguishes whether it is included and if it is, whether it is the front or back of two consecutive characters. Step 10: When F (a) = m, this character chain The character chain information is stored as character chain information with forward delimiter code. Step 11: When F (b) = m, the character chain information of this character chain is stored as character chain information with rear delimiter code.

【００３７】ステップ12：次いで、処理の対象を項目ｋ
における次のデータに移すため、データ番号ｈをインク
リメントし、ステップ13：そのｈの値が項目ｋ内の最大のデータ番号
に達するまでステップ５以降の手順を繰り返す。Step 12: Next, the processing target is item k
In order to move to the next data in, the data number h is incremented. Step 13: The procedure from step 5 is repeated until the value of h reaches the maximum data number in the item k.

【００３８】図６（４）には、図６（１）のデータから
生成され、検索ファイル記憶部11に格納された検索ファ
イルを示している。FIG. 6 (4) shows a search file generated from the data of FIG. 6 (1) and stored in the search file storage unit 11.

【００３９】ステップ13において、ｈの値が項目ｋ内の
最大のデータ番号を超えたときは、ステップ14：処理の対象を次のレコードデータに移すた
め、レコード番号Ｌをインクリメントし、ステップ15：全てのレコードデータについての処理が終
了するまで、ステップ２〜ステップ15の手順を繰り返
す。In step 13, when the value of h exceeds the maximum data number in the item k, step 14: increment the record number L to move the processing target to the next record data, and step 15: Steps 2 to 15 are repeated until the processing for all record data is completed.

【００４０】次に、図７の処理フローにより、検索ファ
イルを用いて入力されたキーワードと一致するデータを
検索する処理手順について説明する。このときのキーワ
ードの読込から検索までのデータの流れを図８に示して
いる。キーワード読取部12で読取るデータとして図８
（１）のパターン１「あと？」とパターン２「あいあ
え」とを例として挙げる。「あと？」は、「あとあし」
等のように冒頭に「あと」を含む文字列を調べることを
意味し、「あいあえ」の様に「？」がない場合はキーワ
ードと完全に一致する文字列を調べることを意味する。Next, a description will be given of a processing procedure for searching for data that matches an input keyword using a search file, with reference to the processing flow of FIG. FIG. 8 shows the flow of data from the reading of the keyword to the search at this time. FIG. 8 shows data read by the keyword reading unit 12.
The pattern 1 “After?” And the pattern 2 “Aie” in (1) will be described as examples. "After?"
This means that a character string that includes "after" at the beginning is checked, and if there is no "?" Like "Oh," it means that a character string that completely matches the keyword is checked.

【００４１】ステップ１：キーワード読取部12は、入力
されたパターン１やパターン２のキーワードを読み込
み、ステップ２：操作者から入力された項目ｋを指定項目と
して設定する。Step 1: The keyword reading section 12 reads the input keywords of pattern 1 and pattern 2, and sets the item k input by the operator as a designated item.

【００４２】ステップ３：キーワード区切り付加部13
は、キーワードの文字列における「？」の有無に応じて
検索種別（部分一致、前方一致、後方一致、完全一致）
を判定し、ステップ４：検索種別に応じてキーワードに区切りコー
ド（∧Ｗ）を付与する。図８（２）に示すように、キー
ワードの検索種別が完全一致の場合は、文字列の両端に
区切りコードを付加し、キーワード文字列に「？」が有
るときは、「？」の無い方の端にだけ区切りコードを付
加する。Step 3: Keyword delimiter 13
Is the search type (partial match, start match, end match, perfect match) according to the presence or absence of "?" In the keyword string
Step 4: Delimiter code (@W) is assigned to the keyword according to the search type. As shown in FIG. 8 (2), when the search type of the keyword is an exact match, a delimiter code is added to both ends of the character string, and when the keyword character string has "?" Add a delimiter code only to the end of.

【００４３】ステップ５：キーワード文字連鎖生成部17
は、区切りコードが付与されたキーワードに対して文字
連鎖パターン（ａ，ｂ）を作成する。図８（３）にパタ
ーン１及びパターン２に対応する文字連鎖パターンを示
している。Step 5: Keyword character chain generation unit 17
Creates a character chain pattern (a, b) for a keyword to which a delimiter code has been assigned. FIG. 8 (3) shows a character chain pattern corresponding to pattern 1 and pattern 2.

【００４４】ステップ６：次に照合部１８は、この文字
連鎖パターンに対応する文字連鎖情報を検索ファイル記
憶部11から引き出して照合を行なう。文字種ａが区切り
コードであるときは、ステップ７：検索ファイルの前方区切りコード付き文字
連鎖情報の中から該当する文字連鎖パターンの文字連鎖
情報を全て引き出し、ステップ８：文字種ｂが区切りコードであるときは、ステップ９：検索ファイルの後方区切りコード付き文字
連鎖情報の中から該当する文字連鎖パターンの文字連鎖
情報を全て引き出し、ステップ10：それ以外のときは、検索ファイルの区切り
コードなし文字連鎖情報の中から該当する文字連鎖パタ
ーンの文字連鎖情報を全て引き出す。Step 6: Next, the collating unit 18 extracts character chain information corresponding to the character chain pattern from the search file storage unit 11 and performs collation. If the character type a is a delimiter code, Step 7: extract all the character chain information of the corresponding character chain pattern from the character chain information with the forward delimiter code of the search file. Step 8: If the character type b is the delimiter code Step 9: Extract all the character chain information of the corresponding character chain pattern from the character chain information with the rear delimiter code of the search file. Step 10: Otherwise, extract the character chain information without the delimiter code of the search file. All the character chain information of the corresponding character chain pattern is extracted from the inside.

【００４５】ステップ11：次いで、キーワードの文字連
鎖の組の数を調べて、ステップ12：１組であるときは、ステップ13：引き出した文字連鎖情報を全て該当する文
字連鎖情報とする。Step 11: Next, the number of character chain sets of the keyword is checked, and if the number of sets is one, Step 12: all the extracted character chain information is used as the corresponding character chain information.

【００４６】ステップ14：２組以上であるときは、ステップ15：連続する２組の文字連鎖における文字連鎖
情報の間で、１番目の文字連鎖の第２文字の出現回数と
２番目の文字連鎖の第１文字の出現回数が一致し、且つ
データ番号が一致するものが在るかどうかを照合し、そ
れらが一致するまで次々と文字連鎖情報を取り替えて照
合を繰り返す。Step 14: If there are two or more sets, Step 15: The number of appearances of the second character of the first character chain and the second character chain occurrences of the first character matches the, and verifies whether or not the data number matches exists, repeat the collation replaced one after another character chain information until they match.

【００４７】ステップ16：これらの照合で該当する文字
連鎖情報が存在するときは、ステップ17：照合部18は、該当するレコード番号と件数
とをレコードデータ記憶部２から抽出し、検索結果表示
部19に表示する。Step 16: When the corresponding character chain information exists in the collation, Step 17: the collation unit 18 extracts the corresponding record number and the number of records from the record data storage unit 2, and displays the retrieval result display unit. Display at 19.

【００４８】このときの照合時のデータの流れを図８
（４）に示している。図８（４）の＜パターン１＞の場
合には、照合１回目で第１文字連鎖と第２文字連鎖との
照合（(A)と(B)とに対する照合）が一致しないので、第
２文字連鎖情報はそのままにして次の第１文字連鎖情報
との照合を行なう（照合２回目）。以下繰り返し、照合
３回目で一致するので、照合部18は、一致した文字連鎖
情報から、項目番号２、データ番号３、レコード番号１
を取得し、その内容をレコードデータ記憶部２から求め
る。もし、全ての文字連鎖パターンに対して一致しない
場合には照合を終了する。FIG. 8 shows the data flow at the time of collation at this time.
This is shown in (4). In the case of <pattern 1> in FIG. 8D, since the collation between the first character chain and the second character chain (collation with (A) and (B)) does not match in the first collation, the second collation is performed. The collation with the next first character chain information is performed without changing the character chain information (second collation). Since the matching is repeated the third time, the matching unit 18 determines the item number 2, the data number 3, the record number 1 from the matched character chain information.
Is obtained from the record data storage unit 2. If all the character chain patterns do not match, the collation ends.

【００４９】また、図８（４）の＜パターン２＞は、５
個の文字連鎖パターンに対する照合の例であり、順番に
(A)と(B)、(C)と(B)、(D)と(B)、(E)と(B)で照合を行な
う過程を示している。また、図８（５）に最終的な検索
結果を示している。<Pattern 2> in FIG.
This is an example of matching against a character chain pattern.
(A) and (B), (C) and (B), (D) and (B), and (E) and (B) are shown the process of collating. FIG. 8 (5) shows the final search result.

【００５０】このように実施例の情報検索装置では、デ
ータの先頭、間、終端に区切りコードを付与し、区切り
コードを含めて文字連鎖パターンを形成し、各文字連鎖
パターンの文字連鎖情報を集めて検索ファイルを形成し
ている。そして、入力されたキーワードを区切りコード
を含む文字連鎖に分解し、その文字連鎖に対応する文字
連鎖情報を照合することによって、キーワードの文字列
を含むデータを検出している。As described above, in the information retrieval apparatus of the embodiment, a delimiter code is added to the beginning, interval, and end of data, a character chain pattern is formed including the delimiter code, and character chain information of each character chain pattern is collected. To form a search file. Then, the input keyword is decomposed into a character chain including a delimiter code, and character chain information corresponding to the character chain is collated to detect data including a character string of the keyword.

【００５１】このように項目内のデータに区切りコード
を付与しているため、１項目内に複数のデータを持つ場
合でも、それらのデータを、キーワード検索においてそ
れぞれ独立したデータとして作用させることができる。
従って、検索における支障を生じることなく、１項目内
に複数のデータを追加することが可能である。As described above, since the delimiter code is assigned to the data in the item, even when a plurality of data are included in one item, the data can be made to act as independent data in the keyword search. .
Therefore, it is possible to add a plurality of data in one item without causing any trouble in the search.

【００５２】また、検索時には、キーワードの文字連鎖
に対応する文字連鎖情報だけを検索ファイルから読み出
して照合を行なうので、キーワードの文字列をデータ文
字列の全てと照合する場合に比べて、検索時間を短くす
ることができる。Also, at the time of retrieval, only the character chain information corresponding to the character chain of the keyword is read from the search file and collation is performed. Therefore, compared with the case where the keyword character string is collated with all the data character strings, the retrieval time is longer. Can be shortened.

【００５３】なお、実施例では、データが文字パターン
である場合について説明したが、○△など予めパターン
化されたデータであれば、何に対しても適用可能であ
る。また、区切りコードには、パターン化されたデータ
以外のものであれば、どのようなものでも用いることが
できる。Although the embodiment has been described with reference to the case where the data is a character pattern, the present invention can be applied to any pre-patterned data such as △. As the delimiter code, any code other than the patterned data can be used.

【００５４】（第２実施例）第２実施例では、後方一致
検索などの便を図るため、入力されたキーワードとの照
合を、キーワード文字列の後ろから行なう場合について
説明する。(Second Embodiment) In the second embodiment, a case will be described in which collation with an input keyword is performed from the end of a keyword character string in order to facilitate a search such as backward matching.

【００５５】この動作は、図１の情報検索装置におい
て、第１実施例で使用した各部の他に、区切りコードを
第２文字に含む文字連鎖の文字連鎖情報を優先的な読出
し位置に格納するよう指示する優先指示部６と、区切り
コードを第２文字に含む文字連鎖の文字連鎖情報を優先
的に照合するように指示する優先照合部16とを使って実
行される。In this operation, in the information retrieval apparatus of FIG. 1, in addition to the parts used in the first embodiment, character chain information of a character chain including a delimiter code in the second character is stored in a preferential reading position. This is performed using a priority instructing unit 6 for instructing to perform collation and a priority collating unit 16 for instructing to preferentially collate the character chain information of the character chain including the delimiter code in the second character.

【００５６】この装置において、検索ファイルの生成は
第１実施例と同じように行なわれる。但し、区切りコー
ドを第２文字に含む文字連鎖の文字連鎖情報について
は、優先指示部６の指示を受けた検索ファイル生成部10
が、検索ファイル記憶部11の優先的に読出しが可能な位
置に格納する。In this apparatus, the generation of the search file is performed in the same manner as in the first embodiment. However, regarding the character chain information of the character chain including the delimiter code in the second character, the search file generation unit 10 receiving the instruction of the priority instruction unit 6
Is stored in the search file storage unit 11 at a position where reading is possible preferentially.

【００５７】一方、入力されたキーワードとの一致検索
は、図９のフロー図に従って行なわれる。このときのキ
ーワードの読込みから検索までのデータの流れを図１０
に示している。入力キーワードは、図１０（１）の「あ
とあし」であるとする。On the other hand, a search for a match with the input keyword is performed according to the flowchart of FIG. FIG. 10 shows the flow of data from reading of a keyword to searching at this time.
Is shown in It is assumed that the input keyword is “rear toe” in FIG.

【００５８】ステップ１〜ステップ５：キーワード読取
部12は、入力キーワードを読込み、操作者から入力され
た項目ｋを指定項目として設定する。キーワード区切り
付加部13は、キーワードの文字列に応じて検索種別を判
定し、検索種別に応じてキーワードに区切りコード（∧
Ｗ）を付与する。キーワード文字連鎖生成部17は、区切
りコードが付与されたキーワードの文字連鎖パターン
（ａ，ｂ）を作成する。ここまでの手順は第１実施例と
全く同じである。区切りコードが付加されたキーワード
を図１０（２）に、また、作成された文字連鎖パターン
を図１０（３）に示している。Steps 1 to 5: The keyword reading section 12 reads an input keyword and sets an item k input by the operator as a designated item. The keyword delimiter 13 determines a search type according to the character string of the keyword, and separates the keyword with a delimiter code (∧
W). The keyword character chain generation unit 17 generates a character chain pattern (a, b) of a keyword to which a delimiter code has been added. The procedure up to this point is exactly the same as in the first embodiment. FIG. 10 (2) shows the keyword to which the delimiter code is added, and FIG. 10 (3) shows the created character chain pattern.

【００５９】ステップ６：照合部18は、この中の文字種
ｂが区切りコードである文字連鎖パターン（し，∧Ｗ）
の文字連鎖情報を検索ファイル記憶部11から優先して引
き出し、次いで、文字種ａが区切りコードである文字連
鎖パターン（∧Ｗ，あ）の文字連鎖情報を引き出し、ステップ７：次に、その他の文字連鎖パターンの文字連
鎖情報を検索ファイルから引き出す。Step 6: The collating unit 18 determines whether the character type b is a delimiter code in a character chain pattern (し, ∧W)
The character chain information of the character chain pattern (@W, a) in which the character type a is the delimiter code is extracted from the search file storage unit 11 with priority. Step 7: Next, the other character Extract the character chain information of the chain pattern from the search file.

【００６０】ステップ８：次いで、キーワードの文字連
鎖の組の数を調べて、ステップ９：１組であるときは、ステップ10：引き出した文字連鎖情報を全て該当する文
字連鎖情報とする。Step 8: Next, the number of sets of character chains of the keyword is checked. Step 9: If the number of sets is one, Step 10: All the extracted character chain information is set as the corresponding character chain information.

【００６１】ステップ11：２組以上であるときは、ステップ12：連続する２組の文字連鎖における文字連鎖
情報の間で出現回数とデータ番号とが一致するかどうか
を照合し、それらが一致するまで次々と文字連鎖情報を
取り替えて照合を繰り返す。[0061] Step 11: the two or more sets at a time, step 12: collates whether the number of occurrences and data number between the character chain information in the two sets of character chain consecutive matches, they match The character chain information is replaced one after another until the collation is repeated.

【００６２】ステップ13：これらの照合で一致する文字
連鎖情報が存在するときは、ステップ14：照合部18は、該当するレコード番号と件数
とをレコードデータ記憶部２から抽出し、検索結果表示
部19に表示する。Step 13: When there is character chain information that matches in the collation, Step 14: The collation unit 18 extracts the corresponding record number and the number of records from the record data storage unit 2, and displays the retrieval result display unit Display at 19.

【００６３】このステップ８〜ステップ14の手順は、第
１実施例におけるステップ11〜ステップ17（図７）の手
順と同じである。但し、ステップ12における照合では、
図１０（４）に示すように、第２文字が区切りコードで
ある文字連鎖パターン（し，∧Ｗ）を優先し、先ず、こ
の（し，∧Ｗ）の文字連鎖情報と、その前の（あ，し）
の文字連鎖情報とを照合して、（し，∧Ｗ）の第１文字
の出現回数が（あ，し）の第２文字の出現回数と一致
し、且つデータ番号が一致するかどうかを調べる。これ
らが一致したときは、次に（あ，し）と（と，あ）との
文字連鎖情報の照合を行ない、順次、キーワードの後ろ
から照合を進めて行く。The procedure of steps 8 to 14 is the same as the procedure of steps 11 to 17 (FIG. 7) in the first embodiment. However, in the collation in step 12,
As shown in FIG. 10 (4), the character chain pattern (し, ∧W) in which the second character is a delimiter code is prioritized. First, the character chain information of (（, ∧W) and the preceding ( Ah)
To check whether the number of occurrences of the first character of (し, ∧W) matches the number of occurrences of the second character of (あ, ∧), and whether the data numbers match. . When they match, the character chain information of (A, S) and (A, A) is collated, and the collation is sequentially advanced after the keyword.

【００６４】このように第２実施例の情報検索装置で
は、照合に際して、第２文字に区切りコードを持つ文字
連鎖を優先させることができるため、後方一致検索を高
速で行なうことが可能となる。As described above, in the information retrieving apparatus of the second embodiment, the character chain having the delimiter code as the second character can be prioritized in the collation, so that the backward matching retrieval can be performed at high speed.

【００６５】なお、照合に際して、第１文字に区切りコ
ードを持つ文字連鎖を優先させることも可能であり、こ
の場合には、前方一致検索を高速で行なうことができ
る。In the collation, it is possible to give priority to a character chain having a delimiter code as the first character, and in this case, a forward match search can be performed at high speed.

【００６６】（第３実施例）第３実施例では、区切りコ
ードが付加されたデータの区切りコードの前後に続く２
文字を区切りコード付き文字連鎖として扱うことによ
り、高速での検索を実現する。(Third Embodiment) In the third embodiment, the data following the delimiter code of the data to which the delimiter code is added,
A high-speed search is realized by treating characters as a character chain with a delimiter code.

【００６７】この動作は、図１の情報検索装置におい
て、第１実施例で使用した各部の他に、１項目に含まれ
る各データの両端における２文字の文字種組合せに対し
て区切りコードを付加する群別区切り付加部４と、入力
されたキーワードの両端における２文字の文字種組合せ
に対して区切りコードを付加するキーワード群別区切り
付加部14とを使って実行される。In this operation, in the information retrieval apparatus shown in FIG. 1, in addition to the parts used in the first embodiment, a delimiter code is added to a combination of two character types at both ends of each data included in one item. This is executed using the group-specific delimiter adding unit 4 and the keyword-group-specific delimiter adding unit 14 that adds a delimiter code to a combination of two character types at both ends of the input keyword.

【００６８】この場合の検索ファイルの作成は、図１１
の処理フローに従って行なわれる。In this case, the search file is created as shown in FIG.
Is performed according to the processing flow of

【００６９】ステップ１〜ステップ７：レコードデータ
の処理すべき項目（第ｋ番目）が指定されると、レコー
ド読取部１は、レコードデータ記憶部２からレコードデ
ータを読取り、区切り付加部３は、このデータの先頭及
び末尾に区切りコード（∧Ｗ）を追加し、また、データ
間の区切り「、」を区切りコードに変える。文字連鎖パ
ターン生成部７は、項目ｋの処理すべきデータのデータ
番号ｈを１に設定し、文字種別が区切りコードのときに
その出現回数を特定番号ｍに設定し、このデータのその
他の文字について、文字種別ごとの出現回数を算出し、
次いで、文字連鎖（Ｆ（ａ），Ｆ（ｂ））を作成する。Steps 1 to 7: When the item (k-th) to be processed of the record data is specified, the record reading section 1 reads the record data from the record data storage section 2, and the delimiter section 3 A delimiter code (@W) is added to the beginning and end of this data, and the delimiter "," between data is changed to a delimiter code. The character chain pattern generation unit 7 sets the data number h of the data to be processed of the item k to 1, sets the number of occurrences to the specific number m when the character type is the delimiter code, and sets the other characters of this data to For, calculate the number of appearances for each character type,
Next, a character chain (F (a), F (b)) is created.

【００７０】ここまでの動作は、実質的に第１実施例の
場合と同じであるが、但し、文字連鎖パターン生成部７
は、文字連鎖（Ｆ（ａ），Ｆ（ｂ））の作成に際して、
群別区切り付加部４の指示に基づいて、区切りコードを
群別の区切りコードとして扱い、この区切りコードを文
字種に含めずに文字連鎖（Ｆ（ａ），Ｆ（ｂ））を作成
する。そして、ステップ８：区切り判定部８は、この文字連鎖の第１文
字または第２文字が区切りコードと隣接しているかどう
かを調べ、ステップ９：区切りコードと接していないものと、ステップ10：第１文字が区切りコードと隣接する前方区
切りコード付き文字連鎖と、第２文字が区切りコードと
隣接する後方区切りコード付き文字連鎖とに分別する。The operation so far is substantially the same as that of the first embodiment, except that the character chain pattern generation unit 7
Is used to create character chains (F (a), F (b))
Based on the instruction of the group-specific delimiter adding unit 4, the delimiter code is treated as a group-specific delimiter code, and a character chain (F (a), F (b)) is created without including the delimiter code in the character type. Step 8: The delimiter determination unit 8 checks whether the first character or the second character of the character chain is adjacent to the delimiter code. One character is separated into a character chain with a forward delimiter code adjacent to the delimiter code, and the second character is separated into a character chain with a rear delimiter code adjacent to the delimiter code.

【００７１】ステップ11：検索ファイル生成部10は、こ
れらの分別された文字連鎖パターンの文字連鎖情報を検
索ファイル記憶部11に分別して格納する。この検索ファ
イル記憶部11の構成は図１２に示している。Step 11: The search file generation unit 10 separates and stores the character chain information of these separated character chain patterns in the search file storage unit 11. The configuration of the search file storage unit 11 is shown in FIG.

【００７２】これらの動作を、データ番号が項目ｋ内で
最大に達するまで行ない、また、全てのレコードデータ
を対象にして繰り返す。These operations are performed until the data number reaches the maximum in the item k, and is repeated for all record data.

【００７３】図１３（１）に、レコードデータの文字種
別に応じた出現回数と、区切りコードに接続する群別文
字連鎖パターンとを示しており、また、図１３（２）
に、図１３（１）から作成された検索ファイルを示して
いる。この検索ファイルでは、（あ，い）、（あ，と）
は前方区切りコード付き文字連鎖パターンとして格納さ
れるとともに、その第１文字により「あ」の群として区
画される。同様に（あ，え）（あ，し）は後方区切りコ
ード付き文字連鎖パターンとして格納され、「あ」の群
として区画される。FIG. 13 (1) shows the number of appearances according to the character type of the record data and the group-based character chain pattern connected to the delimiter code.
FIG. 13 shows a search file created from FIG. In this search file, (a, i), (a, and)
Are stored as a character chain pattern with a forward delimiter code, and are partitioned as a group of "A" by the first character. Similarly, (A, E) (A, S) is stored as a character chain pattern with a backward delimiter code, and is partitioned as a group of "A".

【００７４】ここで注目すべきは、（あ，い）、（あ，
と）、（あ，え）（あ，し）が区切りコードなし文字連
鎖情報としては格納されないということである。つま
り、それだけ照合すべき文字連鎖情報が減り、照合を高
速で行なうことが可能になる。It should be noted that (a, i), (a,
), (A, E) (A, S) are not stored as character chain information without a delimiter code. That is, the character chain information to be collated is reduced accordingly, and the collation can be performed at high speed.

【００７５】次に、検索の処理の流れを図１４に示す。
また、検索時のデータの流れを図１５に示す。ここで
は、入力キーワードが「あとあし」であるとする。Next, FIG. 14 shows the flow of the retrieval process.
FIG. 15 shows the flow of data at the time of retrieval. Here, it is assumed that the input keyword is “rear toe”.

【００７６】ステップ１〜３：第１実施例と同様に、キ
ーワード読取部12は、キーワードを読み込み、操作者の
入力操作を受けて、指定項目にセットし、検索種別を指
定する。Steps 1 to 3: As in the first embodiment, the keyword reading section 12 reads a keyword, receives an input operation by the operator, sets the keyword in a specified item, and specifies a search type.

【００７７】ステップ４：キーワード群別区切り付加部
14は、キーワードに群別の区切りを付加し（図１５
（２））、キーワード文字連鎖生成部17は、キーワード
を文字パターンに分割して文字連鎖パターンを作成す
る。このとき、図１５（３）に示すように、区切りコー
ドに接続する文字連鎖（あ，と）（あ，し）と、区切り
コードに接続しない文字連鎖（と，あ）とが作成され
る。Step 4: Keyword Group Delimiter Addition Unit
14 adds a group-based delimiter to the keyword (FIG. 15)
(2)), the keyword character chain generation unit 17 divides the keyword into character patterns to create a character chain pattern. At this time, as shown in FIG. 15C, a character chain (A, A) connected to the delimiter code and a character chain (A, A) not connected to the delimiter code are created.

【００７８】ステップ５：照合部18は、検索種別が前方
一致検索であるときは、ステップ６：検索ファイルから、前方区切りコード付き
の文字連鎖（あ，と）に該当する前方区切りコード付き
文字連鎖情報を優先して読込み、次いで、後方区切りコ
ード付きの文字連鎖（あ，し）に該当する後方区切りコ
ード付き文字連鎖情報を読込む。Step 5: When the search type is a forward match search, Step 6: From the search file, a character chain with a forward delimiter code corresponding to a character sequence with a forward delimiter code (a, a) The information is preferentially read, and then the character chain information with the backward delimiter code corresponding to the character chain with the backward delimiter code is read.

【００７９】ステップ７：また、検索種別が後方一致検
索であるときは、ステップ８：検索ファイルから、後方区切りコード付き
の文字連鎖（あ，し）に該当する後方区切りコード付き
文字連鎖情報を優先して読込み、次いで、前方区切りコ
ード付きの文字連鎖（あ，と）に該当する前方区切りコ
ード付き文字連鎖情報を読込み、ステップ９：次いで、区切りコードなしの文字連鎖
（と，あ）に該当する区切りコードなし文字連鎖情報を
読込み、ステップ10〜ステップ16：第１実施例と同様の手順で照
合を行なう。Step 7: If the search type is a backward match search, Step 8: Priority is given to the character chain information with the backward delimiter code corresponding to the character chain with the backward delimiter code (a, ashi) from the search file. Then, the character chain information with the forward delimiter code corresponding to the character chain with the forward delimiter code (a, a) is read. Step 9: Next, the character chain information without the delimiter code (a, a) is applicable. The character chain information without a delimiter code is read, and Step 10 to Step 16: collation is performed in the same procedure as in the first embodiment.

【００８０】この検索ファイルから、前方区切りコード
付き文字連鎖（あ，と）に該当する文字連鎖情報を読込
む場合には、検索ファイル記憶部11の前方区切りコード
付き文字連鎖情報が格納された領域から、文字種別
「あ」を含む群の中の（あ，と）に該当する情報（１，
１，３，１）を容易に取得することができ、また、後方
区切りコード付き文字連鎖（あ，し）の文字連鎖情報を
読込み場合にも、文字種別「あ」を含む群の中の（あ，
し）に該当する文字連鎖情報を容易に取得することがで
きる。When reading the character chain information corresponding to the character chain with the forward delimiter code (A, and) from this search file, the area of the search file storage unit 11 where the character chain information with the forward delimiter code is stored. From (a, and) in the group containing the character type "a" (1,
1, 3, 1) can be easily obtained, and even when reading the character chain information of the character chain (a, shi) with a backward delimiter code, the character group (a) in the group including the character type "a" is read. Ah,
The character chain information corresponding to (c) can be easily obtained.

【００８１】このように第３実施例の情報検索装置で
は、区切りコードを付加したデータから区切りコードを
含めない状態で文字連鎖パターンを作成し、区切りコー
ドに接続する文字連鎖パターンの文字連鎖情報を、その
区切りコードの接続位置に応じて、前方または後方区切
りコード付き文字連鎖情報として扱っている。また、こ
れらの文字連鎖情報は、文字連鎖パターンの第１文字別
に区分するなどして、群別に検索ファイルに格納してい
る。そのため、検索ファイルにおける文字連鎖情報の総
数が減り、検索時の照合回数を削減することが可能にな
る。また、群別に整理されているため、これらの情報を
検索ファイルから読み出すことが容易になる。また、前
方一致検索、後方一致検索、完全一致検索などの検索種
別に応じて、該当する文字連鎖情報の照合を優先させる
ことができるため、各種検索を高速で行なうことができ
る。As described above, in the information retrieval apparatus of the third embodiment, a character chain pattern is created from the data to which the delimiter code is added without including the delimiter code, and the character chain information of the character chain pattern connected to the delimiter code is obtained. Depending on the connection position of the delimiter code, it is handled as character chain information with forward or backward delimiter code. The character chain information is stored in the search file for each group, for example, by classifying the character chain pattern into first characters. Therefore, the total number of character chain information in the search file is reduced, and the number of times of collation at the time of search can be reduced. In addition, since the information is arranged in groups, it is easy to read such information from the search file. In addition, since matching of the corresponding character chain information can be prioritized in accordance with a search type such as a forward match search, a backward match search, and a perfect match search, various searches can be performed at high speed.

【００８２】（第４実施例）第４実施例では、データの
階層または順序構造を意識した検索を行なう。この動作
は、図１の情報検索装置において、第１実施例で使用し
た各部の他に、各データの末端に階層構造に応じた区切
りコードを付加する階層・順序別区切り付加部５と、入
力されたキーワードに階層を示す区切りコードを付加す
るキーワード階層別区切り付加部15とを使って実行され
る。(Fourth Embodiment) In the fourth embodiment, a search is performed in consideration of the hierarchy or order structure of data. This operation is performed in the information retrieval apparatus of FIG. 1 in addition to the components used in the first embodiment, a hierarchy / order-based delimiter adding unit 5 for adding a delimiter code according to a hierarchical structure to the end of each data, This is performed using the keyword-level-specific delimiter adding unit 15 that adds a delimiter code indicating the level to the keyword thus set.

【００８３】この場合の検索ファイルの作成は、図１６
の処理フローに従って行なわれる。In this case, the search file is created as shown in FIG.
Is performed according to the processing flow of

【００８４】ステップ１〜ステップ２：レコードデータ
の処理すべき項目（第ｋ番目）が指定されると、レコー
ド読取部１は、レコードデータ記憶部２からレコードデ
ータを読取り、レコード番号をセットする。Steps 1 and 2: When the item (k-th) to be processed of the record data is designated, the record reading section 1 reads the record data from the record data storage section 2 and sets the record number.

【００８５】ステップ３：階層・順序別区切り付加部５
は、各データの先頭、間、末尾に階層別・順序別区切り
コードを付加する。Step 3: Hierarchical / order-based delimiter adding unit 5
Adds a hierarchical / sequential delimiter code to the beginning, between, and end of each data.

【００８６】図１８（１）に、１つの項目内に記録され
た複数のデータの例を示している。この各データにはレ
ベルが付けられており、レベルに応じた区切りコードが
階層・順序別区切り付加部５により付けられる。図１８
（２）では、レベル１、２、３に対して区切りコードを
∧Ｓ、∧Ｔ、∧Ｕに対応させている。FIG. 18A shows an example of a plurality of data recorded in one item. Each data is assigned a level, and a delimiter code according to the level is added by the layer / order-based delimiter adding unit 5. FIG.
In (2), delimiter codes corresponding to levels 1, 2, and 3 correspond to $ S, $ T, and $ U.

【００８７】ステップ４〜ステップ16の手順は、実質的
に第１実施例の手順（図２、ステップ４〜ステップ15）
と同じであり、文字連鎖パターン生成部７が、階層・順
序別の区切りコードが付加された文字列における文字種
別ごとの出現回数を計算し、この文字列から文字連鎖パ
ターンを作成する。このとき区切りコードだけから成る
文字連鎖パターンは削除される（ステップ８）。検索フ
ァイル生成部10は、各種の区切りコードを含んだ文字連
鎖パターンに対応する文字連鎖情報を作成して検索ファ
イル記憶部11に格納し、検索ファイルを作成する。検索
ファイル記憶部11は、図１７に示すデータ構造を有し、
ここに、図１８（３）に示すように、文字連鎖情報が格
納される。なお、文字連鎖情報は、（第１文字出現回
数，第２文字出現回数，デ―タ番号，レコード番号）で
構成され、文字出現回数は階層区切りコード用番号の場
合もある。ここでは、階層に応じて区切りコードがレベ
ル付けされ、文字連鎖情報がこのレベルで分類されてい
る。図１８（３）は図１８（１）に対する検索ファイル
の例である。The procedure of steps 4 to 16 is substantially the same as that of the first embodiment (FIG. 2, steps 4 to 15).
The character chain pattern generation unit 7 calculates the number of appearances for each character type in a character string to which a delimiter code for each layer and order is added, and creates a character chain pattern from this character string. At this time, the character chain pattern consisting only of the delimiter code is deleted (step 8). The search file generation unit 10 creates character chain information corresponding to a character chain pattern including various delimiter codes, stores it in the search file storage unit 11, and creates a search file. The search file storage unit 11 has a data structure shown in FIG.
Here, as shown in FIG. 18C, character chain information is stored. It should be noted that the character chain information is (first character occurrence times)
Number, second character appearance count, de - data number, is composed of a record number), character appearance number is sometimes hierarchy delimiter code for numbers. Here, delimiter codes are assigned levels according to the hierarchy, and character chain information is classified at this level. FIG. 18 (3) is an example of a search file for FIG. 18 (1).

【００８８】次に、検索の処理の流れを図１９により説
明する。図２０には検索時のデータの流れを示してい
る。Next, the flow of the retrieval process will be described with reference to FIG. FIG. 20 shows the flow of data at the time of retrieval.

【００８９】ステップ１〜ステップ３：第１実施例と同
様に、キーワード読取部12は、入力キーワードを読込
み、指定項目をセットし、検索種別を指定する。ここで
は、図２０（１）に示すように、入力キーワードとして
「？渋谷」が入力され、その階層レベルとして「２」が
指定されたとする。Steps 1 to 3: As in the first embodiment, the keyword reading section 12 reads an input keyword, sets specified items, and specifies a search type. Here, as shown in FIG. 20A, it is assumed that “? Shibuya” is input as an input keyword and “2” is specified as its hierarchical level.

【００９０】ステップ４：キーワード階層別区切り付加
部15は、指定された階層に対する区切りコードを選択
し、ステップ５：キーワード区切り付加部13は、選択された
区切りコードを検索種別に応じてキーワードに付加す
る。この時、区切りを付加されたキーワードは「？渋谷
∧Ｔ」である（図２０（２））。Step 4: The keyword layer-specific delimiter 15 selects a delimiter code for the specified layer. Step 5: The keyword delimiter 13 adds the selected delimiter code to the keyword according to the search type. I do. At this time, the keyword added with the delimiter is “? Shibuya @ T” (FIG. 20 (2)).

【００９１】ステップ６：キーワード文字連鎖生成部17
は、この区切りを付加されたキーワードを文字パターン
に分割し文字連鎖パターンを作成する（図２０
（３））。Step 6: Keyword character chain generation unit 17
Creates a character chain pattern by dividing the delimited keyword into character patterns (FIG. 20).
(3)).

【００９２】以下のステップ７〜ステップ18の動作は、
分割された文字連鎖パターンに対して該当する文字連鎖
情報を取得する処理であり、第１実施例または第３実施
例の処理と同様である。The operations of the following steps 7 to 18 are as follows:
This is a process of acquiring character chain information corresponding to the divided character chain pattern, and is similar to the process of the first embodiment or the third embodiment.

【００９３】なお、図２１には、データを階層化する代
わりに、データに順序構造を取り入れた場合の例を示し
ている。図２１（１）は、そのデータの例、図２１
（２）は検索ファイルを作成した結果を示している。検
索ファイル作成及び検索処理の手順は階層化の場合と同
様である。FIG. 21 shows an example in which an order structure is adopted in data instead of hierarchizing the data. FIG. 21A shows an example of the data.
(2) shows the result of creating a search file. The procedure of search file creation and search processing is the same as in the case of hierarchization.

【００９４】このように第４実施例の情報検索装置で
は、項目に含まれる複数のデータが階層または順序構造
を持つ場合に、それらのデータに階層または順序構造に
応じた区切りコードを付与して検索ファイルを作成し、
検索時には、階層または順序を指定して検索を実行す
る。こうすることにより、検索ファイルに格納された文
字連鎖情報の中から指定された階層または順序の情報を
優先して照合することができ、１項目内の複数のデータ
に階層関係や順序関係を意識した検索ができる。As described above, in the information retrieval apparatus of the fourth embodiment, when a plurality of data included in an item has a hierarchical or sequential structure, a delimiter code according to the hierarchical or sequential structure is added to the data. Create a search file,
At the time of search, the search is executed by specifying a hierarchy or an order. By doing so, it is possible to preferentially collate the information of the specified hierarchy or order from the character chain information stored in the search file, and to be aware of the hierarchical relation and the order relation in a plurality of data in one item. You can search.

【００９５】[0095]

【発明の効果】以上の実施例の説明から明らかなよう
に、本発明の情報検索装置は、レコードデータの構成要
素である１項目に対して、複数のデータを繰り返し登録
することができ、入力されたキーワードを含む文字列や
パターンなどのデータを項目内で繰り返し高速で検索す
ることができる。As is apparent from the above description of the embodiment, the information retrieval apparatus of the present invention can repeatedly register a plurality of data for one item which is a constituent element of the record data. Data such as a character string and a pattern including the specified keyword can be repeatedly searched at high speed within the item.

【００９６】また、複数のデータが共通の属性を持つ場
合でも、新たに項目数を増やすことなく、１項目内に複
数データを追加することができるので登録が容易であ
る。また、検索時には、１項目のみを検索指定すればよ
いので、検索の操作が簡素化され、かつ検索処理時間を
短縮することができる。Even when a plurality of data have a common attribute, a plurality of data can be added in one item without newly increasing the number of items, so that registration is easy. Further, at the time of search, only one item needs to be specified, so that the search operation can be simplified and the search processing time can be shortened.

【００９７】また、１項目内のデータは全て区別されて
登録されているので、検索漏れが発生しない。Further, since all the data in one item are registered in a distinguished manner, no search omission occurs.

【００９８】また、１項目に追加されるデータの個数に
関係なく項目内の検索が実現できるので高速に検索がで
きる。Further, since a search within an item can be realized irrespective of the number of data added to one item, a high-speed search can be performed.

【００９９】また、キーワードを含む文字の照合で、区
切りコードを使用した文字連鎖情報（検索ファイル）を
検索するので、前方一致検索、後方一致検索、完全一致
検索、部分一致検索の各種検索が高速にできる。Also, since character chain information (search file) using a delimiter code is searched for by collating characters including a keyword, various searches such as a forward match search, a backward match search, a complete match search, and a partial match search are performed at high speed. Can be.

【０１００】また、区切りコードを含む文字連鎖情報を
優先して検索することが可能なので、特に、完全一致検
索など従来時間が掛かっていた検索処理を高速で実施す
ることができる。Further, since the character chain information including the delimiter code can be preferentially searched, it is possible to perform high-speed search processing, such as a perfect match search, which has conventionally taken a long time.

【０１０１】また、１項目内の複数のデータに階層関係
や順序関係を区切りコードとして持たせることが可能な
ので、１レコード内または１項目内で階層関係や順序関
係の持つデータを登録することが可能であり、階層構造
や順序関係を意識した検索ができる。Further, since a plurality of data in one item can have a hierarchical relation or an order relation as a delimiter code, it is possible to register data having a hierarchical relation or an order relation in one record or one item. It is possible, and it is possible to perform a search in consideration of the hierarchical structure and the order relation.

[Brief description of the drawings]

【図１】本発明の実施例における情報検索装置の構成を
示すブロック図、FIG. 1 is a block diagram showing a configuration of an information search device according to an embodiment of the present invention;

【図２】第１実施例の情報検索装置における検索ファイ
ル生成の処理フロー、FIG. 2 is a processing flow for generating a search file in the information search apparatus according to the first embodiment;

【図３】第１実施例の情報検索装置におけるレコードデ
ータ記憶部のデータ構造、FIG. 3 is a data structure of a record data storage unit in the information search device according to the first embodiment;

【図４】第１実施例の情報検索装置における項目ｊの文
字連鎖情報を示す図、FIG. 4 is a diagram showing character chain information of item j in the information search device of the first embodiment;

【図５】第１実施例の情報検索装置における検索ファイ
ル記憶部のデータ構造、FIG. 5 is a data structure of a search file storage unit in the information search device according to the first embodiment;

【図６】第１実施例の情報検索装置における検索ファイ
ル作成までのデータフロー、FIG. 6 is a data flow until a search file is created in the information search device of the first embodiment;

【図７】第１実施例の情報検索装置における検索処理フ
ロー、FIG. 7 is a search processing flow in the information search device of the first embodiment,

【図８】第１実施例の情報検索装置における検索のデー
タフロー、FIG. 8 shows a search data flow in the information search device of the first embodiment;

【図９】第２実施例の情報検索装置における検索処理フ
ロー、FIG. 9 is a search processing flow in the information search apparatus of the second embodiment,

【図１０】第２実施例の情報検索装置における検索のデ
ータフロー、FIG. 10 shows a search data flow in the information search device of the second embodiment;

【図１１】第３実施例の情報検索装置における検索ファ
イル生成の処理フロー、FIG. 11 is a processing flow for generating a search file in the information search apparatus according to the third embodiment;

【図１２】第３実施例の情報検索装置における検索ファ
イル記憶部のデータ構造、FIG. 12 illustrates a data structure of a search file storage unit in the information search device according to the third embodiment;

【図１３】第３実施例の情報検索装置における検索ファ
イル作成までのデータフロー、FIG. 13 shows a data flow up to creation of a search file in the information search device of the third embodiment;

【図１４】第３実施例の情報検索装置における検索処理
フロー、FIG. 14 is a flowchart of a search process in the information search device according to the third embodiment;

【図１５】第３実施例の情報検索装置における検索時の
データフロー、FIG. 15 shows a data flow at the time of search in the information search device of the third embodiment;

【図１６】第４実施例の情報検索装置における検索ファ
イル生成の処理フロー、FIG. 16 is a processing flow for generating a search file in the information search apparatus according to the fourth embodiment;

【図１７】第４実施例の情報検索装置における検索ファ
イル記憶部のデータ構造、FIG. 17 shows a data structure of a search file storage unit in the information search device of the fourth embodiment;

【図１８】第４実施例の情報検索装置における検索ファ
イル作成までのデータフロー、FIG. 18 is a data flow until a search file is created in the information search device of the fourth embodiment;

【図１９】第４実施例の情報検索装置における検索処理
フロー、FIG. 19 is a search processing flow in an information search device according to a fourth embodiment;

【図２０】第４実施例の情報検索装置における検索時の
データフロー、FIG. 20 is a data flow at the time of search in the information search device of the fourth embodiment;

【図２１】第４実施例の情報検索装置における検索ファ
イル作成までのデータフロー、FIG. 21 is a data flow up to creation of a search file in the information search device of the fourth embodiment;

【図２２】従来の情報検索装置におけるレコードデータ
の構成、FIG. 22 shows the structure of record data in a conventional information search device;

【図２３】従来の情報検索装置でのデータ検索手順を示
すフロー図である。FIG. 23 is a flowchart showing a data search procedure in a conventional information search device.

[Explanation of symbols]

１レコード読取部２レコードデータ記憶部３区切り付加部４群別区切り付加部５階層・順序別区切り付加部６優先指示部７文字連鎖パターン生成部８区切り判定部９番号付加部 10 検索ファイル生成部 11 検索ファイル記憶部 12 キーワード読取部 13 キーワード区切り付加部 14 キーワード群別区切り付加部 15 キーワード階層・順序別区切り付加部 16 優先照合部 17 キーワード文字連鎖生成部 18 照合部 19 検索結果表示部 REFERENCE SIGNS LIST 1 record reading section 2 record data storage section 3 delimiter addition section 4 group delimiter addition section 5 layer / order delimiter addition section 6 priority indication section 7 character chain pattern generation section 8 delimitation determination section 9 number addition section 10 search file generation section 11 Search file storage unit 12 Keyword reading unit 13 Keyword delimiter addition unit 14 Keyword group delimiter addition unit 15 Keyword hierarchy / order delimiter addition unit 16 Priority collation unit 17 Keyword character chain generation unit 18 Collation unit 19 Search result display unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者菊池忠一大阪府門真市大字門真1006番地松下電器産業株式会社内 (56)参考文献特開昭51−57259（ＪＰ，Ａ) 特開平６−52226（ＪＰ，Ａ) 特開平６−301721（ＪＰ，Ａ) 特開昭64−35627（ＪＰ，Ａ) 特開平６−215041（ＪＰ，Ａ) 増永良文，「リレーショナルデータベース入門 −データモデル・ＳＱＬ・管理システム−」，株式会社サイエンス社，1993年３月10日初版第５刷発行, ｐ．28−30 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/30 ＪＩＣＳＴファイル（ＪＯＩＳ)──────────────────────────────────────────────────続き Continuation of the front page (72) Inventor Tadakazu Kikuchi 1006 Kazuma Kadoma, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. (56) References JP-A-51-57259 (JP, A) 52226 (JP, A) JP-A-6-301721 (JP, A) JP-A-64-35627 (JP, A) JP-A-6-215041 (JP, A) Yoshifumi Masunaga, "Introduction to Relational Database-Data Model / SQL / Management System- ", Science Co., Ltd., March 10, 1993, first edition, 5th printing, p. 28-30 (58) Fields surveyed (Int. Cl. ⁷ , DB name) G06F 17/30 JICST file (JOIS)

Claims

(57) [Claims]

1. An information retrieval apparatus for retrieving data including an input keyword from data in an item of record data having a plurality of items, wherein when the data in the item comprises a plurality of data, Delimiter assigning means for assigning a delimiter code between the beginning and end of the column connecting the data and the data; character chain pattern generating means for creating a character chain consisting of a set of two adjacent characters in the column; out in the data of each character constituting the
Search file creating means for creating character chain information including a current number , a record number, and the data number, storing the character chain information in a memory in association with the character chain, and creating a search file; Keyword-separating means for giving a character string; keyword-character-chain generating means for creating a character chain of keywords from keywords to which a delimiter code has been added; reading the character-chain information corresponding to the character chain of the keyword from the search file; An information retrieval apparatus, comprising: a collation unit configured to collate them and retrieve the data including a keyword.

2. The information according to claim 1, wherein said matching means is provided with priority matching means for prioritizing matching of character chain information corresponding to a character chain of a keyword including said delimiter code. Search device.

3. The information retrieval apparatus according to claim 1, wherein said character chain pattern generating means includes said delimiter code in said character chain.

4. The search file creation means according to claim 1, wherein the number of occurrences of the delimiter code in the character chain information is
4. The information search device according to claim 3, wherein a specific number is set.

5. The character chain pattern generating means creates the character chain without including the delimiter code, and the search file creating means determines a character string of a character chain adjacent to the delimiter code in the character chain. 2. The information retrieval apparatus according to claim 1, wherein the chain information is created separately from other character chain information.

6. The information search apparatus according to claim 5, wherein the search file creating means stores character chain information of a character chain adjacent to the delimiter code in the memory for each group.

7. The method according to claim 1, wherein the delimiter assigning unit assigns a plurality of types of delimiter codes indicating a hierarchy to the column, and the keyword delimiter assigning unit assigns a delimiter code indicating the hierarchy to an input keyword. Claim 1
An information retrieval device according to item 1.

8. The method according to claim 1, wherein the delimiter assigning unit assigns a plurality of types of delimiter codes indicating the order to the column, and the keyword delimiter assigning unit assigns the input keyword with the delimiter code indicating the order. Claim 1
An information retrieval device according to item 1.