JPH04153880A

JPH04153880A - String substring extraction processing method

Info

Publication number: JPH04153880A
Application number: JP2280322A
Authority: JP
Inventors: Tadahiro Murotani; 室谷　忠宏
Original assignee: PFU Ltd
Current assignee: PFU Ltd
Priority date: 1990-10-18
Filing date: 1990-10-18
Publication date: 1992-05-27

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】［概要〕文字プリミティブ列から所定の条件に適合した部分列を
抽出する文字列の部分列抽出処理方式に関し。DETAILED DESCRIPTION OF THE INVENTION [Summary] This invention relates to a character string substring extraction processing method for extracting a substring that satisfies a predetermined condition from a character primitive string.

入力文字列を所定の部分列に分割して所定の条件に適合
したものを抽出することを目的とし入力文字列を文字プ
リミティブ列に分割する分割部と、前記文字プリミティ
ブ列の属性を判別する属性判別部と５抽出条件から抽出
条件判別列を生成する抽出条件生成部と、前記抽出条件
に適合した前記文字プリミティブ列を抽出する抽出部と
前記属性を所定の順に格納する配列型格納領域とを備え
、前記属性判別部が、前記属性を前記配列型格納領域に
前記所定の順に格納し、前記抽出部が、前記配列型格納
領域に格納された内容と前記抽出条件判別列とを比較す
ることにより、前記抽出を行なうように構成する。A dividing unit that divides an input character string into character primitive strings for the purpose of dividing an input character string into predetermined substrings and extracting substrings that meet predetermined conditions; and an attribute that determines the attributes of the character primitive strings. a discriminator, an extraction condition generation section that generates an extraction condition discriminant string from the extraction conditions, an extraction section that extracts the character primitive string that matches the extraction condition, and an array type storage area that stores the attributes in a predetermined order. The attribute discriminator stores the attributes in the array type storage area in the predetermined order, and the extraction unit compares the contents stored in the array type storage area with the extraction condition discrimination string. Accordingly, the above extraction is performed.

[Industrial application field]

本発明は文字列の部分列抽出処理方式に関し。 The present invention relates to a substring extraction processing method for character strings.

更に詳しくは１文字プリミティブ列から所定の条件に適
合した部分列を抽出する文字列の部分列抽出処理方式に
関する。More specifically, the present invention relates to a character string substring extraction processing method for extracting a substring that satisfies a predetermined condition from a single character primitive string.

ある規則に従って作成された文字列の中から。From among strings created according to certain rules.

所定の条件に適合した部分列を自動的に抽出することが
できれば、当該部分列の内容が正しいか否か等のチエツ
クを行うのに有効である。If it is possible to automatically extract a subsequence that meets a predetermined condition, it is effective for checking whether the contents of the subsequence are correct or not.

[Conventional technology]

例えば、複数の部分を組立てである１つの製品を製造す
る場合５通常１組立て図面が作成される。For example, if a product is manufactured by assembling multiple parts, one assembly drawing is usually created.

組立て図面には、ｍ立での仕様の他ムこ、どの部品を何
個使用するかという情報〔部品情報〕も記載される。部
品情報は１例えば、数量（何個用いるか）１品名（部品
の一般的な名称）２図番（部品の詳細を特定する番号）
等からなる。The assembly drawings include information on which parts and how many are to be used (parts information) in addition to the specifications in m-stand. Part information includes: 1. Quantity (how many pieces to use) 1. Product name (general name of the part) 2. Drawing number (number that specifies the details of the part)
Consists of etc.

一方、ｍ立てに用いる部品の購入、管理等を目的として
３通常２部品リスト（ＳＫ）も作成される。部品リスト
は５部品情報からなる。On the other hand, a 3-ordinary-2 parts list (SK) is also created for the purpose of purchasing, managing, etc. parts used for m-setting. The parts list consists of five parts information.

ここで１組立て図と部品リストとの関係を考える９本来
１部品リストは１Ｍｉ立て図の完成後、これに基づいて
作成されるものである。しかし、これでは製造工程完了
の短縮が図れないので、ある程度まで組立て図が作成進
行した段階で部品リストが作成され１発注等がなされる
。そして１組立て図の完成後に部品リストを修正する。Here, let us consider the relationship between the 1 assembly drawing and the parts list.9 Originally, the 1 parts list is created based on the 1Mi vertical drawing after completion. However, since this cannot shorten the time required to complete the manufacturing process, a parts list is created and an order is placed after the assembly drawings have been created to a certain extent. Then, after completing one assembly drawing, the parts list is revised.

この修正のために１Ｍｉ立て図と部品７ノストを対比し
て１部品情報をチエツクする必要がある。In order to make this correction, it is necessary to check the 1 part information by comparing the 1Mi elevation and the part 7 nost.

[Problem to be solved by the invention]

前述の従来技術において１部品情報の対比によるチエツ
クは、設計者等が人手によって行っている。In the prior art described above, checking by comparing information on one part is performed manually by a designer or the like.

このため、以下の問題があった。即ち１組立て図におけ
る部品情報は、当該部品の近傍（の所定位置）に記入さ
れるため２図面全域に散在しており、その位置が判りに
くい。また１図と文字とが混在するため、この点からも
判りにくい。更に組立て図中には１部品情報の他にも種
々の情報が存在し、これらは共に「文字列」という形態
をとるため、どの文字列が部品情報か判りにくい。また
５部品情報の文字列のフォーマットには、幾つかの種類
があり、一方、フォーマントが変更されることがある。This caused the following problems. That is, the component information in one assembly drawing is written near (at a predetermined position) the component, so it is scattered throughout the two drawings, and its position is difficult to understand. Also, since there is a mixture of figures and text, it is difficult to understand from this point of view as well. Furthermore, in addition to single part information, there are various pieces of information in the assembly diagram, and since both of these pieces of information take the form of "character strings," it is difficult to determine which character string is part information. Furthermore, there are several types of formats for character strings of the 5-component information, and the formant may be changed.

このような場合１人手に依るチエツクでは正確に変更等
に対応できず、エラーを生しる可能性もある。In such a case, checking by one person cannot accurately respond to changes, etc., and there is a possibility that errors may occur.

本発明は２人力文字列を所定の部分列に分割して所定の
条件に適合したものを抽出することを可能とした文字列
の部分列抽出処理方式を提供することを目的とする。SUMMARY OF THE INVENTION An object of the present invention is to provide a character string substring extraction processing method that makes it possible to divide a character string manually into predetermined substrings and extract those that meet predetermined conditions.

[Means to solve the problem]

第１図は本発明の原理構成図であり６本発明によるデー
タ処理装置を示す。FIG. 1 is a basic configuration diagram of the present invention, and shows a data processing apparatus according to the present invention.

第１図において、１は処理装置、２は分割部３は属性判
別部、４は抽出条件生成部、５は抽出部、６は配列型格
納領域、７は読取り装置、８は出力装置である。In FIG. 1, 1 is a processing device, 2 is a division unit 3 is an attribute discrimination unit, 4 is an extraction condition generation unit, 5 is an extraction unit, 6 is an array type storage area, 7 is a reading device, and 8 is an output device. .

処理装置ｌは、中央処理装置（ｃｐｕ）及びメモリから
なる。The processing device l consists of a central processing unit (CPU) and a memory.

分割部２は、入力文字列を文字プリミティブ列（即ち部
分列）に分割する。The dividing unit 2 divides the input character string into character primitive strings (ie, substrings).

属性判別部３は２分割された文字プリミティブ列毎にそ
の属性を判別する。The attribute determination unit 3 determines the attribute of each character primitive string divided into two.

抽出条件生成部４は、抽出条件から抽出条件判別列を生
成する。The extraction condition generation unit 4 generates an extraction condition discriminant string from the extraction conditions.

抽出部５は７分割された文字プリミティブ列のうちから
、抽出条件に適合したものを抽出する。The extracting unit 5 extracts a string of character primitives that meets the extraction conditions from among the seven divided character primitive strings.

配列型格納領域６は、処理装置ｌのメモリ上に設けられ
２文字プリミティブ列の属性を所定の順に格納する。即
ち、このために、予め定められた大きさの配列型とされ
る。The array type storage area 6 is provided on the memory of the processing device 1 and stores attributes of a two-character primitive string in a predetermined order. That is, for this purpose, an array type having a predetermined size is used.

読取り装置７は１図面上の入力文字列、抽出条件を読取
る。The reading device 7 reads input character strings and extraction conditions on one drawing.

出力装置８は、抽出された文字プリミティブ列を出力す
る。The output device 8 outputs the extracted character primitive string.

[Effect]

第２図は本発明の作用説明図であり１本発明の処理フロ
ーを示す。FIG. 2 is an explanatory diagram of the operation of the present invention, and shows a processing flow of the present invention.

■　分割部２が、読取り装置７の読取った入力文字列を
、予め定められた条件に従って１文字プリミティブ列（
部分列）に分割する。■ The dividing unit 2 divides the input character string read by the reading device 7 into a one-character primitive string (
subsequence).

■　属性判別部３が１分割部２の分割した文字プリミテ
ィブ列の各々について予め定められた条件を用いてその
属性を判別し、その結果を、配列型格納領域６に所定の
順に格納（配列）する。■ The attribute determination unit 3 determines the attribute of each of the character primitive strings divided by the first division unit 2 using predetermined conditions, and stores the results in the array type storage area 6 in a predetermined order (array). do.

これにより、属性を記述した属性列が得られる。This provides an attribute string that describes the attributes.

■　抽出条件生成部４が、読取り装置７の読取った抽出
条件から、抽出条件判別列を生成する。(2) The extraction condition generation unit 4 generates an extraction condition discrimination string from the extraction conditions read by the reading device 7.

■　抽出部５が、配列型格納領域６に格納された内容即
ち属性列の各々と、抽出条件判別列とを比較し、一致し
た属性列に対応する文字プリミティブ列を、抽出条件に
適合したものとして抽出する。そして、出力装置ＦＢが
、当該文字プリミティブ列を例えば、印刷等により出力
する。■ The extraction unit 5 compares the contents stored in the array type storage area 6, that is, each of the attribute strings, with the extraction condition discrimination string, and selects a character primitive string corresponding to the matching attribute string as one that meets the extraction conditions. Extract as. Then, the output device FB outputs the character primitive string by, for example, printing.

以上によれば１図面上の文字列を区別なく入力し、この
中から所定の抽出条件に適合した文字列（部分列）１例
えば１部品情報のみを自動的に抽出することができる。According to the above, character strings on one drawing can be input without distinction, and only a character string (partial string) 1, for example, 1 part information that meets a predetermined extraction condition can be automatically extracted from among the character strings.

また、抽出条件を入力して判別列を生成することにより
１例えば部品情報のフォーマットの変更等に容易に対応
することができる。Furthermore, by inputting extraction conditions and generating a discriminant string, it is possible to easily respond to changes in the format of component information, for example.

〔Example〕

第３図は実施例構成図であり、第１図図示のデータ処理
装置の詳細を示す。FIG. 3 is a block diagram of an embodiment, showing details of the data processing apparatus shown in FIG.

入力文字列読取り装置１０及び抽出条件読取り装置１４
は、読取り装置７を構成する。Input character string reading device 10 and extraction condition reading device 14
constitutes the reading device 7.

文字プリミティブ分解部１１は２分割部２に相当し１分
割した文字プリミティブを文字プリミティブ格納領域１
２に格納する。文字プリミティブ格納領域１２は、処理
装置１のメモリの所定の位置に設けられる。The character primitive decomposition unit 11 corresponds to the two-part dividing unit 2, and stores the character primitives divided into one into the character primitive storage area 1.
Store in 2. The character primitive storage area 12 is provided at a predetermined location in the memory of the processing device 1.

判別値決定部１３は２属性判別部３に相当し文字プリミ
ティブ格納領域１２から文字プリミティブを取出し、そ
の属性、即ち２判別値を決定しこれを配列型格納ＩＪＩ
域６に格納する。The discriminant value determining unit 13 corresponds to the two-attribute discriminating unit 3, takes out a character primitive from the character primitive storage area 12, determines its attributes, that is, two discriminant values, and stores it in an array type IJI.
Store in area 6.

抽出条件判別列生成部１５は、抽出条件生成部４に相当
し、生成した抽出条件判別列を抽出条件判別列格納領域
１６に格納する。この領域１６は。The extraction condition discriminant string generation section 15 corresponds to the extraction condition generation section 4, and stores the generated extraction condition discriminant string in the extraction condition discriminant string storage area 16. This area 16 is.

処理装置１のメモリの所定の位置に設けられる。It is provided at a predetermined location in the memory of the processing device 1.

配列型記憶領域抽出条件判別列比較部１７は。The array type storage area extraction condition discriminant string comparison unit 17 is.

文字プリミティブ抜出し部１８と共に抽出部５を構成し
、配列型記憶領域６の内容の各々と抽出条件判別列の各
々とを比較する。この比較の結果に基づいて２文字プリ
ミティブ抜出し部１８は１文字プリミティブ格納領域１
２から適合する文字プリミティブを抜出す。The extraction section 5 is configured together with the character primitive extraction section 18, and compares each of the contents of the array type storage area 6 with each of the extraction condition discrimination strings. Based on the result of this comparison, the two-character primitive extraction unit 18 extracts the one-character primitive storage area 1.
Extract matching character primitives from 2.

文字プリミティブ出力装置１９は、出力装置８に相当し
、抜出された文字プリミティブを出力すなお、各部１１
．１３，１５．１７及び１８は。The character primitive output device 19 corresponds to the output device 8 and outputs the extracted character primitives.
．． 13, 15, 17 and 18.

処理袋Ｎ１のｃｐｕとそのメモリ上に存在する処理プロ
グラムとによって実現される。This is realized by the CPU of the processing bag N1 and the processing program existing on its memory.

第４図は文字プリミティブ説明図である。FIG. 4 is an explanatory diagram of character primitives.

第４図（ａ）において、■乃至［相］の各々が文字プリ
ミティブであり、「！」が文字プリミティブ列である。In FIG. 4(a), each of ■ to [phase] is a character primitive, and "!" is a character primitive string.

文字プリミティブ列！は、入力文字列に等しく所定の記
号（区切り文字）１例えば、「口（ブランクを表すもの
とする）」、　　ｒ、　　（ピリオド）」。Character primitive string! is a predetermined symbol (delimiter) 1 that is equal to the input string, for example, "mouth (shall represent a blank)", r, (period).

「、（カンマ）」等によって区切られる。この区切られ
た文字列の各々■乃至［相］が文字プリミティブである
。従って１文字プリミティブ■乃至０は。Separated by ", (comma)" etc. Each of the separated character strings from ■ to [phase] are character primitives. Therefore, the one-character primitive ■ to 0.

複数の文字を含むことがあり、かつ、これ以上分割して
は意味をなさなくなるものである。例えば。It may contain multiple characters, and it would be meaningless to divide it further. for example.

■の「コネジ」を「コＪ、ｒ＄ジ」に分解しては意味を
失う。If you break down ``Koneji'' in ■ into ``koJ, r$ji'', it loses its meaning.

なお、この文字プリミティブ列ｌは、主に部品情報（数
量２品名１図番）からなる。Note that this character primitive string 1 mainly consists of parts information (2 quantities, 1 product name, 1 drawing number).

また、このような分割処理は１分割部２が、第８図図示
の分割処理フローに従って、実行するものである。Further, such division processing is executed by the 1 division unit 2 according to the division processing flow shown in FIG.

即ち、読取り装置７で読取られた入力文字列は一旦所定
の配憶領域（図示せず）に格納される。That is, the input character string read by the reading device 7 is temporarily stored in a predetermined storage area (not shown).

分割部２は、入力文字列から一文字づつ取出した文字に
ついて１区切り文字か否かを判定し、これを利用して文
字プリミティブに分解する。The dividing unit 2 determines whether or not each character extracted from the input character string is a single delimiter, and uses this to decompose the character into character primitives.

なお、第８図において、Ｎｅは入力文字数。In addition, in FIG. 8, Ne is the number of input characters.

Ｎｐは文字プリミティブの数である。Np is the number of character primitives.

第４図（ｂ）は２文字プリミティブ■乃至［相］の各々
について　その属性を判別値「１」又は「０」を用いて
示している。この場合１文字プリミティブ列！から部品
情報を抽出しようとしているので属性が部品情報の数量
２品名２図番のうちのいずれを表しているのかを示す。FIG. 4(b) shows the attributes of each of the two-character primitives ``■'' to ``phase'' using the discriminant value ``1'' or ``0''. In this case, a one-character primitive string! Since we are trying to extract component information from , the attribute indicates which of the quantity, product name, and drawing number of the component information.

例えば、■の「ネジ」は、「品名」であるから２品名に
ついての判別値を「１」とじ５数量及び図番についての
判別値は「ＯＪとされる。他の文字プリミティブ■乃至
［相］についても同様に判別値が決定される。例えば２
文字プリミティブが片仮名からなれば「品名」、数字か
らなれば「数量」、アルファニューメリ、りからなれば
「図番」であると判別される。For example, "Screw" in ■ is a "product name", so the discriminant values for 2 product names are set to "1", and the discriminant values for 5 quantities and drawing numbers are set to "OJ". ] The discriminant value is determined in the same way. For example, 2
If the character primitive consists of katakana, it is determined that it is a "product name," if it consists of numbers, it is determined that it is a "quantity," and if it consists of an alphanumeric character, it is determined that it is a "picture number."

なお　判別の方法は、これに限定されるものではない。Note that the determination method is not limited to this.

また、一般に１文字プリミティブの属性（判別値）の数
をＮａとすると、当該属性は（ａｌ＋ａ！＋　　・・・ａ　Ｎ１１−１＋　　ａ　ｍ
ａｌなる集合で表される。ある文字プリミティブがｎ番
目の属性である（ｎ番目の判別値が「１」）であるとき
５ａｋ　＝１　（ｋ＝ｎ）＝Ｏ（ｋｉｎ）となる。In addition, in general, if the number of attributes (discriminant values) of a one-character primitive is Na, then the attribute is (al+a!+...a N11-1+ a m
It is represented by the set al. When a certain character primitive has the nth attribute (the nth discriminant value is "1"), 5 ak =1 (k=n) =O(kin).

第５図は抽出条件判別列説明図である。FIG. 5 is an explanatory diagram of the extraction condition discrimination sequence.

今２部品情報を抽出しようとしているものとし。Assume that you are currently trying to extract information on two parts.

かつ２部品情報は、数量１品名１図番の３つのみをこの
順序に記載することにより構成されているとする。It is also assumed that the two-part information is composed of only three items, one quantity, one item name, and one drawing number, written in this order.

この場合１部品情報は２本来的に、前記３つの項目の全
てが記入されたものからなるから、抽出条件としては「
数量１品名１図番」と指定され。In this case, 1 parts information 2 essentially consists of all of the above three items filled in, so the extraction condition is ``
Quantity: 1 item name: 1 drawing number.

入力される。これにより５抽出条件判別列は。is input. This results in 5 extraction condition discrimination columns.

ｒｌｌ、ｌ」となる。即ち、数量１品名５図番の３つに
ついての文字列が存在しく記入があり）。rll, l". In other words, three character strings exist and have been entered: quantity 1 product name 5 drawing number).

この結果各々の判別値が「ｌ」であるものが抽出され、
抽出条件に適合する文字列２即ち２部品情報とされる。As a result, those whose discriminant value is "l" are extracted,
The character string 2, that is, the 2-part information, that meets the extraction conditions is set.

また３部品情報は、その数量が「１」である場合にこれ
を省略して記入されることがある。そこで　このような
ものまで部品情報として抽出する場合、抽出条件として
「〔省略〕十品名十図番」と指定される。これにより、
抽出条件判別列はｒｏｌ、ｌ」となる。Furthermore, when the quantity of three parts information is "1", this may be omitted and entered. Therefore, when extracting such information as part information, the extraction condition is specified as ``[omitted] 10 product names and 10 drawing numbers.'' This results in
The extraction condition discriminant string is "rol,l".

更に２部品情報は１図番のみで品名を必要としないこと
があるので、これを省略して記入されることがある。そ
こで、この場合には抽出条件は「数量＋（省略〕十図番
ｊとされ、抽出条件判別列はｒｏ、０．ＩＪ又はｒｌ、
０．０」となる。Furthermore, since two-part information may require only one drawing number and no product name, this may be omitted and entered. Therefore, in this case, the extraction condition is "Quantity + (omitted) 10 drawing number j," and the extraction condition discrimination column is ro, 0.IJ or rl,
0.0".

本発明によれば、このように種々の抽出条件を所望によ
り設定できる。即ち、抽出条件読取り装置１４を設ける
ことにより、抽出対象に応して。According to the present invention, various extraction conditions can be set as desired in this way. That is, by providing the extraction condition reading device 14, depending on the extraction target.

柔軟な抽出条件が設定できる。そして、このように種々
の抽出条件を設定することにより、抽出対象の抽出率を
極めて高いものにできる。Flexible extraction conditions can be set. By setting various extraction conditions in this manner, the extraction rate of the extraction target can be made extremely high.

なお、抽出条件は４つ以上の項目で構成されていてもよ
い０例えば、８つの項目からなり、その３番目、５番目
及び６番目の項目が省略されているものを抽出するとす
る。この場合、抽出条件判別列は。Note that the extraction condition may consist of four or more items. For example, suppose that the extraction condition consists of eight items and the third, fifth, and sixth items are omitted. In this case, the extraction condition determination column is.

＊＋に＋ｋＯＯ０１１＄＄００００１＊＊０Ｏ１００＊＊。＊＋＋kOO011 $$00001* *0O100**.

１１０００＊＊＊。11000***.

の４つとなる。There are four.

また、このような生成処理は、抽出条件生成部４が、第
９図図示の生成処理フローに従って実行するものである
。Further, such a generation process is executed by the extraction condition generation unit 4 according to the generation process flow shown in FIG.

第９図において、要素ｄ　ｉｊに「０」又は「１」。In FIG. 9, element dij is "0" or "1".

省略の場合には「＊」が格納されることによって。If omitted, "*" is stored.

抽出条件判別列が求まる。ここで、ｊは「１」乃至ｒＮ
ａ」の値をとり、第５図中のＸ方向に相当し、ｉは「１
」乃至ｒＮｄＪ（抽出条件判別列の番号）の値をとり、
第５図中のＸ方向に相当する。Determine the extraction condition discriminant column. Here, j is "1" to rN
a", which corresponds to the X direction in FIG. 5, and i is "1".
” to rNdJ (number of extraction condition discriminant column),
This corresponds to the X direction in FIG.

Ｘ方向のｉ列目の全要素をＤｉで表すと２　このＤｉが
１つの抽出条件判別列を示す。If all the elements of the i-th column in the X direction are represented by Di, then 2 This Di indicates one extraction condition discriminant string.

第６図は格納状態説明図である。FIG. 6 is an explanatory diagram of the stored state.

今２文字プリミティブ■に着目すると、これが部品情報
の一部である可能性は１例えば。Now, if we focus on the two-character primitive ■, there is a possibility that this is part of the component information, for example.

（数量十品名十図番）−（省略＋■＋■）（数量十品名
十図番）−（■＋■＋■）（数量十品名十図番）＝（■
十■＋■）の３つの組合せがある。従って、この各々に
ついて、抽出条件判別列ｒ１．１．ＩＪ又はｒＯ，１゜
１」等と一致するか否かを調べる必要がある。(Quantity: 10 product names, 10 drawing numbers) - (omitted + ■ + ■) (Quantity: 10 product names, 10 drawing numbers) - (■ + ■ + ■) (Quantity: 0.
There are three combinations: 10■+■). Therefore, for each of these extraction condition discriminant columns r1.1. It is necessary to check whether it matches with IJ or rO, 1°1.

そこで、まず、配列型格納領域６が用意される。Therefore, first, an array type storage area 6 is prepared.

この配列の横の要素数は部品情報の項目の数Ｎａとされ
、縦の要素数は（文字プリミティブ数Ｎｐ＋（前記項目
の数−１））とされる。そして、この配列に、所定の順
で１文字プリミティブ■乃至［株］の各々の属性の判別
値が格納される。文字プリミティブ■（ｎ）について見
ると、その図番１品名及び数量は、各々１組合せ順“２
（ｎ）”“’３　（ｎ＋１）”及び“４　（ｎ＋２）”
の該当する項目の領域に格納される。他についても同様
である０以上により１文字プリミティブ■乃至［相］の
各々について１部品情報を構成する可能性のある組合せ
の全ての属性列を作成できる。The number of horizontal elements in this array is the number Na of parts information items, and the number of vertical elements is (number of character primitives Np+(number of items - 1)). Then, the discrimination values of the attributes of each of the one-character primitives ``■'' to ``stock'' are stored in this array in a predetermined order. Looking at the character primitive ■(n), its drawing number, product name, and quantity are each in the order of combination "2".
(n)” “’3 (n+1)” and “4 (n+2)”
is stored in the area of the corresponding item. Similarly for the others, it is possible to create all attribute strings of combinations that may constitute one part information for each of the one-character primitives ① to [phase] using 0 or more.

このような判別及び格納処理は、属性判別部３が、第１
０図図示の判別格納処理フローに従って実行するもので
ある。Such discrimination and storage processing is carried out by the attribute discriminator 3 in the first
This is executed according to the discrimination storage processing flow shown in FIG.

第１０図において、要素ｔｘｙに属性ａＶが格納される
ことによって、属性列が生成される。ここで、Ｘはｒｌ
、乃至「Ｎａ」の値をとり、第６図中のＸ方向に相当し
、ｙは「１」乃至ｒＮｐ　十Ｎａ−１」の値をとり５第
６図中のＸ方向に相当する。Ｘ方向のｉ列目の全要素を
Ｔｉで表すと。In FIG. 10, an attribute string is generated by storing attribute aV in element txy. Here, X is rl
, to "Na", which corresponds to the X direction in FIG. 6, and y takes values from "1" to "rNp +Na-1", and corresponds to the X direction in FIG. Let Ti represent all the elements in the i-th column in the X direction.

このＴｉが１つの属性列を示す。This Ti indicates one attribute string.

第７図は抽出処理説明図であり５第３図乃至第６図まで
をまとめて示したものである。FIG. 7 is an explanatory diagram of the extraction process, and shows all of FIG. 3 to FIG. 6 together.

前述の如く、入力文字列を文字プリミティブ■乃至［株
］に分解し、その属性の判別を行い１判別値を配列型格
納領域６に所定の順で格納する。一方抽出条件から抽出
条件判別列ｒｌ、１．ＩＪ及びｒｏ、１．ＩＪを得る。As described above, the input character string is decomposed into the character primitives ``■'' to ``stock'', their attributes are determined, and 1 discriminant values are stored in the array type storage area 6 in a predetermined order. On the other hand, from the extraction conditions, the extraction condition discriminant string rl, 1. IJ and ro, 1. Get IJ.

そして、配列型記憶領域６の各組合せ順と抽出条件判別
列とを比較する。この結果１組合せ順“６″及び１３″
が一致する。前者は、その数量２品名及び図番が、各々
２文字プリミティブ■■及び■により構成される部品情
報である。後者は、その数量１品名及び図番が、各々、
省略１文字プリミティブ＠及び■により構成される部品
情報である。Then, each combination order in the array type storage area 6 is compared with the extraction condition discrimination string. As a result, 1 combination order “6” and 13”
matches. The former is part information whose quantity, two product name, and drawing number are each composed of two-character primitives ■■ and ■. For the latter, the quantity 1 product name and drawing number are respectively
This is part information composed of the omitted one-character primitive @ and ■.

このように１文字プリミティブ列中から２部品情報が２
つ抽出される。即ち７「３．ネジ四Ｍ３×３０」は２図
番ｒＭ３Ｘ３．で特定される「ネジ」が「３」個である
ことを意味し、「ネジ四Ｍ７×７０」は１図番「Ｍ７×
７」で特定される「ネジ」が「１」個であることを意味
する。この場合の部品情報の抽出率は１００％　である
。従って、この後の作業を橿めて効率よく行いうる。In this way, two parts information from one character primitive string is
One is extracted. That is, 7 "3. Four screws M3 x 30" is 2 drawing number rM3X3. This means that the number of "screws" specified by is "3", and "four screws M7 x 70" is one drawing number "M7 x
7" means that the number of "screws" specified is "1". The parts information extraction rate in this case is 100%. Therefore, subsequent work can be done more efficiently.

以上の抽出処理は、抽出部５が、第１１図図示の抽出処
理フローに従って実行するものである。The extraction process described above is executed by the extraction unit 5 according to the extraction process flow shown in FIG.

第１１図において、抽出条件判別列Ｄｊ　と属性列Ｔ＋
　とを比較し１両者が一致し、がっ、その属性が指定さ
れている場合に、当該属性列Ｔｉのその属性に対応する
文字プリミティブを、当該属性として出力する。ここで
、ｑｈ＝０なら、に番目の属性が指定されており、Ｑｋ
＝１なら、当該属性が省略されていることを意味する。In FIG. 11, the extraction condition discrimination column Dj and the attribute column T+
If they match and the attribute is specified, the character primitive corresponding to the attribute in the attribute string Ti is output as the attribute. Here, if qh=0, the th attribute is specified and Qk
=1 means that the attribute is omitted.

〔Effect of the invention〕

以上説明したように１本発明によれば、所定の条件に適
合した部分列を抽出する文字列の部分列抽出処理におい
て１文字列を分割した文字プリミティブの属性を所定の
順に配列したものと抽出条件判別列とを比較することに
より、所望の抽出条件に適合した文字プリミティブを抽
出することができるので２部分列を用いた作業を行うた
めの当該部分列の抽出処理を自動化でき１作業の効率を
大幅に向上できる。As explained above, according to the present invention, attributes of character primitives obtained by dividing one character string are arranged in a predetermined order and extracted in a substring extraction process of a character string that extracts substrings that meet predetermined conditions. By comparing the condition discriminant string, character primitives that meet the desired extraction conditions can be extracted, which can automate the process of extracting the subsequence when performing work using two subsequences, increasing the efficiency of one task. can be significantly improved.

[Brief explanation of drawings]

第１図は本発明の原理構成図。第２図は本発明の作用説明図第３図は実施例構成図。第４図は文字プリミティブ説明図第５図は抽出条件判別列説明図。第６図は格納状態説明図９第７図は抽出処理説明図。第８図は分割処理フロー第９図は生成処理フロー第１０図は判別格納処理フロー第１１図は抽出処理フロー１は処理装置、２は分剖部５３は属性判別部４は抽出条
件生成部、５は抽出部、６は配列型格納領域、７は読取
り装置、８は出力装置である。特許出願人　　株式会社ビーエフニー代理人　弁理士　長谷用　交情（外２名）本発明の作用説明図第図実施例橋成第３図図生成処理フロ第　９　図（その１）住成処理フロ第図（その２）生成処理フロ第　９　区（その３）生成処理フロ第　９　図（その４）１別格納処理フロー第１０図抽　　出　　処第１１理　　フ　　ロ図（その１）抽出処第１１１！　　フ　　ロ図（その２）FIG. 1 is a diagram showing the principle configuration of the present invention. FIG. 2 is an explanatory diagram of the operation of the present invention. FIG. 3 is a diagram showing the configuration of an embodiment. FIG. 4 is an explanatory diagram of character primitives. FIG. 5 is an explanatory diagram of extraction condition discrimination strings. FIG. 6 is a diagram illustrating the storage state; FIG. 7 is a diagram illustrating the extraction process. FIG. 8 is a division processing flow. FIG. 9 is a generation processing flow. FIG. 10 is a discrimination storage processing flow. FIG. 11 is an extraction processing flow. , 5 is an extraction unit, 6 is an array type storage area, 7 is a reading device, and 8 is an output device. Patent Applicant: BFN Co., Ltd. Agent Patent Attorney: Yo Hase Kojo (2 others) Action explanatory diagram of the present invention Figure 3 Embodiment Figure 3 Generation process flow Figure 9 (Part 1) Housing process flow diagram (Part 2) Generation Processing Flow Section 9 (Part 3) Generation Processing Flow Fig. 9 (Part 4) Separate Storage Processing Flow Fig. 10 Extraction Process 11 Process Flow Diagram (Part 1) Extraction Process 11 1 ! Flow diagram (Part 2)

Claims

[Claims]

(1) A dividing unit (2) that divides an input character string into character primitive strings; and an attribute determination unit (2) that determines the attributes of the character primitive strings.
3), an extraction condition generation unit (4) that generates an extraction condition discriminant string from the extraction conditions, an extraction unit (5) that extracts the character primitive string that matches the extraction condition, and stores the attributes in a predetermined order. an array type storage area (6) in which the attribute discriminator (3) stores the attributes in the array type storage area (6) in the predetermined order, and the extraction unit (5) stores the attributes in the array type storage area (6); A substring extraction processing method for a character string, characterized in that the extraction is performed by comparing the content stored in a type storage area (6) with the extraction condition discriminant string.

(2) The input character string and extraction conditions are sent to the reading device (7
2. The method for extracting a substring of a character string according to claim 1, wherein the character string is read as follows.