JP3037776B2

JP3037776B2 - Term decomposition device

Info

Publication number: JP3037776B2
Application number: JP3115017A
Authority: JP
Inventors: 康子佐野
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1991-05-20
Filing date: 1991-05-20
Publication date: 2000-05-08
Anticipated expiration: 2015-05-08
Also published as: JPH04343171A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、システム仕様として入
力された入力語を、その仕様内容を認識可能な単位に分
解する用語分解装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a term decomposing device for decomposing an input word input as a system specification into units capable of recognizing the specification.

【０００２】[0002]

【従来の技術】最近、対象システム例えばプロセス制御
システムの要求に合致するプログラムを生成するため
に、システム要求をシステム仕様として入力して、その
入力された仕様に表現されている内容をもとにしてプロ
グラムを自動生成するプログラム自動生成装置が開発さ
れている。このプログラム自動生成装置では、上記シス
テム仕様を解析するために、システム仕様として入力さ
れた入力語を、仕様として理解し易い単位に分解する用
語分解装置が用いられている。かかる用語分解装置の構
成例を図６に示す。2. Description of the Related Art Recently, in order to generate a program that meets the requirements of a target system, for example, a process control system, system requirements are input as system specifications and based on the contents expressed in the input specifications. A program automatic generation device that automatically generates a program by using the program has been developed. In order to analyze the system specifications, the automatic program generation device uses a term decomposition device that decomposes input words input as system specifications into units that can be easily understood as specifications. FIG. 6 shows a configuration example of such a term decomposition apparatus.

【０００３】この用語分解装置１は、分解したい用語が
予め登録された用語辞書２を備え、システム仕様を記述
した入力語３を上位の処理系から受け取ると、用語辞書
２に登録されている用語に従って分解処理を行う。この
分解処理は、入力語３を先頭の文字から順に読込んで、
その読込んだ部分文字列に一致した語を用語辞書２から
検索し、そのうちの最も長い文字列毎に分解する。分解
した用語はその用語の属性と共に出力語４として上位の
処理系へ返す。この用語分解装置１の具体的な動作につ
いて第７図を参照して説明する。The term decomposition apparatus 1 includes a term dictionary 2 in which terms to be decomposed are registered in advance. When an input word 3 describing system specifications is received from a higher-level processing system, a term registered in the term dictionary 2 is received. According to the following. This decomposition process reads the input word 3 in order from the first character,
A word that matches the read partial character string is searched from the term dictionary 2 and decomposed for each longest character string. The decomposed term is returned to an upper processing system as an output word 4 together with the attribute of the term. The specific operation of the term decomposition device 1 will be described with reference to FIG.

【０００４】用語分解装置１では、上位の処理系から
「前進中間位置」なる入力語３−１を受け取ると、この
先頭からの部分文字列に一致する語を用語辞書２から検
索する。この検索された文字列（用語）の中から最も長
い用語を選択して入力語３−１から切り出す。同図の場
合、「前進」が選択されて「前進中間位置」から切り出
され、残りの文字列である「中間位置」に対し、さらに
用語辞書２からの文字列の検索Ｔが行われる。その結
果、「前進中間位置」は、「前進」＋「中間位置」に分
解され、各々の属性である｛動作｝、｛位置｝と組み合
わされて、出力語４−１，４−２として上位の処理系に
返される。[0004] When the term decomposition device 1 receives the input word 3-1 of “advance intermediate position” from a higher-level processing system, it searches the term dictionary 2 for a word that matches the partial character string from the beginning. The longest term is selected from the searched character strings (terms) and cut out from the input word 3-1. In the case shown in the figure, "forward" is selected and cut out from "forward intermediate position", and a character string search T from the term dictionary 2 is further performed on the remaining character string "intermediate position". As a result, the "advance intermediate position" is decomposed into "advance" + "intermediate position", and is combined with the respective attributes {operation} and {position} to form upper words as output words 4-1 and 4-2. Is returned to the processing system.

【０００５】ところで、上述した用語分解装置１は、入
力語３を用語辞書２に登録されている用語に従って忠実
に分解をする機能しか有していなかったので、上位の処
理系から渡される可能性のある用語は、すべて用語辞書
２に登録しておく必要があった。The term decomposition apparatus 1 has only a function of faithfully decomposing the input words 3 in accordance with the terms registered in the term dictionary 2. It is necessary to register all the words with the term in the word dictionary 2.

【０００６】しかしながら、システム仕様に用いられる
用語は膨大な数あり、それら全ての用語を用語辞書２に
登録し、かつ管理しておくのには極めて多くの労力が必
要となる。例えば、基本となる用語「前進」に対して
は、「前進中」、「前進位置」、「前進検出」、「前進
指令」等の組み合わせがあり、同様に、「後退」に対し
ても、「後退中」、「後退位置」、「後退検出」、「後
退指令」等の組み合わせが存在する。従来の用語分解装
置１では、これら全ての組み合わせを用語辞書２に登録
しなければならないわけである。However, there are an enormous number of terms used in the system specifications, and an extremely large amount of labor is required to register and manage all the terms in the term dictionary 2. For example, for the basic term "forward", there are combinations such as "forward", "forward position", "forward detection", "forward command", and likewise, for "reverse", There are combinations such as “during retreat”, “retreat position”, “retreat detection”, and “retreat command”. In the conventional term decomposition apparatus 1, all of these combinations must be registered in the term dictionary 2.

【０００７】また、ある分解すべき用語と、その用語を
一部に含む他の用語とがそれぞれ用語辞書２に登録され
ている場合、上記他の用語に分解すべきなのにも拘ら
ず、読込んだ部分文字列が偶然に短いほうの用語に一致
してしまったときには、正確に分解できなくなるという
問題がある。例えば、入力語３が「前進中間位置」で、
用語辞書２に、「前進」、「前進中」、「中間位置」が
登録されていた場合、分解すべき組み合わせは「前進」
＋「中間位置」であるにも拘らず、実際には、「前進
中」＋「間位置」とに分解されてしまう。[0007] Further, when a certain term to be decomposed and another term partially including the term are registered in the term dictionary 2, they are read in spite of the fact that they should be decomposed into the other terms. However, if a substring accidentally matches the shorter term, there is a problem that it cannot be correctly decomposed. For example, if the input word 3 is “forward intermediate position”,
If “forward”, “in progress”, and “intermediate position” are registered in the term dictionary 2, the combination to be decomposed is “forward”
In fact, despite being at the “intermediate position”, it is actually decomposed into “forward” + “intermediate position”.

【０００８】なお、このような、不正確な用語分解を行
わないようにするために、翻訳機械等の為に開発されて
いる構文解析装置などを用いることも考えられるが、構
文解析装置は、一般に機構が複雑かつ大規模なため、用
語分解装置のようなプログラム自動生成装置の中に組み
込むためのデータを用意するには、大変な労力を要し現
実的でない。In order to prevent such inaccurate term decomposition, it is conceivable to use a parsing apparatus developed for a translation machine or the like. In general, the mechanism is complicated and large-scale, and preparing data for incorporation into an automatic program generation device such as a term decomposition device requires a great deal of labor and is not realistic.

【０００９】[0009]

【発明が解決しようとする課題】このように、従来の用
語分解装置は、膨大な数の用語を用語辞書に登録しなけ
ればならず多大な労力を必要とし、しかも入力語を正確
に分解できない可能性があった。As described above, the conventional term decomposer has to register an enormous number of terms in a term dictionary, requires a great deal of labor, and cannot decompose input words accurately. There was a possibility.

【００１０】本発明は以上のような実情に鑑みてなされ
たもので、用語辞書に登録すべき用語の数を大幅に減少
できて労力の軽減を図ることができ、かつ入力語を正確
に分解できて装置の信頼性を向上し得る用語分解装置を
提供することを目的とする。[0010] The present invention has been made in view of the above circumstances, and can significantly reduce the number of terms to be registered in the term dictionary, reduce labor, and accurately decompose input words. It is an object of the present invention to provide a term decomposition apparatus which can improve the reliability of the apparatus.

【００１１】[0011]

【課題を解決するための手段】本発明は上記目的を達成
するために、対象システムの要求に応じたシステム仕様
を表現するために使用される用語及びその属性が格納さ
れた用語辞書を備え、前記用語を組合わせて表現された
システム仕様が入力語として与えられ、その入力語を前
記用語辞書に基づいて所定の単位に分解する用語分解装
置において、分解してはいけない用語の配列規則を定め
た分解禁止パタンが記憶された分解禁止パタン記憶手段
と、前記分解禁止パタンで定められた用語配列を含んで
いるが所定の用語と組合わされることにより分解可能と
なる用語の配列規則を定めた分解可能パタンが記憶され
た分解可能パタン記憶手段と、前記分解禁止パタン及び
前記分解可能パタンとを参照しながら前記入力語を分解
する分解手段とを具備した構成とした。In order to achieve the above object, the present invention comprises a term dictionary in which terms used for expressing a system specification according to a request of a target system and attributes thereof are stored, A system specification expressed by combining the terms is given as an input word, and a term decomposing device that decomposes the input word into predetermined units based on the term dictionary defines an arrangement rule of terms that must not be decomposed. A decomposition prohibition pattern storing means in which the decomposition prohibition pattern is stored, and an arrangement rule of a term which includes the term array defined by the decomposition prohibition pattern but can be decomposed by being combined with a predetermined term is defined. Decomposable pattern storage means storing a decomposable pattern, and decomposing means for decomposing the input word with reference to the decomposition prohibition pattern and the decomposable pattern. It was Bei configuration.

【００１２】[0012]

【作用】本発明は、以上のような手段を講じたことによ
り、入力された入力語に対し、用語辞書に従って分解で
きる用語が取り出され、次に分解禁止パタン、分解可能
パタンに基づいて、用語辞書から取り出した用語が分解
可能であるかどうか判定されて分解可能である場合にの
み分解される。よって、用語辞書には基本的な用語のみ
を登録し、基本用語を修飾するような基本用語を一部に
含む文字列は、分解禁止パタンおよび分解可能パタンと
して配列規則だけを分解禁止パタン記憶手段及び分解可
能パタン記憶手段に登録しておけば、あらゆる入力語に
対応できると共に正確に分解でき、しかも登録用語数も
大幅に減少されるものとなる。According to the present invention, by taking the above measures, terms that can be decomposed according to the term dictionary are extracted from the input words that have been input, and then the terms are determined based on the decomposition prohibition pattern and the decomposable pattern. It is determined whether the term extracted from the dictionary is decomposable, and is decomposed only if the term is decomposable. Therefore, only a basic term is registered in the term dictionary, and a character string partially including a basic term that modifies the basic term is used as a decomposition prohibition pattern and a decomposable pattern. By registering in the decomposable pattern storage means, it is possible to deal with any input word and to decompose accurately, and the number of registered terms is greatly reduced.

【００１３】[0013]

【実施例】以下、本発明の一実施例に係る用語分解装置
について説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a term decomposition apparatus according to one embodiment of the present invention will be described.

【００１４】図１には、一実施例に係る用語分解装置１
０の機能ブロックが示されている。なお、本実施例は、
用語分解装置１０をプログラム自動生成装置に用いた例
である。FIG. 1 shows a term decomposition apparatus 1 according to one embodiment.
0 functional blocks are shown. In this embodiment,
This is an example in which the term decomposition device 10 is used for an automatic program generation device.

【００１５】この用語分解装置１０は、後述する用語分
解処理を実行する装置本体１１、及びこの装置本体１１
にそれぞれ接続された用語辞書１１、分解禁止パタン記
憶手段１２、分解可能パタン記憶手段１３を備えて構成
されている。The term decomposition apparatus 10 includes an apparatus main body 11 for executing a term decomposition process described later,
, A term dictionary 11, a disintegration prohibition pattern storage unit 12, and a decomposable pattern storage unit 13 which are respectively connected to the system.

【００１６】装置本体１１は、上位処理系１５から分解
すべき入力語が与えられ、図３及び図４に示すフローチ
ャートに基づいて分解した用語を出力語として上位処理
系１５へ返すように動作する。用語辞書１２は、システ
ム仕様に用いられる基本的な用語がその属性と共に登録
されている。The apparatus main body 11 is supplied with an input word to be decomposed from the high-level processing system 15 and operates to return the decomposed term as an output word to the high-level processing system 15 based on the flowcharts shown in FIGS. . In the term dictionary 12, basic terms used in the system specifications are registered together with their attributes.

【００１７】分解禁止パタン記憶手段１３は、図２
（ａ）に示すように、第１要素、第２要素、第３要素に
よって分解してはいけない配列規則、即ち分解禁止パタ
ンが定められ、各分解禁止パタン毎にその属性が登録さ
れている。例えば、第１要素，第２要素に＜STRING＞，
「中」が登録されている場合には、＜STRING＞と「中」
とは分解してはいけないこととする。ここで、＜STRING
＞は具体的な語までは指定しない単なる文字列を指定し
ているだけなので、この一つの分解禁止パタンで「前進
中」、「行進中」等の複数の用語の分解禁止を指定して
いることになる。従って、分解禁止パタンに含まれる用
語は用語辞書１２への登録から削除される。このような
分解禁止パタンを、誤って分解される恐れのある特定の
用語について定義しておき、図２（ａ）に示すようにパ
ターン化して記憶しておく。The disintegration prohibition pattern storage means 13 is provided in FIG.
As shown in (a), an arrangement rule that cannot be decomposed by the first element, the second element, and the third element, that is, a decomposition prohibition pattern is determined, and the attribute is registered for each decomposition prohibition pattern. For example, <STRING>,
If "Medium" is registered, <STRING> and "Medium"
Shall not be disassembled. Where <STRING
> Indicates only a simple character string that does not specify a specific word, so this one decomposition prohibition pattern specifies the prohibition of decomposing multiple terms such as "move forward" and "march" Will be. Therefore, the terms included in the decomposition prohibition pattern are deleted from the registration in the term dictionary 12. Such a decomposition prohibition pattern is defined for a specific term that may be erroneously decomposed, and is stored in a pattern as shown in FIG.

【００１８】分解可能パタン記憶手段１４は、図２
（ｂ）に示すように、前要素と後要素とから構成され、
文字列の一部に分解禁止パタンが含まれるが分解できる
配列規則、即ち分解可能パタンが定められている。例え
ば、＜STRING＞の後に「中間」が続く場合、＜STRING＞
＋「中」は、上記分解禁止パタンに該当するが、「中」
の後にすぐ「間」が来る場合には、＜STRING＞と「中」
の間を分解できることとする。この様な分解可能パタン
も分解禁止パタンに対応して定めておく。The decomposable pattern storage means 14 is provided in FIG.
As shown in (b), it is composed of a front element and a rear element,
An arrangement rule that can be decomposed, that is, a decomposition prohibition pattern is included in a part of the character string, but a decomposable pattern is defined. For example, if <intermediate> follows <STRING>, then <STRING>
+ “Medium” corresponds to the above decomposition prohibition pattern, but “Medium”
If "interval" comes immediately after "<STRING>" and "medium"
Can be disassembled. Such decomposable patterns are also determined corresponding to the decomposition prohibition patterns.

【００１９】すなわち、＜STRING＞＋「中」や＜STRING
＞＋「中間」のように用語辞書１２に登録されていない
用語であっても、上記分解禁止パタンおよび分解可能パ
タンを用いることにより分解できる。よって、用語辞書
１２への登録用語数が削減される。That is, <STRING> + “medium” or <STRING
Even a term that is not registered in the term dictionary 12 such as> + “intermediate” can be decomposed by using the above-mentioned decomposition prohibition pattern and decomposable pattern. Therefore, the number of terms registered in the term dictionary 12 is reduced.

【００２０】なお、上位処理系１５は、システム仕様入
力手段１６からシステム仕様が入力され、プログラムを
作成する上でその意味内容を解析する必要があると判断
された用語を、入力語として用語分解装置１０に与え
る。また、用語分解装置１０でプログラム可能に分解さ
れた用語とその属性を受けとりプログラム１７を出力す
る。次に、以上のように構成された本実施例の動作につ
いて説明する。The high-level processing system 15 receives the system specifications from the system specification input means 16 and decomposes the terms determined to need to be analyzed in terms of their meaning in creating a program as terminology. Give to device 10. Further, it receives the terms and their attributes that have been programmatically decomposed by the term decomposer 10 and outputs a program 17. Next, the operation of the present embodiment configured as described above will be described.

【００２１】用語分解装置１０では、入力語を上位処理
系１５から受け取ると、用語辞書１２に登録されている
用語に従って、入力語を、特定の意味のある用語の単位
毎に分解する。それら分解された用語は、その用語の属
性と共に出力語として上位処理系１５へ返される。この
とき、装置本体１１は、分解禁止パタンと、分解可能パ
タンとを参照して、入力語の分解を行い、用語辞書１２
には登録されていない用語も登録されている用語と同様
に分解する。When the input word is received from the high-level processing system 15, the term decomposition device 10 decomposes the input word into units of specific meaningful terms in accordance with the terms registered in the term dictionary 12. These decomposed terms are returned to the high-level processing system 15 as output words together with the attributes of the terms. At this time, the device main body 11 decomposes the input word with reference to the decomposition prohibition pattern and the decomposable pattern, and
The terms that are not registered are also decomposed in the same way as the registered terms.

【００２２】図３は、装置本体１１における用語分解処
理を示したフローチャートである。装置本体１１は、上
位処理系１５から、又は用語分解装置１０内で変更され
た入力語が与えられると、文字列長さを判定して（ステ
ップＣ１）、文字列長さが０より大きい間は入力語の分
解処理を行う。FIG. 3 is a flowchart showing the term decomposition processing in the apparatus main body 11. When the input word changed from the higher-level processing system 15 or in the term decomposition device 10 is provided, the device main body 11 determines the character string length (step C1) and determines whether the character string length is greater than 0. Performs input word decomposition processing.

【００２３】入力語の分解処理は、先ず、入力語に完全
に一致する文字列が、用語辞書１２に定義されているか
どうかを検索し（ステップＣ２）、定義されていれば、
入力語と、その語に付加されている属性を出力語の一つ
とし、分解処理を終了する（ステップＰ３）。In the decomposition process of the input word, first, it is searched whether or not a character string completely matching the input word is defined in the term dictionary 12 (step C2).
The input word and the attribute added to the word are set as one of the output words, and the decomposition process ends (step P3).

【００２４】上記ステップＣ２の検索において、定義さ
れていなかったときには、入力語の先頭から一文字づつ
読込み、その部分文字列に一致した語を用語辞書１２か
ら検索し、該当する用語の中から最も長い文字列を、出
力語の候補とする（ステップＰ１）。In the search in step C2, if not defined, the input word is read one character at a time from the beginning, a word matching the partial character string is searched from the term dictionary 12, and the longest word among the corresponding words is searched. The character string is set as an output word candidate (step P1).

【００２５】上記ステップＰ１において、出力語の候補
が決定できなければ、入力語を、属性未定のまま出力語
の一つとし、分解処理を終了する（ステップＰ３）。出
力語の候補が決定できれば、その出力語の候補に対し、
分解禁止パタン、分解可能パタンに基づいて用語の分解
を行い、出力語を決定して入力語からその文字列を切り
出し、同時にその文字列に付加された属性を確保してお
き、残りの文字列を新しい入力語とする（ステップＰ
２）。If the output word candidate cannot be determined in step P1, the input word is set as one of the output words with the attribute undecided, and the disassembly process is terminated (step P3). If the output word candidate can be determined,
Decompose terms based on decomposition prohibition patterns and decomposable patterns, determine output words, cut out the character strings from input words, and at the same time secure the attributes added to the character strings, and leave the remaining character strings As a new input word (step P
2).

【００２６】このようにして分解処理が終了したら、そ
れまでに決定された出力語と、それらの語に付加されて
いる属性をまとめて、用語分解の処理を呼び出した上位
処理系１５へ返す（ステップＰ４）。上記ステップＰ２
での処理の詳細を図４に示す。When the decomposition process is completed in this way, the output words determined so far and the attributes added to those words are put together and returned to the higher-level processing system 15 that called up the term decomposition process ( Step P4). Step P2 above
FIG. 4 shows the details of the processing in.

【００２７】ステップＰ２での処理が開始されると、先
ず、ステップＰ１で決定したか、または本処理ステップ
Ｐ２において変更された出力語の候補となる文字列の長
さと、入力語の文字列の長さを比較し（ステップＣ
４）、等しい場合は、出力語の候補を出力語の一つとし
て入力語から切り出し、付加されている属性を確保した
上で、残りの文字列を新しい入力語として処理を終了す
る（ステップＰ６）。When the process in step P2 is started, first, the length of the character string which is the output word candidate determined in step P1 or changed in the present process step P2, and the character string of the input word Compare the lengths (Step C
4) If they are equal, the output word candidate is cut out from the input word as one of the output words, the added attribute is secured, and the process is terminated with the remaining character strings as new input words (step P6). ).

【００２８】上記ステップＣ４で出力語の候補の文字列
のほうが短い場合は、出力語の候補となる文字列が分解
禁止パタン記憶手段１３に記憶されている分解禁止パタ
ンで定義してある文字列のパタンに該当するか否かを判
定し（ステップＣ５）、該当しないならば、出力語の候
補を出力語の一つとして入力語から切り出し、付加され
ている属性を確保して、残りの文字列を新しい入力語と
して処理を終了する（ステップＰ６）。If the character string of the output word candidate is shorter in step C4, the character string which is the output word candidate is the character string defined by the decomposition prohibition pattern stored in the decomposition prohibition pattern storage means 13. Is determined (step C5). If not, an output word candidate is cut out from the input word as one of the output words, the added attribute is secured, and the remaining characters are obtained. The process ends with the column as a new input word (step P6).

【００２９】上記ステップＣ５で分解禁止パタンに該当
したならば、さらに、出力語の候補となる文字列が分解
可能パタン記憶手段１４に記憶されている分解可能パタ
ンで定義してある文字列のパタンに該当するか否かを判
定し（ステップＣ６）、分解可能パタンに該当しなけれ
ば、分解禁止パタンに該当する部分の文字列を新しい出
力語の候補とし、その属性を分解禁止パタンに登録され
ている属性であるとみなして、分解処理を禁止する（ス
テップＰ５）。If the pattern corresponds to the decomposition prohibition pattern in step C5, the character string which is a candidate for an output word is further defined by the character string pattern defined by the decomposable pattern stored in the decomposable pattern storage means 14. Is determined (step C6). If the pattern does not correspond to a decomposable pattern, the character string of the portion corresponding to the decomposition prohibition pattern is set as a new output word candidate, and its attribute is registered in the decomposition prohibition pattern. The disassembly process is prohibited because the attribute is regarded as the attribute (step P5).

【００３０】上記ステップＣ６で分解可能パタンに該当
したならば、分解禁止パタンに該当する文字列の長さと
分解可能パタンに該当する文字列の長さを比較し（ステ
ップＣ７）、分解禁止パタンに該当する文字列の長さの
ほうが長ければ、分解禁止パタンに該当する部分の文字
列を新しい出力語の候補とし、その属性を分解禁止パタ
ンに登録されている属性であるとみなして、分解処理を
継続する（ステップＰ５）。また、分解可能パタンに該
当する文字列の長さのほうが長ければ、出力語の候補を
出力語の一つとして入力語から切り出し、残りの文字列
を新しい入力語として、分解処理を終了する（ステップ
Ｐ６）。例えば、入力語として「前進中間位置」を分解
する場合にはついて図５を参照しながら説明する。If the pattern corresponds to the decomposable pattern in step C6, the length of the character string corresponding to the decomposable pattern is compared with the length of the character string corresponding to the decomposable pattern (step C7). If the length of the corresponding character string is longer, the character string of the part corresponding to the decomposition prohibition pattern is set as a candidate for a new output word, and the attribute is regarded as an attribute registered in the decomposition prohibition pattern. (Step P5). If the length of the character string corresponding to the decomposable pattern is longer, the output word candidate is cut out from the input word as one of the output words, and the remaining character strings are used as new input words, and the decomposition process ends. Step P6). For example, a case where the “forward intermediate position” is decomposed as an input word will be described with reference to FIG.

【００３１】用語分解装置１０は、上位処理系１５から
入力語２０として「前進中間位置」という語を受け取る
と、先ず、用語辞書１２に「前進中間位置」という語が
登録されているかどうか検索する。同図の例では登録さ
れていないので、ステップＰ１に移行し、「前進中間位
置」の先頭からの部分文字列に一致している語が用語辞
書１２に登録されているかどうか検索する。本例では、
用語辞書１２から「前進」という語が検索される。When the term decomposition device 10 receives the word “forward intermediate position” as the input word 20 from the host processing system 15, it first searches whether or not the word “forward intermediate position” is registered in the term dictionary 12. . In the example of FIG. 7, since the word is not registered, the process proceeds to step P1, and a search is made as to whether a word that matches the partial character string from the head of the “forward intermediate position” is registered in the term dictionary 12. In this example,
The term “forward” is searched from the term dictionary 12.

【００３２】この「前進」を出力語の候補として、分解
禁止パタンと比較する。このとき、「前進」の後ろに
「中」が続いているので、分解禁止パタンとして定義さ
れている＜STRING＞＋「中」に該当する。さらに、分解可能パタンとの比較を行う。
「前進」の後ろに続く文字列が「中間」であることか
ら、分解可能パタンに定義されている＜STRING＞＋「中間」に該当する。This “advance” is compared with a decomposition prohibition pattern as an output word candidate. At this time, since “middle” follows “forward”, it corresponds to <STRING> + “medium” defined as a decomposition prohibition pattern. Furthermore, comparison with a decomposable pattern is performed.
Since the character string following “forward” is “middle”, it corresponds to <STRING> + “middle” defined in the decomposable pattern.

【００３３】そこで、分解禁止パタンに該当する文字列
の長さと分解可能パタンに該当する文字列の長さを比較
すると、明らかに分解可能パタンに該当する文字列のほ
うが長いことがわかるので、「前進」と「中間」の間を
分解することに決定し、出力語の候補「前進」と、用語
辞書１２に登録されている属性｛動作｝の組を出力語２
１−１の一つとして、文字列「前進」を入力語２０「前
進中間位置」から切り出し、残りの文字列「中間位置」
を新しい入力語２０とする。When the length of the character string corresponding to the decomposable pattern is compared with the length of the character string corresponding to the decomposable pattern, it is clear that the character string corresponding to the decomposable pattern is longer. It is decided to decompose between "forward" and "intermediate", and a set of the output word candidate "forward" and the attribute {action} registered in the term dictionary 12 is output word 2
As one of 1-1, the character string “forward” is cut out from the input word 20 “forward intermediate position”, and the remaining character string “intermediate position” is extracted.
Is a new input word 20.

【００３４】続いて、新しい入力語２０「中間位置」に
対して用語の分解を行い、用語辞書１２の検索を行う
と、部分文字列「中間」が出力語の候補にできることが
わかる。次に、分解禁止パタンと分解可能パタンとの比
較を行うと、分解禁止パタンに定義されている＜STRING＞＋「位置」に該当することがわかるので、文字列「中間位置」と、
分解禁止パタンに登録されている属性｛位置｝の組を出
力語の一つ（２１−２）とする。Subsequently, when the new input word 20 "intermediate position" is decomposed into terms and the term dictionary 12 is searched, it is understood that the partial character string "intermediate" can be a candidate for an output word. Next, a comparison between the decomposition prohibition pattern and the decomposable pattern reveals that the pattern corresponds to <STRING> + “position” defined in the decomposition prohibition pattern.
A set of the attribute {position} registered in the decomposition prohibition pattern is defined as one of the output words (21-2).

【００３５】以上のようにして「前進中間位置」という
入力語２０が、「前進」＋「中間位置」という２つの出
力語２１−，２１−２に分解され、最終的に、属性と合
わせて上位処理系１５へ返される。As described above, the input word 20 of "forward intermediate position" is decomposed into two output words 21- and 21-2 of "forward" + "intermediate position". It is returned to the host processing system 15.

【００３６】この様に本実施例によれば、用語の分解禁
止パタンと分解可能パタンを参照して入力語の分解を行
うようにしたので、用語辞書１２には基本となる用語の
みを記憶しておくだけで用語辞書１２には直接記憶され
ていない入力語まで分解することができ、記憶させる用
語の数を少なくすることができ、この結果、用語辞書１
２への用語の登録および管理が行い易くなり、労力の軽
減が図られる。As described above, according to the present embodiment, the input words are decomposed with reference to the term prohibition pattern and the decomposable pattern, so that only the basic terms are stored in the term dictionary 12. By simply storing the input words, the input words not directly stored in the term dictionary 12 can be decomposed, and the number of terms to be stored can be reduced. As a result, the term dictionary 1
2 can be easily registered and managed, and labor can be reduced.

【００３７】また、実際に分解すべき語を含んでいる長
い文字列が用語辞書１２に登録されることが少なくなる
ので、辞書登録されている長い文字列と入力語の部分文
字列とが一致することはほとんど無くなり、しかもその
様な一致があったとしても分解禁止パタンによって分解
が確実に禁止され、正確に分解できる。Further, since a long character string containing a word to be actually decomposed is less likely to be registered in the term dictionary 12, the long character string registered in the dictionary matches the partial character string of the input word. This is almost impossible, and even if there is such a coincidence, the decomposition is strictly prohibited by the decomposition prohibition pattern, and accurate decomposition can be performed.

【００３８】なお、本発明は上記一実施例に限定される
ものではなく、システム仕様の用語分解を必要とする用
途であればプログラム自動作成装置以外にも適用するこ
とができる。It should be noted that the present invention is not limited to the above-described embodiment, but can be applied to other than an automatic program creating device as long as the application requires term decomposition of system specifications.

【００３９】[0039]

【発明の効果】以上詳記したように本発明によれば、用
語辞書に登録すべき用語の数を大幅に減少できて労力の
軽減を図ることができ、かつ入力語を正確に分解できて
装置の信頼性を向上し得る用語分解装置を提供できる。As described in detail above, according to the present invention, the number of terms to be registered in the term dictionary can be greatly reduced, labor can be reduced, and input words can be accurately decomposed. A term decomposition device capable of improving the reliability of the device can be provided.

[Brief description of the drawings]

【図１】本発明の一実施例に係る用語分解装置の概略構
成を示すブロック図。FIG. 1 is a block diagram showing a schematic configuration of a term decomposition apparatus according to an embodiment of the present invention.

【図２】一実施例に係る用語分解装置に備えられた分解
禁止パタン記憶手段及び分解可能パタン記憶手段の構成
図。FIG. 2 is a configuration diagram of a decomposition prohibition pattern storage unit and a decomposable pattern storage unit provided in the term decomposition device according to one embodiment.

【図３】一実施例に係る用語分解装置の用語分解処理に
関する動作説明図。FIG. 3 is an operation explanatory diagram relating to term decomposition processing of the term decomposition device according to one embodiment;

【図４】一実施例に係る用語分解装置のさらに詳細な動
作説明図。FIG. 4 is a more detailed operation explanatory diagram of the term decomposition apparatus according to the embodiment;

【図５】一実施例に係る用語分解装置の用語分解処理を
具体的なデータを用いて説明するための図。FIG. 5 is a view for explaining term decomposition processing of the term decomposition apparatus according to one embodiment using specific data.

【図６】従来より在る用語分解装置の機能ブロック図。FIG. 6 is a functional block diagram of a conventional term decomposition apparatus.

【図７】図６に示す用語分解装置の動作説明図。FIG. 7 is an explanatory diagram of the operation of the term decomposition device shown in FIG. 6;

[Explanation of symbols]

１０…用語分解装置、１１…用語分解装置本体、１２…
用語辞書、１３…分解禁止パタン、１４…分解可能パタ
ン、１５…上位処理系、１６…システム仕様入力手段、
１７…プログラム。10: Term decomposition device, 11: Term decomposition device body, 12:
Term dictionary, 13: disassembly prohibition pattern, 14: disassembly pattern, 15: host processing system, 16: system specification input means,
17 Program.

Claims

(57) [Claims]

1. A system according to claim 1, further comprising: a term dictionary storing terms used for expressing a system specification corresponding to a request of a target system and attributes thereof, wherein a system specification expressed by combining said terms is used as an input word. Given, in a term decomposer that decomposes the input word into predetermined units based on the term dictionary, a decomposition prohibition pattern storage means storing a decomposition prohibition pattern that defines an arrangement rule of terms that should not be decomposed, Decomposable pattern storage means that includes a term array defined by the decomposition prohibition pattern, but stores a decomposable pattern that defines an arrangement rule of terms that can be decomposed by being combined with a predetermined term, A term decomposer for decomposing the input word with reference to the decomposable pattern and the decomposable pattern.