JPH01189767A

JPH01189767A - Parallel sentence recognition system

Info

Publication number: JPH01189767A
Application number: JP63015124A
Authority: JP
Inventors: Hiroshi Suematsu; 末松　博
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1988-01-25
Filing date: 1988-01-25
Publication date: 1989-07-28
Anticipated expiration: 2014-12-13
Also published as: JP2989823B2

Abstract

PURPOSE:To reduce comparison circuits and to efficiently recognize a sentence form by setting the finite number of positions, analyzing a syntax and checking only one position to a candidate throughout plural sentence forms in parallel. CONSTITUTION:An input sentence is initially converted to a slot table represented by the finite number of the positions through a sentence form extracting means and a slot table forming means or the like in the analysis of a morphological element. Then, in the analysis of the syntax, a candidate obtained through a sentence form constituting element candidate detecting means and a slot table operating means or the like is inserted into one position in the slot table and compared with the slot condition of all the probable sentence forms. Consequently, they coincide, it is recognized as the sentence form constituting element and the coinciding slot is buried. Subsequently, these processes are repeated to produce no sentence form constituting element, and then, the sentence from having no idle slot but having the largest number of buried slots is recognized as the sentence form of the declinable parts of speech of the input sentence. Thereby, the number of times comparison is comparatively reduced to efficiently recognize the sentence form.

Description

【発明の詳細な説明】（産業上の利用分野）本発明は文型認識方式に関するものである。[Detailed description of the invention] (Industrial application field) The present invention relates to a sentence pattern recognition method.

（従来の技術）従来の文型認識方式としては以下に述べる２つのものが
知られている。１つの方式では、入力文における用言が
取り得る文型の中から一つをその用言の文型であると仮
定し、その仮定のもとで、該入力文の解析を進めている
。そして現れるべき文型構成要素が出現しないこと等に
より解析失敗が検出された段階で、バックトラックして
他の文型を一つ選び、失敗が検出されない文型が得られ
るまでこの処理を繰り返し、失敗が検出されなかった文
型を該入力文の用言の文型と認定している。(Prior Art) The following two methods are known as conventional sentence pattern recognition methods. In one method, one of the possible sentence patterns of a predicate in an input sentence is assumed to be the sentence pattern of the predicate, and the input sentence is analyzed based on that assumption. When a parsing failure is detected due to the failure of a sentence structure component that should appear, backtrack and select one of the other sentence patterns, and repeat this process until a sentence pattern in which no failure is detected is detected. The sentence pattern that is not recognized is recognized as the sentence pattern of the predicate of the input sentence.

また別の方式では、入力文における用言の取り得る全て
の文型を行として並列表現し、かつ、その構成文法格を
スロワ１へ条件として表現したテーブルを用意する。そ
して該入力文中における文型構成要素の候補の素性を、
ある文型が持つ全スロワ１〜条件に対し順に合致するま
で比較してゆき、合致したスロットを埋め、この操作を
最初から最後の文型まで順番に繰り返し、空きスロワＩ
〜の無い最多一致した文型を該入力文の文型と認定して
いる。In another method, a table is prepared in which all possible sentence patterns of the predicate in the input sentence are expressed in parallel as rows, and their constituent grammatical cases are expressed as conditions for thrower 1. Then, the features of the candidate sentence pattern components in the input sentence are
Compare all slots 1 to conditions of a certain sentence pattern in order until they match, fill in the matching slots, repeat this operation in order from the first to the last sentence pattern, and fill in the empty slots I.
The sentence pattern that matches the most without ... is recognized as the sentence pattern of the input sentence.

（発明が解決しようとする問題点）上に述べた文型認識方式の前者の方式は、可能な文型の
中から認定される文型か、仮定された順序に依存するた
め、最多要素の文型を優先させようとしてそれから解析
を試ると、最多要素のものが必ずしも頻度の高いもので
はないのでバックトラックの回数が多くなる。また、頻
度順に解析を試ると、本来文型構成要素と認識されるべ
きものが、認識されずに解析がそのまま成功してしまう
という問題があった。(Problem to be solved by the invention) The former sentence pattern recognition method described above depends on the sentence pattern recognized from among possible sentence patterns or on the assumed order, so it gives priority to the sentence pattern with the largest number of elements. If you try to analyze it after that, you will end up backtracking a lot because the elements with the most number of elements are not necessarily the most frequent ones. Furthermore, if analysis is attempted in order of frequency, there is a problem in that elements that should originally be recognized as sentence pattern constituents are not recognized and the analysis is successful.

また後者の方式においては、前者における問題は存在し
ないが、候補の素性とスロワ１へ条件との比較回数が極
めて多いという問題がある。−回のテーブル操作におい
て、その比較回数は、概算すると文型数の数倍になり、
処理工数、時間が極めてかかるという問題があった。Although the latter method does not have the same problems as the former, there is a problem in that the candidate's background and the conditions for thrower 1 are compared extremely many times. - In table operations, the number of comparisons is roughly several times the number of sentence patterns,
There was a problem that the processing time and man-hours were extremely long.

本発明の目的は、このような欠点を除去せしめて、比較
回数を減らし、効率的に文型を認識する方式を提供する
ことにある。An object of the present invention is to provide a method for eliminating such drawbacks, reducing the number of comparisons, and efficiently recognizing sentence patterns.

（問題点を解決するための手段）本発明の文型認識方式においては、文型を構成する文法
格を用言との位置関係や表層文法格により整理した有限
個のポジションにおけるスロワ１へ条件として表現し、
候補に対し、一つのポジションのみを複数の文型に渡っ
て並列にチエツクすることを特徴としている。(Means for Solving the Problems) In the sentence pattern recognition method of the present invention, grammatical cases constituting a sentence pattern are expressed as conditions for thrower 1 in a finite number of positions organized by positional relationships with predicates and surface grammatical cases. death,
It is characterized by checking only one position of candidates in parallel across multiple sentence patterns.

（作用）本発明においては、有限個のポジションを設定すること
により、候補の素性との比較が、特定のポジション以外
に対しては不要となり、比較回数が従来技術の数分の１
となり、処理工数の削減、処理速度の向上につながる。(Operation) In the present invention, by setting a finite number of positions, comparison with the candidate's background is unnecessary for positions other than specific positions, and the number of comparisons is reduced to a fraction of that of the prior art.
This leads to a reduction in processing man-hours and an improvement in processing speed.

（実施例）次に第１図から第３図を参照して本発明の実施例につい
て説明する。(Example) Next, an example of the present invention will be described with reference to FIGS. 1 to 3.

第１図は、本発明を利用した自然言語解析手法のフロー
チャー１へである。入力文は、形態素解析により、辞書
を用いながら、意味の最小単位である単語に分割され、
各単語に可能な品詞が付与される。用言の可能性のある
単語に対しては、その用言が取り得る文型が全て抽出さ
れる。抽出された文型情報は、スロットテーブルのテン
プレートを使用して、有限個のポジションで表現された
スロットテーブルに変換される。形態素解析結果は、構
文解析において、文型の構成要素に成り得る大きさにま
でまとめ上げられ、その素性が、スロットテーブル操作
関数を用いて、スロットテーブルにおける一つのポジシ
ョンに挿入され、全ての可能な文型のスロット条件と比
較される。合致するものが在れば、候補は文型構成要素
と認定され、合致したスロットを持つ文型のみが適格な
ものとして残され、合致したスロットは埋められる。文
型構成要素候補が現れ得る間は、このプロセスが繰り返
されるか、現れ得ない状態になった場合、空きスロット
の無い文型で、埋まったスロット数の最も多いものが入
力文の用言の文型と認定される。FIG. 1 is a flowchart 1 of a natural language analysis method using the present invention. The input sentence is divided into words, which are the smallest units of meaning, using a dictionary through morphological analysis.
Possible parts of speech are assigned to each word. For a word that may be a predicate, all possible sentence patterns of the predicate are extracted. The extracted sentence pattern information is converted into a slot table expressed by a finite number of positions using a slot table template. The morphological analysis results are summarized in the syntactic analysis to a size that can be used as the constituent elements of a sentence pattern, and the features are inserted into one position in the slot table using the slot table manipulation function, and all possible It is compared with the sentence pattern slot condition. If there is a match, the candidate is recognized as a sentence pattern component, and only sentence patterns with matching slots are left as eligible, and the matching slots are filled. This process is repeated as long as sentence structure component candidates can appear, or if they cannot appear, the sentence pattern with the largest number of filled slots without empty slots is determined to be the sentence pattern of the predicate in the input sentence. Certified.

第２図は用言の文型情報からスロットテーブルへの変換
例を示している。同図（ａ）は英語の例であり、同図（
ｂ）には日本語の例が示されている。英語の例において
は、ポジションを用言との相対位置により設定している
。主語（ＳＵＢ）はポジション１に設定され、用言の直
後の補語（ＣＯＭＰ）、直接目的語（ＤＯ’Ｂ）、間接
目的語（’　Ｊ　ＯＢ　）はポジション２に設定される
。さらに後ろの要素は順にポジション３、ポジション４
に設定される。FIG. 2 shows an example of conversion from sentence pattern information of predicates to a slot table. Figure (a) is an example of English, and figure (a) is an example of English.
b) shows an example in Japanese. In the English example, the position is set by the relative position to the predicate. The subject (SUB) is set in position 1, and the complement (COMP), direct object (DO'B), and indirect object ('JOB) immediately after the predicate are set in position 2. The elements further back are in position 3 and position 4 in order.
is set to

日本語の例では、ポジションを表層文法格により設定し
ている。文型がある種のシフトを受けた後に助詞が「が
」となる文法格はポジションＧＡに設定され、助詞が１
を」になるものはポジションｗｏに設定される。In the Japanese example, positions are determined by surface grammatical cases. Grammatical cases in which the particle becomes ``ga'' after the sentence pattern undergoes a certain shift are set in position GA, and the particle becomes 1.
” is set to position wo.

ポジションの選択は、第３図＜　ａ−＞に示されるよう
に、英語においてはポジション１から順にポジション２
、ポジショジ３というように選択される。日本語におい
ては、第３図（ｂ）に示すとおり文型構成要素の候補の
助詞の綴りを見てボジションが選択される。As shown in Figure 3 <a->, in English the positions are selected from position 1 to position 2.
, position 3, and so on. In Japanese, as shown in FIG. 3(b), a position is selected by looking at the spelling of the particle of a candidate sentence structure element.

[Brief explanation of the drawing]

第１図は、本発明の文並列認識方式を用いた文解析方式
の例を示すフローチャート、第２図（ａ＞、（ｂ）は、
用言の文型情報からポジションを利用したスロットテー
ブルの変換例、第３図（ａ）、（ｂ）は、スロットテー
ブル上のポジションの選択法の例である。FIG. 1 is a flowchart showing an example of a sentence analysis method using the sentence parallel recognition method of the present invention, and FIG. 2 (a>, (b))
FIGS. 3(a) and 3(b), an example of converting a slot table using positions from sentence pattern information of predicates, are examples of how to select positions on the slot table.

Claims

[Claims]

In a natural language processing system that recognizes sentence patterns, for a predicate in an input sentence, all possible sentence patterns of the predicate are expressed in parallel as lines, and the constituent grammatical cases are expressed in terms of their positional relationship with the predicate and surface layer. Prepare a table expressed as slot conditions in a finite number of positions organized by grammatical case,
For candidates of sentence pattern constituents in the input sentence, select one position in the table and compare the features of the candidate with the slot conditions at the position of all possible sentence patterns, and if there is a slot that matches the condition. A sentence pattern recognition method characterized in that a plurality of sentence patterns are processed in parallel by making only sentence patterns having the slot eligible.