JPH01189767A - Parallel sentence recognition system - Google Patents

Parallel sentence recognition system

Info

Publication number
JPH01189767A
JPH01189767A JP63015124A JP1512488A JPH01189767A JP H01189767 A JPH01189767 A JP H01189767A JP 63015124 A JP63015124 A JP 63015124A JP 1512488 A JP1512488 A JP 1512488A JP H01189767 A JPH01189767 A JP H01189767A
Authority
JP
Japan
Prior art keywords
sentence
slot
candidate
sentence form
slot table
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP63015124A
Other languages
Japanese (ja)
Other versions
JP2989823B2 (en
Inventor
Hiroshi Suematsu
末松 博
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP63015124A priority Critical patent/JP2989823B2/en
Publication of JPH01189767A publication Critical patent/JPH01189767A/en
Application granted granted Critical
Publication of JP2989823B2 publication Critical patent/JP2989823B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

PURPOSE:To reduce comparison circuits and to efficiently recognize a sentence form by setting the finite number of positions, analyzing a syntax and checking only one position to a candidate throughout plural sentence forms in parallel. CONSTITUTION:An input sentence is initially converted to a slot table represented by the finite number of the positions through a sentence form extracting means and a slot table forming means or the like in the analysis of a morphological element. Then, in the analysis of the syntax, a candidate obtained through a sentence form constituting element candidate detecting means and a slot table operating means or the like is inserted into one position in the slot table and compared with the slot condition of all the probable sentence forms. Consequently, they coincide, it is recognized as the sentence form constituting element and the coinciding slot is buried. Subsequently, these processes are repeated to produce no sentence form constituting element, and then, the sentence from having no idle slot but having the largest number of buried slots is recognized as the sentence form of the declinable parts of speech of the input sentence. Thereby, the number of times comparison is comparatively reduced to efficiently recognize the sentence form.

Description

【発明の詳細な説明】 (産業上の利用分野) 本発明は文型認識方式に関するものである。[Detailed description of the invention] (Industrial application field) The present invention relates to a sentence pattern recognition method.

(従来の技術) 従来の文型認識方式としては以下に述べる2つのものが
知られている。1つの方式では、入力文における用言が
取り得る文型の中から一つをその用言の文型であると仮
定し、その仮定のもとで、該入力文の解析を進めている
。そして現れるべき文型構成要素が出現しないこと等に
より解析失敗が検出された段階で、バックトラックして
他の文型を一つ選び、失敗が検出されない文型が得られ
るまでこの処理を繰り返し、失敗が検出されなかった文
型を該入力文の用言の文型と認定している。
(Prior Art) The following two methods are known as conventional sentence pattern recognition methods. In one method, one of the possible sentence patterns of a predicate in an input sentence is assumed to be the sentence pattern of the predicate, and the input sentence is analyzed based on that assumption. When a parsing failure is detected due to the failure of a sentence structure component that should appear, backtrack and select one of the other sentence patterns, and repeat this process until a sentence pattern in which no failure is detected is detected. The sentence pattern that is not recognized is recognized as the sentence pattern of the predicate of the input sentence.

また別の方式では、入力文における用言の取り得る全て
の文型を行として並列表現し、かつ、その構成文法格を
スロワ1へ条件として表現したテーブルを用意する。そ
して該入力文中における文型構成要素の候補の素性を、
ある文型が持つ全スロワ1〜条件に対し順に合致するま
で比較してゆき、合致したスロットを埋め、この操作を
最初から最後の文型まで順番に繰り返し、空きスロワI
〜の無い最多一致した文型を該入力文の文型と認定して
いる。
In another method, a table is prepared in which all possible sentence patterns of the predicate in the input sentence are expressed in parallel as rows, and their constituent grammatical cases are expressed as conditions for thrower 1. Then, the features of the candidate sentence pattern components in the input sentence are
Compare all slots 1 to conditions of a certain sentence pattern in order until they match, fill in the matching slots, repeat this operation in order from the first to the last sentence pattern, and fill in the empty slots I.
The sentence pattern that matches the most without ... is recognized as the sentence pattern of the input sentence.

(発明が解決しようとする問題点) 上に述べた文型認識方式の前者の方式は、可能な文型の
中から認定される文型か、仮定された順序に依存するた
め、最多要素の文型を優先させようとしてそれから解析
を試ると、最多要素のものが必ずしも頻度の高いもので
はないのでバックトラックの回数が多くなる。また、頻
度順に解析を試ると、本来文型構成要素と認識されるべ
きものが、認識されずに解析がそのまま成功してしまう
という問題があった。
(Problem to be solved by the invention) The former sentence pattern recognition method described above depends on the sentence pattern recognized from among possible sentence patterns or on the assumed order, so it gives priority to the sentence pattern with the largest number of elements. If you try to analyze it after that, you will end up backtracking a lot because the elements with the most number of elements are not necessarily the most frequent ones. Furthermore, if analysis is attempted in order of frequency, there is a problem in that elements that should originally be recognized as sentence pattern constituents are not recognized and the analysis is successful.

また後者の方式においては、前者における問題は存在し
ないが、候補の素性とスロワ1へ条件との比較回数が極
めて多いという問題がある。−回のテーブル操作におい
て、その比較回数は、概算すると文型数の数倍になり、
処理工数、時間が極めてかかるという問題があった。
Although the latter method does not have the same problems as the former, there is a problem in that the candidate's background and the conditions for thrower 1 are compared extremely many times. - In table operations, the number of comparisons is roughly several times the number of sentence patterns,
There was a problem that the processing time and man-hours were extremely long.

本発明の目的は、このような欠点を除去せしめて、比較
回数を減らし、効率的に文型を認識する方式を提供する
ことにある。
An object of the present invention is to provide a method for eliminating such drawbacks, reducing the number of comparisons, and efficiently recognizing sentence patterns.

(問題点を解決するための手段) 本発明の文型認識方式においては、文型を構成する文法
格を用言との位置関係や表層文法格により整理した有限
個のポジションにおけるスロワ1へ条件として表現し、
候補に対し、一つのポジションのみを複数の文型に渡っ
て並列にチエツクすることを特徴としている。
(Means for Solving the Problems) In the sentence pattern recognition method of the present invention, grammatical cases constituting a sentence pattern are expressed as conditions for thrower 1 in a finite number of positions organized by positional relationships with predicates and surface grammatical cases. death,
It is characterized by checking only one position of candidates in parallel across multiple sentence patterns.

(作用) 本発明においては、有限個のポジションを設定すること
により、候補の素性との比較が、特定のポジション以外
に対しては不要となり、比較回数が従来技術の数分の1
となり、処理工数の削減、処理速度の向上につながる。
(Operation) In the present invention, by setting a finite number of positions, comparison with the candidate's background is unnecessary for positions other than specific positions, and the number of comparisons is reduced to a fraction of that of the prior art.
This leads to a reduction in processing man-hours and an improvement in processing speed.

(実施例) 次に第1図から第3図を参照して本発明の実施例につい
て説明する。
(Example) Next, an example of the present invention will be described with reference to FIGS. 1 to 3.

第1図は、本発明を利用した自然言語解析手法のフロー
チャー1へである。入力文は、形態素解析により、辞書
を用いながら、意味の最小単位である単語に分割され、
各単語に可能な品詞が付与される。用言の可能性のある
単語に対しては、その用言が取り得る文型が全て抽出さ
れる。抽出された文型情報は、スロットテーブルのテン
プレートを使用して、有限個のポジションで表現された
スロットテーブルに変換される。形態素解析結果は、構
文解析において、文型の構成要素に成り得る大きさにま
でまとめ上げられ、その素性が、スロットテーブル操作
関数を用いて、スロットテーブルにおける一つのポジシ
ョンに挿入され、全ての可能な文型のスロット条件と比
較される。合致するものが在れば、候補は文型構成要素
と認定され、合致したスロットを持つ文型のみが適格な
ものとして残され、合致したスロットは埋められる。文
型構成要素候補が現れ得る間は、このプロセスが繰り返
されるか、現れ得ない状態になった場合、空きスロット
の無い文型で、埋まったスロット数の最も多いものが入
力文の用言の文型と認定される。
FIG. 1 is a flowchart 1 of a natural language analysis method using the present invention. The input sentence is divided into words, which are the smallest units of meaning, using a dictionary through morphological analysis.
Possible parts of speech are assigned to each word. For a word that may be a predicate, all possible sentence patterns of the predicate are extracted. The extracted sentence pattern information is converted into a slot table expressed by a finite number of positions using a slot table template. The morphological analysis results are summarized in the syntactic analysis to a size that can be used as the constituent elements of a sentence pattern, and the features are inserted into one position in the slot table using the slot table manipulation function, and all possible It is compared with the sentence pattern slot condition. If there is a match, the candidate is recognized as a sentence pattern component, and only sentence patterns with matching slots are left as eligible, and the matching slots are filled. This process is repeated as long as sentence structure component candidates can appear, or if they cannot appear, the sentence pattern with the largest number of filled slots without empty slots is determined to be the sentence pattern of the predicate in the input sentence. Certified.

第2図は用言の文型情報からスロットテーブルへの変換
例を示している。同図(a)は英語の例であり、同図(
b)には日本語の例が示されている。英語の例において
は、ポジションを用言との相対位置により設定している
。主語(SUB)はポジション1に設定され、用言の直
後の補語(COMP)、直接目的語(DO’B)、間接
目的語(’ J OB )はポジション2に設定される
。さらに後ろの要素は順にポジション3、ポジション4
に設定される。
FIG. 2 shows an example of conversion from sentence pattern information of predicates to a slot table. Figure (a) is an example of English, and figure (a) is an example of English.
b) shows an example in Japanese. In the English example, the position is set by the relative position to the predicate. The subject (SUB) is set in position 1, and the complement (COMP), direct object (DO'B), and indirect object ('JOB) immediately after the predicate are set in position 2. The elements further back are in position 3 and position 4 in order.
is set to

日本語の例では、ポジションを表層文法格により設定し
ている。文型がある種のシフトを受けた後に助詞が「が
」となる文法格はポジションGAに設定され、助詞が1
を」になるものはポジションwoに設定される。
In the Japanese example, positions are determined by surface grammatical cases. Grammatical cases in which the particle becomes ``ga'' after the sentence pattern undergoes a certain shift are set in position GA, and the particle becomes 1.
” is set to position wo.

ポジションの選択は、第3図< a−>に示されるよう
に、英語においてはポジション1から順にポジション2
、ポジショジ3というように選択される。日本語におい
ては、第3図(b)に示すとおり文型構成要素の候補の
助詞の綴りを見てボジションが選択される。
As shown in Figure 3 <a->, in English the positions are selected from position 1 to position 2.
, position 3, and so on. In Japanese, as shown in FIG. 3(b), a position is selected by looking at the spelling of the particle of a candidate sentence structure element.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は、本発明の文並列認識方式を用いた文解析方式
の例を示すフローチャート、第2図(a>、(b)は、
用言の文型情報からポジションを利用したスロットテー
ブルの変換例、第3図(a)、(b)は、スロットテー
ブル上のポジションの選択法の例である。
FIG. 1 is a flowchart showing an example of a sentence analysis method using the sentence parallel recognition method of the present invention, and FIG. 2 (a>, (b))
FIGS. 3(a) and 3(b), an example of converting a slot table using positions from sentence pattern information of predicates, are examples of how to select positions on the slot table.

Claims (1)

【特許請求の範囲】[Claims] 文型を認識する自然言語処理システムにおいて、入力文
中の用言に対し、該用言が取り得る全ての文型を行とし
て並列表現し、かつ、その構成文法格を、用言との位置
関係や表層文法格により整理した有限個のポジションに
おけるスロット条件として表現したテーブルを用意し、
該入力文中における文型構成要素の候補に対し、該候補
の素性を、該テーブルのポジションを一つ選択して、全
ての可能な文型の該ポジションにおけるスロット条件と
比較し、条件の合うスロットが存在する場合に、該スロ
ットを有する文型のみを適格とすることにより、複数の
文型を並列に処理することを特徴とする文型認識方式。
In a natural language processing system that recognizes sentence patterns, for a predicate in an input sentence, all possible sentence patterns of the predicate are expressed in parallel as lines, and the constituent grammatical cases are expressed in terms of their positional relationship with the predicate and surface layer. Prepare a table expressed as slot conditions in a finite number of positions organized by grammatical case,
For candidates of sentence pattern constituents in the input sentence, select one position in the table and compare the features of the candidate with the slot conditions at the position of all possible sentence patterns, and if there is a slot that matches the condition. A sentence pattern recognition method characterized in that a plurality of sentence patterns are processed in parallel by making only sentence patterns having the slot eligible.
JP63015124A 1988-01-25 1988-01-25 Sentence pattern recognition method Expired - Lifetime JP2989823B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63015124A JP2989823B2 (en) 1988-01-25 1988-01-25 Sentence pattern recognition method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63015124A JP2989823B2 (en) 1988-01-25 1988-01-25 Sentence pattern recognition method

Publications (2)

Publication Number Publication Date
JPH01189767A true JPH01189767A (en) 1989-07-28
JP2989823B2 JP2989823B2 (en) 1999-12-13

Family

ID=11880073

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63015124A Expired - Lifetime JP2989823B2 (en) 1988-01-25 1988-01-25 Sentence pattern recognition method

Country Status (1)

Country Link
JP (1) JP2989823B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108241609A (en) * 2016-12-23 2018-07-03 科大讯飞股份有限公司 The recognition methods of parallelism sentence and system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5840684A (en) * 1981-09-04 1983-03-09 Hitachi Ltd Automatic translating system between natural languages
JPS62267872A (en) * 1986-05-16 1987-11-20 Ricoh Co Ltd Language analyzing device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5840684A (en) * 1981-09-04 1983-03-09 Hitachi Ltd Automatic translating system between natural languages
JPS62267872A (en) * 1986-05-16 1987-11-20 Ricoh Co Ltd Language analyzing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108241609A (en) * 2016-12-23 2018-07-03 科大讯飞股份有限公司 The recognition methods of parallelism sentence and system
CN108241609B (en) * 2016-12-23 2022-02-01 科大讯飞股份有限公司 Ranking sentence identification method and system

Also Published As

Publication number Publication date
JP2989823B2 (en) 1999-12-13

Similar Documents

Publication Publication Date Title
US6169999B1 (en) Dictionary and index creating system and document retrieval system
EP0971294A2 (en) Method and apparatus for automated search and retrieval processing
CN102915299A (en) Word segmentation method and device
CN111079408B (en) Language identification method, device, equipment and storage medium
US6999917B1 (en) Left-corner chart parsing system
US5283737A (en) Mechanism for generating linguistic expressions based on synonyms and rules derived from examples
JPS58114225A (en) &#34;kana&#34; (japanese syllabary) and chinese character convert-inputting system
CN109800430B (en) Semantic understanding method and system
JPS63244259A (en) Keyword extractor
JPH01189767A (en) Parallel sentence recognition system
Hollingsworth et al. Retrieving hierarchical text structure from typeset scientific articles–a prerequisite for e-science text mining
Sagot Building a morphosyntactic lexicon and a pre-syntactic processing chain for Polish
JPH0877196A (en) Extracting device for document information
JPH0619962A (en) Text dividing device
JP2989824B2 (en) Sentence pattern / grammar recognition method
JP6665029B2 (en) Language analysis device, language analysis method, and program
JPS63228326A (en) Automatic key word extracting system
JPS6368972A (en) Unregistered word processing system
JP2713353B2 (en) Knowledge extraction device
CN114328970A (en) Triple extraction method, equipment and computer storage medium
JPH0756919A (en) Japanese analysis method
JPH03125264A (en) Key word extracting device
JPH05143351A (en) Source program comparing system
JPH08263493A (en) Knowledge acquiring method
JPH0785041A (en) Relational conception extracting device

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20071008

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081008

Year of fee payment: 9

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081008

Year of fee payment: 9