JPH01189767A - Parallel sentence recognition system - Google Patents
Parallel sentence recognition systemInfo
- Publication number
- JPH01189767A JPH01189767A JP63015124A JP1512488A JPH01189767A JP H01189767 A JPH01189767 A JP H01189767A JP 63015124 A JP63015124 A JP 63015124A JP 1512488 A JP1512488 A JP 1512488A JP H01189767 A JPH01189767 A JP H01189767A
- Authority
- JP
- Japan
- Prior art keywords
- sentence
- slot
- candidate
- sentence form
- slot table
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000000470 constituent Substances 0.000 claims description 5
- 238000012567 pattern recognition method Methods 0.000 claims description 5
- 238000003058 natural language processing Methods 0.000 claims 1
- 239000002344 surface layer Substances 0.000 claims 1
- 238000004458 analytical method Methods 0.000 abstract description 9
- 238000000034 method Methods 0.000 abstract description 9
- 230000000877 morphologic effect Effects 0.000 abstract description 3
- 239000002245 particle Substances 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
Landscapes
- Machine Translation (AREA)
Abstract
Description
【発明の詳細な説明】 (産業上の利用分野) 本発明は文型認識方式に関するものである。[Detailed description of the invention] (Industrial application field) The present invention relates to a sentence pattern recognition method.
(従来の技術)
従来の文型認識方式としては以下に述べる2つのものが
知られている。1つの方式では、入力文における用言が
取り得る文型の中から一つをその用言の文型であると仮
定し、その仮定のもとで、該入力文の解析を進めている
。そして現れるべき文型構成要素が出現しないこと等に
より解析失敗が検出された段階で、バックトラックして
他の文型を一つ選び、失敗が検出されない文型が得られ
るまでこの処理を繰り返し、失敗が検出されなかった文
型を該入力文の用言の文型と認定している。(Prior Art) The following two methods are known as conventional sentence pattern recognition methods. In one method, one of the possible sentence patterns of a predicate in an input sentence is assumed to be the sentence pattern of the predicate, and the input sentence is analyzed based on that assumption. When a parsing failure is detected due to the failure of a sentence structure component that should appear, backtrack and select one of the other sentence patterns, and repeat this process until a sentence pattern in which no failure is detected is detected. The sentence pattern that is not recognized is recognized as the sentence pattern of the predicate of the input sentence.
また別の方式では、入力文における用言の取り得る全て
の文型を行として並列表現し、かつ、その構成文法格を
スロワ1へ条件として表現したテーブルを用意する。そ
して該入力文中における文型構成要素の候補の素性を、
ある文型が持つ全スロワ1〜条件に対し順に合致するま
で比較してゆき、合致したスロットを埋め、この操作を
最初から最後の文型まで順番に繰り返し、空きスロワI
〜の無い最多一致した文型を該入力文の文型と認定して
いる。In another method, a table is prepared in which all possible sentence patterns of the predicate in the input sentence are expressed in parallel as rows, and their constituent grammatical cases are expressed as conditions for thrower 1. Then, the features of the candidate sentence pattern components in the input sentence are
Compare all slots 1 to conditions of a certain sentence pattern in order until they match, fill in the matching slots, repeat this operation in order from the first to the last sentence pattern, and fill in the empty slots I.
The sentence pattern that matches the most without ... is recognized as the sentence pattern of the input sentence.
(発明が解決しようとする問題点)
上に述べた文型認識方式の前者の方式は、可能な文型の
中から認定される文型か、仮定された順序に依存するた
め、最多要素の文型を優先させようとしてそれから解析
を試ると、最多要素のものが必ずしも頻度の高いもので
はないのでバックトラックの回数が多くなる。また、頻
度順に解析を試ると、本来文型構成要素と認識されるべ
きものが、認識されずに解析がそのまま成功してしまう
という問題があった。(Problem to be solved by the invention) The former sentence pattern recognition method described above depends on the sentence pattern recognized from among possible sentence patterns or on the assumed order, so it gives priority to the sentence pattern with the largest number of elements. If you try to analyze it after that, you will end up backtracking a lot because the elements with the most number of elements are not necessarily the most frequent ones. Furthermore, if analysis is attempted in order of frequency, there is a problem in that elements that should originally be recognized as sentence pattern constituents are not recognized and the analysis is successful.
また後者の方式においては、前者における問題は存在し
ないが、候補の素性とスロワ1へ条件との比較回数が極
めて多いという問題がある。−回のテーブル操作におい
て、その比較回数は、概算すると文型数の数倍になり、
処理工数、時間が極めてかかるという問題があった。Although the latter method does not have the same problems as the former, there is a problem in that the candidate's background and the conditions for thrower 1 are compared extremely many times. - In table operations, the number of comparisons is roughly several times the number of sentence patterns,
There was a problem that the processing time and man-hours were extremely long.
本発明の目的は、このような欠点を除去せしめて、比較
回数を減らし、効率的に文型を認識する方式を提供する
ことにある。An object of the present invention is to provide a method for eliminating such drawbacks, reducing the number of comparisons, and efficiently recognizing sentence patterns.
(問題点を解決するための手段)
本発明の文型認識方式においては、文型を構成する文法
格を用言との位置関係や表層文法格により整理した有限
個のポジションにおけるスロワ1へ条件として表現し、
候補に対し、一つのポジションのみを複数の文型に渡っ
て並列にチエツクすることを特徴としている。(Means for Solving the Problems) In the sentence pattern recognition method of the present invention, grammatical cases constituting a sentence pattern are expressed as conditions for thrower 1 in a finite number of positions organized by positional relationships with predicates and surface grammatical cases. death,
It is characterized by checking only one position of candidates in parallel across multiple sentence patterns.
(作用)
本発明においては、有限個のポジションを設定すること
により、候補の素性との比較が、特定のポジション以外
に対しては不要となり、比較回数が従来技術の数分の1
となり、処理工数の削減、処理速度の向上につながる。(Operation) In the present invention, by setting a finite number of positions, comparison with the candidate's background is unnecessary for positions other than specific positions, and the number of comparisons is reduced to a fraction of that of the prior art.
This leads to a reduction in processing man-hours and an improvement in processing speed.
(実施例)
次に第1図から第3図を参照して本発明の実施例につい
て説明する。(Example) Next, an example of the present invention will be described with reference to FIGS. 1 to 3.
第1図は、本発明を利用した自然言語解析手法のフロー
チャー1へである。入力文は、形態素解析により、辞書
を用いながら、意味の最小単位である単語に分割され、
各単語に可能な品詞が付与される。用言の可能性のある
単語に対しては、その用言が取り得る文型が全て抽出さ
れる。抽出された文型情報は、スロットテーブルのテン
プレートを使用して、有限個のポジションで表現された
スロットテーブルに変換される。形態素解析結果は、構
文解析において、文型の構成要素に成り得る大きさにま
でまとめ上げられ、その素性が、スロットテーブル操作
関数を用いて、スロットテーブルにおける一つのポジシ
ョンに挿入され、全ての可能な文型のスロット条件と比
較される。合致するものが在れば、候補は文型構成要素
と認定され、合致したスロットを持つ文型のみが適格な
ものとして残され、合致したスロットは埋められる。文
型構成要素候補が現れ得る間は、このプロセスが繰り返
されるか、現れ得ない状態になった場合、空きスロット
の無い文型で、埋まったスロット数の最も多いものが入
力文の用言の文型と認定される。FIG. 1 is a flowchart 1 of a natural language analysis method using the present invention. The input sentence is divided into words, which are the smallest units of meaning, using a dictionary through morphological analysis.
Possible parts of speech are assigned to each word. For a word that may be a predicate, all possible sentence patterns of the predicate are extracted. The extracted sentence pattern information is converted into a slot table expressed by a finite number of positions using a slot table template. The morphological analysis results are summarized in the syntactic analysis to a size that can be used as the constituent elements of a sentence pattern, and the features are inserted into one position in the slot table using the slot table manipulation function, and all possible It is compared with the sentence pattern slot condition. If there is a match, the candidate is recognized as a sentence pattern component, and only sentence patterns with matching slots are left as eligible, and the matching slots are filled. This process is repeated as long as sentence structure component candidates can appear, or if they cannot appear, the sentence pattern with the largest number of filled slots without empty slots is determined to be the sentence pattern of the predicate in the input sentence. Certified.
第2図は用言の文型情報からスロットテーブルへの変換
例を示している。同図(a)は英語の例であり、同図(
b)には日本語の例が示されている。英語の例において
は、ポジションを用言との相対位置により設定している
。主語(SUB)はポジション1に設定され、用言の直
後の補語(COMP)、直接目的語(DO’B)、間接
目的語(’ J OB )はポジション2に設定される
。さらに後ろの要素は順にポジション3、ポジション4
に設定される。FIG. 2 shows an example of conversion from sentence pattern information of predicates to a slot table. Figure (a) is an example of English, and figure (a) is an example of English.
b) shows an example in Japanese. In the English example, the position is set by the relative position to the predicate. The subject (SUB) is set in position 1, and the complement (COMP), direct object (DO'B), and indirect object ('JOB) immediately after the predicate are set in position 2. The elements further back are in position 3 and position 4 in order.
is set to
日本語の例では、ポジションを表層文法格により設定し
ている。文型がある種のシフトを受けた後に助詞が「が
」となる文法格はポジションGAに設定され、助詞が1
を」になるものはポジションwoに設定される。In the Japanese example, positions are determined by surface grammatical cases. Grammatical cases in which the particle becomes ``ga'' after the sentence pattern undergoes a certain shift are set in position GA, and the particle becomes 1.
” is set to position wo.
ポジションの選択は、第3図< a−>に示されるよう
に、英語においてはポジション1から順にポジション2
、ポジショジ3というように選択される。日本語におい
ては、第3図(b)に示すとおり文型構成要素の候補の
助詞の綴りを見てボジションが選択される。As shown in Figure 3 <a->, in English the positions are selected from position 1 to position 2.
, position 3, and so on. In Japanese, as shown in FIG. 3(b), a position is selected by looking at the spelling of the particle of a candidate sentence structure element.
第1図は、本発明の文並列認識方式を用いた文解析方式
の例を示すフローチャート、第2図(a>、(b)は、
用言の文型情報からポジションを利用したスロットテー
ブルの変換例、第3図(a)、(b)は、スロットテー
ブル上のポジションの選択法の例である。FIG. 1 is a flowchart showing an example of a sentence analysis method using the sentence parallel recognition method of the present invention, and FIG. 2 (a>, (b))
FIGS. 3(a) and 3(b), an example of converting a slot table using positions from sentence pattern information of predicates, are examples of how to select positions on the slot table.
Claims (1)
中の用言に対し、該用言が取り得る全ての文型を行とし
て並列表現し、かつ、その構成文法格を、用言との位置
関係や表層文法格により整理した有限個のポジションに
おけるスロット条件として表現したテーブルを用意し、
該入力文中における文型構成要素の候補に対し、該候補
の素性を、該テーブルのポジションを一つ選択して、全
ての可能な文型の該ポジションにおけるスロット条件と
比較し、条件の合うスロットが存在する場合に、該スロ
ットを有する文型のみを適格とすることにより、複数の
文型を並列に処理することを特徴とする文型認識方式。In a natural language processing system that recognizes sentence patterns, for a predicate in an input sentence, all possible sentence patterns of the predicate are expressed in parallel as lines, and the constituent grammatical cases are expressed in terms of their positional relationship with the predicate and surface layer. Prepare a table expressed as slot conditions in a finite number of positions organized by grammatical case,
For candidates of sentence pattern constituents in the input sentence, select one position in the table and compare the features of the candidate with the slot conditions at the position of all possible sentence patterns, and if there is a slot that matches the condition. A sentence pattern recognition method characterized in that a plurality of sentence patterns are processed in parallel by making only sentence patterns having the slot eligible.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63015124A JP2989823B2 (en) | 1988-01-25 | 1988-01-25 | Sentence pattern recognition method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP63015124A JP2989823B2 (en) | 1988-01-25 | 1988-01-25 | Sentence pattern recognition method |
Publications (2)
Publication Number | Publication Date |
---|---|
JPH01189767A true JPH01189767A (en) | 1989-07-28 |
JP2989823B2 JP2989823B2 (en) | 1999-12-13 |
Family
ID=11880073
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP63015124A Expired - Lifetime JP2989823B2 (en) | 1988-01-25 | 1988-01-25 | Sentence pattern recognition method |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP2989823B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108241609A (en) * | 2016-12-23 | 2018-07-03 | 科大讯飞股份有限公司 | The recognition methods of parallelism sentence and system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5840684A (en) * | 1981-09-04 | 1983-03-09 | Hitachi Ltd | Automatic translating system between natural languages |
JPS62267872A (en) * | 1986-05-16 | 1987-11-20 | Ricoh Co Ltd | Language analyzing device |
-
1988
- 1988-01-25 JP JP63015124A patent/JP2989823B2/en not_active Expired - Lifetime
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5840684A (en) * | 1981-09-04 | 1983-03-09 | Hitachi Ltd | Automatic translating system between natural languages |
JPS62267872A (en) * | 1986-05-16 | 1987-11-20 | Ricoh Co Ltd | Language analyzing device |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108241609A (en) * | 2016-12-23 | 2018-07-03 | 科大讯飞股份有限公司 | The recognition methods of parallelism sentence and system |
CN108241609B (en) * | 2016-12-23 | 2022-02-01 | 科大讯飞股份有限公司 | Ranking sentence identification method and system |
Also Published As
Publication number | Publication date |
---|---|
JP2989823B2 (en) | 1999-12-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6169999B1 (en) | Dictionary and index creating system and document retrieval system | |
EP0971294A2 (en) | Method and apparatus for automated search and retrieval processing | |
CN102915299A (en) | Word segmentation method and device | |
CN111079408B (en) | Language identification method, device, equipment and storage medium | |
US6999917B1 (en) | Left-corner chart parsing system | |
US5283737A (en) | Mechanism for generating linguistic expressions based on synonyms and rules derived from examples | |
JPS58114225A (en) | "kana" (japanese syllabary) and chinese character convert-inputting system | |
CN109800430B (en) | Semantic understanding method and system | |
JPS63244259A (en) | Keyword extractor | |
JPH01189767A (en) | Parallel sentence recognition system | |
Hollingsworth et al. | Retrieving hierarchical text structure from typeset scientific articles–a prerequisite for e-science text mining | |
Sagot | Building a morphosyntactic lexicon and a pre-syntactic processing chain for Polish | |
JPH0877196A (en) | Extracting device for document information | |
JPH0619962A (en) | Text dividing device | |
JP2989824B2 (en) | Sentence pattern / grammar recognition method | |
JP6665029B2 (en) | Language analysis device, language analysis method, and program | |
JPS63228326A (en) | Automatic key word extracting system | |
JPS6368972A (en) | Unregistered word processing system | |
JP2713353B2 (en) | Knowledge extraction device | |
CN114328970A (en) | Triple extraction method, equipment and computer storage medium | |
JPH0756919A (en) | Japanese analysis method | |
JPH03125264A (en) | Key word extracting device | |
JPH05143351A (en) | Source program comparing system | |
JPH08263493A (en) | Knowledge acquiring method | |
JPH0785041A (en) | Relational conception extracting device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20071008 Year of fee payment: 8 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20081008 Year of fee payment: 9 |
|
EXPY | Cancellation because of completion of term | ||
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20081008 Year of fee payment: 9 |