JP2575125B2

JP2575125B2 - Language processor

Info

Publication number: JP2575125B2
Application number: JP62045337A
Authority: JP
Inventors: 稔藤田; 隆也上田
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 1987-03-02
Filing date: 1987-03-02
Publication date: 1997-01-22
Anticipated expiration: 2012-01-22
Also published as: JPS63213069A

Description

【発明の詳細な説明】［産業上の利用分野］本発明は、たとえば日本語や英語などの言語に関する
情報を処理する言語処理装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Industrial Application Field] The present invention relates to a language processing apparatus that processes information relating to languages such as Japanese and English.

［従来の技術］一般に自然言語の処理装置では、文や句などの中に並
列表現がある場合、それを解析する手段としては、従来
は、並列の範囲を明示するなどの方法を用いて、解析処
理の負荷を減らす方法がとられていた。[Prior Art] Generally, in a natural language processing device, when there is a parallel expression in a sentence, a phrase, or the like, conventionally, as a means for analyzing the parallel expression, a method such as specifying a parallel range is used. A method was used to reduce the load of the analysis processing.

［発明が解決しようとする問題点］しかしながら、並列の範囲を明示するなどの方法は、
文入力者が文の構造を明確に意識して入力しなければな
らない。このため、入力する文が多い場合には入力文の
取扱いに不便が生じ、それゆえ熟練した専任の文入力者
を必要とするという欠点があった。[Problems to be solved by the invention] However, methods such as specifying a parallel range are as follows.
The sentence entrant must input the sentence with a clear awareness of the structure of the sentence. Therefore, when there are many sentences to be input, handling of the input sentences becomes inconvenient, and therefore, there is a disadvantage that a skilled and dedicated sentence inputting person is required.

本発明の目的は、上述の欠点を除去し、文入力者が並
列の範囲を意識することなく入力した並列表現を含む文
を、簡単に解析できる言語処理装置を提供することにあ
る。An object of the present invention is to provide a language processing apparatus which eliminates the above-mentioned drawbacks and which can easily analyze a sentence including a parallel expression input by a sentence input user without being aware of the range of parallelism.

［問題点を解決するための手段］かかる目的を達成するために、本発明は、入力文を構
文解析する構文解析手段と、該構文解析手段によって解
析された結果に、並列される部分の異なる複数の解釈候
補が存在する場合に、各解釈候補において、並列される
部分のそれぞれの構成要素数を比較する比較手段と、該
比較手段の比較結果において構成要素数の差の小さい解
釈候補を選択する選択手段とを具える。[Means for Solving the Problems] In order to achieve the above object, the present invention provides a parsing means for parsing an input sentence, and a method in which a result of parsing by the parsing means is different in a part to be parallelized. When there are a plurality of interpretation candidates, for each interpretation candidate, a comparison means for comparing the respective component numbers of the paralleled parts, and an interpretation candidate having a small difference in the number of components in the comparison result of the comparison means are selected. Selection means to perform the selection.

［作用］本発明によれば、入力文を構文解析した結果に、並列
される部分の異なる複数の解釈候補が存在する場合に、
各解釈候補において、並列される部分のそれぞれの構成
要素数を比較し、比較結果において構成要素数の差の小
さい解釈候補を優先候補とする。[Operation] According to the present invention, when a result of parsing an input sentence includes a plurality of interpretation candidates having different parallel parts,
In each interpretation candidate, the number of constituent elements of the parallel part is compared, and the interpretation candidate having a small difference in the number of constituent elements in the comparison result is set as a priority candidate.

［実施例］以下、図面を参照して本発明を詳細に説明する。Examples Hereinafter, the present invention will be described in detail with reference to the drawings.

第１図は本発明を適用した言語処理装置の一実施例を
示すブロック構成図である。１は入力された文を保持す
る入力文保持部である。２は入力文保持部１に保持され
ている文を構文解析する構文解析部、３は構文解析部２
において構文解析された結果を保持する構文解析結果保
持部である。４は構文解析結果保持部３に保持されてい
る結果の並列表現部分を評価するための並列表現評価
部、５は並列表現評価部４の評価結果を保持する評価結
果保持部である。FIG. 1 is a block diagram showing an embodiment of a language processing apparatus to which the present invention is applied. An input sentence holding unit 1 holds an input sentence. 2 is a parsing unit for parsing the sentence held in the input sentence holding unit 1 and 3 is a parsing unit 2
Is a syntax analysis result holding unit that holds the result of the syntax analysis in. Reference numeral 4 denotes a parallel expression evaluation unit for evaluating the parallel expression part of the result held in the syntax analysis result holding unit 3, and reference numeral 5 denotes an evaluation result holding unit for holding the evaluation result of the parallel expression evaluation unit 4.

第２図は本発明をさらに具体的に説明するものであ
る。21は第３図に示すような制御手順（後述する）を記
録する制御メモリ（以下PMEM）である。PMEM21はROM
（リードオンリメモリ）の形でもよいし、RAM（ランダ
ムアクセスメモリ）の形でもよい。22はPMEM21に記憶さ
れている制御手順に従って、論理・判断・演算を行う中
央処理装置（以下CPU）である。FIG. 2 illustrates the present invention more specifically. Reference numeral 21 denotes a control memory (hereinafter, PMEM) for recording a control procedure (to be described later) as shown in FIG. PMEM21 is ROM
(Read only memory) or RAM (random access memory). Reference numeral 22 denotes a central processing unit (hereinafter referred to as a CPU) that performs logic, judgment, and operation in accordance with a control procedure stored in the PMEM 21.

23は辞書等を格納するディスク装置である。24は入力
文の入力装置として用いるキーボード、25は処理された
入力文の出力装置としてのディスプレイである。26はメ
モリ（以下MEM）であり、入力文保持部IB、構文解析結
果保持部KB、評価結果保持部HBを有している。23 is a disk device for storing a dictionary and the like. 24 is a keyboard used as an input device for input sentences, and 25 is a display as an output device for processed input sentences. Reference numeral 26 denotes a memory (hereinafter referred to as MEM), which has an input sentence holding unit IB, a syntax analysis result holding unit KB, and an evaluation result holding unit HB.

次に第３図に示す動作フローチャートを参照して、第
１図に示す装置の動作について説明する。まず、入力文
が入力保持部１に保持されるまでステップS1が繰り返さ
れる。入力文が入力文保持部１に保持されるとステップ
S2に移り、構文解析処理が行われる。ステップS2におい
ては、入力文保持部１に保持されている入力文が構文解
析部２で構文解析される。そして、その解析結果が構文
解析結果保持部３に保持される。Next, the operation of the apparatus shown in FIG. 1 will be described with reference to the operation flowchart shown in FIG. First, step S1 is repeated until the input sentence is held in the input holding unit 1. Step when an input sentence is held in the input sentence holding unit 1
Moving to S2, a syntax analysis process is performed. In step S2, the input sentence held in the input sentence holding unit 1 is parsed by the syntax analysis unit 2. Then, the analysis result is held in the syntax analysis result holding unit 3.

第４図は並列表現が２通りに解釈される一例を示す図
である。本図に示される「赤い鉛筆と安い定規」のよう
な並列表現が入力文中に含まれている場合は、この並列
表現の部分は本図に示すように２通りに解釈され、この
２つの解析結果は構文解析結果保持部３に保持される。
構文解析結果が構文解析結果保持部３に保持された後、
ステップS3に移り並列表現評価処理がなされる。FIG. 4 is a diagram showing an example in which a parallel expression is interpreted in two ways. When a parallel expression such as “red pencil and cheap ruler” shown in the figure is included in the input sentence, the part of the parallel expression is interpreted in two ways as shown in the figure, and the two analysis is performed. The result is held in the syntax analysis result holding unit 3.
After the syntax analysis result is held in the syntax analysis result holding unit 3,
Moving to step S3, parallel expression evaluation processing is performed.

ステップS3では、構文解析結果保持部３に保持されて
いる構文解析結果のうち、並列表現の部分に注目する。
すなわち、第３図に示される例において、助詞「と」の
前後にある名詞句に着目する。「（赤い鉛筆）と（安い
定規）」、「（鉛筆）と（安い定規）」のような２通り
の並列表現に着目し、助詞「と」の前後にある名詞句を
構成する文字数を比較する。文字数の差の小さい方をす
わりがよい、すなわち前後のバランスがよいと評価す
る。そして、評価結果のよい方の並列表現を含む構文解
析結果を評価結果保持部５に保持する。第４図に示され
る例においては、「（赤い鉛筆）と（安い定規）」の並
列表現が保持される。ステップS3の処理が終了するとス
テップS1の判定に戻る。In step S3, of the syntax analysis results held in the syntax analysis result holding unit 3, attention is paid to the parallel expression part.
That is, in the example shown in FIG. 3, attention is paid to the noun phrases before and after the particle "to". Focus on two parallel expressions such as "(red pencil) and (cheap ruler)" and "(pencil) and (cheap ruler)", and compare the number of characters that form the noun phrase before and after the particle "to" I do. A person with a small difference in the number of characters is evaluated as having good sitting, that is, having a good balance before and after. Then, the syntax analysis result including the parallel expression with the better evaluation result is held in the evaluation result holding unit 5. In the example shown in FIG. 4, the parallel expression of “(red pencil) and (cheap ruler)” is held. Upon completion of the process in the step S3, the process returns to the determination in the step S1.

なお、上述の実施例では、助詞の前後の名詞句の文字
数を比較する方法を用いて並列表現を評価したが、名詞
句に含まれる漢字の数を比較する方法を用いてもよい。In the above-described embodiment, the parallel expression is evaluated using the method of comparing the number of characters of the noun phrase before and after the particle, but a method of comparing the number of kanji included in the noun phrase may be used.

さらに、上述の実施例においては、名詞句に含まれる
文字数を評価する方法を用いたが、名詞句に含まれる単
語の数を比較しても同様の効果が得られる。Further, in the above-described embodiment, the method of evaluating the number of characters included in the noun phrase is used, but the same effect can be obtained by comparing the number of words included in the noun phrase.

［発明の効果］以上説明したように、本発明の言語処理装置によれ
ば、入力文を構文解析した結果に、並列される部分の異
なる複数の解釈候補が存在する場合に、各解釈候補にお
いて、並列される部分のそれぞれの構成要素数を比較
し、比較結果において構成要素数の差の小さい解釈候補
を優先候補とするようにしたので、並列表現を含む入力
文の解析を、簡単に効率よく行うことができる。[Effects of the Invention] As described above, according to the language processing apparatus of the present invention, when a plurality of interpretation candidates having different parallel parts exist in the result of syntax analysis of an input sentence, , The number of components in each of the paralleled parts is compared, and the interpretation candidate having the smaller difference in the number of components in the comparison result is set as the priority candidate. Can do well.

[Brief description of the drawings]

第１図は本発明を適用した言語処理装置の一実施例を示
すブロック構成図、第２図は本発明の一実施例におけるシステム構成を示す
ブロック図、第３図は第１図示の装置の動作を説明するためのフロー
チャート、第４図は構文解析結果保持部の説明図である。１……入力文保持部、２……構文解析部、３……構文解析結果保持部、４……並列表現評価部、５……評価結果保持部。FIG. 1 is a block diagram showing an embodiment of a language processing apparatus to which the present invention is applied, FIG. 2 is a block diagram showing a system configuration in an embodiment of the present invention, and FIG. 4 is an explanatory diagram of a syntax analysis result holding unit. 1 ... input sentence holding unit, 2 ... syntax analysis unit, 3 ... syntax analysis result holding unit, 4 ... parallel expression evaluation unit, 5 ... evaluation result holding unit.

Claims

(57) [Claims]

1. A parsing means for parsing an input sentence, and when a plurality of interpretation candidates having different parallel parts exist in a result analyzed by the parsing means, the parsing is performed for each interpretation candidate. A language processing apparatus, comprising: comparing means for comparing the numbers of components of each of the parts to be interpreted; and selecting means for selecting an interpretation candidate having a small difference in the number of components in the comparison result of the comparing means.

2. The language processing apparatus according to claim 1, wherein said number of constituent elements is the number of characters.

3. The language processing apparatus according to claim 1, wherein said number of constituent elements is the number of Chinese characters.

4. The language processing apparatus according to claim 1, wherein the number of components is the number of words.