JPH10269215A

JPH10269215A - Natural language analyzing device

Info

Publication number: JPH10269215A
Application number: JP9071256A
Authority: JP
Inventors: Hiromi Nakaiwa; 浩巳中岩
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1997-03-25
Filing date: 1997-03-25
Publication date: 1998-10-09
Anticipated expiration: 2017-03-25
Also published as: JP3508904B2

Abstract

PROBLEM TO BE SOLVED: To provide a natural language analyzing device that automatically generates, and adds and corrects an omitted element complementing rule for complementing omitted elements, by extracting omitted expressions of a source language and complementary elements of the omitted elements in a translated sentence of a target language. SOLUTION: An input sentence and an ideal translated sentence of the target language are read in through a translated sentence read-in part 1, an input sentence omitted element extraction part 52 is extracted omitted elements omitted from the input sentence, and a target language complementing element extraction part 53 extracts target language complementary elements complementing the omitted elements from the ideal translated sentence. On the basis of the extracted target language complementary elements, source language complementary element extraction part 54 extracts source language complementary elements and on the basis of omitted and complementary element pairs of the target language complementary elements and source language complementary elements, an omitted element complementing rule generation and alteration part 6 adds and alters the omitted element complementing rule.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、自然言語において
省略されている単語や表現などの省略要素を補完するた
めの省略要素補完規則を自動的に学習して生成する自然
言語解析装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a natural language analyzer for automatically learning and generating an abbreviation element complementing rule for complementing an abbreviation element such as a word or an expression abbreviated in a natural language.

【０００２】[0002]

【従来の技術】従来、自然言語解析装置において省略要
素補完規則を構築する場合、解析対象となる入力文を解
読して、省略要素補完規則を人手で構築するのが一般的
である。また、このようにして構築された省略要素補完
規則は自然言語解析装置の解析結果を基に、省略要素に
対して不適切な補完が行われている場合には、その省略
要素に対する補完の原因となる規則を、自然言語解析装
置の内部処理を良く理解したエキスパートが人手で追加
修正している。2. Description of the Related Art Conventionally, when constructing an abbreviation element complementing rule in a natural language analyzer, it is common to decode an input sentence to be analyzed and construct an abbreviated element complementing rule manually. In addition, the abbreviated element completion rule constructed in this way is based on the analysis result of the natural language analyzer, and when an inappropriate abbreviated element is complemented, the cause of the abbreviated element is complemented. Experts who understand the internal processing of the natural language analyzer well add and modify the rules manually

【０００３】[0003]

【発明が解決しようとする課題】上述したように、従来
の自然言語解析装置における省略要素補完規則の構築方
法において、規則の新規作成や追加修正を行うには、自
然言語解析装置の内部処理を良く理解したエキスパート
により規則を作成したり解析結果を検証する必要があ
る。従って、エキスパートでないと規則の新規作成や追
加修正を行うことができないという問題がある。また、
本作業は人手によるものであるので、規則の構築のため
の時間的経済的コストが高くなってしまうという問題が
ある。As described above, in the conventional method of constructing an abbreviation element complementing rule in a natural language analysis device, in order to newly create or add a new rule, the internal processing of the natural language analysis device must be performed. It is necessary to create rules and verify the analysis results by well-understood experts. Therefore, there is a problem that a rule cannot be newly created or added or modified unless an expert is used. Also,
Since this work is performed manually, there is a problem that the time and economic cost for constructing the rules are increased.

【０００４】本発明は、上記に鑑みてなされたもので、
その目的とするところは、原言語における省略表現と目
的言語の対訳文における省略要素の補完要素を抽出し、
省略要素を補完する省略要素補完規則の自動的作成およ
び追加修正を行う自然言語解析装置を提供することにあ
る。[0004] The present invention has been made in view of the above,
Its purpose is to extract the abbreviation in the source language and the complement of the abbreviation in the target language bilingual sentence,
An object of the present invention is to provide a natural language analysis device that automatically creates and additionally corrects an omitted element complementing rule for complementing an omitted element.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するた
め、請求項１記載の本発明は、原言語である第１の自然
言語の入力文を解析し、該入力文の解析構造である原言
語解析構造を生成し、省略要素を補完するための省略要
素補完規則を生成変更する自然言語解析装置であって、
前記入力文と対訳関係にある第２の自然言語である目的
言語の理想訳文を入力する入力手段と、前記入力文から
省略された省略要素を抽出する入力文省略要素抽出手段
と、前記理想訳文から前記省略要素に補完される目的言
語補完要素を抽出する目的言語補完要素抽出手段と、該
目的言語補完要素抽出手段で抽出された目的言語補完要
素を基に原言語補完要素を抽出する原言語補完要素抽出
手段と、前記目的言語補完要素および原言語補完要素の
対である省略補完要素対を基に前記省略要素補完規則を
追加変更する省略要素補完規則生成変更手段とを有する
ことを要旨とする。In order to achieve the above object, according to the present invention, an input sentence of a first natural language which is an original language is analyzed, and an original sentence which is an analysis structure of the input sentence is analyzed. A natural language analyzer that generates a linguistic analysis structure and generates and changes an abbreviation element completion rule for complementing an abbreviation element,
An input unit for inputting an ideal translation of a target language, which is a second natural language having a bilingual relationship with the input sentence; an input sentence omitting element extracting unit for extracting an abbreviated element from the input sentence; Target language supplementary element extracting means for extracting a target language complementary element complemented from the omission element, and a source language for extracting a source language complementary element based on the target language complementary element extracted by the target language complementary element extracting means A gist comprising: a supplementary element extracting unit; and an abbreviated element complementing rule generation changing unit that additionally changes the abbreviated element complementing rule based on an abbreviated complementing element pair that is a pair of the target language complementing element and the source language complementing element. I do.

【０００６】請求項１記載の本発明にあっては、入力文
と目的言語の理想訳文を入力し、入力文から省略要素を
抽出し、理想訳文から目的言語補完要素を抽出し、該目
的言語補完要素を基に原言語補完要素を抽出し、省略補
完要素対を基に省略要素補完規則を追加変更する。According to the first aspect of the present invention, an input sentence and an ideal translation of a target language are input, an omission element is extracted from the input sentence, a target language complement element is extracted from the ideal translation, and the target language is extracted. The source language completion element is extracted based on the completion element, and the omitted element completion rule is additionally changed based on the omitted complementary element pair.

【０００７】また、請求項２記載の本発明は、原言語で
ある第１の自然言語の入力文を解析し、該入力文の解析
構造である原言語解析構造を生成し、省略要素を補完す
るための省略要素補完規則を生成変更する自然言語解析
装置であって、互いに対訳関係にある前記入力文と第２
の自然言語の理想訳文を読み込む読み込み手段と、前記
入力文および前記理想訳文から前記入力文中から省略さ
れた省略要素を抽出し、前記理想訳文中から前記省略要
素に補完される目的言語補完要素を抽出し、前記省略要
素と前記目的言語補完要素を基に前記省略要素と該省略
要素に補完すべき原言語補完要素の対である省略補完要
素対を抽出する省略補完要素対抽出手段と、前記省略補
完要素対抽出手段で抽出された前記省略補完要素対を基
に前記省略要素補完規則を生成変更する省略要素補完規
則生成変更手段とを有することを要旨とする。According to a second aspect of the present invention, an input sentence of a first natural language as a source language is analyzed, a source language analysis structure as an analysis structure of the input sentence is generated, and omitted elements are complemented. A natural language analysis device for generating and changing an abbreviation element complementing rule for performing the
Reading means for reading an ideal translation of a natural language, extracting an omitted element omitted from the input sentence from the input sentence and the ideal translated sentence, and extracting a target language complement element complemented by the omitted element from the ideal translated sentence. An abbreviation complement element pair extracting means for extracting and extracting an abbreviation complement element pair that is a pair of the abbreviation element and a source language complement element to be complemented with the abbreviation element based on the omission element and the target language complement element; The gist of the invention is to include an omitted element complementing rule generation changing unit that generates and changes the omitted element complementing rule based on the omitted complementary element pair extracted by the omitted complementary element pair extracting unit.

【０００８】請求項２記載の本発明にあっては、入力文
と理想訳文を読み込み、入力文および理想訳文から入力
文中の省略要素を抽出し、理想訳文中から省略要素に補
完される目的言語補完要素を抽出し、省略要素と目的言
語補完要素を基に省略要素と該省略要素に補完すべき原
言語補完要素の対である省略補完要素対を抽出し、この
抽出された省略補完要素対を基に省略要素補完規則を生
成変更する。According to the second aspect of the present invention, an input sentence and an ideal translation are read, an omitted element in the input sentence is extracted from the input sentence and the ideal translated sentence, and a target language complemented by the omitted element from the ideal translated sentence. A complementary element is extracted, and an abbreviated element and an abbreviated complement element pair, which is a pair of a source language complement element to be complemented with the abbreviated element, are extracted based on the abbreviated element and the target language complement element. Generate and change the omission element completion rule based on.

【０００９】更に、請求項３記載の本発明は、請求項２
記載の発明において、前記省略補完要素対抽出手段が、
前記入力文から省略された省略要素を抽出する入力文省
略要素抽出手段と、前記理想訳文から前記省略要素に補
完される目的言語補完要素を抽出する目的言語補完要素
抽出手段とを有することを要旨とする。Further, the present invention according to claim 3 provides the invention according to claim 2
In the described invention, the omitted complementary element pair extracting means includes:
The gist includes an input sentence omitted element extracting unit that extracts an omitted omitted element from the input sentence, and a target language complementary element extracting unit that extracts a target language complementary element that is complemented by the omitted element from the ideal translation. And

【００１０】請求項３記載の本発明にあっては、省略補
完要素対抽出手段において入力文から省略された省略要
素を入力文省略要素抽出手段で抽出し、理想訳文から省
略要素に補完される目的言語補完要素を目的言語補完要
素抽出手段で抽出する。According to the third aspect of the present invention, the omitted elements omitted from the input sentence in the omitted complementary element pair extracting means are extracted by the input sentence omitted element extracting means, and the omitted elements are complemented from the ideal translation. The target language complement element is extracted by the target language complement element extracting means.

【００１１】請求項４記載の本発明は、請求項２または
３記載の発明において、前記省略補完要素対抽出手段
が、前記入力文と前記理想訳文から前記入力文中の入力
単語表現と該入力単語表現と対訳関係にある該理想訳文
中の理想訳単語表現の対である入力文理想訳単語表現対
を抽出する対訳表現対抽出手段を有することを要旨とす
る。According to a fourth aspect of the present invention, in the second or third aspect of the present invention, the omission complementary element pair extracting means includes an input word expression in the input sentence from the input sentence and the ideal translation sentence, and the input word The gist of the present invention is to provide a bilingual expression pair extracting means for extracting an input sentence ideal translation word expression pair which is a pair of an ideal translation word expression in the ideal translation sentence having a bilingual relation with the expression.

【００１２】請求項４記載の本発明にあっては、省略補
完要素対抽出手段において入力文と理想訳文から入力文
中の入力単語表現と該入力単語表現と対訳関係にある該
理想訳文中の理想訳単語表現の対である入力文理想訳単
語表現対を対訳表現対抽出手段で抽出する。According to the present invention, in the abbreviation complementary element pair extraction means, the input word expression in the input sentence and the ideal word in the ideal translation having a bilingual relationship with the input word expression from the input sentence and the ideal translation. An input sentence ideal translation word expression pair, which is a pair of translation word expressions, is extracted by a translation expression pair extraction unit.

【００１３】また、請求項５記載の本発明は、請求項３
または４記載の発明において、前記省略補完要素対抽出
手段が、前記目的言語補完要素抽出手段で抽出された前
記目的言語補完要素を基に前記原言語補完要素を抽出す
る原言語補完要素抽出手段を有することを要旨とする。The present invention described in claim 5 provides the present invention according to claim 3.
In the invention according to the fourth aspect, the omitted complementary element pair extracting means includes a source language complementary element extracting means for extracting the source language complementary element based on the target language complementary element extracted by the target language complementary element extracting means. It is the gist to have.

【００１４】請求項５記載の本発明にあっては、省略補
完要素対抽出手段において目的言語補完要素抽出手段で
抽出された目的言語補完要素を基に原言語補完要素を原
言語補完要素抽出手段で抽出する。According to the fifth aspect of the present invention, the source language supplementary element extracting means extracts the source language supplementary element based on the target language supplementary element extracted by the target language supplementary element extracting means in the omitted complementing element pair extracting means. Extract with

【００１５】更に、請求項６記載の本発明は、請求項２
ないし５のいずれかに記載の発明において、前記対訳文
読み込み手段で読み込まれた前記入力文を解析して入力
文解析構造を生成する入力文解析手段を有し、前記入力
文解析構造を利用して前記省略要素、前記目的言語補完
要素および／または前記原言語補完要素を抽出すること
を要旨とする。[0015] Further, the present invention according to claim 6 provides the present invention as claimed in claim 2.
6. The invention according to any one of claims 1 to 5, further comprising input sentence analyzing means for analyzing the input sentence read by the bilingual sentence reading means to generate an input sentence analysis structure, and utilizing the input sentence analysis structure. The gist is to extract the omitted element, the target language complement element, and / or the source language complement element.

【００１６】請求項６記載の本発明にあっては、対訳文
読み込み手段で読み込まれた入力文を解析して入力文解
析構造を入力文解析手段で生成し、該入力文解析構造を
利用して省略要素、目的言語補完要素および／または原
言語補完要素を抽出する。According to the present invention, the input sentence read by the bilingual sentence reading means is analyzed to generate an input sentence analysis structure by the input sentence analysis means, and the input sentence analysis structure is used. To extract the omitted element, the target language complement element, and / or the source language complement element.

【００１７】請求項７記載の本発明は、請求項６記載の
発明において、前記入力文解析手段が前記入力文中の省
略要素を補完する入力文省略要素補完手段を有すること
を要旨とする。According to a seventh aspect of the present invention, in the invention of the sixth aspect, the input sentence analyzing means has an input sentence omitted element complementing means for complementing an omitted element in the input sentence.

【００１８】請求項７記載の本発明にあっては、入力文
解析手段において入力文中の省略要素を入力文省略要素
補完手段で補完する。According to the present invention, in the input sentence analyzing means, the omitted elements in the input sentence are complemented by the input sentence omitted element complementing means.

【００１９】また、請求項８記載の本発明は、請求項２
ないし７のいずれかに記載の発明において、前記対訳読
み込み手段で読み込まれた前記理想訳文を解析して理想
訳文解析構造を生成する理想訳文解析手段を有し、前記
理想訳文解析構造を利用して前記省略要素、前記目的言
語補完要素および／または前記原言語補完要素を抽出す
ることを要旨とする。The present invention described in claim 8 provides the present invention according to claim 2.
8. The invention according to any one of claims 7 to 7, further comprising an ideal translation analysis unit that analyzes the ideal translation read by the bilingual reading unit to generate an ideal translation analysis structure, and uses the ideal translation analysis structure. The gist is to extract the omitted element, the target language complement element, and / or the source language complement element.

【００２０】請求項８記載の本発明にあっては、対訳読
み込み手段で読み込まれた理想訳文を解析して理想訳文
解析構造を理想訳文解析手段で生成し、理想訳文解析構
造を利用して省略要素、目的言語補完要素および／また
は原言語補完要素を抽出する。According to the present invention, the ideal translation sentence read by the bilingual reading means is analyzed to generate an ideal translation analysis structure by the ideal translation analysis means, and the ideal translation analysis structure is omitted using the ideal translation analysis structure. Extract elements, target language complement elements and / or source language complement elements.

【００２１】更に、請求項９記載の本発明は、請求項８
記載の発明において、前記理想訳文解析手段が前記理想
訳文中の省略要素を補完する理想訳文省略要素補完手段
を有することを要旨とする。Further, the present invention described in claim 9 is based on claim 8.
In the invention described above, the gist of the invention is that the ideal translation analyzing unit has an ideal translation omitting element complementing unit that complements an omitting element in the ideal translation.

【００２２】請求項９記載の本発明にあっては、理想訳
文解析手段において理想訳文中の省略要素を理想訳文省
略要素補完手段で補完する。In the ninth aspect of the present invention, the omitted elements in the ideal translation are complemented by the ideal translation omitted element complementing means in the ideal translation analyzing means.

【００２３】請求項１０記載の本発明は、請求項８また
は９記載の発明において、前記理想訳文解析手段が前記
理想訳文中の代名詞等の照応表現の指示対象を決定する
理想訳文照応解析手段を有することを要旨とする。According to a tenth aspect of the present invention, in the invention of the eighth or ninth aspect, the ideal translation analyzing means determines the target of an anaphoric expression such as a pronoun in the ideal translation. It is the gist to have.

【００２４】請求項１０記載の本発明にあっては、理想
訳文解析手段において理想訳文中の代名詞等の照応表現
の指示対象を理想訳文照応解析手段で決定する。According to the tenth aspect of the present invention, the target of the anaphor expression such as a pronoun in the ideal translation is determined by the ideal translation anaphor analysis means.

【００２５】また、請求項１１記載の本発明は、請求項
６ないし１０のいずれかに記載の発明において、前記省
略要素補完規則生成変更手段が前記入力文解析手段で得
られる前記入力文解析構造を利用して前記省略要素補完
規則を生成変更することを要旨とする。According to the present invention as set forth in claim 11, in the invention as set forth in any of claims 6 to 10, the input sentence analyzing structure wherein the omitted element complementing rule generation changing means is obtained by the input sentence analyzing means. The gist of the invention is to generate and change the omitted element complementing rule by using the above.

【００２６】請求項１１記載の本発明にあっては、省略
要素補完規則生成変更手段において入力文解析手段で得
られる入力文解析構造を利用して省略要素補完規則を生
成変更する。According to the eleventh aspect of the present invention, the omitted element complementing rule is generated and changed by using the input sentence analysis structure obtained by the input sentence analyzing means.

【００２７】更に、請求項１２記載の本発明は、請求項
２ないし１１のいずれかに記載の発明において、前記省
略要素補完規則生成変更手段が前記省略補完要素対抽出
手段で抽出された前記省略補完要素対を記録する省略補
完要素対格納手段を有し、学習対象となる対訳関係にあ
る前記入力文と前記理想訳文の対の集合を基に蓄積され
た前記省略補完要素対格納手段中の前記省略補完要素対
の集合を利用して前記省略要素補完規則を生成変更する
ことを要旨とする。Further, according to a twelfth aspect of the present invention, in the invention according to any one of the second to eleventh aspects, the omitted element complementing rule generation / change means is the omitted element extracted by the omitted element pair extracting means. An omission complement element pair storage means for recording a complement element pair, wherein the omission complement element pair storage means stored based on a set of pairs of the input sentence and the ideal translation sentence having a bilingual relationship to be learned. The gist is to generate and change the omitted element complementing rule using the set of the omitted complementary element pairs.

【００２８】請求項１２記載の本発明にあっては、省略
要素補完規則生成変更手段において省略補完要素対抽出
手段で抽出された省略補完要素対を省略補完要素対格納
手段で記録し、学習対象となる対訳関係にある入力文と
理想訳文の対の集合を基に蓄積された省略補完要素対格
納手段中の省略補完要素対の集合を利用して省略要素補
完規則を生成変更する。According to the twelfth aspect of the present invention, the omitted complementing element pair extracted by the omitted complementing element pair extracting means by the omitted element complementing rule generation changing means is recorded by the omitted complementing element pair storage means, and the learning target The abbreviation element complementing rule is generated and changed using the set of the abbreviation complement element pairs stored in the abbreviation complement element pair storage means accumulated based on the set of the input sentence and the ideal translation sentence having the bilingual relation.

【００２９】請求項１３記載の本発明は、請求項７ない
し１２のいずれかに記載の発明において、前記入力文省
略要素補完手段が前記省略要素補完規則を用いて前記入
力文中の省略要素を補完し、省略要素補完規則学習方法
を繰り返し行うことを要旨とする。According to a thirteenth aspect of the present invention, in the invention according to any one of the seventh to twelfth aspects, the input sentence omitted element complementing means complements an omitted element in the input sentence using the omitted element complementing rule. The gist is that the omitted element complementing rule learning method is repeatedly performed.

【００３０】請求項１３記載の本発明にあっては、入力
文省略要素補完手段において省略要素補完規則を用いて
入力文中の省略要素を補完し、省略要素補完規則学習方
法を繰り返し行う。According to the thirteenth aspect of the present invention, the input sentence omitting element complementing means complements the omitting element in the input sentence using the omitting element complementing rule, and repeatedly executes the omitting element complementing rule learning method.

【００３１】また、請求項１４記載の本発明は、請求項
１３記載の発明において、前記省略要素補完規則生成変
更手段が学習の終了を判定する学習終了判定手段を有す
ることを要旨とする。According to a fourteenth aspect of the present invention, the gist of the thirteenth aspect is that the omitted element complementing rule generation changing means has a learning end determining means for determining the end of learning.

【００３２】請求項１４記載の本発明にあっては、省略
要素補完規則生成変更手段において学習の終了を学習終
了判定手段で判定する。According to the fourteenth aspect of the present invention, the end of learning is determined by the omission element complementing rule generation changing means by the learning end determining means.

【００３３】更に、請求項１５記載の本発明は、請求項
１４記載の発明において、前記省略要素補完規則を用い
て前記入力文を解析した結果である前記原言語解析構造
の履歴である原言語解析構造履歴を記録する原言語解析
構造履歴格納手段を有し、前記学習終了判定手段は前記
原言語解析構造履歴格納手段の記録内容が定常状態にな
ったときに学習を終了することを要旨とする。Further, according to a fifteenth aspect of the present invention, in the fourteenth aspect of the present invention, the source language is a history of the source language analysis structure which is a result of analyzing the input sentence using the omitted element completion rule. It has a source language analysis structure history storage unit that records the analysis structure history, and the learning end determination unit ends the learning when the recorded content of the source language analysis structure history storage unit reaches a steady state. I do.

【００３４】請求項１５記載の本発明にあっては、省略
要素補完規則を用いて入力文を解析した結果である原言
語解析構造の履歴である原言語解析構造履歴を原言語解
析構造履歴格納手段に記録し、原言語解析構造履歴格納
手段の記録内容が定常状態になったときに学習を終了す
る。According to the fifteenth aspect of the present invention, the source language analysis structure history, which is the history of the source language analysis structure obtained as a result of analyzing the input sentence using the omitted element completion rule, is stored in the source language analysis structure history. The learning is terminated when the recorded content of the source language analysis structure history storage means is in a steady state.

【００３５】[0035]

【発明の実施の形態】以下、図面を用いて本発明の実施
の形態について説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００３６】図１は、本発明の一実施形態に係る自然言
語解析装置の構成を示すブロック図である。同図に示す
自然言語解析装置は、自然言語において省略されている
単語や表現などの省略要素を補完するための省略要素補
完規則を自動的に学習して生成する自然言語解析システ
ム用省略要素補完規則学習方法を実施するものであり、
第１の自然言語が日本語、第２の自然言語が英語である
としている。FIG. 1 is a block diagram showing a configuration of a natural language analyzer according to one embodiment of the present invention. The natural language analysis device shown in the figure is a natural language analysis system that automatically learns and generates an abbreviated element completion rule for complementing an abbreviated element such as a word or expression omitted in a natural language. To implement the rule learning method,
It is assumed that the first natural language is Japanese and the second natural language is English.

【００３７】図１に示す自然言語解析装置は、対訳文読
み込み部１、自然言語解析システム部２、規則３、理想
訳文解析部４、省略補完要素対抽出部５、省略要素補完
規則生成変更部６より構成される。The natural language analyzing apparatus shown in FIG. 1 includes a bilingual sentence reading unit 1, a natural language analyzing system unit 2, a rule 3, an ideal translated sentence analyzing unit 4, an omitted complementary element pair extracting unit 5, an omitted element complementary rule generation changing unit. 6.

【００３８】対訳文読み込み部１は、対訳関係にある第
１の自然言語の文（入力文）と第２の自然言語の文（理
想訳文）の対（対訳文）を入力する。The bilingual sentence reading unit 1 inputs a pair (a bilingual sentence) of a first natural language sentence (input sentence) and a second natural language sentence (ideal translated sentence) having a bilingual relationship.

【００３９】自然言語解析システム部２は、対訳文読み
込み部１で読み込まれた入力文を規則３を用いて解析し
て入力文解析構造を生成する。自然言語解析システム２
は、入力文構文意味解析部２１、入力文省略要素補完部
２２から構成される。規則３は、構文意味解析規則３
１、省略要素補完規則３２から構成される。The natural language analysis system unit 2 analyzes the input sentence read by the bilingual sentence reading unit 1 using rule 3 to generate an input sentence analysis structure. Natural language analysis system 2
Is composed of an input sentence syntactic and semantic analyzing unit 21 and an input sentence omitting element complementing unit 22. Rule 3 is syntactic and semantic analysis rule 3.
1. It is composed of an omitted element complementing rule 32.

【００４０】自然言語解析システム部２の入力文構文意
味解析部２１は、対訳文読み込み部１で読み込まれた入
力文を構文意味解析規則３１を用いて構文意味解析を行
い、入力文構文意味構造を生成する。入力文省略要素補
完部２２は、省略要素補完規則３２を用いて入力文構文
意味解析部２１で生成された入力文構文意味構造を基に
入力文中の省略要素を補完する。The input sentence syntactic and semantic analysis unit 21 of the natural language analysis system unit 2 performs a syntactic and semantic analysis of the input sentence read by the bilingual sentence reading unit 1 by using the syntactic and semantic analysis rule 31, and obtains the input sentence syntactic and semantic structure. Generate The input sentence abbreviation element complementing unit 22 complements an omitted element in the input sentence based on the input sentence syntax / semantic structure generated by the input sentence syntax / semantic analysis unit 21 using the omitted element complementing rule 32.

【００４１】理想訳文解析部４は、対訳文読み込み部１
で読み込まれた理想訳文を解析して理想訳文解析構造を
生成する。理想訳文解析部４は、理想訳文構文意味解析
部４１、理想訳文省略要素補完部４２、理想訳文照応解
析部４３から構成される。The ideal translated sentence analyzing unit 4 is a bilingual sentence reading unit 1
Analyzes the ideal translated sentence read in step (1) to generate an ideal translated sentence analysis structure. The ideal translated sentence analyzing unit 4 includes an ideal translated sentence syntactic and semantic analyzing unit 41, an ideal translated sentence omitting element complementing unit 42, and an ideal translated sentence analytic analyzing unit 43.

【００４２】理想訳文解析部４の理想訳文構文意味解析
部４１は、対訳文読み込み部１で読み込まれた理想訳文
の構文意味解析を行い、理想訳文構文意味構造を生成す
る。理想訳文省略要素補完部４２は、理想訳文構文意味
解析部４１で生成された入力文構文意味構造を基に入力
文中の省略要素を補完する。理想訳文照応解析部４３
は、理想訳文構文意味解析部４１で生成された理想訳文
構文意味構造を基に理想訳文中の代名詞等の照応表現の
指示対象を決定する。The ideal translated sentence syntactic and semantic analyzing unit 41 of the ideal translated sentence analyzing unit 4 performs a syntactic and semantic analysis of the ideal translated sentence read by the bilingual sentence reading unit 1 and generates an ideal translated sentence syntactic and semantic structure. The ideal translated sentence omitted element complementing section 42 complements the omitted element in the input sentence based on the input sentence syntax / semantic structure generated by the ideal translated sentence syntax / semantic analyzing section 41. Ideal translation sentence analysis unit 43
Determines the target of an anaphoric expression such as a pronoun in an ideal translation based on the ideal translation syntax / semantic structure generated by the ideal translation syntax / semantic analysis unit 41.

【００４３】省略補完要素対抽出部５は、対訳文読み込
み部１で読み込まれた対訳関係にある入力文および理想
訳文を対象に、入力文中から省略要素を抽出し、理想訳
文中から省略要素に補完する目的言語補完要素を抽出
し、抽出された省略要素と目的言語補完要素を基に、目
的言語補完要素と対訳関係にある原言語補完要素を抽出
し、抽出された省略要素と目的言語補完要素と原言語補
完要素の対である省略補完要素対を抽出する。省略補完
要素対抽出部５は、対訳表現対抽出部５１、入力文省略
要素抽出部５２、目的言語補完要素抽出部５３、原言語
補完要素抽出部５４から構成される。The omitted complementary element pair extraction unit 5 extracts an omitted element from the input sentence and extracts the omitted element from the ideal translated sentence, for the input sentence and the ideal translated sentence read by the bilingual sentence reading unit 1. Extract the target language complement elements to be complemented, extract the source language complement elements that are in parallel with the target language complement elements based on the extracted omitted elements and target language complement elements, and extract the extracted omitted language elements and target language complement. Extract an abbreviated complement element pair that is a pair of an element and a source language complement element. The omitted complementary element pair extraction unit 5 includes a bilingual expression pair extraction unit 51, an input sentence omitted element extraction unit 52, a target language complement element extraction unit 53, and a source language complement element extraction unit 54.

【００４４】省略補完要素対抽出部５の対訳表現対抽出
部５１は、対訳文読み込み部１で読み込まれた対訳関係
にある入力文と理想訳文から入力文中の入力単語表現と
この入力単語実現と対訳関係にある理想訳文中の理想訳
単語表現の対である入力文理想訳単語表現対を抽出す
る。入力文省略要素抽出部５２は、入力文中から自然言
語解析システム部２で生成された入力文解析構造および
理想訳文解析部４で生成された理想訳文解析構造および
対訳表現対抽出部５１で抽出された入力文理想訳単語表
現対を用いて省略された要素である省略要素を抽出す
る。目的言語補完要素抽出部５３は、理想訳文から自然
言語解析システム２で生成された入力文解析構造および
理想訳文解析部４で生成された理想訳文解析構造および
対訳表現対抽出部５１で抽出された入力文理想訳単語表
現対を用いて省略要素に補完される目的言語補完要素を
抽出する。原言語補完要素抽出部５４は、目的言語補完
要素抽出部５３で抽出された目的言語補完要素を基に原
言語補完要素を抽出する。The bilingual expression pair extraction unit 51 of the abbreviation complementary element pair extraction unit 5 converts the input sentence and ideal translation sentence read by the bilingual sentence reading unit 1 into an input word expression in the input sentence and realizes the input word. An input sentence ideal translation word expression pair, which is an ideal translation word expression pair in an ideal translation having a bilingual relationship, is extracted. The input sentence omitting element extraction unit 52 is extracted from the input sentence by the input sentence analysis structure generated by the natural language analysis system unit 2 and the ideal translated sentence analysis structure generated by the ideal translated sentence analysis unit 4 and the bilingual expression pair extraction unit 51. The abbreviated element which is the abbreviated element is extracted using the input sentence ideal translation word expression pair. The target language supplementary element extraction unit 53 is extracted by the natural language analysis system 2 from the ideal translation and the ideal translation analysis structure generated by the ideal translation analysis unit 4 and extracted by the bilingual expression pair extraction unit 51. A target language complement element to be complemented with the omitted element is extracted using the input sentence ideal translation word expression pair. The source language supplementary element extracting unit 54 extracts a source language supplementary element based on the target language supplementary element extracted by the target language supplementary element extracting unit 53.

【００４５】省略要素補完規則生成変更部６は、省略補
完要素対抽出部５で抽出される省略補完要素対を基に規
則３中の省略要素補完規則３２を生成変更する。The omitted element complementing rule generation changing unit 6 generates and changes the omitted element complementing rule 32 in the rule 3 based on the omitted complementary element pair extracted by the omitted complementary element pair extracting unit 5.

【００４６】図２は、本発明で追加変更される省略要素
補完規則を用いた自然言語解析装置の基本ブロック構成
の一例である。同図に示す自然言語解析装置は、解析す
る第１の自然言語の文を入力する第１自然言語入力部
７、自然言語解析システム部２、自然言語システム部で
用いられる規則３、自然言語解析システム部２により生
成された第１自然言語解析構造を出力する第１自然言語
解析構造出力部８より構成される。FIG. 2 shows an example of a basic block configuration of a natural language analysis apparatus using an abbreviation element complementing rule which is additionally changed in the present invention. The natural language analysis device shown in FIG. 1 includes a first natural language input unit 7 for inputting a sentence of a first natural language to be analyzed, a natural language analysis system unit 2, a rule 3 used in the natural language system unit, a natural language analysis. The first natural language analysis structure output unit 8 outputs the first natural language analysis structure generated by the system unit 2.

【００４７】自然言語解析システム部２は、入力文構文
意味解析部２１、入力文省略要素補完部２２から構成さ
れる。規則３は、構文意味解析規則３１、省略要素補完
規則３２から構成される。構文意味解析規則３１は、構
文意味解析辞書３１１、構文意味解析ルール３１２から
構成される。自然言語解析システム部２の入力文構文意
味解析部２１は、第１自然言語入力部７で読み込まれた
第１の自然言語の文を構文意味解析規則３１を用いて構
文意味解析を行い、入力文構文意味構造を生成する。入
力文省略要素補完部２２は、省略要素補完規則３２を用
いて入力文構文意味解析部２１で生成された入力文構文
意味構造を基に第１の自然言語の文の省略要素を補完す
る。The natural language analysis system unit 2 comprises an input sentence syntax / meaning analysis unit 21 and an input sentence omitting element complementing unit 22. Rule 3 includes a syntactic and semantic analysis rule 31 and an omission element complementing rule 32. The syntactic and semantic analysis rule 31 includes a syntactic and semantic analysis dictionary 311, and a syntactic and semantic analysis rule 312. The input sentence syntactic and semantic analysis unit 21 of the natural language analysis system unit 2 performs a syntactic and semantic analysis of the first natural language sentence read by the first natural language input unit 7 using the syntactic and semantic analysis rule 31 and inputs the sentence. Generate a sentence syntax semantic structure. The input sentence abbreviation element complementing unit 22 complements the omitted element of the first natural language sentence based on the input sentence syntax / semantic structure generated by the input sentence syntax / semantic analysis unit 21 using the omitted element complementing rule 32.

【００４８】図３は、本発明の一実施形態の構文意味解
析辞書３１１の例を示す。構文意味解析辞書３１１は、
例えば、単語構文意味辞書があり、その単語構文意味辞
書には、第１の自然言語である日本語の各単語に対する
標準形、品詞、意味属性がそれぞれ記入されている。例
えば、「車」に対してはその標準形も「車」であり、品
詞は“名詞”であり、意味属性は“具体物”であること
が記入されている。また、「を」に対してはその標準形
も「を」であり、品詞は“助詞”であり、意味属性は
“目的格”であることが記入されている。また、「買い
たい」に対してはその標準形は「買う」と「たい」の２
種類に分けられ、「買う」に対しては品詞は“動詞”で
あり、意味属性は“購入”であることが記入され、「た
い」に対しては品詞は“様相”であり、意味属性は“希
望”であることが記入されている。FIG. 3 shows an example of a syntactic and semantic analysis dictionary 311 according to an embodiment of the present invention. The syntactic and semantic analysis dictionary 311 is
For example, there is a word syntax and semantic dictionary, and the standard form, part of speech, and semantic attributes for each word of Japanese as the first natural language are entered in the word syntax and semantic dictionary. For example, it is described that the standard form of "car" is also "car", the part of speech is "noun", and the semantic attribute is "concrete". In addition, the standard form of "wo" is "wo", the part of speech is "particle", and the semantic attribute is "purpose". The standard forms of “want to buy” are “buy” and “tai”.
For "buy", the part of speech is "verb" and the semantic attribute is "purchase". For "tai", the part of speech is "modal" and the semantic attribute is Is described as "hope".

【００４９】図４は、本発明の一実施形態の構文意味解
析ルール３１２の例を示す。構文意味解析ルール３１２
は、例えば、動詞構文意味解析ルールがあり、その動詞
構文意味解析ルールには日本語の各動詞だけではなく、
動詞の助詞がパターンとして付けられており、つまり、
例えば動詞「買いたい」については“「Ｓ」（主体、名
詞）「が」（助詞）「Ｏ」（目的語、名詞）「を」（助
詞）”という助詞パターンが記入されており、動詞をそ
の動詞を含めてキーとして動詞を中心とした構文意味解
析構造を検索するように記入されている。さらに動詞の
格要素の意味を示す選択条件が記されている。例えば、
動詞「買いたい」についてはその助詞パターン（ＳがＯ
を）に示されている格要素の意味について、動詞である
「買う」が様相“希望”を伴っており、Ｓが主格の関係
を、Ｏと目的格の関係を持つことを示す構文意味解析構
造を生成する場合には、Ｓが主体の“意味”を持ちＯが
“具体物”の意味を持つ時にその構文意味解析構造を適
用する制約があることを示している。FIG. 4 shows an example of the syntactic and semantic analysis rule 312 according to the embodiment of the present invention. Syntax and semantic analysis rule 312
For example, there are verb syntactic and semantic analysis rules, and the verb syntactic and semantic analysis rules include not only Japanese verbs but also
The verb particles are attached as patterns, that is,
For example, for the verb “buy”, the particle pattern “S” (subject, noun) “ga” (particle) “O” (object, noun) “wo” (particle) ”is entered. A search is made to search for a syntactic and semantic analysis structure centered on the verb as a key including the verb, and further, a selection condition indicating the meaning of the case element of the verb is described.
For the verb "buy", the particle pattern (S is O
)), The syntactic and semantic analysis indicates that the verb "buy" has a modality "hope" and that S has a nominative relation and O has a relation with the object case. In the case of generating a structure, it indicates that there is a restriction to apply the syntactic and semantic analysis structure when S has the main “meaning” and O has the meaning of “concrete”.

【００５０】図５は、本発明の一実施形態の省略要素補
完規則３２の例を示す。省略要素補完規則３２は、例え
ば、構文意味解析構造条件とそれに対応する補完要素が
記入されており、構文意味解析構造がその構文意味解析
構造条件と一致した場合に、補完要素を補完する。例え
ば、任意（＊）の動詞に様相“禁止”を伴っており、主
格の関係にある要素が省略されており、目的格の関係に
ある要素は省略されていない場合には、省略されている
主格に「あなた」を補完することを示している。FIG. 5 shows an example of the omitted element complementing rule 32 according to an embodiment of the present invention. In the omitted element complementing rule 32, for example, a syntactic and semantic analysis structure condition and a complementary element corresponding to the syntactic and semantic analysis structure condition are described. When the syntactic and semantic analysis structure matches the syntactic and semantic analysis structure condition, the complement element is complemented. For example, if an arbitrary (*) verb is accompanied by modal "prohibition", elements having a nominative relation are omitted, and elements having a target case are omitted if not omitted. It indicates that the nominative complements you.

【００５１】次に、図１の基本ブロック構成の自然言語
解析システム用省略要素補完規則学習システムの処理手
順について説明する。ここでは、図６に示す対訳文を例
にとって、本処理の手順も説明する。Next, the processing procedure of the omission element complementing rule learning system for a natural language analysis system having the basic block configuration shown in FIG. 1 will be described. Here, the procedure of this processing will also be described by taking the bilingual sentence shown in FIG. 6 as an example.

【００５２】まず、対訳文読み込み部１において、対訳
関係にある第１の自然言語の文と第２の自然言語の文の
対（対訳文）を入力する。この対訳文は、自然言語解析
システム用省略要素補完規則の自動学習のために用いる
ものである。この省略要素補完規則の学習用に対訳文の
集合（対訳文章）を選択することによって、学習させた
対訳文章と類似の省略現象が頻出する同様の分野文章向
きに、自然言語解析システムの省略要素補完処理をチュ
ーンすることが可能となる。図６の例では、第１の自然
言語である日本語の文（入力日本文）「車を買いたい」
に対して対訳関係にある第２の自然言語である英語の文
（理想英訳文）”I want to buy my car．”が対訳文と
して入力される。First, a bilingual sentence reading unit 1 inputs a pair (a bilingual sentence) of a first natural language sentence and a second natural language sentence having a bilingual relationship. The bilingual sentence is used for automatic learning of the omitted element complementing rule for the natural language analysis system. By selecting a set of bilingual sentences (bilingual sentences) for learning of this abbreviation element complementing rule, a natural language analysis system omits the natural language analysis system in the same field sentences where similar omission phenomena frequently occur as the learned bilingual sentences. It is possible to tune the complementary processing. In the example of FIG. 6, the first natural language sentence in Japanese (input Japanese sentence) “I want to buy a car”
, An English sentence (ideal English translated sentence), which is a second natural language having a bilingual relationship, is input as a translated sentence.

【００５３】入力された対訳文のうち、第１の自然言語
の文である入力日本語文は、規則３を用いた自然言語解
析システム部２によって第１の自然言語の解析構造であ
る入力日本文解析構造を生成する。この自然言語解析シ
ステムの処理の手順を図２の基本ブロック構成の自然言
語解析システムを用いて説明する。The input Japanese sentence, which is the first natural language sentence, of the input bilingual sentence is input Japanese sentence which is the first natural language analysis structure by the natural language analysis system unit 2 using rule 3. Generate an analysis structure. The processing procedure of this natural language analysis system will be described using a natural language analysis system having a basic block configuration shown in FIG.

【００５４】まず、第１の自然言語の文である入力日本
語文は例えば図３に示す単語構文意味解析辞書のような
構文意味解析辞書３１１や例えば図４に示す動詞構文意
味解析ルールのような構文意味解析ルール３１２を含む
構文意味解析辞書３１を用いた入力文構文意味解析部２
１によって解析され、日本語の構文意味解析構造を生成
する。生成された日本語の構文意味解析構造は、例えば
図５に示す主格省略要素補完規則のような省略要素補完
規則３２を用いた入力文省略要素補完部２２によって省
略要素が補完された日本文の解析構造を生成する。生成
された日本文の解析構造は、第１自然言語解析構造出力
部８によって出力される。First, an input Japanese sentence as a first natural language sentence is, for example, a syntactic and semantic analysis dictionary 311 such as a word syntactic and semantic analysis dictionary shown in FIG. 3 or a verb syntactic and semantic analysis rule such as shown in FIG. Input sentence syntactic and semantic analysis unit 2 using syntactic and semantic analysis dictionary 31 including syntactic and semantic analysis rule 312
1 to generate a Japanese syntactic and semantic analysis structure. The generated Japanese syntactic and semantic analysis structure is, for example, a Japanese sentence whose abbreviation elements are complemented by the input sentence abbreviation element complementing unit 22 using the abbreviation element complementation rules 32 such as the nominative abbreviation element complementation rules shown in FIG. Generate an analysis structure. The generated Japanese sentence analysis structure is output by the first natural language analysis structure output unit 8.

【００５５】なお、この自然言語解析システムの各段階
で参照・生成された解析構造や規則は、省略要素補完規
則自動学習処理で参照可能となるように情報を保存して
おく。例えば、図４の構文意味解析ルール３１２を検索
した際に得られる個々の構文意味解析構造や選択条件も
保存しておく。The analysis structures and rules referred to and generated at each stage of the natural language analysis system are stored with information so that they can be referred to in the omitted element complementing rule automatic learning process. For example, individual syntactic and semantic analysis structures and selection conditions obtained when the syntactic and semantic analysis rule 312 in FIG. 4 is searched are also stored.

【００５６】上記のような自然言語解析システムで、例
として図３で示した構文意味解析辞書３１１、図４で示
した構文意味解析ルール３１２、図５で示した省略要素
補完規則３２を用いることによって、図６の入力日本文
に対しては、図７に示す入力文解析構造が生成される。
これら入力文解析構造は動詞と各要素の関係と機能別に
入力文中の単語が記されている。なお、図６の入力日本
文では主格の要素が省略されているが、図５の省略要素
補完規則３２にはこの入力日本文の主格に補完する省略
要素補完規則がないため、入力文省略要素補完部２２で
は補完は行われない。In the natural language analysis system as described above, the syntactic and semantic analysis dictionary 311 shown in FIG. 3, the syntactic and semantic analysis rule 312 shown in FIG. 4, and the omitted element complementing rule 32 shown in FIG. Thus, the input sentence analysis structure shown in FIG. 7 is generated for the input Japanese sentence shown in FIG.
In these input sentence analysis structures, words in the input sentence are described according to the relation between verbs and each element and the function. Although the nominative element in the input Japanese sentence of FIG. 6 is omitted, the abbreviation element complementing rule 32 of FIG. 5 does not include an abbreviation element complementing rule for complementing the nominative of the input Japanese sentence. The complementing unit 22 does not perform complementation.

【００５７】ここで、「主格」の関係とは名詞句が動詞
の主語として機能していることを示し、「目的格」の関
係とは名詞句が動詞の目的語として機能していることを
示す。Here, the relation of "subject" indicates that the noun phrase functions as the subject of the verb, and the relation of "object" indicates that the noun phrase functions as the object of the verb. Show.

【００５８】次に、図１の理想訳文解析部４について説
明する。この理想訳文解析部４は対訳文読み込み部１で
読み込まれた第２の自然言語の文（理想英訳文）を解析
することによって自動的に理想訳解析構造を生成する。
この理想訳文解析部４の処理の手順を説明する。まず第
２の自然言語の文である理想英訳文は理想訳文構文意味
解析部４１によって解析され、英語の構文意味解析構造
を生成する。生成された英語の構文意味解析構造は、理
想訳文省略要素補完部４２によって省略要素が補完され
た英語の解析構造を生成する。省略要素が補完された英
語の解析構造は、理想訳文照応解析部４３によって照応
表現の指示対象の情報が付加された英語の解析構造を生
成する。Next, the ideal translation analyzing section 4 shown in FIG. 1 will be described. The ideal translation analysis unit 4 automatically generates an ideal translation analysis structure by analyzing the sentence of the second natural language (ideal English translation) read by the bilingual sentence reading unit 1.
The procedure of the process of the ideal translation analysis unit 4 will be described. First, the ideal English translation sentence, which is a sentence of the second natural language, is analyzed by the ideal translation sentence syntactic and semantic analysis unit 41 to generate an English syntactic and semantic analysis structure. The generated English syntactic and semantic analysis structure generates an English analysis structure in which omitted elements are complemented by the ideal translation sentence omitted element complementing unit 42. The English analysis structure in which the omitted elements are supplemented generates an English analysis structure to which the information on the target of the anaphor expression is added by the ideal translation anaphor analysis unit 43.

【００５９】図６の理想英訳文に対しては図７に示す理
想訳文解析構造が理想訳文解析部４によって生成され。
これら理想訳文解析構造は入力文解析構造と同様に動詞
と各要素の関係と機能別に理想訳文中の単語が記されて
いる。なお、図６の理想英訳文中には省略がないため、
理想訳文省略要素補完部４２で、補完が行われないが、
理想訳文照応解析部４３では”my”が照応表現であるこ
とから、この指示対象である”Ｉ”の情報が”my”の解
析構造中に付加される。For the ideal English translation shown in FIG. 6, an ideal translation analysis structure shown in FIG.
As in the input sentence analysis structure, the words in the ideal translation are described according to the relationship between verbs and each element and the functions in these ideal sentence analysis structures. Since there is no omission in the ideal English translation of FIG. 6,
Although the ideal translation omitted element complementing unit 42 does not perform complementation,
Since “my” is an anaphoric expression in the ideal translation sentence anaphor analysis unit 43, the information of “I” to be designated is added to the analysis structure of “my”.

【００６０】上記のようにして得られた入力文解析構造
と理想訳文解析構造は、対訳表現対抽出部５１によって
対訳関係にある入力文解析構造中の単語表現と理想訳文
解析構造中の単語表現の対である入力文理想訳単語表現
対が抽出される。このような入力文理想訳文単語表現対
は、例えば、第１の自然言語である日本語と第２の自然
言語である英語の対訳要素対を保持した日英対訳辞書を
参照することで抽出される。図７に示す入力文解析構造
および理想訳文解析構造から抽出されれる日英の対訳表
現対は、対訳表現対抽出部５１によって抽出され、図８
に示すような入力文単語表現対が得られる。The input sentence analysis structure and the ideal translated sentence analysis structure obtained as described above are converted by the bilingual expression pair extraction unit 51 into a word expression in the input sentence analysis structure and a word expression in the ideal translated sentence analysis structure which are in a bilingual relationship. , An input sentence ideal translation word expression pair is extracted. Such an input sentence ideal translation sentence word expression pair is extracted by referring to, for example, a Japanese-English bilingual dictionary holding translation element pairs of Japanese as the first natural language and English as the second natural language. You. A Japanese-English bilingual expression pair extracted from the input sentence analysis structure and the ideal translation analysis structure shown in FIG. 7 is extracted by the bilingual expression pair extraction unit 51, and FIG.
An input sentence word expression pair as shown in FIG.

【００６１】図８では、入力文単語表現「買いたい」に
相当する入力文解析構造から得られた解析構造と、理想
訳文単語表現”want to buy ”に相当する理想訳文解析
構造から得られた解析構造の単語表現対と、入力文単語
表現「車」に相当する入力文解析構造から得られた解析
構造と、理想訳文単語表現”car ”に相当する理想訳文
解析構造から得られた解析構造の単語表現対の２種類が
対訳表現対抽出部５１によって抽出されている。なお、
本実施形態では、入力文を解析する自然言語解析システ
ム部２と理想訳文を解析する理想訳文解析部４を具備す
るため、入力文理想訳文単語表現対は、入力文および理
想訳文の解析構造が抽出されているが、それぞれの単語
表現そのものの対を保持する方法も考えられる。FIG. 8 shows an analysis structure obtained from the input sentence analysis structure corresponding to the input sentence word expression "buy" and an ideal translation sentence analysis structure corresponding to the ideal translated word expression "want to buy". Analysis structure obtained from the input sentence analysis structure corresponding to the word expression pair of the analysis structure, the input sentence word expression "car", and the analysis structure obtained from the ideal translation sentence analysis structure corresponding to the ideal translation word expression "car" Are extracted by the bilingual expression pair extraction unit 51. In addition,
In the present embodiment, since the system includes the natural language analysis system unit 2 for analyzing the input sentence and the ideal translated sentence analyzing unit 4 for analyzing the ideal translated sentence, the input sentence ideal translated word expression pair has an analysis structure of the input sentence and the ideal translated sentence. Although extracted, a method of holding a pair of each word expression itself is also conceivable.

【００６２】上記のようにして得られた入力文理想訳文
単語表現対および入力文解析構造および理想訳文解析構
造を用いて、入力文省略要素抽出部５２によって入力文
中の省略要素が抽出される。Using the input sentence ideal translation sentence word expression pair, the input sentence analysis structure, and the ideal translation sentence analysis structure obtained as described above, the input sentence omission element extraction unit 52 extracts the omitted element in the input sentence.

【００６３】図９は、本発明の一実施形態の入力文省略
要素抽出に関する処理を示すフローチャートである。FIG. 9 is a flowchart showing processing relating to extraction of input sentence omitting elements according to an embodiment of the present invention.

【００６４】ステップＳ５２０１において、まず省略補
完要素対抽出部５の対訳表現対抽出部５２は、対訳表現
対抽出部５１の結果から入力文理想訳文単語表現対を取
り出す。図８の例では、「買いたい」と”want to buy
”に相当する解析構造の対と、「車」と”car ”に相
当する解析構造の対が取り出される。In step S5201, the bilingual expression pair extracting unit 52 of the omitted complementary element pair extracting unit 5 extracts an input sentence ideal translation word expression pair from the result of the bilingual expression pair extracting unit 51. In the example of FIG. 8, “want to buy” and “want to buy”
A pair of an analysis structure corresponding to "" and a pair of an analysis structure corresponding to "car" and "car" are extracted.

【００６５】次のステップＳ５２０２では、自然言語解
析システム部２の結果から入力文解析構造を取り出す。
図６の入力文に対しては図７に示す入力文解析構造が取
り出される。In the next step S5202, the input sentence analysis structure is extracted from the result of the natural language analysis system unit 2.
The input sentence analysis structure shown in FIG. 7 is extracted from the input sentence shown in FIG.

【００６６】ステップＳ５２０３では、次に理想訳文解
析部４の結果から理想訳文解析構造を取り出す。図６の
理想訳文に対しては図７に示す理想訳文解析構造が取り
出される。In step S5203, an ideal translation analysis structure is extracted from the result of the ideal translation analysis unit 4. The ideal translation analysis structure shown in FIG. 7 is extracted from the ideal translation shown in FIG.

【００６７】ステップＳ５２０４においては、ステップ
Ｓ５２０２で取り出した入力文解析構造から入力文部分
構造を順に取り出す。図７の入力文解析構造の例では、
「買いたい」に相当する部分構造、主格の省略フラグの
立っている要素に相当する部分構造、目的格の「車」に
相当する部分構造が取り出される。In step S5204, input sentence substructures are sequentially extracted from the input sentence analysis structure extracted in step S5202. In the example of the input sentence analysis structure in FIG.
The partial structure corresponding to "want to buy", the partial structure corresponding to the element with the nominative omission flag set, and the partial structure corresponding to the target case "car" are extracted.

【００６８】ステップＳ５２０５では、ステップＳ５２
０４で取り出した入力文部分構造に省略フラグがあるか
どうかが判定される。省略フラグ有りの判定があれば
（ＹＥＳ）、ステップＳ５２０６に移行し、なければ
（ＮＯ）、ステップＳ５２０７に移行する。上記の例で
は、主格の要素に相当する入力文部分構造は、省略フラ
グが立っているので、判定はＹＥＳとなり、ステップＳ
５２０６に移行する。また、「買いたい」に相当する入
力文部分構造および「車」に相当する部分構造は、とも
に省略フラグが立っていないので、判定はＮＯとなり、
ステップＳ５２０７に移行する。In step S5205, step S52
It is determined whether the input sentence substructure extracted in step 04 has an omission flag. If it is determined that the omission flag is present (YES), the process proceeds to step S5206; otherwise (NO), the process proceeds to step S5207. In the above example, since the omission flag is set for the input sentence substructure corresponding to the nominative element, the determination is YES, and step S
The process moves to 5206. In addition, since the omission flag is not set for both the input sentence partial structure corresponding to “want to buy” and the partial structure corresponding to “car”, the determination is NO.
It moves to step S5207.

【００６９】ステップＳ５２０６では、省略フラグの立
っている入力文部分構造について、省略要素として抽出
する。上記の例では、主格の省略フラグの立っている要
素に相当する部分構造が省略要素として抽出される。In step S5206, the input sentence partial structure with the omission flag set is extracted as an omission element. In the above example, the partial structure corresponding to the element with the nominative omission flag is extracted as the omission element.

【００７０】ステップＳ５２０７では、入力文部分構造
について、入力文理想訳文単語表現対を参照して、その
入力部分構造と対訳関係にある理想訳文部分構造が存在
するかどうか判定する。存在すれば（ＹＥＳ）、ステッ
プＳ５２０８に移行し、なければ（ＮＯ）、ステップＳ
５２１１に移行する。上記の例では、「買いたい」に相
当する入力文部分構造および「車」に相当する入力文部
分構造は、それぞれ”want to buy ”に相当する理想訳
文部分構造および”car ”に相当する理想訳文部分構造
が存在するのでともにＹＥＳとなりステップＳ５２０８
に移行する。また、主格の省略フラグの立っている要素
に相当する入力文部分構造は対訳関係にある理想訳文部
分構造が存在しないのでＮＯとなり、ステップＳ５２１
１に移行する。In step S5207, for the input sentence substructure, it is determined whether or not there is an ideal translated sentence substructure having a bilingual relationship with the input partial structure by referring to the input sentence ideal translated word expression pair. If it exists (YES), the process shifts to step S5208; otherwise (NO), step S5208
It moves to 5211. In the above example, the input sentence substructure corresponding to “want to buy” and the input sentence substructure corresponding to “car” are the ideal translated sentence substructure corresponding to “want to buy” and the ideal corresponding to “car”, respectively. Since both the translated sentence substructures exist, the answer is YES in step S5208.
Move to The input sentence partial structure corresponding to the element with the nominative omission flag is set to NO because there is no ideal translation sentence partial structure in a bilingual relationship.
Move to 1.

【００７１】ステップＳ５２０８では、入力文部分構造
について、入力文部分構造と隣接する入力文部分構造で
ある隣接入力文部分構造を抽出し、その入力文部分構造
と対訳関係にある理想訳文部分構造と隣接する理想訳文
部分構造である隣接理想訳文部分構造を抽出する。ここ
で、ある部分構造と隣接する部分構造とは、例えば、部
分構造と直接関係にある直下の部分構造のことを示す。
上記の例では、「買いたい」に相当する入力文部分構造
の隣接入力文部分構造は主格の省略フラグの立っている
要素に相当する入力文部分構造と、目的格の「車」に相
当する入力文部分構造であり、「車」に相当する入力文
部分構造の隣接入力文部分構造は存在しない。また、
「買いたい」と対訳関係にある”want to buy ”に相当
する理想訳文部分構造の隣接理想訳文部分構造は主格
の”Ｉ”に相当する理想訳文部分構造と目的格の”car
”に相当する理想訳文部分構造であり、「車」と対訳
関係にある”car ”に相当する理想訳文部分構造の隣接
理想訳文部分構造は所有格の”my”に相当する理想訳文
部分構造である。なお、ここでは、部分構造と直接関係
にある直下の部分構造のことを隣接する部分構造とした
が、この定義以外の部分構造も隣接部分構造とする方法
も考えられる。In step S5208, for the input sentence substructure, an adjacent input sentence substructure which is an input sentence substructure adjacent to the input sentence substructure is extracted, and an ideal translation sentence substructure having a bilingual relation with the input sentence substructure is extracted. An adjacent ideal translation partial structure that is an adjacent ideal translation partial structure is extracted. Here, the partial structure adjacent to a certain partial structure indicates, for example, a partial structure immediately below the partial structure that is directly related to the partial structure.
In the above example, the adjacent input sentence substructure of the input sentence substructure corresponding to “want to buy” corresponds to the input sentence substructure corresponding to the element with the nominative omission flag set and the target case “car”. This is an input sentence substructure, and there is no adjacent input sentence substructure of the input sentence substructure corresponding to “car”. Also,
The adjacent ideal translation partial structure corresponding to "want to buy" which is in parallel with "want to buy" is an ideal translation partial structure equivalent to the nominative "I" and a target car "car"
Is the ideal translation substructure corresponding to "car" and the ideal translation substructure adjacent to the ideal translation substructure corresponding to "car" in parallel with "car" is the ideal translation substructure corresponding to the possessive case "my" is there. Here, the partial structure immediately below the partial structure that is directly related to the partial structure is defined as an adjacent partial structure. However, a method of defining a partial structure other than this definition as an adjacent partial structure is also conceivable.

【００７２】ステップＳ５２０９では、ステップＳ５２
０８で取り出した入力文部分構造、この入力文部分構造
と対訳関係にある理想訳文部分構造およびその隣接理想
訳文部分構造について、その隣接入力文部分構造が存在
せず、その隣接理想訳文部分構造は１箇所存在し、その
隣接部分構造と対訳関係にある入力文部分構造が入力文
理想訳文単語表現対中に存在しないかどうか判断する。
この条件を満たせば（ＹＥＳ）、ステップＳ５２１０に
移行し、満たさなければ（ＮＯ）、ステップＳ５２１１
に移行する。上記の例では、「車」に相当する入力文部
分構造には隣接入力文部分構造は存在せず、「車」と対
訳関係にある”car ”に相当する理想訳文部分構造の隣
接理想訳文部分構造は所有格の”my”に相当する理想訳
文部分構造１箇所が存在し、図８の入力文理想訳文単語
表現対によると、”my”と対訳関係にある入力文部分構
造は存在しないためＹＥＳとなり、ステップＳ５２１０
に移行する。また、「買いたい」に相当する入力文部分
構造には２箇所の隣接入力文部分構造が存在するのでＮ
Ｏとなり、ステップＳ５２１１に移行する。In step S5209, step S52
In the input sentence substructure extracted in step 08, the ideal translated sentence substructure having a bilingual relationship with the input sentence substructure, and its adjacent ideal translated sentence substructure, the adjacent input sentence substructure does not exist. It is determined whether an input sentence substructure that exists at one location and has a bilingual relationship with its adjacent substructure does not exist in the input sentence ideal translation word expression pair.
If this condition is satisfied (YES), the flow shifts to step S5210, and if not (NO), step S5211.
Move to In the above example, there is no adjacent input sentence substructure in the input sentence substructure corresponding to “car”, and the adjacent ideal translation portion of the ideal translation substructure corresponding to “car” in parallel with “car” Since there is one ideal translated sentence partial structure corresponding to the possessive case “my”, and according to the input sentence ideal translated word expression pair shown in FIG. 8, there is no input sentence partial structure having a parallel translation relationship with “my”. YES, and step S5210
Move to Further, since there are two adjacent input sentence substructures in the input sentence substructure corresponding to “want to buy”, N
It becomes O, and it transfers to step S5211.

【００７３】ステップＳ５２１０では、ステップＳ５２
０９の条件を満たした隣接入力文部分構造を省略要素と
して抽出する。上記の例では、「車」の入力文部分構造
の隣接入力文部分構造を省略要素として抽出する。な
お、この場合は、「車」の入力文部分構造とその隣接入
力文部分構造との関係は、未定となる。In step S5210, step S52
The adjacent input sentence partial structure that satisfies the condition 09 is extracted as an omitted element. In the above example, the adjacent input sentence partial structure of the input sentence partial structure of “car” is extracted as an omitted element. In this case, the relationship between the input sentence partial structure of “car” and its adjacent input sentence partial structure is undecided.

【００７４】ステップＳ５２１１では、まだ検査してい
ない入力文部分構造が残っていないかどうか判断する。
最終的に、自然言語解析システム部２で生成された入力
文解析構造中の入力文部分構造すべてについて検査が終
了すると、この入力文省略要素抽出部５２の処理は終了
する。In step S5211, it is determined whether there is any input sentence substructure that has not been checked yet.
Finally, when the inspection is completed for all the input sentence substructures in the input sentence analysis structure generated by the natural language analysis system unit 2, the processing of the input sentence omitting element extraction unit 52 ends.

【００７５】図７の入力文解析構造に関しては、最終的
に入力文省略要素抽出部５２によって図１０に示す入力
文省略要素が抽出される。なお、図１０の例では、入力
文省略要素の入力文部分構造とそれを直接関係する１つ
上の入力文部分構造の対として入力文省略要素が抽出さ
れている。As for the input sentence analysis structure of FIG. 7, the input sentence omitted element shown in FIG. In the example of FIG. 10, the input sentence omitting element is extracted as a pair of the input sentence partial structure of the input sentence omitting element and the immediately preceding input sentence partial structure.

【００７６】次に、目的言語補完要素抽出部５３の処理
を説明する。図１１は、本発明の一実施形態の目的言語
補完要素抽出処理を示すフローチャートである。Next, the processing of the target language supplementary element extraction unit 53 will be described. FIG. 11 is a flowchart showing the target language supplementary element extraction processing according to one embodiment of the present invention.

【００７７】図１１の処理では、まずステップＳ５３０
１において、省略補完要素対抽出部５の目的言語補完要
素抽出部５３は、対訳表現対抽出部５１の結果から入力
文理想訳文単語表現対を取り出す。図８の例では、「買
いたい」と”want to buy ”に相当する解析構造の対
と、「車」と”car ”に相当する解析構造の対が取り出
される。In the process of FIG. 11, first, in step S530
In 1, the target language supplementary element extraction unit 53 of the omitted supplementary element pair extraction unit 5 extracts the input sentence ideal translation sentence word expression pair from the result of the bilingual expression pair extraction unit 51. In the example of FIG. 8, a pair of analysis structures corresponding to “want to buy” and “want to buy” and a pair of analysis structures corresponding to “car” and “car” are extracted.

【００７８】ステップＳ５３０２では、自然言語解析シ
ステム部２の結果から入力文解析構造を取り出す。図６
の入力文に対しては図７に示す入力文解析構造が取り出
される。ステップＳ５３０３では、理想訳文解析部４の
結果から理想訳文解析構造を取り出す。図６の理想訳文
に対しては図７に示す理想訳文解析構造が取り出され
る。ステップＳ５３０４では、入力文省略要素抽出部５
２の結果から入力文省略要素を取り出す。図６の入力文
に対しては図１０の入力文省略要素が取り出される。In step S5302, the input sentence analysis structure is extracted from the result of the natural language analysis system unit 2. FIG.
The input sentence analysis structure shown in FIG. In step S5303, an ideal translation analysis structure is extracted from the result of the ideal translation analysis unit 4. The ideal translation analysis structure shown in FIG. 7 is extracted from the ideal translation shown in FIG. In step S5304, the input sentence omitting element extraction unit 5
The input sentence omitting element is extracted from the result of 2. For the input sentence of FIG. 6, the input sentence omitting element of FIG. 10 is extracted.

【００７９】ステップＳ５３０５においては、ステップ
Ｓ５３０２で取り出した入力文解析構造から、ステップ
Ｓ５３０１で取り出された入力文理想訳文単語表現対中
に、対訳関係にある入力文部分構造が存在しない理想訳
文部分構造を順に取り出す。図７の理想訳文解析構造の
例では、”Ｉ”に相当する部分構造、”my”に相当する
部分構造が取り出される。In step S5305, based on the input sentence analysis structure extracted in step S5302, the ideal translated sentence partial structure in which no bilingual input sentence partial structure exists in the input sentence ideal translated word expression pair extracted in step S5301. In order. In the example of the ideal translation analysis structure in FIG. 7, a partial structure corresponding to "I" and a partial structure corresponding to "my" are extracted.

【００８０】ステップＳ５３０６においては、ステップ
Ｓ５３０５で取り出した理想訳文部分構造が代名詞表現
か定冠詞表現であるかどうか判定する。どちらかの表現
であれば（ＹＥＳ）、ステップＳ５３０７に移行し、そ
うでなければ（ＮＯ）、ステップＳ５３１２に移行す
る。上記の例では、”Ｉ”，”my”ともに代名詞表現で
あるので、判定はＹＥＳとなり、ステップＳ５３０７に
移行する。なお、この一実施形態では、代名詞表現と定
冠詞表現を条件としたが、それ以外を条件とする場合も
考えられる。In step S5306, it is determined whether the ideal translation partial structure extracted in step S5305 is a pronoun expression or a definite article expression. If it is either expression (YES), the process proceeds to step S5307; otherwise (NO), the process proceeds to step S5312. In the above example, since both “I” and “my” are pronoun expressions, the determination is YES, and the flow shifts to step S5307. In this embodiment, a pronoun expression and a definite article expression are used as conditions, but other conditions may be used.

【００８１】ステップＳ５３０７では、代名詞表現か定
冠詞表現である理想訳文部分構造と直接関係する１つ上
の理想訳文部分構造を抽出する。上記の例では、”Ｉ”
に対しては”want to buy ”に相当する理想訳文部分構
造が抽出され、”my”に対しては”car ”に相当する理
想訳文部分構造が抽出される。In step S5307, the next higher ideal translation partial structure that is directly related to the ideal translation partial structure that is a pronoun expression or a definite article expression is extracted. In the above example, "I"
For, an ideal translation partial structure corresponding to “want to buy” is extracted, and for “my”, an ideal translation partial structure corresponding to “car” is extracted.

【００８２】ステップＳ５３０８では、ステップＳ５３
０７で抽出された１つ上の理想訳文部分構造と対訳関係
にある入力文部分構造を入力文理想訳文単語表現対中か
ら抽出する。上記の例では、図８の入力文理想訳文単語
表現対を用いて、”Ｉ”の１つ上の理想訳文部分構造で
ある”want to buy ”に相当する部分構造からは、「買
いたい」に相当する入力文部分構造が抽出され、”my”
の１つ上の理想訳文部分構造である”car ”の部分構造
からは、「車」に相当する入力文部分構造が抽出され
る。At step S5308, at step S53
In step 07, the input sentence substructure that has a bilingual relationship with the immediately above ideal translation partial structure extracted in step 07 is extracted from the input sentence ideal translation word expression pair. In the above example, using the input sentence ideal translated word expression pair shown in FIG. 8, “I want to buy” from the partial structure corresponding to “want to buy” which is the ideal translated sentence partial structure immediately above “I”. The input sentence substructure corresponding to is extracted, and “my”
An input sentence partial structure corresponding to "car" is extracted from the partial structure of "car" which is the ideal translated sentence partial structure immediately above.

【００８３】ステップＳ５３０９では、ステップＳ５３
０８で抽出された入力文部分構造が抽出され、その入力
文部分構造がステップＳ５３０４で取り出した入力文省
略表現中の省略表現を含むものがあるかどうか判定す
る。その入力文部分構造を含む省略表現があれば（ＹＥ
Ｓ）、ステップＳ５３１０に移行し、そうでなければ
（ＮＯ）、ステップＳ５３１２に移行する。上記の例で
は、「買いたい」に相当する入力文部分構造を含む省略
表現、「車」に相当する入力文部分構造を含む省略表現
が存在するので、判定はＹＥＳとなり、ステップＳ５３
１０に移行する。In step S5309, step S53
In step S5304, it is determined whether the input sentence partial structure extracted in step 08 includes an abbreviation in the input sentence abbreviation extracted in step S5304. If there is an abbreviation that includes the input sentence substructure (YE
S), and proceeds to step S5310, otherwise (NO), proceeds to step S5312. In the above example, since there is an abbreviated expression including an input sentence substructure corresponding to “want to buy” and an abbreviated expression including an input sentence substructure corresponding to “car”, the determination is YES, and the determination is YES.
Move to 10.

【００８４】ステップＳ５３１０においては、ステップ
Ｓ５３０９で判定されたステップＳ５３０８で抽出され
た入力文部分構造を含む省略要素の中の構造が、ステッ
プＳ５３０５で取り出された代名詞表現か定冠詞表現で
ある理想訳文部分構造と矛盾するかどうか判定する。矛
盾する場合（ＹＥＳ）、ステップＳ５３１２に移行し、
矛盾しなければ（ＮＯ）、ステップＳ５３１１に移行す
る。In step S5310, the structure of the omitted element including the input sentence partial structure extracted in step S5308 determined in step S5309 is the ideal translated sentence portion which is a pronoun expression or a definite article expression extracted in step S5305. Determine if it conflicts with the structure. In the case of contradiction (YES), the flow shifts to step S5312,
If there is no contradiction (NO), the flow shifts to step S5311.

【００８５】上記の例では、”Ｉ”の理想訳文部分構造
は、”主体”の意味を持ち、”wantto buy ”と主格の
関係があるのに対し、”want to buy ”と対訳関係にあ
る「買いたい」の部分構造を含む省略要素の中の構造は
「買いたい」と主格の関係にある”主体”の意味を持つ
省略要素が存在することから両者は矛盾しないので、Ｎ
Ｏと判断され、ステップＳ５３１１に移行する。ま
た、”my”の理想訳文部分構造は、”主体”の意味を持
ち、その指示対象は”Ｉ”であり、”car ”と所有格の
関係があるのに対し、”car ”と対訳関係にある「車」
の部分構造を含む省略要素の中の構造は「車」と未定の
関係にある未定の意味を持つ省略要素が存在することか
ら両者は矛盾しないので、ＮＯと判断されステップＳ５
３１１に移行する。In the above example, the ideal translated sentence partial structure of “I” has the meaning of “subject” and has a nominative relationship with “wantto buy”, while a parallel translation with “want to buy” Since the structure of the omitted element including the partial structure of "want to buy" has an omitted element having the meaning of "subject" which has a nominative relationship with "want to buy", the two are not inconsistent.
It is determined to be O, and the flow shifts to step S5311. In addition, the ideal translation partial structure of “my” has the meaning of “subject” and its referent is “I”, which has a relation of possession with “car”, whereas it has a bilingual relation with “car”. "Cars"
In the structure of the omitted element including the partial structure of, there is an omitted element having an undetermined meaning that has an undetermined relationship with “car”, and therefore the two are not inconsistent.
It moves to 311.

【００８６】ステップＳ５３１１においては、ステップ
Ｓ５３１０で省略要素の構造と矛盾しないと判定された
理想訳部分構造を、その省略要素の目的言語補完要素と
して抽出する。上記の例では、”Ｉ”，”my" の両方の
理想訳部分構造とも、それぞれと矛盾しない省略要素の
目的言語補完要素として抽出する。In step S5311, the ideal translation partial structure determined as not inconsistent with the structure of the omitted element in step S5310 is extracted as the target language complement element of the omitted element. In the above example, both the ideal translation substructures of “I” and “my” are extracted as the target language complement elements of the omitted elements that do not contradict each other.

【００８７】ステップＳ５３１２では、まだ検査してい
ない理想訳文部分構造が残っていないかどうか判断す
る。最終的に、理想訳文解析部４で生成された理想訳文
解析構造中の理想訳文部分構造すべてについて検査が終
了すると、この目的言語補完要素抽出部５３の処理は終
了する。In step S5312, it is determined whether or not an ideal translated sentence substructure that has not been checked remains. Finally, when the inspection is completed for all the ideal translated sentence substructures in the ideal translated sentence analysis structure generated by the ideal translated sentence analysis unit 4, the processing of the target language supplementary element extraction unit 53 is completed.

【００８８】図７の入力文解析構造に関しては、最終的
に目的言語補完要素抽出部５３によって図１２に示す目
的言語補完要素が抽出される。なお、図１０の”my”を
補完する入力文省略要素には未定であった省略要素と
「車」との関係が、”my”と”car ”の関係である所有
格と認定されている。With respect to the input sentence analysis structure shown in FIG. 7, finally, the target language supplementary element shown in FIG. It should be noted that the relationship between the undetermined omitted element and the "car" in the input sentence omitted element that complements "my" in FIG. 10 is recognized as possessive, which is the relationship between "my" and "car". .

【００８９】次に、原言語補完要素抽出部５４の処理を
説明する。原言語補完要素抽出部５４は、目的言語補完
要素抽出部５３で抽出された目的言語補完要素を変換し
原言語補完要素を生成する。この目的言語補完要素から
原言語補完要素への変換には、原言語と目的言語の対訳
辞書を用いる手法等が考えられる。Next, the processing of the source language supplementary element extraction unit 54 will be described. The source language supplementary element extracting unit 54 converts the target language supplementary element extracted by the target language supplementary element extracting unit 53 to generate a source language supplementary element. For conversion from the target language complement element to the source language complement element, a method using a bilingual dictionary of the source language and the target language can be considered.

【００９０】図１２の個々の目的言語補完要素に対して
は図１３に示す原言語補完要素が原言語補完要素抽出部
５４によって生成される。For each target language complementary element in FIG. 12, a source language complementary element shown in FIG.

【００９１】上記のようにして省略補完要素対抽出部５
で得られた入力文省略要素、目的言語補完要素、原言語
補完要素の対である省略補完要素対は、図１の省略要素
補完規則生成変更部６によって、省略要素補完規則が生
成され、省略要素補完規則３２に新規追加されたり、省
略要素補完規則３２に既に存在する省略要素補完規則を
変更したりする。図１３の省略補完要素対につては、最
終的に図１４に示すようにルールエントリが図５の省略
要素補完規則に追加される。図１４の例では日本語が動
詞「買う」が“希望”の様相を伴っており、「買う」の
主格が省略されており、「買う」の目的格が「車」で、
「車」の所有格が省略されている場合に、「買う」の主
格に「私」を補完し、「車」の所有格に「私の」を補完
するというルールが追加されている。As described above, the omitted complementary element pair extraction unit 5
The omitted element complement rule, which is a pair of the input sentence omitted element, the target language complement element, and the source language complement element obtained in the above, is generated by the omitted element complement rule generation changing unit 6 in FIG. The new rule is added to the element complementing rule 32, or the omitted element complementing rule already existing in the omitted element complementing rule 32 is changed. For the omitted complementary element pair of FIG. 13, a rule entry is finally added to the omitted element complementary rule of FIG. 5, as shown in FIG. In the example of FIG. 14, the Japanese verb "buy" has the appearance of "hope", the nominative case of "buy" is omitted, the purpose case of "buy" is "car",
When the ownership of "car" is omitted, a rule has been added that complements "me" to the nomination of "buy" and complements "my" to the ownership of "car".

【００９２】なお、省略要素補完規則生成変更部６に、
省略補完要素対抽出部５で抽出された入力文省略要素、
目的言語補完要素、原言語補完要素の対である省略補完
要素対を格納し、省略要素補完規則を学習する対象であ
る対訳関係にある入力文および理想訳文の対の集合から
抽出された省略補完要素対の集合を用いて、最も整合性
がよく効果的な省略要素補完規則を抽出する方法も考え
られる。Note that the omitted element complementing rule generation changing unit 6
Input sentence omitted element extracted by the omitted complementing element pair extraction unit 5,
The abbreviation completion extracted from the set of pairs of the input sentence and the ideal translation sentence that are the target of learning the abbreviation element completion rule, which stores the abbreviation completion element pair that is the pair of the target language completion element and the source language completion element A method of extracting the most consistent and effective omitted element complementing rule using a set of element pairs is also conceivable.

【００９３】上記のようにして図１の基本ブロック構成
の自然言語解析システム用省略要素補完規則学習システ
ムの１回の処理は終了となる。しかし、同じ文の複数の
問題が混在している場合、１回の走行ではそのすべてを
抽出できるとは限らない上に、追加変更した辞書やルー
ルが追加変更する前と比べて悪くなる可能性もある。そ
こで、学習終了条件を設定して終了条件が満たされるま
で、図１の省略要素対抽出部５および省略要素補完規則
生成変更部６を繰り返す必要がある。学習終了判定の１
案として自然言語解析システム部が生成する原言語解析
構造の履歴を保持する原言語解析構造履歴格納部を設
け、その原言語解析構造履歴格納部が定常状態かどうか
を検査することが挙げられる。As described above, one process of the omission element complementing rule learning system for the natural language analysis system having the basic block configuration of FIG. 1 is completed. However, when multiple problems with the same sentence are mixed, not all of them can be extracted in one run, and the dictionary or rule that has been changed may be worse than before the addition was changed. There is also. Therefore, it is necessary to set the learning end condition and repeat the omitted element pair extraction unit 5 and the omitted element complementing rule generation changing unit 6 in FIG. 1 until the end condition is satisfied. 1 of learning end judgment
As a proposal, a source language analysis structure history storage unit that holds the history of the source language analysis structure generated by the natural language analysis system unit is provided, and it is checked whether the source language analysis structure history storage unit is in a steady state.

【００９４】繰り返し実行する学習システムの手順を図
１５のフローチャートを用いて説明する。１回目は上記
に示したと同様の方法で学習をする。その時点で図１５
の対訳文を読み込むステップＳ９１と図１で示した省略
補完要素対抽出部５および省略要素補完規則生成部６に
より省略要素補完規則３２を追加変更するステップＳ９
２は終了している。次に、ステップＳ９３において、ス
テップＳ９２で追加変更された省略要素補完規則３２を
用いて自然言語解析システム部２で再度解析する。次
に、学習終了条件としてステップＳ９４で保存されてい
る解析構造の記録内容が定常状態かどうかを判定する。
最初は、１回目の走行しかしていないので定常状態とは
言えない。従って、判定はＮＯとなり、ステップＳ９２
へと進む。２回目以降、ステップＳ９２、ステップＳ９
３がステップＳ９４により解析構造の保持内容が定常状
態に達するまで繰り返し実行される省略要素補完規則３
２の追加修正が行われる。このように、省略補完要素対
抽出部５および省略要素補完規則生成変更部６を繰り返
し行うと上記対訳文から抽出すべき省略要素補完規則は
なくなり、解析構造が定常状態になるので学習はその時
点で終了し、最終的な省略要素補完規則が抽出される。The procedure of the repeatedly executed learning system will be described with reference to the flowchart of FIG. At the first time, learning is performed in the same manner as described above. At that point, FIG.
Step S9 of reading the bilingual sentence of step S9, and step S9 of additionally changing the omitted element complementing rule 32 by the omitted complementary element pair extracting unit 5 and the omitted element complementing rule generating unit 6 shown in FIG.
2 is over. Next, in step S93, the natural language analysis system unit 2 analyzes again using the omitted element complementing rule 32 that has been added and changed in step S92. Next, it is determined whether or not the recorded contents of the analysis structure stored in step S94 as a learning end condition are in a steady state.
At first, it can not be said that it is in a steady state because only the first run has been made. Therefore, the determination is NO, and step S92
Proceed to. Steps S92 and S9 after the second time
3 is omitted until the content of the analysis structure reaches a steady state in step S94.
Two additional modifications are made. As described above, if the omitted complementary element pair extracting unit 5 and the omitted element complementary rule generation changing unit 6 are repeatedly performed, there is no omitted element complementary rule to be extracted from the bilingual sentence, and the analysis structure is in a steady state. And a final omitted element complementing rule is extracted.

【００９５】本発明は、上記の実施形態に限定されるこ
となく、特許請求の範囲内で種々変更または応用可能で
ある。The present invention is not limited to the above embodiments, but can be variously modified or applied within the scope of the claims.

【００９６】以上を要約すると、本実施形態において、
対訳文読み込み部１によって互いに対訳関係にある第１
の自然言語の文（入力文）と第２の自然言語の文（理想
訳文）を読み込み、以下の各処理を行うことにより省略
要素補完規則を自動的に新規作成し、追加修正すること
ができる。To summarize the above, in this embodiment,
First translations that are bilingual with each other by the bilingual sentence reading unit 1
By reading the natural language sentence (input sentence) and the second natural language sentence (ideal translation sentence), and performing the following processing, a new omitted element complementing rule can be automatically created and additionally corrected. .

【００９７】すなわち、省略補完要素対抽出部５におい
ては、入力文中から省略された単語および／または表現
である省略要素を抽出し、理想訳文中から省略要素に補
完される単語および／または表現である目的言語補完要
素を抽出し、省略要素と目的言語補完要素を基に省略要
素と省略要素に補完すべき第１の自然言語の単語および
／または表現である原言語補完要素の対である省略補完
要素対を抽出する。省略要素補完規則生成変更部６は、
省略補完要素対抽出部５で抽出される省略補完要素対を
基に省略要素補完規則を生成変更する。これにより、対
訳文中の入力文中の省略表現を自然言語解析システムで
解析するための省略要素補完規則を自動的に作成変更す
ることができる。That is, the omitted complementing element pair extracting unit 5 extracts the omitted word and / or the omitted element which is the expression from the input sentence, and uses the word and / or expression complemented to the omitted element from the ideal translation. An abbreviation, which is a pair of a source language complement element that is a first natural language word and / or expression to be complemented to an abbreviation element and an abbreviation element based on the abbreviation element and the target language complement element, by extracting a certain target language complement element Extract complementary element pairs. The omitted element complementing rule generation changing unit 6
Based on the omitted complementary element pairs extracted by the omitted complementary element pair extraction unit 5, an omitted element complementing rule is generated and changed. This makes it possible to automatically create and change an abbreviation element complementing rule for analyzing an abbreviation in an input sentence in a bilingual sentence with a natural language analysis system.

【００９８】また、省略補完要素対抽出部５に、入力文
から省略された要素である省略要素を抽出する入力文省
略要素抽出部５２と理想訳文から省略要素に補完される
単語および／または表現である目的言語補完要素を抽出
する目的言語補完要素抽出部５３とを設けることができ
る。これにより、原言語の文中から省略要素を抽出する
技術や目的言語の文中から代名詞表現等の定表現等、原
言語の文中での省略要素を目的言語の文に翻訳される傾
向にある要素を抽出する技術を、省略補完要素対抽出部
５で流用することができる。The omitted supplementary element pair extracting section 5 includes an input sentence omitted element extracting section 52 for extracting an omitted element, which is an element omitted from the input sentence, and a word and / or expression to be complemented from the ideal translation sentence to the omitted element. And a target language complementary element extraction unit 53 that extracts the target language complementary element. As a result, elements that tend to translate abbreviated elements in the source language sentence into sentences in the target language, such as techniques for extracting abbreviated elements from the source language sentence and fixed expressions such as pronoun expressions from the target language sentence, can be used. The extraction technique can be used in the omitted complementary element pair extraction unit 5.

【００９９】また、省略補完要素対抽出部５に、入力文
と理想訳文から入力文中の単語および／または表現であ
る入力単語表現と入力単語表現と対訳関係にある理想訳
文中の単語および／または表現である理想訳単語表現の
対である入力文理想訳単語表現対を抽出する対訳表現対
抽出部５１を設けることができる。これにより、入力文
と対訳関係にある理想訳文中の単語および表現を、対訳
関係にある表現が入力文中にあるものと、ないものに分
類することができるので、入力文中の省略要素に補完す
べき要素か否かの判断がより正確に行うことができる。
また、理想訳文中の省略要素が補完すべき要素と対訳関
係にある入力文中の要素が存在する場合には、同一文内
に補完要素がある省略現象を抽出することができる。Further, the abbreviation complementary element pair extraction unit 5 inputs words and / or expressions in the input sentence from the input sentence and the ideal translated sentence, and words and / or words in the ideal translated sentence having a bilingual relationship with the input word expression. A translation expression pair extraction unit 51 that extracts an input sentence ideal translation word expression pair that is an ideal translation word expression pair can be provided. This makes it possible to classify words and expressions in the ideal translation having a bilingual relationship with the input sentence into those having a bilingual relationship in the input sentence and those having no bilingual expression in the input sentence. The determination as to whether or not the element should be performed can be made more accurately.
Further, when there is an element in the input sentence that has a bilingual relationship with the element to be complemented by the omitted element in the ideal translation, it is possible to extract an omission phenomenon in which the complementary element is present in the same sentence.

【０１００】また、省略補完要素対抽出部５に、目的言
語補完要素抽出部５３で抽出される目的言語補完要素を
基に原言語での補完要素を抽出する原言語補完要素抽出
部５４を設けることができる。これにより、原言語であ
る第１の自然言語のみで省略要素の位置とその省略要素
に補うべき要素を抽出することができる。Further, the omitted language supplementary element pair extraction unit 5 is provided with a source language complement element extraction unit 54 for extracting a complement element in the source language based on the target language complement element extracted by the target language complement element extraction unit 53. be able to. This makes it possible to extract the position of the omitted element and the element to be supplemented by the omitted element only in the first natural language that is the source language.

【０１０１】入力文解析部は、入力文を解析して入力文
解析構造を生成し、入力文解析構造を利用して省略要素
および／または目的言語補完要素および／または原言語
補完要素を抽出することができる。これにより、入力文
を解析することで初めて判断できる省略要素が抽出でき
るとともに、解析の結果判明したこの省略要素に補完で
きる要素への制約条件を用いてより正確に目的言語補完
要素や原言語補完要素を抽出することができる。The input sentence analysis unit analyzes the input sentence to generate an input sentence analysis structure, and extracts an omitted element and / or a target language complement element and / or a source language complement element using the input sentence analysis structure. be able to. This makes it possible to extract omitted elements that can be determined for the first time by analyzing the input sentence, and to more accurately use target language complement elements and source language complement elements by using the constraints on elements that can be complemented by these omitted elements that were found as a result of the analysis. Elements can be extracted.

【０１０２】また、入力文解析部において、入力文中の
省略要素を補完する入力文省略要素補完部２２を含むこ
とができる。これにより、個々の入力文中の省略要素
を、まだ補完されておらず新規に補完規則が必要な省略
要素であるか、既に補完されているが誤った要素が補完
されている省略要素であるか、既に補完されており正し
い要素が補完されている省略要素であるかを区別するこ
とができ、きめ細かな省略要素補完規則を抽出すること
ができる。Further, the input sentence analyzing section may include an input sentence omitting element complementing section 22 for complementing an omitted element in the input sentence. As a result, whether the abbreviation element in each input sentence is an abbreviation element that has not been completed yet and requires a new completion rule, or is an abbreviation element that has already been completed but an erroneous element has been completed Thus, it is possible to distinguish whether or not the omitted element that has already been complemented and the correct element has been complemented, and a detailed omitted element complementing rule can be extracted.

【０１０３】また、入力文解析部において、入力文の解
析に本学習方法で抽出される省略要素補完規則を使う自
然言語解析システムを利用することができる。これによ
り、本自然言語解析システムの解析傾向を直接、省略要
素および／または目的言語補完要素および／または原言
語補完要素の抽出に活かすことができる。In the input sentence analysis unit, a natural language analysis system that uses the omitted element complementing rules extracted by the present learning method for analyzing the input sentence can be used. As a result, the analysis tendency of the natural language analysis system can be directly used for extracting the omitted element and / or the target language complementary element and / or the source language complementary element.

【０１０４】理想訳文解析部４は、理想訳文を解析して
理想訳文解析構造を生成し、理想訳文解析構造を利用し
て省略要素および／または目的言語補完要素および／ま
たは原言語補完要素を抽出することができる。これによ
り、理想訳文中から入力文中の省略要素と対訳関係にあ
る要素となりやすい要素である目的言語補完要素の候補
およびそれと対訳関係にある原言語補完要素の候補をよ
り正確に抽出できるとともに、それらの候補をもとに入
力文中の省略要素をより正確に抽出することができる。The ideal translated sentence analyzing unit 4 analyzes the ideal translated sentence to generate an ideal translated sentence analyzing structure, and extracts the omitted element and / or the target language complementary element and / or the source language complementary element using the ideal translated sentence analyzing structure. can do. This makes it possible to more accurately extract, from the ideal translation, the candidate for the target language complement element, which is an element that is likely to be a parallel translation element with the omitted element in the input sentence, and the candidate for the source language complement element that has a parallel translation with it. , The omitted element in the input sentence can be extracted more accurately.

【０１０５】また、理想訳文解析部４において、理想訳
文中の省略要素を補完する理想訳文省略要素補完部４２
を含むことができる。これにより、入力文中の省略要素
に補完すべき要素が理想訳文中でも省略されていた場合
でも、理想訳文省略要素補完部４２によって補完された
場合には、その補完された要素を入力文の省略要素に補
完すべき要素として抽出することができる。In the ideal translated sentence analyzing section 4, an ideal translated sentence omitting element complementing section 42 for complementing an omitted element in the ideal translated sentence is provided.
Can be included. Thereby, even if the element to be complemented to the omitted element in the input sentence is also omitted in the ideal translation, if the ideal translation sentence omitting element complementing section 42 complements the element, the complemented element is omitted from the input sentence. Can be extracted as an element to be complemented.

【０１０６】また、理想訳文解析部４において、理想訳
文中の代名詞等の照応表現の指示対象を決定する理想訳
文照応解析部４３を含むことができる。これにより、入
力文中の省略要素に補完すべき要素が理想訳文中に存在
し、それが照応表現である場合には、理想訳文解析部４
で決定された照応表現の指示対象の情報も抽出すること
ができ、よりきめ細かな省略要素補完規則を抽出するこ
とができる。また、入力文中の省略要素に補完すべき要
素が理想訳文中に存在し、それが照応表現であり、理想
訳文照応解析部４３で決定されたその理想訳文中の指示
対象が同一文内または文章中の他の文に存在し、この理
想訳文中の指示対象と対訳関係にある入力文中の要素が
決定された場合には、この指示対象と対訳関係にある入
力文内の要素が省略要素に補完すべき要素であることが
決定でき、省略要素への文内補完規則や文間補完規則を
抽出することができる。The ideal translated sentence analyzing unit 4 can include an ideal translated sentence anaphor analyzing unit 43 for determining a target of an anaphor expression such as a pronoun in an ideal translated sentence. Thus, if an element to be supplemented to the omitted element in the input sentence exists in the ideal translation and is an anaphoric expression, the ideal translation analyzer 4
The information of the pointing target of the anaphor expression determined in the above can also be extracted, and a more detailed omitted element complementing rule can be extracted. Also, an element to be supplemented to the omitted element in the input sentence exists in the ideal translation, which is an anaphor expression, and the target in the ideal translation determined by the ideal translation sentence analytic analysis unit 43 is in the same sentence or a sentence. If an element in the input sentence that is present in another sentence and is in parallel with the referent in this ideal translation is determined, the element in the input sentence that is in parallel with this referent is omitted. It can be determined that the element should be complemented, and the in-sentence complement rules and inter-sentence complement rules for the omitted elements can be extracted.

【０１０７】また、省略要素補完規則生成変更部６に、
入力文解析部で得られる入力文解析構造を利用して省略
要素補完規則を生成修正することができる。これによ
り、入力文の解析構造に応じたきめ細かな省略要素補完
規則を生成変更することができる。更に、この入力文解
析部での解析に本学習方法で抽出される省略要素補完規
則を使う自然言語解析システムを利用することにより、
本自然言語解析システムの出力する解析構造を直接活用
して省略要素補完規則を効率的に生成することができ
る。Further, the omitted element complementing rule generation changing unit 6
By using the input sentence analysis structure obtained by the input sentence analysis unit, the omitted element complementing rule can be generated and corrected. As a result, it is possible to generate and change a detailed omitted element complementing rule according to the analysis structure of the input sentence. Furthermore, by using a natural language analysis system that uses the abbreviation element completion rules extracted by this learning method for analysis in this input sentence analysis unit,
By directly utilizing the analysis structure output by the natural language analysis system, it is possible to efficiently generate the omitted element completion rules.

【０１０８】また、省略要素補完規則生成変更部６にお
いて、省略補完要素対抽出部５で抽出される省略補完要
素対を省略補完要素対格納部へ記録する省略要素補完対
記録処理を含み、学習対象となる対訳関係にある前記入
力文と前記理想訳文の対の集合を基に蓄積された省略補
完要素対格納部中の省略補完要素対の集合を利用して省
略要素補完規則を生成変更することができる。これによ
り、学習対象となる対訳関係にある入力文と理想訳文の
対の集合から抽出される様々な省略補完要素対の結果を
考慮にいれて、その学習対象に対して、最も効果的で整
合性の良い省略要素補完規則を抽出することができる。Further, the omission element complementing rule generation / change unit 6 includes an omission element complementation pair recording process of recording the omission complement element pair extracted by the omission complement element pair extraction unit 5 into the omission complement element pair storage unit. Generating and changing an abbreviation element complementing rule using a set of abbreviated complementation element pairs stored in an abbreviated complementation element pair storage unit based on a set of pairs of the input sentence and the ideal translation sentence having a target bilingual relationship. be able to. As a result, taking into account the results of various omitted complementary element pairs extracted from the set of pairs of the input sentence and the ideal translated sentence that are in the bilingual relationship to be learned, the most effective and matching It is possible to extract a good omitted element complement rule.

【０１０９】本省略要素補完規則学習方法によって得ら
れる前記省略要素補完規則を用いて入力文を解析し、省
略要素補完規則学習方法を繰り返すことによって、生成
修正された省略要素補完規則の検証ができ、更に１回の
学習過程では得られなかった省略要素補完規則を抽出す
ることも可能となる。The input sentence is analyzed using the omitted element completion rule obtained by the omitted element completion rule learning method, and the generated and modified omitted element completion rule can be verified by repeating the omitted element completion rule learning method. Further, it is possible to extract the omitted element complementing rules that cannot be obtained in one learning process.

【０１１０】終了条件判定処理は、学習終了条件を自動
的に判定する。これによって、学習結果を人が見ずに自
動的に学習が終了する。In the termination condition determination process, the learning termination condition is automatically determined. As a result, the learning is automatically terminated without the person viewing the learning result.

【０１１１】原言語解析構造記録処理は、上記省略要素
補完規則を用いて入力文を解析した結果である原言語解
析構造の履歴である原言語解析構造履歴を原言語解析構
造履歴格納部へ記録する。これによって、原言語解析構
造記録処理より得られた原言語解析構造履歴格納部への
記録内容が定常状態になったとき学習が自動的に終了
し、最適な辞書やルールが登録される。In the source language analysis structure recording processing, the source language analysis structure history, which is the history of the source language analysis structure obtained as a result of analyzing the input sentence using the above omitted element completion rule, is recorded in the source language analysis structure history storage unit. I do. Thus, when the contents of the source language analysis structure history storage unit obtained by the source language analysis structure recording process have reached a steady state, the learning is automatically terminated, and the optimal dictionary and rules are registered.

【０１１２】[0112]

【発明の効果】以上説明したように、本発明によれば、
入力文と理想訳文を読み込み、入力文および理想訳文か
ら入力文中の省略要素を抽出し、理想訳文中から省略要
素に補完される目的言語補完要素を抽出し、省略要素と
目的言語補完要素を基に省略要素と該省略要素に補完す
べき原言語補完要素の対である省略補完要素対を抽出
し、この抽出された省略補完要素対を基に省略要素補完
規則を生成変更するので、従来のように人手によること
なく、入力文中の省略要素を補完するのに適した省略要
素補完規則を自動的に蓄積することができる。As described above, according to the present invention,
The input sentence and the ideal translation are read, the omitted elements in the input sentence are extracted from the input sentence and the ideal translation, the target language complement element complemented by the omitted element is extracted from the ideal translated sentence, and the target language complement element is extracted based on the omitted element and the target language complement element. The abbreviation element pair, which is a pair of the abbreviation element and the source language complement element to be complemented with the abbreviation element, is extracted, and the abbreviation element completion rule is generated and changed based on the extracted abbreviation complement element pair. As described above, it is possible to automatically accumulate the omitted element complementing rules suitable for complementing the omitted element in the input sentence without manual operation.

【０１１３】また、省略要素補完規則生成変更処理によ
り得られた省略要素補完規則を用いて省略要素補完規則
学習方法を繰り返すことにより、登録された省略要素補
完規則を検証でき、更に１回の学習過程では得られなか
った省略要素補完規則を抽出することもできる。Further, by repeating the omitted element complementing rule learning method using the omitted element complementing rule obtained by the omitted element complementary rule generation changing process, the registered omitted element complementing rule can be verified, and one more learning can be performed. It is also possible to extract omitted element complement rules that were not obtained in the process.

【０１１４】更に、入力文解析処理で得られた原言語解
析構造を原言語解析構造格納手段に記録することによ
り、原言語解析構造記録処理により得られた原言語解析
構造格納手段への記録内容が定常状態になったときに学
習を自動的に終了し、最適な省略要素補完規則を登録す
ることができる。Further, the source language analysis structure obtained by the input sentence analysis processing is recorded in the source language analysis structure storage means, so that the contents of the source language analysis structure storage means obtained by the source language analysis structure storage processing are recorded. Is automatically terminated when the steady state is reached, and an optimal omitted element complementing rule can be registered.

[Brief description of the drawings]

【図１】本発明の一実施形態に係る自然言語解析装置の
構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a natural language analysis device according to an embodiment of the present invention.

【図２】本発明の一実施形態の追加変更される省略要素
補完規則を用いた自然言語解析装置の基本構成を示すブ
ロック図である。FIG. 2 is a block diagram showing a basic configuration of a natural language analysis apparatus using an omitted and omitted element complementing rule according to an embodiment of the present invention.

【図３】図２の自然言語解析装置に使用されている構文
意味解析辞書の構成を示す図である。FIG. 3 is a diagram showing a configuration of a syntactic and semantic analysis dictionary used in the natural language analysis device of FIG. 2;

【図４】図２の自然言語解析装置に使用されている構文
意味解析ルールの構成を示す図である。FIG. 4 is a diagram showing a configuration of a syntactic and semantic analysis rule used in the natural language analysis device of FIG. 2;

【図５】図１の自然言語解析装置に使用されている省略
要素補完規則の構成を示す図である。FIG. 5 is a diagram showing a configuration of an omitted element complementing rule used in the natural language analysis device of FIG. 1;

【図６】図１の自然言語解析装置に入力される対訳文の
一例を示す図である。FIG. 6 is a diagram illustrating an example of a bilingual sentence input to the natural language analysis device of FIG. 1;

【図７】図１の自然言語解析装置における入力文と理想
訳文の概念構造を示す図である。FIG. 7 is a diagram showing a conceptual structure of an input sentence and an ideal translated sentence in the natural language analysis device of FIG. 1;

【図８】図１の自然言語解析装置における入力文理想訳
文単語表現対を示す図である。8 is a diagram showing an input sentence ideal translation sentence word expression pair in the natural language analysis device of FIG. 1;

【図９】図１の自然言語解析装置における入力文省略要
素抽出部の処理を示すフローチャートである。FIG. 9 is a flowchart showing processing of an input sentence omitting element extraction unit in the natural language analysis device of FIG. 1;

【図１０】図１の自然言語解析装置における入力文省略
要素を示す図である。FIG. 10 is a diagram showing input sentence omitting elements in the natural language analysis device of FIG. 1;

【図１１】図１の自然言語解析装置における目的言語補
完要素抽出部の処理を示すフローチャートである。FIG. 11 is a flowchart illustrating a process of a target language supplementary element extraction unit in the natural language analysis device of FIG. 1;

【図１２】図１の自然言語解析装置における入力文省略
要素と目的言語補完要素を示す図である。12 is a diagram showing an input sentence omitting element and a target language complementing element in the natural language analysis device of FIG. 1;

【図１３】図１の自然言語解析装置における入力文省略
要素と目的言語補完要素と原言語補完要素を示す図であ
る。13 is a diagram showing an input sentence omitting element, a target language complementing element, and a source language complementing element in the natural language analyzer of FIG. 1;

【図１４】図１の自然言語解析装置における省略要素補
完規則の追加変更処理を行った結果得られる省略要素補
完規則の例を示す図である。FIG. 14 is a diagram illustrating an example of an omitted element complementing rule obtained as a result of performing a process of adding and changing an omitted element complementing rule in the natural language analysis device of FIG. 1;

【図１５】本発明の他の実施形態の学習処理を示すフロ
ーチャートである。FIG. 15 is a flowchart illustrating a learning process according to another embodiment of the present invention.

[Explanation of symbols]

１対訳文読み込み部２自然言語解析システム部３規則４理想訳文解析部５省略補完要素対抽出部６省略要素補完規則生成変更部７第１自然言語入力部８第１自然言語解析構造出力部２１入力文構文意味解析部２２入力文省略要素補完部３１構文意味解析規則３２省略要素補完規則４１理想訳文構文意味解析部４２理想訳文省略要素補完部４３理想訳文照応解析部５１対訳表現対抽出部５２入力文省略要素抽出部５３目的言語補完要素抽出部５４原言語補完要素抽出部３１１構文意味解析辞書３１２構文意味解析ルール DESCRIPTION OF SYMBOLS 1 Bilingual sentence reading part 2 Natural language analysis system part 3 Rule 4 Ideal translation sentence analysis part 5 Omission complement element pair extraction part 6 Omission element complement rule generation change part 7 First natural language input part 8 First natural language analysis structure output part 21 Input sentence syntactic and semantic analysis unit 22 Input sentence elimination element complement unit 31 Syntax and semantic analysis rule 32 Omission element complement rule 41 Ideal translation sentence syntactic and semantic analysis unit 42 Ideal translation sentence elimination element complement unit 43 Ideal translation sentence analytic analysis unit 51 Bilingual expression pair extraction unit 52 Input sentence elimination element extraction unit 53 Target language complement element extraction unit 54 Source language complement element extraction unit 311 Syntax / meaning analysis dictionary 312 Syntax / meaning analysis rule

Claims

[Claims]

1. An input sentence of a first natural language that is a source language is analyzed, a source language analysis structure that is an analysis structure of the input sentence is generated, and an omitted element complementing rule for complementing an omitted element is generated. An input means for inputting an ideal translation of a target language which is a second natural language having a bilingual relationship with the input sentence, and an input for extracting an omitted element omitted from the input sentence. Sentence omitted element extracting means, target language complementary element extracting means for extracting a target language complementary element to be complemented to the omitted element from the ideal translation, and a target language complementary element extracted by the target language complementary element extracting means. Source-language complementing element extraction means for extracting a source-language complementing element, and an omission-element complementing rule for additionally changing the abbreviation-element complementing rule based on the pair of abbreviation-complementary elements, which is a pair of the target language complement and the source language complement Generate A natural language analysis device comprising: a change unit.

2. An input sentence of a first natural language which is a source language is analyzed, a source language analysis structure which is an analysis structure of the input sentence is generated, and an omitted element complementing rule for complementing an omitted element is generated. A natural language analysis apparatus for changing, wherein reading means for reading the input sentence and an ideal translation of a second natural language which are bilingually related to each other, and an abbreviation element omitted from the input sentence from the input sentence and the ideal translation Is extracted from the ideal translation, and a target language complement element to be complemented by the omitted element is extracted. Based on the omitted element and the target language complement element, the omitted element and the source language complement to be complemented by the omitted element are extracted. An omitted complementing element pair extracting means for extracting an omitted complementary element pair which is a pair of elements, and an omitted element for generating and changing the omitted element complementing rule based on the omitted complementary element pair extracted by the omitted complementary element pair extracting means. Supplement A natural language analysis device comprising: perfect rule generation changing means.

3. An abbreviation complementary element pair extracting means, comprising: an input sentence abbreviation element extracting means for extracting an abbreviated element omitted from the input sentence; and a target language complement element complemented from the ideal translation by the abbreviation element. 3. The natural language analysis device according to claim 2, further comprising a target language complementary element extraction unit for extracting.

4. An abbreviation complementary element pair extracting means, comprising: a pair of an input word expression in the input sentence and an ideal translation word expression in the ideal translation having a bilingual relationship with the input word expression from the input sentence and the ideal translation. 3. A bilingual expression pair extracting means for extracting an input sentence ideal translation word expression pair as follows.
Or the natural language analyzer according to 3.

5. The source language complement element extraction unit for extracting the source language complement element based on the target language complement element extracted by the target language complement element extraction unit, wherein the omitted complement element pair extraction unit comprises: 5. The method according to claim 3, wherein
Natural language analyzer as described.

6. An input sentence analyzing means for analyzing the input sentence read by the bilingual sentence reading means to generate an input sentence analysis structure, and utilizing the input sentence analysis structure, The natural language analysis device according to claim 2, wherein a target language complement element and / or the source language complement element are extracted.

7. The natural language analysis device according to claim 6, wherein the input sentence analyzing means includes an input sentence omitting element complementing means for complementing an omitting element in the input sentence.

8. An ideal translated sentence analyzing means for analyzing the ideal translated sentence read by the bilingual reading means to generate an ideal translated sentence analyzing structure, and utilizing the ideal translated sentence analyzing structure, the omitted element and the object. 3. A language completion element and / or the source language completion element is extracted.
8. The natural language analyzer according to any one of claims 7 to 7.

9. The natural language analysis device according to claim 8, wherein said ideal translation analyzing means includes ideal translation omitting element complementing means for complementing omitting elements in said ideal translation.

10. The natural language analysis device according to claim 8, wherein the ideal translation analysis unit has an ideal translation anaphor analysis unit that determines a target of an anaphor expression such as a pronoun in the ideal translation. .

11. The omission element complementing rule generation and change means generates and changes the omitted element complementation rule using the input sentence analysis structure obtained by the input sentence analysis means. The natural language analysis device according to any one of the above.

12. The omission element complementing rule generation / change unit includes an omission complement element pair storage unit that records the omission complement element pair extracted by the omission complement element pair extraction unit, and stores the omission complement element pair storage unit. The omitted element complementing rule is generated and changed using a set of the omitted complementary element pairs stored in the omitted complementary element pair storage means based on a set of a pair of the input sentence and the ideal translation sentence. The natural language analysis device according to any one of claims 2 to 11, wherein

13. The input sentence omitting element complementing means complements an omitting element in the input sentence by using the omitting element complementing rule, and repeatedly executes an omitting element complementing rule learning method. The natural language analysis device according to any one of the above.

14. The natural language analysis device according to claim 13, wherein said omitted element complementing rule generation changing means has a learning end determining means for determining the end of learning.

15. A source language analysis structure history storage unit that records a source language analysis structure history that is a history of the source language analysis structure that is a result of analyzing the input sentence using the omitted element completion rule, 15. The natural language analysis device according to claim 14, wherein the learning end determination unit ends the learning when the recorded content of the source language analysis structure history storage unit is in a steady state.