JPH06203068A

JPH06203068A - Machine translation system

Info

Publication number: JPH06203068A
Application number: JP4347561A
Authority: JP
Inventors: Hidezo Kugimiya; 秀造釘宮
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1992-12-28
Filing date: 1992-12-28
Publication date: 1994-07-22

Abstract

PURPOSE:To provide a machine translation system capable of performing than heretofore. translation more naturally CONSTITUTION:This system is provided with modules 11, 12 which generate a structural analysis tree by analyzing a text of source language, memory 18 which stores a rule to convert the structural analysis tree to that of target language, a module 13 which converts the structural analysis tree of the source language to that of the target language by referring to the memory 18, a module 21 to detect a word which satisfies a condition decided in advance in the structural analysis tree of the target language, and a module 22 which prepares the text of the target language by eliminating a detected word based on the structural analysis tree of the target language.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は機械翻訳装置に関し、
特に、言語の構造の相違に基づく不自然な翻訳文の生成
を防止することができる機械翻訳装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device,
In particular, the present invention relates to a machine translation device that can prevent unnatural translations from being generated due to differences in language structure.

【０００２】[0002]

【従来の技術】従来の機械翻訳装置では、キーボードか
ら入力されたソース言語のテキストを、メインＣＰＵ
（中央処理装置）の制御により翻訳モジュールに入力す
る。入力されたソース言語のテキストは、機械翻訳装置
に備えられたメモリに記憶されているメイン辞書やユー
ザ辞書からなる辞書と、文法規則と、木構造変換規則と
を利用してターゲット言語のテキストに翻訳される。2. Description of the Related Art In a conventional machine translation device, a source language text input from a keyboard is transferred to a main CPU.
Input to the translation module under the control of the central processing unit. The input source language text is converted into a target language text by using a dictionary including a main dictionary and a user dictionary stored in a memory provided in the machine translation device, a grammar rule, and a tree structure conversion rule. Will be translated.

【０００３】[0003]

【発明が解決しようとする課題】しかしこのような従来
の機械翻訳装置では、ソース言語の単語とターゲット言
語の単語との間に１対１の対応がつけられているため
に、生成されたターゲット言語のテキストが不自然なも
のになるという問題点があった。図１９は、そのような
問題点を示す一例である。図１９に示される例は英文を
日本語に翻訳した場合の例である。However, in such a conventional machine translation device, since there is a one-to-one correspondence between a word in the source language and a word in the target language, a generated target is generated. There was a problem that the language text became unnatural. FIG. 19 is an example showing such a problem. The example shown in FIG. 19 is an example in which an English sentence is translated into Japanese.

【０００４】図１９を参照して、英文では“ｙｏｕ”が
２箇所、“ｙｏｕｒ”が１箇所出現する。翻訳して得ら
れた日本語の文でも、これらに対応して「あなた」とい
う語が全部で３回出現する。Referring to FIG. 19, in English, "you" appears in two places and "your" appears in one place. In the translated Japanese sentence, the word "you" appears three times in total.

【０００５】英語の文中の“ｙｏｕ”のような代名詞な
どの単語は、英語の構造上不可欠なものである。しか
し、英語以外の言語では必ずしもそれが当てはまらな
い。たとえば日本語では、話し手の間で主語が明確であ
る場合には、主語を省略するのが普通である。そのため
に日本語のような言語では、英語の代名詞などをそのま
ま訳語に置換えた場合には、訳文としては冗長で不自然
になることがあるという問題点がある。したがってこの
ような各言語に特有の問題をも考慮にいれて、原文と翻
訳文との間に１対１の関係をつけずにすむような、より
柔軟な翻訳を行なうことが望まれている。できればその
ような翻訳は容易に行なえることが望ましい。Words such as pronouns such as "you" in English sentences are indispensable in the structure of English. However, this is not always the case in languages other than English. For example, in Japanese, it is common to omit the subject if the subject is clear among the speakers. Therefore, in a language such as Japanese, if an English pronoun or the like is directly replaced by a translated word, the translated sentence may be redundant and unnatural. Therefore, in consideration of such a problem peculiar to each language, it is desired to perform more flexible translation so as not to make a one-to-one relationship between the original sentence and the translated sentence. . If possible, such translation should be easy.

【０００６】それゆえに請求項１に記載の発明の目的
は、従来よりも柔軟な翻訳を行なうことができる機械翻
訳装置を提供することである。Therefore, an object of the invention described in claim 1 is to provide a machine translation device capable of more flexible translation than ever.

【０００７】請求項２に記載の発明の目的は、従来より
も柔軟な翻訳を容易に行なうことができる機械翻訳装置
を提供することである。An object of the invention described in claim 2 is to provide a machine translation device capable of easily performing a more flexible translation than the conventional one.

【０００８】請求項３に記載の発明の目的は、冗長な表
現が少ない、より自然な翻訳を容易に行なえる機械翻訳
装置を提供することである。An object of the invention described in claim 3 is to provide a machine translation device with few redundant expressions and capable of easily performing a more natural translation.

【０００９】請求項４に記載の発明の目的は、英語か
ら、自然な表現の日本語への翻訳を容易に行なうことが
できる機械翻訳装置を提供することである。An object of the invention described in claim 4 is to provide a machine translation device capable of easily translating English into a Japanese with a natural expression.

【００１０】請求項５に記載の発明の目的は、英語では
表現されても日本語では表現不要な代名詞などの単語が
日本語のテキスト上に出現することを防止でき、英語か
ら、自然な表現の日本語への翻訳を容易に行なうことが
できる機械翻訳装置を提供することである。The object of the invention described in claim 5 is to prevent words, such as pronouns, which are expressed in English but are unnecessary in Japanese, from appearing in Japanese text, and the natural expression from English can be obtained. It is an object of the present invention to provide a machine translation device capable of easily translating Japanese into Japanese.

【００１１】請求項６に記載の発明の目的は、英語では
表現されても日本語では表現不要だと使用者が判断した
代名詞などの単語が日本語のテキスト中に出現すること
を防止でき、英語から、自然な表現の日本語への翻訳を
容易に行なうことができる機械翻訳装置を提供すること
である。The object of the invention as set forth in claim 6 is to prevent words, such as pronouns, judged by the user that they are expressed in English but need not be expressed in Japanese, from appearing in Japanese text. It is an object of the present invention to provide a machine translation device that can easily translate English into a natural expression in Japanese.

【００１２】[0012]

【課題を解決するための手段】請求項１に記載の機械翻
訳装置は、入力されるソース言語のテキストを解析し、
ソース言語の構造解析木を生成するための手段と、生成
された構造解析木を、予め定めるターゲット言語の構造
解析木に変換するための変換規則を格納するための第１
の格納手段と、ソース言語の構造解析木を、変換規則を
参照してターゲット言語の構造解析木に変換するための
手段と、ターゲット言語の構造解析木中の、予め定める
条件を満たす単語を検出するための検出手段と、ターゲ
ット言語の構造解析木に基づいて、検出された単語を除
外してターゲット言語のテキストを生成するための生成
手段とを含む。A machine translation device according to claim 1 analyzes an input source language text,
First means for generating a source language structural analysis tree and a conversion rule for converting the generated structural analysis tree into a predetermined target language structural analysis tree
Storage means, a means for converting a source language structure analysis tree into a target language structure analysis tree by referring to conversion rules, and detecting a word satisfying a predetermined condition in the target language structure analysis tree. And a generating unit for generating a target language text by excluding the detected words based on the target language structure analysis tree.

【００１３】請求項２に記載の機械翻訳装置は、請求項
１に記載の機械翻訳装置であって、検出手段は、予め定
める条件を規定する除外規則を格納するための第２の格
納手段と、ターゲット言語の構造解析木中の各単語に対
して、第２の格納手段に格納されている除外規則のいず
れかを適用可能かどうかを判断するための判断手段と、
判断手段により除外規則のいずれかが適用可能と判断さ
れた単語に関連して、所定の標識を準備するための手段
とを含み、生成手段は、標識が準備された単語を除外し
てターゲット言語のテキストを生成する。A machine translation device according to a second aspect is the machine translation device according to the first aspect, wherein the detecting means includes a second storing means for storing an exclusion rule defining a predetermined condition. Determining means for determining whether any of the exclusion rules stored in the second storage means can be applied to each word in the structure analysis tree of the target language,
Means for preparing a predetermined marker in relation to a word for which any of the exclusion rules is determined to be applicable by the judging means, and the generating means excludes the word for which the marker is prepared and excludes the target language. Produces the text of.

【００１４】請求項３に記載の機械翻訳装置は、請求項
２に記載の機械翻訳装置であって、除外規則は、ターゲ
ット言語の構造解析木中に含まれる、意味的に重複する
複数個の単語の少なくとも１つを除外対象として指定す
る除外規則を含む。A machine translation apparatus according to a third aspect is the machine translation apparatus according to the second aspect, wherein the exclusion rules are a plurality of semantically overlapping words included in the structural analysis tree of the target language. It includes an exclusion rule that specifies at least one of the words as an exclusion target.

【００１５】請求項４に記載の機械翻訳装置は、請求項
２に記載の装置であって、ソース言語は英語であり、タ
ーゲット言語は日本語である。A machine translation device according to a fourth aspect is the device according to the second aspect, wherein the source language is English and the target language is Japanese.

【００１６】請求項５に記載の機械翻訳装置は、請求項
４に記載の装置であって、その除外規則は、ターゲット
言語の構造解析木中に含まれる、特定の代名詞を除外対
象として指定する除外規則を含む。A machine translation device according to a fifth aspect is the device according to the fourth aspect, wherein the exclusion rule specifies a specific pronoun included in the structural analysis tree of the target language as an exclusion target. Includes exclusion rules.

【００１７】請求項６に記載の機械翻訳装置は、請求項
５に記載の装置であって、特定の代名詞を入力するため
の手段をさらに含む。The machine translation device according to claim 6 is the device according to claim 5, further comprising means for inputting a specific pronoun.

【００１８】[0018]

【作用】請求項１に記載の機械翻訳装置では、入力され
るソース言語のテキストからソース言語の構造解析木が
生成され、生成された構造解析木は、第１の格納手段に
格納されている変換規則によってターゲット言語の構造
解析木に変換される。ターゲット言語の構造解析木中
の、予め定める条件を満たす単語が検出されると、この
検出された単語を除外してターゲット言語のテキストが
生成される。In the machine translation device according to the first aspect, the source language structure analysis tree is generated from the input source language text, and the generated structure analysis tree is stored in the first storage means. It is converted into a structure analysis tree of the target language according to the conversion rule. When a word satisfying a predetermined condition is detected in the target language structure analysis tree, the target language text is generated by excluding the detected word.

【００１９】請求項２に記載の機械翻訳装置では、入力
されるソース言語のテキストからソース言語の構造解析
木が生成され、生成された構造解析木は、第１の格納手
段に格納されている変換規則によってターゲット言語の
構造解析木に変換される。変換後の構造解析木中の各単
語に対して、第２の格納手段に格納されている除外規則
のいずれかが適用可能かどうかが判断される。適用可能
な除外規則がある場合にはその単語に関連して所定の標
識が準備され、その標識が準備された単語を除外してタ
ーゲット言語のテキストが生成される。In the machine translation device according to the second aspect, a source language structure analysis tree is generated from the input source language text, and the generated structure analysis tree is stored in the first storage means. It is converted into a structure analysis tree of the target language according to the conversion rule. It is determined whether or not any of the exclusion rules stored in the second storage means is applicable to each word in the converted structure analysis tree. If there is an applicable exclusion rule, a predetermined indicator is prepared in relation to the word, and the word in which the indicator is prepared is excluded to generate the text in the target language.

【００２０】請求項３に記載の機械翻訳装置では、入力
されるソース言語のテキストからソース言語の構造解析
木が生成され、生成された構造解析木は、第１の格納手
段に格納されている変換規則によってターゲット言語の
構造解析木に変換される。変換後の構造解析木中の各単
語に対して、第２の格納手段に格納されている除外規則
のいずれかが適用可能かどうかが判断される。除外規則
は、ターゲット言語の構造解析木中に含まれる、意味的
に重複する複数個の単語の少なくとも１つを除外対象と
して指定する除外規則を含む。適用可能な除外規則があ
る場合にはその単語に関連して所定の標識が準備され、
その標識が準備された単語を除外してターゲット言語の
テキストが生成される。In the machine translation device according to the third aspect, the source language structure analysis tree is generated from the input source language text, and the generated structure analysis tree is stored in the first storage means. It is converted into a structure analysis tree of the target language according to the conversion rule. It is determined whether or not any of the exclusion rules stored in the second storage means is applicable to each word in the converted structure analysis tree. The exclusion rule includes an exclusion rule that specifies at least one of a plurality of semantically overlapping words included in the structure analysis tree of the target language as an exclusion target. If there is an applicable exclusion rule, a certain sign will be prepared in relation to that word,
The target language text is generated excluding the words for which the indicator was prepared.

【００２１】請求項４に記載の機械翻訳装置では、入力
される英語のテキストから英語の構造解析木が生成さ
れ、生成された構造解析木は、第１の格納手段に格納さ
れている変換規則によって日本語の構造解析木に変換さ
れる。変換語の構造解析木中の各単語に対して、第２の
格納手段に格納されている除外規則のいずれかが適用可
能かどうかが判断される。適用可能な除外規則がある場
合にはその単語に関連して所定の標識が準備され、その
標識が準備された単語を除外して日本語のテキストが生
成される。In the machine translation device according to the fourth aspect, an English structural analysis tree is generated from the input English text, and the generated structural analysis tree is the conversion rule stored in the first storage means. Is converted into a Japanese structural analysis tree. It is determined whether any of the exclusion rules stored in the second storage means is applicable to each word in the structure analysis tree of the conversion word. If there is an applicable exclusion rule, a predetermined marker is prepared in association with the word, and the word for which the marker is prepared is excluded to generate Japanese text.

【００２２】請求項５に記載の機械翻訳装置では、入力
される英語のテキストから英語の構造解析木が生成さ
れ、生成された構造解析木は、第１の格納手段に格納さ
れている変換規則によって日本語の構造解析木に変換さ
れる。変換後の構造解析木中の各単語に対して、第２の
格納手段に格納されている除外規則のいずれかが適用可
能かどうかが判断される。適用可能な除外規則がある場
合、たとえば特定の代名詞が発見された場合には、その
特定の代名詞に関連して所定の標識が準備され、その特
定の代名詞を除外して日本語のテキストが生成される。In the machine translation device according to the fifth aspect, an English structural analysis tree is generated from the input English text, and the generated structural analysis tree is the conversion rule stored in the first storage means. Is converted into a Japanese structural analysis tree. It is determined whether or not any of the exclusion rules stored in the second storage means is applicable to each word in the converted structure analysis tree. When there is an applicable exclusion rule, for example, when a specific pronoun is found, a certain sign is prepared in relation to that specific pronoun and the specific pronoun is excluded to generate Japanese text. To be done.

【００２３】請求項６に記載の機械翻訳装置では、入力
される英語のテキストから英語の構造解析木が生成さ
れ、生成された構造解析木は、第１の格納手段に格納さ
れている変換規則によって日本語の構造解析木に変換さ
れる。変換後の構造解析木中の各単語に対して、第２の
格納手段に格納されている除外規則のいずれかが適用可
能かどうかが判断される。適用可能な除外規則がある場
合、たとえば予め使用者によって入力された特定の代名
詞が発見された場合には、その特定の代名詞に関連して
所定の標識が準備され、その特定の代名詞を除外して日
本語のテキストが生成される。In the machine translation device according to the sixth aspect, an English structural analysis tree is generated from the input English text, and the generated structural analysis tree is the conversion rule stored in the first storage means. Is converted into a Japanese structural analysis tree. It is determined whether or not any of the exclusion rules stored in the second storage means is applicable to each word in the converted structure analysis tree. If there is an applicable exclusion rule, for example, if a specific pronoun previously entered by the user is found, a predetermined sign is prepared in association with that specific pronoun to exclude that specific pronoun. Japanese text is generated.

【００２４】[0024]

【実施例】以下、本発明の一実施例の機械翻訳装置を説
明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A machine translation device according to an embodiment of the present invention will be described below.

【００２５】実施例の説明の前に、機械翻訳の概念につ
いて簡単に説明する。第１図を参照して、機械翻訳にお
いて行なわれる解析処理には、様々な解析レベルがあ
る。機械翻訳は、図１の左上に示されるソース言語が入
力されると、各レベルの処理を順に行なって最終的に図
１の右側に示されるターゲット言語を得るための処理で
ある。たとえばソース言語が入力されると、まずレベル
Ｌ１の辞書引き処理、レベルＬ２の形態素解析処理、レ
ベルＬ３の構文解析処理、…と処理が進められ、最終的
にレベルＬ１０の形態素生成処理が行なわれてターゲッ
ト言語が生成される。Before explaining the embodiments, the concept of machine translation will be briefly described. Referring to FIG. 1, the analysis processing performed in machine translation has various analysis levels. The machine translation is a process for inputting the source language shown in the upper left of FIG. 1 to sequentially perform the processing of each level to finally obtain the target language shown on the right side of FIG. For example, when a source language is input, first, a level L1 dictionary lookup process, a level L2 morphological analysis process, a level L3 syntactic analysis process, and so on are performed, and finally a level L10 morpheme generation process is performed. The target language is generated.

【００２６】機械翻訳は、どのレベルの解析処理まで行
なうかによって、大きく次の２つに分けられる。第１
は、レベルＬ６に示される中間言語まで解析し、そこか
らターゲット言語を生成していくピボット方式である。
第２は、上述のレベルＬ２〜レベルＬ５のいずれかまで
解析を行なってソース言語の内部構造を得、次に、得ら
れたソース言語の内部構造と同じレベルのターゲット言
語の内部構造に変換した後、ターゲット言語を生成する
トランスファ方式である。Machine translation is roughly divided into the following two types, depending on which level of analysis processing is performed. First
Is a pivot method in which an intermediate language shown at level L6 is analyzed and a target language is generated therefrom.
Secondly, the internal structure of the source language is obtained by performing analysis to any of the above-mentioned levels L2 to L5, and then converted to the internal structure of the target language at the same level as the obtained internal structure of the source language. Later, it is a transfer method that generates a target language.

【００２７】ピボット方式において用いられる中間言語
とは、ソース言語およびターゲット言語のどちらにも依
存しない概念である。したがってソース言語の解析処理
を一度行なって中間言語を得れば、この中間言語から複
数の言語を生成することができるため、多言語間翻訳に
有利であるとされる。しかし、このようなピボット方式
では、その基本的概念である中間言語を本当に設定でき
るかどうかという点が未解決である。The intermediate language used in the pivot system is a concept that does not depend on either the source language or the target language. Therefore, if the source language is analyzed once to obtain an intermediate language, a plurality of languages can be generated from this intermediate language, which is advantageous for multilingual translation. However, in such a pivot method, it is still unsolved whether the intermediate language which is the basic concept can be really set.

【００２８】トランスファ方式はピボット方式のこのよ
うな問題に対する妥協案であって、現在の多くのシステ
ムはこのトランスファ方式を採っている。以下の説明は
トランスファ方式についてのものであり、後述する実施
例の機械翻訳装置もトランスファ方式を用いたものであ
る。The transfer method is a compromise solution to such a problem of the pivot method, and many current systems adopt this transfer method. The following description is for the transfer method, and the machine translation device of the embodiment described later also uses the transfer method.

【００２９】以下、図１に示される各解析処理の内容に
ついて説明する。（１）辞書引き、形態素解析この処理においては、入力された文章に対して、形態素
が格納された辞書を参照しながらたとえば最長一致法な
どを用いて形態素列（単語列）に分割する処理がまず行
なわれる。そして、得られた各単語に対する品詞などの
文法情報および訳語を得、さらに時制・人称・数などを
解析する処理が行なわれる。The contents of each analysis process shown in FIG. 1 will be described below. (1) Dictionary lookup, morpheme analysis In this process, the process of dividing the input sentence into morpheme strings (word strings) using the longest matching method or the like while referring to the dictionary in which the morphemes are stored First done. Then, a process of obtaining grammatical information such as a part of speech and a translated word for each of the obtained words and further analyzing tense, personal name, and number are performed.

【００３０】（２）構文解析この処理においては、品詞・変化形などに基づき、単語
間のかかり受けなどの文章の構造（構造解析木）を組立
て、決定する処理が行なわれる。この処理においては、
得られた文章の構造が正しい意味を表わすかどうかにつ
いての判断は行なわれない。(2) Syntactic Analysis In this process, a process of assembling and deciding a sentence structure (structural analysis tree) such as inter-word crossing based on the part-of-speech / variant, etc. is performed. In this process,
No judgment is made as to whether the structure of the obtained sentence represents the correct meaning.

【００３１】（３）意味解析構文解析処理の結果得られた複数の構造解析木から、意
味的に正しいものとそうでないものとを判別し、正しい
もののみを採用する処理が行なわれる。(3) Semantic Analysis From a plurality of structure analysis trees obtained as a result of the syntactic analysis processing, semantically correct and improper ones are discriminated and only the correct ones are adopted.

【００３２】（４）文脈解析文脈解析処理では、入力された文章の話題を理解し、入
力文章中に含まれる省略部分や曖昧さなどを取去る処理
が行なわれる。(4) Context Analysis In the context analysis process, a process of understanding the topic of the input sentence and removing abbreviated parts and ambiguities contained in the input sentence is performed.

【００３３】以下で説明する本発明の一実施例の機械翻
訳装置の翻訳モジュールにおいては、レベルＬ３までの
解析処理を行なうものとする。すなわち、本実施例の機
械翻訳装置の翻訳モジュールの機能ブロックは、図２に
示されるような構造を有する。この翻訳モジュールは、
図２を参照して、入力モジュール１０から与えられるソ
ース言語のテキストに対して辞書引き・形態素解析処理
を行なうための辞書引き・形態素解析モジュール１１
と、形態素解析されたテキストに対して構文解析を行な
うための構文解析モジュール１２と、構文解析の結果を
変換してターゲット言語の構造解析木を生成するための
構文変換モジュール１３と、構文変換モジュール１３に
よって生成されたターゲット言語の構造解析木に基づ
き、ターゲット言語の翻訳文を生成するための翻訳文生
成モジュール１４とを含む。各モジュール１１〜１４で
行なわれる処理では、メモリ６の格納内容が参照される
が、各処理の内容については、実施例において詳しく説
明する。In the translation module of the machine translation apparatus according to the embodiment of the present invention described below, it is assumed that analysis processing up to level L3 is performed. That is, the functional block of the translation module of the machine translation device of this embodiment has a structure as shown in FIG. This translation module
Referring to FIG. 2, dictionary lookup / morpheme analysis module 11 for performing dictionary lookup / morpheme analysis processing on the source language text provided from input module 10.
A syntactic analysis module 12 for performing syntactic analysis on the morphologically analyzed text; a syntactic conversion module 13 for converting the syntactic analysis result to generate a structural analysis tree of the target language; and a syntactic conversion module. And a translation sentence generation module 14 for generating a translation sentence of the target language based on the structure analysis tree of the target language generated by 13. The contents stored in the memory 6 are referred to in the processes performed by the respective modules 11 to 14, and the contents of the respective processes will be described in detail in the embodiments.

【００３４】翻訳文生成モジュール１４は、構文変換モ
ジュール１３によって生成されたターゲット言語の構造
解析木に対し、訳語変換ルールを適用し、省略可能な単
語に対して省略フラグに所定の値を設定するための訳語
変換ルール適用モジュール２１と、訳語変換ルール適用
モジュール２１によって設定された省略フラグを参照し
て、ターゲット言語の構造解析木に基づいて、省略可能
な語を省略しながらターゲット言語の翻訳文を生成する
ための訳語生成モジュール２２とを含む。訳語生成モジ
ュール２２によって生成されたターゲット言語の翻訳文
は、出力モジュール１７を介してプリンタや外部ファイ
ルなどに出力される。The translated sentence generation module 14 applies the translated word conversion rule to the structure analysis tree of the target language generated by the syntax conversion module 13, and sets a predetermined value in the omission flag for the omissible word. The translation word conversion rule application module 21 and the abbreviation flag set by the translation word conversion rule application module 21 are referred to, and the translation text of the target language is omitted while omitting the optional words based on the structure analysis tree of the target language. And a translation generation module 22 for generating. The translated sentence in the target language generated by the translated word generation module 22 is output to a printer, an external file, or the like via the output module 17.

【００３５】メモリ６は、辞書メモリ１８と、バッファ
メモリ１９と、訳語変換ルールメモリ２０とを含む。メ
モリ６には、図示されてはいないが文法規則や木構造変
換規則を格納するメモリも含まれる。The memory 6 includes a dictionary memory 18, a buffer memory 19 and a translation conversion rule memory 20. The memory 6 also includes a memory (not shown) for storing grammar rules and tree structure conversion rules.

【００３６】図３は、この発明の言語処理装置の一実施
例の機械翻訳装置のブロック図である。図３を参照し
て、この機械翻訳装置は、メインＣＰＵ（中央処理装
置）１と、メインＣＰＵ１が接続されたバス７と、バス
７に接続されたメインメモリ２と、バス７に接続された
ＣＲＴ（陰極線管）やＬＣＤ（液晶表示装置）などから
なる表示装置３と、キーボード４と、バス７に接続され
た前述の翻訳モジュール５と、翻訳モジュール５に接続
された翻訳用の辞書・文法規則および木構造変換規則な
どの知識ベースを格納している前述のメモリ６とを含
む。FIG. 3 is a block diagram of a machine translation apparatus which is an embodiment of the language processing apparatus of the present invention. With reference to FIG. 3, this machine translation device is connected to a main CPU (central processing unit) 1, a bus 7 to which the main CPU 1 is connected, a main memory 2 connected to the bus 7, and a bus 7. A display device 3 including a CRT (cathode ray tube) or LCD (liquid crystal display device), a keyboard 4, the above-mentioned translation module 5 connected to a bus 7, and a translation dictionary / grammar connected to the translation module 5. And the above-mentioned memory 6 storing a knowledge base such as rules and tree structure conversion rules.

【００３７】翻訳モジュール５は、ソース言語の文章が
入力されると、それを所定の手順で翻訳してターゲット
言語を出力するためのものである。図４を参照して、ハ
ードウェアとして、翻訳モジュール５は、バス７に接続
され、バス７を介して入力されるソース言語（本実施例
の場合には英語）を、所定の翻訳プログラムに従って翻
訳してターゲット言語（本実施例の場合には日本語）と
してバス７に出力するための翻訳ＣＰＵ１５と、バス７
に接続され、翻訳ＣＰＵ１５で実行される翻訳プログラ
ムを格納するための翻訳プログラムメモリ１６と、入力
されたソース言語の原文を各単語ごとに格納するための
バッファＡと、バッファＡに格納された各単語につき、
メモリ６に含まれる辞書を参照して得た各単語の品詞、
訳語などの情報を格納するためのバッファＢと、ソース
言語の構造解析木に関する情報を格納するためのバッフ
ァＣと、バッファＣに格納されたソース言語の構造解析
木が変換されたターゲット言語の構造解析木を格納する
ためのバッファＤと、バッファＤに格納された日本語の
構造解析木に適切な助詞や助動詞などを補充して、日本
語の形として整えられた文章を格納するためのバッファ
Ｅとを含む。When the source language sentence is input, the translation module 5 translates the sentence in a predetermined procedure and outputs the target language. Referring to FIG. 4, as a hardware, translation module 5 is connected to bus 7 and translates a source language (English in the case of this embodiment) input via bus 7 according to a predetermined translation program. Then, the translation CPU 15 for outputting to the bus 7 as the target language (Japanese in this embodiment) and the bus 7
Connected to the translation CPU 15, a translation program memory 16 for storing a translation program executed by the translation CPU 15, a buffer A for storing an input source language original sentence for each word, and a buffer A for storing each word. Per word,
Part of speech of each word obtained by referring to the dictionary included in the memory 6,
A buffer B for storing information such as a translated word, a buffer C for storing information about a source language structure analysis tree, and a structure of a target language obtained by converting the source language structure analysis tree stored in the buffer C. A buffer D for storing a parse tree, and a buffer for storing a sentence prepared as a Japanese form by supplementing the Japanese structural parse tree stored in the buffer D with appropriate particles and auxiliary verbs. Including E and.

【００３８】以下、図３〜図９を参照して、本実施例の
機械翻訳装置による英日翻訳の動作を説明する。なお、
翻訳ＣＰＵ１５で実行される翻訳プログラムの詳細につ
いては後に詳述する。The operation of the English-Japanese translation by the machine translation apparatus of this embodiment will be described below with reference to FIGS. In addition,
The details of the translation program executed by the translation CPU 15 will be described later.

【００３９】まず、読込まれた原文は形態素解析によっ
て形態素に分解され、図５に示されるようにバッファＡ
（図４参照）に格納される。First, the read original sentence is decomposed into morphemes by morphological analysis, and as shown in FIG.
(See FIG. 4).

【００４０】続いて翻訳プログラム１６に基づく翻訳Ｃ
ＰＵ１５の制御の下に、図２に示される辞書引き・形態
素解析部１１によって、バッファＡに格納された原文の
各単語につき、メモリ６に格納されている辞書を参照す
ることにより各単語の訳語や品詞情報などの情報が得ら
れる。この情報は図４に示されるバッファＢに格納され
る。これらの情報の一部として、各単語の品詞情報が含
まれるが、これら品詞情報は図６に示されるように格納
される。すなわち“ｔｈｉｓ”は多品詞語であって代名
詞、指示形容詞の２つの品詞を持つ。また“ｉｓ”の品
詞は動詞である。同様に“ａ”、“ｐｅｎ”についても
それぞれの品詞がバッファＢに格納される。“ｔｈｉ
ｓ”は多品詞語であるが、文中の品詞が何であるかにつ
いては、図２に示される構文解析部１２に相当する処理
によって一意に決定される。Subsequently, a translation C based on the translation program 16
Under the control of the PU 15, the dictionary lookup / morpheme analysis unit 11 shown in FIG. 2 refers to the dictionary stored in the memory 6 for each word of the original sentence stored in the buffer A, and translates each word. And information such as part-of-speech information can be obtained. This information is stored in buffer B shown in FIG. Part of this information includes the part-of-speech information of each word, and the part-of-speech information is stored as shown in FIG. That is, "this" is a multi-part-of-speech word and has two parts of speech, a pronoun and a referential adjective. The part of speech of "is" is a verb. Similarly, the parts of speech for "a" and "pen" are also stored in the buffer B. "Thi
Although s "is a multi-part-of-speech word, what is the part-of-speech in the sentence is uniquely determined by the process corresponding to the parsing unit 12 shown in FIG.

【００４１】翻訳プログラムのうち図２に示される構文
解析モジュール１２に相当する処理においては、メモリ
６に格納された辞書および文法規則に従って、各単語間
のかかり受け関係を示す構造解析木がたとえば図７に示
されるように決定される。この構文解析結果は図４のバ
ッファＣに格納される。In the process corresponding to the syntactic analysis module 12 shown in FIG. 2 of the translation program, a structural analysis tree showing the inter-word relations between words according to the dictionary and grammatical rules stored in the memory 6 is shown in FIG. 7 is determined. The result of this parsing is stored in the buffer C in FIG.

【００４２】構造解析木の決定は次のようにして行なわ
れる。メモリ６に格納された文法規則のうち、英語に関
する文法規則は次のようなものから成り立っている。The structure analysis tree is determined as follows. Among the grammatical rules stored in the memory 6, the grammatical rules for English are made up of the following.

【００４３】文→主部、述部主部→名詞句述部→動詞、名詞句名詞句→代名詞名詞句→冠詞、名詞この規則のうちたとえば１つ目の規則は、「文は主部と
述部からできている。」ということを表わす。他の規則
についても同様である。これらの規則に従って構造解析
木が決定されるのである。なお、このような文法規則は
同じように日本語についても用意されており、英語の文
法規則と日本語の文法規則との間で対応づけがなされて
いる。Sentence → main part, predicate main part → noun phrase predicate → verb, noun phrase noun phrase → pronoun noun phrase → article, noun For example, the first rule of these rules is that "a sentence is a main part It is made up of predicates. " The same applies to other rules. The structural analysis tree is determined according to these rules. It should be noted that such grammatical rules are similarly prepared for Japanese, and correspondence is made between English grammatical rules and Japanese grammatical rules.

【００４４】翻訳プログラムのうち、図２に示される構
文変換モジュール１３に相当する処理においては、入力
された英文の構造解析木（図７参照）の構造が、図８に
示される日本文に対する構文解析木の構造に変換され
る。この変換においては、上述の構文解析部１２が利用
したのと同様に、メモリ６に格納されている「木構造変
換規則」が用いられる。この変換は、図１でいえばレベ
ルＬ３からターゲット言語のレベルＬ９への変換に相当
する。得られた結果は図４に示されるバッファＤに格納
される。この説明において用いられている例文“Ｔｈｉ
ｓｉｓａｐｅｎ”は、この変換によって日本語文
字列「これペンである」に変換されたことになる。In the process corresponding to the syntax conversion module 13 shown in FIG. 2 of the translation program, the structure of the input structural analysis tree (see FIG. 7) of the English sentence is the syntax for the Japanese sentence shown in FIG. Converted to a parse tree structure. In this conversion, the “tree structure conversion rule” stored in the memory 6 is used in the same manner as that used by the syntax analysis unit 12 described above. This conversion corresponds to the conversion from the level L3 to the target language level L9 in FIG. The obtained result is stored in the buffer D shown in FIG. The example sentence "Thi used in this description
This means that "is is a pen" has been converted into the Japanese character string "This is a pen" by this conversion.

【００４５】翻訳プログラムのうち図２の翻訳文生成モ
ジュール１４に相当する処理を行なう部分は、得られた
日本語文字列「これペンである」に適切な助詞
「は」や助動詞をつけることにより、図９に示されるよ
うな文法的な日本語の形にし、図４に示されるバッファ
Ｅに格納する。この処理は、図１に示されるレベルＬ９
からレベルＬ１０への変換に相当する。得られた日本文
「これはペンである。」は、図４に示される翻訳モジュ
ール５から出力され、メインメモリ２に格納されるとと
もに、表示装置３に表示される。The portion of the translation program that performs the processing corresponding to the translated sentence generation module 14 of FIG. 2 is obtained by adding an appropriate particle "ha" or auxiliary verb to the obtained Japanese character string "this is a pen". , And stores it in the buffer E shown in FIG. 4 in the grammatical Japanese form as shown in FIG. This process corresponds to the level L9 shown in FIG.
Corresponds to the conversion from level L10. The obtained Japanese sentence “This is a pen.” Is output from the translation module 5 shown in FIG. 4, stored in the main memory 2, and displayed on the display device 3.

【００４６】本発明の特徴は、構文変換モジュール１３
によって生成された日本語の構造解析木から翻訳文を生
成する際に、英語では多用されるが日本語では訳出する
必要のない単語、たとえば従属節の主語であって主節の
主語と同じ単語のようなものを、訳出しないように翻訳
文を生成するところにある。以下翻訳文生成モジュール
１４で行なわれる処理の詳細について説明する。A feature of the present invention is that the syntax conversion module 13
When a translated sentence is generated from the Japanese structural analysis tree generated by, a word that is frequently used in English but does not need to be translated in Japanese, for example, a word that is the subject of a subordinate clause and is the same as the subject of the main clause. It is a place to generate a translated sentence so that it does not translate something like. The details of the processing performed by the translated sentence generation module 14 will be described below.

【００４７】図１０は、図２の翻訳文生成モジュール１
４で行なわれる処理のフローチャートである。特にステ
ップＳ００１は、図２の訳語変換ルール適用モジュール
２１の行なう処理に対応する。FIG. 10 shows the translated sentence generation module 1 of FIG.
4 is a flowchart of the process performed in 4. In particular, step S001 corresponds to the process performed by the translation conversion rule application module 21 of FIG.

【００４８】図１０を参照して、まずステップＳ００１
で、構文変換モジュール１３（図２参照）の処理の結果
得られた構文解析木の変換結果を格納するバッファＤ内
をスキャンし、省略可能な単語に対して省略フラグを立
てるという処理を行なう。この処理においては、入力テ
キストの各単語に対して、図２の訳語変換ルールメモリ
２０に格納されている訳語変換ルールのいずれかが適用
できるかどうかを調べることにより行なわれる。各訳語
変換ルールについて或る単語に対して行なわれる処理
は、それぞれ一定の条件がこの単語に対して当てはまる
かどうかを確認するためのものである。この実施例で
は、訳語変換ルールとして３つのルールを用いる場合に
ついて説明する。Referring to FIG. 10, first, step S001.
Then, the buffer D that stores the conversion result of the syntax analysis tree obtained as a result of the processing of the syntax conversion module 13 (see FIG. 2) is scanned, and a processing of setting an omission flag for the omissible word is performed. This process is performed by checking whether or not any of the translation word conversion rules stored in the translation word conversion rule memory 20 of FIG. 2 can be applied to each word of the input text. The process performed for a certain word in each translation conversion rule is to confirm whether or not a certain condition applies to this word. In this embodiment, a case where three rules are used as the translation conversion rules will be described.

【００４９】第１の訳語変換ルールは、「従属節の主語
が主節の主語と同じものには省略フラグを設定する。」
というものである。第２の訳語変換ルールは、「主語と
同じ代名詞には省略フラグを立てる」というものであ
る。第３の訳語変換ルールは、「ユーザが訳さないよう
に指定した代名詞に対してはフラグを設定する。」とい
うものである。第３のルールについて、ユーザが設定す
る単語は図３に示されるキーボード４を利用して訳語変
換ルールメモリ２０内に格納される。これら３つのルー
ルが適用可能かどうかについて調べる処理は、ステップ
Ｓ００１のルール適用処理の中で順次行なわれる。以下
各ルールについての処理をフローチャートを参照しなが
ら説明する。The first translation word conversion rule is that an abbreviation flag is set when the subject of the subordinate clause is the same as the subject of the main clause.
That is. The second translation conversion rule is to "set an abbreviation flag for the same pronoun as the subject". The third translation conversion rule is that "a flag is set for pronouns designated by the user not to be translated." Regarding the third rule, the word set by the user is stored in the translated word conversion rule memory 20 by using the keyboard 4 shown in FIG. The process of checking whether or not these three rules are applicable is sequentially performed in the rule application process of step S001. The processing for each rule will be described below with reference to the flowchart.

【００５０】図１１は、第１の訳語変換ルールについて
行なわれる処理のフローチャートである。図１１を参照
して、まずステップＳ１０１で、ポインタを構文解析木
変換結果バッファＤの先頭に位置づける。FIG. 11 is a flowchart of the processing performed for the first translation conversion rule. Referring to FIG. 11, first, in step S101, the pointer is positioned at the beginning of the parse tree conversion result buffer D.

【００５１】続いてステップＳ１０２で、ポインタが変
換結果の最後まで到達したかどうかについての判断を行
なう。到達していればこのルール１についての処理は終
了する。未だ最後まで到達していない場合、ポインタの
指す位置の単語が主語かどうかについて、ステップＳ１
０３での判断が行なわれる。主語でなければ処理はステ
ップＳ１０８に進み、ポインタに１が加算される。ステ
ップＳ１０８の後制御はステップＳ１０２に戻る。Then, in step S102, it is determined whether or not the pointer reaches the end of the conversion result. If it has arrived, the process for rule 1 ends. If the word at the position pointed to by the pointer is not the subject, the step S1
Judgment at 03 is made. If it is not the subject, the process proceeds to step S108, and 1 is added to the pointer. After step S108, the control returns to step S102.

【００５２】ポインタの指す位置の単語が主語であれば
制御はステップＳ１０４に進む。ステップＳ１０４で
は、現在ポインタが指す位置の単語が属する節が主節で
あるかどうかについての判断が行なわれる。主節であれ
ば処理はステップＳ１０５に、従属節であれば処理はス
テップＳ１０６に進む。すなわち、ポインタの指す位置
の単語が従属節に属するものであれば制御はステップＳ
１０６に進むことになる。If the word pointed by the pointer is the subject, control proceeds to step S104. In step S104, it is determined whether or not the section to which the word at the position currently pointed to belongs is the main section. If it is the main clause, the processing proceeds to step S105, and if it is the dependent clause, the processing proceeds to step S106. That is, if the word at the position pointed to by the pointer belongs to the subordinate clause, control is performed in step S.
It will proceed to 106.

【００５３】ステップＳ１０５では、ポインタの指す位
置の単語が主節の主語であると判断される。したがっ
て、ポインタの指す位置の単語を、主語を覚えておくた
めのバッファ（図２のバッファメモリ１９内に確保され
る）に格納する。ステップＳ１０５の後制御はステップ
Ｓ１０８に進む。In step S105, the word at the position pointed to by the pointer is determined to be the subject of the main clause. Therefore, the word at the position pointed to by the pointer is stored in a buffer (reserved in the buffer memory 19 in FIG. 2) for remembering the subject. After step S105, the control proceeds to step S108.

【００５４】ステップＳ１０６に制御が進んだ場合、ポ
インタの指す位置にある単語が、ステップＳ１０５でバ
ッファに格納された単語（主語）と同一であるかどうか
についての判断が行なわれる。同一であれば制御はステ
ップＳ１０７に、同一でなければ制御はステップＳ１０
８にそれぞれ進む。When the control proceeds to step S106, it is determined whether or not the word at the position pointed by the pointer is the same as the word (subject) stored in the buffer at step S105. If they are the same, the control proceeds to step S107, and if they are not the same, the control proceeds to step S10.
Go to 8 respectively.

【００５５】ステップＳ１０７では、主節の主語と従属
節の主語とが同一の単語であると判断されるために、ル
ール１を適用することが可能であると判断される。した
がって、第１の訳語変換ルールが適用可能であることを
示す省略フラグがセットされる。ステップＳ１０７の後
処理はステップＳ１０８に進む。In step S107, since the subject of the main clause and the subject of the subordinate clause are determined to be the same word, it is determined that rule 1 can be applied. Therefore, the omission flag indicating that the first translation conversion rule is applicable is set. The post-processing of step S107 proceeds to step S108.

【００５６】このようにして図１１に示される処理が終
了することにより、バッファ内の各単語について、第１
の訳語変換ルールを適用可能かどうかが判断され、もし
そのようなものが存在すればそれに対する省略フラグが
設定される。By thus completing the processing shown in FIG. 11, the first word of each word in the buffer is
It is determined whether or not the translation conversion rule of is applicable, and if such a rule is present, an abbreviation flag is set for it.

【００５７】この第１の訳語変換ルールを用いると、次
のような結果が得られる。図１９に示される英文を構文
解析し、日本語の木構造に変換した結果が図１５に示さ
れている。この変換結果は、図８に示されるようなバッ
ファＤに格納されている。上述の第１の訳語変換ルール
をこのバッファＤに対して適用すると、（＊１）の“ｙ
ｏｕ”に対して省略フラグが設定されることになる。な
お、上述の判断を行なうためには、主部がバッファＤ内
に格納されている木構造のうちのどの部分かを示す情報
が必要となるが、そうした情報もこのバッファＤには一
緒に格納されている。図１５の場合には左から３つめの
“Ｙｏｕ”が主節の主語であるという情報が保持されて
おり、右から４つめの“ｙｏｕ”がこれと一致するた
め、図１１のステップＳ１０７でこの“ｙｏｕ”に省略
フラグが設定されることになる。Using the first translation conversion rule, the following result is obtained. FIG. 15 shows a result of parsing the English sentence shown in FIG. 19 and converting it into a Japanese tree structure. The conversion result is stored in the buffer D as shown in FIG. Applying the above-mentioned first translation conversion rule to this buffer D, “y” in (* 1)
The omission flag is set for ou ". In order to make the above determination, information indicating which part of the tree structure the main part is stored in the buffer D is required. However, such information is also stored together in this buffer D. In the case of Fig. 15, the information that the third "You" is the subject of the main clause is held, and from the right. Since the fourth "you" matches this, the omission flag is set for this "you" in step S107 of FIG.

【００５８】図１２は、第２の訳語変換ルールを適用可
能かどうか調べるための処理のフローチャートである。
まずステップＳ２０１で、ポインタを変換結果バッファ
Ｄの先頭位置に位置づける。FIG. 12 is a flowchart of a process for checking whether or not the second translation conversion rule can be applied.
First, in step S201, the pointer is positioned at the head position of the conversion result buffer D.

【００５９】続いてステップＳ２０２で、ポインタが、
変換結果の最後の位置にきているかどうかについての判
断が行なわれる。変換結果の最後までポインタがきてい
る場合には制御はステップＳ２０３に、また最後まで到
達していない場合には制御はステップＳ２０９に、それ
ぞれ進む。Then, in step S202, the pointer is
A determination is made as to whether the last position of the conversion result has been reached. If the pointer reaches the end of the conversion result, the control proceeds to step S203, and if it does not reach the end, the control proceeds to step S209.

【００６０】ステップＳ２０９では、現在ポインタが指
す位置の単語が主語であるかどうかについての判断が行
なわれる。この判断は、図１５に示されているような木
変換結果のバッファ内の文法的な情報に基づいて行なわ
れる。もしも現在の単語が主語であれば制御はステップ
Ｓ２１０に、主語でなければ制御はステップＳ２１１に
それぞれ進む。In step S209, it is determined whether or not the word at the position currently pointed to by the pointer is the subject. This judgment is made based on the grammatical information in the tree conversion result buffer as shown in FIG. If the current word is the subject, control proceeds to step S210, and if it is not the subject, control proceeds to step S211.

【００６１】ステップＳ２１１では、ポインタに１が加
算される。ステップＳ２１１の後制御はステップＳ２０
２に戻る。In step S211, 1 is added to the pointer. The control after step S211 is step S20.
Return to 2.

【００６２】ステップＳ２１０では、現在ポインタによ
って指し示される位置の単語を、主語を格納するために
予め準備されていたバッファに格納される。このバッフ
ァは図２のバッファメモリ１９内に確保される。ステッ
プＳ２１０の後制御はステップＳ２１１に進む。In step S210, the word at the position currently pointed to by the pointer is stored in a buffer prepared in advance for storing the subject. This buffer is secured in the buffer memory 19 of FIG. After step S210, control proceeds to step S211.

【００６３】ステップＳ２０２、ステップＳ２０９ない
しＳ２１１の処理が繰返し行なわれることにより、バッ
ファに主語となる単語が格納され、バッファの最後まで
ポインタが到達した時点で制御はステップＳ２０３に進
む。By repeating the processing of steps S202 and S209 to S211, the word serving as the subject is stored in the buffer, and when the pointer reaches the end of the buffer, control proceeds to step S203.

【００６４】ステップＳ２０３では、ポインタが再び木
変換結果バッファＤの先頭位置に戻される。In step S203, the pointer is returned to the head position of the tree conversion result buffer D again.

【００６５】続いてステップＳ２０４で、ポインタが変
換結果の最後の位置を指し示しているかどうかについて
の判断が行なわれる。最後まで到達している場合にはこ
の処理は終了する。最後まで到達していない場合には制
御はステップＳ２０５に進む。Then, in step S204, it is determined whether or not the pointer points to the last position of the conversion result. If the end has been reached, this process ends. If the end has not been reached, control proceeds to step S205.

【００６６】ステップＳ２０５では、現在ポインタが指
し示す位置の単語が代名詞かどうかについての判断が行
なわれる。代名詞でない場合には制御はステップＳ２０
８に、代名詞である場合には制御はステップＳ２０６に
それぞれ進む。In step S205, it is determined whether the word at the position currently pointed to by the pointer is a pronoun. If it is not a pronoun, the control is step S20.
If it is a pronoun, the control proceeds to step S206.

【００６７】ステップＳ２０８ではポインタに１が加算
される。ステップＳ２０８の後制御はステップＳ２０４
に戻る。In step S208, 1 is added to the pointer. After step S208, the control is step S204.
Return to.

【００６８】ステップＳ２０６では、現在ポインタが指
し示す位置の単語が、バッファに格納されている単語と
同一かどうか、すなわち主語と同一かどうかについての
判断が行なわれる。同一である場合には制御はステップ
Ｓ２０７に、そうでない場合には制御はステップＳ２０
８に進む。In step S206, it is determined whether the word at the position currently pointed to by the pointer is the same as the word stored in the buffer, that is, the same as the subject. If they are the same, the control proceeds to step S207, and if not, the control proceeds to step S20.
Go to 8.

【００６９】ステップＳ２０７では、現在ポインタが指
し示している位置にある単語に対し、第２の訳語変換ル
ールを適用することが可能であるため、省略フラグを設
定する。ステップＳ２０７の後制御はステップＳ２０８
に進む。In step S207, since the second translation conversion rule can be applied to the word currently at the position pointed by the pointer, the omission flag is set. After step S207, the control is step S208.
Proceed to.

【００７０】ステップＳ２０４〜Ｓ２０８の処理が繰返
し行なわれることにより、文の主語と同一である代名詞
については、第２の訳語変換ルールを適用した結果に基
づく省略フラグが設定されることになる。By repeating the processing of steps S204 to S208, the omission flag based on the result of applying the second translation conversion rule is set for the pronoun that is the same as the subject of the sentence.

【００７１】図１５に示される例では、（＊２）が付さ
れた単語“ｙｏｕｒ”にフラグが設定される。なお、こ
の場合には所有格の“ｙｏｕｒ”を、“ｙｏｕ”と同一
のものとして取り扱っている。In the example shown in FIG. 15, a flag is set for the word "your" with (* 2). In this case, the possessive "your" is treated as the same as "you".

【００７２】この第２の訳語変換ルールを適用した場
合、図１９に示される例文では次のような結果が得られ
る。図１５を参照して、左から４つめの単語“ｙｏｕ
ｒ”に省略フラグが設定される結果、訳出される日本語
の文ではこの単語に該当する部分が削除される。すなわ
ち、図１９に示される例文をこの第２の訳語変換ルール
に従って日本語に訳すと、図１７に示されるものが得ら
れる。図１７においては、第１の訳語変換ルールも適用
されている。この図１７に示される結果は、図１６に示
されるものよりもさらに自然な日本語に近い。When the second translation conversion rule is applied, the following result is obtained in the example sentence shown in FIG. Referring to FIG. 15, the fourth word “you” from the left.
As a result of setting the abbreviation flag in r ”, the part corresponding to this word is deleted in the translated Japanese sentence. That is, the example sentence shown in FIG. 19 is converted into Japanese according to the second translation conversion rule. The translation results in what is shown in Fig. 17. In Fig. 17, the first translation conversion rule is also applied, and the result shown in Fig. 17 is more natural than that shown in Fig. 16. It is close to Japanese.

【００７３】第３の訳語変換ルールを適用可能な単語が
存在するかどうかを調べるための処理のフローチャート
が図１３に示される。図１３を参照して、まずステップ
Ｓ３０１で、ポインタを木変換結果バッファＤの先頭位
置に設定する処理が行なわれる。FIG. 13 shows a flowchart of the processing for checking whether or not there is a word to which the third translation conversion rule can be applied. Referring to FIG. 13, first, in step S301, a process of setting the pointer to the head position of tree conversion result buffer D is performed.

【００７４】続いてステップＳ３０２で、ポインタの指
す位置が木変換結果バッファＤの格納内容の終了位置に
到達したかどうかについての判断が行なわれる。到達し
ている場合にはこの処理は終了する。到達していない場
合には制御はステップＳ３０３に進む。Subsequently, in step S302, it is determined whether or not the position pointed by the pointer has reached the end position of the contents stored in the tree conversion result buffer D. If it has, this process ends. If not, the control proceeds to step S303.

【００７５】ステップＳ３０３では、ポインタが指し示
す位置の単語が、予めユーザが指定した単語かどうかに
ついての判断が行なわれる。一致しない場合には制御は
ステップＳ３０５に、一致する場合には制御はステップ
Ｓ３０４に、それぞれ進む。In step S303, it is determined whether the word at the position pointed to by the pointer is the word designated by the user in advance. If they do not match, the control proceeds to step S305, and if they match, the control proceeds to step S304.

【００７６】ステップＳ３０５では、ポインタに１加算
する処理が行なわれ、制御はステップＳ３０２に戻る。In step S305, the pointer is incremented by 1, and control is returned to step S302.

【００７７】ステップＳ３０４では、木変換結果の中
に、第３の訳語変換ルールが適用可能な単語を発見した
ということであるから、この単語に対して省略可能であ
ることを示す省略フラグを設定する処理が行なわれる。
ステップＳ３０４の後制御はステップＳ３０５に進む。In step S304, since the word to which the third translation conversion rule is applicable is found in the tree conversion result, an omission flag indicating that this word can be omitted is set. Processing is performed.
After step S304, the control proceeds to step S305.

【００７８】第３の訳語変換ルールを適用するために
は、予めユーザが特定の代名詞を指定することが必要で
ある。これについては、たとえば翻訳処理を始める前の
初期設定の段階で、翻訳システムからユーザに対して問
い合わせを行ない、「訳さない代名詞」をユーザが設定
できるようにしておくことが考えられる。In order to apply the third translation conversion rule, the user needs to specify a specific pronoun in advance. Regarding this, for example, it is conceivable that the translation system may make an inquiry to the user at the stage of initial setting before starting the translation processing so that the user can set a "pronoun not translated".

【００７９】たとえばユーザが“ｙｏｕ”を訳さない代
名詞として指定したものとする。“ｙｏｕ”の変化形も
すべてこの“ｙｏｕ”の指定により「訳されない代名
詞」と指定されたものとして取り扱う。すると、図１５
に示される木変換結果バッファの内容の場合には、（＊
１）（＊２）（＊３）の３つの単語に対して省略フラグ
が設定されることになる。すなわち、この第３の訳語変
換ルールを適用した結果、図１８に示されるように「あ
なた」という語が全く含まれない翻訳文が得られること
になる。この翻訳文は、図１６、図１７に示されるもの
よりもさらに自然な日本語に近いものとなっている。For example, it is assumed that the user has designated "you" as a pronoun that does not translate. All variants of "you" are treated as being designated as "untranslated pronouns" by the designation of "you". Then, as shown in FIG.
In the case of the contents of the tree conversion result buffer shown in, (*
1) The omission flag is set for the three words (* 2) (* 3). That is, as a result of applying the third translation word conversion rule, a translation sentence that does not include the word "you" is obtained as shown in FIG. This translated sentence is closer to natural Japanese than that shown in FIGS. 16 and 17.

【００８０】再び図１０に戻り、ステップＳ００１のル
ール適用処理は、上述したような各ルールについての処
理をすべて行なうことにより終了する。Returning to FIG. 10 again, the rule application process of step S001 ends by performing all the processes for each rule as described above.

【００８１】ステップＳ００１の後、ステップＳ００２
で、木変換結果バッファＤの格納内容のうち、未だ日本
語を生成していない単語が存在するかどうかについての
判断が行なわれる。既に最後まで到達し、これ以上生成
する対象の単語が存在しない場合にはこの処理は終了す
る。日本語が生成されていない単語が存在する場合に
は、その単語の品詞に応じてステップＳ００３、Ｓ００
４、Ｓ００５などに処理が進み、代名詞、名詞、動詞な
どに応じた日本語の生成が行なわれる。この後たとえば
ステップＳ００３の後処理はステップＳ００６に進む。After step S001, step S002
Then, a determination is made as to whether or not there is a word for which Japanese has not yet been generated among the stored contents of the tree conversion result buffer D. If the word has reached the end and there are no more target words to be generated, this processing ends. If there is a word for which Japanese has not been generated, steps S003 and S00 are performed according to the part of speech of the word.
4, the processing proceeds to S005 and the like, and Japanese is generated according to the pronoun, the noun, the verb, and the like. After this, for example, the post-processing of step S003 proceeds to step S006.

【００８２】ステップＳ００６では、ステップＳ００１
で行なわれたルール適用処理の結果、処理対象の単語に
対する省略フラグが設定されているかどうかについての
判断を行なう。すなわち、処理対象の単語を翻訳文内に
訳出するかどうかについての判断が行なわれる。省略フ
ラグが設定されていなければ制御はステップＳ００７に
進む。省略フラグが設定されている場合には制御はステ
ップＳ００２に戻る。In step S006, step S001
As a result of the rule application processing performed in step 1, it is determined whether or not the omission flag is set for the word to be processed. That is, it is determined whether or not the word to be processed is translated into the translated sentence. If the omission flag has not been set, control proceeds to step S007. If the omission flag is set, control returns to step S002.

【００８３】ステップＳ００７では、処理対象の単語に
対して、辞書バッファに格納されている訳語を割当てて
出力文バッファに格納する処理が行なわれる。ステップ
Ｓ００７の後制御はステップＳ００２に戻る。In step S007, a translated word stored in the dictionary buffer is assigned to the word to be processed and stored in the output sentence buffer. After step S007, the control returns to step S002.

【００８４】或る単語に対して省略フラグが設定されて
いる場合には、このステップＳ００７の処理が行なわれ
ない。そのため、翻訳文にはこの単語が訳出されること
はない。When the omission flag is set for a certain word, the process of step S007 is not performed. Therefore, this word will not be translated into the translated text.

【００８５】以上、この発明につき英日翻訳の機械翻訳
装置を例として説明したが、この発明は英日機械翻訳装
置のみに限定されているものではない。本発明は言語の
構造が異なるために、単語と単語とを機械的に対応した
場合に不自然な訳文が得られる可能性のある言語間の翻
訳全般に対して適用することが可能である。Although the present invention has been described by taking the machine translation device for English-Japanese translation as an example, the present invention is not limited to the machine translation device for English-Japanese translation. INDUSTRIAL APPLICABILITY The present invention can be applied to general translation between languages in which an unnatural translated sentence may be obtained when words are mechanically corresponded to each other due to different language structures.

【００８６】[0086]

【発明の効果】以上のように請求項１に記載の機械翻訳
装置では、ターゲット言語の構造解析木中の、予め定め
る条件を満たす単語が検出されると、この検出された単
語を除外してターゲット言語のテキストが生成される。
ソース言語のテキスト中に含まれる単語であっても、タ
ーゲット言語の訳文には訳出されることがなくなる場合
があり、従来よりも柔軟な翻訳を行なうことが可能にな
る。As described above, in the machine translation device according to the first aspect, when a word satisfying a predetermined condition is detected in the structure analysis tree of the target language, the detected word is excluded. The target language text is generated.
Even words included in the text of the source language may not be translated into the translated text of the target language, and thus it is possible to perform more flexible translation than before.

【００８７】請求項２に記載の機械翻訳装置では、変換
後の構造解析木中の単語のうち、第２の格納手段に格納
されている除外規則のいずれかが適用可能な単語に関連
して所定の標識が準備され、その単語を除外してターゲ
ット言語のテキストが生成される。訳文から除外する単
語を除外規則の形で予め複数個定義しておくことがで
き、従来よりも柔軟な翻訳を容易に行なうことができ
る。According to another aspect of the machine translation apparatus of the present invention, among the words in the converted structure analysis tree, any of the exclusion rules stored in the second storage means is associated with the applicable word. A predetermined indicator is prepared and the word is excluded to produce the target language text. A plurality of words to be excluded from the translated text can be defined in advance in the form of an exclusion rule, and a more flexible translation than before can be easily performed.

【００８８】請求項３に記載の機械翻訳装置では、変換
後の構造解析木中の各単語のうち、意味的に重複する複
数個の単語について、そのうちの少なくとも１つが除外
対象として指定される。そして、この除外対象に指定さ
れた単語を除外してターゲット言語のテキストが生成さ
れるので、冗長な表現の少ない、より自然な翻訳が容易
に行なえる。In the machine translation device according to the third aspect, among the words in the structural analysis tree after conversion, at least one of a plurality of words that are semantically overlapping is designated as an exclusion target. Then, since the target language text is generated by excluding the words designated as the exclusion target, a more natural translation with less redundant expressions can be easily performed.

【００８９】請求項４に記載の機械翻訳装置では、入力
される英語のテキストから英語の構造解析木が生成さ
れ、さらに日本語の構造解析木に変換される。変換後の
構造解析木中の各単語のうち除外規則が適用可能なもの
は、それを除外して日本語のテキストが生成される。英
語では表現されても日本語では表現不要な単語などが日
本語のテキスト中に出現することを防止でき、英語か
ら、自然な表現の日本語への翻訳を容易に行なうことが
できる。In the machine translation device according to the fourth aspect, an English structural analysis tree is generated from the input English text and further converted into a Japanese structural analysis tree. If each of the words in the converted structure analysis tree to which the exclusion rule is applicable is excluded, the Japanese text is generated. It is possible to prevent words that are expressed in English but that are unnecessary in Japanese from appearing in Japanese text, and it is possible to easily translate English into a natural expression in Japanese.

【００９０】請求項５に記載の機械翻訳装置では、入力
される英語のテキストから英語の構造解析木が生成さ
れ、さらに日本語の構造解析木に変換される。変換後の
構造解析木中の各単語のうち除外規則が適用可能なも
の、たとえば特定の代名詞は、それを除外して日本語の
テキストが生成される。英語では表現されても日本語で
は表現不要な代名詞などの単語が日本語のテキスト中に
出現することを防止でき、英語から、自然な表現の日本
語への翻訳を容易に行なうことができる。In the machine translation device according to the fifth aspect, an English structural analysis tree is generated from the input English text and further converted into a Japanese structural analysis tree. Among the words in the converted structure analysis tree, those for which the exclusion rule is applicable, for example, specific pronouns, are excluded to generate Japanese text. It is possible to prevent words such as pronouns that are expressed in English but are unnecessary in Japanese from appearing in Japanese text, and it is possible to easily translate English into a natural expression in Japanese.

【００９１】請求項６に記載の機械翻訳装置では、入力
される英語のテキストから英語の構造解析木が生成さ
れ、さらに日本語の構造解析木に変換される。変換後の
構造解析木中の各単語のうち除外規則が適用可能なも
の、たとえば予め使用者が入力した特定の代名詞は、そ
れを除外して日本語のテキストが生成される。英語では
表現されても日本語では表現不要だと使用者が判断した
代名詞などの単語が日本語のテキスト中に出現すること
を防止でき、英語から、自然な表現の日本語への翻訳を
容易に行なうことができる。In the machine translation device according to the sixth aspect, an English structural analysis tree is generated from the input English text and further converted into a Japanese structural analysis tree. Among the words in the converted structure analysis tree, those for which the exclusion rule is applicable, for example, specific pronouns input by the user in advance, are excluded to generate Japanese text. It is possible to prevent words, such as pronouns, that the user has judged to be unnecessary in Japanese from being expressed in English, from appearing in Japanese text, making it easy to translate English into a natural expression in Japanese. Can be done

[Brief description of drawings]

【図１】機械翻訳の概念を模式的に示す図である。FIG. 1 is a diagram schematically showing the concept of machine translation.

【図２】本発明の言語処理装置の一例である機械翻訳装
置のブロック図である。FIG. 2 is a block diagram of a machine translation device which is an example of a language processing device of the present invention.

【図３】図２に示される翻訳モジュール５の詳細な機能
ブロック図である。FIG. 3 is a detailed functional block diagram of a translation module 5 shown in FIG.

【図４】翻訳モジュールのハードウェア的なブロック図
である。FIG. 4 is a hardware block diagram of a translation module.

【図５】バッファＡの格納内容を模式的に示す図であ
る。5 is a diagram schematically showing the stored contents of a buffer A. FIG.

【図６】バッファＢの格納内容を模式的に示す図であ
る。FIG. 6 is a diagram schematically showing stored contents of a buffer B.

【図７】バッファＣの格納内容を模式的に示す図であ
る。FIG. 7 is a diagram schematically showing stored contents of a buffer C.

【図８】バッファＤの格納内容を模式的に示す図であ
る。FIG. 8 is a diagram schematically showing stored contents of a buffer D.

【図９】バッファＥの格納内容を模式的に示す図であ
る。FIG. 9 is a diagram schematically showing stored contents of a buffer E.

【図１０】翻訳文生成処理の概略フローチャートであ
る。FIG. 10 is a schematic flowchart of a translated sentence generation process.

【図１１】第１の訳語変換ルールの適用処理の概略フロ
ーチャートである。FIG. 11 is a schematic flowchart of an application process of a first translation conversion rule.

【図１２】第２の訳語変換ルールの適用処理の概略フロ
ーチャートである。FIG. 12 is a schematic flowchart of application processing of a second translation conversion rule.

【図１３】第３の訳語変換ルールの適用処理の概略フロ
ーチャートである。FIG. 13 is a schematic flowchart of application processing of a third translation conversion rule.

【図１４】訳語変換ルールを模式的に示す図である。FIG. 14 is a diagram schematically showing translated word conversion rules.

【図１５】訳語変換ルールを適用した後の木変換結果の
バッファＤの格納内容を示す模式図である。FIG. 15 is a schematic diagram showing stored contents of a tree conversion result buffer D after applying a translation conversion rule.

【図１６】第１の訳語変換ルールを適用した結果の翻訳
文を模式的に示す図である。FIG. 16 is a diagram schematically showing a translated sentence as a result of applying the first translated word conversion rule.

【図１７】第１および第２の訳語変換ルールを適用した
結果の翻訳文を模式的に示す図である。FIG. 17 is a diagram schematically showing a translated sentence as a result of applying first and second translated word conversion rules.

【図１８】第３の訳語変換ルールを適用した結果の翻訳
文を示す模式図である。FIG. 18 is a schematic diagram showing a translated sentence as a result of applying a third translated word conversion rule.

【図１９】従来の機械翻訳装置による翻訳結果を示す模
式図である。FIG. 19 is a schematic diagram showing a translation result by a conventional machine translation device.

[Explanation of symbols]

１メインＣＰＵ２メインメモリ３ＣＲＴ４キーボード５翻訳モジュール６メモリ１１辞書引き・形態素解析モジュール１２構文解析モジュール１３構文変換モジュール１４翻訳文生成モジュール１５翻訳ＣＰＵ１６翻訳プログラムメモリ DESCRIPTION OF SYMBOLS 1 main CPU 2 main memory 3 CRT 4 keyboard 5 translation module 6 memory 11 dictionary lookup / morphological analysis module 12 syntactic analysis module 13 syntactic conversion module 14 translation sentence generation module 15 translation CPU 16 translation program memory

Claims

[Claims]

1. A means for analyzing an input source language text to generate a structure analysis tree of the source language, and a means for converting the structure analysis tree into a structure analysis tree of a predetermined target language. A first storage unit for storing a conversion rule; a unit for converting the structure analysis tree of the source language into a structure analysis tree of the target language by referring to the conversion rule; and a structure of the target language In the parse tree, detection means for detecting a word satisfying a predetermined condition, based on the structure analysis tree of the target language, for generating the text of the target language by excluding the detected word A machine translation device including a generation unit.

2. The detecting means includes a second storing means for storing an exclusion rule defining the predetermined condition, and each word in the structure analysis tree of the target language,
Determination means for determining whether any of the exclusion rules stored in the second storage means is applicable; and a word for which any of the exclusion rules is determined to be applicable by the determination means. Associated with a means for preparing a predetermined marker, the generating means generating the text in the target language by excluding words for which the marker has been prepared. apparatus.

3. The exclusion rule includes an exclusion rule that specifies, as an exclusion target, at least one of a plurality of semantically overlapping words included in the structure analysis tree of the target language. The machine translation device described.

4. The machine translation device according to claim 2, wherein the source language is English and the target language is Japanese.

5. The machine translation device according to claim 4, wherein the exclusion rule includes an exclusion rule for designating a specific pronoun included in the structure analysis tree of the target language as an exclusion target.

6. The machine translation device according to claim 5, further comprising means for inputting the specific pronoun.