JPH09185626A

JPH09185626A - Machine translation device

Info

Publication number: JPH09185626A
Application number: JP7353550A
Authority: JP
Inventors: Takeshi Kutsumi; 毅九津見
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1995-12-29
Filing date: 1995-12-29
Publication date: 1997-07-15

Abstract

PROBLEM TO BE SOLVED: To perform phrase translation to let the effect of auxiliary verb appear for each clause consisting of parallel clauses including the auxiliary verb. SOLUTION: A target language generating part 2d of a translation module 2A is provided with a parallel clause detecting means, auxiliary verb detecting means and auxiliary verb translation generating means. A main memory 3 is provided with an auxiliary verb buffer 3k for storing the auxiliary verb code of auxiliary verb detected for each clause by the auxiliary verb detecting means. Then, concerning a source sentence, in which the parallel clauses including the auxiliary verb exist, inputted to an input part 1, the target language generating part 2d generates a target language for which the ending of each translated clause consisting of the parallel clauses is deformed into ending corresponding to the auxiliary verb code. As a result, the effect of this auxiliary verb is let appear in both the parallel clauses and unnaturality is canceled.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】この発明は、機械翻訳装置に
関し、詳しくは、ソース言語の文を句や節に分割し、こ
の句や節毎に翻訳して得られたターゲット言語の句や節
を出力することが可能な機械翻訳装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device, and more specifically, it divides a source language sentence into phrases and clauses, and translates each phrase or clause into target language phrases or clauses. The present invention relates to a machine translation device capable of outputting.

【０００２】[0002]

【従来の技術】現存する多くの機械翻訳装置において
は、１文単位で翻訳処理を実行するようにしている。し
たがって、１文の長さが長くなる程構文を誤って解析す
る可能性が高まり、長い文を１文単位で翻訳すると意味
を無さない翻訳文を出力する可能性が高くなる。また、
構文を正しく解析できても、論理的には正しいが非常に
読みにくい翻訳文が出力される場合も多い。そこで、現
在の機械翻訳装置は、その出力結果を利用者が後編集を
して利用に供するという使用方法が主流となっている。2. Description of the Related Art In many existing machine translation apparatuses, translation processing is executed in units of one sentence. Therefore, the longer the length of one sentence is, the more likely it is that the syntax will be erroneously parsed, and if a long sentence is translated on a sentence-by-sentence basis, the possibility that a translated sentence that does not make sense will be increased. Also,
Even if the syntax can be parsed correctly, a translated sentence that is logically correct but extremely difficult to read is often output. Therefore, in the current machine translation apparatus, a main usage method is that a user post-edits the output result and provides it for use.

【０００３】また、構文解析が誤っている可能性が大き
い状態で１文単位での翻訳結果を得るよりも、解析誤り
が少ない状態で１文よりも小さな単位での翻訳結果が得
られる方が良いとして、翻訳結果を１文単位で出力せず
に、単語,句あるいは節等の１文よりも小さな単位で翻
訳結果を出力する機械翻訳装置が提案されている。この
ような機械翻訳装置として、例えば次のようなものがあ
る。Further, it is better to obtain a translation result in a unit smaller than one sentence in a state where there are few parsing errors, than to obtain a translation result in a sentence unit in a state where there is a high possibility that the syntactic analysis is wrong. As a good point, a machine translation device has been proposed which outputs a translation result in units smaller than one sentence such as a word, a phrase or a clause without outputting the translation result in a sentence unit. Examples of such a machine translation device include the following.

【０００４】すなわち、通常の１文単位での翻訳装置と
同様の辞書引き,形態素解析,訳語選択を行って、原文の
構文構造と原文の各単語に対する訳語とを得る。そうし
た後に、翻訳文生成の際には、構文構造に基づいて原文
を句や節に分割し、分割された原文の句や節を構成する
原語での単語(以下、原語単語と言う)に対応する訳語を
原文の句や節毎にまとめ、夫々の句や節の中で訳語を適
切な順序(１文単位で翻訳する場合の訳語の順序から当
該句や節に属する訳語だけが抜き出されて得られる順
序)に並べ替えて、訳文フレーズを組み立てるものであ
る。That is, dictionary construction, morphological analysis, and translation word selection are performed in the same manner as in a normal sentence-by-sentence translation apparatus to obtain the syntactic structure of the original sentence and the translated word for each word of the original sentence. After that, when generating a translated sentence, the original sentence is divided into phrases and clauses based on the syntactic structure, and the words in the original language (hereinafter referred to as the original word) that compose the divided original sentence clauses and clauses are supported. The translated words are grouped into each phrase or section of the original sentence, and the translated words are arranged in an appropriate order in each phrase or section (only the translated words belonging to the phrase or section are extracted from the order of the translated words when translated in one sentence unit). The translated phrase is assembled into the translated phrase.

【０００５】ここで、上記訳文フレーズの「フレーズ」と
は、英文法で言うところの「phrase」を意味するものでは
なく、上記「句」及び「節」を統括する際に用いる文言であ
る。尚、以下において使用される文言「フレーズ」につい
ても同様である。Here, the "phrase" of the translated phrase does not mean "phrase" in the English grammar, but is a word used to control the "phrase" and "section". The same applies to the word "phrase" used below.

【０００６】[0006]

【発明が解決しようとする課題】しかしながら、上記従
来の１文よりも小さな単位で翻訳結果を出力する機械翻
訳装置には、例１,例２および例３に示すような問題が
ある。［例１］原文「It would push them to the end of the credito
rs' lineand make it unlikely.」原文のフレーズへの分割結果（１）It would push them to the end of the credito
rs' line （２）and （３）make it unlikely. フレーズ翻訳の結果（ａ）それがそれらを債権者のラインのエンドに入れ（ｂ）そして（ｃ）それをありそうもない状態にするであろうHowever, the conventional machine translation device that outputs a translation result in a unit smaller than one sentence has the problems as shown in Examples 1, 2 and 3. [Example 1] Original text "It would push them to the end of the credito
rs' lineand make it unlikely. "(1) It would push them to the end of the credito
rs' line (2) and (3) make it unlikely. The result of phrase translation (a) It puts them at the end of the creditor's line (b) and (c) so that it's unlikely Let's go

【０００７】この例では、原文と翻訳結果とをフレーズ
単位で比較すると、(１)の原文フレーズにおいては原文
に助動詞「would」が存在するが、このことが(ａ)の訳文
フレーズに反映されてはいない。助動詞「would」の効果
は、(ｃ)の訳語フレーズのみに「〜であろう」という形で
現れている。この原因は、原文に対して１文単位での通
常の翻訳処理を行った結果は、「それがそれらを債権者
のラインのエンドに入れ、そして、それをありそうもな
い状態にするであろう。」であり、助動詞の効果が文末
にのみ現れていることと、原文のフレーズ分割位置を忠
実に翻訳文に反映させたためである。In this example, when the original sentence and the translation result are compared in phrase units, the auxiliary verb "would" exists in the original sentence in the original sentence phrase (1), but this is reflected in the translated sentence phrase in (a). Not. The effect of the auxiliary verb “would” appears only in the translated phrase of (c) in the form “to be probably”. The reason for this is that the result of normal translation processing of the original sentence, one sentence at a time, is that "it puts them at the end of the creditor's line and makes it unlikely. This is because the effect of the auxiliary verb appears only at the end of the sentence and that the phrase division position of the original sentence is faithfully reflected in the translated sentence.

【０００８】ところが、(１)の原文フレーズと(ａ)の訳
語フレーズとを比較してみると、原文フレーズには助動
詞「would」が存在するのに訳語フレーズにはその効果が
現れていないのは不自然である。意味的にも、原文フレ
ーズの助動詞「would」は、主語を共有して並列になって
いる原文の２つの節の両方に対して効いているのである
から、通常の１文単位での翻訳の場合には実際に訳の形
を変えるのは文末のみで良いとしても、夫々の節が分離
しているフレーズ翻訳の場合には夫々の節に対して明示
的に訳の形を変えなければ正確な翻訳とは言えないので
ある。However, comparing the original phrase of (1) and the translated phrase of (a), the auxiliary verb “would” exists in the original phrase, but the effect does not appear in the translated phrase. Is unnatural. In terms of meaning, the auxiliary verb "would" of the original sentence phrase is effective for both two clauses of the original sentence that share the subject and are in parallel. In some cases, it is only necessary to change the translation form at the end of the sentence, but in the case of phrase translation where each clause is separated, it is necessary to explicitly change the translation form for each clause. It cannot be said to be a translation.

【０００９】［例２］通常、機械翻訳装置によってソー
ス言語の原文からターゲット言語の翻訳文を生成する際
には、ソース言語の単語の訳語に加えて、ソース言語に
よる何れの単語にも対応しない訳語が幾つか生成されて
用いられるのであるが、このようなソース言語の単語に
対応しない訳語の生成位置は非常に単純な規則によって
決定される。例えば、ターゲット言語が日本語である場
合には、１文単位で翻訳した場合の訳語生成順序を基準
として、ターゲット言語の単語に対応しない訳語はその
１つ前の訳語と同一のフレーズに生成されるとしてい
る。[Example 2] Normally, when a translated sentence of a target language is generated from an original sentence of a source language by a machine translation device, in addition to a translated word of a source language word, it does not correspond to any word in the source language. Although some translated words are generated and used, the generation position of the translated word that does not correspond to the word in the source language is determined by a very simple rule. For example, when the target language is Japanese, a translated word that does not correspond to a word in the target language is generated in the same phrase as the previous translated word, based on the translated word generation order when translated in sentence units. I'm supposed to.

【００１０】このような規則に従って、英語から日本語
への翻訳処理を行った場合には、次のような結果とな
る。原文「keating and other Lincoln exectives are the
subject ofa class-action lawsuit alleging they mis
led investors aboutthe safety of junk bonds issued
by American Continental.」原文のフレーズへの分割結果（１）keating and other Lincoln exectives are the
subject ofa class-action lawsuit （２）alleging they misled investors about the saf
ety of junk bondsissued by American Continental フレーズ翻訳の結果（ａ）キーティング及び他のリンカーン幹部は主張する
集団訴訟の主体である（ｂ）それらがアメリカン・コンチネンタルによって出
されたジャック・ボンドの安全性について投資家を誤ら
せたとWhen the translation process from English to Japanese is performed according to such rules, the following results are obtained. Original text `` keating and other Lincoln exectives are the
subject ofa class-action lawsuit alleging they mis
led investors about the safety of junk bonds issued
by American Continental. ”The result of dividing the original sentence into phrases (1) keating and other Lincoln exectives are the
subject ofa class-action lawsuit (2) alleging they misled investors about the saf
ety of junk bondsissued by American Continental Results of phrase translation (a) Keating and other Lincoln executives are the subject of the class action claims (b) They invested in the security of Jack Bonds issued by American Continental I misled my house

【００１１】この場合は、(ａ)の訳文フレーズ中の「主
張する」が目的語を取り、その目的語が(ｂ)の訳文フレ
ーズ全体であると言うことが分かりにくい。このことを
改善するためには、(ｂ)の訳文フレーズの末尾にあって
(ｂ)の訳文フレーズ全体を受けている格助詞「と」を、
(ａ)の訳文フレーズの「主張する」の直前に生成すれば分
かりやすくなる。ところが、上述したように、この格助
詞「と」は原文フレーズの単語とは対応しない単語である
から、１文単位で翻訳した場合に「と」の直前に生成され
る訳語「誤らせた」と同一フレーズに生成するようになっ
ているために、(ｂ)の訳文フレーズに生成されてしまう
のである。In this case, it is difficult to understand that "insist" in the translated phrase of (a) takes an object and that the object is the entire translated phrase of (b). To improve this, put it at the end of the translated phrase in (b)
The case particle "to" which received the whole translated phrase of (b),
It will be easier to understand if it is generated immediately before "arguing" the translated phrase of (a). However, as described above, since the case particle "to" does not correspond to the word of the original sentence phrase, the translated word "misleading" generated immediately before "to" when translated in sentence units. Since it is designed to be generated in the same phrase as, the translated phrase in (b) is generated.

【００１２】［例３］原文「The plan was opposed by Keating and some of
his former topexecutives, who invested in the comp
any's securities.」原文のフレーズへの分割結果（１）The plan was opposed by Keating and some of
his former topexecutives （２）who invested in the company's securities フレーズ翻訳の結果（ａ）計画はキーティング及びいくらかの彼の前のトッ
プ幹部によって反対された（ｂ）会社の有価証券に投資した[Example 3] The original sentence "The plan was opposed by Keating and some of
his former topexecutives, who invested in the comp
any's securities. "Result of splitting into original phrases (1) The plan was opposed by Keating and some of
his former topexecutives (2) the result of who invested in the company's securities (a) the plan was opposed by Keating and some of his former top executives (b) invested in the company's securities

【００１３】この例では、(２)の原文フレーズは関係節
であるが、(ａ)の訳文フレーズだけを見た場合には格
(この場合には動作主格)が欠けた節となっているので、
これだけではフレーズとして意味がとりにくい。In this example, the original phrase in (2) is a relative clause, but if only the translated phrase in (a) is viewed,
(In this case, the nominative case) is a missing section, so
It is difficult to take meaning as a phrase by this alone.

【００１４】そこで、この発明の目的は、原文と翻訳結
果とをフレーズ単位で比較したり訳文フレーズのみを読
み進めて行ったりする場合に妨げとなる上述の諸問題を
無くし、利用者にとって訳文の内容把握をより容易なら
しめる機械翻訳装置を提供することにある。Therefore, an object of the present invention is to eliminate the above-mentioned problems that hinder the comparison of the original sentence and the translation result in phrase units or to read and proceed only the translated sentence phrase. It is to provide a machine translation device that makes it easier to understand the contents.

【００１５】[0015]

【課題を解決するための手段】上記目的を達成するた
め、請求項１に係る発明は、入力部から入力されたソー
ス言語による文の構文解析を行う構文解析部と,得られ
た構文構造をターゲット言語による構文構造に変換する
構文変換部と,この変換結果に基づいて入力文のターゲ
ット言語による訳語列を生成するターゲット言語生成部
と,分割規則に従って入力文を句または節に分割する際
の分割位置を決定する分割位置決定部と,上記分割位置
決定部による分割結果と上記ターゲット言語生成部によ
って生成されたターゲット言語とに基づいて各句または
節毎に上記ターゲット言語による訳語列を求める訳文フ
レーズ構築部を有して,入力文を句又は節毎に翻訳する
機械翻訳装置において、上記ターゲット言語生成部は、
入力文中に特定の格のみを共有する並列節が在る場合に
は、当該並列節を成す夫々の節から助動詞を検出し、当
該並列節を構成する各節に関して末尾の単語が上記検出
された助動詞に応じた終止形に変換されたターゲット言
語による訳語列を生成して、上記特定の格のみを共有す
る助動詞を含む並列節を成す節の何れにも当該助動詞に
基づく様相を反映させて節毎の翻訳を行うことを特徴と
している。In order to achieve the above object, the invention according to claim 1 provides a syntactic analysis unit for performing syntactic analysis of a sentence in a source language input from an input unit, and an obtained syntactic structure. A syntax conversion unit that converts the syntactic structure in the target language, a target language generation unit that generates a target language translation string of the input sentence based on the conversion result, and a division unit that divides the input sentence into phrases or clauses according to the division rules. A division position determining unit for determining a division position, and a translated sentence for obtaining a translation string in the target language for each phrase or clause based on the division result by the division position determining unit and the target language generated by the target language generating unit In a machine translation device that has a phrase construction unit and translates an input sentence into phrases or clauses, the target language generation unit is
When there is a parallel clause that shares only a specific case in the input sentence, the auxiliary verb is detected from each of the clauses that make up the parallel clause, and the last word is detected for each of the clauses that make up the parallel clause. Generate a translated word string in the target language converted to the final form according to the auxiliary verb, and reflect the aspect based on the auxiliary verb in any of the clauses that form a parallel clause including the auxiliary verb that shares only the above specific case. The feature is that each translation is performed.

【００１６】上記構成によれば、特定の格のみを共有す
る助動詞を含む並列節を有する入力文に対して当該並列
節を成す夫々の節毎に翻訳を行う場合には、当該並列節
を成す各節の何れにも当該助動詞に基づく様相を反映さ
せて訳語列が生成される。According to the above configuration, when an input sentence having a parallel clause including an auxiliary verb sharing only a specific case is translated for each clause forming the parallel clause, the parallel clause is formed. A translated word string is generated by reflecting the aspect based on the auxiliary verb in each of the clauses.

【００１７】また、請求項２に係る発明は、請求項１に
係る発明の機械翻訳装置において、上記ターゲット言語
生成部は、上記構文変換部によって得られたターゲット
言語による構文構造に基づいて特定の格のみを共有する
並列節を検出する並列節検出手段と、上記並列節検出手
段によって検出された夫々の節から助動詞を検出する助
動詞検出手段と、上記助動詞検出手段によって検出され
た助動詞を表す情報を格納する助動詞情報格納手段と、
上記並列節検出手段によって検出された夫々の節に関し
て,末尾の単語が上記助動詞情報格納手段に格納されて
いる情報に応じた終止形に変換されたターゲット言語に
よる訳語列を生成する助動詞訳生成手段を備えているこ
とを特徴としている。According to a second aspect of the present invention, in the machine translation device of the first aspect of the invention, the target language generation unit is specified based on the syntactic structure in the target language obtained by the syntactic conversion unit. Parallel clause detecting means for detecting parallel clauses sharing only the case, auxiliary verb detecting means for detecting auxiliary verbs from the respective clauses detected by the parallel clause detecting means, and information representing the auxiliary verb detected by the auxiliary verb detecting means. Auxiliary verb information storage means for storing
With respect to each clause detected by the parallel clause detecting means, an auxiliary verb translation generating means for generating a translated word string in the target language in which the last word is converted to an end form according to the information stored in the auxiliary verb information storage means It is characterized by having.

【００１８】上記構成において、並列節検出手段によっ
てターゲット言語による構文構造中に特定の格のみを共
有する並列節が検出されると、助動詞検出手段によっ
て、当該並列節を成す夫々の節から助動詞が検出されて
この助動詞を表す情報が助動詞情報格納手段に格納され
る。そして、当該並列節を成す夫々の節の訳語列を生成
するに際して、助動詞訳生成手段によって、末尾の単語
が上記助動詞情報格納手段に格納されている情報に応じ
た終止形に変換された訳語列が生成される。こうして、
入力文中に特定の格のみを共有する助動詞を含む並列節
が存在する場合には、当該並列節を成す各節の何れにも
当該助動詞に基づく様相を反映させて節毎の翻訳が行わ
れる。In the above configuration, when the parallel clause detecting means detects a parallel clause sharing only a specific case in the syntactic structure of the target language, the auxiliary verb detecting means detects the auxiliary verb from each clause forming the parallel clause. Information that is detected and represents this auxiliary verb is stored in the auxiliary verb information storage means. Then, when generating the translated word string of each of the clauses forming the parallel clause, the translated word string in which the last word is converted to the final form according to the information stored in the auxiliary verb information storage means by the auxiliary verb translation generation means Is generated. Thus,
When an input sentence includes a parallel clause including an auxiliary verb that shares only a specific case, each clause forming the parallel clause is translated by reflecting the aspect based on the auxiliary verb.

【００１９】また、請求項３に係る発明は、入力部から
入力されたソース言語による文の構文解析を行う構文解
析部と,得られた構文構造をターゲット言語による構文
構造に変換する構文変換部と,この変換結果に基づいて
入力文のターゲット言語による訳語列を生成するターゲ
ット言語生成部と,分割規則に従って入力文を句または
節に分割する際の分割位置を決定する分割位置決定部
と,上記分割位置決定部による分割結果と上記ターゲッ
ト言語生成部によって生成されたターゲット言語とに基
づいて各句または節毎に上記ターゲット言語による訳語
列を求める訳文フレーズ構築部を有して,入力文を句又
は節毎に翻訳する機械翻訳装置において、上記訳文フレ
ーズ構築部は、上記分割位置決定部での分割によって係
り受け関係の係りの語と受けの語とが異なる節に分割さ
れた場合には、上記受けの語を含む節の訳語列における
当該受けの語の直前に、上記係りの語を含む節が本来存
在することを表す補助記号を生成することを特徴として
いる。Further, the invention according to claim 3 is a syntax analysis unit for performing syntax analysis of a sentence in a source language input from an input unit, and a syntax conversion unit for converting the obtained syntax structure into a syntax structure in a target language. A target language generation unit that generates a translation string in the target language of the input sentence based on this conversion result; a division position determination unit that determines the division position when dividing the input sentence into phrases or clauses according to the division rules; Based on the result of the division by the division position determination unit and the target language generated by the target language generation unit, a target phrase construction unit that obtains a target word string in the target language for each phrase or clause In a machine translation device that translates phrases or clauses, the translated phrase construction unit receives the words related to the dependency relationship as a result of the division by the division position determination unit. When the word of is divided into different clauses, a supplementary symbol indicating that the clause containing the related word originally exists immediately before the corresponding word in the translated word sequence of the clause containing the above received word is added. It is characterized by generating.

【００２０】上記構成によれば、分割位置決定部によっ
て入力文における係りの語と受けの語とが異なる節に分
割された場合には、上記受けの語を含む節の訳語列中に
上記係りの語を含む節が本来存在することを明示した訳
語列が生成される。According to the above configuration, when the division position determining unit divides the related word and the receiving word in the input sentence into different clauses, the related word is included in the translated word string of the clause including the receiving word. A translated word string that clearly indicates that the clause including the word is originally present is generated.

【００２１】また、請求項４に係る発明は、請求項３に
係る発明の機械翻訳装置において、上記訳文フレーズ構
築部は、各節毎に求められた上記ターゲット言語による
訳語列に基づいて,入力文における係り受け関係の係り
の語と受けの語とが異なる節に分割されて上記係りの語
を含む節が受けの語を含む節から丸ごと抜けた構造を有
する中抜け節を検出する中抜け節検出手段と、上記中抜
け節検出手段によって中抜け節が検出されると,上記受
けの語を含む節の訳語列における当該受けの語の直前に
上記係りの語を含む節が抜けたことを表す補助記号を生
成する補助記号生成手段と、上記補助記号生成手段によ
って生成された補助記号の直後に,当該係り受け関係を
表す付属語を生成する付属語生成手段を備えていること
を特徴としている。Further, the invention according to claim 4 is the machine translation device according to claim 3, wherein the translated phrase construction section inputs based on the translated string in the target language obtained for each clause. A void in which a dependent word and a dependent word in a sentence are divided into different clauses and a clause including the dependent word has a structure in which the clause including the dependent word is completely omitted from the clause including the dependent word. When the hollow clause is detected by the clause detecting means and the hollow clause detecting means, the clause including the related word is omitted immediately before the word of the receiving word in the translated word sequence of the clause including the receiving word. A supplementary symbol generating means for generating a supplementary symbol expressing the auxiliary symbol and an adjunct word generating means for generating an adjunct word indicating the dependency relation immediately after the supplementary symbol generated by the supplementary symbol generating means. I am trying.

【００２２】上記構成において、入力文における係り受
け関係の係りの語と受けの語とが異なる節に分割されて
いることが中抜け節検出手段によって検出されると、補
助記号生成手段によって、上記受けの語を含む節の訳語
列における当該受けの語の直前に上記係りの語を含む節
が抜けたことを表す補助記号が生成され、付属語生成手
段によって上記補助記号の直後に付属語が生成される。
こうして、入力文中における係りの語と受けの語とが異
なる節に分割された場合には、上記受けの語を含む側の
節の訳語列に上記係りの語を含む節が抜けたことを表す
補助記号が挿入されて各節毎の翻訳が行われる。In the above structure, when the hollow clause detecting means detects that the dependent word and the dependent word of the dependency relation in the input sentence are divided into different clauses, the supplementary symbol generating means causes An auxiliary symbol indicating that the clause including the related word is omitted is generated immediately before the corresponding word in the translated word sequence of the clause including the receiving word, and the auxiliary word is generated immediately after the auxiliary symbol by the auxiliary word generating means. Is generated.
In this way, when the input word and the related word are divided into different clauses, it means that the section containing the related word has been omitted from the translated word string of the side clause containing the received word. Supplementary symbols are inserted and translation for each section is performed.

【００２３】また、請求項５に係る発明は、入力部から
入力されたソース言語による文の構文解析を行う構文解
析部と,得られた構文構造をターゲット言語による構文
構造に変換する構文変換部と,この変換結果に基づいて
入力文のターゲット言語による訳語列を生成するターゲ
ット言語生成部と,分割規則を適用して入力文を句また
は節に分割する際の分割位置を決定する分割位置決定部
と,上記分割位置決定部による分割結果と上記ターゲッ
ト言語生成部によって生成されたターゲット言語とに基
づいて各句または節毎に上記ターゲット言語による訳語
列を求める訳文フレーズ構築部を有して,入力文を句ま
たは節毎に翻訳する機械翻訳装置において、上記訳文フ
レーズ構築部は、上記分割位置決定部によって分離され
て格要素が欠落した関係節の訳語列中における上記欠落
した格要素の位置に、上記ターゲット言語生成部で生成
された当該関係節における関係詞の先行詞の訳語を生成
することを特徴としている。Further, the invention according to claim 5 is a syntax analysis unit for performing syntax analysis of a sentence in a source language input from an input unit, and a syntax conversion unit for converting the obtained syntax structure into a syntax structure in a target language. And a target language generator that generates a translation string in the target language of the input sentence based on this conversion result, and a division position determination that applies a division rule to determine the division position when dividing the input sentence into phrases or clauses. Section, a translation phrase construction unit that obtains a translation string in the target language for each phrase or clause based on the division result by the division position determination unit and the target language generated by the target language generation unit, In a machine translation device that translates an input sentence into phrases or clauses, the translated sentence phrase construction unit is separated by the division position determination unit and the case element is missing. The position of the missing case element in the translation column of, is characterized in that generating the translation of the antecedent relationship lyrics in the relative clause generated by the target language generator.

【００２４】上記構成によれば、分割位置決定部によっ
て分離されて格要素が欠落した関係節の訳語列中に、当
該関係詞の訳語として先行詞の訳語が挿入された訳語列
が生成される。According to the above configuration, a translated word string in which the translated word of the antecedent is inserted as the translated word of the relative word is generated in the translated word string of the relative clause which is separated by the division position determination unit and lacks the case element. .

【００２５】また、請求項６に係る発明は、請求項５に
係る発明の機械翻訳装置において、上記訳文フレーズ構
築部は、上記構文解析部によって得られた構文構造に基
づいて,上記分割位置検出部によって分離された各要素
が欠落した関係節を検出する関係節検出手段と、上記関
係節検出手段によって上記関係節が検出されると,上記
ターゲット言語生成部によって生成された入力文の訳語
列に基づいて,当該関係節における関係詞の先行詞の訳
語を得る先行詞取得手段と、上記先行詞取得手段によっ
て得られた当該先行詞の訳語を当該関係節の訳語列中に
おける上記欠落した格要素の位置に当該関係詞の訳語と
して生成する関係詞訳生成手段と、上記関係詞訳生成手
段によって生成された当該関係詞の訳語の直後に当該関
係詞の訳語の格を表す後置語を生成する後置語生成手段
を備えていることを特徴としている。Further, the invention according to claim 6 is the machine translation device according to the invention according to claim 5, wherein the translated phrase construction unit detects the division position based on the syntactic structure obtained by the syntactic analysis unit. When a relative clause detecting means detects a relative clause in which each element separated by a part is missing, and the relative clause is detected by the relative clause detecting means, a translated word string of the input sentence generated by the target language generating portion Based on, the antecedent acquisition means for obtaining the translated word of the antecedent of the relative in the relative clause, and the translation of the antecedent obtained by the antecedent acquisition means, the missing case in the translation string of the relative clause. The relative word translation generating means for generating the relative word at the position of the element, and the case of the relative word translated immediately after the relative word translated by the relative word translation generating means. It is characterized in that it comprises a 置語 generating means after generating the 置語.

【００２６】上記構成において、分割位置決定部によっ
て分離されて格要素が欠落した関係節が関係節検出手段
によって検出されると、先行詞取得手段によって当該関
係節の先行詞の訳語が得られ、関係詞訳生成手段によっ
て当該関係節の訳語列中における上記欠落した格要素の
位置に当該関係詞の訳語として生成される。さらに、後
置語生成手段によって、当該関係詞の訳語の直後に後置
語が生成される。こうして、上記分割位置決定部によっ
て分離されて格要素が欠落した関係節の訳語列中に当該
関係詞の訳語として先行詞の訳語が挿入されて各節毎の
翻訳が行われる。In the above structure, when the relative clause detecting means detects a relative clause which is separated by the division position deciding unit and lacks case elements, the antecedent obtaining means obtains a translated word of the antecedent of the relative clause. It is generated by the relative word translation generating means as a translation word of the relative word at the position of the missing case element in the translation word string of the relative clause. Further, the postposition word generation means generates the postposition word immediately after the translation word of the relative word. In this way, the translated word of the antecedent is inserted as the translated word of the relative clause into the translated word string of the relative clause that is separated by the division position determination unit and lacks the case element, and translation for each clause is performed.

【００２７】また、請求項７に係る発明は、請求項６に
係る発明の機械翻訳装置において、上記訳文フレーズ構
築部は、上記関係詞訳生成手段によって生成された当該
関係詞の訳語に係る修飾語を生成して、当該関係詞の訳
語が生成された節の可読性を向上させる補助修飾語生成
手段を備えていることを特徴としている。The invention according to claim 7 is the machine translation apparatus according to the invention according to claim 6, wherein the translated sentence phrase constructing unit modifies the translated word of the relative word generated by the relative word translation generating means. It is characterized by including an auxiliary modifier generation means for generating a word and improving the readability of the clause in which the translation of the relative word is generated.

【００２８】[0028]

【発明の実施の形態】以下、この発明を図示の実施の形
態により詳細に説明する。図１は本実施の形態の機械翻
訳装置における一実施の形態を示すブロック図である。
尚、本実施の形態においては、ソース言語が英語であ
り、ターゲット言語が日本語である場合を例に、本機械
翻訳装置を説明する。BEST MODE FOR CARRYING OUT THE INVENTION The present invention will be described in detail below with reference to the embodiments shown in the drawings. FIG. 1 is a block diagram showing an embodiment of a machine translation apparatus of this embodiment.
In the present embodiment, the machine translation apparatus will be described by taking the case where the source language is English and the target language is Japanese as an example.

【００２９】図１において、本機械翻訳装置は、入力部
１,翻訳制御部２,メインメモリ３,記憶部４,出力部５お
よび制御部６をバスライン７を介して接続して構成され
ている。上記入力部１は、キーボード,マウス,電子ペ
ン,タブレット,スキャナ,文字認識装置等の入力装置や
通信回線と接続された通信装置によって構成されて、各
種命令やソース言語による翻訳対象の文字列が入力され
る。上記翻訳制御部２は、翻訳モジュール２Ａ及び出力
形式整形部２Ｂから成り、入力部１から入力されたソー
ス言語による原文に対してフレーズ単位での翻訳処理を
行ってターゲット言語による翻訳文を生成する。上記メ
インメモリ３には、翻訳制御部２が翻訳文を生成する過
程において生じたデータを保持する各種のバッファが設
定されている。上記記憶部４には、翻訳制御部２が翻訳
文を生成する際に使用する翻訳用の辞書,文法規則,その
他翻訳に必要な規則等のデータが格納されている。上記
出力部５は、ＣＲＴ(陰極線管),ＬＣＤ(液晶表示装置)
等の表示装置やプリンタ等の印字装置あるいは通信回線
と接続されている通信装置によって構成されて、生成さ
れた翻訳文等を出力する。上記制御部６は、主としてＣ
ＰＵ(中央演算処理装置)によって構成されて、上記入力
部１,翻訳制御部２および出力部５を制御して翻訳処理
等の各種処理を実行する。In FIG. 1, this machine translation device is configured by connecting an input unit 1, a translation control unit 2, a main memory 3, a storage unit 4, an output unit 5 and a control unit 6 via a bus line 7. There is. The input unit 1 is composed of an input device such as a keyboard, a mouse, an electronic pen, a tablet, a scanner, and a character recognition device, and a communication device connected to a communication line. Is entered. The translation control unit 2 includes a translation module 2A and an output format shaping unit 2B, and performs translation processing in phrase units on an original sentence in the source language input from the input unit 1 to generate a translated sentence in the target language. . The main memory 3 is provided with various buffers for holding data generated in the process of the translation control unit 2 generating a translated sentence. The storage unit 4 stores data such as a dictionary for translation used when the translation control unit 2 generates a translated sentence, grammatical rules, and other rules necessary for translation. The output unit 5 is a CRT (cathode ray tube), LCD (liquid crystal display device).
It is constituted by a display device such as a printer, a printing device such as a printer, or a communication device connected to a communication line, and outputs a generated translated sentence or the like. The control unit 6 is mainly C
It is configured by a PU (Central Processing Unit) and controls the input unit 1, the translation control unit 2 and the output unit 5 to execute various processes such as a translation process.

【００３０】一般に、機械翻訳には、図６に示すような
解析レベルがある。図中左上においてソース言語による
原文が入力されると、レベルＬ1の辞書引き、レベルＬ2
の形態素解析、レベルＬ3の構文解析、…と解析が進め
られる。機械翻訳は、どの解析レベルまで解析を行うか
によって大きく次の２つに分けられる。１つは、レベル
Ｌ6のソース言語およびターゲット言語の何れにも依存
しない概念である中間言語まで解析し、そこから、レベ
ルＬ7のターゲット言語による文脈生成、レベルＬ8の意
味生成、レベルＬ9の構文生成、レベルＬ10の形態素生
成と生成を進めて、ターゲット言語による翻訳文を生成
していくピボット方式である。もう１つは、上記レベル
Ｌ2の形態素解析,レベルＬ3の構文解析,レベルＬ4の意
味解析およびレベルＬ5の文脈解析の何れかのレベルま
で解析を行ってソース言語の内部構造を得、次に、この
得られたソース言語の内部構造と同レベルのターゲット
言語の内部構造に変換した後、ターゲット言語の翻訳文
を生成するトランスファー方式である。In general, machine translation has analysis levels as shown in FIG. When the original text in the source language is entered in the upper left of the figure, the dictionary lookup for level L1 and level L2
Morphological analysis, level L3 syntax analysis, and so on. Machine translation can be roughly divided into the following two, depending on which analysis level is used. One is to analyze an intermediate language, which is a concept that does not depend on any of the source language and the target language of level L6, and from that, generate context in the target language of level L7, generate meaning of level L8, generate syntax of level L9. , Is a pivot method for generating a translated sentence in a target language by proceeding with morpheme generation and generation at level L10. The other is to analyze the level L2 morphological analysis, level L3 syntactic analysis, level L4 semantic analysis, and level L5 context analysis to obtain the internal structure of the source language. This is a transfer method in which after converting the obtained internal structure of the source language into the internal structure of the target language at the same level, a translated sentence of the target language is generated.

【００３１】以下、上記各解析について順次説明する。（Ａ）辞書引き・形態素解析上記記憶部４の辞書メモリ４aに格納された辞書を引い
て、入力された文章を各形態素列(単語列)に分割する。
そして、分割された各単語の品詞等の文法情報および訳
語を得、さらに、時制・人称・数等を解析する。（Ｂ）構文解析上記形態素解析の結果得られた各単語間の係り受け等の
入力文の構造(構造解析木)を決定する。（Ｃ）意味解析上記構文解析の結果得られた複数の構文候補の中から意
味的に正しいものとそうでないものとを判別する。（Ｄ）文脈解析話題を理解し、省略や曖昧さを取り去る。本実施例にお
ける翻訳モジュール２Ａは、図６における少なくともレ
ベルＬ3の構文解析までの解析を行うトランスファー方
式によって翻訳処理を実行するものとする。Hereinafter, each of the above analyzes will be sequentially described. (A) Dictionary Lookup / Morpheme Analysis Look up the dictionary stored in the dictionary memory 4a of the storage unit 4 to divide the input sentence into each morpheme string (word string).
Then, the grammatical information such as the part of speech of each divided word and the translated word are obtained, and further the tense, the personal name, the number, etc. are analyzed. (B) Syntactic analysis The structure (structure analysis tree) of the input sentence such as the dependency between each word obtained as a result of the above morphological analysis is determined. (C) Semantic analysis From a plurality of syntax candidates obtained as a result of the above-mentioned syntax analysis, semantically correct ones and non-semantic ones are discriminated. (D) Context analysis Understand topics and remove omissions and ambiguities. It is assumed that the translation module 2A in this embodiment executes the translation process by the transfer method for performing the analysis up to the syntax analysis of at least the level L3 in FIG.

【００３２】上述のトランスファー方式による翻訳処理
を実行するために、翻訳モジュール２Ａは、図１に示す
ように、辞書引き・形態素解析部２a,構文解析部２b,構
文変換部２c,ターゲット言語生成部２dを有している。
さらに、翻訳中のデータを参照して文をフレーズに分割
する際の分割位置を決定する分割位置決定部２eと、タ
ーゲット言語生成部２dによって生成されたターゲット
言語の単語から各訳文フレーズを構築する訳文フレーズ
構築部２fを有している。In order to execute the translation processing by the transfer method described above, the translation module 2A, as shown in FIG. 1, has a dictionary lookup / morphological analysis unit 2a, a syntax analysis unit 2b, a syntax conversion unit 2c, and a target language generation unit. Has 2d.
Further, each translated phrase is constructed from a target language word generated by the target language generation unit 2d and a division position determination unit 2e that determines a division position when the sentence is divided into phrases by referring to the data being translated. It has a translated phrase construction unit 2f.

【００３３】また、上記メインメモリ３には、原文バッ
ファ３a,辞書引きバッファ３b,変換前の構文バッファ３
c,変換後の構文バッファ３d,訳語バッファ３e,原語・訳
語対応情報バッファ３f,分割位置バッファ３g,フレーズ
毎の訳語単語位置バッファ３h,訳フレーズバッファ３i,
出力形式バッファ３j,助動詞バッファ３k,フレーズ訳フ
ラグバッファ３l,木構造ポインタバッファ３mが設定さ
れている。また、上記記憶部４には、辞書メモリ４aの
他に、構文解析部２bが構文解析時に使用する文法規則
が格納される文法規則メモリ４bや後置語テーブルメモ
リ４c,補助訳語テーブルメモリ４d,その他の翻訳規則メ
モリ４e,フレーズ分割規則メモリ４f,助動詞コード表メ
モリ４gが設定されている。In the main memory 3, the original text buffer 3a, the dictionary lookup buffer 3b, the syntax buffer 3 before conversion are stored.
c, post-conversion syntax buffer 3d, translation word buffer 3e, original language / translation word correspondence information buffer 3f, division position buffer 3g, translation word position buffer 3h for each phrase, translation phrase buffer 3i,
The output format buffer 3j, auxiliary verb buffer 3k, phrase translation flag buffer 3l, and tree structure pointer buffer 3m are set. In addition to the dictionary memory 4a, the storage unit 4 stores a grammar rule memory 4b, a postfix word table memory 4c, an auxiliary translation table memory 4d, which stores grammatical rules used by the syntax analysis unit 2b during syntax analysis. The other translation rule memory 4e, phrase division rule memory 4f, and auxiliary verb code table memory 4g are set.

【００３４】さらに、本実施の形態における翻訳モジュ
ール２Ａは、フレーズ単位で翻訳するに際して内容が正
確で分かり易い訳文フレーズを生成するために、図２に
示すように、ターゲット言語生成部２dには、木構造ト
レース手段１１,節点情報取得手段１２,助動詞文訳出法
決定手段１３,助動詞検出手段１４,並列節検出手段１
５,辞書記述訳語生成手段１６,付属語生成手段１７およ
び助動詞訳生成手段１８を有している。また、図３に示
すように、上記訳文フレーズ構築部２fには、中抜けフ
レーズ検出手段２１,関係節検出手段２２,生成位置修正
手段２３,補助記号生成手段２４,補助修飾語生成手段２
５,先行詞取得手段２６および後置語生成手段２７を有
している。Further, the translation module 2A according to the present embodiment, as shown in FIG. 2, has a target language generation unit 2d, in order to generate a translated phrase whose content is accurate and easy to understand when translating in phrase units. Tree structure tracing means 11, node information acquisition means 12, auxiliary verb sentence translation method determination means 13, auxiliary verb detection means 14, parallel node detection means 1
5, it has a dictionary description translation word generation means 16, an adjunct word generation means 17, and an auxiliary verb translation generation means 18. Further, as shown in FIG. 3, the translated phrase construction unit 2f includes a blank phrase detection unit 21, a relative clause detection unit 22, a generation position correction unit 23, an auxiliary symbol generation unit 24, and an auxiliary modifier generation unit 2.
5, it has an antecedent acquisition means 26 and a postpositional word generation means 27.

【００３５】図４は、上記制御部６の制御の下に、入力
部１,翻訳制御部２および出力部３によって実行される
フレーズ単位翻訳処理動作のフローチャートである。以
下、図４に従って、本実施の形態におけるフレーズ単位
翻訳処理動作について説明する。FIG. 4 is a flow chart of the phrase unit translation processing operation executed by the input unit 1, the translation control unit 2 and the output unit 3 under the control of the control unit 6. The phrase-based translation processing operation in this embodiment will be described below with reference to FIG.

【００３６】ステップＳ1で、上記入力部１から原文が
入力される。そうすると、入力部１によって当該原文を
構成する文字列が認識され、空白文字(スペース)に基づ
いて入力文字列が部分文字列に分割される。そして、分
割された各部分文字列(形態素候補)は、図７に示すよう
に、位置情報が付加されて原文バッファ３aに格納され
る。さらに、上記入力部１からフレーズ単位による翻訳
処理(以下、フレーズ訳と言う)が指令されて入力部１に
よって認識される。そうすると、上記認識結果「フレー
ズ訳」に従って、メインメモリ３のフレーズ訳フラグバ
ッファ３lにフレーズ訳フラグがセットされる。ステッ
プＳ2で、上記翻訳モジュール２Ａの辞書引き・形態素解
析部２aによって、上述のようにして原文バッファ３aに
格納された原文に対して、上記辞書引き・形態素解析処
理が実行され、得られた解析結果がメインメモリ３の辞
書引きバッファ３bに格納される。ステップＳ3で、上記
翻訳モジュール２Ａの構文解析部２bによって、上記構
文解析処理が実行されて原文の構造解析木が決定され
る。そして、得られた構造解析木がメインメモリ３にお
ける変換前の構文バッファ３cに格納される。ステップ
Ｓ4で、上記翻訳モジュール２Ａの構文変換部２cによっ
て、変換前の構文バッファ３cに格納されている原文の
構造解析木が日本語の構造解析木に変換されて、メイン
メモリ３における変換後の構文バッファ３dに格納され
る。In step S1, the original text is input from the input unit 1. Then, the input unit 1 recognizes the character string forming the original sentence and divides the input character string into partial character strings based on the blank character (space). Then, each divided partial character string (morpheme candidate) is stored in the original text buffer 3a with position information added thereto, as shown in FIG. Further, a translation process in phrase units (hereinafter referred to as phrase translation) is instructed from the input unit 1 and recognized by the input unit 1. Then, the phrase translation flag is set in the phrase translation flag buffer 3l of the main memory 3 according to the recognition result "phrase translation". In step S2, the dictionary lookup / morpheme analysis unit 2a of the translation module 2A executes the dictionary lookup / morpheme analysis process on the original sentence stored in the original sentence buffer 3a as described above, and the obtained analysis is performed. The result is stored in the dictionary lookup buffer 3b of the main memory 3. In step S3, the syntactic analysis unit 2b of the translation module 2A executes the syntactic analysis process to determine the structure analysis tree of the original sentence. Then, the obtained structure analysis tree is stored in the pre-conversion syntax buffer 3c in the main memory 3. In step S4, the structure conversion tree 2c of the translation module 2A converts the original structure analysis tree stored in the pre-conversion syntax buffer 3c into a Japanese structure analysis tree, and the converted structure analysis tree in the main memory 3 is obtained. It is stored in the syntax buffer 3d.

【００３７】ステップＳ5で、上記翻訳モジュール２Ａ
におけるターゲット言語生成部２dによって、後に詳述
するようなターゲット言語生成サブルーチンが実行され
て当該原文の各単語の訳語が決定される。そして、決定
された各訳語がメインメモリ３の訳語バッファ３eに格
納される。また、メインメモリ３の原語・訳語対応情報
バッファ３fに原語・訳語対応情報が格納される。ステッ
プＳ6で、上記翻訳モジュール２Ａの分割位置決定部２e
によって、当該原文のフレーズへの分割位置が決定され
て、メインメモリ３の分割位置バッファ３gに格納され
る。尚、このフレーズへの分割位置の決定は、変換前の
構文バッファ３cに格納されている構造解析木に対し
て、記憶部４に格納されている図８に示すようなフレー
ズ分割規則を適用することによって行われる。その際に
おける分割位置決定の実際の動作は、上記構造解析木の
節点を所定の順序で探索し、各節点で夫々のフレーズ分
割規則が適用可能か否かを調べ、適用可能であればその
節点をフレーズの分割位置であると決定することによっ
て行われる。In step S5, the translation module 2A
The target language generation unit 2d executes the target language generation subroutine, which will be described in detail later, to determine the translation of each word of the original sentence. Then, each determined translation word is stored in the translation word buffer 3e of the main memory 3. Further, the source language / translation word correspondence information is stored in the source language / translation word correspondence information buffer 3f of the main memory 3. In step S6, the division position determination unit 2e of the translation module 2A
Thus, the division position of the original sentence into phrases is determined and stored in the division position buffer 3g of the main memory 3. Note that the phrase division position is determined by applying the phrase division rule stored in the storage unit 4 as shown in FIG. 8 to the structure analysis tree stored in the syntax buffer 3c before conversion. Done by. The actual operation of the division position determination at that time is to search the nodes of the above structural analysis tree in a predetermined order, check whether or not each phrase division rule is applicable at each node, and if applicable, Is determined as the phrase division position.

【００３８】ステップＳ7で、上記翻訳モジュール２Ａ
の訳文フレーズ構築部２fによって、上記ステップＳ5に
おいて作成された原語・訳語対応情報(図１５参照)と上
記ステップＳ6において作成された分割位置情報(図１６
参照)に基づいて、各フレーズを構成する訳語の単語(以
下、訳語単語と言う)を得る。そして、上記訳語バッフ
ァ３eを参照して各訳語単語の訳文上での単語位置(訳語
単語位置)を求めて、メインメモリ３のフレーズ毎の訳
語単語位置バッファ３hにセットする。尚、このフレー
ズ毎の訳語単語位置バッファ３hの形式を図１７に示
す。ステップＳ8で、上記訳文フレーズ構築部２fによっ
て、上記ステップＳ5におけるターゲット言語生成サブ
ルーチンの実行の結果決定された訳語単語のうち訳語単
語位置がフレーズ毎の訳語単語位置バッファ３hに未だ
セットされていない（すなわち、属するフレーズが決定
していない)訳語単語に関して、後に詳述するようにし
て、当該訳語単語が属するフレーズが定められてフレー
ズ毎の訳語単語位置バッファ３hにおける該当するフレ
ーズに当該訳語単語の訳語単語位置がセットされる。In step S7, the translation module 2A
By the translated phrase construction unit 2f of the original phrase / translated word correspondence information (see FIG. 15) created in step S5 and the division position information created in step S6 (see FIG. 16).
Based on the reference), the words of the translated words (hereinafter referred to as translated words) that form each phrase are obtained. Then, the word position (translated word position) of each translated word in the translated sentence is obtained by referring to the translated word buffer 3e and set in the translated word position buffer 3h for each phrase in the main memory 3. The format of the translated word position buffer 3h for each phrase is shown in FIG. In step S8, the translated word position of the translated word determined as a result of the execution of the target language generation subroutine in step S5 by the translated phrase construction unit 2f is not yet set in the translated word position buffer 3h for each phrase ( (For example, the phrase to which the phrase belongs is not determined.) For the translated word, the phrase to which the translated word belongs is determined and the translated word of the translated word is added to the corresponding phrase in the translated word position buffer 3h for each phrase, as described later in detail. The word position is set.

【００３９】ステップＳ9で、上記訳文フレーズ構築部
２fによって、フレーズ毎の訳語単語位置バッファ３hに
セットされた各訳語単語位置に対して修正が必要である
か否かが判定され、修正が必要な訳語単語位置が存在す
れば修正操作が行われる。尚、上記修正操作は、記憶部
４のその他の翻訳規則メモリ４eに記憶されている修正
規則に従って、後に詳述するようにして行われる。In step S9, the translated phrase construction unit 2f determines whether or not each translated word position set in the translated word position buffer 3h for each phrase needs to be corrected. If the translated word position exists, a correction operation is performed. The above-mentioned correction operation is performed as described in detail later according to the correction rule stored in the other translation rule memory 4e of the storage unit 4.

【００４０】ステップＳ10で、上記訳文フレーズ構築部
２fによって、上記ステップＳ7〜ステップＳ9において
作成されたフレーズ毎の訳語単語位置バッファ３hに格
納された各フレーズ毎の訳語単語位置に基づいて、訳語
バッファ３eを参照してフレーズ毎の訳語列を求め、得
られたフレーズ毎の訳語列を訳フレーズバッファ３iに
格納する。ステップＳ11で、上記出力形式整形部５Ｂに
よって、訳フレーズバッファ３iに格納されている各フ
レーズ毎の訳語列の形式が出力する際の形式に整えられ
て出力形式バッファ３jに格納される。尚、上記出力す
る際の形式とは、例えば図１９に示すように、夫々のフ
レーズの原文と訳文との先頭位置が揃えられて、原文フ
レーズとその訳文である訳文フレーズとの対応関係が明
確にされた形式である。ステップＳ12で、上記出力部５
によって、図１９に示すように、出力形式バッファ３j
に格納されている原文フレーズと訳文フレーズとが出力
される。In step S10, the translated word phrase construction unit 2f uses the translated word buffer for each phrase stored in the translated word position buffer 3h for each phrase created in steps S7 to S9, based on the translated word position for each phrase. The translated word string for each phrase is obtained by referring to 3e, and the obtained translated word string for each phrase is stored in the translated phrase buffer 3i. In step S11, the output format shaping unit 5B adjusts the format of the translated word string for each phrase stored in the translated phrase buffer 3i to the format for output and stores it in the output format buffer 3j. The output format is, for example, as shown in FIG. 19, that the start positions of the original sentence and the translated sentence of each phrase are aligned and the correspondence between the original sentence phrase and the translated sentence that is the translated sentence is clear. This is the format used. In step S12, the output unit 5
As shown in FIG. 19, the output format buffer 3j
The original phrase and the translated phrase stored in are output.

【００４１】図５は、図４に示すフレーズ単位翻訳処理
動作のフローチャートにおける上記ステップＳ5におい
てターゲット言語生成部２dによって実行されるターゲ
ット言語生成サブルーチンのフローチャートである。以
下、図５に従って、ターゲット言語生成サブルーチンに
ついて説明する。FIG. 5 is a flowchart of a target language generation subroutine executed by the target language generation unit 2d in step S5 in the flowchart of the phrase unit translation processing operation shown in FIG. The target language generation subroutine will be described below with reference to FIG.

【００４２】図４に示すフレーズ単位翻訳処理動作のフ
ローチャートにおける上記ステップＳ4において原文の
構造解析木が日本語の構造解析木に変換されると、ター
ゲット言語生成サブルーチンがスタートする。ステップ
Ｓ21で、上記ターゲット言語生成部２dの並列節検出手
段１５によって、上記構文変換部２cで得られた日本語
の構造解析木の中に並列の句や節が存在するか否かが判
別される。その結果、存在する場合には当該構造解析木
における並列句や並列節に相当する節点がポイントされ
てステップＳ23に進む。一方、存在しない場合にはステ
ップＳ22に進む。尚、上述の「節点をポイントする」と
は、上記メインメモリ３の木構造ポインタバッファ３m
に構造解析木における節点の番号を格納することであ
る。このようにして、現在着目すべき節点の情報を保持
しておくのである。ステップＳ22で、上記辞書記述訳語
生成手段１６および付属語生成手段１７によって、当該
原文の各単語の訳語が辞書引きバッファ３bに格納され
た訳語に基づいて決定されて訳語バッファ３eに格納さ
れる。また、原語・訳語対応情報バッファ３fに原語・訳
語対応情報が格納される。そうした後、図４に示すフレ
ーズ単位翻訳処理動作のフローチャートにおける上記ス
テップＳ6にリターンする。When the structure analysis tree of the original sentence is converted into the Japanese structure analysis tree in step S4 in the flowchart of the phrase-based translation processing operation shown in FIG. 4, the target language generation subroutine starts. In step S21, the parallel clause detecting means 15 of the target language generator 2d determines whether or not there are parallel phrases or clauses in the Japanese structural analysis tree obtained by the syntax converter 2c. It As a result, if they exist, the node corresponding to the parallel phrase or parallel node in the structural analysis tree is pointed, and the process proceeds to step S23. On the other hand, if it does not exist, the process proceeds to step S22. The above-mentioned "pointing to a node" means that the tree structure pointer buffer 3m of the main memory 3 mentioned above.
Is to store the node number in the structural analysis tree. In this way, the information on the node to be focused on at present is held. In step S22, the dictionary description translation word generation means 16 and the adjunct word generation means 17 determine the translation word of each word of the original sentence based on the translation word stored in the dictionary lookup buffer 3b and store it in the translation word buffer 3e. Further, the source language / translation word correspondence information is stored in the source language / translation word correspondence information buffer 3f. After that, the process returns to step S6 in the flowchart of the phrase-based translation processing operation shown in FIG.

【００４３】ステップＳ23で、上記メインメモリ３の助
動詞バッファ３kの内容がクリアされる。ステップＳ24
で、上記木構造トレース手段１１によって、節点情報取
得手段１２で取得された下位に在る各節点の品詞や文法
要素等を参照して上記各下位節点間の位置関係が判定さ
れることによって、現在ポイントされている節点の長子
に相当する節点がポイントされる。In step S23, the contents of the auxiliary verb buffer 3k in the main memory 3 are cleared. Step S24
Then, the tree structure tracing means 11 determines the positional relationship between the respective lower nodes by referring to the parts of speech and grammatical elements of the respective lower nodes acquired by the node information acquiring means 12, The node corresponding to the longest child of the currently pointed node is pointed.

【００４４】ステップＳ25で、上記助動詞検出手段１４
によって、現在ポイントされている節点に助動詞が在る
か否かが判別される。その結果、助動詞が在ればステッ
プＳ26に進み、なければステップＳ27に進む。尚、上記
助動詞の検出は次のようにして行われる。すなわち、こ
こでいう助動詞とは英語の構造解析木上における助動詞
のことであるから、変換後(日本語)の構造解析木上には
存在しない。そこで、木構造トレース手段１１と節点情
報取得手段１２とが制御されて、変換後(日本語)の構造
解析木上におけるポイントされている節点に対応する変
換前(英語)の構造解析木上における節点をトレースする
ことによって行われるのである。ステップＳ26で、上記
助動詞検出手段１４によって、上述の如く検出された助
動詞の助動詞コードが、図１１に示すような形式で記憶
部４の助動詞コード表メモリ４gに格納されている助動
詞コード表を参照して得られる。そして、得られた助動
詞コードが助動詞バッファ３kに格納される。In step S25, the auxiliary verb detecting means 14 is used.
Determines whether the currently pointed node has an auxiliary verb. As a result, if there is an auxiliary verb, the process proceeds to step S26, and if not, the process proceeds to step S27. The auxiliary verb is detected as follows. That is, since the auxiliary verb here is an auxiliary verb on the English structural analysis tree, it does not exist on the converted (Japanese) structural analysis tree. Therefore, the tree structure tracing means 11 and the node information acquisition means 12 are controlled so that on the structure analysis tree before conversion (English) corresponding to the pointed point on the structure analysis tree after conversion (Japanese). This is done by tracing the nodes. In step S26, the auxiliary verb code of the auxiliary verb detected as described above by the auxiliary verb detecting means 14 is referred to the auxiliary verb code table stored in the auxiliary verb code table memory 4g of the storage unit 4 in a format as shown in FIG. Obtained. Then, the obtained auxiliary verb code is stored in the auxiliary verb buffer 3k.

【００４５】ステップＳ27で、上記助動詞文訳出法決定
手段１３によって、フレーズ訳フラグバッファ３lの内
容が参照されてフレーズ訳が指令されているか否かが判
別される。その結果、フレーズ訳が指令されている場合
にはステップＳ28に進み、指令されていない場合にはス
テップＳ34に進む。ステップＳ28で、上記辞書記述訳語
生成手段１６および付属語生成手段１７によって、辞書
引きバッファ３bに格納されている訳語に基づいて、現
在ポイントされている節点に属する原語単語列に相応し
い訳語単語列が決定されて訳語バッファ３eに格納され
る。その際に、助動詞訳生成手段１８によって、末尾が
助動詞バッファ３kに格納された助動詞コードに応じた
終止形に変形される。但し、助動詞バッファ３kに助動
詞コードが格納されていない場合には、単に終止形に変
形される。In step S27, the auxiliary verb sentence translation method determining means 13 refers to the contents of the phrase translation flag buffer 3l to determine whether or not the phrase translation is instructed. As a result, if the phrase translation is instructed, the process proceeds to step S28, and if not, the process proceeds to step S34. In step S28, the dictionary description translation word generation means 16 and the adjunct word generation means 17 generate a translated word string suitable for the original word string belonging to the currently pointed node based on the translated words stored in the dictionary lookup buffer 3b. It is determined and stored in the translated word buffer 3e. At this time, the auxiliary verb translation generating means 18 transforms the end into a final form corresponding to the auxiliary verb code stored in the auxiliary verb buffer 3k. However, when the auxiliary verb code is not stored in the auxiliary verb buffer 3k, the auxiliary verb code is simply transformed into the final form.

【００４６】ここで、上記付属語生成手段１７は、原語
単語に対応しない訳語単語を生成する。その生成方法の
一例として、格構造に関する情報を利用する方法があ
る。この方法は、原語単語に対応する訳語単語を生成す
る際に、辞書メモリ４aに格納されている当該原語単語
の格情報が参照されて、ターゲット言語(日本語)で言う
ところの格助詞等の付属語を決定するものである。Here, the adjunct word generating means 17 generates a translated word that does not correspond to the original word. As an example of the generation method, there is a method of using information on case structure. In this method, when generating a translated word corresponding to an original word, the case information of the original word stored in the dictionary memory 4a is referred to, such as a case particle in the target language (Japanese). It determines the adjunct.

【００４７】ステップＳ29で、上記木構造トレース手段
１１によって、節点情報取得手段１２で取得された品詞
や文法要素等を参照して、現在ポイントされている節点
に弟節点が在るか否かが判別される。その結果、弟節点
が在る場合にはステップＳ31に進み、ない場合にはステ
ップＳ30に進む。ステップＳ30で、上記メインメモリ３
の助動詞バッファ３kの内容がクリアされる。そうした
後、図４に示すフレーズ単位翻訳処理動作のフローチャ
ートにおける上記ステップＳ6にリターンする。In step S29, the tree structure tracing means 11 refers to the part-of-speech, the grammatical element and the like acquired by the node information acquiring means 12 to determine whether or not the currently pointed node has a younger brother node. To be determined. As a result, if there is a younger brother node, the process proceeds to step S31, and if not, the process proceeds to step S30. In step S30, the main memory 3
The contents of the auxiliary verb buffer 3k of are cleared. After that, the process returns to step S6 in the flowchart of the phrase-based translation processing operation shown in FIG.

【００４８】ステップＳ31で、上記木構造トレース手段
１１によって、上記ステップＳ29において在ると判別さ
れた弟節点がポイントされる。ステップＳ32で、上記辞
書記述訳語生成手段１６によって、当該弟節点に在る接
続詞の訳語が生成されて訳語バッファ３eに格納され
る。ステップＳ33で、上記木構造トレース手段１１によ
って、次の弟節点(すなわち、上記接続詞で上記長子節
点に並列されている節点)がポイントされる。そうした
後、上記ステップＳ25に戻って当該弟節点に対する処理
に移行する。In step S31, the tree structure tracing means 11 points the younger brother node determined to exist in step S29. In step S32, the dictionary description translation word generation means 16 generates the translation word of the conjunction at the younger brother node and stores it in the translation word buffer 3e. In step S33, the tree structure tracing means 11 points the next younger brother node (that is, the node parallel to the firstborn node in the conjunction). After that, the process returns to the step S25 and shifts to the process for the younger brother node.

【００４９】ステップＳ34で、上記ステップＳ27におい
てフレーズ訳が指令されていないと判別された場合に
は、木構造トレース手段１１および節点情報取得手段１
２によって、現在ポイントされている節点に弟節点があ
るか否かが判別される。その結果、弟節点が在る場合に
はステップＳ35に進み、ない場合には上記ステップＳ28
に戻って現在の節点の訳が生成される。ステップＳ35
で、上記辞書記述訳語生成手段１６および付属語生成手
段１７によって、現在ポイントされている節点に属する
原語単語列に相応しい訳語単語列が決定されて訳語バッ
ファ３eに格納される。但し、この場合は、助動詞バッ
ファ３kに助動詞コードが存在していても存在していな
くても、節の末尾は助動詞の効力が及んでいないような
連用中止形に変形される。そうした後、上記ステップＳ
31に戻って、上記ステップＳ34において存在すると判別
された弟節点にポイントがセットされて、当該弟節点に
対する処理に移行する。そして、上記ステップＳ29にお
いて最早弟節点がないと判別されるとステップＳ30を経
由して、図４に示すフレーズ単位翻訳処理動作のフロー
チャートにおける上記ステップＳ6にリターンする。If it is determined in step S34 that the phrase translation is not instructed in step S27, the tree structure tracing means 11 and the node information acquisition means 1
By 2, it is determined whether or not the currently pointed node has a younger brother node. As a result, if there is a younger brother node, the process proceeds to step S35, and if not, the above step S28.
Returning to, the translation of the current node is generated. Step S35
Then, the dictionary description translation word generation means 16 and the adjunct word generation means 17 determine a translation word word string suitable for the original language word string belonging to the currently pointed node and store it in the translation word buffer 3e. However, in this case, whether or not the auxiliary verb code exists in the auxiliary verb buffer 3k, the end of the section is transformed into a continuous discontinuation form in which the effect of the auxiliary verb does not reach. After that, the above step S
Returning to step 31, the point is set to the younger brother node determined to exist in step S34, and the processing shifts to the younger brother node. If it is determined in step S29 that there is no younger brother node anymore, the process returns to step S6 in the flowchart of the phrase-based translation processing operation shown in FIG. 4 via step S30.

【００５０】以下、上記入力部１に、助動詞を含む並列
節が存在する原文「It would push them to the end of
the creditors' lineand make it unlikely.」が入力
された場合を例に、図４および図５に示すフレーズ単位
翻訳処理動作について具体的に説明する。Hereinafter, in the input section 1, the original sentence "It would push them to the end of" in which a parallel clause including an auxiliary verb exists.
The phrase-based translation processing operation shown in FIG. 4 and FIG. 5 will be specifically described by taking the case where "the creditors' lineand make it unlikely." is input as an example.

【００５１】先ず、入力部１から読み込まれた原文は、
図７に示すように原文バッファ３aに格納される。そし
て、辞書引き・形態素解析処理および構文解析処理が行
われて図９に示すような原文の構造解析木が得られ、変
換前の構文バッファ３cに格納される。さらに、構文変
換処理が行われて図１０に示すような日本語の構造解析
木が得られ、変換後の構文バッファ３dに格納される。
…ステップＳ1〜ステップＳ4First, the original sentence read from the input section 1 is
It is stored in the original text buffer 3a as shown in FIG. Then, a dictionary lookup / morphological analysis process and a syntactic analysis process are performed to obtain a structure analysis tree of the original sentence as shown in FIG. 9 and stored in the syntax buffer 3c before conversion. Further, a syntax conversion process is performed to obtain a Japanese structural parse tree as shown in FIG. 10, and it is stored in the converted syntax buffer 3d.
... Steps S1 to S4

【００５２】さらに、当該日本語の構造解析木中に並列
節が存在するので、「並列節」の節点番号“１"(図１０参
照)が木構造ポインタバッファ３mに格納される。そし
て、節点「並列節」に属する節点「主節(節点番号＝２)」と
節点「接続詞(節点番号＝１８)」と節点「主語欠け主節(節
点番号＝１９)」との３個の子節点のうちの節点「主節」が
長子であると判定されて、節点「主節」の節点番号“２"
が木構造ポインタバッファ３mに格納される。…ステッ
プＳ21,Ｓ23,Ｓ24Further, since a parallel clause exists in the Japanese structural analysis tree, the node number "1" (see FIG. 10) of the "parallel clause" is stored in the tree structure pointer buffer 3m. Then, three nodes of the node “main node (node number = 2)”, the node “connective node (node number = 18)”, and the node “subject lacking main node (node number = 19)” belonging to the node “parallel node” Of the child nodes, the node “main node” is determined to be the first child, and the node number “2” of the node “main node” is determined.
Is stored in the tree structure pointer buffer 3m. ... Steps S21, S23, S24

【００５３】ここで、上記長子である節点「主節(節点番
号＝２)」に対応する変換前の構造解析木(図９)における
節点は節点「主節(節点番号＝２)」であり、その下位には
節点「助動詞(節点番号＝６)」が存在する。そこで、その
節点「助動詞(６)」に該当する助動詞「would」の助動詞コ
ード「９」が図１１に示す助動詞コード表を参照して得ら
れて、図１２に示すように助動詞バッファ３kにセット
される。…ステップＳ25,Ｓ26Here, the node in the structural analysis tree (FIG. 9) before conversion corresponding to the node "main node (node number = 2)" which is the above-mentioned child is the node "main node (node number = 2)". , And the node “auxiliary verb (node number = 6)” exists below it. Then, the auxiliary verb code "9" of the auxiliary verb "would" corresponding to the node "auxiliary verb (6)" is obtained by referring to the auxiliary verb code table shown in FIG. 11, and is set in the auxiliary verb buffer 3k as shown in FIG. To be done. ... Steps S25 and S26

【００５４】そして、「フレーズ訳」が指令されているか
ら、図１０に示す変換後の構造解析木における現在ポイ
ントされている節点「主節(２)」に属する原語単語列「 It them creditors' line of end to push」から、辞書引きバッファ３bに格納された訳語に基づい
て当該節点「主節(２)」の訳語単語列が決定される。そし
て、この場合には、当該節の末尾は、当該助動詞コード
「９」に従って終止形「〜だろう」に変形される。その結
果、当該原文における節点「主節(２)」に属する各原語単
語に対する訳語は「それ」「それら」「債権者」「の」「ライン」「の」「エンド」「入れるだろう」と決定される。加えて、原語単語に対応しない訳語「、」「は」「を」が生成されて、図１３のように訳語バッファ３eにセッ
トされる。ここで、上述のように、上記原語単語に対応
しない訳語は、付属語生成手段１７によって記憶部４の
辞書メモリ４aに格納された格情報が参照されて生成さ
れるのであるが、具体的には、図３４に示すように辞書
メモリ４aに格納されている「格名」の内容に着目し、「格
名」の内容に対応する付属語を図３５に示すような記憶
部４の後置語テーブル４cから生成するのである。…ス
テップＳ27,Ｓ28Since the "phrase translation" is commanded, the original word string "It them creditors'" belonging to the node "main clause (2)" currently pointed in the converted structural analysis tree shown in FIG. From "line of end to push", the translated word string of the node "main clause (2)" is determined based on the translated word stored in the dictionary lookup buffer 3b. Then, in this case, the end of the clause is transformed into the final form “to darou” according to the auxiliary verb code “9”. As a result, the translated words for each original word belonging to the node “main clause (2)” in the original sentence are “that”, “these”, “creditors”, “no”, “line”, “no”, “end”, and “insert”. It is determined. In addition, translated words “,” “wa” “wo” that do not correspond to the original words are generated and set in the translated word buffer 3e as shown in FIG. Here, as described above, the translated word that does not correspond to the original word is generated by referring to the case information stored in the dictionary memory 4a of the storage unit 4 by the attached word generation means 17, 34 pays attention to the contents of the "case name" stored in the dictionary memory 4a as shown in FIG. 34, and attaches the attached word corresponding to the contents of the "case name" to the storage unit 4 as shown in FIG. It is generated from the word table 4c. … Steps S27 and S28

【００５５】尚、上記「フレーズ訳」が指令されていない
場合には、現在ポイントされている節点「主節(２)」には
節点「接続詞(１８)」と節点「主語欠け主節(１９)」の弟節
点が存在するので、助動詞「would」の効力が及んでいな
いような連用中止形に末尾が変形された節点「主節(２)」
に属する原語単語列の訳語が決定されて訳語バッファ３
eに格納されることになる。したがって、図１４に示す
ように、当該原文中における原語単語「push」の訳語は
「入れ」となる。…ステップＳ27,Ｓ34,Ｓ35When the above "phrase translation" is not commanded, the node "connective (18)" and the node "subject lacking main node (19)" are added to the node "main node (2)" currently pointed to. ) ”Exists as a younger brother node, so the node“ main clause (2) ”whose end is transformed into a continuous discontinuation form that does not have the effect of the auxiliary verb“ would ”
The translation word of the source word string belonging to is determined and the translation word buffer 3
It will be stored in e. Therefore, as shown in FIG. 14, the translated word of the original word “push” in the original sentence is “put”. ... Steps S27, S34, S35

【００５６】そうした後、現在ポイントされている節点
「主節(２)」には、弟節点「接続詞(１８)」が存在する。そ
こで、弟節点「接続詞(１８)」がポイントされて、当該接
続詞「and」の訳語「そして」が決定されて、図１３に示す
ように訳語バッファ３eにセットされる。そして更に、
節点「主節(２)」に関する次の弟節点である節点「主語欠
け主節(１９)」がポイントされる。…ステップＳ29,Ｓ31
〜Ｓ33After that, the younger brother node "conjunction (18)" exists at the node "main node (2)" currently pointed. Therefore, the younger brother node "conjunction (18)" is pointed to, and the translation "and" of the conjunction "and" is determined and set in the translation buffer 3e as shown in FIG. And furthermore
The node “main subject missing main clause (19)”, which is the next younger brother node regarding the node “main clause (2)”, is pointed. ... Steps S29 and S31
~ S33

【００５７】以下、節点「主語欠け主節(１９)」に対応す
る変換前の構造解析木における節点は節点「主語欠け主
節(２３)」であり、その下位には節点「助動詞」は存在し
ない。したがって、節点「主語欠け主節(１９)」に属する
原語単語列「 it unlikely make」から当該節点「主語欠け主節(１９)」の訳語単語列が決定
される。但し、この場合には、下位節点に節点「助動詞」
が存在しないので助動詞バッファ３kの内容は更新され
てはいない。したがって、当該節の末尾は、「主節(２)」
の末尾と同様に、助動詞バッファ３kに格納されている
助動詞コード「９」に従って終止形「〜であろう」に変形さ
れる。その結果、当該原文における節点「主語欠け主節
(１９)」に属する各原語単語に対する訳語は「それ」「ありそうもない」「するであろう」「。」と決定される。加えて、原語単語に対応しない訳語「を」「状態に」が生成されて、図１３に示すように訳語バッファ３eに
セットされる。ここで、当該節点「主語欠け主節(１９)」
には弟節点が存在しないので、ターゲット言語生成サブ
ルーチンを終了する。…ステップＳ25,Ｓ27〜Ｓ30Hereinafter, the node in the structural analysis tree before conversion corresponding to the node "subject missing main clause (19)" is the node "subject missing main clause (23)", and the node "auxiliary verb" exists below it. do not do. Therefore, the translated word string of the node "subject missing main clause (19)" is determined from the original word string "it unlikely make" belonging to the node "subject missing main clause (19)". However, in this case, the node "auxiliary verb" is added to the lower node.
, The contents of the auxiliary verb buffer 3k have not been updated. Therefore, the end of the section is "Main Section (2)"
Similar to the end of, the verb is transformed into the final form "to be" according to the auxiliary verb code "9" stored in the auxiliary verb buffer 3k. As a result, the node “subject missing main clause” in the original text is
The translated word for each original word belonging to (19) ”is determined to be“ that ”,“ impossible ”,“ will do ”,“. ”. In addition, a translated word "a" and "in a state" that do not correspond to the original word is generated and set in the translated word buffer 3e as shown in FIG. Here, the relevant node “subject lacking subject (19)”
Since there is no younger brother node in, the target language generation subroutine ends. ... Steps S25, S27 to S30

【００５８】尚、上述のように、各節毎に得られた訳語
単語列が訳語バッファ３eにセットされるのであるが、
その際には、各訳語単語に、訳文上での単語位置情報が
付加される。この場合の単語位置情報は、フレーズ単位
ではなく１つの原文全体を単位とする単語位置情報であ
る。こうする理由は、従来の、１文単位で翻訳文を出力
するような機械翻訳装置と構成の多くを共通化すること
を可能ならしめるためである。また、同時に、原文上に
おける各原語単語の位置と対応する訳語単語の訳文上に
おける位置との対応を示す原語・訳語対応情報が、図１
５に示すように原語・訳語対応情報バッファ３fにセット
される。その際に必要な原語単語位置情報は、図７に示
すような形式で原文バッファ３aに格納されている。As described above, the translated word string obtained for each clause is set in the translated word buffer 3e.
At that time, word position information in the translated sentence is added to each translated word. The word position information in this case is not the phrase unit but the word position information in the unit of one whole original sentence. The reason for doing so is that it is possible to make many of the configurations common to the conventional machine translation device that outputs a translated sentence in units of one sentence. At the same time, the original word / translated word correspondence information indicating the correspondence between the position of each original word in the original sentence and the position of the corresponding translated word in the translated sentence is shown in FIG.
As shown in 5, it is set in the source language / translation word correspondence information buffer 3f. The original word position information necessary at that time is stored in the original buffer 3a in a format as shown in FIG.

【００５９】次に、上記変換前の構文バッファ３cに格
納されている図９に示すような英語の構造解析木に対し
て、図８に示すフレーズ分割規則が次のように適用され
てフレーズ分割位置が決定される。先ず、節点番号
“２"の節点「主節」に、・フレーズ分割規則２の「主節」の節点でまとめられてい
る部分は分離するが適用される。ここで、節点番号“２"の節点「主節」で
まとめられている部分とは「 It would push them to the end of the creditors' l
ine」であるから、当該原文が単語「line」と単語「and」との間
で分割される。次に、節点番号“２３"の節点「主語欠け
主節」に、・フレーズ分割規則２の「格要素が欠けた主節」の節点で
まとめられている部分は分離するが適用される。ここで、節点番号“２３"の節点「主語欠
け主節」でまとめられている部分とは「 make it nulikely」であるから、単語「and」と単語「make」との間で分割され
る。Next, the phrase division rule shown in FIG. 8 is applied to the English structural analysis tree shown in FIG. 9 stored in the pre-conversion syntax buffer 3c as follows. The position is determined. First, the following applies to the node "main node" of node number "2":-Separate the parts grouped by the "main node" of phrase division rule 2. Here, the part that is summarized by the node "main clause" of the node number "2" is "It would push them to the end of the creditors' l
Since it is "ine", the original sentence is divided between the word "line" and the word "and". Next, to the node "subject lacking main clause" of the node number "23":-Separate the parts gathered at the nodes of the "main clause lacking case elements" of the phrase division rule 2 are applied. Here, since the part of the node number “23” that is summarized by the node “subject main missing clause” is “make it nulikely”, it is divided between the word “and” and the word “make”.

【００６０】以上の如くフレーズ分割規則の適用を受け
た結果、当該原文は最終的に、単語「line」と単語「and」
との間、および、単語「and」と単語「the」との間との２箇
所で「 It would push them to the end of the creators' li
ne」「and」「make it unlikely」の３個のフレーズに分割されるのである。こうして分割
された各フレーズの先頭単語「It」,「and」および「make」の
当該原文上における単語位置“１",“１２"および“１
３"が、図７の原文バッファ３aを参照して得られる。そ
して、得られた各単語位置が、図１６に示すようにメイ
ンメモリ３の分割位置バッファ３gに格納される。…ス
テップＳ6As a result of applying the phrase division rule as described above, the original sentence finally has the word "line" and the word "and".
"It would push them to the end of the creators'li" in two places, between the word "and" and the word "the".
It is divided into three phrases, "ne", "and", "make it unlikely". The word positions “1”, “12” and “1” in the original sentence of the first words “It”, “and” and “make” of each phrase thus divided
3 "is obtained by referring to the original text buffer 3a of FIG. 7. Then, each obtained word position is stored in the division position buffer 3g of the main memory 3 as shown in FIG.

【００６１】そうすると、得られた各フレーズを構成す
る訳語単語の単語位置が、次のようにして求められる。
例えば、１番目のフレーズ「 It would push them to the end of the creditors' l
ine」の場合は、図１６に示すような分割位置バッファ３gに
おける「連番１」の「単語位置」の欄を参照することによっ
て、当該原文における１番目のフレーズの開始単語位置
“１"が得られる。また、「連番２」の「単語位置」の欄を
参照することによって、２番目のフレーズの開始単語位
置“１２"と、１番目のフレーズの終了単語位置“１１"
が得られる。以上のことから、当該原文の１番目のフレ
ーズは、各原語単語位置が「１,２,３,４,５,６,７,８,
９,１０,１１」である各原語単語から構成されているこ
とがわかる。次に、図１５に示すような原語・訳語対応
情報バッファ３fを参照して、１番目のフレーズを構成
する上記原語単語に対応する訳語単語の訳語単語位置
「１,１２,４,１１,１０,７,６,８」が得られる。さら
に、重複しているものを除いて昇順に並び替えると、当
該原文における１番目のフレーズの訳を構成する訳語の
訳語単語位置「１,４,６,７,８,１０,１１,１２」が得ら
れる。こうして得られた１番目のフレーズの訳を構成す
る訳語の訳語単語位置が、フレーズ毎の訳語単語位置バ
ッファ３hにおけるフレーズ番号１の「訳語単語位置」の
欄に格納される。Then, the word position of the translated word forming each obtained phrase is obtained as follows.
For example, the first phrase "It would push them to the end of the creditors' l
In the case of "ine", by referring to the "word position" column of "serial number 1" in the division position buffer 3g as shown in FIG. 16, the starting word position "1" of the first phrase in the original sentence is can get. Also, by referring to the "word position" column of "serial number 2", the start word position "12" of the second phrase and the end word position "11" of the first phrase
Is obtained. From the above, in the first phrase of the original sentence, each original word word position is "1, 2, 3, 4, 5, 6, 7, 8,
It can be seen that each of the original words is "9, 10, 11". Next, referring to the source word / translated word correspondence information buffer 3f as shown in FIG. 15, the translated word position "1, 12, 4, 11, 10 of the translated word corresponding to the original word forming the first phrase. , 7, 6, 8 ”is obtained. Furthermore, if the words are sorted in ascending order except for duplicates, the translated word position “1,4,6,7,8,10,11,12” of the translated word that constitutes the translation of the first phrase in the original sentence. Is obtained. The translation word position of the translation word that constitutes the translation of the first phrase thus obtained is stored in the "translation word position" column of phrase number 1 in the translation word position buffer 3h for each phrase.

【００６２】同様にして、当該原文の２番目のフレーズ
の訳を構成する訳語の訳語単語位置「１４」と３番目のフ
レーズの訳を構成する訳語の訳語単語位置「１６,１８,
２０」とが得られ、該当するフレーズ番号の「訳語単語位
置」の欄に格納される。…ステップＳ7Similarly, the translated word position "14" of the translated word that constitutes the translation of the second phrase of the original sentence and the translated word position of the translated word that constitutes the translation of the third phrase "16, 18,
20 "is obtained and stored in the" translated word position "column of the corresponding phrase number. … Step S7

【００６３】次に、上記訳語単語のうち、訳語単語位置
がフレーズ毎の訳語単語位置バッファ３hに格納されて
いない訳語単語、すなわち、原語単語に対応しない訳語
単語「は」,「を」,「を」および「状態に」が属するフレーズ
が定められて夫々の訳語単語位置「２」,「５」,「１７」およ
び「１９」が得られ、フレーズ毎の訳語単語位置バッファ
３hにおける該当するフレーズの「訳語単語位置」の欄に
格納される。…ステップＳ8 この結果、各フレーズ毎の訳語単語位置は、図１７に示
すように、１番目のフレーズ１,２,４,５,６,７,８,９,１０,１１,１２２番目のフレーズ１４３番目のフレーズ１６,１７,１８,１９,２０となる。Next, among the translated words, translated word positions whose translated word positions are not stored in the translated word position buffer 3h for each phrase, that is, translated word "ha", "wo", " The phrases to which "to" and "to the state" belong are determined to obtain respective translated word positions "2", "5", "17" and "19", and the corresponding phrases in the translated word position buffer 3h for each phrase are obtained. It is stored in the column of “translated word position”. As a result, the translated word position for each phrase is, as shown in FIG. 17, the first phrase 1,2,4,5,6,7,8,9,10,11,12 Phrase 14 The third phrase is 16,17,18,19,20.

【００６４】こうして得られた各フレーズ毎の訳語単語
に修正が必要な訳語単語位置を呈する訳語単語は存在し
ないので、以下のようにして、各フレーズ毎の訳が構築
される。先ず、１番目のフレーズを構成する訳語の訳語
単語位置は、図１７に示すフレーズ毎の訳語単語位置バ
ッファ３hにおけるフレーズ番号１の「訳語単語位置」の
欄によって、１,２,４,５,６,７,８,９,１０,１１,１２と得られる。次に、図１３に示す訳語バッファ３eか
ら、訳語単語位置１の訳語単語「それ」が得られ、訳語単
語位置２の訳語単語「は」が得られる。以下、順に訳語単
語「それら」,「を」,「債権者」,「の」,「ライン」,「の」,「エン
ド」,「に」および「入れるだろう」が得られる。そして、こ
うして得られた各訳語単語を訳語単語位置の昇順に繋げ
て得られた文字列「それはそれらを債権者のラインのエンドに入れるだろ
う」を、図１８に示すように、訳フレーズバッファ３iのフ
レーズ番号１の「訳フレーズ」の欄に格納する。同様にし
て、２番目のフレーズを構成する訳語単語「そして」と、
３番目のフレーズを構成する訳語単語「それ」,「を」,「あ
りそうもない」,「状態に」および「するであろう」とが得ら
れ、得られた訳語単語を訳語単語位置の昇順に繋げて得
られた文字列を訳フレーズバッファ３iにおける該等す
るフレーズ番号の「訳フレーズ」の欄に格納する。その結
果、図１８に示すように、各フレーズ毎の訳が構築され
る。…ステップＳ9,Ｓ10Since the translated word for each phrase thus obtained does not have a translated word that presents a translated word position that needs to be corrected, a translation for each phrase is constructed as follows. First, the translated word positions of the translated words that make up the first phrase are 1, 2, 4, 5, and 5 according to the "translated word position" column of phrase number 1 in the translated word position buffer 3h for each phrase shown in FIG. It is obtained as 6, 7, 8, 9, 10, 11, and 12. Next, the translated word "it" at the translated word position 1 and the translated word "ha" at the translated word position 2 are obtained from the translated word buffer 3e shown in FIG. Below, the translated words "there", "o", "creditor", "no", "line", "no", "end", "ni" and "will enter" are obtained in order. Then, as shown in FIG. 18, the translation phrase buffer is obtained by connecting the translation words thus obtained in the ascending order of the translation word positions, and obtaining the character string "it will put them at the end of the creditor's line". Stored in the "translated phrase" column of phrase number 1 of 3i. Similarly, the translated word "and" that composes the second phrase,
The translation words “it”, “o”, “impossible”, “in the state” and “will” that form the third phrase are obtained, and the obtained translation word is set to the translation word position. The character strings obtained by connecting in ascending order are stored in the "translated phrase" column of the corresponding phrase numbers in the translated phrase buffer 3i. As a result, translations for each phrase are constructed as shown in FIG. … Steps S9 and S10

【００６５】以上のようにして、当該原文の各フレーズ
に対する訳フレーズが得られると、出力形式整形部２Ｂ
によって、図１９に示すように、原文フレーズと訳文フ
レーズの頭の位置が揃えられて出力部５に表示される。
…ステップＳ11,Ｓ12When the translated phrase for each phrase of the original sentence is obtained as described above, the output format shaping unit 2B
Thus, as shown in FIG. 19, the head positions of the original sentence phrase and the translated sentence phrase are aligned and displayed on the output unit 5.
... Steps S11 and S12

【００６６】このように、本実施の形態においては、上
記翻訳モジュール２Ａのターゲット言語生成部２dに、
助動詞文訳出法決定手段１３,助動詞検出手段１４,並列
節検出手段１５および助動詞訳生成手段１８を設け、さ
らに、メインメモリ３に、助動詞検出手段１４によって
各節毎に検出された助動詞の助動詞コードを格納する助
動詞バッファ３kを設けている。そして、入力部１に助
動詞を含む並列節が存在する原文が入力された場合に
は、ターゲット言語生成部２dによって、並列節を成す
夫々の節の訳の末尾が上記助動詞に応じた終止形に変形
されたターゲット言語生成を行うようにしている。した
がって、助動詞を含む並列節が存在する原文「It would
push them to the end of the creditors' lineand ma
ke it unlikely.」に対するフレーズ訳の結果が It would push them to the end of the creditors' line and それがそれらを債権者のラインのエンドに入れるだろうそして make it unlikely. それをありそうもない状態にするであろうの如く得られ、１番目のフレーズの訳に対しても助動詞
「would」の効果が「入れるだろう」と現れて、１番目のフ
レーズの訳文として助動詞「would」の効果が入らないこ
とによる不自然さが解消されるのである。As described above, in the present embodiment, the target language generator 2d of the translation module 2A is
Auxiliary verb sentence translation determining means 13, auxiliary verb detecting means 14, parallel clause detecting means 15 and auxiliary verb translation generating means 18 are provided, and further, auxiliary verb codes of auxiliary verbs detected by the auxiliary verb detecting means 14 in the main memory 3 for each clause. An auxiliary verb buffer 3k for storing is provided. When an original sentence in which a parallel clause including an auxiliary verb exists is input to the input unit 1, the target language generation unit 2d causes the end of the translation of each clause forming the parallel clause to be an end form corresponding to the auxiliary verb. The modified target language is generated. Therefore, the original sentence "It would have a parallel clause containing an auxiliary verb."
push them to the end of the creditors' lineand ma
The result of the phrase translation for "ke it unlikely." It would push them to the end of the creditors' line and that would put them at the end of the creditor's line and make it unlikely. The effect of the auxiliary verb "would" appears on the translation of the first phrase as "I will put it in ", and the effect of the auxiliary verb "would" appears as the translation of the first phrase. The unnaturalness of not entering is eliminated.

【００６７】次に、ある節が他の節から丸ごと抜けたよ
うに分離した構造を有する原文「Keating is the subje
ct of a class-action lawsuit allegingthey misled i
nvestors about the safety of junk bonds.」が、入力
部１から入力された場合を例に、図４および図５に示す
フレーズ単位翻訳処理動作について具体的に説明する。Next, the original sentence "Keating is the subje" has a structure in which one clause is separated from the other clause.
ct of a class-action lawsuit allegingthey misled i
Investors about the safety of junk bonds. "is input from the input unit 1 as an example, the phrase-based translation processing operation shown in FIGS. 4 and 5 will be specifically described.

【００６８】上述の原文の場合と同様にして、辞書引き
・形態素解析処理,構文解析処理および構文変換処理が行
われる。その結果、英語による構造解析木が図２０に示
すように得られる。…ステップＳ1〜Ｓ4 次に、当該原文に並列節は存在しないので、変換後の日
本語による構造解析木における原語単語の配列順に訳語
単語が配列されて、当該原文の各単語の訳語が決定され
る。そして、この決定された訳語が、図２１に示すよう
に訳語バッファ３eに格納される。また、原語・訳語対応
情報バッファ３fに原語・訳語対応情報が図２２に示すよ
うに格納される。…ステップＳ5Similar to the case of the above-mentioned original sentence, the dictionary lookup / morphological analysis process, the syntactic analysis process and the syntactic conversion process are performed. As a result, a structural analysis tree in English is obtained as shown in FIG. Steps S1 to S4 Next, since there are no parallel clauses in the original sentence, the translated words are arranged in the order of arrangement of the original words in the converted Japanese structural analysis tree, and the translated words of each word of the original sentence are determined. It Then, the determined translated word is stored in the translated word buffer 3e as shown in FIG. Further, the original language / translated word correspondence information is stored in the original language / translated word correspondence information buffer 3f as shown in FIG. … Step S5

【００６９】次に、図２０に示すような変換前の構造解
析木における節点「主節(１６)」に対して、図８に示すフ
レーズ分割規則のうち・フレーズ分割規則２の「主節」の節点でまとめられてい
る部分は分離するが適用される。ここで、節点番号“１６"の節点「主節」
でまとめられている部分とは「 they misled investors about the safety of junk bo
nds」であるから、当該原文が単語「alleging」と単語「they」と
の間で分割される。そして、こうして分割された各フレ
ーズの先頭単語の当該原文上における単語位置“１"お
よび“１０"が、フレーズ分割箇所決定の際に適用され
たフレーズ分割規則の番号と共に、図２３に示すように
分割位置バッファ３gに格納される。…ステップＳ6Next, for the node “main clause (16)” in the structure analysis tree before conversion as shown in FIG. 20, among the phrase division rules shown in FIG. The parts that are grouped at the nodes of are separated. Here, the node “main node” with node number “16”
The part summarized in is `` they misled investors about the safety of junk bo
Since it is "nds", the original sentence is divided between the word "alleging" and the word "they". Then, the word positions "1" and "10" in the original sentence of the first word of each of the phrases thus divided are shown in FIG. 23 together with the numbers of the phrase division rules applied when determining the phrase division points. It is stored in the division position buffer 3g. … Step S6

【００７０】次に、図２３に示すような分割位置バッフ
ァ３gにおける「単語位置」の欄および図２２に示すよう
な原語・訳語対応情報バッファ３fが参照されて、当該原
文における各フレーズを構成する訳語単語の訳語単語位
置が得られる。また、訳語単語位置がフレーズ毎の訳語
単語位置バッファ３hに格納されていない訳語単語の訳
語単語位置が決定される。こうして、各フレーズ毎の訳
語単語位置が図２４に示すように決定されて、フレーズ
毎の訳語単語位置バッファ３hに格納される。…ステッ
プＳ7,Ｓ8Next, the "word position" column in the division position buffer 3g as shown in FIG. 23 and the original word / translation word correspondence information buffer 3f as shown in FIG. 22 are referred to to compose each phrase in the original sentence. The translated word position of the translated word is obtained. Further, the translated word position of the translated word whose translated word position is not stored in the translated word position buffer 3h for each phrase is determined. In this way, the translated word position for each phrase is determined as shown in FIG. 24 and stored in the translated word position buffer 3h for each phrase. … Steps S7 and S8

【００７１】次に、上記訳文フレーズ構築部２fにおけ
る中抜けフレーズ検出手段２１,生成位置修正手段２３
および補助記号生成手段２４によって、記憶部４のその
他の翻訳規則メモリ４eに記憶されている修正規則に従
って、各訳語単語位置に対して修正が必要かの判定と修
正操作が以下のようにして行われる。その際における上
記修正規則は、（I）下記の判定規則(１),(２),(３)を総て満たす場合
には、記号「〜」とフレーズｂの末尾の訳語単語とをフレ
ーズａの不連続箇所に生成する。（１）あるフレーズａを形成する訳語単語の訳語単語位
置に不連続な箇所があり、且つ、他のフレーズｂを形成
する訳語単語の訳語単語位置の総てが、フレーズａ内に
おける１つの不連続な箇所に入る。（２）フレーズｂの分割時に適用されたフレーズ分割規
則は、規則番号２である。（３）フレーズｂの末尾の訳語単語に対応する原語単語
は存在しない。である。Next, the blank phrase detection means 21 and the generation position correction means 23 in the translated phrase construction section 2f.
According to the correction rule stored in the other translation rule memory 4e of the storage unit 4, the auxiliary symbol generation means 24 determines whether or not each translated word position needs to be corrected and performs the correction operation as follows. Be seen. In this case, the above correction rule is: (I) If all of the following determination rules (1), (2), and (3) are satisfied, the symbol "~" and the translated word at the end of the phrase b are converted into the phrase a. It is generated at the discontinuous part of. (1) There is a discontinuity in the translated word position of the translated word that forms a certain phrase a, and all the translated word positions of the translated word that form another phrase b are one non-constant in the phrase a. Enter a continuous area. (2) The phrase division rule applied when dividing phrase b is rule number 2. (3) There is no source word corresponding to the translated word at the end of phrase b. It is.

【００７２】先ず、上記中抜けフレーズ検出手段２１に
よって、図２４に示すフレーズ毎の訳語対応情報バッフ
ァ３hに格納された１番目のフレーズにおいて、各訳語
単語位置はソート後の訳語単語位置であるにも拘わらず
訳語単語位置「２」の次に訳語単語位置「１５」が位置して
いるので、この箇所が不連続な箇所であると検出され
る。且つ、この不連続箇所に、２番目のフレーズに属す
る総ての訳語単語(訳語単語位置４〜１４)が入ることが
検知される。したがって、判定規則(１)「あるフレーズａを形成する訳語単語の訳語単語位置に
不連続な箇所があり、且つ、別のフレーズｂを形成する
訳語単語の訳語単語位置の総てが、フレーズａ内におけ
る１つの不連続な箇所内に入る」が満たされると判定される。また、図２３に示す分割位
置バッファ３gの内容が参照されて、２番目のフレーズ
に関するフレーズ分割の際の適用ルール番号は“２"で
ある。したがって、判定規則(２)「フレーズｂの分割時に適用されたフレーズ分割規則は
規則番号２である」が満たされると判定される。また、２番目のフレーズの
末尾に在る訳語単語の訳語単語位置は“１４"である
が、図２２に示す原語・訳語対応情報バッファ３fを参照
すると、訳語単語位置１４の訳語に対応している原語単
語は存在しないことが分かる。したがって、判定規則
(３)「フレーズｂの末尾の訳語単語に対応する原語単語は存
在しない」が満たされると判定される。すなわち、当該原文は、判
定規則(１),(２)および(３)の総てを満たすのでフレー
ズ毎の訳語単語位置バッファ３hの内容に対して修正が
必要であると判定されるのである。First, in the first phrase stored in the translated word correspondence information buffer 3h for each phrase shown in FIG. 24 by the above-mentioned blank phrase detection means 21, each translated word position is a translated word position after sorting. Nevertheless, since the translated word position "15" is located next to the translated word position "2", this part is detected as a discontinuous part. In addition, it is detected that all the translated words (translated word positions 4 to 14) belonging to the second phrase enter this discontinuous portion. Therefore, in the determination rule (1), there is a discontinuous part in the translated word position of the translated word forming one phrase a, and all the translated word positions of the translated word forming another phrase b are the phrase a. “Enter within one discontinuous point in” is satisfied. Further, the content of the division position buffer 3g shown in FIG. 23 is referred to, and the applied rule number at the time of phrase division regarding the second phrase is "2". Therefore, it is determined that the determination rule (2) "The phrase division rule applied when dividing phrase b is rule number 2" is satisfied. The translated word position of the translated word at the end of the second phrase is "14". However, referring to the source word / translated word correspondence information buffer 3f shown in FIG. It can be seen that there is no source word that exists. Therefore, the decision rule
(3) It is determined that "the original word corresponding to the translated word at the end of phrase b does not exist" is satisfied. That is, since the original sentence satisfies all the determination rules (1), (2) and (3), it is determined that the content of the translated word position buffer 3h for each phrase needs to be corrected.

【００７３】そうすると、上記生成位置修正手段２３に
よって、修正規則(I)に従って、２番目のフレーズの末
尾に位置する訳語単語位置１４の訳語単語「と」が、１番
目のフレーズにおける不連続箇所である訳語単語位置１
５の訳語単語「主張する」の前に移動される。つまり、本
実施の形態においては、請求項４における付属語生成手
段をターゲット言語生成部２dの付属語生成手段１７と
訳文フレーズ構築部２fの生成位置修正手段２３で実現
するのである。さらに、訳文フレーズ構築部２fの補助
記号生成手段２４によって、１番目のフレーズの不連続
箇所における訳語単語「と」の前に記号「〜」が生成され
る。この結果、上記フレーズ毎の訳語単語位置バッファ
３hの内容は、図２５に示すように変化する。ここで、
訳語単語位置“−１００"とは訳語単語「〜」の訳語単語
位置であることを意味する。また、訳語単語位置の値が
負の数である場合には、訳語バッファ３eに格納されて
いる訳語単語を使用するのではなく、記憶部４の補助訳
語テーブルメモリ４dに格納されている訳語を使用する
ことを意味している。本例の場合は、図２６に示すよう
な補助訳語テーブルメモリ４dの補助訳語テーブルを参
照して、コードが“−１００"である訳語(記号)「〜」を
得るのである。…ステップＳ9Then, according to the correction rule (I), the translated word "to" at the translated word position 14 located at the end of the second phrase is detected by the generation position correcting means 23 at a discontinuous portion in the first phrase. Certain translated word position 1
Moved in front of the 5 translation word “claim”. That is, in the present embodiment, the adjunct word generating means in claim 4 is realized by the adjunct word generating means 17 of the target language generating section 2d and the generation position correcting means 23 of the translated sentence phrase constructing section 2f. Further, the auxiliary symbol generation means 24 of the translated phrase construction unit 2f generates the symbol "~" before the translated word "to" in the discontinuous portion of the first phrase. As a result, the content of the translated word position buffer 3h for each phrase changes as shown in FIG. here,
The translated word position “−100” means the translated word position of the translated word “˜”. When the value of the translated word position is a negative number, the translated word stored in the translated word table memory 4d of the storage unit 4 is not used instead of using the translated word stored in the translated word buffer 3e. Is meant to be used. In the case of this example, the translated word (symbol) "..." With the code "-100" is obtained by referring to the auxiliary translated word table of the auxiliary translated word table memory 4d as shown in FIG. … Step S9

【００７４】そして、上記フレーズ毎の訳語単語位置バ
ッファ３hの内容に基づいて、訳語バッファ３eと補助訳
語テーブルを参照して、各フレーズ毎の訳が構築され
て、図２７に示すように訳フレーズバッファ３iに格納
される。そして、原文フレーズと訳文フレーズの頭の位
置が揃えられて、図２８に示すように表示される。…ス
テップＳ10〜Ｓ12Then, based on the contents of the translated word position buffer 3h for each phrase, a translation for each phrase is constructed by referring to the translated word buffer 3e and the auxiliary translation table, and as shown in FIG. It is stored in the buffer 3i. Then, the head positions of the original sentence phrase and the translated sentence phrase are aligned and displayed as shown in FIG. ... Steps S10 to S12

【００７５】このように、本実施の形態においては、上
記記憶部４のその他の翻訳規則メモリ４eに訳語単語位
置の修正規則(I)を記憶しておく。そして、訳文フレー
ズ構築部２fにおける中抜けフレーズ検出手段２１によ
って、判定規則(１),(２)および(３)の総てを満たすの
でフレーズ毎の訳語単語位置バッファ３hの内容に対し
て修正が必要であると判定されると、訳文フレーズ構築
部２fの生成位置修正手段２３および補助記号生成手段
２４によって、修正規則(I)に従って、２番目のフレー
ズの末尾に位置する訳語単語である訳語単語位置１４の
訳語単語「と」の前に記号「〜」が付加されて、１番目のフ
レーズにおける不連続箇所である訳語単語位置１５の訳
語単語「主張する」の前に移動される。As described above, in the present embodiment, the translation rule word position correction rule (I) is stored in the other translation rule memory 4e of the storage unit 4. Then, all of the judgment rules (1), (2) and (3) are satisfied by the hollow phrase detecting means 21 in the translated phrase construction unit 2f, so that the contents of the translated word position buffer 3h for each phrase can be corrected. When it is determined that the translation word is necessary, the translation position, which is the translation word located at the end of the second phrase, is corrected by the generation position correction means 23 and the auxiliary symbol generation means 24 of the translation phrase construction unit 2f according to the correction rule (I). The translation word "to" at position 14 is preceded by the symbol "~" and moved to the translation word "assertion" at translation word position 15 which is a discontinuity in the first phrase.

【００７６】したがって、本実施の形態によれば、ある
節が他の節から丸ごと抜けたように分離した構造を有す
る原文「Keating is the subject of a class-action l
awsuit allegingthey misled investors about the saf
ety of junk bonds.」のフレーズ訳の結果が Keating is the subject of a class-action lawsuit alleging キーティングは〜と主張する集団訴訟の主体である they misled investors about the safety of junk bonds. それらがジャンク・ボンドの安全性について投資家を誤らせたの如く得られ、１番目のフレーズにおける「〜と」の箇所
から、その文の格要素である２番目のフレーズが丸ごと
抜けるように分離した文の構造を有していることが分か
り易く、且つ、夫々の訳文フレーズが読み易くなる。Therefore, according to the present embodiment, the original sentence “Keating is the subject of a class-action l” having a structure in which a certain clause is separated from the other clause so as to be completely omitted.
awsuit allegingthey misled investors about the saf
The result of the phrase translation of `` ety of junk bonds. '' is Keating is the subject of a class-action lawsuit alleging they are misled investors about the safety of junk bonds. The structure of the sentence, which was obtained as if the investor was mistaken about the bond safety, and the second phrase, which is a case element of the sentence, was completely separated from the place of "to" in the first phrase. It is easy to understand that each of the translated phrases has a word and each translated phrase is easy to read.

【００７７】次に、分離すると格要素が欠落してしまう
関係節を有する原文「The plan was opposed by some o
f top executiveswho invested in the company's secu
rities.」が、入力部１から入力された場合を例に、図
４および図５に示すフレーズ単位翻訳処理動作について
具体的に説明する。Next, the original sentence "The plan was opposed by some o" having a relational clause in which case elements are missing when separated.
f top executiveswho invested in the company's secu
"Rities." is input from the input unit 1 as an example, and the phrase-based translation processing operation shown in FIGS. 4 and 5 will be specifically described.

【００７８】上述の場合と同様にして、辞書引き・形態
素解析処理,構文解析処理および構文変換処理が行われ
る。その結果、英語による構造解析木が図２９に示すよ
うに得られる。…ステップＳ1〜Ｓ4 次に、当該原文に並列節は存在しないので、変換後の日
本語による構造解析木における原語単語の配列順に訳語
単語が配列されて、当該原文の各単語の訳語が決定され
る。そして、決定された訳語が、図３０に示すように訳
語バッファ３eに格納される。また、原語・訳語対応情報
バッファ３fに原語・訳語対応情報が図３１に示すように
格納される。…ステップＳ5Similar to the above case, the dictionary lookup / morpheme analysis processing, the syntax analysis processing and the syntax conversion processing are performed. As a result, a structural analysis tree in English is obtained as shown in FIG. Steps S1 to S4 Next, since there are no parallel clauses in the original sentence, the translated words are arranged in the order of arrangement of the original words in the converted Japanese structural analysis tree, and the translated words of each word of the original sentence are determined. It Then, the determined translated word is stored in the translated word buffer 3e as shown in FIG. Further, the original language / translated word correspondence information buffer 3f stores the original language / translated word correspondence information as shown in FIG. … Step S5

【００７９】次に、図２９に示す変換前の構造解析木に
おける節点「関係代名詞(１５)」および節点「主語欠け主
節(１６)」に対して、図８に示すフレーズ分割規則のう
ち・フレーズ分割規則５の「格要素が欠けた主節」の節点のすぐ左隣に「関係詞」の接
点がある場合には、その「関係詞」の接点と「格要素が欠
けた主節」の接点とをまとめて一つのフレーズとして分
離するが適用される。ここで、節点番号“１５"の節点「関係代
名詞」と節点番号“１６"の節点「主語欠け主節」とをまと
めた部分とは、「who invested in the company'securi
ties」であるから、当該原文が単語「executives」と単語
「who」との間で分割される。そして、このようにして分
割された各フレーズの先頭単語の当該原文上における単
語位置“１"および“１０"が、フレーズ分割箇所決定の
際に適用されたフレーズ分割規則の番号と共に、図３２
に示すように、分割位置バッファ３gに格納される。…
ステップＳ6Next, among the phrase division rules shown in FIG. 8, for the node “relative pronoun (15)” and the node “subject missing main clause (16)” in the structure analysis tree before conversion shown in FIG. If there is a "relative" contact point to the immediate left of the node of "main clause lacking case element" in Phrase Division Rule 5, then the contact point of "relative clause" and "main clause lacking case element" It is applied that the contact points of and are separated as one phrase. Here, the part in which the node “relative pronoun” with the node number “15” and the node “subject lacking main node” with the node number “16” are summarized is “who invested in the company'securi”.
Since it is "ties", the original sentence is divided between the word "executives" and the word "who". Then, the word positions "1" and "10" in the original sentence of the first word of each of the phrases divided in this way, together with the number of the phrase division rule applied at the time of determining the phrase division portion, are shown in FIG.
As shown in FIG. …
Step S6

【００８０】次に、図３２に示すような分割位置バッフ
ァ３gにおける「単語位置」の欄および図３１に示すよう
な原語・訳語対応情報バッファ３fが参照されて、当該原
文における各フレーズを構成する訳語単語の訳語単語位
置が得られる。また、訳語単語位置がフレーズ毎の訳語
単語位置バッファ３hに格納されていない訳語単語の訳
語単語位置が決定される。こうして、各フレーズ毎の訳
語単語位置が図３３に示すように決定されて、フレーズ
毎の訳語単語位置バッファ３hに格納される。…ステッ
プＳ7,Ｓ8Next, the "word position" column in the division position buffer 3g as shown in FIG. 32 and the original / translated word correspondence information buffer 3f as shown in FIG. 31 are referred to and each phrase in the original sentence is constructed. The translated word position of the translated word is obtained. Further, the translated word position of the translated word whose translated word position is not stored in the translated word position buffer 3h for each phrase is determined. Thus, the translated word position for each phrase is determined as shown in FIG. 33 and stored in the translated word position buffer 3h for each phrase. … Steps S7 and S8

【００８１】次に、上記訳文フレーズ構築部２fにおけ
る関係節検出手段２２,生成位置修正手段２３,補助修飾
語生成手段２５,先行詞取得手段２６および後置語生成
手段２７によって、記憶部４のその他の翻訳規則メモリ
４eに記憶されている修正規則に従って、各訳語単語位
置に対して修正が必要かの判定と修正操作が以下のよう
にして行われる。その際における上記修正規則は（II）下記の判定規則(４),(５)を総て満たす場合に
は、フレーズａ内の関係詞の先行詞にあたる語を直前の
フレーズ内から取得し、その語に適切な修飾語を付け
て、フレーズａの欠落格の位置に生成する。（４）あるフレーズａの分割時に適用されたルール分割
規則は規則番号５である。（５）フレーズａ内に格要素が欠落した「主節」が存在す
る。である。Next, the relative clause detecting means 22, the generation position correcting means 23, the auxiliary modifier generating means 25, the antecedent obtaining means 26 and the postposition generating means 27 in the translated phrase constructing section 2f are used to store the data in the storage section 4. According to the correction rule stored in the other translation rule memory 4e, it is determined whether each translated word position needs to be corrected and the correction operation is performed as follows. The correction rule in that case is (II) if all the following judgment rules (4) and (5) are satisfied, the word corresponding to the antecedent of the relative phrase in phrase a is acquired from the immediately preceding phrase, and An appropriate modifier is added to the word, and the word is generated in the position of the missing case of the phrase a. (4) The rule division rule applied when dividing a phrase a is rule number 5. (5) There is a “main clause” in which the case element is missing in phrase a. It is.

【００８２】先ず、上記関係節検出手段２２によって、
図３２に示す分割位置バッファ３gの内容が参照され
て、２番目のフレーズに関するフレーズ分割の際の適用
ルール番号は“５"であるから、判定規則(４)「あるフレーズａの分割時に適用されたルール分割規則
は規則番号５である」が満たされると判定される。また、図２９に示す変換前
の構造解析木を参照して、２番目のフレーズには格要素
が欠落した主節である節点「主語欠け主節(１６)」が存在
することが分かる。したがって、判定規則(５)「フレーズａ内に格要素が欠落した「主節」が存在する」が満たされると判定される。すなわち、当該原文は、判
定規則(４)および(５)の総てを満たすのでフレーズ毎の
訳語単語位置バッファ３hの内容に対して修正が必要で
あると判定されるのである。First, by the relational clause detecting means 22,
The content of the division position buffer 3g shown in FIG. 32 is referred to, and the application rule number for phrase division regarding the second phrase is "5". Therefore, the determination rule (4) "Apply when division of certain phrase a is performed." The rule division rule is rule number 5, ”is determined to be satisfied. Also, referring to the structure analysis tree before conversion shown in FIG. 29, it can be seen that the second phrase has the node “subject-missing main clause (16)” which is the main clause lacking case elements. Therefore, it is determined that the determination rule (5) "there is a" main clause "in which the case element is missing in phrase a""is satisfied. That is, since the original sentence satisfies all the determination rules (4) and (5), it is determined that the content of the translated word position buffer 3h for each phrase needs to be corrected.

【００８３】そうすると、上記先行詞取得手段２６によ
って、上記修正規則(II)に従って、２番目のフレーズ内
にある関係代名詞「who」の先行詞が１番目のフレーズ内
から探されて、名詞「executives」が目的の先行詞である
と決定される。また、この名詞「executives」は当該原文
において原語単語位置が“９"の単語であるから、図３
１に示す原語・訳語対応情報バッファ３fおよび図３０に
示す訳語バッファ３eを参照して、先行詞である単語「ex
ecutives」に対応する訳語単語として訳語単語位置が
“１１"である「幹部」が得られる。そして、上記訳文フ
レーズ構築部２fの生成位置修正手段２３によって、図
３３に示すフレーズ毎の訳語単語位置バッファ３hにお
けるフレーズ番号２に属する訳語単語の中に、訳語単語
番号１１の訳語単語「幹部」が加えられるのである。Then, the antecedent acquisition unit 26 searches the antecedent of the relative pronoun "who" in the second phrase from the first phrase according to the correction rule (II), and the noun "executives" is searched. Is determined to be the target antecedent. Further, since this noun “executives” is a word whose original word position is “9” in the original sentence,
The source word / translation word correspondence information buffer 3f shown in FIG. 1 and the translation word buffer 3e shown in FIG.
As the translated word corresponding to "ecutives", the "executive" whose translated word position is "11" is obtained. Then, by the generation position correcting means 23 of the translated phrase construction unit 2f, the translated word "stem" of the translated word number 11 is included in the translated word belonging to the phrase number 2 in the translated word position buffer 3h for each phrase shown in FIG. Is added.

【００８４】その際に、２番目のフレーズ内における訳
語単語「幹部」の生成位置を決定する必要があるが、これ
は格文法的な観点に基づいて、通常の訳語生成の場合と
同様にターゲット言語生成部２dによって決定される。
ここで、本実施の形態においては、文の格要素の生成位
置は動詞の辞書に記述されているものとする。そして、
当該原文の場合は、２番目のフレーズを形成する節の中
心となる動詞は「invested」であり、当該節の構文は主語＋動詞＋前置動詞という形が変形(主語が欠落)したものである。このこと
は、図２９に示すように構文解析部２bによって既に決
定されている。また、単語「invest」の訳語も、この構文
解析処理結果に基づいて決定されている。At this time, it is necessary to determine the generation position of the translated word "stem" in the second phrase. This is based on the case grammar and is the same as in the case of the normal translated word generation. It is determined by the language generator 2d.
Here, in the present embodiment, the generation position of the case element of the sentence is described in the verb dictionary. And
In the case of the original sentence, the central verb of the clause forming the second phrase is “invested”, and the syntax of the clause is a modification of the form of subject + verb + prefix verb (missing subject). is there. This has already been determined by the syntax analysis unit 2b as shown in FIG. In addition, the translated word of the word “invest” is also determined based on the result of the syntax analysis processing.

【００８５】図３４は、上記記憶部４における辞書デー
タメモリ４aに格納されている原語単語「invest」におけ
る当該原文において選択された訳語「投資する」に関する
データの概念図である。図３４における「第１格」は当該
原文における格のうちの「主語」であり、図３４における
「第２格」は当該原文における格のうちの「前置詞句」であ
る。そして、日本語生成時における第１格の標準生成位
置は、「１番目」と指定されているものとする。そうする
と、上記先行詞取得手段２６によって取得された訳語単
語「幹部」は、原語単語「invested」に対して「主語」である
から、生成位置修正手段２２は訳語単語「幹部」を２番目
のフレーズ内の最初の位置に関係代名詞「who」の訳語と
して生成するのである。つまり、本実施の形態において
は、請求項６における関係詞訳生成手段を生成位置修正
手段２３で実現するのである。FIG. 34 is a conceptual diagram of data relating to the translated word "invest" selected in the original sentence of the original word "invest" stored in the dictionary data memory 4a in the storage unit 4. The “first case” in FIG. 34 is the “subject” of the case in the original sentence, and the “second case” in FIG. 34 is the “preposition phrase” of the case in the original sentence. Then, it is assumed that the first case standard generation position at the time of generating Japanese is designated as "first". Then, since the translated word “stem” acquired by the antecedent acquisition unit 26 is the “subject” with respect to the original word “invested”, the generation position correction unit 22 sets the translated word “stem” to the second phrase. It is generated as a translation of the relative pronoun "who" at the first position in. That is, in the present embodiment, the relative position translation generation means in claim 6 is realized by the generation position correction means 23.

【００８６】また、日本語の格要素には格助詞などの後
置語がある。この後置語を生成するための情報も辞書デ
ータメモリ４aには格納されている。したがって、後置
語生成手段２７によって、図３４に示す辞書データメモ
リ４aから動詞「invest」の第１格の格名「動作主体」を
得、図３５に示すように記憶部４の後置語テーブルメモ
リ４cに格納されている後置語生成情報を参照して、格
名が「動作主体」であってその格を含む節（２番目のフレ
ーズを形成する節）が主節であるので、訳語単語「幹部」
の後置語として「は」が生成される。さらに、上記訳文フ
レーズ構築部２fの補助修飾語生成手段２５によって、
次のようにして訳語単語「幹部」の修飾語が決定される。
すなわち、訳語単語「幹部」は名詞であるから体言であ
る。したがって、記憶部４の補助訳語テーブルメモリ４
dに図２６に示すように格納されている情報を参照する
ことによって体言の修飾語として定められている「その」
が選ばれ、その生成位置が訳語単語「幹部」の直前と設定
される。Further, Japanese case elements include postpositional words such as case particles. Information for generating this postword is also stored in the dictionary data memory 4a. Therefore, the postpositional word generation means 27 obtains the first case name "action subject" of the verb "invest" from the dictionary data memory 4a shown in FIG. 34, and as shown in FIG. With reference to the postword generation information stored in the table memory 4c, since the case name is the "acting subject" and the section including the case (section forming the second phrase) is the main section, Translated word "Executive"
“Ha” is generated as a postfix. Further, by the auxiliary modifier generation means 25 of the translated phrase construction unit 2f,
The modifier of the translated word "executive" is determined as follows.
That is, the translated word “executive” is a noun because it is a noun. Therefore, the auxiliary translation table memory 4 of the storage unit 4
As shown in FIG. 26 in d, "the" which is defined as a modifier of the word by referring to the stored information
Is selected, and its generation position is set immediately before the translation word “executive”.

【００８７】その結果、上記フレーズ毎の訳語単語位置
バッファ３hの内容は、図３６に示すようになる。尚、
訳語単語位置の値が負の数である場合は、訳語バッファ
３eに格納されている訳語単語を使用するのでなく、記
憶部４の後置語テーブルメモリ４c(図３５)と補助訳語
テーブルメモリ４d(図２６)を参照して、コード“−１
０１"は訳語単語「その」を生成し、コード“−１"は訳語
単語「は」を生成すべきことを意味している。…ステップ
Ｓ9As a result, the contents of the translated word position buffer 3h for each phrase are as shown in FIG. still,
When the value of the translated word position is a negative number, the translated word stored in the translated word buffer 3e is not used, but the postword table memory 4c (FIG. 35) and the auxiliary translated word table memory 4d of the storage unit 4 are used. Referring to FIG. 26, the code "-1"
01 "means that the translated word" that "should be generated, and the code" -1 "means that the translated word" ha "should be generated. … Step S9

【００８８】そして、上記フレーズ毎の訳語単語位置バ
ッファ３hの内容に基づいて、訳語バッファ３e(図３０)
と後置語テーブルメモリ４c(図３５)と補助訳語テーブ
ルメモリ４d(図２６)を参照して、各フレーズ毎の訳が
構築され、図３７に示すように訳フレーズバッファ３i
に格納される。そして、原文フレーズと訳文フレーズの
頭の位置が揃えられて、図３８に示すように表示され
る。…ステップＳ10〜Ｓ12Then, based on the contents of the translated word position buffer 3h for each phrase, the translated word buffer 3e (FIG. 30).
With reference to the postword table memory 4c (FIG. 35) and the auxiliary translation table memory 4d (FIG. 26), the translation for each phrase is constructed, and as shown in FIG.
Stored in. Then, the positions of the heads of the original sentence phrase and the translated sentence phrase are aligned and displayed as shown in FIG. ... Steps S10 to S12

【００８９】このように、本実施の形態においては、上
記記憶部４のその他の翻訳規則メモリ４eに訳語単語位
置の修正規則(II)を記憶しておく。そして、訳文フレー
ズ構築部２fにおける関係節検出手段２２によって、判
定規則(４)及び(５)の総てを満たすのでフレーズ毎の訳
語単語位置バッファ３hの内容に対して修正が必要であ
ると判定されると、訳文フレーズ構築部２fの先行詞取
得手段２６によって、修正規則(II)に従って、２番目の
フレーズ内にある関係代名詞「who」の先行詞である単語
「executives」とその訳語単語「幹部」を得る。そして、上
記訳文フレーズ構築部２fの生成位置修正手段２３によ
ってフレーズ番号２に属する訳語単語の中に訳語単語
「幹部」を生成し、さらに、補助修飾語生成手段２５およ
び後置語生成手段２７によって体言の修飾語「その」と後
置語「は」とを生成して関係詞訳語である訳語単語「幹部」
に付加するようにしている。As described above, in the present embodiment, the translation rule position correction rule (II) is stored in the other translation rule memory 4e of the storage unit 4. Then, the relational clause detecting means 22 in the translated sentence phrase construction unit 2f determines that the contents of the translated word position buffer 3h for each phrase need to be corrected because all the determination rules (4) and (5) are satisfied. Then, the antecedent acquisition unit 26 of the translated phrase constructing unit 2f follows the correction rule (II), and the word "executives" which is the antecedent of the relative pronoun "who" in the second phrase and its translated word " Get executives. Then, the translated word “stem” is generated in the translated word belonging to the phrase number 2 by the generation position correcting means 23 of the translated sentence phrase constructing section 2f, and further, by the auxiliary modifier word generating means 25 and the postfix word generating means 27. Translated word "Executive" which is a relative translation by generating a modifier "no" and a postfix "ha"
It is added to.

【００９０】したがって、本実施の形態によれば、分離
すると格要素が欠落してしまう関係節を有する原文「Th
e plan was opposed by some of top executiveswho in
vested in the company's securities.」のフレーズ訳
の結果が The plan was opposed by some of top executives 計画はいくらかのトップ幹部によって反対された who invested in the company's securities. その幹部は会社の有価証券に投資したの如く得られ、２番目のフレーズに対して欠落した格要
素「その幹部は」が補われて、関係節である２番目のフレ
ーズを単独で読んでも非常に分かり易くなっている。Therefore, according to the present embodiment, the original sentence "Th
e plan was opposed by some of top executiveswho in
The result of the phrase "vested in the company's securities." was the plan by opposed of some of top executives who was invested in the company's securities. As described above, the missing case element "the cadre is" for the second phrase is supplemented, and it is very easy to understand even if the second phrase, which is a relative clause, is read alone.

【００９１】尚、本実施の形態におけるフレーズ単位翻
訳処理動作のアルゴリズムは、図４および図５に示すフ
ローチャートに限定されるものではない。The algorithm for the phrase-based translation processing operation in this embodiment is not limited to the flowcharts shown in FIGS. 4 and 5.

【００９２】[0092]

【発明の効果】以上より明らかなように、請求項１に係
る発明の機械翻訳装置におけるターゲット言語生成部
は、入力文中に特定の格のみを共有する並列節が在る場
合には、当該並列節を構成する各節に関して、末尾の単
語が当該並列節から検出された助動詞に応じた終止形に
変換された訳語列を生成するので、上記特定の格のみを
共有する助動詞を含む並列節を成す節の何れに対しても
当該助動詞に基づく様相を反映させた訳語列を生成する
ことができる。As is apparent from the above, the target language generation unit in the machine translation device according to the first aspect of the present invention, when there is a parallel clause that shares only a specific case in the input sentence, For each of the clauses that make up the clause, a translated word string in which the last word is converted to the final form according to the auxiliary verb detected from the parallel clause is generated, so a parallel clause containing an auxiliary verb that shares only the above specific case is generated. It is possible to generate a translated word string that reflects the aspect based on the auxiliary verb for any of the clauses to be formed.

【００９３】したがって、助動詞を含む並列節が存在す
る原文「It would push them to the end of the credi
tors' lineand make it unlikely.」に対するフレーズ
訳の結果が「It would push them to the end of the creditors' line and それがそれらを債権者のラインのエンドに入れるだろうそして make it unlikely. それをありそうもない状態にするであろう」の如く得られ、１番目のフレーズの訳に対しても助動詞
「would」の効果が「入れるだろう」と現れる。すなわち、
この発明によれば、助動詞を含む並列節を成す各節の訳
文に上記助動詞の効果が現れないことによる不自然さを
解消できるのである。Therefore, the original sentence "It would push them to the end of the credi" in which a parallel clause containing an auxiliary verb exists.
The result of the phrase translation for'tors' lineand make it unlikely. 'is that it would push them to the end of the creditors' line and that would put them at the end of the creditor's line and make it unlikely. The effect of the auxiliary verb "would" appears on the translation of the first phrase as "I will put it in ". That is,
According to the present invention, it is possible to eliminate unnaturalness due to the effect of the auxiliary verb not appearing in the translated sentences of the parallel clauses including the auxiliary verb.

【００９４】また、請求項２に係る発明の機械翻訳装置
におけるターゲット言語生成部は、特定の格のみを共有
する並列節を検出する並列節検出手段と、上記並列節を
成す各節から助動詞を検出する助動詞検出手段と、末尾
の単語が上記検出された助動詞に応じた終止形に変換さ
れた訳語列を生成する助動詞訳生成手段を有しているの
で、入力文中に特定の格のみを共有する助動詞を含む並
列節が存在する場合には、当該並列節を成す各節の何れ
にも当該助動詞に基づく様相を反映させて節毎の翻訳が
行われる。すなわち、この発明によれば、請求項１に係
る発明と同様に、助動詞を含む並列節を成す各節の訳文
に上記助動詞の効果が現れないことによる不自然さを解
消できる。The target language generation unit in the machine translation device according to the second aspect of the present invention includes a parallel clause detecting means for detecting a parallel clause sharing only a specific case, and an auxiliary verb from each clause forming the parallel clause. Since it has auxiliary verb detection means for detecting and auxiliary verb translation generating means for generating a translated word string in which the last word is converted to the final form according to the detected auxiliary verb, only a specific case is shared in the input sentence. When there is a parallel clause including the auxiliary verb, the translation based on the auxiliary verb is performed in each of the clauses forming the parallel clause. That is, according to the present invention, similar to the invention according to claim 1, it is possible to eliminate the unnaturalness due to the effect of the auxiliary verb not appearing in the translated sentence of each clause forming the parallel clause including the auxiliary verb.

【００９５】また、請求項３に係る発明の機械翻訳装置
における訳文フレーズ構築部は、入力文における係りの
語と受けの語とが異なる節に分割された場合には、上記
受けの語を含む節の訳語列における当該受けの語の直前
に上記係りの語を含む節が本来存在することを表す補助
記号を生成するので、上記受けの語を含む節の訳語列中
に上記係りの語を含む節が本来存在することを明示した
各節毎の翻訳を行うことができる。In the machine translation device according to the third aspect of the present invention, the translated phrase construction unit includes the received word when the related word and the received word in the input sentence are divided into different sections. Since a supplementary symbol is generated just before the word of the word in the translation word of the clause, which indicates that the clause including the word of the dependency originally exists, the word of the dependency is included in the translation string of the clause including the word of the phrase. It is possible to perform translation for each clause that clearly indicates that the containing clause originally exists.

【００９６】したがって、ある節が他の節から丸ごと抜
けたような構造を有する原文「Keating is the subject
of a class-action lawsuit allegingthey misled inv
estors about the safety of junk bonds.」に対するフ
レーズ訳の結果が「Keating is the subject of a class-action lawsuit alleging キーティングは〜と主張する集団訴訟の主体である they misled investors about the safety of junk bonds. それらがジャンク・ボンドの安全性について投資家を誤らせた」の如く得られる。すなわち、この発明によれば、上記受
け側の節における上記補助記号の箇所からその節の格要
素である係り側の節が丸ごと抜けていることが分かり易
く、読み易い訳文フレーズを得ることができる。Therefore, the original sentence "Keating is the subject" has a structure in which one clause is completely omitted from another clause.
of a class-action lawsuit alleging they misled inv
The result of the phrase translation for `` estors about the safety of junk bonds. '' is `` Keating is the subject of a class-action lawsuit alleging they misled investors about the safety of junk bonds. They misleaded investors about the safety of junk bonds. " That is, according to the present invention, it is easy to understand that the clause on the engagement side, which is a case element of the clause, is completely omitted from the location of the supplementary symbol in the clause on the reception side, and a translated phrase that is easy to read can be obtained. .

【００９７】また、請求項４に係る発明の機械翻訳装置
における訳文フレーズ構築部は、入力文における係りの
語と受けの語が異なる節に分割されて上記係りの語を含
む節が受けの語を含む節から丸ごと抜けている構造を有
する中抜け節を検出する中抜け節検出手段と、上記受け
の語を含む節の訳語列における当該受けの語の直前に補
助記号を生成する補助記号生成手段と、上記生成された
補助記号の直後に付属語を生成する付属語生成手段を有
しているので、入力文中における係りの語と受けの語と
が異なる節に分割された場合には、上記受けの語側の節
の訳語列に上記係りの語側の節が抜けたことを表す補助
記号を挿入して各節毎の翻訳を行うことができる。すな
わち、この発明によれば、請求項３に係る発明と同様
に、上記受け側の節からその節の格要素である係り側の
節が丸ごと抜けていることが分かり易く、読み易い訳文
フレーズを得ることができる。Further, in the translation phrase construction unit in the machine translation device of the invention according to claim 4, the related word and the receiving word in the input sentence are divided into different clauses, and the clause including the related word is the receiving word. Hollow clause detecting means for detecting a hollow clause having a structure in which it is completely omitted from a clause containing, and auxiliary symbol generation for generating an auxiliary symbol immediately before the word of the word in the translated word string of the clause containing the word of the word. Since it has a means and an adjunct word generating means for generating an adjunct word immediately after the generated auxiliary symbol, when the related word and the receiving word in the input sentence are divided into different clauses, It is possible to perform translation for each clause by inserting an auxiliary symbol indicating that the clause on the word side of the relation has been omitted in the translated word string of the clause on the word side of the reception. That is, according to the present invention, as in the invention according to claim 3, it is easy to understand that the clause on the receiving side, which is a case element of the clause, is completely omitted from the clause on the receiving side, and a translation phrase that is easy to read is provided. Obtainable.

【００９８】また、請求項５に係る発明の機械翻訳装置
における訳文フレーズ構築部は、分離されて格要素が欠
落した関係節の訳語列中における上記欠落した格要素の
位置に当該関係節における関係詞の先行詞の訳語を生成
するので、関係節の訳語列中に関係詞の訳語として先行
詞の訳語を挿入した翻訳結果を得ることができる。Further, the translation phrase construction unit in the machine translation device of the invention according to claim 5 is arranged such that the position of the missing case element in the translated word string of the separated relative case in which the case element is missing is related to the relational clause. Since the translated word of the antecedent of the phrase is generated, it is possible to obtain the translation result in which the translated word of the antecedent is inserted into the translated word string of the relative clause as the translated word of the relative phrase.

【００９９】したがって、分離すると格要素が欠落して
しまう関係節を有する原文「The plan was opposed by
some of top executiveswho invested in the company'
s securities.」に対するフレーズ訳の結果が「The plan was opposed by some of top executives 計画はいくらかのトップ幹部によって反対された who invested in the company's securities. その幹部は会社の有価証券に投資した」の如く得られる。すなわち、この発明によれば、関係節
における欠落した格要素が補われて、単独で読んでも非
常に分かり易い翻訳結果が得られる。Therefore, the original sentence “The plan was opposed by” has a relational clause in which case elements are lost when separated.
some of top executiveswho invested in the company '
The result of the phrase translation for "s securities." is "The plan was opposed by some of top executives who invested in the company's securities. The executives invested in the company's securities." can get. That is, according to the present invention, the missing case element in the relational clause is compensated, and a very easy-to-understand translation result can be obtained even when read alone.

【０１００】また、請求項６に係る発明の機械翻訳装置
における訳文フレーズ構築部は、分離されて各要素が欠
落した関係節を検出する関係節検出手段と、当該関係節
における関係詞の先行詞の訳語を得る先行詞取得手段
と、得られた当該先行詞の訳語を当該関係節の訳語列中
における上記欠落した格要素の位置に当該関係詞の訳語
として生成する関係詞訳生成手段と、生成された当該関
係詞の訳語の直後に後置語を生成する後置語生成手段を
有しているので、分割位置決定部によって分離されて格
要素が欠落した関係節の訳語列中に当該関係詞の訳語と
して先行詞の訳語が挿入された翻訳結果を得ることがで
きる。すなわち、この発明によれば、請求項５に係る発
明と同様に、関係節における欠落した格要素が補われ
て、単独で読んでも非常に分かり易い関係節の翻訳結果
が得られる。Further, the translated phrase construction unit in the machine translation device according to the sixth aspect of the present invention includes a relative clause detecting means for detecting a relative clause in which each element is separated and each element is missing, and an antecedent of the relative clause in the relative clause. An antecedent acquisition unit that obtains a translated word of the relative phrase, and a relative word translation generation unit that generates the translated word of the obtained antecedent at the position of the missing case element in the translation string of the relative clause as a translation of the relative phrase, Since the postposition word generating means for generating a postposition word is provided immediately after the generated translation word of the relative phrase, the translation word string of the relative clause separated by the division position determining unit and having the case element missing It is possible to obtain a translation result in which the translation of the antecedent is inserted as the translation of the relative. That is, according to the present invention, as in the invention according to claim 5, the missing case element in the relational clause is compensated, and the translation result of the relational clause which is very easy to understand even when read alone can be obtained.

【０１０１】また、請求項７に係る発明発明の機械翻訳
装置における上記訳文フレーズ構築部は、上記関係詞訳
生成手段によって生成された当該関係詞の訳語に係る修
飾語を生成する補助修飾語生成手段を有しているので、
当該関係詞の訳語が生成された関係節の可読性を更に向
上させることができる。Further, in the machine translation device of the invention according to claim 7, the translated phrase constructing unit generates an auxiliary modifier generating the modifier associated with the translation of the relative word generated by the relative translation generating means. Because it has the means
It is possible to further improve the readability of the relative clause in which the translated word of the relative word is generated.

[Brief description of the drawings]

【図１】この発明の機械翻訳装置におけるブロック図で
ある。FIG. 1 is a block diagram of a machine translation device of the present invention.

【図２】図１におけるターゲット言語生成部の詳細な構
成図である。FIG. 2 is a detailed configuration diagram of a target language generation unit in FIG.

【図３】図１における訳文フレーズ構築部の詳細な構成
図である。FIG. 3 is a detailed configuration diagram of a translated phrase construction unit in FIG.

【図４】図１における制御部による制御の下に実行され
るフレーズ単位翻訳処理動作のフローチャートである。FIG. 4 is a flowchart of a phrase-based translation processing operation executed under the control of the control unit in FIG.

【図５】図４に示すフレーズ単位翻訳処理動作において
実行されるターゲット言語生成サブルーチンのフローチ
ャートである。5 is a flowchart of a target language generation subroutine executed in the phrase unit translation processing operation shown in FIG.

【図６】機械翻訳による解析レベルの説明図である。FIG. 6 is an explanatory diagram of an analysis level by machine translation.

【図７】図１における原文バッファの内容の一例を示す
概念図である。FIG. 7 is a conceptual diagram showing an example of the contents of the original text buffer in FIG.

【図８】フレーズ分割規則の一例を示す図である。FIG. 8 is a diagram showing an example of a phrase division rule.

【図９】変換前の構文解析木の一例を示す概念図であ
る。FIG. 9 is a conceptual diagram showing an example of a syntax analysis tree before conversion.

【図１０】変換後の構文解析木の一例を示す概念図であ
る。FIG. 10 is a conceptual diagram showing an example of a syntax analysis tree after conversion.

【図１１】図１における助動詞コード表メモリの内容の
一例を示す概念図である。11 is a conceptual diagram showing an example of the contents of an auxiliary verb code table memory in FIG.

【図１２】図１における助動詞バッファの内容の一例を
示す概念図である。FIG. 12 is a conceptual diagram showing an example of contents of an auxiliary verb buffer in FIG.

【図１３】図１における訳語バッファの内容の一例を示
す概念図である。FIG. 13 is a conceptual diagram showing an example of contents of a translated word buffer in FIG. 1.

【図１４】フレーズ訳が指令されない場合における図１
３に対応する訳語バッファの内容を示す概念図である。FIG. 14 is a diagram when the phrase translation is not commanded.
It is a conceptual diagram which shows the content of the translation word buffer corresponding to 3.

【図１５】図１における原語・訳語対応情報バッファの
内容の一例を示す概念図である。FIG. 15 is a conceptual diagram showing an example of contents of an original language / translation word correspondence information buffer in FIG. 1.

【図１６】図１における分割位置バッファの内容の一例
を示す概念図である。16 is a conceptual diagram showing an example of the contents of a division position buffer in FIG.

【図１７】図１におけるフレーズ毎の訳語単語位置バッ
ファの内容の一例を示す概念図である。17 is a conceptual diagram showing an example of contents of a translated word position buffer for each phrase in FIG. 1. FIG.

【図１８】図１における訳フレーズバッファの内容の一
例を示す概念図である。FIG. 18 is a conceptual diagram showing an example of contents of a translated phrase buffer in FIG. 1.

【図１９】フレーズ訳結果の表示例を示す図である。FIG. 19 is a diagram showing a display example of a phrase translation result.

【図２０】図９とは異なる変換前の構文解析木の概念図
である。FIG. 20 is a conceptual diagram of a syntax analysis tree before conversion, which is different from FIG. 9.

【図２１】図１３とは異なる訳語バッファの内容を示す
概念図である。FIG. 21 is a conceptual diagram showing contents of a translation word buffer different from FIG.

【図２２】図１５とは異なる原語・訳語対応情報バッフ
ァの内容を示す概念図である。22 is a conceptual diagram showing the contents of an original language / translation word correspondence information buffer different from FIG.

【図２３】図１６とは異なる分割位置バッファの内容を
示す概念図である。23 is a conceptual diagram showing the contents of a division position buffer different from FIG.

【図２４】図１７とは異なるフレーズ毎の訳語単語位置
バッファの内容を示す概念図である。FIG. 24 is a conceptual diagram showing the contents of a translated word position buffer for each phrase different from FIG.

【図２５】図２４に対して修正が行われた後のフレーズ
毎の訳語単語位置バッファの内容を示す概念図である。FIG. 25 is a conceptual diagram showing the contents of a translated word position buffer for each phrase after correction is made to FIG. 24.

【図２６】図１における補助訳語テーブルメモリの内容
の一例を示す概念図である。FIG. 26 is a conceptual diagram showing an example of the contents of an auxiliary translation table memory in FIG.

【図２７】図１８とは異なる訳フレーズバッファの内容
を示す概念図である。FIG. 27 is a conceptual diagram showing the contents of a translated phrase buffer different from FIG.

【図２８】図１９とは異なるフレーズ訳結果の表示例を
示す図である。28 is a diagram showing a display example of a phrase translation result different from FIG.

【図２９】図９および図２０とは異なる変換前の構文解
析木の概念図である。29 is a conceptual diagram of a syntax analysis tree before conversion, which is different from FIGS. 9 and 20. FIG.

【図３０】図１３および図２１とは異なる訳語バッファ
の内容を示す概念図である。FIG. 30 is a conceptual diagram showing contents of a translation word buffer different from those in FIGS. 13 and 21.

【図３１】図１５および図２２とは異なる原語・訳語対
応情報バッファの内容を示す概念図である。FIG. 31 is a conceptual diagram showing the contents of an original language / translation word correspondence information buffer different from those in FIGS. 15 and 22.

【図３２】図１６および図２３とは異なる分割位置バッ
ファの内容を示す概念図である。FIG. 32 is a conceptual diagram showing contents of a division position buffer different from FIGS. 16 and 23.

【図３３】図１７および図２４とは異なるフレーズ毎の
訳語単語位置バッファの内容を示す概念図である。FIG. 33 is a conceptual diagram showing the contents of a translated word position buffer for each phrase different from FIGS. 17 and 24.

【図３４】図１における辞書メモリの一部を示す概念図
である。34 is a conceptual diagram showing a part of the dictionary memory in FIG.

【図３５】図１における後置語テーブルメモリの内容の
一例を示す概念図である。35 is a conceptual diagram showing an example of the contents of the postword table memory in FIG. 1. FIG.

【図３６】図１７および図２４とは異なるフレーズ毎の
訳語単語位置バッファの内容を示す概念図である。FIG. 36 is a conceptual diagram showing the contents of a translated word position buffer for each phrase different from FIGS. 17 and 24;

【図３７】図１８及び図２７とは異なる訳フレーズバッ
ファの内容を示す概念図である。FIG. 37 is a conceptual diagram showing the contents of a translated phrase buffer different from those in FIGS. 18 and 27.

【図３８】図１９及び図２８とは異なるフレーズ訳結果
の表示例を示す図である。FIG. 38 is a diagram showing a display example of a phrase translation result different from FIGS. 19 and 28.

[Explanation of symbols]

１…入力部、２…翻訳制御部、
２Ａ…翻訳モジュール、２a…辞書引き・形
態素解析部、２b…構文解析部、２c
…構文変換部、２d…ターゲット言語生成部、２
e…分割位置決定部、２f…訳文フレーズ構築部、
３…メインメモリ、３c…変換前の構文バッファ、
３d…変換後の構文バッファ、３e…訳語バッフ
ァ、３f…原語・訳語対応情報バッフ
ァ、３g…分割位置バッファ、３h…フレーズ毎の訳語単
語位置バッファ、３i…訳フレーズバッファ、
３k…助動詞バッファ、４…記憶部、
４a…辞書メモリ、４c…後置語テーブルメモリ、
４d…補助訳語テーブルメモリ、４g…助動詞コ
ード表メモリ、５…出力部、６…制御部、
１１…木構造トレース手段、１３…助
動詞文訳出法決定手段、１４…助動詞検出手段、１
５…並列節検出手段、１８…助動詞訳生成
手段、２１…中抜けフレーズ検出手段、２２…関係
節検出手段、２３…生成位置修正手段、２４
…補助記号生成手段、２５…補助修飾語生成手段、
２６…先行詞取得手段、２７…後置語生成手段。1 ... Input unit, 2 ... Translation control unit,
2A ... translation module, 2a ... dictionary lookup / morphological analysis unit, 2b ... syntactic analysis unit, 2c
… Syntax converter, 2d… Target language generator, 2
e ... division position determination unit, 2f ... translated phrase construction unit,
3 ... main memory, 3c ... syntax buffer before conversion,
3d ... Syntax buffer after conversion, 3e ... Translated word buffer, 3f ... Original language / translated word correspondence information buffer, 3g ... Dividing position buffer, 3h ... Translated word position buffer for each phrase, 3i ... Translated phrase buffer,
3k ... auxiliary verb buffer, 4 ... storage unit,
4a ... dictionary memory, 4c ... postword table memory,
4d ... auxiliary translation table memory, 4g ... auxiliary verb code table memory, 5 ... output section, 6 ... control section,
11 ... Tree structure tracing means, 13 ... Auxiliary verb sentence translation method determining means, 14 ... Auxiliary verb detecting means, 1
5 ... Parallel clause detecting means, 18 ... auxiliary verb translation generating means, 21 ... hollow phrase detecting means, 22 ... relative clause detecting means, 23 ... generation position correcting means, 24
... auxiliary symbol generating means, 25 ... auxiliary modifier generating means,
26 ... Antecedent acquisition means, 27 ... Postfix generation means.

Claims

[Claims]

1. A syntactic analysis unit for performing syntactic analysis of a sentence in a source language input from an input unit, a syntactic conversion unit for converting the obtained syntactic structure into a syntactic structure in a target language, and based on the conversion result. A target language generation unit that generates a translation string in the target language of the input sentence, a division position determination unit that determines the division position when dividing the input sentence into phrases or clauses according to the division rule, and the division result by the division position determination unit. A machine for translating an input sentence into phrases or clauses, having a translation phrase construction unit for obtaining a translation string in the target language for each phrase or clause based on the target language generated by the target language generation unit and the target language. In the translation device, when the input sentence includes a parallel clause sharing only a specific case, each of the parallel clauses forming the parallel clause. Auxiliary verbs are detected from each of the parallel clauses, and the words at the end of each of the parallel clauses are converted to the final form corresponding to the detected auxiliary verb to generate a target language translation string and share only the specific case. A machine translation device characterized by performing a translation for each clause by reflecting a feature based on the auxiliary verb in any of the clauses forming a parallel clause including the auxiliary verb.

2. The machine translation device according to claim 1, wherein the target language generation unit includes a parallel clause that shares only a specific case based on the syntactic structure in the target language obtained by the syntactic conversion unit. A parallel clause detecting means for detecting, an auxiliary verb detecting means for detecting an auxiliary verb from each clause detected by the parallel clause detecting means, and an auxiliary verb information storage means for storing information indicating the auxiliary verb detected by the auxiliary verb detecting means. , For each clause detected by the parallel clause detecting means, an auxiliary verb translation generation for generating a translated word string in a target language in which the last word is converted to an end form according to the information stored in the auxiliary verb information storage means A machine translation device comprising means.

3. A syntax analysis unit for performing syntax analysis of a sentence in a source language input from an input unit, a syntax conversion unit for converting the obtained syntax structure into a syntax structure in a target language, and based on the conversion result. A target language generation unit that generates a translation string in the target language of the input sentence, a division position determination unit that determines the division position when dividing the input sentence into phrases or clauses according to the division rule, and the division result by the division position determination unit. A machine for translating an input sentence into phrases or clauses, having a translation phrase construction unit for obtaining a translation string in the target language for each phrase or clause based on the target language generated by the target language generation unit and the target language. In the translation device, the translated phrase construction unit has a clause in which a dependent word and a dependent word differ depending on the division in the division position determining unit. When divided, an auxiliary symbol indicating that the clause including the related word originally exists is generated immediately before the corresponding word in the translated word string of the clause including the related word. Machine translation device.

4. The machine translation device according to claim 3, wherein the translated phrase construction unit is based on a translated string in the target language obtained for each clause, and has a dependency relation in an input sentence. And a word of the receiving word are divided into different clauses, and a clause containing the related word has a structure in which the clause containing the word of the receiving word is entirely omitted from the clause, and a hollow element detecting means, When the hollowing-out clause is detected by the detecting means, an auxiliary symbol is generated immediately before the word of the receiving word in the translated word string of the clause including the receiving word, which indicates that the clause including the related word is omitted. A machine translation device comprising: a symbol generating unit; and an adjunct word generating unit for generating an adjunct word representing the dependency relation immediately after the auxiliary symbol generated by the auxiliary symbol generating unit.

5. A syntax analysis unit for performing syntax analysis of a sentence in a source language input from an input unit, a syntax conversion unit for converting the obtained syntax structure into a syntax structure in a target language, and based on the conversion result. A target language generation unit that generates a translation string in the target language of the input sentence, a division position determination unit that determines the division position when dividing the input sentence into phrases or clauses according to the division rule, and the division result by the division position determination unit. A machine for translating an input sentence into phrases or clauses, having a translation phrase construction unit for obtaining a translation string in the target language for each phrase or clause based on the target language generated by the target language generation unit and the target language. In the translation device, the translated phrase construction unit is arranged in the translated word sequence of the related clause separated by the division position determination unit and lacking case elements. The position of the missing case elements, the machine translation apparatus and generating a translation of the antecedent relationship lyrics in the relative clause generated by the target language generator.

6. The machine translation device according to claim 5, wherein the translated phrase construction unit is based on the syntactic structure obtained by the syntactic analysis unit.
A relative clause detecting unit that detects a relative clause that is separated by the division position detecting unit and has a missing case element; and an input generated by the target language generating unit when the relative clause is detected by the relative clause detecting unit. An antecedent acquisition unit that obtains a translation of the antecedent of the relative in the relative clause based on the translation of the sentence, and a translation of the antecedent obtained by the antecedent acquisition unit in the translation of the relative clause. Immediately after the relative word translation generating means for generating the relative word at the position of the missing case element, and the relative word translation for the relative word generated by the relative word translation generating means, the case of the translated word of the relative word is displayed. A machine translation device comprising a postword generation unit that generates a postword to represent.

7. The machine translation device according to claim 6, wherein the translated phrase construction unit generates a modifier associated with a translated word of the relative word generated by the relative word translation generation unit, and the relative word. A machine translation device comprising auxiliary modifier generation means for improving the readability of a clause in which the translated word is generated.