JPH07295985A

JPH07295985A - Machine translation device

Info

Publication number: JPH07295985A
Application number: JP6089633A
Authority: JP
Inventors: Takeshi Kuzumi; 毅九津見; Hidezo Kugimiya; 秀造釘宮
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1994-04-27
Filing date: 1994-04-27
Publication date: 1995-11-10

Abstract

PURPOSE:To generate partial symbols of kinds different from the inside and the outside of parentheses by inserting delimiter symbols of kinds different for parts satisfying a prescribed condition from parts not satisfying a prescribed condition to borders of partial structures in a translation sentence. CONSTITUTION:A target language generating section (translation sentence generating section) 15d generates a translation sentence based on a syntax structure and a translated word strings of a target language. A delimiter symbol generating section 15f inserts delimiter symbols of kinds different for a position in the translated word strings corresponding to a border in a part satisfying a prescribed condition from a part not satisfying a prescribed condition to the border of partial structures in the translated sentence generated based on the result of discrimination by an in-sentence condition discrimination section 15e. Thus, plural kinds of delimiter symbols are generated and partial symbols of kinds different from the inside and the outside of parentheses are generated.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、原文を文単位に翻訳
する際に何らかの条件に応じて文より小さい部分に分割
された翻訳文を出力する機械翻訳装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device for outputting a translated sentence divided into smaller parts according to some conditions when translating an original sentence into sentence units.

【０００２】[0002]

【従来の技術】現在の機械翻訳装置は、句点あるいはピ
リオド等を区切りとする一つのまとまりである“文"を
一回の処理単位として翻訳を実施するものが主流であ
る。このような機械翻訳装置は、１つの“文"の内部で
その文の構造を解析し、その解析結果に基づいて、１つ
の文から１つの独立した構造(つまり、構文)を作り上げ
ることによって翻訳を実施する。2. Description of the Related Art Most current machine translation devices perform translation using a "sentence", which is a unit of delimiters such as punctuation marks or periods, as one processing unit. Such a machine translation device analyzes the structure of a sentence within one "sentence" and translates it by constructing one independent structure (that is, syntax) from one sentence based on the analysis result. Carry out.

【０００３】一般に、機械翻訳処理には図６に示すよう
なレベルがある。図中左側は翻訳元の言語(ソース言語)
の解析レベルであり、右側は翻訳先の言語(ターゲット
言語)の生成レベルである。今、ソース言語による文が
入力されると、解析レベルＬ₁での辞書引き,解析レベル
Ｌ₂での形態素解析,解析レベルＬ₃での構文解析…と解
析が進められる。Generally, the machine translation processing has levels as shown in FIG. The left side of the figure is the source language (source language)
Is the analysis level, and the right side is the generation level of the translation destination language (target language). Now, when a sentence in the source language is input, analysis is proceeded with dictionary lookup at the analysis level L ₁ , morphological analysis at the analysis level L ₂ , syntactic analysis at the analysis level L ₃ , and so on.

【０００４】機械翻訳は、上記解析レベルによって大き
く次の２つに分けられる。その１つは、解析レベルＬ₆
のソース言語またはターゲット言語の何れにも依存しな
い概念(中間言語と呼ぶ)まで解析し、そこから生成レベ
ルＬ₇での文脈生成,生成レベルＬ₈での意味生成,生成レ
ベルＬ₉での構文生成,生成レベルＬ₁₀での形態素生成と
生成を進めて、ターゲット言語による文を生成していく
ピボット方式である。もう一つは、上記解析レベルＬ₂
での形態素解析,解析レベルＬ₃での構文解析，解析レベ
ルＬ_４での意味解析,解析レベルＬ₅での文脈解析の何れ
かまで解析を行ってソース言語の内部構造を得る。次
に、この得られたソース言語の内部構造をそれと同じレ
ベルのターゲット言語の内部構造に変換した後、ターゲ
ット言語の文を生成するトランスファー方式である。Machine translation is roughly divided into the following two types according to the above analysis level. One is analysis level L ₆
Source language or in any of the target language analyzed to concepts that are independent (referred to as the intermediate language), the context generation in generation level L ₇ therefrom, generating means at production level L _8, syntax of the generated level L ₉ of This is a pivot method for generating a sentence in a target language by proceeding with morpheme generation and generation at the generation and generation level L ₁₀ . The other is the above analysis level L ₂
Obtaining an internal structure of the source language by performing morphological analysis, parsing the analysis level L _3, semantic analysis of the analysis level L _4, the analysis until either contextual analysis at the analysis level L ₅ in. Next, it is a transfer method in which the obtained internal structure of the source language is converted into the internal structure of the target language at the same level as that, and then a sentence of the target language is generated.

【０００５】ここで、上記解析レベルにおける各解析に
ついて詳細に説明する。・辞書引き形態素解析メモリ等に格納されている辞書の情報を用いて原文を各
形態素(単語：以下、単に単語と言う)に分割し、各単語
の品詞等の文法情報や訳語を得、さらに、時制・人称・数
等を解析する。・構文解析各単語間の係り受け関係等の文の構造を表す構造解析木
を決定する。・意味解析複数の構文解析結果から意味的に正しいものとそうでな
いものとを判別する。・文脈解析話題を理解して、省略されたものの推定や曖昧さの解消
等を行う。以下に述べる機械翻訳装置は、少なくとも解
析レベルＬ₃の構文解析までの解析を行うトランスファ
ー方式によるものとする。Here, each analysis at the above analysis level will be described in detail.・ Dictionary lookup Morphological analysis Dividing the original sentence into morphemes (words: simply called words) using the information of the dictionary stored in the memory etc., and obtaining grammatical information such as the part of speech of each word and translated words, and Analyze the tense, person, number, etc. -Syntactic analysis Determine the structural analysis tree that represents the structure of the sentence such as the dependency relation between each word.・ Semantic analysis Discriminate semantically correct and improper ones from multiple syntactic analysis results.・ Context analysis Understand the topic and estimate the omitted ones and resolve the ambiguity. The machine translation device described below is based on the transfer method for performing at least parsing up to the parsing level L ₃ .

【０００６】上述のような機械翻訳においては、１つの
文から１つの構造(構文)を決定できない場合がある。こ
のような場合、従来の多くの機械翻訳装置では、何らか
の形で１つの文を複数の部分に分割して翻訳することが
可能なようになっている。例えば、特開昭６３−８８６
４号公報に開示されている翻訳装置においては、入力さ
れた１つの文からテーブルに登録された文法規則を用い
て１つの構造を作り上げることに失敗した場合には、当
該文を幾つかに分割した部分構造から成る構造を作り上
げるのである。In the above machine translation, there are cases where one structure (syntax) cannot be determined from one sentence. In such a case, in many conventional machine translation devices, one sentence can be divided into a plurality of parts and translated in some way. For example, JP-A-63-886
In the translation device disclosed in Japanese Patent Laid-Open No. 4 publication, when it is unsuccessful to construct one structure from one input sentence using the grammatical rules registered in the table, the sentence is divided into several parts. The structure consisting of the substructures is created.

【０００７】以下、上述のような、入力された文を幾つ
かに分割した部分構造の形で翻訳文を出力する分割翻訳
処理機能を有する機械翻訳装置についてさらに詳細に説
明する。図７は、上記分割翻訳処理機能を有する機械翻
訳装置のブロック図である。The machine translation device having a division translation processing function for outputting a translation sentence in the form of a partial structure obtained by dividing the input sentence into several pieces as described above will be described in more detail below. FIG. 7 is a block diagram of a machine translation device having the divisional translation processing function.

【０００８】入力部２は、キーボード,マウス,ペン,タ
ブレット,スキャナあるいは文字認識装置等の入力装置
や通信回線と接続されている通信装置で構成され、翻訳
の対象となるソース言語による原文等が入力される。ま
た、出力部３は、ＣＲＴ(陰極線管)あるいはＬＣＤ(液
晶表示装置)等の表示装置やプリンタ等の印字装置や通
信回線と接続されている通信装置で構成され、翻訳結果
等を出力する。翻訳用メインメモリ７は、翻訳処理の際
に得られた情報を一旦格納する原文バッファ７a,辞書引
きバッファ７b,変換前の構文バッファ７c,変換後の構文
バッファ７dおよび訳文バッファ７e等の各種バッファを
有している。The input unit 2 is composed of an input device such as a keyboard, a mouse, a pen, a tablet, a scanner or a character recognition device and a communication device connected to a communication line. Is entered. The output unit 3 is composed of a display device such as a CRT (cathode ray tube) or LCD (liquid crystal display device), a printing device such as a printer, or a communication device connected to a communication line, and outputs a translation result or the like. The translation main memory 7 includes various buffers such as an original sentence buffer 7a, a dictionary lookup buffer 7b, a pre-conversion syntax buffer 7c, a post-conversion syntax buffer 7d, and a translation buffer 7e for temporarily storing information obtained during translation processing. have.

【０００９】翻訳モジュール５は、図７に示すように、
辞書引き・形態素解析部５a,構文解析部５b,構文変換部
５cおよびターゲット言語生成部５dを有し、さらに、タ
ーゲット言語生成部５dは１種類の分割記号を生成する
分割記号生成部５fを有して、図６に示す解析レベルＬ₃
の構文解析までの解析を行う。そして、ソース言語の文
が入力されると、記憶部６に登録されている辞書,文法
規則,木構造変換規則等の翻訳処理の際に使用される各
種規則を用いてソース言語による入力原文をターゲット
言語に変換する。ここで、上記翻訳モジュール５におけ
る構文解析部５b及び構文変換部５cは、分割翻訳に適合
した構文解析処理および構文変換処理を実施するように
なっている。The translation module 5, as shown in FIG.
The dictionary lookup / morpheme analysis unit 5a, the syntax analysis unit 5b, the syntax conversion unit 5c, and the target language generation unit 5d are provided, and the target language generation unit 5d further includes a division symbol generation unit 5f that generates one type of division symbol. Then, the analysis level L ₃ shown in FIG.
Performs parsing up to the parsing of. When a source language sentence is input, the source language input source sentence is converted using various rules used in translation processing such as a dictionary, grammar rules, and tree structure conversion rules registered in the storage unit 6. Convert to target language. Here, the syntactic analysis unit 5b and the syntactic conversion unit 5c in the translation module 5 are adapted to perform syntactic analysis process and syntactic conversion process suitable for divided translation.

【００１０】翻訳制御部１はＣＰＵ(中央演算処理装置)
によって構成され、上記入力部２,出力部３,翻訳モジュ
ール５,記憶部６および翻訳用メインメモリ７を制御し
て翻訳処理を実施する。尚、４はバスラインである。The translation control unit 1 is a CPU (central processing unit)
The input unit 2, the output unit 3, the translation module 5, the storage unit 6 and the translation main memory 7 are controlled to perform translation processing. Incidentally, 4 is a bus line.

【００１１】上記構成の機械翻訳装置は、次のように動
作してソース言語(英語)による入力原文をターゲット言
語(日本語)による翻訳文に変換する。上記翻訳制御部１
の制御の下に、入力部２によって入力された原文「 The demand ( as measured by the aggregate,busines
s investment ) increased all through the year.」は、図８に示すように、翻訳用メインメモリ７における
原文バッファ７aに格納される。こうして、上記原文バ
ッファ７aに格納された原文は、上記翻訳制御部１の制
御の下に、出力部３によって、図９に示すように表示さ
れる。尚、その際には、図中右側における翻訳文の欄は
空欄となっている。The machine translation device having the above configuration operates as follows to convert an input source sentence in a source language (English) into a translated sentence in a target language (Japanese). Translation control unit 1
Under the control of the original sentence "The demand (as measured by the aggregate, busines
s investment) increased all through the year. "is stored in the original text buffer 7a in the main translation memory 7, as shown in FIG. In this way, the original sentence stored in the original sentence buffer 7a is displayed by the output unit 3 as shown in FIG. 9 under the control of the translation control unit 1. In this case, the translated sentence column on the right side of the drawing is blank.

【００１２】上記翻訳制御部１の制御の下に、翻訳モジ
ュール５における辞書引き・形態素解析部５aによって、
原文バッファ７aに格納された原文が読み出され、記憶
部６に格納された辞書の情報を用いて各形態素に分割さ
れる。そして、各単語毎に品詞,訳語,時制,人称,数等の
情報が得られて、翻訳用メインメモリ７における辞書引
きバッファ７bに格納される。その際における品詞情報
は図１０に示すように格納される。ここで、単語「deman
d」は“名詞"と“動詞"との２品詞を有する多品詞語であ
るが、構文解析部５bによって一意に決定されるのであ
る。Under the control of the translation control unit 1, the dictionary lookup / morphological analysis unit 5a in the translation module 5
The original sentence stored in the original sentence buffer 7a is read and divided into each morpheme using the information of the dictionary stored in the storage unit 6. Then, information such as a part of speech, a translated word, a tense, a personal name, and a number is obtained for each word, and stored in the dictionary lookup buffer 7b in the translation main memory 7. The part-of-speech information at that time is stored as shown in FIG. Where the word "deman
"d" is a multi-part word having two parts of speech, "noun" and "verb", but is uniquely determined by the syntactic analysis unit 5b.

【００１３】次に、上記構文解析部５bによって、辞書
引き・形態素解析部５aで得られた情報と記憶部６に格納
された文法規則とに基づいて、各単語間の係り受け関係
を表す英語の構造解析木が図１１に示すように決定され
る。そして、得られた英語の構造解析木は翻訳用メイン
メモリ７における変換前の構文バッファ７cに格納され
る。その際に、構造解析木は“分割要素"という特別の
要素を併用して組み立てられている。Next, the syntactic analysis unit 5b, based on the information obtained by the dictionary lookup / morpheme analysis unit 5a and the grammatical rules stored in the storage unit 6, expresses the dependency relation between each word. 11 is determined as shown in FIG. Then, the obtained English structural analysis tree is stored in the pre-conversion syntax buffer 7c in the translation main memory 7. At that time, the structural analysis tree is constructed by using a special element called "splitting element" together.

【００１４】次に、上記構文変換部５cによって、記憶
部６に格納された木構造変換規則に基づいて、英語の構
造解析木が日本語の構造解析木に変換される。さらに、
翻訳用メインメモリ７における辞書引きバッファ７bに
格納された訳語を用いて、上記日本語の構造解析木にお
ける各要素の訳語が決定される。こうして得られた日本
語の構造解析木とその各要素の訳語とは、図１２に示す
ように、翻訳用メインメモリ７における変換後の構文バ
ッファ７dに格納される。次に、上記ターゲット言語生
成部５dによって、構文バッファ７dに格納された日本語
の構造解析木および辞書引きバッファ７bに格納された
情報を参照して、日本語の構造解析木に付加されている
訳語の列「需要（総計よって測定するように企業投資）
増大するその年間のすべて」に適切な助詞や助動詞が付けられると共に、最適な文型
に変化されて、日本語による翻訳文「需要（総計によって測定されたように‖企業投資）
は、増大した‖ その年の間の全て。」が生成される。こうして生成された翻訳文は、図１３に
示すように、翻訳用メインメモリ７における訳文バッフ
ァ７eに格納される。また、上記訳文バッファ７eに格納
された当該翻訳文は、翻訳制御部１の制御の下に、出力
部３によって、図９に示すように原文と対応付けられて
出力されるのである。Next, the syntax conversion unit 5c converts the English structural analysis tree into a Japanese structural analysis tree based on the tree structure conversion rules stored in the storage unit 6. further,
Using the translation word stored in the dictionary lookup buffer 7b in the translation main memory 7, the translation word of each element in the Japanese structural analysis tree is determined. The Japanese structural analysis tree thus obtained and the translated words of each element thereof are stored in the translated syntax buffer 7d in the translation main memory 7, as shown in FIG. Next, the target language generation unit 5d refers to the Japanese structural analysis tree stored in the syntax buffer 7d and the information stored in the dictionary lookup buffer 7b, and adds them to the Japanese structural analysis tree. Translation column “Demand (corporate investment as measured by aggregate)”
With the appropriate particles and auxiliary verbs added to "all increasing during the year", the sentence is transformed into an optimal sentence pattern, and translated into Japanese "Demand (as measured by total business investment").
Increased ‖ all over the year. Is generated. The translated sentence thus generated is stored in the translated sentence buffer 7e in the main translation memory 7, as shown in FIG. The translated sentence stored in the translated sentence buffer 7e is output by the output unit 3 under the control of the translation control unit 1 in association with the original sentence as shown in FIG.

【００１５】上述のように、上記ターゲット言語生成部
５dによって翻訳文が生成されるに際して、分割記号生
成部５fによって、１種類の分割記号(この場合には
“‖")が生成されるのである。その際における分割記号
は、 (１) ターゲット言語の構造解析木中に“分割要素"ノ
ードを頂点とする部分木が存在する。 (２) 当該訳語は、“分割要素"ノードを頂点とする部
分木における末端ノードであって、右端に位置するノー
ドの訳語(その訳語に付属語が付随している場合にはそ
の付属語も含めた訳語)である。さらに、上記訳語列に
おける末尾の訳語ではない。という生成基準に従って、(２)を満たす訳語が訳文バッ
ファ７eに格納された後に生成されるのである。As described above, when the translated sentence is generated by the target language generating unit 5d, one kind of dividing symbol (in this case, "‖") is generated by the dividing symbol generating unit 5f. . The division symbols used in this case are: (1) There is a subtree having the "partitioning element" node as a vertex in the structure analysis tree of the target language. (2) The target word is the end node of the subtree with the "partitioning element" node as the apex, and the target word of the node located at the right end (if the target word has an attached word, the attached word also It is a translation word including). Furthermore, it is not the last translated word in the translated word string. According to the generation criterion, the translated word satisfying the condition (2) is stored in the translated text buffer 7e and then generated.

【００１６】[0016]

【発明が解決しようとする課題】しかしながら、上記従
来の分割翻訳機能を有する機械翻訳装置においては以下
に述べるような問題がある。第１には、上述のように翻
訳モジュール５におけるターゲット言語生成部５dによ
ってターゲット言語による翻訳文が生成されるに際し
て、上記分割記号生成部５fによって唯１種類の分割記
号(上述の例の場合には“‖")が生成される。したがっ
て、生成される分割記号が有する形状の持つ役割の印象
が翻訳文全体を強く支配することになり、そのために翻
訳文が読みにくい場合が生ずる。However, the above-described conventional machine translation device having a division translation function has the following problems. First, when the target language generation unit 5d of the translation module 5 generates a translated sentence in the target language as described above, the division symbol generation unit 5f causes only one type of division symbol (in the case of the above example, "‖") Is generated. Therefore, the impression of the role of the shape of the generated division symbol strongly controls the entire translated sentence, which may make it difficult to read the translated sentence.

【００１７】例えば、上記分割記号“‖"の場合には、
形状の持つ“区切り"としての役割の印象が翻訳文全体
を強く支配する。したがって、上述の日本文「需要（総計によって測定されたように‖企業投資）
は、増大した‖ その年の間の全て。」は、本来は大きく「需要（総計によって測定されたように‖企業投資）
は、増大した‖」と「その年の間の全て。」との２つに分割されるのであるが、「需要（総計によって測定されたように‖」と「企業投資）は、増大した‖」と「その年の間の全て。」との３つに分割されるように見えることになり、読みに
くいのである。For example, in the case of the division symbol “‖”,
The impression of the shape's role as a "break" strongly controls the entire translation. Therefore, the above-mentioned Japanese sentence "Demand (as measured by aggregate ‖ corporate investment)
Increased ‖ all over the year. Is essentially large "Demand (as measured by aggregate ‖ corporate investment)
Is divided into two: “increased ‖” and “everything during the year”, while “demand (as measured by aggregate ‖” and “corporate investment” has increased ‖ It seems to be divided into three parts, "and all of that year." It is difficult to read.

【００１８】第２には、次のような問題がある。すなわ
ち、上記ターゲット言語生成部５dによって生成された
ターゲット言語による翻訳文中に特定の一対の文字記号
(例えば、““，”"等)によって挟まれた部分が存在す
る場合がある。このような文字記号を使用するのは、そ
の特定の一対の文字記号によって挟まれた部分が構文的
にあるいは意味的に他とは異なる点があるのでその点を
強調する目的のためである。その際に、構文解析によっ
て得られた“分割要素"ノードを頂点とする部分木の境
界を示すための分割記号として“区切り"の印象の強い
分割記号（例えば、“‖")を用いると、ある部分を強調
するために挿入した特定の一対の文字記号による強調の
度合が弱くなってしまう。しかしながら、そうであるか
らと言って、“区切り"としての印象の弱い分割記号を
用いると、構文的にあるいは意味的に強調する部分が無
い翻訳文の場合には分割記号が目立たず、分割記号の効
果が充分に活かされないことになる。Secondly, there is the following problem. That is, a specific pair of character symbols is included in the translated sentence in the target language generated by the target language generation unit 5d.
There may be a part sandwiched by (for example, ““, ””, etc.). The use of such a character symbol is for the purpose of emphasizing that the part sandwiched by the specific pair of character symbols is syntactically or semantically different from the others. At that time, if a partition symbol with a strong impression of “break” (for example, “‖”) is used as the partition symbol for indicating the boundary of the subtree having the “partition element” node as a vertex obtained by the syntactic analysis, The degree of emphasis made by a specific pair of character symbols inserted to emphasize a certain portion becomes weak. However, even if this is the case, if a partition symbol that does not give a strong impression as a "delimiter" is used, the partition symbol is not conspicuous in the case of a translated sentence that has no syntactically or semantically emphasized part, and the partition symbol The effect of will not be fully utilized.

【００１９】例えば、図１３に例示するような翻訳文の
場合には上記特定の一対の文字記号としての括弧によっ
て挟まれた部分が存在し、この括弧で挟まれた部分とそ
れ以外の部分とに夫々分割記号が存在する。元来、括弧
(例えば、“（，）"や“〔，〕”等)を含んだ翻訳文に
おいては、括弧が意味上の大きな区切りとなっている。
ところが、図９に示されるように翻訳文が表示された場
合には、分割記号における“区切り"としての印象が強
すぎるために、括弧によって挟まれた部分のまとまりの
印象が弱められてしまう。しかしながら、そうであるか
らと言って、“区切り"としての印象の弱い分割記号を
用いると、括弧によって挟まれていない部分や括弧の無
い翻訳文の場合には分割記号が目立たず、分割記号の効
果が充分に活かされないことになってしまうのである。For example, in the case of the translated sentence illustrated in FIG. 13, there is a part sandwiched between parentheses as the above-mentioned specific pair of character symbols, and the part sandwiched between the parentheses and the other part. There is a split sign in each. Originally parenthesis
In a translated sentence that includes (for example, "(,)" or "[,]"), parentheses serve as a significant delimiter.
However, when the translated text is displayed as shown in FIG. 9, the impression of the “delimiter” in the division symbol is too strong, and the impression of the unity between the brackets is weakened. However, even if this is the case, if you use a delimiter that does not give the impression of being a "delimiter," the delimiter will not be conspicuous in the part that is not enclosed by parentheses or in a translation that does not have parentheses. The effect will not be fully utilized.

【００２０】そこで、この発明の目的は、複数種類の分
割記号を生成可能であり、括弧を含む特定の一対の記号
によって挟まれた部分とその他の部分とで異なる種類の
分割記号を生成可能な機械翻訳装置を提供することにあ
る。Therefore, an object of the present invention is to be able to generate a plurality of types of division symbols, and to generate different types of division symbols between a portion sandwiched by a specific pair of symbols including parentheses and other portions. To provide a machine translation device.

【００２１】[0021]

【課題を解決するための手段】上記目的を達成するた
め、請求項１に係る発明は、入力部からの入力文に対し
て形態素解析部によって形態素解析処理を実施し,構文
解析部によって上記形態素解析処理結果に基づいてソー
ス言語の構文構造を得る構文解析処理を実施し,構文変
換部によって上記ソース言語の構文構造をターゲット言
語の構文構造に変換すると共にそのターゲット言語の構
文構造に対応する訳語列を得る構文変換処理を実施し,
翻訳文生成部によって上記ターゲット言語の構文構造と
訳語列とに基づいて翻訳文を生成する翻訳文生成処理を
実施して,上記入力文を翻訳文に変換するに際して,上記
構文解析部,構文変換部および翻訳文生成部は,当該入力
文が所定の翻訳条件に合致する場合には当該入力文を複
数の部分構造に分割して処理する分割翻訳処理を実施す
る機械翻訳装置において、上記分割翻訳処理に際して,
上記構文変換部によって得られた訳語列中に所定の条件
を満たす部分が存在するか否かを判定する文中条件判定
部と、上記文中条件判定部による判定結果に基づいて,
上記翻訳文生成部で上記訳語列に基づいて生成された翻
訳文における上記部分構造の境界位置に,その境界位置
に対応する上記訳語列における位置が上記所定の条件を
満たす部分に在る場合と上記所定の条件を満たさない部
分に在る場合とで異なる種類の分割記号を挿入する分割
記号生成部を備えたことを特徴としている。In order to achieve the above object, the invention according to claim 1 is such that a morpheme analysis unit performs a morpheme analysis process on an input sentence from an input unit, and a syntactic analysis unit performs the morpheme analysis. The parsing process is performed to obtain the syntactic structure of the source language based on the result of the parsing process, and the syntactic conversion unit converts the syntactic structure of the source language into the syntactic structure of the target language and the translated word corresponding to the syntactic structure of the target language. Perform the syntax conversion process to get the sequence,
A translation sentence generation process is performed by the translation sentence generation unit to generate a translation sentence based on the syntactic structure of the target language and a translated word string, and when the input sentence is converted into a translation sentence, the syntax analysis unit and the syntax conversion unit In the machine translation device that executes a divided translation process in which the input sentence is divided into a plurality of partial structures and processed when the input sentence matches a predetermined translation condition, the division translation When processing,
Based on the determination result by the in-sentence condition determination unit that determines whether or not a portion satisfying a predetermined condition exists in the translated word string obtained by the syntax conversion unit, and the determination result by the in-sentence condition determination unit,
At the boundary position of the partial structure in the translated sentence generated based on the translated word string by the translated sentence generation unit, the position in the translated word string corresponding to the boundary position is in a portion that satisfies the predetermined condition. It is characterized in that a division symbol generation unit for inserting a division symbol of a different type from that in the case where the predetermined condition is not satisfied is provided.

【００２２】また、請求項２に係る発明は、請求項１に
係る発明の機械翻訳装置において、上記文中条件判定部
は,上記構文変換部で得られた訳語列中に特定の一対の
文字記号によって挟まれた部分が存在するか否かを判定
する文字記号判定手段を有して、上記分割記号生成部
は,上記翻訳文における上記特定の一対の文字記号によ
って挟まれた部分と上記特定の一対の文字記号によって
挟まれていない部分とで異なる種類の分割記号を挿入す
ることを特徴としている。The invention according to claim 2 is the machine translation device according to claim 1, wherein the in-sentence condition determination part is a pair of specific character symbols in the translated word string obtained by the syntax conversion part. The division symbol generation unit has a character / symbol determination means for determining whether or not there is a portion sandwiched between, and the division symbol generation unit includes the portion sandwiched by the specific pair of character symbols in the translation and the specific It is characterized in that different types of division symbols are inserted in a portion that is not sandwiched by a pair of character symbols.

【００２３】また、請求項３に係る発明は、請求項２に
記載の機械翻訳装置において、上記文字記号判定手段
は,上記構文変換部で得られた訳語列中に括弧によって
挟まれた部分が存在するか否かを判定する括弧判定手段
であって、上記分割記号生成部は,上記翻訳文における
括弧によって挟まれた部分と括弧によって挟まれていな
い部分とで異なる種類の分割記号を挿入することを特徴
としている。The invention according to claim 3 is the machine translation device according to claim 2, wherein the character symbol determination means includes a portion between parentheses in the translated word string obtained by the syntax conversion section. Parenthesis determination means for determining whether or not they exist, wherein the division symbol generation unit inserts different types of division symbols between the portion enclosed by the parentheses and the portion not enclosed by the parentheses in the translated sentence. It is characterized by that.

【００２４】[0024]

【作用】請求項１に係る発明では、入力部からソース言
語による文が入力されると、形態素解析部によって形態
素解析処理が実施される。そして、構文解析部による構
文解析処理によって、上記形態素解析処理結果に基づい
て、当該入力文が所定の翻訳条件に合致する場合には複
数の部分構造に分割されてソース言語の構文構造が得ら
れる。さらに、構文変換部による構文変換処理によっ
て、上記ソース言語の構文構造がターゲット言語の構文
構造に変換されると共に、そのターゲット言語の構文構
造に対応する訳語列が得られる。そうすると、文中条件
判定部によって、上記構文変換部で得られた訳語列中に
所定の条件を満たす部分が存在するか否かが判定され
る。そして、上記翻訳文生成部によって上記ターゲット
言語の構文構造および訳語列に基づいて翻訳文が生成さ
れる際に、生成された翻訳文における上記部分構造の境
界位置に、上記文中条件判定部による判定結果に基づい
て、分割記号生成部によって、上記境界位置に対応する
上記訳語列中における位置が上記所定の条件を満たす部
分に在る場合と満たさない部分に在る場合とで異なる種
類の分割記号が挿入される。In the invention according to claim 1, when a sentence in the source language is input from the input unit, the morpheme analysis process is executed by the morpheme analysis unit. Then, by the syntactic analysis process by the syntactic analysis unit, based on the result of the morphological analysis process, when the input sentence matches a predetermined translation condition, it is divided into a plurality of partial structures to obtain the syntactic structure of the source language. . Further, by the syntax conversion processing by the syntax conversion unit, the syntax structure of the source language is converted into the syntax structure of the target language, and a translated word string corresponding to the syntax structure of the target language is obtained. Then, the in-sentence condition determination unit determines whether or not there is a portion satisfying a predetermined condition in the translated word string obtained by the syntax conversion unit. Then, when the translated sentence is generated by the translated sentence generation unit based on the syntactic structure and the translated word string of the target language, determination is made by the in-sentence condition determination unit at the boundary position of the partial structure in the generated translated sentence. Based on the result, by the division symbol generation unit, different types of division symbols depending on whether the position in the translation string corresponding to the boundary position is in a portion that satisfies the predetermined condition or not. Is inserted.

【００２５】こうして、上記翻訳文生成部から出力され
る翻訳文における上記部分構造の境界位置には、上記所
定の条件を満たす部分と満たさない部部とで異なる種類
の分割記号が挿入される。In this way, different kinds of division symbols are inserted at the boundary position of the partial structure in the translated text output from the translated text generation section, for the part that satisfies the predetermined condition and the part that does not.

【００２６】また、請求項２に係る発明では、上記文中
条件判定部の文字記号判定手段によって、上記構文変換
部で得られた訳語列中に特定の一対の文字記号で挟まれ
た部分が存在するか否かが判定される。そして、上記分
割記号生成部によって、上記文中条件判定部による判定
結果に基づいて、上記翻訳文生成部で生成された翻訳文
における上記部分構造の境界位置に、上記特定の一対の
文字記号によって挟まれた部分と挟まれていない部分と
で異なる種類の分割記号が挿入される。In the invention according to claim 2, there is a portion sandwiched by a specific pair of character symbols in the translated word string obtained by the syntax conversion unit by the character symbol determination means of the sentence condition determination unit. It is determined whether or not to do it. Then, based on the determination result by the in-sentence condition determination unit, the division symbol generation unit inserts the specific pair of character symbols at the boundary position of the partial structure in the translation sentence generated by the translation sentence generation unit. Different types of division symbols are inserted between the part that is sandwiched and the part that is not sandwiched.

【００２７】また、請求項３に係る発明では、上記文中
条件判定部における括弧判定手段によって、上記構文変
換部で得られた訳語列中に括弧で挟まれた部分が存在す
るか否かが判定される。そして、上記分割記号生成部に
よって、上記文中条件判定部による判定結果に基づい
て、上記翻訳文生成部で生成された翻訳文における上記
部分構造の境界位置に、括弧によって挟まれた部分と挟
まれていない部分とで異なる種類の分割記号が挿入され
る。Further, in the invention according to claim 3, the parenthesis determining means in the sentence condition determining section determines whether or not there is a portion enclosed by parentheses in the translated word string obtained by the syntax converting section. To be done. Then, based on the determination result by the in-sentence condition determination unit, the division symbol generation unit inserts a portion sandwiched between parentheses at the boundary position of the partial structure in the translated sentence generated by the translated sentence generation unit. Different types of division symbols are inserted depending on the part that is not filled.

【００２８】[0028]

【実施例】以下、この発明を図示の実施例により詳細に
説明する。図１は本実施例の機械翻訳装置におけるブロ
ック図である。本機械翻訳装置における入力部１２,出
力部１３,バスライン１４および記憶部１６は、図７に
示す従来の機械翻訳装置における入力部２,出力部３,バ
スライン４および記憶部６と同一構成を成しており、詳
細な説明は省略する。The present invention will be described in detail below with reference to the embodiments shown in the drawings. FIG. 1 is a block diagram of the machine translation apparatus of this embodiment. The input unit 12, the output unit 13, the bus line 14, and the storage unit 16 in this machine translation device have the same configuration as the input unit 2, the output unit 3, the bus line 4, and the storage unit 6 in the conventional machine translation device shown in FIG. The detailed description is omitted.

【００２９】翻訳用メインメモリ１７は、図７に示す従
来の機械翻訳装置における翻訳用メインメモリ７と同様
に原文バッファ１７a,辞書引きバッファ１７b,変換前の
構文バッファ１７c,変換後の構文バッファ１７dおよび
訳文バッファ１７eを有して、翻訳処理の際に得られる
情報を一旦格納する。さらに、日本語の構造解析木にお
ける各要素の訳語を指示する訳語ポインタＴＷ、およ
び、開き括弧が出現したことを表す括弧フラグＫを有し
ている。また、翻訳モジュール１５は、図７に示す従来
の機械翻訳装置における翻訳モジュール５と同様に、辞
書引き・形態素解析部１５a,構文解析部１５b,構文変換
部１５cおよび上記翻訳文生成部としてのターゲット言
語生成部１５dを有しており、図６に示す解析レベルＬ₃
の構文解析までの解析を行って、ソース言語による入力
文をターゲット言語による翻訳文に変換する。The translation main memory 17 is similar to the translation main memory 7 in the conventional machine translation device shown in FIG. 7, and includes an original sentence buffer 17a, a dictionary lookup buffer 17b, a pre-conversion syntax buffer 17c, and a post-conversion syntax buffer 17d. And a translation buffer 17e for temporarily storing information obtained during translation processing. Further, it has a translation word pointer TW for pointing a translation word of each element in the Japanese structural analysis tree, and a parenthesis flag K indicating that an opening parenthesis has appeared. Further, the translation module 15 is the same as the translation module 5 in the conventional machine translation device shown in FIG. It has a language generation unit 15d and has an analysis level L ₃ shown in FIG.
The input sentence in the source language is converted into the translated sentence in the target language by performing the parsing up to the parsing.

【００３０】さらに、本実施例における翻訳モジュール
１５のターゲット言語生成部１５dは、本実施例におけ
る特徴である文中条件判定部１５eおよび複数種類の分
割記号を生成する分割記号生成部１５fを有している。
ここで、上記構文変換部１５cによって決定されたター
ゲット言語による訳語列中に特定の一対の文字記号が存
在するか否かを判定可能なように文中条件判定部１５e
を構成することによって、分割記号生成部１５fは翻訳
文における特定の一対の文字記号によって挟まれている
部分か否かに応じて異なった分割記号を生成することが
可能となる。Further, the target language generation unit 15d of the translation module 15 in this embodiment has a sentence condition determination unit 15e, which is a feature of this embodiment, and a division symbol generation unit 15f that generates a plurality of types of division symbols. There is.
Here, the in-sentence condition determination unit 15e is provided so that it can be determined whether or not a specific pair of character symbols is present in the translated word string in the target language determined by the syntax conversion unit 15c.
With this configuration, the division symbol generation unit 15f can generate different division symbols depending on whether or not the portion is sandwiched by a specific pair of character symbols in the translated sentence.

【００３１】特に、上記特定の一対の文字記号の一つで
ある一対の括弧が上記ターゲット言語による訳語列中に
存在するか否かを判定可能なように文中条件判定部１５
eを構成すれば、分割記号生成部１５fは、翻訳文中にお
ける一対の括弧によって挟まれている部分か否かに応じ
て異なった分割記号を生成できのである。尚、本実施例
の機械翻訳装置における文中条件判定部１５eは、上記
訳語列中に一対の括弧が存在するか否かを判定するよう
に構成されているものとする。また、以下の説明におい
ては、ソース言語を英語とする一方、ターゲット言語を
日本語とする。In particular, the in-sentence condition determining unit 15 is provided so that it can be determined whether or not a pair of parentheses, which is one of the specific pair of character symbols, exists in the translated word string in the target language.
By configuring e, the division symbol generation unit 15f can generate different division symbols depending on whether or not the portion is sandwiched by a pair of parentheses in the translated sentence. The in-sentence condition determination unit 15e in the machine translation device of the present embodiment is configured to determine whether or not a pair of parentheses are present in the translated word string. In the following description, the source language is English and the target language is Japanese.

【００３２】図１に示す機械翻訳装置は、翻訳制御部１
１の制御の下に、入力部１２,出力部１３,翻訳モジュー
ル１５および記憶部１６によって、次ぎのようにして分
割翻訳処理を実施する。図２および図３は分割翻訳処理
動作のフローチャートである。以下、図２および図３に
従って分割翻訳処理動作について詳細に説明する。The machine translation device shown in FIG.
Under the control of No. 1, the input unit 12, the output unit 13, the translation module 15, and the storage unit 16 perform the divided translation process as follows. 2 and 3 are flowcharts of the divided translation processing operation. The divided translation processing operation will be described in detail below with reference to FIGS. 2 and 3.

【００３３】ステップＳ1で、上記入力部１２によって
読み込まれた英語による原文が、翻訳用メインメモリ１
７における原文バッファ１７aに格納される。尚、本実
施例における原文は、従来の分割翻訳処理機能を有する
機械翻訳装置の動作説明の際に用いた原文と同じ「 The demand ( as measured by the aggregate,busines
s investment ) increased all through the year.」であるとする。In step S1, the original text in English read by the input unit 12 is converted into the main memory 1 for translation.
7 is stored in the original text buffer 17a. The original sentence in the present embodiment is the same as the original sentence used when explaining the operation of the conventional machine translation device having the divided translation processing function, `` The demand (as measured by the aggregate, busines
s investment) increased all through the year. "

【００３４】ステップＳ2で、上記翻訳モジュール１５
の辞書引き・形態素解析部１５aによって、原文バッファ
１７aに格納された原文が読み出され、図７に示す翻訳
モジュール５における辞書引き・形態素解析部５aの場合
と同様の形態素解析が実行される。そして、得られた情
報が翻訳用メインメモリ１７における辞書引きバッファ
１７bに格納される。ステップＳ3で、上記構文解析部１
５bによって、図７に示す構文解析部５bの場合と同様の
構文解析が実行されて、図１１に示すような英語の構造
解析木が決定される。そして、得られた英語の構造解析
木は、翻訳用メインメモリ１７における変換前の構文バ
ッファ１７cに格納される。その際に、当該原文は、例
えば“登録された文法規則を用いて構造解析木を決定で
きない"という翻訳条件に合致するので、当該構造解析
木は“分割要素"という特別の要素を併用して組み立て
られている。ステップＳ4で、上記構文変換部１５cによ
って、記憶部１６に格納された木構造変換規則に基づい
て、図７に示す構文変換部５cの場合と同様の構文変換
が実行されて、英語の構造解析木が図１２に示すような
日本語の構造解析木に変換される。そして、得られた日
本語の構造解析木は翻訳用メインメモリ１７における変
換後の構文バッファ１７dに格納される。In step S2, the translation module 15
The original sentence stored in the original sentence buffer 17a is read by the dictionary lookup / morpheme analysis unit 15a, and the same morpheme analysis as that of the dictionary lookup / morpheme analysis unit 5a in the translation module 5 shown in FIG. 7 is executed. Then, the obtained information is stored in the dictionary lookup buffer 17b in the translation main memory 17. In step S3, the parser 1
5b executes the same syntactic analysis as in the syntactic analysis unit 5b shown in FIG. 7 to determine the English structural parse tree as shown in FIG. Then, the obtained English structural analysis tree is stored in the pre-conversion syntax buffer 17c in the translation main memory 17. At that time, since the original sentence matches the translation condition that "the structural analysis tree cannot be determined using the registered grammatical rule", the structural analysis tree uses a special element called "dividing element" together. It is assembled. In step S4, the syntactic conversion unit 15c executes syntactic conversion similar to that of the syntactic conversion unit 5c shown in FIG. 7 based on the tree structure conversion rule stored in the storage unit 16 to analyze the English structure. The tree is converted into a Japanese structural analysis tree as shown in FIG. Then, the obtained Japanese structural analysis tree is stored in the translated syntax buffer 17d in the translation main memory 17.

【００３５】以下、上記ターゲット言語生成部１５dに
よる翻訳文生成処理が実行される(ステップＳ5〜ステッ
プＳ23)。但し、そのうちの、ステップＳ7〜ステップＳ
10は文中条件判定部１５eによる文中条件判定処理であ
り、ステップＳ19〜ステップＳ21は分割記号生成部１５
fによる分割記号生成処理である。ステップＳ5で、上記
翻訳用メインメモリ１７に設けられた訳語ポインタＴＷ
に“１"がセットされる。ステップＳ6で、上記翻訳用メ
インメモリ１７における括弧フラグＫに“０"がセット
される。Thereafter, the translated sentence generation process by the target language generation unit 15d is executed (steps S5 to S23). However, among them, steps S7 to S
Reference numeral 10 denotes a sentence condition determination process by the sentence condition determination unit 15e, and steps S19 to S21 are division symbol generation unit 15
This is a division symbol generation process by f. In step S5, the translation word pointer TW provided in the translation main memory 17
Is set to "1". In step S6, the parenthesis flag K in the translation main memory 17 is set to "0".

【００３６】ステップＳ7で、上記変換後の構文バッフ
ァ１７dに日本語の構造解析木に付加されて格納されて
いる訳語列の内容が参照されて、訳語ポインタＴＷによ
って指示される“ＴＷ"番目の訳語が開き括弧“（"であ
るか否かが判別される。その結果、開き括弧であればス
テップＳ8に進み、そうでなければステップＳ9に進む。
ステップＳ8で、上記括弧フラグＫに“１"がセットされ
る。ステップＳ9で、同様にして“ＴＷ"番目の訳語が閉
じ括弧“）"であるか否かが判別される。その結果、閉
じ括弧であればステップＳ10に進み、そうでなければス
テップＳ11に進む。ステップＳ10で、上記括弧フラグＫ
に“０"がセットされる。すなわち、上記ステップＳ7お
よびステップＳ9によって上記括弧判定手段を構成する
のである。In step S7, the contents of the translated word string added to the Japanese structural analysis tree and stored in the converted syntax buffer 17d is referred to, and the "TW" -th point designated by the translated word pointer TW is displayed. It is determined whether or not the translated word is an open parenthesis "(". As a result, if it is an open parenthesis, the process proceeds to step S8, and if not, the process proceeds to step S9.
In step S8, the parenthesis flag K is set to "1". In step S9, it is similarly determined whether or not the "TW" th translated word is a closing bracket ")". As a result, if it is a closing parenthesis, the process proceeds to step S10, and if not, the process proceeds to step S11. In step S10, the parenthesis flag K
Is set to "0". That is, the parenthesis determination means is constituted by the steps S7 and S9.

【００３７】ステップＳ11で、上記辞書引きバッファ１
７b内の情報が参照されて“ＴＷ"番目の訳語は型変化が
必要な訳語であるか否かが判別される。その結果、型変
化が必要な訳語であればステップＳ12に進み、そうでな
ければステップＳ13に進む。ステップＳ12で、上記“Ｔ
Ｗ"番目の訳語が型変化される。ステップＳ13で、上記
“ＴＷ"番目の訳語(当該訳語が型変化が必要な訳語であ
れば上記ステップＳ12において型変化された後の訳語)
が、翻訳用メインメモリ１７における訳文バッファ１７
eに格納される。その際に、既に訳文バッファ１７eに訳
語列が格納されている場合には、その訳語列の最後尾に
格納される。In step S11, the dictionary lookup buffer 1
By referring to the information in 7b, it is judged whether or not the "TW" th translation word is a translation word that requires a pattern change. As a result, if the translated word requires a pattern change, the process proceeds to step S12, and if not, the process proceeds to step S13. In step S12, the above "T
The W "-th translated word is changed in type. In step S13, the" TW "-th translated word (if the translated word requires a type change, the translated word after the type is changed in step S12).
Is a translation buffer 17 in the main memory 17 for translation.
Stored in e. At that time, if the translated word string is already stored in the translated text buffer 17e, it is stored at the end of the translated word string.

【００３８】ステップＳ14で、上記変換後の構文バッフ
ァ１７dの内容および辞書引きバッファ１７b内の情報が
参照されて、“ＴＷ"番目の訳語には付属語が必要であ
るか否かが判別される。その結果、必要であればステッ
プＳ15に進み、そうでなければステップＳ16に進む。ス
テップＳ15で、上記“ＴＷ"番目の訳語に付加すべき付
属語が、訳文バッファ１７eにおける上記ステップＳ13
において格納された“ＴＷ"番目の訳語の次の位置に格
納される。In step S14, the contents of the converted syntax buffer 17d and the information in the dictionary lookup buffer 17b are referred to, and it is determined whether or not the "TW" -th translation word requires an adjunct word. . As a result, if necessary, the process proceeds to step S15, and if not, the process proceeds to step S16. In step S15, the adjunct word to be added to the "TW" th translated word is the step S13 in the translated text buffer 17e.
It is stored in the position next to the "TW" -th translated word stored in.

【００３９】ステップＳ16で、上記変換後の構文バッフ
ァ１７dに格納されている日本語の構造解析木を上記
“ＴＷ"番目の訳語から上位に溯った際に“分割要素"ノ
ードに至るか否か、つまり、当該日本語の構造解析木に
おける“ＴＷ"番目の訳語の上位に“分割要素"ノードが
在るか否かが判別される。その結果、在ればステップＳ
17に進み、無ければステップＳ22に進む。ステップＳ17
で、上記“ＴＷ"番目の訳語は、日本語の構造解析木を
上位に溯った際に最初に遭遇する“分割要素"ノードを
頂点とする部分木における右端に位置するノードである
か否かが判別される。その結果、部分木における右端の
ノードであればステップＳ18に進み、そうでなければス
テップＳ22に進む。ステップＳ18で、上記“ＴＷ"番目
の訳語は、変換後の構文バッファ１７dに格納されてい
る訳語列における最終訳語であるか否かが判別される。
その結果、最終訳語であればステップＳ22に進み、そう
でなければステップＳ19に進む。In step S16, when the Japanese structural analysis tree stored in the converted syntax buffer 17d is traced upward from the "TW" th translated word, it is determined whether the "partition element" node is reached. That is, it is determined whether or not there is a “partition element” node above the “TW” -th translated word in the Japanese structural analysis tree. As a result, if there is, step S
If not, go to step S22. Step S17
Whether the "TW" th translated word is the node located at the right end of the subtree whose vertex is the "splitting element" node encountered first when the Japanese structural analysis tree is descended Is determined. As a result, if the node is the rightmost node in the subtree, the process proceeds to step S18, and if not, the process proceeds to step S22. In step S18, it is determined whether or not the "TW" th translation word is the final translation word in the translation word string stored in the converted syntax buffer 17d.
As a result, if it is the final translated word, the process proceeds to step S22, and if not, the process proceeds to step S19.

【００４０】ステップＳ19で、上記翻訳用メインメモリ
１７における括弧フラグＫの内容は“０"であるか否か
が判別される。その結果“０"であればステップＳ20に
進み、そうでなければステップＳ21に進む。ステップＳ
20で、開き括弧“（"と閉じ括弧“）"とで挟まれた部分
以外における“分割要素"ノードを頂点とする部分木の
境界に挿入する第１の分割記号“‖"が、訳文バッファ
１７eにおける上記ステップＳ13において格納された
“ＴＷ"番目の訳語あるいは上記ステップＳ15において
格納された付属語の次の位置に格納される。ここで、上
記第１の分割記号は開き括弧と閉じ括弧とで挟まれた部
分以外における区切りの箇所に挿入されるのであるか
ら、括弧“（，）"の形状が有する区切りとしての印象
よりも強い印象を有する形状の記号“‖"を第１の分割
記号とするのである。In step S19, it is determined whether or not the content of the parenthesis flag K in the translation main memory 17 is "0". If the result is "0", the process proceeds to step S20, and if not, the process proceeds to step S21. Step S
At 20, the first partition symbol “‖” to be inserted at the boundary of the subtree whose vertex is the “partition element” node other than the part between the opening parenthesis “(” and the closing parenthesis “)” is the translation buffer. It is stored in the position next to the "TW" -th translated word stored in step S13 in step 17e or the attached word stored in step S15. Here, since the first division symbol is inserted at a delimiter portion other than the portion sandwiched between the opening parenthesis and the closing parenthesis, it is more preferable than the impression that the shape of the parentheses “(,)” is the delimiter. The symbol “‖” having a strong impression is used as the first division symbol.

【００４１】ステップＳ21で、開き括弧“（”と閉じ括
弧“）”とで挟まれた部分における“分割要素"ノード
を頂点とする部分木の境界に挿入する第２の分割記号
“…”が、訳文バッファ１７eにおける上記“ＴＷ"番目
の訳語あるいは上記付属語の次の位置に格納される。こ
こで、上記第２の分割記号は開き括弧と閉じ括弧とで挟
まれた部分における区切りの箇所に挿入されるのである
から、括弧“（，）"の形状が有する区切りとしての印
象よりも弱い印象を有する形状の記号“…"を第２の分
割記号とするのである。In step S21, the second division symbol "..." to be inserted into the boundary of the subtree having the "partitioning element" node in the portion sandwiched between the opening parenthesis "(" and the closing parenthesis ")" is , Is stored in the translated text buffer 17e at a position next to the "TW" th translated word or the adjunct word. Here, since the second division symbol is inserted at the delimiter in the part sandwiched between the opening parenthesis and the closing parenthesis, it is weaker than the impression that the shape of the parentheses "(,)" is as a delimiter. The symbol "..." Having an impression is used as the second division symbol.

【００４２】ステップＳ22で、上記“ＴＷ"番目の訳語
は、変換後の構文バッファ１７dに格納されている訳語
列における最終訳語であるか否かが判別される。その結
果、最終訳語であればステップＳ24に進み、そうでなけ
ればステップＳ23に進む。ステップＳ23で、訳語ポイン
タＴＷの内容がインクリメントされて上記ステップＳ7
に戻り、上記訳語列における次ぎの訳語に対する処理に
移行する。ステップＳ24で、上記出力部１３によって、
図４に示すように翻訳用メインメモリ１７における訳文
バッファ１７eに格納されている日本語による翻訳文が
読み出されて、図５に示すように原文と対応付けられて
出力される。そうした後、分割翻訳処理動作を終了す
る。In step S22, it is determined whether or not the "TW" th translation word is the final translation word in the translation word string stored in the converted syntax buffer 17d. As a result, if it is the final translated word, the process proceeds to step S24, and if not, the process proceeds to step S23. In step S23, the content of the translated word pointer TW is incremented and the above step S7 is executed.
The process returns to the process for the next translated word in the translated word sequence. In step S24, the output unit 13 causes
As shown in FIG. 4, the Japanese translated sentence stored in the translated sentence buffer 17e in the translation main memory 17 is read out and is output in association with the original sentence as shown in FIG. After that, the divided translation processing operation ends.

【００４３】次に、上記ターゲット言語生成部１５dに
よって実行される翻訳文生成処理について、当該原文「 The demand ( as measured by the aggregate,busines
s investment ) increased all through the year.」に則して、より具体的に説明する。ここで、既に、上記
翻訳モジュール１５の辞書引き・形態素解析部１５aによ
る形態素解析,構文解析部１５bによる構文解析および構
文変換部１５cによる構文変換が上述のように実行され
て、得られた日本語の構造解析木がこの日本語の構造解
析木における各要素の訳語の列「需要（総計よって測定するように企業投資）
増大するその年間のすべて」と共に、翻訳用メインメモリ１７における変換後の構文
バッファ１７dに格納されているものとする。Next, regarding the translated sentence generation processing executed by the target language generation unit 15d, the original sentence "The demand (as measured by the aggregate, busines
s investment) increased all through the year. " Here, the morphological analysis by the dictionary lookup / morphological analysis unit 15a, the syntactic analysis by the syntactic analysis unit 15b, and the syntactic conversion by the syntactic conversion unit 15c of the translation module 15 have already been performed as described above, and the obtained Japanese is obtained. The structural analysis tree of is the sequence of translated words of each element in this Japanese structural analysis tree "Demand (Corporate investment as measured by total)".
It is assumed that it is stored in the post-conversion syntax buffer 17d in the translation main memory 17 together with "all the increasing year."

【００４４】先ず、上記変換後の構文バッファ１７dに
格納されている訳語列のうち、(ＴＷ＝１)番目の訳語
「需要」に関する処理が実施される。訳語「需要」は、開き
括弧および閉じ括弧の何れでもなく、名詞であるから型
変化の必要がないので、訳語「需要」がそのまま訳文バッ
ファ１７eに格納される。…ステップＳ5〜ステップＳ7,
ステップＳ9,Ｓ11,Ｓ13 さらに、上記訳語「需要」は、付属語を必要とはせず、日
本語の構造解析木における上位に“分割要素"ノードが
存在し、その“分割要素"ノードを頂点とする部分木の
右端のノードではなく、上記訳語列における最終訳語で
はないので、１番目の訳語「需要」に関する処理を終了す
る。…ステップＳ14,Ｓ16,Ｓ17,Ｓ22First, of the translated word strings stored in the converted syntax buffer 17d, the processing for the (TW = 1) th translated word "demand" is executed. Since the translated word "demand" is neither an open parenthesis nor a closed parenthesis and is a noun and does not need to be changed in shape, the translated word "demand" is stored in the translated text buffer 17e as it is. ... Step S5 to Step S7,
Steps S9, S11, S13 Furthermore, the translated word "demand" does not require an adjunct word, and there is a "partitioning element" node at the upper level in the Japanese structural analysis tree, and the "partitioning element" node is the vertex. Since it is not the node at the right end of the subtree and is not the final translated word in the translated word string, the process relating to the first translated word "demand" ends. ... Steps S14, S16, S17, S22

【００４５】次に、(ＴＷ＝２)番目の訳語「（」は開き括
弧であるので括弧フラグＫに“１"がセットされる。ま
た、閉じ括弧ではなく型変化の必要がないために、訳語
「（」が訳文バッファ１７eに格納される。その結果、訳
文バッファ１７eの内容は「需要（」となる。さらに、訳語「（」は、付属語を必要とはせず、
日本語の構造解析木における上位に“分割要素"ノード
が存在し、その“分割要素"ノードを頂点とする部分木
の右端のノードではなく、上記訳語列における最終訳語
ではないので、２番目の訳語「（」に関する処理を終了す
る。…ステップＳ23,ステップＳ7〜ステップＳ9,ステッ
プＳ11,Ｓ13,Ｓ14,Ｓ16,Ｓ17,Ｓ22Next, since the (TW = 2) th translated word "(" is an open parenthesis, "1" is set in the parenthesis flag K. Moreover, since it is not a closing parenthesis and there is no need to change the type, The translated word “(” is stored in the translated text buffer 17e. As a result, the content of the translated text buffer 17e becomes “demand (). Furthermore, the translated word“ (”does not require an adjunct word,
Since there is a "partitioning element" node in the upper part of the Japanese structural analysis tree, and it is not the node at the right end of the subtree with the "partitioning element" node as the vertex, and is not the final translation word in the above translation string, the second The process for the translated word "(" is finished .... Step S23, Step S7 to Step S9, Steps S11, S13, S14, S16, S17, S22.

【００４６】次に、(ＴＷ＝３)番目の訳語「統計」は、開
き括弧でも閉じ括弧でもなく、型変化の必要がないため
に、訳語「統計」が訳文バッファ１７eに格納される。ま
た、訳語「統計」には付属語「に」が必要なので、付属語
「に」が訳文バッファ１７eに格納される。その結果、訳
文バッファ１７eの内容は「需要（統計に」となる。さらに、上記訳語「統計」は、日本語の構造解析
木における上位に“分割要素"ノードが存在し、その
“分割要素"ノードを頂点とする部分木の右端のノード
ではなく、上記訳語列における最終訳語ではないので、
３番目の訳語「統計」に関する処理を終了する。…ステッ
プＳ23,Ｓ7,Ｓ9,Ｓ11,ステップＳ13〜ステップＳ17,Ｓ
２２Next, since the (TW = 3) th translated word "statistics" is neither an open parenthesis nor a closed parenthesis, and there is no need to change the type, the translated word "statistics" is stored in the translated text buffer 17e. Further, since the translated word "statistical" requires the adjunct word "ni", the adjunct word "ni" is stored in the translated sentence buffer 17e. As a result, the content of the translated text buffer 17e becomes "demand (statistically). Furthermore, the translated word" statistical "has a" partitioning element "node at the upper level in the Japanese structural analysis tree, and the" partitioning element " Since it is not the node at the right end of the subtree with the node as the vertex and it is not the final translation in the above translation string,
The process for the third translated word "statistics" is ended. ... Steps S23, S7, S9, S11, Steps S13 to S17, S
22

【００４７】次に、（ＴＷ＝４)番目の訳語「よって」に
関する処理が１番目の訳語「需要」の場合と同様にして実
施されて、訳語「よって」が訳文バッファ１７eに格納さ
れる。その結果、訳文バッファ１７eの内容は「需要（統計によって」となる。次に、(ＴＷ＝５)番目の訳語「測定する」は、そ
の構文条件から、上記ステップＳ11において型変化の必
要があると判定され、ステップＳ14において付属語「た」
が必要であると判定されて、型変化後の訳語「測定され」
と付属語「た」とが訳文バッファ１７eに格納される。そ
の結果、訳文バッファ１７eの内容は「需要（統計によって測定された」となる。Next, the processing for the (TW = 4) th translated word "by" is performed in the same manner as the case of the first translated word "demand", and the translated word "by" is stored in the translated text buffer 17e. As a result, the content of the translated text buffer 17e becomes "demand (according to statistics). Next, the type of the (TW = 5) th translated word" measure "needs to be changed in the above step S11 due to its syntax condition. Is determined, and in step S14 the attached word "ta"
Is determined to be necessary, and the translated word "measured" after the type change
And the adjunct “ta” are stored in the translated text buffer 17e. As a result, the content of the translated text buffer 17e becomes "demand (measured by statistics").

【００４８】次に、(ＴＷ＝６)番目の訳語「ように」は、
開き括弧でも閉じ括弧でもなく、型変化の必要がないた
めに、訳語「ように」が訳文バッファ１７eに格納され
る。また、訳語「ように」は、付属語を必要とはせず、日
本語の構造解析木における上位に“分割要素"ノードが
存在し、その“分割要素"ノードを頂点とする部分木の
右端のノードであり、上記訳語列における最終訳語では
なく、括弧フラグＫの内容は“０"ではない(２番目の訳
語「（」に関する処理の際に“１"がセットされている)の
で、第２の分割記号“…"が訳文バッファ１７eに格納さ
れる。その結果、訳文バッファ１７eの内容は「需要（統計によって測定されたように…」となる。…ステップＳ23,Ｓ7,Ｓ9,Ｓ11,Ｓ13,Ｓ14,ステ
ップＳ16〜ステップＳ19,ステップＳ21,Ｓ22Next, the (TW = 6) th translated word "youyo" is
Since it is neither an opening parenthesis nor a closing parenthesis, and there is no need to change the type, the translated word "Yoyo" is stored in the translated text buffer 17e. In addition, the translated word "yoyo" does not require an adjunct word, and there is a "partition element" node at the upper level in the Japanese structural analysis tree, and the right end of the subtree whose vertex is the "partition element" node Node, which is not the final translated word in the translated word string, and the content of the parenthesis flag K is not “0” (“1” is set in the process for the second translated word “(”). The division symbol "..." of 2 is stored in the translated text buffer 17e, and as a result, the content of the translated text buffer 17e becomes "demand (as measured by statistics ...) .... Steps S23, S7, S9, S11 ,. S13, S14, Step S16 to Step S19, Step S21, S22

【００４９】次に、(ＴＷ＝７)番目の訳語「企業投資」に
関する処理が、１番目の訳語「需要」や４番目の「よって」
の場合と同様にして実施されて、訳語「企業投資」が訳文
バッファ１７eに格納される。その結果、訳文バッファ
１７eの内容は「需要（統計によって測定されたように…企業投資」となる。次に、(ＴＷ＝８)番目の訳語「）」は、上記ステ
ップＳ9において閉じ括弧であると判定される。そし
て、ステップＳ10において括弧フラグＫに“０"がセッ
トされる。また、ステップＳ14において付属語「は、」が
必要であると判定されて、訳語「）」と付属語「は、」とが
訳文バッファ１７eに格納される。その結果、訳文バッ
ファ１７eの内容は「需要（統計によって測定されたように…企業投資）
は、」となる。Next, the processing relating to the (TW = 7) th translation word "corporate investment" is performed by the first translation word "demand" and the fourth translation word "by".
The translation word "corporate investment" is stored in the translation sentence buffer 17e in the same manner as in the above case. As a result, the content of the translated text buffer 17e becomes "demand (as measured by statistics ... corporate investment". Next, the (TW = 8) th translated word ")" is a closing parenthesis in step S9. Is determined. Then, in step S10, the parenthesis flag K is set to "0". In step S14, it is determined that the attached word "ha" is necessary, and the translated word ")" and the attached word "ha" are stored in the translated text buffer 17e. As a result, the content of the translation buffer 17e is "demand (as measured by statistics ... corporate investment).
Will be ".

【００５０】次に、(ＴＷ＝９)番目の訳語「増大する」
は、開き括弧でも閉じ括弧でもなく、型変化の必要があ
り、型変化後の訳語「増大し」には付属語「た」が必要であ
るために、型変化後の訳語「増大し」と付属語「た」が訳文
バッファ１７eに格納される。また、上記訳語「増大す
る」は、日本語の構造解析木における上位に“分割要素"
ノードが存在し、その“分割要素"ノードを頂点とする
部分木の右端のノードであり、上記訳語列における最終
訳語ではなく、括弧フラグＫの内容は“０"である(８番
目の訳語「）」に関する処理の際に“０"がセットされて
いる)ので、第１の分割記号“‖"が訳文バッファ１７e
に格納される。その結果、訳文バッファ１７eの内容は「需要（統計によって測定されたように…企業投資）
は、増大した‖」となる。…ステップＳ23,Ｓ7,Ｓ9,ステップＳ11〜ステ
ップＳ20,Ｓ22Next, the (TW = 9) th translated word "increase"
Is neither an open parenthesis nor a closing parenthesis, and it requires a type change, and the translated word "increase" after the type change requires the adjunct "ta". The adjunct word "ta" is stored in the translated text buffer 17e. In addition, the above-mentioned translated word "increase" means "partitioning element" at the top of the Japanese structural analysis tree.
There is a node, which is the node at the right end of the subtree whose vertex is the "splitting element" node, and is not the final translated word in the translated word string, but the content of the parenthesis flag K is "0" (the eighth translated word " ) ”Is set to“ 0 ”during the process), the first division symbol“ ‖ ”is displayed in the translation buffer 17e.
Stored in. As a result, the content of the translation buffer 17e is "demand (as measured by statistics ... corporate investment).
Is increased ‖. " ... Steps S23, S7, S9, Steps S11 to S20, S22

【００５１】次に、(ＴＷ＝１０)番目の訳語「その」に関
する処理が、１番目の訳語「需要」等の場合と同様に実施
されて、訳語「その」が訳文バッファ１７eに格納され
る。その結果、訳文バッファ１７eの内容は「需要（統計によって測定されたように…企業投資）
は、増大した‖その」となる。次に、(ＴＷ＝１１)番目の訳語「年」に関する処
理が、３番目の訳語「統計」の場合と同様に実施されて、
訳語「年」と付属語「の」とが訳文バッファ１７eに格納さ
れる。その結果、訳文バッファ１７eの内容は「需要（統計によって測定されたように…企業投資）
は、増大した‖ その年の」となる。次に、(ＴＷ＝１２)番目の訳語「間の」および
(ＴＷ＝１３)番目の訳語「すべて」に関する処理が、１番
目の訳語「需要」等の場合と同様に実施されて、訳語「間
の」および訳語「全て。」が訳文バッファ１７eに格納され
る。その際に、最終訳語である「すべて」には句点が付加
される。その結果、訳文バッファ１７eの内容は「需要（統計によって測定されたように…企業投資）
は、増大した‖ その年の間の全て。」となる。そして、上記ステップＳ22において１３番目の
訳語は最終訳語であると判別されて、翻訳文生成処理が
終了するのである。Next, the process for the (TW = 10) th translated word "that" is executed in the same manner as the case of the first translated word "demand", etc., and the translated word "that" is stored in the translated sentence buffer 17e. . As a result, the content of the translation buffer 17e is "demand (as measured by statistics ... corporate investment).
Has increased ‖ that. " Next, the processing for the (TW = 11) th translated word “year” is performed in the same manner as the case of the third translated word “statistical”,
The translated word "year" and the attached word "no" are stored in the translated text buffer 17e. As a result, the content of the translation buffer 17e is "demand (as measured by statistics ... corporate investment).
Will increase that year. ” Next, the (TW = 12) th translated word "ma" and
The process for the (TW = 13) th translated word "all" is performed in the same manner as the case of the first translated word "demand", and the translated words "between" and the translated word "all." Are stored in the translated text buffer 17e. It At that time, punctuation marks are added to the final translated word "all". As a result, the content of the translation buffer 17e is "demand (as measured by statistics ... corporate investment).
Increased ‖ all over the year. Will be Then, in step S22, the 13th translated word is determined to be the final translated word, and the translated sentence generation process ends.

【００５２】上述のように、本実施例の機械翻訳装置
は、上記翻訳モジュール１５におけるターゲット言語生
成部１５dは文中条件判定部１５eおよび分割記号生成部
１５fを有する一方、翻訳用メインメモリ１７には括弧
フラグＫを設けている。そして、上記文中条件判定部１
５eによって、翻訳モジュール１５における構文変換部
１５cで得られた日本語の構造解析木における各要素の
訳語中に“開き括弧"が存在する場合には括弧フラグＫ
に“１"をセットする一方、“閉じ括弧"が存在する場合
には括弧フラグＫに“０"をセットする。そして、分割
記号生成部１５fは、構文変換部１５cによって得られた
日本語の構造解析木における“分割要素"ノードを頂点
とする部分木の境界に挿入する分割記号を生成するに際
して、括弧フラグＫの内容が“１"である場合には、括
弧の形状が有する区切りとしての印象よりも弱い印象を
有する第２の分割記号“…"を生成する一方、括弧フラ
グＫの内容が“０"である場合には、括弧の形状が有す
る区切りとしての印象よりも強い印象を有する第１の分
割記号“‖"を生成する。As described above, in the machine translation apparatus of this embodiment, the target language generation unit 15d in the translation module 15 has the sentence condition determination unit 15e and the division symbol generation unit 15f, while the translation main memory 17 has A parenthesis flag K is provided. Then, the sentence condition determination unit 1
If the "open parenthesis" is present in the translated word of each element in the Japanese structural analysis tree obtained by the syntax conversion unit 15c in the translation module 15 by 5e, the parenthesis flag K
Is set to "1", while "closed parenthesis" is present, the parenthesis flag K is set to "0". Then, the split symbol generation unit 15f generates the split symbol to be inserted into the boundary of the subtree having the “split element” node in the Japanese structural analysis tree obtained by the syntax conversion unit 15c as the vertex. Is "1", the second division symbol "...", which has a weaker impression than the shape of the parenthesis, is generated, while the content of the parenthesis flag K is "0". In some cases, a first split symbol "|" is generated that has a stronger impression than the impression as a delimiter that the shape of the brackets has.

【００５３】すなわち、本実施例によれば、上記ターゲ
ット言語生成部１５dによって生成される翻訳文におけ
る括弧内には括弧よりも弱い印象の第２の分割記号
“…"が挿入される一方、括弧外には括弧よりも強い印
象の第１の分割記号“‖"が挿入されることになる。し
たがって、出力部１３によって、図５に示すように出力
された翻訳文「需要（統計によって測定されたように…企業投資）
は、増大した‖ その年の間の全て。」においては、当該翻訳文は、大きく「需要（統計によって測定されたように…企業投資）
は、増大した‖」と「その年の間の全て。」との２つに分割され、そのうちの前半「需要（統計によって測定されたように…企業投資）
は、増大した‖」には挿入句「統計によって測定されたように…企業投資」が存在し、さらにこの挿入句「統計によって測定された
ように…企業投資」は、「統計によって測定されたように」と「企業投資」との２つに分割される構造になっていることが一目で分
かる。その結果、当該翻訳文は読み易く、翻訳文中の括
弧内に挿入された分割記号によって括弧内のまとまりの
印象が弱められることがなく、しかも、翻訳文における
括弧外の分割記号の効果が十分活かされるのである。That is, according to the present embodiment, the second division symbol "...", which is weaker than the parentheses, is inserted into the parentheses in the translated sentence generated by the target language generating unit 15d, while the parentheses are used. The first division symbol “‖”, which is stronger than the parentheses, will be inserted outside. Therefore, the output section 13 outputs the translated sentence "Demand (as measured by statistics ... Corporate investment) output as shown in FIG.
Increased ‖ all over the year. ", The translation is largely" demand (as measured by statistics ... corporate investment).
Is divided into two: “increased ‖” and “everything during the year.” The first half of them, “Demand (as measured by statistics… corporate investment).
"Increased ‖" has the introductory phrase "as measured by statistics ... corporate investment", and this introductory phrase "as measured by statistics ... corporate investment" is "measured by statistical It can be seen at a glance that it has a structure that is divided into two parts, such as “” and “corporate investment”. As a result, the translated text is easy to read, the impression of the unity inside the parentheses is not weakened by the division symbols inserted in the parentheses in the translation, and the effect of the division symbols outside the parentheses in the translation is sufficiently utilized. It is done.

【００５４】尚、上記分割記号生成部１５fによって生
成される第１の分割記号と第２の分割記号の種類は上記
実施例に限定されるものではない。要は、第１の分割記
号は括弧よりも強い印象の形状を有する一方、第２の分
割記号は括弧よりも弱い印象の形状を有していればよい
のである。The types of the first division symbol and the second division symbol generated by the division symbol generation unit 15f are not limited to those in the above embodiment. The point is that the first division symbol has a shape with a stronger impression than the parenthesis, while the second division symbol has a shape with a lower impression than the parentheses.

【００５５】また、上記実施例における文中条件判定部
１５eを、構文変換部１５cで得られたターゲット言語の
構造解析木に付加されている訳語列に括弧以外の特定の
一対の文字記号が存在することを判別可能に構成すれ
ば、翻訳文中における上記特定の一対の文字記号の内と
外とで分割記号の種類を変えることができる。そうする
ことによって、読み易く、翻訳文中の特定の一対の文字
記号内に挿入された分割記号によって当該一対の文字記
号による強調の度合が弱められることがなく、しかも、
強調する箇所が無い翻訳文における分割記号の効果が十
分活かされるのである。その際には、上述した分割翻訳
処理動作のフローチャートにおける上記ステップＳ7で
は“ＴＷ"番目の訳語が特定の一対の文字記号を構成す
る前方の文字記号(““"等)であるか否かを判別し、上
記ステップＳ9では“ＴＷ"番目の訳語が後方の文字記号
(“”"等)であるか否かを判別すればよい。In the in-sentence condition determining unit 15e in the above embodiment, the translated word string added to the structure analysis tree of the target language obtained by the syntax converting unit 15c has a specific pair of character symbols other than parentheses. If it is configured such that it can be discriminated, it is possible to change the type of division symbol between inside and outside of the specific pair of character symbols in the translated sentence. By doing so, it is easy to read, and the degree of emphasis by the pair of character symbols is not weakened by the division symbol inserted in the specific pair of character symbols in the translated sentence, and
The effect of the division symbol in a translated sentence that does not have a portion to be emphasized is fully utilized. At that time, in step S7 in the flowchart of the divided translation processing operation described above, it is determined whether or not the "TW" th translated word is a forward character symbol ("""or the like) forming a specific pair of character symbols. In step S9 above, the "TW" th translated word is the backward character symbol.
It suffices to determine whether it is ("""or the like).

【００５６】[0056]

【発明の効果】以上より明らかなように、請求項１に係
る発明の機械翻訳装置は、入力部からの入力文に対して
形態素解析部,構文解析部,構文変換部および翻訳文生成
部によって分割翻訳処理を実施するに際して、文中条件
判定部によって、上記構文変換部で得られた訳語列中に
所定の条件を満たす部分が存在するか否かを判定し、こ
の判定結果に基づいて、分割記号生成部によって、上記
翻訳文生成部で上記訳語列に基づいて生成された翻訳文
における上記部分構造の境界位置に、その境界位置に対
応する上記訳語列中における位置が上記所定の条件を満
たす部分に在る場合と満たさない部分に在る場合とで異
なる種類の分割記号を挿入するので、本機械翻訳装置に
よって得られる翻訳文における上記部分構造の境界位置
には、上記所定の条件を満たす部分と満たさない部分と
で異なる種類の分割記号が挿入される。したがって、こ
の発明によれば、翻訳文における上記所定の条件を満た
す部分が他の部分と区別する部分である場合には、上記
所定の条件を満たす部分に挿入される分割信号を他の部
分に挿入される分割信号よりも印象の弱い種類の分割信
号にすることによって、上記翻訳文における上記所定の
条件を満たす部分の他の部分との区別効果が上記分割信
号の挿入によって低下することがなく、理解しやすい翻
訳文を生成できる。As is apparent from the above, the machine translation device according to the first aspect of the present invention uses the morphological analysis unit, the syntactic analysis unit, the syntactic conversion unit and the translation sentence generation unit for the input sentence from the input unit. When performing the divided translation process, the sentence condition determination unit determines whether or not there is a portion satisfying a predetermined condition in the translated word string obtained by the syntax conversion unit, and based on this determination result, the At the boundary position of the partial structure in the translated sentence generated based on the translated word sequence by the translated sentence generation unit by the symbol generation unit, the position in the translated word sequence corresponding to the boundary position satisfies the predetermined condition. Since different kinds of division symbols are inserted depending on whether it is in a part or in a part that does not satisfy it, at the boundary position of the partial structure in the translation obtained by the machine translation device, the predetermined Different types of split symbols are inserted in the portion not filled with the part that meets the matter. Therefore, according to the present invention, when the portion of the translated sentence that satisfies the predetermined condition is a portion that distinguishes it from other portions, the divided signal inserted in the portion that satisfies the predetermined condition is set to the other portion. By using a divided signal of a kind having a weaker impression than the inserted divided signal, the effect of distinguishing the portion satisfying the predetermined condition in the translated sentence from other portions is not deteriorated by the insertion of the divided signal. , It is possible to generate a translation that is easy to understand.

【００５７】また、請求項２に係る発明の機械翻訳装置
における上記文中条件判定部は、上記構文変換部で得ら
れた訳語列中に特定の一対の文字記号によって挟まれた
部分が存在するか否かを判定する文字記号判定手段を有
して、上記分割記号生成部によって、翻訳文における上
記特定の一対の文字記号によって挟まれた部分と挟まれ
ていない部分とで異なる種類の分割記号を挿入するの
で、本機械翻訳装置で得られる翻訳文における上記部分
構造の境界位置には上記特定の一対の文字記号によって
挟まれている部分と挟まれていない部分とで異なる種類
の分割記号が挿入される。したがって、この発明によれ
ば、翻訳文中の特定の一対の文字記号内に挿入される分
割記号を他の部分に挿入される分割信号よりも弱い印象
の分割記号にすれば、翻訳文における上記特定の一対の
文字記号内に挿入される分割記号によって当該一対の文
字記号による強調の度合が弱められることがなく、且
つ、上記特定の一対の文字記号によって強調される部分
が無い翻訳文における分割記号の効果が十分活かされ
て、読みやすい翻訳文が生成されるのである。Further, in the sentence condition determination unit in the machine translation device according to the second aspect of the present invention, does the translated word sequence obtained by the syntax conversion unit include a portion sandwiched by a specific pair of character symbols? A character / symbol determination unit for determining whether or not the divided symbol generation unit determines different types of divided symbols in a portion sandwiched by the specific pair of character symbols and a portion not sandwiched in the translated sentence. Since it is inserted, at the boundary position of the partial structure in the translated text obtained by the machine translation device, different kinds of division symbols are inserted between the part sandwiched by the specific pair of character symbols and the part not sandwiched. To be done. Therefore, according to the present invention, if the division symbol inserted in a specific pair of character symbols in a translated sentence is made to be a division symbol having a weaker impression than the division signal inserted in another part, The division symbol inserted in the pair of character symbols does not weaken the degree of emphasis by the pair of character symbols, and the division symbol in the translated sentence in which there is no portion emphasized by the specific pair of character symbols The effect of is fully utilized and a legible translated sentence is generated.

【００５８】また、請求項３に係る発明の機械翻訳装置
における上記文中条件判定部の文字記号判定手段は、上
記構文変換部で得られた訳語列中に括弧によって挟まれ
た部分が存在するか否かを判定する括弧判定手段であっ
て、上記分割記号生成部によって、翻訳文における括弧
内の部分と括弧外の部分とで異なる種類の分割記号を挿
入するので、本機械翻訳装置によって得られる翻訳文に
おける上記部分構造の境界位置には括弧内の部分と括弧
外の部分とで異なる種類の分割記号が挿入される。した
がって、この発明によれば、翻訳文中の括弧内に挿入さ
れる分割記号を括弧外の部分に挿入される分割信号より
も弱い印象の分割記号にすれば、翻訳文における括弧内
に挿入された分割記号によって括弧内のまとまりの印象
が弱められることがなく、且つ、括弧外の分割記号の効
果が十分活かされて、読みやすい翻訳文が生成されるの
である。Further, in the character symbol determination means of the sentence condition determination unit in the machine translation device according to the third aspect of the present invention, is there a portion between parentheses in the translated word string obtained by the syntax conversion unit? Parenthesis determination means for determining whether or not the division symbol generation unit inserts different types of division symbols in the portion inside the parentheses and the portion outside the parentheses in the translated sentence, and thus is obtained by the machine translation device. At the boundary position of the partial structure in the translated sentence, different kinds of division symbols are inserted between the portion inside the parentheses and the portion outside the parentheses. Therefore, according to the present invention, if the division symbol inserted in the parentheses in the translated sentence is made to have a weaker impression than the division signal inserted in the portion outside the parentheses, the division symbol inserted in the parentheses in the translated sentence is inserted. The division symbols do not weaken the impression of the unity inside the parentheses, and the effects of the division symbols outside the parentheses are sufficiently utilized to generate an easy-to-read translated sentence.

[Brief description of drawings]

【図１】この発明の機械翻訳装置におけるブロック図で
ある。FIG. 1 is a block diagram of a machine translation device of the present invention.

【図２】図１における翻訳制御部の制御の下に実施され
る分割翻訳処理動作のフローチャートである。FIG. 2 is a flowchart of a division translation processing operation performed under the control of a translation control unit in FIG.

【図３】図２に続く分割翻訳処理動作のフローチャート
である。FIG. 3 is a flowchart of a divided translation processing operation subsequent to FIG.

【図４】図１における訳文バッファに格納された翻訳文
の一例を示す図である。4 is a diagram showing an example of translated texts stored in a translated text buffer in FIG.

【図５】図１における出力部による出力内容の一例を示
す図である。FIG. 5 is a diagram showing an example of output contents by an output unit in FIG.

【図６】翻訳レベルの説明図である。FIG. 6 is an explanatory diagram of a translation level.

【図７】従来の分割翻訳処理機能を有する機械翻訳装置
のブロック図である。FIG. 7 is a block diagram of a conventional machine translation device having a divisional translation processing function.

【図８】図１および図７における原文バッファの概念を
示す図である。FIG. 8 is a diagram showing a concept of an original text buffer in FIGS. 1 and 7.

【図９】図７における出力部による出力内容の一例を示
す図である。9 is a diagram showing an example of output contents by an output unit in FIG.

【図１０】図１および図７における辞書引きバッファの
概念を示す図である。10 is a diagram showing the concept of a dictionary lookup buffer in FIGS. 1 and 7. FIG.

【図１１】図１および図７における変換前の構文バッフ
ァに格納されているソース言語による構造解析木の概念
を示す図である。11 is a diagram showing the concept of a structure analysis tree in the source language stored in the syntax buffer before conversion in FIGS. 1 and 7. FIG.

【図１２】図１および図７における変換後の構文バッフ
ァに格納されているターゲット言語による構造解析木の
概念を示す図である。12 is a diagram showing the concept of a structure analysis tree in the target language stored in the syntax buffer after conversion in FIGS. 1 and 7. FIG.

【図１３】図７における訳文バッファに格納された翻訳
文の一例を示す図である。FIG. 13 is a diagram showing an example of translated sentences stored in a translated sentence buffer in FIG. 7.

[Explanation of symbols]

１１…翻訳制御部、１２…入力部、１
３…出力部、１５…翻訳モジュー
ル、１５a…辞書引き・形態素解析部、１５b…構文
解析部、１５c…構文変換部、１５d…
ターゲット言語生成部、１５e…文中条件判定部、
１５f…分割記号生成部、１６…記憶部、
１７…翻訳用メインメモリ、１７a…原
文バッファ、１７b…辞書引きバッファ、
１７c…変換前の構文バッファ、１７d…変換後の
構文バッファ、１７e…訳文バッファ、Ｔ
Ｗ…訳語ポインタ、Ｋ…括弧フラグ。11 ... Translation control unit, 12 ... Input unit, 1
3 ... Output unit, 15 ... Translation module, 15a ... Dictionary lookup / morpheme analysis unit, 15b ... Syntax analysis unit, 15c ... Syntax conversion unit, 15d ...
Target language generation unit, 15e ... Sentence condition determination unit,
15f ... division symbol generation unit, 16 ... storage unit,
17 ... Main memory for translation, 17a ... Original text buffer, 17b ... Dictionary lookup buffer,
17c ... Syntax buffer before conversion, 17d ... Syntax buffer after conversion, 17e ... Translation buffer, T
W ... translation pointer, K ... parenthesis flag.

Claims

[Claims]

1. A morphological analysis process is performed on an input sentence from an input unit by a morphological analysis unit, and a syntactic analysis process is performed by a syntactic analysis unit to obtain a syntactic structure of a source language based on the result of the morphological analysis process. , Performing a syntax conversion process of converting the syntax structure of the source language into the syntax structure of the target language by the syntax conversion unit and obtaining a translation string corresponding to the syntax structure of the target language, and the translation sentence generation unit of the target language. When performing a translation sentence generation process for generating a translation sentence based on a syntactic structure and a translated word string and converting the input sentence into a translation sentence, the syntax analysis unit, the syntax conversion unit, and the translation sentence generation unit are In a machine translation device that performs a divided translation process that divides an input sentence into a plurality of partial structures and processes the input sentence when the input sentence meets a predetermined translation condition, In the divided translation process, based on the determination result by the in-sentence condition determination unit that determines whether or not a portion satisfying a predetermined condition exists in the translated word string obtained by the syntax conversion unit, and the determination result by the in-sentence condition determination unit, At the boundary position of the partial structure in the translated sentence generated based on the translated word string by the translated sentence generation unit, a position in the translated word string corresponding to the boundary position is in a portion that satisfies the predetermined condition. A machine translation device comprising: a division symbol generation unit that inserts a division symbol of a different type from that in a portion that does not satisfy the predetermined condition.

2. The machine translation device according to claim 1, wherein the in-sentence condition determination section includes a portion sandwiched by a specific pair of character symbols in the translated word string obtained by the syntax conversion section. The divided symbol generation unit has a character / symbol determination unit for determining whether or not the portion sandwiched by the specific pair of character symbols in the translated sentence and the specific pair of character symbols in the translated sentence are not sandwiched. A machine translation device, characterized in that different types of division symbols are inserted between parts.

3. The machine translation device according to claim 2, wherein the character / symbol determination means determines whether or not there is a portion sandwiched between parentheses in the translated word string obtained by the syntax conversion unit. A machine translation device, which is parentheses determination means, wherein the division symbol generation unit inserts different types of division symbols in a portion sandwiched between parentheses and a portion not sandwiched between parentheses in the translated sentence. .