JP5548176B2

JP5548176B2 - Translation optimization apparatus, method, and program

Info

Publication number: JP5548176B2
Application number: JP2011203479A
Authority: JP
Inventors: ドゥケヴィン; 元塚田; 克仁須藤
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2011-09-16
Filing date: 2011-09-16
Publication date: 2014-07-16
Anticipated expiration: 2031-09-16
Also published as: JP2013065202A

Description

本発明は、翻訳最適化装置、方法、及びプログラムに係り、特に、翻訳処理で用いる複数の翻訳特徴モデルに対する重みを最適化する翻訳最適化装置、方法、及びプログラムに関する。 The present invention relates to a translation optimization device, method, and program, and more particularly, to a translation optimization device, method, and program that optimize weights for a plurality of translation feature models used in translation processing.

通常、統計翻訳システムは、複数の知識ベースを活用する。例えば、日本語から英語への翻訳の場合、翻訳モデル（入力となる日本語単語に対して最も可能性の高い英語訳を表す知識ベース）、言語モデル（出力となる英語表現の自然さを表す知識ベース）、及び並び換えモデル（入出力言語間の文法構造変換の尤もらしさ表す知識ベース）などの様々なモデルを統合した上で、入力文（日本語）に対して最も尤もらしい翻訳文（英語）を生成する。 A statistical translation system typically uses multiple knowledge bases. For example, in the case of translation from Japanese to English, a translation model (a knowledge base representing the most likely English translation for the input Japanese word) and a language model (representing the naturalness of the output English expression) The most likely translation for the input sentence (Japanese) after integrating various models such as knowledge base) and rearrangement model (knowledge base representing the likelihood of grammatical structure conversion between input and output languages) English).

翻訳文の尤もらしさはモデル特徴量の重みつき組み合わせで表現される。ここで、最も重要なステップの一つは、この重みの最適化（ＷｅｉｇｈｔＯｐｔｉｍｉｚａｔｉｏｎ）である。従来手法では、重みの最適化の際、ただ１つの翻訳品質評価尺度（ｅｖａｌｕａｔｉｏｎｍｅｔｒｉｃ）を目的関数として用いている。 The likelihood of the translation is expressed by a weighted combination of model features. Here, one of the most important steps is the optimization of the weight (Weight Optimization). The conventional method uses only one translation quality evaluation metric as an objective function when optimizing the weights.

重みの最適化の流れは以下の通りである。まず開発データとして、例えば日本語の文章（入力)と正しい英語訳（出力）を用意する。（１）機械翻訳システムは、入力文を翻訳して、出力結果と正しい訳文を比較する。（２）モデルの重みを調節する。この（１）〜（２）を何回も行い、最終的に、出力と正しい訳文が十分に一致したときの重みを求める。このような統計翻訳の最適化には、次の二つの代表的な手法が使われてきた。 The flow of weight optimization is as follows. First, for example, Japanese text (input) and correct English translation (output) are prepared as development data. (1) The machine translation system translates the input sentence and compares the output result with the correct translation. (2) Adjust the model weight. This (1) to (2) is repeated many times, and finally a weight is obtained when the output and the correct translation sufficiently match. The following two representative methods have been used for such optimization of statistical translation.

一つの手法はＭＥＲＴ（ＭｉｎｉｍｕｍＥｒｒｏｒＲａｔｅＴｒａｉｎｉｎｇ）であり（非特許文献１）、もう一つの手法はＭＩＲＡ（ＭａｒｇｉｎＩｎｆｕｓｅｄＲｅｌａｘｅｄＡｌｇｏｒｉｔｈｍ）である（非特許文献２）。どちらの方法も、繰り返し翻訳して、翻訳品質評価尺度の観点で最もよい翻訳を生成するように重みを調整する。 One technique is MERT (Minimum Error Rate Training) (Non-Patent Document 1), and another technique is MIRA (Margin Infused Relaxed Algorithm) (Non-Patent Document 2). Both methods translate repeatedly and adjust the weights to produce the best translation in terms of a translation quality rating scale.

このように重みの最適化のために、評価尺度は重要な役割を果たす。いい評価尺度は、正しい翻訳に高いスコアを与え、悪い翻訳に低いスコア与える。現在一番広く使われている評価尺度がＢＬＥＵという尺度である（非特許文献３）。ＢＬＥＵは、出力と正しい訳文との間における連続する部分単語列の一致度に基づいて計算される（よい翻訳のときＢＬＥＵ値は高くなる）。もう一つの代表的な評価尺度はＴＥＲ（ＴｒａｎｓｌａｔｉｏｎＥｄｉｔＲａｔｅ）である（非特許文献４）。ＴＥＲは、翻訳文を正しい訳文に書き換えるときに何ステップの作業が必要かに基づいてと計算される（よい翻訳のときにＴＥＲ値は低くなる）。
これまで数多くの評価尺度が提案されてきたが、人間の評価と一致する評価尺度は知られていない。実際は、評価尺度はそれぞれ違う観点で翻訳品質を評価しており、その意味で全ての評価尺度は一面の合理性を有している。そのため、機械翻訳の開発者は、複数の評価尺度を使ってシステムの性能を測っているのが実状である。 Thus, the evaluation scale plays an important role for optimizing the weight. A good rating scale gives high scores for correct translations and low scores for bad translations. The most widely used evaluation scale at present is a scale called BLEU (Non-patent Document 3). BLEU is calculated based on the degree of coincidence between successive partial word strings between the output and the correct translation (the BLEU value is high for good translation). Another representative evaluation scale is TER (Translation Edit Rate) (Non-patent Document 4). The TER is calculated based on how many steps are required when rewriting a translated sentence into a correct translation (the TER value is low when the translation is good).
Many evaluation scales have been proposed so far, but no evaluation scale that matches human evaluation is known. Actually, each evaluation scale evaluates translation quality from different viewpoints. In this sense, all evaluation scales have a certain level of rationality. Therefore, the reality is that machine translation developers measure system performance using multiple evaluation measures.

Franz Och, “Minimum Error Training in Statistical Machine Translation,” Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2003.Franz Och, “Minimum Error Training in Statistical Machine Translation,” Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2003. Taro Watanabe, Jun Suzuki, Hajime Tsukada, Hideki Isozaki, “Oneline Large-Margin Training for Statistical Machine Translation,” Proceedings of the Conference on Empirical Methods for Natural Language Processing (EMNLP-CoNLL), 2007.Taro Watanabe, Jun Suzuki, Hajime Tsukada, Hideki Isozaki, “Oneline Large-Margin Training for Statistical Machine Translation,” Proceedings of the Conference on Empirical Methods for Natural Language Processing (EMNLP-CoNLL), 2007. Kishore Papineni, Salim Roukas, Todd Ward, Wei-jing Zhu, “BLEU: A method for automatic evaluation of machine translation,” Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2002.Kishore Papineni, Salim Roukas, Todd Ward, Wei-jing Zhu, “BLEU: A method for automatic evaluation of machine translation,” Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), 2002. Matt Snover, Bonnie Dorr, Richard Schart, Linnea Micciulla, John Makhoul, “A study of Translation Edit Rate with Targeted Human Annotation,” Proceedings of Meeting of Association for Machine Translation for the Americas (AMTA), 2006.Matt Snover, Bonnie Dorr, Richard Schart, Linnea Micciulla, John Makhoul, “A study of Translation Edit Rate with Targeted Human Annotation,” Proceedings of Meeting of Association for Machine Translation for the Americas (AMTA), 2006.

しかしながら、上記非特許文献１、２に記載されたＭＥＲＴやＭＩＲＡを用いた最適化手法では一つだけの評価尺度しか利用できなかった。そのため、ある観点でよい翻訳を生成できるように最適化されるが、他の観点では適切な翻訳となることが保障されない、という問題がある。これは、開発者が、評価尺度として例えばＢＬＥＵかＴＥＲのどちらかしか使えないことに相当する。ＢＬＥＵで最適化すればＴＥＲが最適ではないかもしれない。同じように、ＴＥＲで最適化すればＢＬＥＵが悪くなるかもしれない。一つ評価尺度がよい一方で他の評価尺度が悪いという不均衡があると、人間が評価したときに決してよい翻訳にならない。 However, the optimization method using MERT and MIRA described in Non-Patent Documents 1 and 2 described above can use only one evaluation measure. Therefore, although it is optimized so that a good translation can be generated from a certain viewpoint, there is a problem that it is not guaranteed that the translation is appropriate from another viewpoint. This corresponds to the fact that the developer can use only BLEU or TER as an evaluation scale, for example. If optimized with BLEU, TER may not be optimal. Similarly, BLEU may worsen if optimized with TER. If there is an imbalance that one rating scale is good while the other rating scale is bad, it will never be a good translation when humans evaluate.

本発明は、上記の事情を鑑みてなされたもので、複数の評価尺度を同時に用いて、複数の翻訳特徴モデルに対する重みの最適化を行うことができる翻訳最適化装置、方法、及びプログラムを提供することを目的とする。 The present invention has been made in view of the above circumstances, and provides a translation optimization device, method, and program capable of optimizing weights for a plurality of translation feature models by simultaneously using a plurality of evaluation scales. The purpose is to do.

上記の目的を達成するために本発明に係る翻訳最適化装置は、複数種類の翻訳特徴モデル及び前記複数種類の翻訳特徴モデルの各々に対する重みに基づいて、翻訳元言語の単語列を、翻訳先言語の単語列の複数の候補ｈに翻訳する翻訳処理手段と、前記翻訳処理手段によって翻訳された前記翻訳先言語の単語列の複数の候補ｈの各々について、複数種類の翻訳評価尺度を示す複数のスコアＭ _１（ｈ）、Ｍ _２（ｈ）を算出する翻訳評価手段と、前記複数の候補ｈの各々について算出された前記複数のスコアＭ _１（ｈ）、Ｍ _２（ｈ）に基づいて、以下の式を用いて、変数αを０から無限大まで徐々に増加させながら、変数αの各値において、前記複数の候補の集合Ｌから、パレート最適性を満たす候補ｈ ^＊を抽出すると共に、前記抽出された候補ｈ ^＊より前記スコアＭ _２（ｈ）が小さくなる候補を、前記パレート最適性を満たさない候補として識別するパレート識別手段と、前記パレート識別手段による識別結果に応じて、前記複数種類の翻訳特徴モデルの各々に対する重みを更新する重み更新手段と、前記翻訳処理手段による翻訳、前記翻訳評価手段による算出、前記パレート識別手段による識別、及び前記重み更新手段による更新を繰り返すことで、前記複数種類の翻訳特徴モデルの各々に対する重みを最適化する最適化手段と、を含んで構成されている。

In order to achieve the above object, a translation optimization apparatus according to the present invention converts a translation source language word string into a translation destination based on a plurality of types of translation feature models and weights for each of the plurality of types of translation feature models. multiple illustrating the translation processing means for translating the plurality of candidate h word string language, for each of a plurality of candidate h word string of the target language translated by said translation processing means, a plurality of types of translation rating scale score M ₁ _(h), a translation evaluation means for calculating the M 2 (h), the score of each of the plurality of calculated for a plurality of candidate h M ₁ _(h), based on M 2 (h) Using the following formula, while gradually increasing the variable α from 0 to infinity, a candidate h ^* satisfying Pareto optimality is extracted from the plurality of candidate sets L at each value of the variable α. The extracted Pareto identifying means for identifying a candidate whose score M ₂ (h) is smaller than the candidate h ^{* as} a candidate that does not satisfy the Pareto optimality, and the plurality of types of translation features according to the identification result by the Pareto identifying means Repeating the weight update means for updating the weight for each of the models, the translation by the translation processing means, the calculation by the translation evaluation means, the identification by the Pareto identification means, and the update by the weight update means, the plurality of types Optimizing means for optimizing the weight for each translation feature model.

本発明に係る翻訳最適化方法は、翻訳処理手段、翻訳評価手段、パレート識別手段、重み更新手段、及び最適化手段を含む翻訳最適化装置における翻訳最適化方法であって、前記翻訳最適化装置は、前記翻訳処理手段によって、複数種類の翻訳特徴モデル及び前記複数種類の翻訳特徴モデルの各々に対する重みに基づいて、翻訳元言語の単語列を、翻訳先言語の単語列の複数の候補ｈに翻訳するステップと、前記翻訳評価手段によって、前記翻訳処理手段によって翻訳された前記翻訳先言語の単語列の複数の候補ｈの各々について、複数種類の翻訳評価尺度を示す複数のスコアＭ _１（ｈ）、Ｍ _２（ｈ）を算出するステップと、前記パレート識別手段によって、前記複数の候補ｈの各々について算出された前記複数のスコアＭ _１（ｈ）、Ｍ _２（ｈ）に基づいて、以下の式を用いて、変数αを０から無限大まで徐々に増加させながら、変数αの各値において、前記複数の候補の集合Ｌから、パレート最適性を満たす候補ｈ ^＊を抽出すると共に、前記抽出された候補ｈ ^＊より前記スコアＭ _２（ｈ）が小さくなる候補を、前記パレート最適性を満たさない候補として識別するステップと、前記重み更新手段によって、前記パレート識別手段による識別結果に応じて、前記複数種類の翻訳特徴モデルの各々に対する重みを更新するステップと、前記最適化手段によって、前記翻訳処理手段による翻訳、前記翻訳評価手段による算出、前記パレート識別手段による識別、及び前記重み更新手段による更新を繰り返すことで、前記複数種類の翻訳特徴モデルの各々に対する重みを最適化するステップと、を含んで実行することを特徴とする。

A translation optimizing method according to the present invention is a translation optimizing method in a translation optimizing device including a translation processing unit, a translation evaluating unit, a Pareto identifying unit, a weight updating unit, and an optimizing unit, wherein the translation optimizing device It is by the translation processing unit, based on the weight for each of a plurality of types of translation feature model and the plurality of types of translation feature model, a word string of the source language, a plurality of candidate h word string of target language A plurality of scores M ₁ (h) indicating a plurality of types of translation evaluation scales for each of a plurality of candidates h of the word string of the translation target language translated by the translation processing means by the translation evaluating means; ), M ₂ (h) , and the plurality of scores M ₁ (h), M calculated for each of the plurality of candidates h by the Pareto identification means. ₂ Based on (h) , the Pareto optimality is satisfied from the set L of the plurality of candidates at each value of the variable α while gradually increasing the variable α from 0 to infinity using the following formula: extracts a candidate h ^*, the score M _{2 (h)} is smaller candidates from the extracted candidate h ^*, identifying as a candidate that does not meet the Pareto optimality, by the weight updating unit, wherein Updating the weights for each of the plurality of types of translation feature models according to the identification result by the Pareto identification means, the translation by the translation processing means by the optimization means, the calculation by the translation evaluation means, and the Pareto identification The weight for each of the plurality of types of translation feature models is optimized by repeating the identification by the means and the update by the weight update means. And executes includes a step, a.

本発明によれば、翻訳処理手段によって、複数種類の翻訳特徴モデル及び前記複数種類の翻訳特徴モデルの各々に対する重みに基づいて、翻訳元言語の単語列を、翻訳先言語の単語列の複数の候補に翻訳する。前記翻訳評価手段によって、前記翻訳処理手段によって翻訳された前記翻訳先言語の単語列の複数の候補の各々について、複数種類の翻訳評価尺度を示す複数のスコアを算出する。 According to the present invention, the translation processing means converts the plurality of types of translation feature models and the plurality of types of translation feature models into weights of the source language word strings and the translation destination language word strings. Translate to candidate. The translation evaluation means calculates a plurality of scores indicating a plurality of types of translation evaluation scales for each of a plurality of candidates for the word string in the translation target language translated by the translation processing means.

そして、パレート識別手段によって、前記複数の候補の各々について算出された前記複数のスコアに基づいて、前記複数の候補の各々が、パレート最適性を満たすか否かを識別する。重み更新手段によって、前記パレート識別手段による識別結果に応じて、前記複数種類の翻訳特徴モデルの各々に対する重みを更新する。 Then, the Pareto identifying means identifies whether each of the plurality of candidates satisfies Pareto optimality based on the plurality of scores calculated for each of the plurality of candidates. The weight updating unit updates the weight for each of the plurality of types of translation feature models according to the identification result by the Pareto identification unit.

そして、最適化手段によって、前記翻訳処理手段による翻訳、前記翻訳評価手段による算出、前記パレート識別手段による識別、及び前記重み更新手段による更新を繰り返すことで、前記複数種類の翻訳特徴モデルの各々に対する重みを最適化する。 Then, the optimization unit repeats the translation by the translation processing unit, the calculation by the translation evaluation unit, the identification by the Pareto identification unit, and the update by the weight update unit. Optimize the weights.

このように、翻訳先言語の単語列の複数の候補の各々について複数種類の翻訳評価尺度を示す複数のスコアを算出して、パレート最適性を満たすか否かを識別し、識別結果に応じて、各翻訳特徴モデルに対する重みを更新することにより、複数の評価尺度を同時に用いて、複数の翻訳特徴モデルに対する重みの最適化を行うことができる。 In this way, by calculating a plurality of scores indicating a plurality of types of translation evaluation scales for each of a plurality of candidates for the word string in the translation target language, it is identified whether or not Pareto optimality is satisfied, and according to the identification result By updating the weights for each translation feature model, it is possible to optimize the weights for a plurality of translation feature models by simultaneously using a plurality of evaluation scales.

本発明に係るパレート識別手段は、前記複数の候補ｈの各々について算出された前記複数のスコアＭ_１（ｈ）、Ｍ_２（ｈ）に基づいて、以下の式を用いて、変数αを０から無限大まで徐々に増加させながら、変数αの各値において、前記複数の候補の集合Ｌから、パレート最適性を満たす候補ｈ^＊を抽出すると共に、前記抽出された候補ｈ^＊より前記スコアＭ_２（ｈ）が小さくなる候補を、前記パレート最適性を満たさない候補として識別するようにする。 The Pareto identification means according to the present invention sets the variable α to 0 using the following formula based on the plurality of scores M ₁ (h) and M ₂ (h) calculated for each of the plurality of candidates h. The candidate h ^* satisfying the Pareto optimality is extracted from the plurality of candidate sets L at each value of the variable α while gradually increasing from 1 to infinity, and the score M is extracted from the extracted candidate h ^*. the _{candidate 2 (h)} is smaller, you to be identified as candidates that do not satisfy the Pareto optimality.

本発明に係る翻訳処理手段は、翻訳元言語の複数の単語列の各々を、翻訳先言語の単語列の複数の候補に翻訳し、前記重み更新手段は、前記パレート識別手段による識別結果に応じて、前記翻訳先言語の複数の単語列に対する各候補について、パレート最適性を満たすと識別された場合に、該候補に対する各スコアとして、該スコアの最大値を与えて、前記複数の単語列に対する各候補について、各スコアの合計値を累積した累計スコアを算出し、前記算出された累計スコアに基づいて、前記複数種類の翻訳特徴モデルの各々に対する重みを更新するようにすることができる。 The translation processing means according to the present invention translates each of the plurality of word strings of the translation source language into a plurality of candidates of the word strings of the translation destination language, and the weight update means responds to the identification result by the Pareto identification means. When each candidate for a plurality of word strings in the target language is identified as satisfying Pareto optimality, a maximum value of the score is given as each score for the candidate, For each candidate, a cumulative score obtained by accumulating the total value of the scores can be calculated, and the weight for each of the plurality of types of translation feature models can be updated based on the calculated cumulative score.

本発明に係る重み更新手段は、前記パレート識別手段による識別結果に応じて、前記翻訳先言語の単語列に対する各候補の少なくとも一つが、パレート最適性を満たさないと識別された場合に、前記複数種類の翻訳特徴モデルの各々に対する重みを更新するようにすることができる。 The weight updating unit according to the present invention is configured such that when at least one of the candidates for the word string in the translation destination language is identified as not satisfying the Pareto optimality according to the identification result by the Pareto identifying unit, The weight for each type of translation feature model can be updated.

本発明に係るプログラムは、コンピュータを、上記の翻訳最適化装置の各手段として機能させるためのプログラムである。 The program according to the present invention is a program for causing a computer to function as each unit of the translation optimization device.

以上説明したように、本発明の翻訳最適化装置、方法、及びプログラムによれば、翻訳先言語の単語列の複数の候補の各々について複数種類の翻訳評価尺度を示す複数のスコアを算出して、パレート最適性を満たすか否かを識別し、識別結果に応じて、各翻訳特徴モデルに対する重みを更新することにより、複数の評価尺度を同時に用いて、複数の翻訳特徴モデルに対する重みの最適化を行うことができる、という効果が得られる。 As described above, according to the translation optimization device, method, and program of the present invention, a plurality of scores indicating a plurality of types of translation evaluation scales are calculated for each of a plurality of candidates for a word string of a translation target language. , Identify whether or not Pareto optimality is satisfied, and update the weights for each translation feature model according to the identification result, thereby simultaneously optimizing the weights for multiple translation feature models using multiple evaluation measures The effect that it can be performed is acquired.

本発明の第１の実施の形態に係る翻訳最適化装置の構成を示す概略図である。It is the schematic which shows the structure of the translation optimization apparatus which concerns on the 1st Embodiment of this invention. パレート最適性を説明するための図である。It is a figure for demonstrating Pareto optimality. パレートフロンティアを求めるアルゴリズムを示す図である。It is a figure which shows the algorithm which calculates | requires a Pareto frontier. 本発明の第１の実施の形態に係る翻訳最適化装置における翻訳最適化処理ルーチンの内容を示すフローチャートである。It is a flowchart which shows the content of the translation optimization process routine in the translation optimization apparatus which concerns on the 1st Embodiment of this invention.

以下、図面を参照して本発明の実施の形態を詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

＜第１の実施の形態＞
＜システム構成＞
本発明の第１の実施の形態に係る翻訳最適化装置１００は、翻訳元言語の入力文と翻訳先言語の正しい出力文とからなる開発データが複数入力され、複数の特徴モデルに対する重みを最適化する。この翻訳最適化装置１００は、ＣＰＵと、ＲＡＭと、後述する翻訳最適化処理ルーチンを実行するためのプログラムを記憶したＲＯＭとを備えたコンピュータで構成され、機能的には次に示すように構成されている。図１に示すように、翻訳最適化装置１００は、入力部１０と、演算部２０とを備えている。 <First Embodiment>
<System configuration>
The translation optimizing apparatus 100 according to the first embodiment of the present invention receives a plurality of development data composed of an input sentence in a source language and a correct output sentence in a destination language, and optimizes weights for a plurality of feature models. Turn into. This translation optimizing apparatus 100 is composed of a computer including a CPU, a RAM, and a ROM storing a program for executing a translation optimizing process routine to be described later, and functionally configured as shown below. Has been. As shown in FIG. 1, the translation optimization apparatus 100 includes an input unit 10 and a calculation unit 20.

入力部１０は、入力された開発データとして、翻訳元の日本語の入力文（単語列)と正しい英語訳である出力文（単語列）とからなるデータセットを複数受け付ける。 The input unit 10 accepts a plurality of data sets composed of a Japanese input sentence (word string) as a translation source and an output sentence (word string) as a correct English translation as input development data.

演算部２０は、開発データ記憶部２１、特徴量重み記憶部２２、翻訳部２３、翻訳評価部２４、パレート識別部２５、及び最適化部２６を備えている。なお、最適化部２６が、重み更新手段及び最適化手段の一例である。 The computing unit 20 includes a development data storage unit 21, a feature amount weight storage unit 22, a translation unit 23, a translation evaluation unit 24, a Pareto identification unit 25, and an optimization unit 26. The optimization unit 26 is an example of a weight update unit and an optimization unit.

開発データ記憶部２１は、入力部１０により受け付けた複数の開発データを記憶する。 The development data storage unit 21 stores a plurality of development data received by the input unit 10.

特徴量重み記憶部２２は、翻訳部２３による翻訳処理で参照される複数の翻訳特徴モデル（例えば、翻訳モデル、言語モデル、及び並び替えモデル）の各々に対する重みからなる重みベクトルＷ（＝（ｗ1、ｗ2、ｗ3））を記憶する。 The feature amount weight storage unit 22 is a weight vector W (= (w1) comprising weights for each of a plurality of translation feature models (for example, a translation model, a language model, and a rearrangement model) referred to in the translation processing by the translation unit 23. , W2, w3)).

翻訳部２３は、各開発データについて、以下のように、当該開発データの入力文を翻訳し、翻訳文の出力候補リストを出力する。 For each development data, the translation unit 23 translates the input sentence of the development data as follows, and outputs a translation sentence output candidate list.

まず、翻訳部２３は、開発データの入力文について、翻訳処理を行い、複数の翻訳候補を求める。なお、翻訳処理については、従来既知の手法を用いればよいため、詳細な説明を省略する。翻訳部２３は、各翻訳候補について、複数の翻訳特徴モデルの各々に対する特徴量を算出し、各翻訳特徴モデルの特徴量からなる特徴ベクトルＦ（＝（ｆ1、ｆ2、ｆ3））を算出する。翻訳部２３は、各翻訳候補について、特徴量重み記憶部２２に記憶された各翻訳特徴モデルに対する重みからなる重みベクトルＷを用いて、各翻訳特徴モデルの特徴量の重み付き加算値Ｆ・Ｗ（＝ｗ1・ｆ1＋ｗ2・ｆ2＋ｗ3・ｆ3））を、翻訳特徴スコアとして算出する。翻訳部２３は、各翻訳候補を、翻訳特徴スコアの降順に並べ、上位ｋ個の翻訳候補を特定し、特定された上位ｋ個の翻訳候補を、翻訳文の出力候補リストとして出力する。 First, the translation unit 23 performs a translation process on the input sentence of the development data to obtain a plurality of translation candidates. In addition, about a translation process, since a conventionally known method should just be used, detailed description is abbreviate | omitted. The translation unit 23 calculates a feature amount for each of a plurality of translation feature models for each translation candidate, and calculates a feature vector F (= (f1, f2, f3)) composed of the feature amounts of each translation feature model. The translation unit 23 uses, for each translation candidate, a weight vector W composed of weights for each translation feature model stored in the feature amount weight storage unit 22 to add a weighted addition value F · W of the feature amount of each translation feature model. (= W 1 · f 1 + w 2 · f 2 + w 3 · f 3)) is calculated as a translation feature score. The translation unit 23 arranges the translation candidates in descending order of the translation feature score, identifies the top k translation candidates, and outputs the identified top k translation candidates as a translation output candidate list.

翻訳評価部２４は、翻訳部２３により出力された翻訳文の出力候補リストの中の各翻訳候補について、開発データに含まれる正しい翻訳文と比較して、複数種類の評価尺度（例えば、ＢＬＥＵ及びＴＥＲ）の各々に対する複数種類の評価尺度スコア（ＢＬＥＵ値、１−ＴＥＲ値）を算出する。 The translation evaluation unit 24 compares each translation candidate in the translation sentence output candidate list output by the translation unit 23 with a correct translation sentence included in the development data, and compares a plurality of types of evaluation scales (for example, BLEU and A plurality of types of evaluation scale scores (BLEU value, 1-TER value) are calculated for each of (TER).

パレート識別部２５は、翻訳評価部２４によって算出された複数種類の評価尺度スコアに基づいて、翻訳文の出力候補リストの中の各翻訳候補について、パレート最適性を満たすか否かを識別し、パレート最適性を満たす翻訳候補からなる、出力候補リストのサブセットを出力する。 The Pareto identifying unit 25 identifies whether or not the Pareto optimality is satisfied for each translation candidate in the output candidate list of the translation sentence based on the plurality of types of evaluation scale scores calculated by the translation evaluating unit 24. Output a subset of the output candidate list consisting of translation candidates that satisfy Pareto optimality.

ここで、非特許文献（Marler and Arora, “Survey of multi-objective optimization methods for Engineering”, in Journal of Structural and Multidisciplinary Optimization, vol26., 2004）に記載されているパレートの概念を説明する。説明を簡単にするため、翻訳部２３は、二つの翻訳候補(Ｈ１、Ｈ２)だけを出力するものとし、この中のどちらが有効な翻訳かを決める問題として説明する。 Here, the concept of Pareto described in non-patent literature (Marler and Arora, “Survey of multi-objective optimization methods for Engineering”, in Journal of Structural and Multidisciplinary Optimization, vol. 26, 2004) will be described. In order to simplify the explanation, the translation unit 23 outputs only two translation candidates (H1, H2), and this will be described as a problem that determines which of these is an effective translation.

翻訳評価部２４によって、出力候補リストに含まれる各翻訳候補のスコアベクトル（各要素は、出力候補のＢＬＥＵ値と（１−ＴＥＲ値）とからなる）を計算する。ここで、翻訳候補Ｈ１のスコアベクトルをＭ（Ｈ１）で表わし、翻訳候補Ｈ２のスコアベクトルをＭ（Ｈ２）で表す。スコアベクトルの各要素の値が高いほど、よい翻訳であると考えられる。もしＭ（Ｈ１）のすべての要素がＭ（Ｈ２）のすべての要素より大きければ、翻訳候補Ｈ１はすべての評価尺度で、翻訳候補Ｈ２より良い翻訳であるといえる。逆に、スコアベクトルＭ（Ｈ１）のすべての要素と比べてスコアベクトルＭ（Ｈ２）のすべての要素が大きい場合、翻訳候補Ｈ１より翻訳候補Ｈ２の方が良い翻訳といえる。しかし、ある要素はスコアベクトルＭ（Ｈ１）の方が高く、他の要素はＭ（Ｈ２）の方が高い時、翻訳候補Ｈ１とＨ２はどちらが良い翻訳かは一概にいえない。このような場合、翻訳候補Ｈ１とＨ２は同程度の品質の翻訳文と考えられる。このような直感を定式化したのがパレート最適性である。以下に、弱パレート最適性（ＷｅａｋＰａｒｅｔｏ−ｏｐｔｉｍａｌｉｔｙ）、及びパレート最適性（Ｐａｒｅｔｏ−ｏｐｔｉｍａｌｉｔｙ）の定義について説明する。 The translation evaluation unit 24 calculates a score vector of each translation candidate included in the output candidate list (each element is composed of a BLEU value of the output candidate and (1-TER value)). Here, the score vector of translation candidate H1 is represented by M (H1), and the score vector of translation candidate H2 is represented by M (H2). The higher the value of each element of the score vector, the better the translation. If all the elements of M (H1) are larger than all the elements of M (H2), it can be said that the translation candidate H1 is a better translation than the translation candidate H2 on all evaluation scales. Conversely, if all the elements of the score vector M (H2) are larger than all the elements of the score vector M (H1), it can be said that the translation candidate H2 is a better translation than the translation candidate H1. However, when a certain element has a higher score vector M (H1) and other elements have a higher M (H2), it cannot be generally said which translation candidate H1 or H2 is a better translation. In such a case, the translation candidates H1 and H2 are considered to be translated sentences having the same quality. It is Pareto optimality that formulated such intuition. Below, the definition of weak pareto optimality (Weak Pareto-optimality) and pareto optimality (Pareto-optimality) is demonstrated.

（定義１）弱パレート最適性：翻訳文の出力候補リストＬが与えられており、すべての要素でＭ（Ｈ’）＞Ｍ（Ｈ）を満たすＨ’が存在しない場合かつその場合のみ、Ｈは弱パレート最適性を満たすという。 (Definition 1) Weak Pareto Optimality: A translation sentence output candidate list L is given, and there is no H ′ that satisfies M (H ′)> M (H) in all elements, and only in that case, H Is said to satisfy weak Pareto optimality.

（定義２）パレート最適性：翻訳文の出力候補リストＬが与えられており、すべての要素でＭ（Ｈ’）≧Ｍ（Ｈ）であり、かつ、ある要素ｉでＭ_ｉ（Ｈ’）＞Ｍ_ｉ（Ｈ）を満たすＨ’が存在しない場合、かつその場合のみ、Ｈはパレート最適性を満たすという。 (Definition 2) Pareto Optimality: A translation output candidate list L is given, M (H ′) ≧ M (H) for all elements, and M _i (H ′) for a certain element i It is said that H satisfies Pareto optimality when and only when H ′ that satisfies> M _i (H) does not exist.

上記の定義に関して、１０個の翻訳候補と２つの評価尺度（ｍｅｔｒｉｃ１、２）を例として、図２にて簡単に説明する。上記図２で、「○」で示した翻訳候補はパレート最適性（および弱パレート最適性）を満たしている。一方で「△」で示した候補は、パレート最適性を満たさないが弱パレート最適性を満たしている。「＋」を示される弱パレート最適性を満たさない翻訳候補は非パレートポイント（Ｎｏｎ−ｐａｒｅｔｏｐｏｉｎｔ）という。例えば、ポイント（０．４、０．７）はパレート最適性を満たした翻訳候補を示す。なぜなら、（ｍｅｔｒｉｃ１＞＝０．４，かつ、ｍｅｔｒｉｃ２＞＝０．７）となる他の翻訳候補が存在しないからである。一方で、ポイント（０．６，０．４）は他のポイント（０．７、０．６）と比べたとき、ｍｅｔｒｉｃ１０．７＞０．６、ｍｅｔｒｉｃ２０．６＞０．４となるため、非パレートポイントとなる。 The above definition will be briefly described with reference to FIG. 2 using ten translation candidates and two evaluation scales (metric 1 and 2) as examples. In FIG. 2, the translation candidates indicated by “◯” satisfy Pareto optimality (and weak Pareto optimality). On the other hand, candidates indicated by “Δ” do not satisfy Pareto optimality but satisfy weak Pareto optimality. A translation candidate that does not satisfy the weak Pareto optimality indicated by “+” is referred to as a non-pareto point. For example, the point (0.4, 0.7) indicates a translation candidate that satisfies the Pareto optimality. This is because there are no other translation candidates that satisfy (metric1> = 0.4 and metric2> = 0.7). On the other hand, the point (0.6, 0.4) becomes metric 1 0.7> 0.6 and metric 2 0.6> 0.4 when compared with other points (0.7, 0.6). Therefore, it becomes a non-Pareto point.

つまり、ある翻訳候補に対して、他の翻訳候補がすべての評価尺度で勝つ場合が無ければ、弱パレート最適性を満たす。ある翻訳候補に対して、全ての評価尺度で同等以上の評価値を持ち、かつ、少なくとも一つ以上の評価尺度でより大きい（勝る）評価値を持つ他の翻訳候補が存在する場合は、当該ある翻訳候補は、パレート最適性を満たさない。逆に、そのような他の翻訳候補が全く存在しない場合は、当該ある翻訳候補はパレート最適性を満たす。 That is, the weak Pareto optimality is satisfied if there is no case where other translation candidates win on all evaluation scales with respect to a certain translation candidate. For a translation candidate, if there is another translation candidate that has an evaluation value that is equal to or greater than that of all evaluation scales and that has a higher (greater) evaluation value in at least one evaluation scale, Some translation candidates do not satisfy Pareto optimality. Conversely, when there is no such other translation candidate, the certain translation candidate satisfies the Pareto optimality.

また、翻訳文の出力候補リストＬが与えられたとき、パレート最適性を満たす全ての翻訳候補の集合をパレートフロンティア（ＰａｒｅｔｏＦｒｏｎｔｉｅｒ）という。 When a translation sentence output candidate list L is given, a set of all translation candidates satisfying the Pareto optimality is referred to as a Pareto Frontier.

パレート識別部２５の目的は、パレートフロンティアを効率的に見つけることである。従来より、いくつかのアルゴリズムが提案されているが、本実施の形態では、一例として、ＣｏｎｖｅｘＨｕｌｌＡｌｇｏｒｉｔｈｍ（凸包アルゴリズム）を拡張したアルゴリズムを用いる。この拡張により、パレートフロンティアの近似解を高速に求めることができる。本実施の形態で用いるアルゴリズムは、評価尺度が２つである場合に用いることができる。 The purpose of the Pareto identification unit 25 is to efficiently find the Pareto frontier. Conventionally, several algorithms have been proposed, but in the present embodiment, as an example, an algorithm obtained by extending Convex Hull Algorithm (convex hull algorithm) is used. With this extension, an approximate Pareto frontier solution can be obtained at high speed. The algorithm used in this embodiment can be used when there are two evaluation measures.

本実施の形態で用いるアルゴリズムでは、以下の（１）式に従って、２つの評価尺度のあらゆる線形和の各々について、最大となるポイント（翻訳候補）ｈ^*を求める。 In the algorithm used in the present embodiment, the maximum point (translation candidate) h ^* is obtained for each of the linear sums of the two evaluation measures according to the following equation (1).

ここで、αの値をゼロから無限へ変化させて、αの各値で求めたすべての翻訳候補ｈ^*は、弱パレート最適性を満たしていることが証明できる。 Here, by changing the value of α from zero to infinity, it can be proved that all translation candidates h ^* obtained for each value of α satisfy the weak Pareto optimality.

上記（１）式は、αに関して区分的定数（ｐｉｅｃｅｗｉｓｅ−ｃｏｎｓｔａｎｔ）であるため、パレート識別部２５は、すべての０≦α＜∞に関する値を検索するのに高速なアリゴリズムが利用できる。擬似-コードは図３で示される。 Since the equation (1) is a piecewise constant with respect to α, the Pareto identification unit 25 can use a high-speed algorithm to search for all values related to 0 ≦ α <∞. The pseudo-code is shown in FIG.

パレート識別部２５は、図３に示すように、複数の翻訳候補ｈの各々について算出された評価尺度スコアＭ_１（ｈ）、Ｍ_２（ｈ）に基づいて、上記（１）式を用いて、変数α＝０を初期値として、翻訳文の出力候補リストＬから、パレート最適性を満たす翻訳候補ｈ^＊を抽出する。そして、抽出された翻訳候補ｈ^＊より評価尺度スコアＭ２（ｈ）が小さくなる翻訳候補を、非パレートポイントとして識別する。 As shown in FIG. 3, the Pareto identification unit 25 uses the above formula (1) based on the evaluation scale scores M ₁ (h) and M ₂ (h) calculated for each of the plurality of translation candidates h. Then, a translation candidate h ^* satisfying the Pareto optimality is extracted from the translation sentence output candidate list L with the variable α = 0 as an initial value. Then, a translation candidate whose evaluation scale score M2 (h) is smaller than the extracted translation candidate h ^* is identified as a non-Pareto point.

そして、残りの翻訳候補ｈの各々について、以下の（２）式を用いて、α（ｈ）を算出し、α（ｈ）の最小値を、次の変数αとして、上記の処理を繰り返す。変数αが無限大になるまで上記の処理を繰り返し、処理が終了した時点で、パレート最適性を満たす翻訳候補ｈ^＊として抽出された翻訳候補の集合を、パレートフロンティアとする。 For each remaining translation candidate h, α (h) is calculated using the following equation (2), and the above processing is repeated with the minimum value of α (h) as the next variable α. The above processing is repeated until the variable α becomes infinite. When the processing is completed, a set of translation candidates extracted as translation candidates h ^* satisfying Pareto optimality is defined as a Pareto frontier.

最適化部２６は、以下に説明するように、翻訳評価部２４により算出された評価尺度スコア及びパレート識別部２５による識別結果に基づいて、複数の翻訳特徴モデルに対する重みを最適化する。 As will be described below, the optimization unit 26 optimizes the weights for the plurality of translation feature models based on the evaluation scale score calculated by the translation evaluation unit 24 and the identification result by the Pareto identification unit 25.

パレート最適性を満たす翻訳候補は、複数の評価尺度スコアを考慮して得られる翻訳であるため、良い翻訳であると考えられる。そこで、最適化部２６は、翻訳候補リストの中にあるパレート最適性を満たす翻訳候補に、高い翻訳尺度スコアがつくように、各翻訳特徴モデルの重みを調整する。本実施の形態では、最適化部２６において、ＭＥＲＴを拡張した多目的最適化手法を用いる。ＭＥＲＴはＰＯＷＥＬＬ法などの勾配なしアルゴリズム（Ｇｒａｄｉｅｎｔ−Ｆｒｅｅｍｅｔｈｏｄ)を使用して重みを更新する最適化手法である。 A translation candidate that satisfies the Pareto optimality is a translation obtained in consideration of a plurality of evaluation scale scores, and is therefore considered to be a good translation. Therefore, the optimization unit 26 adjusts the weight of each translation feature model so that a translation candidate satisfying the Pareto optimality in the translation candidate list has a high translation scale score. In the present embodiment, the optimization unit 26 uses a multi-objective optimization technique obtained by extending MERT. MERT is an optimization method for updating weights using a gradient-free method such as the POWELL method.

本実施の形態で用いる多目的最適化手法は以下の手順で動作する。 The multi-objective optimization method used in the present embodiment operates according to the following procedure.

まず、ランダムに決定された重みベクトルＷを用いて、各開発データについて、翻訳候補リストを各々求める。そして、複数の開発データの各々について翻訳評価部２４により算出された評価尺度スコア及びパレート識別部２５による識別結果に基づいて、以下のように、累計スコアを算出する。 First, a translation candidate list is obtained for each development data using a randomly determined weight vector W. Then, based on the evaluation scale score calculated by the translation evaluation unit 24 and the identification result by the Pareto identification unit 25 for each of a plurality of development data, a cumulative score is calculated as follows.

パレート最適性を満たさない翻訳候補に対しては、Ｍ_１（Ｈ）＋Ｍ_２（Ｈ）の合計スコアを与え、一方、パレートフロンティアに含まれる翻訳候補（パレート最適性を満たす翻訳候補）に対しては、Ｍ_１＿ＭＡＸ＋Ｍ_２＿ＭＡＸの合計スコアを与える。ここで、Ｍ_１＿ＭＡＸは翻訳候補リスト中で、一方の評価尺度の最大スコアであり、Ｍ_２＿ＭＡＸは、翻訳候補リスト中で他方の評価尺度の最大スコアを表す。 For translation candidates that do not satisfy Pareto optimality, a total score of M ₁ (H) + M ₂ (H) is given, while for translation candidates included in Pareto frontier (translation candidates that satisfy Pareto optimality) Gives the total score of M ₁ _MAX + M ₂ _MAX. Here, M ₁ _MAX is the maximum score of one evaluation scale in the translation candidate list, and M ₂ _MAX represents the maximum score of the other evaluation scale in the translation candidate list.

各候補リストに対して、上記の合計スコアを算出し、全開発データにおける合計スコアを累計することにより、累計スコアを算出する。 For each candidate list, the above total score is calculated, and the total score in all the development data is totaled to calculate the total score.

重みベクトルＷを繰り返しランダムに決定して、重みベクトルＷを毎回更新し、累計スコアを算出する。累計スコアが最大となる重みベクトルＷを特定し、最適化された重みベクトルＷとして、特徴量重み記憶部２２に記憶されている重みベクトルを更新する。 The weight vector W is repeatedly determined at random, the weight vector W is updated each time, and the cumulative score is calculated. The weight vector W with the maximum cumulative score is specified, and the weight vector stored in the feature amount weight storage unit 22 is updated as the optimized weight vector W.

翻訳最適化装置１００は、翻訳部２３、翻訳評価部２４、パレート識別部２５、及び最適化部２６による一連の処理を繰り返し行う。重みベクトルを更新する毎に、累計スコアの最大値が次第に改善される。累計スコアの最大値の改善が止まったら、一連の処理の繰り返しを終了する。例えば、所定回数以上繰り返しても累計スコアの改善が止まったままである場合に、一連の処理の繰り返しを終了する。そして、特徴量重み記憶部２２に最終的に記憶されている、最大の累計スコアを与える重みベクトルを、最適化の結果とする。 The translation optimization apparatus 100 repeatedly performs a series of processes by the translation unit 23, the translation evaluation unit 24, the Pareto identification unit 25, and the optimization unit 26. Every time the weight vector is updated, the maximum value of the cumulative score is gradually improved. When the improvement of the maximum value of the cumulative score stops, the repetition of the series of processes is terminated. For example, when the improvement of the cumulative score has stopped even after being repeated a predetermined number of times or more, the series of processing is terminated. Then, the weight vector that finally stores the feature amount weight storage unit 22 and gives the maximum cumulative score is set as the optimization result.

＜翻訳最適化装置の作用＞
次に、本実施の形態に係る翻訳最適化装置１００の作用について説明する。まず、日本語の入力文と英語の正しい翻訳文とからなる開発データが、翻訳最適化装置１００に複数入力されると、翻訳最適化装置１００によって、入力された複数の開発データが、開発データ記憶部２１へ格納される。そして、翻訳最適化装置１００によって、図４に示す翻訳最適化処理ルーチンが実行される。 <Operation of translation optimization device>
Next, the operation of translation optimization apparatus 100 according to the present embodiment will be described. First, when a plurality of development data composed of a Japanese input sentence and a correct English translation is input to the translation optimization apparatus 100, the plurality of development data input by the translation optimization apparatus 100 is converted into the development data. It is stored in the storage unit 21. Then, the translation optimization apparatus 100 executes the translation optimization processing routine shown in FIG.

まず、ステップＳ１０１において、開発データ記憶部２１から複数の開発データの全てを取得する。そして、ステップＳ１０２において、各翻訳特徴モデルに対する重みをランダムに決定し、特徴量重み記憶部２２に格納する。ステップＳ１０３では、上記ステップＳ１０１において取得した各開発データの入力文について、翻訳処理を行い、翻訳特徴モデルに対する特徴量の重み付き加算による得られる翻訳スコアが上位ｋ個となる翻訳候補からなる翻訳文の出力候補リストを、各開発データについて生成する。 First, in step S101, all of a plurality of development data is acquired from the development data storage unit 21. In step S <b> 102, a weight for each translation feature model is determined at random and stored in the feature amount weight storage unit 22. In step S103, the input sentence of each development data acquired in step S101 is subjected to translation processing, and a translation sentence composed of translation candidates whose translation score obtained by weighted addition of feature amounts to the translation feature model is the top k. Output candidate list is generated for each development data.

次のステップＳ１０４では、各開発データに対して、当該開発データについて上記ステップＳ１０３で生成された翻訳文の出力候補リストに含まれる各翻訳候補について、複数種類の翻訳評価尺度スコアを計算する。 In the next step S104, for each development data, a plurality of types of translation evaluation scale scores are calculated for each translation candidate included in the translation candidate output list generated in step S103 for the development data.

そして、ステップＳ１０５において、各開発データに対して、当該開発データの各翻訳候補について上記ステップＳ１０４で算出された複数種類の翻訳評価尺度スコアに基づいて、翻訳候補リストに含まれる各翻訳候補がパレート最適性を満たすか否かを識別し、各開発データに対して、パレートフロンティアを求める。 In step S105, for each development data, each translation candidate included in the translation candidate list is Pareto based on the plurality of types of translation evaluation scale scores calculated in step S104 for each translation candidate of the development data. Identify whether or not the optimality is satisfied, and obtain a Pareto frontier for each development data.

次のステップＳ１０６では、各開発データに対して、上記ステップＳ１０４で計算された各翻訳候補の翻訳評価尺度スコアと、上記ステップＳ１０５で求められたパレートフロンティアとに基づいて、各翻訳候補の合計スコアを計算し、各開発データの合計スコアを累計して、累計スコアを算出する。算出した累計スコアに基づいて、最大の累計スコアを与える、各翻訳特徴モデルに対する重みを特定する。そして、ステップＳ１０７において、特徴量重み記憶部２２に記憶された各翻訳特徴モデルの重みを、上記ステップＳ１０６で特定された各翻訳特徴モデルに対する重みに更新する。 In the next step S106, for each development data, based on the translation evaluation scale score of each translation candidate calculated in step S104 and the Pareto frontier obtained in step S105, the total score of each translation candidate Is calculated, and the total score of each development data is accumulated to calculate the cumulative score. Based on the calculated cumulative score, the weight for each translation feature model that gives the maximum cumulative score is specified. In step S107, the weight of each translation feature model stored in the feature amount weight storage unit 22 is updated to the weight for each translation feature model specified in step S106.

そして、ステップＳ１０８において、繰り返し処理を終了するか否かを判定する。例えば、上記ステップＳ１０６で算出された累計スコアの最大値が所定回数連続して改善していない場合には、繰り返し処理を終了すると判定し、翻訳最適化処理ルーチンを終了する。このとき特徴量重み記憶部２２に記憶された各翻訳特徴モデルの重みが、最適化された重みとなる。 In step S108, it is determined whether or not to end the repetition process. For example, if the maximum value of the cumulative score calculated in step S106 is not continuously improved a predetermined number of times, it is determined that the iterative process is to be terminated, and the translation optimization process routine is terminated. At this time, the weight of each translation feature model stored in the feature amount weight storage unit 22 is an optimized weight.

一方、繰り返し処理を終了しないと判定された場合には、上記ステップＳ１０２へ戻り、各翻訳特徴モデルに対する重みをランダムに決定して、上記ステップＳ１０３〜ステップＳ１０７の処理を繰り返す。 On the other hand, if it is determined not to end the repetitive process, the process returns to step S102, the weight for each translation feature model is determined at random, and the processes of steps S103 to S107 are repeated.

特徴量重み記憶部２２に記憶された各翻訳特徴モデルの重みが出力されて、例えば、機械翻訳装置（図示省略）に入力され、各翻訳特徴モデルの重みを用いて、機械翻訳処理が行われる。 The weight of each translation feature model stored in the feature amount weight storage unit 22 is output and input to, for example, a machine translation device (not shown), and machine translation processing is performed using the weight of each translation feature model. .

＜実験結果＞
次に、１６個の開発データに対して実験を行った結果について説明する。４つのタイプ(ＥＭＥＡ:医療翻訳、Ｏｐｅｎｓｕｂ:映画字幕翻訳、Ｅｕｒｏｐａｒｌ:国会スピーチの翻訳、ＮＩＳＴ:新聞やブログの翻訳)の開発データを用いて実験を行った。翻訳元言語は６種類(ＤＥ:ドイツ語、ＥＳ:スペイン語、ＦＲ:フランス語、ＮＬ:オランダ語、ＳＶ:スウェーデン、ＺＨ:中国語)とし、翻訳先言語を、英語とした。上記の第１の実施の形態で説明した最適化方法を含めて５つの最適化法を比較した。 <Experimental result>
Next, the results of experiments performed on 16 pieces of development data will be described. Experiments were conducted using development data of four types (EMEA: medical translation, Opensub: movie subtitle translation, Europarl: translation of Diet speech, NIST: translation of newspapers and blogs). There were six translation languages (DE: German, ES: Spanish, FR: French, NL: Dutch, SV: Sweden, ZH: Chinese), and the target language was English. Five optimization methods were compared including the optimization method described in the first embodiment.

比較対象として、ＢＬＥＵの評価尺度スコアを目的関数にしたＭＥＲＴ最適化手法、ＴＥＲの評価尺度スコアを目的関数にしたＭＥＲＴ最適化手法、ＢＬＥＵの評価尺度スコアとＴＥＲの評価尺度スコアを目的関数にしたＭＥＲＴ最適化手法を用いた実験を行った。また、パレート識別部２５で、後述するスカイラインアルゴリズムを用いた多目的最適化手法についても、同様に実験を行った。 As comparison targets, the MERT optimization method using the BLEU evaluation scale score as an objective function, the MERT optimization method using the TER evaluation scale score as an objective function, and the BLEU evaluation scale score and the TER evaluation scale score as an objective function Experiments using the MERT optimization technique were conducted. In addition, the Pareto identification unit 25 similarly performed an experiment on a multi-objective optimization method using a skyline algorithm described later.

医療翻訳の開発データについて実験を行った結果を以下の表１に示す。実験の結果として、ＢＬＥＵ値と１−ＴＥＲ値(高いスコアがよりよい)を、各最適化法について求めた。 The results of experiments on medical translation development data are shown in Table 1 below. As a result of the experiment, BLEU values and 1-TER values (higher scores are better) were determined for each optimization method.

上記表１で、「COMBINED」は、ＢＬＥＵの評価尺度スコアとＴＥＲの評価尺度スコアを目的関数にしたＭＥＲＴ最適化手法を用いた場合を示す。「SKYLINE」は、スカイラインアルゴリズムを使用した多目的最適化手法を用いた場合を示す。 In Table 1 above, “COMBINED” indicates a case where the MERT optimization method using the BLEU evaluation scale score and the TER evaluation scale score as an objective function is used. “SKYLINE” indicates the case where a multi-objective optimization method using the skyline algorithm is used.

「CVXHULL」は、上記の第１の実施の形態で説明した凸包アルゴリズムを使用した多目的最適化手法を用いた場合を示す。「MERT(BLEU)」は、ＢＬＥＵの評価尺度スコアを目的関数にしたＭＥＲＴ最適化手法を用いた場合を示す。「MERT(TER)」は、ＴＥＲの評価尺度スコアを目的関数にしたＭＥＲＴ最適化手法を用いた場合を示す。 “CVXHULL” indicates a case where the multi-objective optimization method using the convex hull algorithm described in the first embodiment is used. “MERT (BLEU)” indicates a case where a MERT optimization method using an evaluation function score of BLEU as an objective function is used. “MERT (TER)” indicates a case where a MERT optimization method using an evaluation scale score of TER as an objective function is used.

映画字幕翻訳の開発データについて実験を行った結果を以下の表２に示す。 Table 2 below shows the results of experiments conducted on development data for movie subtitle translation.

国会スピーチの翻訳の開発データ、及び新聞やブログの翻訳の開発データについて実験を行った結果を以下の表３に示す。 Table 3 below shows the results of experiments conducted on development data for translations of Diet speech and development data for translations of newspapers and blogs.

上記表１〜表３において、＊は最もよいシステムを示す。また、太字（ボールド）は両方の評価尺度を使用したシステム（Combined, Skyline, CvxHull）のなかで一番良いものを示す。 In Tables 1 to 3, * indicates the best system. Bold (bold) indicates the best of the systems (Combined, Skyline, CvxHull) using both evaluation scales.

上記の実験結果から、複数の目的関数を使う場合、翻訳元言語と評価尺度の組み合わせのほとんどにおいて、上記の第１の実施の形態で提案した手法（スカイラインアルゴリズムを用いた手法も含む）で、「Combined」よりもすぐれた最適化が実現できていることがわかった。特に、上記の第１の実施の形態で提案した手法は、１−ＴＥＲ値で、「MERT(TER)」よりいい結果を出していることは特筆に値する。以上の実験結果から、上記の第１の実施の形態で提案した、パレート最適性に基づく最適化が有効な方法であることが分かった。 From the above experimental results, when using a plurality of objective functions, most of the combinations of the source language and the evaluation scale are the methods proposed in the first embodiment (including the method using the skyline algorithm), It turned out that the optimization which was superior to "Combined" was realized. In particular, it is worthy of special mention that the method proposed in the first embodiment produces a better result than “MERT (TER)” with a 1-TER value. From the above experimental results, it was found that the optimization based on the Pareto optimality proposed in the first embodiment is an effective method.

以上説明したように、本実施の形態に係る翻訳最適化装置によれば、翻訳文の出力候補リストの各翻訳候補について複数種類の評価尺度スコアを算出して、パレート最適性を満たすか否かを識別し、識別結果に応じて、各翻訳特徴モデルに対する重みを更新することにより、複数の評価尺度スコアを同時に用いて、複数の翻訳特徴モデルに対する重みの最適化を行うことができる。 As described above, according to the translation optimization device according to the present embodiment, whether or not the Pareto optimality is satisfied by calculating a plurality of types of evaluation scale scores for each translation candidate in the output candidate list of the translation sentence. By updating the weights for each translation feature model according to the identification result, the weights for the plurality of translation feature models can be optimized using a plurality of evaluation scale scores simultaneously.

複数の評価尺度についてパレート最適性を満たす翻訳候補（パレートポイント）に高い評価スコアがつくように翻訳特徴モデルの特徴量の重みを調整することにより、複数の評価尺度の観点で、バランスの取れた機械翻訳システムを構築することができる。 By adjusting the feature weights of the translation feature model so that translation candidates that satisfy Pareto optimality (Pareto points) for multiple evaluation scales have a high evaluation score, a balance can be achieved from the viewpoint of multiple evaluation scales. A machine translation system can be constructed.

なお、上記の実施の形態では、翻訳部２３、翻訳評価部２４、パレート識別部２５、及び最適化部２６による一連の処理を繰り返し行う毎に、毎回、翻訳特徴モデルに対する重みをランダムに決定する場合を例に説明したが、これに限定されるものではない。翻訳評価部２４、パレート識別部２５、及び最適化部２６による一連の処理を繰り返し行う毎に、算出された累計スコアに基づいて、例えばＰＯＷＥＬＬ法を用いて、最適化方向に翻訳特徴モデルに対する重みを更新するようにしてもよい。 In the above embodiment, each time a series of processes by the translation unit 23, the translation evaluation unit 24, the Pareto identification unit 25, and the optimization unit 26 are repeated, a weight for the translation feature model is randomly determined. Although the case has been described as an example, the present invention is not limited to this. Each time a series of processes by the translation evaluation unit 24, the Pareto identification unit 25, and the optimization unit 26 are repeatedly performed, the weight for the translation feature model is optimized in the optimization direction based on the calculated cumulative score, for example, using the POWELL method. May be updated.

＜第２の実施の形態＞
次に、第２の実施の形態について説明する。なお、第２の実施の形態に係る翻訳最適化装置は、第１の実施の形態と同様の構成であるため、同一符号を付して説明を省略する。 <Second Embodiment>
Next, a second embodiment will be described. Note that the translation optimizing device according to the second embodiment has the same configuration as that of the first embodiment, and thus the same reference numerals are given and description thereof is omitted.

第２の実施の形態では、各翻訳特徴モデルに対する重みを最適化するアルゴリズムが、第１の実施の形態と異なっている。 In the second embodiment, an algorithm for optimizing the weight for each translation feature model is different from that in the first embodiment.

第２の実施の形態に係る翻訳最適化装置の最適化部２６は、ＭＩＲＡを拡張した多目的最適化手法を用いる。ここで想定する識別器は、翻訳候補をポジティブクラスとネガティブクラスに二値分類するものである。ＭＩＲＡを拡張した多目的最適化法は、当該識別器の特徴量の重みをオンライン的に更新するもので、以下の手順で動作する。 The optimization unit 26 of the translation optimization apparatus according to the second embodiment uses a multi-objective optimization technique that extends MIRA. The classifier assumed here is one that binary classifies translation candidates into a positive class and a negative class. The multi-objective optimization method, which is an extension of MIRA, updates the feature weights of the classifier online, and operates according to the following procedure.

まず、複数の開発データの各々について、パレート識別部２５による識別結果に基づいて、パレート最適性を満たさない翻訳候補に対しては、ネガティブのクラスに分類し、パレートフロンティアに含まれる翻訳候補（パレート最適性を満たす翻訳候補）に対しては、ポジティブのクラスに分類する。各開発データの翻訳候補リストに含まれる各翻訳候補が、ポジティブである場合には、重みの更新を行わない。一方、ネガティブである翻訳候補が存在する場合には、各翻訳特徴モデルの重みを更新する。重みの更新方法は、単なる勾配法でもいいし、複雑な方法（例えば、ｍａｒｇｉｎ−ｉｎｆｕｓｅｄｕｐｄａｔｅ）を用いてもよい。 First, for each of a plurality of development data, based on the identification result by the Pareto identification unit 25, translation candidates that do not satisfy the Pareto optimality are classified into a negative class, and the translation candidates (Pareto) included in the Pareto frontier Translation candidates that satisfy the optimality are classified into positive classes. If each translation candidate included in the translation candidate list of each development data is positive, the weight is not updated. On the other hand, when there is a negative translation candidate, the weight of each translation feature model is updated. The weight update method may be a simple gradient method or a complicated method (for example, margin-infused update).

翻訳最適化装置１００は、翻訳部２３、翻訳評価部２４、パレート識別部２５、及び最適化部２６による一連の処理を繰り返し行う。重みベクトルが更新されなくなるまで、一連の処理を繰り返す。 The translation optimization apparatus 100 repeatedly performs a series of processes by the translation unit 23, the translation evaluation unit 24, the Pareto identification unit 25, and the optimization unit 26. A series of processing is repeated until the weight vector is not updated.

第２の実施の形態に係る翻訳最適化処理ルーチンでは、まず、開発データ記憶部２１から複数の開発データの全てを取得する。そして、各翻訳特徴モデルに対する重みをランダムに決定し、特徴量重み記憶部２２に格納する。上記で取得した各開発データの入力文について、翻訳処理を行い、翻訳特徴モデルに対する特徴量の重み付き加算による得られる翻訳スコアが上位ｋ個となる翻訳候補からなる出力候補リストを、各開発データについて生成する。 In the translation optimization processing routine according to the second embodiment, first, all of a plurality of development data is acquired from the development data storage unit 21. Then, a weight for each translation feature model is randomly determined and stored in the feature amount weight storage unit 22. The input sentence of each development data acquired above is subjected to translation processing, and an output candidate list consisting of translation candidates with the top k translation scores obtained by weighted addition of feature amounts to the translation feature model is displayed for each development data. Generate about.

次に、各開発データに対して、当該開発データについて上記で生成された出力候補リストに含まれる各翻訳候補について、複数種類の翻訳評価尺度スコアを計算する。 Next, for each development data, a plurality of types of translation evaluation scale scores are calculated for each translation candidate included in the output candidate list generated above for the development data.

そして、各開発データに対して、当該開発データの各翻訳候補について上記で算出された複数種類の翻訳評価尺度スコアに基づいて、翻訳候補リストに含まれる各翻訳候補がパレート最適性を満たすか否かを識別し、各開発データに対して、パレートフロンティアを求める。 For each development data, whether or not each translation candidate included in the translation candidate list satisfies the Pareto optimality based on the plurality of types of translation evaluation scale scores calculated above for each translation candidate of the development data And identify the Pareto frontier for each development data.

次に、各開発データに対して、上記で求められたパレートフロンティアに基づいて、各翻訳候補のクラスを分類し、ネガティブとなる翻訳候補が存在する場合には、各翻訳特徴モデルに対する重みを最適化する。そして、特徴量重み記憶部２２に記憶された各翻訳特徴モデルの重みを、上記で最適化された各翻訳特徴モデルに対する重みに更新する。 Next, for each development data, classify each translation candidate class based on the Pareto frontier obtained above, and if there is a negative translation candidate, assign the optimal weight to each translation feature model Turn into. And the weight of each translation feature model memorize | stored in the feature-value weight memory | storage part 22 is updated to the weight with respect to each translation feature model optimized above.

そして、繰り返し処理を終了するか否かを判定する。例えば、重みが更新されなくなった場合には、繰り返し処理を終了すると判定し、翻訳最適化処理ルーチンを終了する。このとき特徴量重み記憶部２２に記憶された各翻訳特徴モデルの重みが、最適化された重みとなる。 Then, it is determined whether or not to end the repetition process. For example, when the weight is not updated, it is determined that the iterative process is finished, and the translation optimization process routine is finished. At this time, the weight of each translation feature model stored in the feature amount weight storage unit 22 is an optimized weight.

一方、繰り返し処理を終了しないと判定された場合には、上記で更新された各翻訳特徴モデルの重みを用いて上記の処理を繰り返す。 On the other hand, if it is determined not to end the repetitive process, the above process is repeated using the weight of each translation feature model updated as described above.

なお、第２の実施の形態に係る翻訳最適化装置の他の構成及び作用については、第１の実施の形態と同様であるため、説明を省略する。 Note that other configurations and operations of the translation optimization apparatus according to the second embodiment are the same as those of the first embodiment, and thus the description thereof is omitted.

このように、翻訳文の出力候補リストの各翻訳候補について識別した、パレート最適性を満たすか否かの識別結果を用いて、ＭＩＲＡを拡張した多目的最適化手法により、各翻訳特徴モデルに対する重みを更新することにより、複数の評価尺度スコアを同時に用いて、複数の翻訳特徴モデルに対する重みの最適化を行うことができる。 In this way, by using the identification result of whether or not the Pareto optimality identified for each translation candidate in the translated sentence output candidate list is used, a weight for each translation feature model is obtained by a multi-objective optimization method that extends MIRA. By updating, weights for a plurality of translation feature models can be optimized using a plurality of evaluation scale scores simultaneously.

なお、本発明は、上述した実施形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications are possible without departing from the gist of the present invention.

例えば、パレート識別部において、スカイラインアルゴリズムを用いて、パレートフロンティアを求めるようにしてもよい。なお、スカイラインアルゴリズムとしては、非特許文献（Parke Godfrey, Ryan Shipley, Jarek Gyrz, “Algorithms and analyses for maximal vector computation”, VLDB Journal, vol16, 2006）に記載された手法と同様のものを用いればよいため、詳細な説明を省略する。 For example, the Pareto identification unit may obtain the Pareto frontier using a skyline algorithm. The skyline algorithm may be the same as that described in non-patent literature (Parke Godfrey, Ryan Shipley, Jarek Gyrz, “Algorithms and analyzes for maximal vector computation”, VLDB Journal, vol16, 2006). Therefore, detailed description is omitted.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 In the present specification, the embodiment has been described in which the program is installed in advance. However, the program can be provided by being stored in a computer-readable recording medium.

１０入力部
２０演算部
２１開発データ記憶部
２２特徴量重み記憶部
２３翻訳部
２４翻訳評価部
２５パレート識別部
２６最適化部
１００翻訳最適化装置 DESCRIPTION OF SYMBOLS 10 Input part 20 Calculation part 21 Development data storage part 22 Feature-value weight storage part 23 Translation part 24 Translation evaluation part 25 Pareto identification part 26 Optimization part 100 Translation optimization apparatus

Claims

A translation processing means for translating a word string in the source language into a plurality of candidates h in a word string in the target language, based on a plurality of types of translation feature models and weights for each of the plurality of types of translation feature models;
Translation for calculating a plurality of scores M ₁ (h) and M ₂ (h) indicating a plurality of types of translation evaluation scales for each of a plurality of candidates h of the word string in the target language translated by the translation processing means An evaluation means;
Based on the plurality of scores M ₁ (h) and M ₂ (h) calculated for each of the plurality of candidates h , the variable α is gradually increased from 0 to infinity using the following equation: For each value of the variable α, candidates h ^* satisfying Pareto optimality are extracted from the plurality of candidate sets L, and candidates whose score M ₂ (h) is smaller than the extracted candidates h ^* are selected. Pareto identifying means for identifying as a candidate that does not satisfy the Pareto optimality ;
Weight updating means for updating weights for each of the plurality of types of translation feature models according to the identification result by the Pareto identification means;
Optimization for optimizing the weight for each of the plurality of types of translation feature models by repeating the translation by the translation processing unit, the calculation by the translation evaluation unit, the identification by the Pareto identification unit, and the update by the weight update unit Means,
Translation optimization device including

The translation processing means translates each of the plurality of word strings in the translation source language into a plurality of candidates for the word string in the translation destination language,
The weight updating means, when each candidate for a plurality of word strings of the translation destination language is identified as satisfying Pareto optimality according to the identification result by the Pareto identification means, Given the maximum value of the score, for each candidate for the plurality of word strings, calculate a cumulative score by accumulating the total value of each score, and based on the calculated cumulative score, the plurality of types of translation feature models claim 1 Symbol placement translation optimization apparatus updates the weighting for each.

The weight updating means determines the plurality of types of translations when at least one of the candidates for the translation target language word string is determined not to satisfy Pareto optimality according to the identification result by the Pareto identification means. claim 1 Symbol placement translation optimization apparatus updates the weights for each of the feature model.

A translation optimization method in a translation optimization device including a translation processing means, a translation evaluation means, a Pareto identification means, a weight update means, and an optimization means,
The translation optimization device includes:
The translation processing means translates a word string in the source language into a plurality of candidate words h in the target language based on the weights for each of the plurality of types of translation feature models and the plurality of types of translation feature models. Steps,
A plurality of scores M ₁ (h) and M ₂ ( M ₂ ) indicating a plurality of types of translation evaluation scales for each of a plurality of candidates h of the word string in the target language translated by the translation processing unit by the translation evaluation unit. h) calculating;
Based on the plurality of scores M ₁ (h) and M ₂ (h) calculated for each of the plurality of candidates h by the Pareto identification unit , the variable α is changed from 0 to infinity using the following equation: while gradually increasing until, at each value of the variable alpha, the set L of the plurality of candidate, extracts the candidate h ^* satisfying Pareto optimality, the more the extracted candidate h ^* score M _{2 (h} ) For which the candidate is smaller as a candidate that does not satisfy the Pareto optimality ;
Updating the weight for each of the plurality of types of translation feature models by the weight updating means according to the identification result by the Pareto identifying means;
The optimization unit repeats the translation by the translation processing unit, the calculation by the translation evaluation unit, the identification by the Pareto identification unit, and the update by the weight update unit, whereby weights for each of the plurality of types of translation feature models are obtained. Steps to optimize
A translation optimizing method comprising:

The step of translating by the translation processing means translates each of the plurality of word strings in the source language into a plurality of candidates for the word strings in the target language,
The step of updating by the weight updating means is performed when each candidate for a plurality of word strings in the translation destination language is identified as satisfying Pareto optimality according to the identification result by the Pareto identifying means. For each candidate for the plurality of word strings, a cumulative score obtained by accumulating the total value of the scores is calculated as each score for the plurality of word strings, and the plurality of scores are calculated based on the calculated cumulative score. The translation optimization method according to claim 4 , wherein the weight for each of the types of translation feature models is updated.

The step of updating by the weight updating means is performed when at least one of the candidates for the word string of the translation target language is identified as not satisfying the Pareto optimality according to the identification result by the Pareto identifying means. 4. Symbol placement translation optimization method to update the weight for each of the plurality of types of translation feature model.

The program for functioning a computer as each means of the translation optimization apparatus of any one of Claims 1-3 .