JP6649124B2

JP6649124B2 - Machine translation method, machine translation device and program

Info

Publication number: JP6649124B2
Application number: JP2016039350A
Authority: JP
Inventors: 菜々美藤原; 山内　真樹; 真樹山内
Original assignee: Panasonic Intellectual Property Corp of America
Current assignee: Panasonic Intellectual Property Corp of America
Priority date: 2015-05-25
Filing date: 2016-03-01
Publication date: 2020-02-19
Anticipated expiration: 2036-03-01
Also published as: JP2016218995A

Description

本発明は、複数言語間の翻訳を行う装置、複数言語間の翻訳を行う方法、及び、複数言語間の翻訳を行うシステムに関する。 The present invention relates to a device for translating between languages, a method for translating between languages, and a system for translating between languages.

近年のグローバル化に伴い、異なる言語を母国語とするユーザ同士のコミュニケーションを可能にする機械翻訳装置、機械翻訳システムの開発が行われている。また、機械翻訳機能を提供するサービスの運用も開始されており、例えば旅行会話などの場面で実際に利用されつつある。 With the recent globalization, a machine translation device and a machine translation system that enable communication between users having different languages as their native languages are being developed. In addition, the operation of a service that provides a machine translation function has also been started, and is being used in situations such as travel conversations.

特許第５６５３３９２号公報Japanese Patent No. 5653392 特開２００５−７８３１８号公報JP 2005-78318 A 特許第５０９７３４０号公報Japanese Patent No. 5097340 国際公開第２０１３／０１４８７７号WO 2013/014877

Ｐａｐｉｎｅｎｉ，Ｋ．，Ｒｏｕｋｏｓ，Ｓ．，Ｗａｒｄ，Ｔ．，ａｎｄＺｈｕ，Ｗ．Ｊ．「ＢＬＥＵ：ａｍｅｔｈｏｄｆｏｒａｕｔｏｍａｔｉｃｅｖａｌｕａｔｉｏｎｏｆｍａｃｈｉｎｅｔｒａｎｓｌａｔｉｏｎ」，Ｐｒｏｃ．ｏｆｔｈｅＡｎｎｕａｌＭｅｅｔｉｎｇｏｆｔｈｅＡｓｓｏｃｉａｔｉｏｎｏｆＣｏｍｐｕｔａｔｉｏｎａｌＬｉｎｇｕｉｓｔｉｃｓ（ＡＣＬ），ｐｐ．３１１−３１８，２００２Papineni, K .; Roukos, S .; , Ward, T.W. , And Zhu, W.W. J. "BLEU: a method for automatic evaluation of machine translation", Proc. pp. of the Annual Meeting of the Association of Computational Linguistics (ACL), 311-318, 2002 高地なつめ、磯崎秀樹、「スクランブリングを考慮した和訳の自動評価法のＮＴＣＩＲ−９データによる検証」、第２１９回自然言語処理研究発表会、岡山県立大、２０１４Natsume Takachi and Hideki Isozaki, "Verification of Automatic Evaluation of Japanese Translation Considering Scrambling Using NTCIR-9 Data", 219th Natural Language Processing Research Conference, Okayama Prefectural University, 2014 ＩｌｙａＳｕｔｓｋｅｖｅｒＰａｐｉｎｅｎｉ他、「ＳｅｑｕｅｎｃｅｔｏＳｅｑｕｅｎｃｅＬｅａｒｎｉｎｇｗｉｔｈＮｅｕｒａｌＮｅｔｗｏｒｋｓ」，ＡｄｖａｎｃｅｓｉｎＮｅｕｒａｌＩｎｆｏｒｍａｔｉｏｎＰｒｏｃｅｓｓｉｎｇＳｙｓｔｅｍｓ２７，ｐｐ．３１０４−３１１２，２０１４"Sequence to Sequence Learning with Neural Networks", Ilya Sustakever Papineni, and others in Advances in Neural Information Processing Systems, 27. 3104-3112, 2014 ＤｚｍｉｔｒｙＢａｈｄａｎａｕ他、「ＮｅｕｒａｌＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎｂｙＪｏｉｎｔｌｙＬｅａｒｎｉｎｇｔｏＡｌｉｇｎａｎｄＴｒａｎｓｌａｔｅ」，ａｒＸｉｖ：１４０９．０４７３ｖ５，ＩＣＬＲ２０１５Dzmitry Bahdanau et al., "Neural Machine Translation by Jointly Learning to Align and Translate", arXiv: 1409.0473v5, ICLR 2015

しかし、上記の機械翻訳装置、機械翻訳システムには、更なる改善が必要であった。 However, the above-mentioned machine translation apparatus and machine translation system required further improvement.

上記課題を解決するための、本発明の一態様は、言語情報を出力する情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行なう機械翻訳システムにおける機械翻訳方法であって、前記第１言語の翻訳対象文を受信し、受信した前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成し、前記複数の異なる前記順翻訳文の各々について前記第１言語へ逆翻訳した複数の逆翻訳文を生成し、前記情報出力装置において前記複数の逆翻訳文を出力しているときに、前記複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文に対応する前記順翻訳文を出力する。 One embodiment of the present invention for solving the above problem is a machine translation method in a machine translation system connected to an information output device that outputs linguistic information and performing a translation process between a first language and a second language. Receiving the translation target sentence in the first language, generating a plurality of different forward translation sentences by translating the received translation target sentence into the second language, and for each of the plurality of different forward translation sentences Generating a plurality of back-translated sentences back-translated into the first language and selecting one back-translated sentence from the plurality of back-translated sentences when the information output device is outputting the plurality of back-translated sentences; When receiving the operation to perform the forward translation, the forward translation corresponding to the one reverse translation is output.

上記態様により、更なる改善が実現できた。 According to the above aspect, further improvement was realized.

本実施の形態における、機械翻訳システムの全体構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of an overall configuration of a machine translation system according to the present embodiment. 本実施の形態における、情報表示端末構成を示すブロック図である。FIG. 3 is a block diagram illustrating an information display terminal configuration according to the present embodiment. 本実施の形態における、翻訳サーバの構成を示すブロック図である。FIG. 3 is a block diagram illustrating a configuration of a translation server in the present embodiment. 本実施の形態における、機械翻訳システムのハードウェア構成を示す図である。FIG. 2 is a diagram illustrating a hardware configuration of a machine translation system according to the present embodiment. 本実施の形態における、機械翻訳システムの動作を示すフローチャートである。5 is a flowchart illustrating an operation of the machine translation system according to the present embodiment. 本実施の形態における、翻訳文選択処理の具体的な動作を示すフローチャートである。It is a flowchart in this Embodiment which shows the specific operation | movement of the translation sentence selection process. 本実施の形態における、逆翻訳文選択処理の具体的な動作を示すフローチャートである。It is a flowchart in this Embodiment which shows the specific operation | movement of the reverse translation sentence selection process. 本実施の形態における、フレーズ評価処理の具体的な動作を示すフローチャートである。It is a flowchart in this Embodiment which shows the specific operation | movement of the phrase evaluation process. 本実施の形態における、学習処理の具体的な動作を示すフローチャートである。It is a flowchart in this Embodiment which shows the specific operation | movement of a learning process. 本実施の形態における、学習処理の具体的な動作を示すフローチャートである。It is a flowchart in this Embodiment which shows the specific operation | movement of a learning process. 本実施の形態における、一般的なフレーズテーブルの例である。It is an example of a general phrase table in the present embodiment. 本実施の形態における、フレーズ分割の概要を表わす説明図である。FIG. 3 is an explanatory diagram showing an outline of phrase division in the present embodiment. （Ａ）、（Ｂ）、（Ｃ）は、それぞれ、本実施の形態における、表示画面の一例を示す図である。(A), (B), (C) is a figure which shows an example of the display screen in this Embodiment, respectively. 本実施の形態における、表示画面の一例を示す図である。FIG. 3 is a diagram illustrating an example of a display screen in the present embodiment.

以下で説明する実施の形態は、いずれも本発明の一具体例を示すものである。以下の実施の形態で示される数値、形状、構成要素、ステップ、ステップの順序などは、一例であり、本発明を限定する主旨ではない。また、以下の実施の形態における構成要素のうち、最上位概念を示す独立請求項に記載されていない構成要素については、任意の構成要素として説明される。また全ての実施の形態において、各々の内容を組み合わせることも出来る。 Each of the embodiments described below shows a specific example of the present invention. Numerical values, shapes, components, steps, order of steps, and the like shown in the following embodiments are merely examples, and do not limit the present invention. In addition, among the components in the following embodiments, components not described in the independent claims indicating the highest concept are described as arbitrary components. Further, in all the embodiments, the respective contents can be combined.

（本発明に至った知見）
翻訳機器の開発が盛んに行われている状況において、異なる言語での円滑なコミュニケーションを実現するには、機械翻訳の精度が完全であることが望まれる。しかし、現状の機械翻訳においては、任意の文を誤りなく翻訳することは極めて困難であり、翻訳できる分野（ドメイン）を旅行会話といった形で限定することで翻訳精度を高めているが、完全には程遠い状況である。 (Knowledge that led to the present invention)
In a situation where translation devices are being actively developed, it is desired that the accuracy of machine translation be perfect in order to realize smooth communication in different languages. However, in the current machine translation, it is extremely difficult to translate an arbitrary sentence without error, and the translation accuracy is enhanced by limiting the field (domain) that can be translated in a form such as travel conversation. Is far from the situation.

ここで、機械翻訳技術は大きく３種類に分類される。本編ではそれぞれ、１）ルールベース機械翻訳（ＲＢＭＴ：Ｒｕｌｅ−ＢａｓｅｄＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎ）、２）統計的機械翻訳（ＳＭＴ：ＳｔａｓｔｉｓｔｉｃａｌＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎ）、３）ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ：ＤｅｅｐＮｅｕｒａｌＮｅｔｗｏｒｋＭａｃｈｉｎｅＴｒａｎｓｌａｔｉｏｎ）と呼ぶことにする。 Here, machine translation techniques are roughly classified into three types. In this volume, 1) Rule-based Machine Translation (RBMT), 2) Statistical Machine Translation (SMT), 3) Model Acquisition Machine Translation by Deep Neural Network (DNNMT: Deep) It will be referred to as “Neural Network Machine Translation”.

１）ルールベース機械翻訳（ＲＢＭＴ）
ルールベース機械翻訳（ＲＢＭＴ）は、人手によって構築された変換規則（訳語の対をデータベースとして記憶したもの）を元に翻訳を行うシステムであり、原文と訳文のデータベースは翻訳メモリと表現されることもある。 1) Rule-based machine translation (RBMT)
Rule-based machine translation (RBMT) is a system that translates based on conversion rules (translation word pairs stored as a database) constructed by hand. The database of original and translated sentences is expressed as a translation memory. There is also.

長所として、「規則（翻訳するパターン）を厳密に定義できるため、原文（の一部）が当該規則内に存在する場合には、対応する部分の翻訳の精度が高くなり、規則内の翻訳表現を予め揃えておくことで、翻訳出力の翻訳表現についても一貫性も保ちやすい」ことが挙げられる。 An advantage is that, because the rules (patterns to be translated) can be strictly defined, if (a part of) the original text exists in the rules, the translation accuracy of the corresponding parts is increased, and the translation expression in the rules is increased. Are prepared in advance, it is easy to maintain consistency in the translation expression of the translation output. "

短所は、「規則が存在しない場合には翻訳精度が非常に低くなる、もしくは全く翻訳できず、想定以外の業界・分野への適応性も極めて低い」ことが挙げられる。また、基本的には人手によって規則を構築・拡充するため、開発に掛かるコストも高くなる。更に、翻訳性能を向上させるためには規則を追加していく必要があるが、ユーザ側で規則を構築して翻訳システムをカスタマイズするには、規則の設計ルールについて相当の知識が要求されるため、一般ユーザが気軽に使えるものとはなっていない。このため、ルールベース翻訳（ＲＢＭＴ）の主な展開先は、業務用翻訳（特許翻訳など）となっており、市販の業務用翻訳ソフトは、このルールベース翻訳や翻訳メモリを用いているものが多い。 Disadvantages include: "If there are no rules, translation accuracy will be very low, or translation will not be possible at all, and adaptability to unexpected industries and fields will be extremely low." In addition, since the rules are basically constructed and expanded manually, the cost for development is also high. Furthermore, it is necessary to add rules in order to improve translation performance, but considerable knowledge is required for rule design rules in order to construct rules on the user side and customize the translation system. However, it is not intended to be easily used by general users. For this reason, the main destination of rule-based translation (RBMT) is business translation (patent translation, etc.), and commercial translation software that uses this rule-based translation or translation memory is not available. Many.

２）統計的機械翻訳（ＳＭＴ）
統計的機械翻訳（ＳＭＴ）は、ＲＢＭＴのような規則（ルール）を作成せず、対訳コーパスと呼ばれる「原言語と目的言語双方での訳文対」を大量に準備し、この対訳コーパスから統計的に翻訳確率を計算し訳文を生成する手法である。グーグル（登録商標）やマイクロソフト（登録商標）等がウェブ上で提供する翻訳エンジンに広く用いられている。 2) Statistical machine translation (SMT)
Statistical machine translation (SMT) does not create rules like RBMT, but prepares a large number of “translation pairs in both the source language and target language” called a bilingual corpus, and uses this bilingual corpus for statistical Is a method of calculating the translation probability and generating a translated sentence. Google (registered trademark), Microsoft (registered trademark) and the like are widely used in translation engines provided on the web.

この手法は、翻訳確率を計算するにあたり、言語モデル（単言語における単語間の出現確率をｎ−ｇｒａｍモデルで表現するものが一般的）、及び、翻訳モデル（双言語間での、単語もしくはフレーズ単位での対応関係をアライメントモデルとして表現し、各単語間の語彙レベルでの訳語対関係を語彙モデルとして表現するものが一般的である）の２つのモデルについて、それぞれ対訳コーパスから統計的に確率を計算し、各モデルのパラメータ（確率値）を得ることで、統計的な翻訳を実現する手法である。一般的にＩＢＭモデルと呼ばれている。（日本語を含む場合は、「並び替えモデル」と呼ばれるモデルを構文解析等により追加する場合がある）。 When calculating the translation probability, this method uses a language model (generally expressing the occurrence probability between words in a monolingual language with an n-gram model) and a translation model (word or phrase between bilingual languages). In general, the correspondence between units is expressed as an alignment model, and the translated word pair relation between words at the vocabulary level is generally expressed as a vocabulary model.) Is calculated, and a parameter (probability value) of each model is obtained to realize statistical translation. It is generally called an IBM model. (If Japanese is included, a model called “rearranged model” may be added by parsing or the like).

このＳＭＴの長所として、ＲＢＭＴのようなルールを用いないことから、「『規則が存在しない場合に翻訳精度が非常に低くなる』と言った問題を未然に回避でき、汎化能力のある翻訳エンジンを構築できる」点が挙げられる。 The advantage of this SMT is that it does not use rules like RBMT, so it is possible to avoid the problem that "the translation accuracy becomes very low when there are no rules", and a translation engine with generalization ability. Can be constructed. "

一方、短所として、「確率的な表現に基づくため、確率計算の基となる対訳コーパスを大量に準備する必要がある」点がある。また、汎化性能が高くなる一方で、翻訳結果は「確率的なスコアが高い」出力に過ぎなくなるため、ＲＢＭＴで上手く翻訳できた事例についてＳＭＴでも同等の翻訳精度が得られる保証は無くなり、「全体／平均としては、比較的良い」翻訳では有るが、個別の翻訳ケースでは確実性に劣る場面も発生する。 On the other hand, the disadvantage is that it is necessary to prepare a large amount of bilingual corpora on which the probability calculation is based because it is based on stochastic expressions. In addition, while the generalization performance is improved, the translation result is merely an output having a “high stochastic score”. Therefore, there is no guarantee that the same translation accuracy can be obtained in the SMT for the case where the translation was successfully performed in the RBMT. Although the translation is relatively good as a whole / average, there are some cases where the individual translation cases are less reliable.

更に、確率計算を行う際には、各内部モデル（言語モデル、翻訳モデル等）が出力する確率値を掛け合わせる必要がある。この際、ＳＭＴの翻訳エンジン全体をチューニングするため、各モデルからの確率値に重み付けをして重みをパラメータとした機械学習を行うが、ここで用いられる機械評価値が文字通り「機械的な評価値（例えば、ＢＬＥＵと呼ばれる評価値）」となるため、必ずしもユーザの主観評価と一致しないことが報告されている（例えば、非特許文献２参照）。つまり、「機械評価値は高いがユーザの実際の評価とは結びつかない」と言った構造的な欠点も有している。 Further, when performing the probability calculation, it is necessary to multiply the probability values output from the internal models (language model, translation model, etc.). At this time, in order to tune the entire SMT translation engine, machine learning is performed by weighting the probability value from each model and using the weight as a parameter. The machine evaluation value used here is literally “mechanical evaluation value”. (For example, an evaluation value called BLEU) ", it is reported that the evaluation value does not always match the subjective evaluation of the user (for example, see Non-Patent Document 2). In other words, it has a structural drawback such as "The mechanical evaluation value is high but is not linked to the user's actual evaluation".

また、ユーザ側やシステム側で性能向上を図ろうとして、仮に対訳コーパスを数文追加したとしても、翻訳出力は、対訳コーパス全体を統計処理した上での確率的な振る舞いに左右されるため、性能向上に直結しない（性能が下がることも有り得る）と言った点も別の構造的な欠点として挙げられる。 Also, even if we try to improve the performance on the user side or system side, even if we add a few translation corpora, the translation output is affected by the stochastic behavior after statistically processing the entire translation corpus. Another structural disadvantage is that it does not directly lead to performance improvement (performance may decrease).

３）ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）
ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）は、ディープラーニング（ＤＬ：ＤｅｅｐＬｅａｒｎｉｎｇ）技術を機械翻訳に適用した比較的新しい翻訳技術である（例えば、非特許文献３，非特許文献４参照）。 3) Model-based machine translation using deep neural network (DNNMT)
Model acquisition type machine translation (DNNMT) using a deep neural network is a relatively new translation technology that applies deep learning (DL) technology to machine translation (for example, see Non-Patent Documents 3 and 4). ).

ＲＢＭＴやＳＭＴのような「人によって設計されたルールやモデル」に入力文や対訳コーパスを当てはめ／統計処理する翻訳手法とは異なり、システムが適切なモデル自体を学習することを特徴としている。現時点では黎明期の技術であり、機械翻訳としては実用化には至っていないが、類似技術として音声認識では既に、アップル社のＳｉｒｉ（登録商標）などで実用化されている。 Unlike translation methods that apply / statistically apply an input sentence or a bilingual corpus to “rules and models designed by humans” such as RBMT and SMT, the system is characterized by learning an appropriate model itself. At the present time, it is an early technology and has not yet been put into practical use as machine translation, but similar technology has already been put to practical use in Siri (registered trademark) of Apple Inc. in speech recognition.

長所として、適切なモデルの学習に成功した場合には、翻訳性能の向上が期待されており、特にＳＭＴを超える汎化性能や、ＲＢＭＴやＳＭＴが不得手とする比較的長い文（例えば４０以上の単語で構成される文）について翻訳性能をそれほど劣化させずに翻訳結果を出力することが期待されている。 As an advantage, when learning of an appropriate model is successful, improvement in translation performance is expected. In particular, generalization performance exceeding SMT, and relatively long sentences that RBMT or SMT are weak (for example, 40 or more It is expected that the translation result will be output without significantly degrading the translation performance of the sentence composed of the words of

短所としては、ＤＮＮＭＴ自身がどのようにモデルを学習するかについて、外部制御が困難な点が挙げられる。いわゆる「パラメータを用いたチューニング」に相当するような「パラメータ」が明示的に存在せず、内部の変数は非常に多いものの、どの変数が翻訳性能と直結しているかが不明である。つまり、ニューラルネットワークの構成と入力データ・教師データを決めた後は、どうシステムが学習するか・どのような性能が出るか制御困難であり、原理的にどうすれば性能向上が図れるのかが不明である。一般的なチューニング手法が適用困難であり、ＳＭＴ以上にユーザ側やシステム側で性能向上を図ろうとしても対応が困難である。仮に対訳コーパスを数文追加したとしても、翻訳出力はニューラルネットの学習に左右され、何が出てくるかはニューラルネットワークの内部変数が決まらないと分からない、という構造的な欠点が存在する。 The disadvantage is that external control of how DNNMT itself learns the model is difficult. There is no explicit "parameter" corresponding to so-called "tuning using parameters", and although there are many internal variables, it is unclear which variable is directly connected to translation performance. In other words, after determining the configuration of the neural network and the input data and teacher data, it is difficult to control how the system learns and what kind of performance it will produce, and it is unclear in principle how to improve the performance. . It is difficult to apply a general tuning method, and it is difficult to improve performance on the user side or the system side more than SMT. Even if a few sentences are added to the bilingual corpus, the translation output is affected by the learning of the neural network, and there is a structural disadvantage that what is to be output cannot be known unless the internal variables of the neural network are determined.

以上が、機械翻訳技術を大きく３種類に分類したときの、それぞれの特徴・長所・短所である。 The above are the features, advantages, and disadvantages when machine translation techniques are roughly classified into three types.

一方でユーザ観点から機械翻訳技術を見ると、翻訳したい言語である目的言語について、ユーザが見識を持っていれば、機械翻訳の精度があまり高くない場合でも、翻訳出力結果を元に会話を進めることができる可能性がある。つまり、機械翻訳の出力について、ユーザが「正しい／正しくない」を判断でき、有用な翻訳部分についてはそれを利用する、といった使い方が期待できる。 On the other hand, when looking at machine translation technology from the user's point of view, if the user has insight into the target language, which is the language to be translated, even if the accuracy of the machine translation is not very high, the conversation will proceed based on the translation output result Could be possible. In other words, it can be expected that the user can judge "correct / incorrect" for the output of the machine translation, and use the useful translation part.

しかし実際には、機械翻訳の出力結果をユーザが参照したとしても、その翻訳結果が示している内容を理解できないケースも容易に想像される。例えば、ユーザを日本人と想定した場合に、日本語をメジャーな言語（英語など）へ翻訳したいケースも有れば、比較的マイナーな言語（例えば、マレー語やベトナム語など）へ翻訳したいケースも有る。例えば、目的言語が英語である場合、英語に関して知見が有るユーザは、機械翻訳の結果を参考にして自らの英語での会話に役立てることが可能かも知れない。一方で、目的言語がマイナーな言語である場合には、ユーザに目的言語についての知見が全く無いことが多く、ユーザは機械翻訳結果の内容が正しいかどうかについて、何の情報も得ることが出来ない。つまり、目的言語の訳文が機械翻訳によって提示されたとしても、その内容を全く理解することが出来ないケースが発生する。言語は数千種類も存在するとされており、ほとんどのユーザにとって、知見のない言語の方が大多数であるとさえ言える。このように、目的言語の翻訳結果について、その意味内容を理解できない場合には、相手に伝わる意図を確認できないまま訳文を提示してしまうことにもつながり、コミュニケーションが破綻する恐れがある。 However, in practice, even if the user refers to the output result of the machine translation, it is easy to imagine a case where the contents indicated by the translation result cannot be understood. For example, if the user is assumed to be Japanese, there may be cases where we want to translate Japanese into major languages (such as English), and cases where we want to translate into relatively minor languages (such as Malay or Vietnamese). There is also. For example, when the target language is English, a user who has knowledge about English may be able to use the result of the machine translation to help his or her conversation in English. On the other hand, if the target language is a minor language, the user often has no knowledge of the target language, and the user can obtain no information about whether the contents of the machine translation result are correct. Absent. That is, even if a translation of the target language is presented by machine translation, there are cases where the contents cannot be understood at all. It is said that there are thousands of languages, and it can be said that even for most users, the language without knowledge is the majority. As described above, if the meaning of the translation result of the target language cannot be understood, a translation may be presented without confirming the intention to be transmitted to the other party, and communication may be broken.

このような現状の機械翻訳精度を少しでも向上させるためには、機械翻訳システムそのものが自動的に学習し、性能向上していくことが望まれる。機械翻訳システムが自動的に学習する際には、ユーザ（原言語話者）に負担をかけないことが期待され、その上で、ユーザが相手に伝わる意図を簡便な方法で確認しながら翻訳文を生成・利用できるように機械翻訳システムを構築する必要がある。機械翻訳システムには、ユーザの利用結果を用いて、自動学習することが要求される。また、当然のこととして、システムそのものに要求される計算機資源や開発コストを極力少なくすることも同時に求められる。 In order to improve the accuracy of the current machine translation as much as possible, it is desired that the machine translation system itself automatically learns and improves the performance. When the machine translation system automatically learns, it is expected that the user (source language speaker) will not be burdened. It is necessary to build a machine translation system that can generate and use text. The machine translation system is required to perform automatic learning using the usage results of the user. Naturally, it is also required to minimize computer resources and development costs required for the system itself.

すなわち、機械翻訳システムに求められることは、「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」、「（要件３）要件１・要件２の実現と同時に、計算機資源や開発コストを下げること」の３要件である。これらの３要件を同時に満たした機械翻訳システムの実現が課題となっている。 That is, what is required of a machine translation system is “(requirement 1) that a user can generate and use a translated sentence without burdening the user and confirming the intention of the user in a simple manner”, “(requirement 1) 2) To be able to learn automatically using the usage results of the user (without putting a new burden on the user) "," (Requirement 3) Realization of Requirement 1 and Requirement 2 and to reduce computer resources and development cost That is the three requirements. The realization of a machine translation system that satisfies these three requirements simultaneously has become an issue.

この課題を解決する手段として、例えば、ＲＢＭＴのように、原言語用例と目的言語用例を対応づけた用例をデータベースや規則・ルールとして持っておき、発話された入力文に対して類似する用例を取得し、原言語・目的言語両方で提示するもの（例えば、特許文献１）、入力文と逆翻訳（目的言語から原言語への翻訳）結果の距離を翻訳信頼度として算出し、必要に応じて信頼度と逆翻訳結果をあわせて提示し、ユーザに言い直しや言い換えを行わせるもの（例えば、特許文献４）、統計的機械翻訳（ＳＭＴ）システムの性能を評価する手法であるＢＬＥＵ（ＢｉＬｉｎｇｕａｌＥｖａｌｕａｔｉｏｎＵｎｄｅｒｓｔｕｄｙ）を用いるもの（例えば、非特許文献１）、入力文に対し、逆翻訳文をＮ個取得し、入力文と逆翻訳文の比較評価を行うもの（例えば、特許文献２）、対訳語句を対訳辞書に登録する際、対象とする翻訳装置に有効であるか否かを判別した結果を機械学習するもの（例えば、特許文献３）などがある。 As a means for solving this problem, for example, as in RBMT, an example in which a source language example and a target language example are associated with each other is stored as a database or a rule / rule, and an example similar to an uttered input sentence is used. Obtain and present in both the source language and the target language (for example, Patent Document 1), calculate the distance between the input sentence and the result of reverse translation (translation from the target language to the source language) as the translation reliability, and if necessary, (For example, Patent Literature 4), and BLEU (BiLingual), a technique for evaluating the performance of a statistical machine translation (SMT) system. Evaluation Understudy (for example, Non-Patent Document 1), N back-translated sentences are obtained for an input sentence, and a comparison between the input sentence and the back-translated sentence (For example, Patent Literature 2), and machine learning of the result of determining whether a translation is effective for a target translation device when registering a bilingual phrase in a bilingual dictionary (for example, Patent Literature 3) and so on.

特許文献１には、予め、原言語の用例と、原言語の用例の翻訳文である目的言語の用例とが対応づけられて用例格納部に格納されている内容が開示されている。 Patent Literature 1 discloses a content stored in an example storage unit in which an example of a source language and an example of a target language which is a translation of the example of the source language are associated with each other in advance.

また、特許文献１には、前述の用例格納部に格納された情報に基づいて、入力された原言語の文字列と、原言語の文字列に対応する目的言語の文字列とが、表示部において、それぞれ異なる表示エリアに表示される内容が開示されている。 Japanese Patent Application Laid-Open No. H11-163873 discloses, based on information stored in the above-described example storage unit, a character string of an input source language and a character string of a target language corresponding to the source language character string on a display unit. Discloses contents displayed in different display areas.

具体的には、入力された原言語の文字列に対して、その原言語の文字列に類似する複数の類似用例の文字列が同一の表示エリアに表示され、これに対して、入力された原言語の文字列および複数の類似用例の文字列の各々に対応する目的言語の文字列が、異なる表示エリアに表示される。 Specifically, for the input source language character string, a plurality of similar example character strings similar to the source language character string are displayed in the same display area. Character strings of the target language corresponding to the character strings of the source language and the character strings of the plurality of similar examples are displayed in different display areas.

原言語話者および目的言語話者は、入力文に対して意味を確認したい時に、用例を確認することができる。また、目的言語話者、または原言語話者が選択した類似文用例が相手の話者にもハイライト等で表示される。 The source language speaker and the target language speaker can check the example when they want to check the meaning of the input sentence. The similar sentence example selected by the target language speaker or the source language speaker is also displayed as a highlight or the like to the other speaker.

特許文献１では、入力文である原言語の文字列を取得し、データベース内の対訳例中にある類似用例を検索し、類似度が閾値以上の用例であれば類似用例であると判断して、入力文に対する類似用例として出力する。このとき、出力された類似用例に対するユーザの選択動作が行なわれた場合には、選択された用例をハイライト表示する内容も開示されている。 In Patent Document 1, a character string in the source language, which is an input sentence, is acquired, a similar example in a bilingual example in a database is searched, and if the similarity is equal to or more than a threshold, it is determined that the example is a similar example. Is output as a similar example to the input sentence. At this time, when the user performs an operation of selecting the output similar example, the content of highlighting the selected example is also disclosed.

特許文献２には、原文に対して順方向機械翻訳部１１で出力された順方向翻訳文を受け、逆方向翻訳部１２ａ、１２ｂ、１２ｃにおいて、逆方向翻訳文Ａ、Ｂ、Ｃを評価部１３で評価する内容が開示されている。 In Patent Document 2, a forward translation sentence output from a forward machine translation unit 11 is received for an original sentence, and backward translation units A, B and C are evaluated by backward translation units 12a, 12b and 12c by an evaluation unit. The contents to be evaluated at 13 are disclosed.

このときの、評価手法としては、非特許文献１に開示されている内容が一般的に知られている。非特許文献１においては、参照訳（人手で作成した正解訳）と機械翻訳で出力された訳との間のＮ−ｇｒａｍが一致した数を計算した上で、参照訳の長さの影響を加味した補正を行ってＢＬＥＵ値を取得する。ＢＬＥＵは翻訳精度の評価手法としてよく用いられているが、特に、日英・英日のような語順が大きく異なる言語間の翻訳においては、人手評価と相関が低いことが知られている（例えば、非特許文献２参照）。 As the evaluation method at this time, the contents disclosed in Non-Patent Document 1 are generally known. In Non-Patent Document 1, after calculating the number of N-grams between the reference translation (correct translation created manually) and the translation output by machine translation, the influence of the length of the reference translation is calculated. The BLEU value is obtained by performing the added correction. BLEU is often used as an evaluation method of translation accuracy, but it is known that especially in the translation between languages having greatly different word orders such as Japanese, English, and English and Japanese, the correlation with the manual evaluation is low (for example, , Non-Patent Document 2).

特許文献２には、さらに、逆翻訳文Ａ、Ｂ、Ｃと原文のＤＰマッチングを行なった後に、最大スコアを有する逆方向翻訳文と原文を出力する内容が開示されている。評価者がこれらを比較することによって順方向翻訳文の主観評価が可能となる。 Patent Literature 2 further discloses that after performing DP matching between the back-translated sentences A, B, and C and the original, the backward-translated sentence having the maximum score and the original are output. By the evaluator comparing these, the subjective evaluation of the forward-translated sentence becomes possible.

以上で述べたように、機械翻訳精度が完全でない状況下において翻訳を行う場合、ユーザ（原言語話者）に繰り返し入力などの負担をかけることなく、簡単にかつ相手に伝わる意図を確認しながら翻訳発話文を生成し、ユーザの選択を評価としてシステムそのものが評価／学習することが必要であった。 As described above, when performing translation in a situation where the machine translation accuracy is not perfect, the user (source language speaker) can easily and easily confirm the intention to be transmitted to the other party without burdening the user with repeated input. It was necessary for the system itself to evaluate / learn the translated utterance sentence and evaluate the user's selection as an evaluation.

機械翻訳システムに求められることを再掲すると、「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」、「（要件３）要件１・要件２の実現と同時に、計算機資源や開発コストを下げること」、の３要件を同時に満たすことである。 To restate what is required of a machine translation system, "(Requirement 1) A user can generate and use a translated sentence without burdening the user and confirming the intention of the user in a simple manner", "(Requirement 1) 2) To be able to learn automatically using the usage results of the user (without putting a new burden on the user) "," (Requirement 3) Realization of Requirement 1 and Requirement 2 and to reduce computer resources and development cost That is, to satisfy the three requirements at the same time.

この課題を解決する手段として、先に挙げた３種類の技術では次のようなアプローチが取られている。 As means for solving this problem, the following approaches are taken in the above three types of technologies.

１）ルールベース機械翻訳（ＲＢＭＴ）
前述のように、特許文献１では、入力された原言語の文字列から、類似する原言語の類似文を出力し、出力された原言語の類似文に対しても対応する翻訳文を選択可能に出力する。これによって、例えば、原言語の文字列が音声入力される場合などには、発話入力時の音声認識誤りなどによる入力ミスの影響を減少させ、伝えたい意図を簡便にユーザが選択できる。 1) Rule-based machine translation (RBMT)
As described above, in Patent Literature 1, a similar sentence of a similar source language is output from a character string of an input source language, and a corresponding translated sentence can be selected for the output similar sentence of the source language. Output to Thus, for example, when a character string in the source language is input by voice, the effect of an input error due to a voice recognition error at the time of inputting an utterance is reduced, and the user can easily select an intention to convey.

特許文献１の段落番号［００１２］には、「用例格納部１０５は、原言語の用例（以下、原言語用例ともいう）と目的言語の用例（以下、目的言語用例ともいう）とを関連づけて格納する。」との記述があり、用例格納部１０５が用例検索を行うデータベースとなって原言語と目的言語の対を格納している。この部分がルールベース翻訳のデータベースに相当している。 The paragraph number [0012] of Patent Document 1 states that “the example storage unit 105 associates an example of a source language (hereinafter also referred to as an example of a source language) with an example of a target language (hereinafter also referred to as an example of a target language). The example storage unit 105 serves as a database for performing an example search and stores pairs of the source language and the target language. This part corresponds to a rule-based translation database.

なお、特許文献１の段落番号［００１１］には「機械翻訳部１０３は、音声認識部１０２から原言語文字列を受け取り、原言語文字列を目的言語（第２言語ともいう）の文字列に機械翻訳し、翻訳結果の文字列である目的言語文字列を得る。機械翻訳の具体的な処理は、一般的な処理を行えばよいため、ここでの説明を省略する。」との記述があり、ここで機械翻訳部１０３について、ＳＭＴやＤＮＮＭＴを用いることによる利点や課題は特許文献１で開示されておらず、特許文献１における音声翻訳システム全体としては、用例格納部１０５によるＲＢＭＴとしての働きが特徴的に開示されているに過ぎない。 The paragraph number [0011] of Patent Document 1 states that “the machine translation unit 103 receives the source language character string from the speech recognition unit 102 and converts the source language character string into a target language (also referred to as a second language) character string. Machine translation is performed to obtain a target language character string which is a character string of the translation result. Since specific processing of machine translation may be performed by general processing, description thereof will be omitted here. " There is no advantage or problem of using the SMT or DNNMT for the machine translation unit 103 in Patent Literature 1, and the entire speech translation system in Patent Literature 1 uses the RBMT as the RBMT by the example storage unit 105. The work is only disclosed characteristically.

先に挙げたように、ＲＢＭＴは規則（ルール）に記述されていない入力文や、対応していない業界・分野の入力が有った場合に翻訳精度が著しく低下する、もしくは全く翻訳が出来ないという問題を有している。特許文献１においても、発話入力に対して用例格納部１０５に類似例が記述されていない場合や分野が異なる場合、翻訳精度が極めて低い翻訳文を提示する可能性が有り、目的言語に関する知識が無いユーザは相手に伝わる意図を確認できないまま訳文を提示してしまう（もしくは相手に伝わる意図を誤解したまま提示してしまう）ことに繋がりかねないが、こういった課題は開示されておらず解決策も記されていない。 As mentioned above, the translation accuracy of RBMT is significantly reduced when there is an input sentence not described in the rules or an input of an unsupported industry or field, or translation cannot be performed at all. There is a problem that. Also in Patent Document 1, when a similar example is not described in the example storage unit 105 for an utterance input or when the field is different, there is a possibility that a translated sentence with extremely low translation accuracy may be presented, and knowledge about the target language is not obtained. If there is no user, it may lead to presenting the translation without confirming the intention transmitted to the other party (or presenting the misunderstanding of the intention transmitted to the other party), but such a problem is not disclosed and solved No measures are described.

更に、用例格納部１０５に類似例が存在しなかった場合は、類似例そのものの提示が為されず、ユーザは音声認識結果の（誤りの多い）入力文を用いる他無い状況となる。翻訳結果についても、音声認識結果による入力文に対する機械翻訳部１０３の翻訳結果のみとなる。この場合、ユーザは「適切に伝えたい意図に則した文章」を「簡便に選択」することが出来ず、この課題についての解決策も特許文献１では開示されていない。 Furthermore, when no similar example exists in the example storage unit 105, the similar example itself is not presented, and the user has no choice but to use the input sentence (of which there are many errors) of the speech recognition result. The translation result is also only the translation result of the machine translation unit 103 for the input sentence based on the speech recognition result. In this case, the user cannot “simplely select” the “sentence according to the intention to convey appropriately”, and Patent Literature 1 does not disclose a solution to this problem.

すなわち、（要件１）について、「ユーザが（文章を）簡便に選択」する手段は、複数の類似文検索結果を提示し選択する手段を提供することで部分的に開示されているとも言えるが、ＲＢＭＴに相当する用例格納部１０５に類似例が存在しなかった場合は、選択肢そのものが失われる結果となり、完全には「ユーザが（文章を）簡便に選択」する手段は提供されていない。 That is, regarding (Requirement 1), it can be said that the means for “the user to easily select (sentence)” is partially disclosed by providing a means for presenting and selecting a plurality of similar sentence search results. If there is no similar example in the example storage unit 105 corresponding to the RBMT, the result is that the option itself is lost, and there is no completely provided means for "the user to select (sentence) simply".

更に「相手に伝わる意図の確認」についても、あくまで用例格納部１０５が保持している内容に検索がヒットした場合においてのみ限定的に解決されているに過ぎず、機械翻訳部１０３の翻訳結果出力には触れられていない。すなわち、機械翻訳部１０３の翻訳結果が、元の原言語文字列の内容に沿っているかどうかについては言及されておらず、（特許文献１の段落番号［００１１］に「機械翻訳部１０３は、音声認識部１０２から原言語文字列を受け取り、原言語文字列を目的言語（第２言語ともいう）の文字列に機械翻訳し、翻訳結果の文字列である目的言語文字列を得る。機械翻訳の具体的な処理は、一般的な処理を行えばよいため、ここでの説明を省略する。」との記載があり、機械翻訳部１０３の出力の正当性は担保されていない）、原言語文字列の内容とかけ離れた内容が機械翻訳部１０３の出力として出てきていたとしても、ユーザはそれを知る由もない。つまり、先の「目的言語の知識がないユーザは相手に伝わる意図を確認できないまま訳文を提示してしまう」という課題について、開示も解決もなされていない。 Further, "confirmation of intention to be transmitted to the other party" is only limitedly solved when a search hits the content held in the example storage unit 105, and the translation result output by the machine translation unit 103 is output. Is not touched. That is, it is not mentioned whether or not the translation result of the machine translation unit 103 is in accordance with the contents of the original source language character string. The source language character string is received from the voice recognition unit 102, the source language character string is machine-translated into a character string of a target language (also referred to as a second language), and a target language character string as a character string of a translation result is obtained. Since the general process may be a general process, the description is omitted here. ", And the validity of the output of the machine translation unit 103 is not guaranteed.) Even if the content that is far from the content of the character string appears as an output of the machine translation unit 103, the user has no way of knowing it. In other words, neither the disclosure nor the solution to the above-mentioned problem of “a user without knowledge of the target language presents a translation without confirming the intention transmitted to the other party” has not been made.

また、（要件２）・（要件３）について、開示も示唆もされていない。特に要件３については、システム全体として軽量化するどころか、機械翻訳部１０３に加え、新たにＲＢＭＴに相当する用例格納部１０５が必要となっているため、計算機資源的にも開発工数的にも増大している。 Neither (Requirement 2) nor (Requirement 3) is disclosed or suggested. Especially for requirement 3, in addition to reducing the weight of the system as a whole, in addition to the machine translation unit 103, a new example storage unit 105 corresponding to RBMT is required, which increases both computer resources and development man-hours. are doing.

ＲＢＭＴにおける「（要件２）ユーザの利用結果を用いて、新たにユーザに負担をかけること無く、自動的に学習できること」への対応については、特許文献３のような、入力文と翻訳文をそれぞれ形態素解析し、対訳語句を対訳辞書に登録する際に、対象とする翻訳装置に有効であるか否かを判別した結果を機械学習するアプローチが開示されている。 Regarding RBMT's response to "(Requirement 2) that learning can be performed automatically without imposing a new burden on the user by using the usage result of the user", an input sentence and a translated sentence as in Patent Document 3 are used. An approach is disclosed in which, when performing morphological analysis and registering a bilingual phrase in a bilingual dictionary, the result of determining whether or not the translation is effective for a target translation device is machine-learned.

特許文献３の段落番号［００２０］では、「５は機械翻訳エンジンであって、例えば規則主導型機械翻訳では、」として、規則主導型機械翻訳という名称でＲＢＭＴを翻訳システムとして例示し、対訳語句を対訳辞書に登録する際の有効性の判別方法を開示している。ここでは、ルールベースの対訳辞書に新たな語句を登録する場合に、その語句を登録することがシステム上有効かどうかを、それまでに登録された対訳対を形態素解析した上で、サポートベクトルマシンを用いて識別した判定空間に照らし合わせることで判断を行っている。すなわち、「ルールベースへの新たな語句の登録」は、ユーザの翻訳利用とは関係なく行われており、ユーザの利用結果を用いた自動学習については課題の開示も解決もなされていない。 In paragraph number [0020] of Patent Document 3, "5 is a machine translation engine, for example, in rule-driven machine translation," and RBMT is exemplified as a translation system with the name of rule-driven machine translation. Discloses a method of determining the validity when registering a. In a bilingual dictionary. Here, when registering a new phrase in the rule-based bilingual dictionary, whether the registration of the phrase is valid on the system is determined by performing a morphological analysis on the previously registered bilingual pair, and then using the support vector machine. The judgment is made by referring to the judgment space identified using. That is, “registration of a new phrase in the rule base” is performed irrespective of the use of translation by the user, and no problem is disclosed or solved for automatic learning using the use result of the user.

一般にＲＢＭＴは、「規則が存在しない場合には翻訳精度が非常に低くなる、もしくは全く翻訳できず、想定以外の業界・分野への適応性も極めて低い」という短所を有しており、この短所は本質的に解決されていない（例えば特許文献１）。 In general, RBMT has the disadvantage that, if there are no rules, the translation accuracy will be very low or cannot be translated at all, and the adaptability to industries and fields other than those expected is extremely low. Has not been essentially solved (for example, Patent Document 1).

また翻訳性能を向上させるためには規則を追加していく必要があるが、これには開発コストが必要となる。更にユーザ側で規則を構築して翻訳システムをカスタマイズするには、規則の設計ルールについて相当の知識が要求されるため、一般ユーザが気軽に使えるものとはなっていない。すなわち、「ＲＢＭＴの持つルールが増加するに従い、ルール同士の干渉・副作用が発生する可能性が飛躍的に増大し、ある文例で有効なルールを追記することで、他の文例で不都合が生じるといった現象が頻発するようになる」が、これを回避するためには、「システムが有している全てのルールを把握した上で、新たなルール追記が必要となる」ため、「ユーザの利用結果を用いた自動学習」はおろか、簡便なルール追加すら困難である。例えば、先に挙げた特許文献３では、ＲＢＭＴが持つ翻訳ルールのうち、単語訳対のみについて、その有効性を自動で判断しようとしているに過ぎず、構文的な翻訳ルールについての干渉回避には至っていない。 In order to improve translation performance, it is necessary to add rules, but this requires development costs. Further, in order to customize the translation system by constructing rules on the user side, considerable knowledge of rule design rules is required, so that general users cannot easily use them. In other words, as the number of rules in RBMT increases, the possibility of interference and side effects between rules increases dramatically, and adding a valid rule in one sentence example causes inconvenience in another sentence example. The phenomenon will occur frequently. "To avoid this, it is necessary to add new rules after grasping all the rules that the system has." It is difficult to add even simple rules, let alone automatic learning using. For example, in Patent Literature 3 mentioned above, among the translation rules of the RBMT, only the validity of the word translation pair is determined automatically. Not reached.

ＲＢＭＴはこのような原理的な短所を有しているため、「ユーザによるＲＢＭＴのデータベースへの単語・用例登録」の効率化・簡便化がＲＢＭＴの課題として開示されている。これらは、ユーザに対して明示的に学習用データの入力や判断を求めるものであり、このような要求をユーザに対して行わないこと、すなわち、先の課題である「ユーザの利用結果を用いた自動学習」については、課題の開示も解決もなされていない状況である。 Since the RBMT has such a fundamental disadvantage, the efficiency and simplification of “user's registration of words and examples in the RBMT database” have been disclosed as issues of the RBMT. These are for explicitly requesting the user to input and judge learning data, and that such a request should not be made to the user. With regard to “automatic learning,” no issues have been disclosed nor solved.

２）統計的機械翻訳（ＳＭＴ）
ＳＭＴでは、入力文と逆翻訳（入力文を目的言語に一旦翻訳し、更に目的言語から原言語に再度翻訳したもの）の結果の距離を翻訳信頼度として算出し、必要に応じて信頼度と逆翻訳結果をあわせて提示し、ユーザに言い直しや言い換えを行わせるもの（例えば、特許文献４参照。ただし特許文献４ではＳＭＴを前提としない場合についても言及されており、ＲＢＭＴも機械翻訳エンジンとして想定している。ＲＢＭＴについて、特許文献４では段落番号［０００９］にて、「文法規則型翻訳」としている。）、ＳＭＴの翻訳精度を評価する手法として、単語ｎ−ｇｒａｍ単位での類似度（ＢＬＥＵ値）を用いて自動学習をおこなうもの（非特許文献１参照）、入力文に対して得られた翻訳文に対する逆翻訳文をＮ個生成し、入力文と逆翻訳文の類似度から翻訳文の良否を比較評価するもの（例えば、特許文献２）、などがある。 2) Statistical machine translation (SMT)
In SMT, the distance between the input sentence and the result of reverse translation (the input sentence is translated once into the target language and then translated again from the target language to the source language) is calculated as the translation reliability. One that presents the reverse translation result together and causes the user to rephrase or paraphrase it (for example, refer to Patent Literature 4. However, Patent Literature 4 also mentions a case where SMT is not premised. The RBMT is referred to as “grammatical rule-type translation” in paragraph number [0009] in Patent Literature 4.) As a method of evaluating the translation accuracy of the SMT, a similarity in word n-gram units is used. Automatic learning using the degree (BLEU value) (see Non-Patent Document 1), N reverse translations for a translation obtained from an input sentence are generated, and the input sentence and the reverse translation are generated. Those comparative evaluation the quality of the translation from the similarity (e.g., Patent Document 2), and the like.

ここで、「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」について、これらの例では、例えば、ユーザに逆翻訳結果を提示すること（特許文献４）で、「相手に伝わる意図」をユーザに確認する手法を開示している。しかしながら、「簡便な形で翻訳文を生成・利用できること」については、逆翻訳結果に応じてユーザに再入力や言い換えを要求しており、課題解決には至っていない。 In these examples, “(Requirement 1) that a user can generate and use a translated sentence while confirming the intention to be transmitted to the user in a simple manner without putting a burden on the user” is described in these examples. A technique of presenting a translation result (Patent Literature 4) to confirm "intention transmitted to the other party" to the user is disclosed. However, regarding "the ability to generate and use a translated sentence in a simple form", the user is required to re-input or paraphrase according to the result of the reverse translation, and the problem has not been solved.

具体的には、特許文献４の段落番号［００１３］では、「適切に翻訳結果に対する信頼度を得ることができ、かつ低信頼度であった場合に入力側ユーザに適切に再入力を促すことができる信頼度算出装置、翻訳信頼度算出利用方法および翻訳エンジン用プログラムを提供する」と開示されている。すなわち、翻訳結果に対する信頼度が低いとシステムが判断した場合、ユーザは原文の再入力を強いられるのみならず、充分に高い信頼度を持つ翻訳文が出力されるまで、原文の表現を変えるなどの試行錯誤を行いながら翻訳システムへの入力と訳文出力の確認作業を実行し続ける必要がある。更にこの時に留意しなければならないことは、使用するユーザにとって、当該翻訳システムが「どういった内部動作で翻訳文を生成」し、「どういった基準で信頼度を算出」し、「どうすれば信頼度が高い翻訳文を得ることができるのか」という点について、何の知見も有していないということである。ユーザは指針も無く、ただ信頼度の高い翻訳結果を得るために、文章を様々に言い換えながらの入力を強いられることとなり、これは実用上の観点から見ると、極めて使いにくいシステムとなってしまう。特許文献４では、これらについての課題提起も解決法も提示されておらず、「簡便な形で翻訳文を生成・利用できること」という点において不十分であると言える。 Specifically, in the paragraph number [0013] of Patent Literature 4, "It is possible to appropriately obtain the reliability of the translation result, and to prompt the input user to re-input properly when the reliability is low. And a method for calculating and using translation reliability and a program for a translation engine. " That is, if the system determines that the reliability of the translation result is low, the user is forced not only to re-input the original text but also to change the expression of the original text until a translated text with sufficiently high reliability is output. It is necessary to keep checking the input to the translation system and the output of the translation while performing trial and error. It is also important to keep in mind that, at this time, the translation system "generates a translated sentence by what kind of internal operation", "calculates the reliability based on what criteria", and " Do you have any knowledge about "Can you get a high degree of translation?" There is no guideline, and the user is forced to input while rephrasing sentences in order to obtain a highly reliable translation result, which is a very difficult system to use from a practical point of view. . Patent Literature 4 does not propose any problem nor solve the problem, and it can be said that it is insufficient in that “a translation can be generated and used in a simple form”.

また、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」については、これまで「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」に関して述べたように、ＲＢＭＴ・ＳＭＴ双方において、ユーザが負担を受けずに「相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」が実現されておらず、そのため、（要件１）を満たすような「ユーザの利用結果」を得る手法が開示されていないため、（要件１）を満たす「ユーザの利用結果」を用いた自動的な学習についても、課題の開示や解決法の提示は為されていない。 Regarding “(Requirement 2) that learning can be performed automatically using the usage result of the user (without imposing a new burden on the user)”, “(Requirement 1) the user is not burdened, That a user can generate and use a translated sentence while easily confirming the intention to be transmitted to the other party "in both RBMT and SMT. That a translation can be generated and used while confirming the above ”has not been realized, and a method of obtaining a“ user usage result ”that satisfies (Requirement 1) has not been disclosed. Regarding the automatic learning using the “user usage result” that satisfies the condition, no problem is disclosed or no solution is presented.

この「（要件１）を満たすようなユーザの利用結果を得る手法が無い」ことを前提とした上で、何らかの別の学習用データが与えられた場合の翻訳システムの自動評価・学習法という観点で、先に述べた、「統計的機械翻訳システムの性能を評価する手法であるＢＬＥＵ値による自動学習をおこなうもの（非特許文献１参照）」、「入力文に対し、逆翻訳文をＮ個取得し、入力文と逆翻訳文の類似度から比較評価を行うもの（例えば、特許文献２）、「対訳語句を対訳辞書に登録する際、対象とする翻訳装置に有効であるか否かを判別した結果を機械学習するもの（例えば、特許文献３参照）」等が存在する。 Based on the premise that "there is no method for obtaining a user's use result that satisfies (Requirement 1)", a viewpoint of an automatic evaluation and learning method of a translation system when some other learning data is given As described above, "the method of performing automatic learning based on the BLEU value, which is a method for evaluating the performance of a statistical machine translation system (see Non-Patent Document 1)", and "N reverse translation sentences for an input sentence" One that acquires and compares and evaluates the similarity between an input sentence and a back-translated sentence (for example, Patent Document 2). “When registering a bilingual phrase in a bilingual dictionary, it is necessary to determine whether the translation is valid for the target translation device. A machine learning the result of the discrimination (for example, see Patent Document 3) "and the like.

非特許文献１は、事前に入力文に対する参照訳（正解データ）を準備し、翻訳エンジンの翻訳出力結果と正解データとを単言語でのｎ−ｇｒａｍをベースに比較することで、その比較結果を値（ＢＬＥＵスコア）として機械的に算出し、その値が高くなるようにシステムをチューニングする、というものである。事前に入力文と、正解データとなる参照訳（翻訳文）を準備する必要があり、本質的に翻訳エンジンの内部モデルはこのチューニングによって何ら変化せず単に重みが変わるだけのため、モデルそのものを学習したい場合や、正解データが与えられない・正解データが一意に決定されない場合には、本手法は適用できない。 Non-Patent Document 1 prepares a reference translation (correct data) for an input sentence in advance and compares the translation output result of the translation engine with the correct data based on a monolingual n-gram, and the comparison result is obtained. Is calculated mechanically as a value (BLEU score), and the system is tuned to increase the value. It is necessary to prepare an input sentence and a reference translation (translation sentence) as the correct answer in advance. Since the internal model of the translation engine does not change at all by this tuning but merely changes the weight, the model itself must be prepared. This method cannot be applied when learning is desired, or when correct data is not given / correct data is not uniquely determined.

仮に「要件１を満たすユーザの利用結果」が得られたとしても、その結果は「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用した」結果である。この「結果」のうち、「どれが正解データであるかどうか」、「どう一意に正解データとするか」、「その後どうモデルの学習を行うか」について、何ら課題の開示も解決も示されていない。 Even if the "use result of the user who satisfies the requirement 1" is obtained, the result is a result that "the user has generated and used the translated sentence while confirming the intention to be transmitted to the other party in a simple manner". Of these "results", there are no issues to be disclosed or solved regarding "whether or not the correct data", "how to uniquely make the correct data", and "how to learn the model afterwards". Not.

特許文献２には、翻訳結果を複数の逆方向の翻訳機で原言語に戻し、入力文と複数の逆翻訳文との間で文の類似度を機械的に算出することで、元の翻訳結果の良否を評価する内容が開示されている。特許文献２では、複数の逆翻訳文を生成することで、元の翻訳結果の評価を行うことが開示されているが、特許文献２において解決を図っている課題は、翻訳結果についての何らかの自動的な評価を行うことであり、この点において本質的に先の非特許文献１との差は存在しない。すなわち、特許文献２には、何らかの正解文（非特許文献１では参照訳、特許文献２では入力文に相当）と翻訳結果文（非特許文献１では翻訳文、特許文献２では翻訳結果を逆翻訳した逆翻訳文に相当）との間で、一致もしくは類似度を表すスコア（非特許文献１のＢＬＥＵ値に代表されるスコア）を算出し、その値で翻訳結果の良否を評価することのみが開示されている。 Patent Document 2 discloses that the translation result is returned to the source language by a plurality of backward translators, and the similarity of the sentence between the input sentence and the plurality of backward-translated sentences is mechanically calculated. The contents for evaluating the quality of the result are disclosed. Patent Literature 2 discloses that the original translation result is evaluated by generating a plurality of reverse-translated sentences. However, the problem addressed in Patent Literature 2 is that some automatic In this respect, there is essentially no difference from the above-mentioned non-patent document 1. That is, in Patent Literature 2, some correct sentence (corresponding to a reference translation in Non-Patent Literature, equivalent to an input sentence in Patent Literature 2) and a translation result sentence (Non-Patent Literature 1 translates, and Patent Literature 2 reverses the translation A score representing a match or similarity (e.g., a score represented by the BLEU value in Non-Patent Document 1) is calculated with respect to the translated back-translation sentence), and the quality of the translation result is evaluated using the calculated value. Is disclosed.

なお、特許文献２では翻訳システム自体の学習には言及されていないが、特許文献２の段落番号［００４８］にて、「・・・３つの逆方向翻訳文と原文とのＤＰマッチングを行ない、最大スコアを順方向翻訳文の自動評価の結果とすることができるので、例文の翻訳文だけでなくすべての翻訳文に対する評価を可能とし、評価の労力を少なくし、かつ評価の信頼性を高くすることできる」と記載されており、その目的を、段落番号［０００９］にて、「例文の翻訳文だけでなくすべての翻訳文に対する評価が可能で、評価の信頼性が高く、かつ労力が少ない機械翻訳文の評価方法、および機械翻訳文の評価装置」の構築としている。非特許文献１では事前に必要とされていた正解文（参照訳）が無い場合においても、特許文献２では、逆翻訳文と入力原文との機械的なマッチングスコアによる評価ができることを開示しており、特許文献２では学習について言及されていないが、このスコアを学習に用いることは、非特許文献１との組み合わせにおいて示唆される。 Note that although Patent Document 2 does not mention learning of the translation system itself, paragraph number [0048] of Patent Document 2 states that "... DP matching between three backward-translated sentences and the original sentence is performed. Since the maximum score can be the result of automatic evaluation of forward translations, it is possible to evaluate not only translations of example sentences but also all translations, reducing evaluation effort and improving reliability of evaluation. The purpose is described in paragraph [0009], "Evaluation is possible not only for translations of example sentences, but also for all translations, and the reliability of evaluation is high and labor is high. A method for evaluating a small number of machine translated sentences and a machine translation sentence evaluation device are to be constructed. Non-Patent Document 1 discloses that even when there is no correct sentence (reference translation) required in advance, Patent Document 2 discloses that evaluation can be performed by using a mechanical matching score between a back-translated sentence and an input original sentence. Thus, Patent Document 2 does not mention learning, but using this score for learning is suggested in combination with Non-Patent Document 1.

しかしながら、特許文献２と非特許文献１を組み合わせた場合においても、モデルそのものを学習したい場合や、マッチングスコアが不正な場合や一意に決定されない場合には、依然としてチューニングすることができない。 However, even in the case where Patent Literature 2 and Non-Patent Literature 1 are combined, tuning cannot be performed if the user wants to learn the model itself, or if the matching score is incorrect or cannot be uniquely determined.

また、非特許文献１と同様に、仮に「要件１を満たすユーザの利用結果」が得られたとしても、その結果は「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用した」結果である。このような結果をどう評価値として判断するか、さらにどのようにその評価値でモデルの学習を行うかについては、何ら課題の開示も解決も示されていない。 Also, as in Non-Patent Document 1, even if “the use result of the user who satisfies the requirement 1” is obtained, the result is “the user confirms the intention to be transmitted to the other party in a simple form while confirming the translated sentence. Is generated and used. " Regarding how to judge such a result as an evaluation value, and how to perform model learning using the evaluation value, no problem is disclosed or solved.

すなわち、「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用」することについては、課題の開示も解法の開示もなされておらず、それを用いた学習についても、先の非特許文献１と同様に何ら課題の開示も解決も示されていない。 In other words, as for "generating and using a translated sentence while the user confirms the intention to be transmitted to the other party in a simple manner", neither the problem nor the solution is disclosed. However, as in Non-Patent Document 1, neither disclosure nor solution of the problem is disclosed.

また、「（要件３）要件１・要件２の実現と同時に、計算機資源や開発コストを下げること」については、非特許文献１ではＢＬＥＵ値の算出、特許文献２ではマッチングスコアの算出が新たに必要となり、計算機資源・開発コストは増大している。 Regarding “(Requirement 3) Reducing computer resources and development costs while realizing requirements 1 and 2”, Non-Patent Document 1 newly calculates a BLEU value, and Patent Document 2 newly calculates a matching score. It becomes necessary, and computer resources and development costs are increasing.

３）ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）
非特許文献３及び非特許文献４は、ニューラルネットによる機械翻訳として、ディープニューラルネット（ＤＮＮ：ＤｅｅｐＮｕｅｒａｌＮｅｔ）のうちＲＮＮ（ＲｅｃｕｒｒｅｎｔＮｅｕｒａｌＮｅｔｗｏｒｋ）およびＲＮＮの一種とされるＬＳＴＭ（ＬｏｎｇＳｈｏｒｔＴｅｒｍＭｅｍｏｒｙ）を用いたＤＮＮＭＴを例示している。いずれの手法も、対訳コーパスをニューラルネットの入力層および出力層に対する正解データ（正例や負例）として用い、ニューラルネットの中間層を直接学習させることにより、ニューラルネットの内部に翻訳モデルを直接構築するものである。ＤＮＮが内部にどのような形で翻訳モデルを持つかは、学習データの種類や与え方、学習回数、ＤＮＮ自身のネットワーク構成などに依存する。いずれのケースにおいても、どのように内部状態を変更すると、どのように翻訳性能が変化するのかについては開示されていない（学術的にも解明されていない）。本来、ニューラルネットの特性として、非線形な出力を学習できるという点があるが、ＤＮＮとなることで非線形性は飛躍的に増しており、何らかの内部パラメータと出力性能との線形的な因果関係は見いだせていないというのが現状である。 3) Model-based machine translation using deep neural network (DNNMT)
Non-Patent Literature 3 and Non-Patent Literature 4 describe LSTM (Long Short Term Memory), which is a kind of deep neural net (DNN: Recursive Neural Network) and RNN, as machine translation by a neural network. Is illustrated using DNNMT. Both methods use the bilingual corpus as the correct answer data (positive and negative examples) for the input and output layers of the neural network, and directly train the intermediate layer of the neural network, so that the translation model is directly stored inside the neural network. To build. The form in which the DNN has the translation model inside depends on the type and manner of providing the learning data, the number of times of learning, the network configuration of the DNN itself, and the like. In any case, how the internal state is changed and how the translation performance changes is not disclosed (it is not clarified academically). Originally, the characteristic of a neural network is that it can learn nonlinear output. However, by using DNN, nonlinearity has dramatically increased, and a linear causal relationship between some internal parameters and output performance can be found. It is not at present.

言い換えると、ＤＮＮＭＴは「入力文に対して何らかの翻訳結果を返す」という意味では、先のＲＢＭＴやＳＭＴと同じである。しかし、「なぜその翻訳結果が得られたのか」に点においては、ＲＢＭＴはそのルールを記述したデータベースを参照すれば翻訳結果が得られた理由（元となったルール）が分かり、ＳＭＴでは翻訳モデル（各単語・句の発生確率、アライメント確率等）と言語モデル（ｎ−ｇｒａｍ確率）から最大確率だったものが選ばれた翻訳結果であることが分かるのに対し、ＤＮＮＭＴでは、ルールやモデルに相当するものをニューラルネットが自ら構築するため、ニューラルネットの出力層が出してきた結果が翻訳結果の文であった、ということ以上に内部モデル・動作についての知見は得られない。 In other words, DNNMT is the same as RBMT or SMT in the sense that “returns some translation result for the input sentence”. However, in terms of "why the translation result was obtained", the RBMT can refer to the database describing the rules to find out why the translation result was obtained (the original rule). It can be seen that the translation result of which the maximum probability was selected from the model (the occurrence probability of each word / phrase, the alignment probability, etc.) and the language model (n-gram probability) is the translation result, whereas the DNNMT uses the rules and models. Since the neural network constructs itself equivalent to the above, there is no knowledge about the internal model or operation beyond the fact that the result output from the output layer of the neural network was a translated result sentence.

このため、ＤＮＮＭＴは学術研究が中心となっており、実用フェーズには至っておらず「（要件１）ユーザに負担を掛けず、ユーザが相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用できること」のような、実利用面からの課題にまでは至っていない。 For this reason, DNNMT is mainly focused on academic research, and has not reached the practical phase. ((Requirement 1) The translation sentence is not burdened on the user and the user confirms the intention to be transmitted to the other party in a simple manner. To be able to generate and use the same. "

また、「（要件２）ユーザの利用結果を用いて（新たにユーザに負担をかけること無く）、自動的に学習できること」についても、仮に「（要件１）を満たすユーザの利用結果」が得られたとしても、その結果を用いて「その後どうモデルの学習を行うか」については、内部動作の解明が必要であり、当然ながら何ら課題の開示も解決も示されていない。 As for “(requirement 2) that learning can be automatically performed using the user's usage result (without imposing a new burden on the user)”, a provisional “user usage result that satisfies (requirement 1)” is obtained. Even if it is done, it is necessary to clarify the internal operation of "how to learn the model after that" using the result, and no problem is disclosed or solved.

以上をまとめると、従来技術では以下の点で課題を有していた。 To summarize the above, the prior art has problems in the following points.

・対訳データベースに類似例や訳文が記述されていない場合、及び、分野が異なる場合に、翻訳精度が極めて低い翻訳文を提示する可能性、もしくは全く翻訳できない可能性が有る。 -When similar examples or translations are not described in the bilingual database, or when the field is different, there is a possibility that a translation with extremely low translation accuracy may be presented, or translation may not be performed at all.

・翻訳文の内容をユーザが簡便に確認し選択する方法がないため、入力文に対する逆翻訳文の提示や翻訳品質（信頼度）の提示と言った方法では、提示内容の品質・信頼度が低かった場合に、ユーザは再入力を強いられるが、再入力によって品質が向上する保証はなく、ユーザは入力を試行錯誤する他ない。 -Since there is no way for the user to easily check and select the content of the translated sentence, the method of presenting a reverse-translated sentence to the input sentence or presenting the translation quality (reliability) reduces the quality and reliability of the presented content. If it is low, the user is forced to re-input, but there is no guarantee that the quality will be improved by the re-input, and the user will have to try and input the input.

・ＢＬＵＥ値など、何らかの方法で機械的に算出したスコアに基づいた翻訳システムの自動チューニングは先行例において開示されているが、「ユーザが、相手に伝わる意図を簡便な形で確認をしながら翻訳文を生成・利用した」場合、その結果に基づく評価、及び、学習をどのように行なうかについては、いずれも課題が開示されておらず、解決法も未開示である。 -Automatic tuning of a translation system based on a score calculated mechanically by a method such as a BLUE value has been disclosed in a prior example, but "translation is performed while the user confirms the intention transmitted to the other party in a simple manner. In the case of "generating and using a sentence", no problem is disclosed in any of evaluations based on the results and how to perform learning, and no solution is disclosed.

・類似用例やチューニング用のデータ（評価スコアを算出等）を生成する計算機資源が必要である。また類似用例を作成する開発・人的コストも必要となる。 Computer resources for generating similar examples and data for tuning (calculation of evaluation score, etc.) are required. Also, development and human costs for creating similar examples are required.

そこで、機械翻訳システムの機能向上のため、以下の改善策を検討した。 Therefore, to improve the function of the machine translation system, the following improvement measures were examined.

機械翻訳システムの機械翻訳方法の一態様は、言語情報を出力する情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行なう機械翻訳システムにおける機械翻訳方法であって、前記第１言語の翻訳対象文を受信し、受信した前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成し、前記複数の異なる前記順翻訳文の各々について前記第１言語へ逆翻訳した複数の逆翻訳文を生成し、前記情報出力装置において前記複数の逆翻訳文を出力しているときに、前記複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文に対応する前記順翻訳文を出力する。 One aspect of the machine translation method of the machine translation system is a machine translation method in a machine translation system that connects to an information output device that outputs language information and performs a translation process between a first language and a second language. Receiving the translation target sentence of the first language, generating a plurality of different forward translation sentences by translating the received translation target sentence into the second language, and generating the first translation target sentence for each of the plurality of different forward translation sentences; Generating a plurality of back-translated sentences back-translated into a language, and selecting one back-translated sentence from the plurality of back-translated sentences when the plurality of back-translated sentences are being output in the information output device. If received, the forward-translated sentence corresponding to the one backward-translated sentence is output.

上記態様によると、第１言語の翻訳対象文を第２言語へ翻訳した複数の異なる順翻訳文を生成し、複数の順翻訳文の各々について第１言語へ逆翻訳した複数の逆翻訳文を生成し、情報出力装置において複数の逆翻訳文を出力しているときに、複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文に対応する順翻訳文を出力する。 According to the above aspect, a plurality of different forward-translated sentences generated by translating the translation target sentence of the first language into the second language are generated, and a plurality of reverse-translated sentences which are back-translated to the first language for each of the plurality of forward-translated sentences are generated. When generating and outputting a plurality of back-translated sentences on the information output device, when an operation of selecting one back-translated sentence from the plurality of back-translated sentences is received, the order corresponding to the one back-translated sentence is received. Output the translation.

例えば、受信した翻訳対象文に対応する複数の逆翻訳文をユーザへ提示し、その中からユーザに選択された逆翻訳文に対応する順翻訳文を提示する。従って、ユーザは、複数の逆翻訳文の中から、自身が入力した翻訳対象文の意図に一番近い逆翻訳文を選択することとなるため、例えば、第１言語で入力された翻訳対象文を第２言語へ翻訳した一の翻訳文、および当該翻訳文に対応する一の逆翻訳文のみが提示されるシステムと比較すると、逆翻訳文が翻訳対象文の意図する内容と異なることで、翻訳対象文の修正または入力のやり直しなどが求められる場面が少なくなる。 For example, a plurality of backward-translated sentences corresponding to the received translation target sentence are presented to the user, and the user is presented with a forward-translated sentence corresponding to the backward-translated sentence selected from the plurality of backward-translated sentences. Therefore, the user selects the reverse translation sentence closest to the intention of the translation target sentence inputted by himself / herself from the plurality of reverse translation sentences. When compared with a system in which only one translation sentence translated into a second language and one reverse translation sentence corresponding to the translation are presented, the reverse translation differs from the intended content of the translation target sentence. The number of situations where correction of the translation target sentence or re-input is required is reduced.

また、例えば、ユーザによって複数の逆翻訳文の中から一の逆翻訳文が選択されるため、機械翻訳システムは、入力された翻訳対象文の意図する内容として、提示した複数の逆翻訳文の中ではどの逆翻訳文が一番妥当であるか、もしくはユーザの好みの表現に合うか、などのフィードバックを得ることが可能である。そのため、例えば、上記態様における機械翻訳システムに機械学習を適用する場合には、入力された翻訳対象文に対して提示した逆翻訳文が妥当であるか否かという評価に加えて、提示した複数の逆翻訳文の中ではいずれの逆翻訳文が妥当であるかという評価も得られる。このとき、機械翻訳システムにおける一度の翻訳動作によって、複数の逆翻訳文に対するフィードバックが得られるので、機械翻訳システムの高い学習効率を実現できる。 In addition, for example, since one backward-translated sentence is selected from a plurality of backward-translated sentences by the user, the machine translation system determines the intended backward-translated sentence Among them, it is possible to obtain feedback such as which reverse-translated sentence is the most appropriate or whether it matches the user's favorite expression. Therefore, for example, when machine learning is applied to the machine translation system in the above aspect, in addition to evaluating whether or not the reverse-translated sentence presented for the input translation target sentence is appropriate, It is also possible to obtain an evaluation as to which of the back-translated sentences is appropriate among the back-translated sentences. At this time, a single translation operation in the machine translation system can provide feedback on a plurality of backward-translated sentences, so that high learning efficiency of the machine translation system can be realized.

さらに、上記態様によると、機械翻訳システムにおける機械学習について、ＢＬＥＵ値の算出による翻訳精度の評価、または入力された翻訳対象文と逆翻訳文との機械的なマッチングスコアによる翻訳精度の評価などが必要なく、ユーザによる逆翻訳文の選択によって学習用データが生成される。そのため、学習用データの生成に新たな計算機資源を必要とせず、開発コストを抑えることもできる。 Further, according to the above aspect, for the machine learning in the machine translation system, the evaluation of the translation accuracy by calculating the BLEU value or the evaluation of the translation accuracy by the mechanical matching score between the input translation target sentence and the reverse translation sentence are performed. There is no need to generate learning data by selecting a backward-translated sentence by the user. Therefore, no new computer resources are required for generating the learning data, and the development cost can be reduced.

上記態様において、例えば、前記機械翻訳システムは、さらに、ユーザによる音声入力を受け付ける音声入力装置、およびユーザによるテキスト入力を受け付けるテキスト入力装置と接続し、前記翻訳対象文は、当該翻訳文を表す音声情報、またはテキスト情報の形態で受信し、前記音声情報、または前記テキスト情報のいずれの形態で前記翻訳対象文を受信したかに応じて、前記一の逆翻訳文に対応する前記順翻訳文の出力の形態を変更するとしてもよい。 In the above aspect, for example, the machine translation system is further connected to a voice input device that receives a voice input by a user and a text input device that receives a text input by a user, and the translation target sentence is a voice representing the translated sentence. Information, or received in the form of text information, and depending on which of the audio information or the text information received the translation target sentence, the forward-translated sentence corresponding to the one reverse-translated sentence The output form may be changed.

上記態様によると、翻訳対象文を、音声情報、またはテキスト情報のいずれの形態で受信したかに応じて、順翻訳文の出力の形態を変更する。これによって、例えば、入力モーダルに応じて出力のモーダルが決定されるため、ユーザは入力の形態を変えることで出力の形態を自由に決定できる。 According to the above aspect, the output form of the forward-translated sentence is changed depending on whether the sentence to be translated is received in the form of voice information or text information. Thereby, for example, since the output modal is determined according to the input modal, the user can freely determine the output form by changing the input form.

上記態様において、例えば、前記情報出力装置は、音声出力装置およびディスプレイを有し、前記翻訳対象文を音声情報の形態で受信した場合は、前記一の逆翻訳文に対応する前記順翻訳文を、前記音声出力装置を介して出力し、前記翻訳対象文をテキスト情報の形態で受信した場合は、前記一の逆翻訳文に対応する前記順翻訳文を、前記ディスプレイを介して出力するとしてもよい。 In the above aspect, for example, the information output device has an audio output device and a display, and when the translation target sentence is received in the form of audio information, the information output device outputs the forward translation sentence corresponding to the one reverse translation sentence. If the sentence to be translated is output via the audio output device and the sentence to be translated is received in the form of text information, the forward translated sentence corresponding to the one reverse translated sentence may be output via the display. Good.

これによると、入力の形式と出力の形式がそれぞれ同じモーダルで対応しているため、ユーザは、自身の希望する出力形態で翻訳対象文を入力すれば良く、いずれの入力形態で入力すれば希望する出力形態で翻訳文が出力されるかの混乱が生じない。 According to this, since the input format and the output format correspond to each other in the same modal, the user only needs to input the translation target sentence in his / her desired output format. There is no confusion as to whether a translated sentence is output in an output form that does.

上記態様において、例えば、前記翻訳対象文は、当該翻訳対象文を示すテキスト情報で受信し、前記テキスト情報に基づいて、前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成するとしてもよい。 In the above aspect, for example, the translation target sentence is received as text information indicating the translation target sentence, and a plurality of different forward-translated sentences obtained by translating the translation target sentence into the second language based on the text information are received. It may be generated.

上記態様において、例えば、前記機械翻訳システムは、さらに、ユーザによるテキスト入力を受け付けるテキスト入力装置と接続し、前記翻訳対象文は、前記テキスト入力装置から、当該翻訳対象文を示すテキスト情報で受信するとしてもよい。 In the above aspect, for example, the machine translation system is further connected to a text input device that receives text input by a user, and the translation target sentence is received from the text input device as text information indicating the translation target sentence. It may be.

上記態様において、例えば、前記翻訳対象文は、当該翻訳対象文を表す音声情報で受信し、受信した前記音声情報に対して音声認識処理を行って前記翻訳対象文を示すテキスト情報を生成し、前記テキスト情報に基づいて、前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成するとしてもよい。 In the above aspect, for example, the translation target sentence is received as voice information representing the translation target sentence, and performs voice recognition processing on the received voice information to generate text information indicating the translation target sentence, Based on the text information, a plurality of different forward-translated sentences obtained by translating the sentence to be translated into the second language may be generated.

これによって、音声を用いた翻訳対象文の入力が可能となるので、例えば、翻訳対象文をキーボード、タッチディスプレイなどを用いて入力する必要が無いので、ユーザは簡単に翻訳対象文を入力できる。 This allows the user to input the translation target sentence using voice. For example, there is no need to input the translation target sentence using a keyboard, a touch display, or the like, so that the user can easily input the translation target sentence.

上記態様において、例えば、前記機械翻訳システムは、さらに、ユーザの音声の入力を受け付ける音声入力装置と接続し、前記翻訳対象文は、前記音声入力装置から、当該翻訳対象文を表す音声情報で受信するとしてもよい。 In the above aspect, for example, the machine translation system is further connected to a voice input device that receives a user's voice input, and the translation target sentence is received from the voice input device as voice information representing the translation target sentence. You may do it.

上記態様において、例えば、前記情報出力装置はディスプレイを有し、前記複数の逆翻訳文は、前記ディスプレイの第１領域に表示され、前記ディスプレイの第１領域とは異なる第２領域に、前記翻訳対象文が表示されるとしてもよい。 In the above aspect, for example, the information output device has a display, and the plurality of reverse-translated sentences are displayed in a first area of the display, and the translation is performed in a second area different from the first area of the display. The target sentence may be displayed.

上記態様によると、逆翻訳文と翻訳対象文とで表示する領域を分ける。逆翻訳文と翻訳対象文とは同じ言語の文章であるため、ユーザは、いずれが逆翻訳文でいずれが翻訳対象文であるかを簡単に見分けることができ、混同することがない。 According to the above aspect, the area to be displayed is divided between the reverse translation sentence and the translation target sentence. Since the back-translation sentence and the translation target sentence are sentences in the same language, the user can easily distinguish which is the reverse-translation sentence and which is the translation-target sentence, and is not confused.

上記態様において、例えば、前記ディスプレイの第３領域に、前記一の逆翻訳文に対応する前記順翻訳文が表示されるとしてもよい。 In the above aspect, for example, the forward-translated sentence corresponding to the one backward-translated sentence may be displayed in a third area of the display.

これによって、逆翻訳文、翻訳対象文、順翻訳文のそれぞれが異なる領域に表示されるため、それぞれいずれの文章であるか、ユーザにとって分かりやすい。 As a result, each of the backward-translated sentence, the translation target sentence, and the forward-translated sentence is displayed in a different area, so that it is easy for the user to identify which sentence each is.

上記態様において、例えば、前記情報出力装置に対する操作に応じて、前記一の逆翻訳文に対応する前記順翻訳文の表示の向きが変更されるとしてもよい。 In the above aspect, for example, the display direction of the forward-translated sentence corresponding to the one backward-translated sentence may be changed according to an operation on the information output device.

これによって、例えば、それぞれ異なる言語を話す、ユーザＡおよびユーザＢが向かい合って会話をする場合、ユーザＡが話す言語で入力した翻訳対象文に対応してユーザＢが話す言語で出力された順翻訳文が情報出力装置に表示されているときに順翻訳文の向きを、例えば、逆向きに変更できれば、ユーザＡが順翻訳文を読み上げたり、情報出力装置自体の向きを変更したりして、ユーザＢに順翻訳文の内容を伝える必要が無く、向かい合った二人のユーザが情報出力装置を上から覗き込むようにして異なる言語間のコミュニケーションを図ることが可能である。 Thus, for example, when the user A and the user B speak face-to-face and have a face-to-face conversation, the forward translation output in the language spoken by the user B corresponding to the translation target sentence input in the language spoken by the user A When the sentence is displayed on the information output device, if the direction of the forward-translated sentence can be changed, for example, in the reverse direction, the user A reads out the forward-translated sentence or changes the direction of the information output device itself, There is no need to inform the user B of the contents of the forward-translated sentence, and communication between different languages can be achieved by allowing two opposing users to look into the information output device from above.

上記態様において、例えば、前記順翻訳文の表示の向きは、前記第１領域に表示される前記複数の逆翻訳文の表示の向きとは異なる向きへ変更されるとしてもよい。 In the above aspect, for example, the display direction of the forward-translated sentence may be changed to a direction different from the display direction of the plurality of reverse-translated sentences displayed in the first area.

上記態様において、例えば、前記順翻訳文の表示の向きは、前記第１領域に表示される前記複数の逆翻訳文の表示の向きと同じ向きへ変更されるとしてもよい。 In the above aspect, for example, the display direction of the forward-translated sentence may be changed to the same direction as the display direction of the plurality of reverse-translated sentences displayed in the first area.

上記態様において、例えば、前記一の逆翻訳文に対応する前記順翻訳文は、前記第１領域に表示される前記複数の逆翻訳文とは異なる向きで表示されるとしてもよい。 In the above aspect, for example, the forward-translated sentence corresponding to the one backward-translated sentence may be displayed in a different direction from the plurality of backward-translated sentences displayed in the first area.

上記態様において、例えば、前記機械翻訳システムは、受信した前記翻訳対象文を前記第２言語へ翻訳した前記順翻訳文の集合であって、前記複数の異なる順翻訳文を含む順翻訳文群を生成し、前記順翻訳文群に含まれる前記順翻訳文の各々について、疑問文、肯定文、否定文、命令文の中のいずれの形態に分類されるかを判断し、前記複数の異なる順翻訳文は、分類された前記形態に基づいて、前記順翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of the forward-translated sentences obtained by translating the received sentence to be translated into the second language, and includes a forward-translated sentence group including the plurality of different forward-translated sentences. It is determined whether each of the forward-translated sentences included in the forward-translated sentence group is classified into a question sentence, an affirmative sentence, a negative sentence, or a command sentence. The translated sentence may be selected from the forward-translated sentence group based on the classified form.

上記態様によると、順翻訳文群の中から、文章の形態に基づいて複数の異なる順翻訳文が選択されるため、例えば、翻訳対象文に基づいて機械的に生成された順翻訳文群から、翻訳対象文の形態と同じ形態である順翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文の形態と異なる形態である順翻訳文を含んで複数の順翻訳文を選択してもよく、これによって、順翻訳文に基づいて生成され、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different forward-translated sentences are selected from the forward-translated sentence group based on the form of the sentence, for example, from the forward-translated sentence group automatically generated based on the translation target sentence, In addition, it is possible to select only the forward-translated sentence having the same form as the form of the translation target sentence, and it is possible to improve the final translation accuracy. Further, for example, a plurality of forward-translated sentences including a forward-translated sentence having a form different from the form of the translation target sentence may be selected, whereby the forward-translated sentence is generated based on the forward-translated sentence and presented to the user. Variations of multiple back-translated sentences can be increased. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記複数の異なる順翻訳文は、各々異なる前記形態に分類された少なくとも２以上の前記順翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of different forward-translated sentences may include at least two or more forward-translated sentences classified into the different forms.

これによって、順翻訳文に基づいて生成され、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 This makes it possible to increase the variations of a plurality of backward-translated sentences generated based on the forward-translated sentences and presented to the user. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記機械翻訳システムは、受信した前記翻訳対象文を前記第２言語へ翻訳した前記順翻訳文の集合である順翻訳文群を生成し、前記順翻訳文群は、前記複数の異なる順翻訳文を含み、前記順翻訳文群に含まれる前記順翻訳文各々の主語、または述語を判断し、前記複数の異なる順翻訳文は、判断された前記主語または前記述語に基づいて、前記順翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system generates a forward-translated sentence group that is a set of forward-translated sentences obtained by translating the received translation target sentence into the second language, and the forward-translated sentence group is A plurality of different forward-translated sentences are included, and the subject or predicate of each of the forward-translated sentences included in the forward-translated sentence group is determined. Based on the selection, it may be selected from the forward-translated sentence group.

上記態様によると、順翻訳文群の中から、主語または述語に基づいて複数の異なる順翻訳文が選択されるため、例えば、翻訳対象文に基づいて機械的に生成された順翻訳文群から、翻訳対象文と同じ主語または述語を有する順翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文と異なる主語または述語を有する順翻訳文を含んで複数の順翻訳文を選択してもよく、これによって、順翻訳文に基づいて生成され、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different forward-translated sentences are selected from the forward-translated sentences based on the subject or the predicate, for example, from the forward-translated sentences that are automatically generated based on the translation target sentence, Only the forward translation having the same subject or predicate as the translation target sentence can be selected, and the final translation accuracy can be improved. Further, for example, a plurality of forward-translated sentences including a forward-translated sentence having a subject or a predicate different from the translation target sentence may be selected, whereby the forward-translated sentences are generated based on the forward-translated sentences and presented to the user. Variations of multiple back-translated sentences can be increased. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記複数の異なる順翻訳文は、各々異なる主語、または述語を含むと判断された少なくとも２以上の順翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of different forward-translated sentences may include at least two or more forward-translated sentences each determined to include a different subject or a predicate.

上記態様において、例えば、前記複数の異なる順翻訳文は、各々同一の主語、または述語を含むと判断された順翻訳文であるとしてもよい。 In the above aspect, for example, the plurality of different forward-translated sentences may be forward-translated sentences each determined to include the same subject or predicate.

これによって、例えば、翻訳対象文に基づいて機械的に生成された順翻訳文群から、翻訳対象文と同じ主語または述語を有する順翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。 Thereby, for example, from a group of forward-translated sentences mechanically generated based on the translation-target sentence, it is possible to select only a forward-translated sentence having the same subject or predicate as the translation-target sentence, and to improve the final translation accuracy. Can be improved.

上記態様において、例えば、前記機械翻訳システムは、前記複数の異なる前記順翻訳文の各々に対して少なくとも一以上生成した前記逆翻訳文の集合であって、前記複数の逆翻訳文を含む逆翻訳文群を生成し、前記逆翻訳群に含まれる前記逆翻訳文の各々について、前記翻訳対象文との類似度を評価した評価値を算出し、前記複数の逆翻訳文は、前記評価値に基づいて、前記逆翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of the backward-translated sentences generated at least one or more for each of the plurality of different forward-translated sentences, and the backward-translation including the plurality of backward-translated sentences. A sentence group is generated, and for each of the back-translated sentences included in the back-translated group, an evaluation value that evaluates the similarity with the translation target sentence is calculated, and the plurality of back-translated sentences are included in the evaluation value. Based on this, it may be selected from the group of reverse-translated sentences.

上記態様によると、逆翻訳文群の中から、翻訳対象文との類似度に基づいて複数の異なる逆翻訳文が選択されるため、例えば、翻訳対象文との類似度が高い逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文との類似度が低い逆翻訳文を含んで複数の逆翻訳文を選択してもよく、これによって、ユーザに提示される、複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, a plurality of different backward-translated sentences are selected from the backward-translated sentence group based on the similarity to the translation target sentence. Can be selected, and the final translation accuracy can be improved. In addition, for example, a plurality of backward-translated sentences including a backward-translated sentence having a low degree of similarity to the translation target sentence may be selected, thereby increasing the variations of the backward-translated sentences presented to the user. Can be. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記機械翻訳システムは、前記複数の異なる前記順翻訳文の各々に対して少なくとも一以上生成した前記逆翻訳文の集合であって、前記複数の逆翻訳文を含む逆翻訳文群を生成し、前記逆翻訳文群に含まれる前記逆翻訳文の各々について、疑問文、肯定文、否定文、命令文の中のいずれの形態に分類されるかを判断し、前記複数の逆翻訳文は、分類された前記形態に基づいて、前記逆翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of the backward-translated sentences generated at least one or more for each of the plurality of different forward-translated sentences, and the backward-translation including the plurality of backward-translated sentences. A sentence group is generated, and for each of the back-translated sentences included in the back-translated sentence group, it is determined whether the sentence is classified into a question sentence, a positive sentence, a negative sentence, or a command sentence. May be selected from the group of back-translated sentences based on the classified form.

上記態様によると、逆翻訳文群の中から、文章の形態に基づいて複数の異なる逆翻訳文が選択されるため、例えば、逆翻訳文群から、翻訳対象文の形態と同じ形態である逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文の形態と異なる形態である逆翻訳文を含んで複数の逆翻訳文を選択してもよく、これによって、ユーザに提示される複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different backward-translated sentences are selected from the backward-translated sentence group based on the form of the sentence, for example, the reverse-translated sentence group having the same form as the form of the translation target sentence is selected from the backward-translated sentence group. Only the translation can be selected, and the final translation accuracy can be improved. In addition, for example, a plurality of backward-translated sentences including a backward-translated sentence having a form different from the form of the translation target sentence may be selected, thereby increasing the variation of the backward-translated sentences presented to the user. Can be. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記複数の逆翻訳文は、各々異なる前記形態に分類された少なくとも２以上の前記逆翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of back-translated sentences may include at least two or more of the back-translated sentences classified into the different forms.

これによって、ユーザに提示される複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 As a result, it is possible to increase the variations of a plurality of reverse-translated sentences presented to the user. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記機械翻訳システムは、前記複数の異なる前記順翻訳文の各々に対して少なくとも一以上生成した前記逆翻訳文の集合であって、前記複数の逆翻訳文を含む逆翻訳文群を生成し、前記逆翻訳文群に含まれる前記逆翻訳文各々の主語、または述語を判断し、前記複数の逆翻訳文は、判断された前記主語または前記述語に基づいて、前記逆翻訳文群の中から選択されるとしてもよい。 In the above aspect, for example, the machine translation system is a set of the backward-translated sentences generated at least one or more for each of the plurality of different forward-translated sentences, and the backward-translation including the plurality of backward-translated sentences. A sentence group is generated, and the subject or predicate of each of the back-translated sentences included in the back-translated sentence group is determined, and the plurality of back-translated sentences are based on the determined subject or predescriptor, It may be selected from a group of reverse-translated sentences.

上記態様によると、逆翻訳文群の中から、主語または述語に基づいて複数の異なる逆翻訳文が選択されるため、例えば、逆翻訳文群から、翻訳対象文と同じ主語または述語を有する逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。また、例えば、翻訳対象文と異なる主語または述語を有する逆翻訳文を含んで複数の逆翻訳文を選択してもよく、これによって、ユーザに提示される複数の逆翻訳文のバリエーションを増やすことができる。そのため、例えば、機械翻訳システムに機械学習を適用させる際には、似通った内容の複数の逆翻訳文からユーザに一の逆翻訳文を選択させると、選択されなかった逆翻訳文でもユーザの入力した翻訳対象文の示す意図を表わせていた場合に、選択されなかったことで間違った逆翻訳文であると機械翻訳システムが学習してしまうことを防止できる。 According to the above aspect, since a plurality of different backward-translated sentences are selected from the backward-translated sentence group based on the subject or the predicate, for example, a reverse-translated sentence having the same subject or predicate as the translation target sentence is selected from the backward-translated sentence group. Only the translation can be selected, and the final translation accuracy can be improved. Further, for example, a plurality of back-translated sentences including a back-translated sentence having a subject or a predicate different from the translation target sentence may be selected, thereby increasing the variation of the plurality of back-translated sentences presented to the user. Can be. Therefore, for example, when applying machine learning to a machine translation system, if the user selects one backward-translated sentence from a plurality of backward-translated sentences having similar contents, even if the backward-translated sentence that is not selected is input by the user. In the case where the intention indicated by the translated target sentence is expressed, it is possible to prevent the machine translation system from learning that it is an incorrect reverse-translated sentence because it was not selected.

上記態様において、例えば、前記複数の逆翻訳文は、各々異なる主語、または述語を含むと判断された少なくとも２以上の逆翻訳文を含むとしてもよい。 In the above aspect, for example, the plurality of back-translated sentences may include at least two or more back-translated sentences determined to include different subjects or predicates.

上記態様において、例えば、前記複数の逆翻訳文は、各々同一の主語、または述語を含むと判断された逆翻訳文であるとしてもよい。 In the above aspect, for example, the plurality of back-translated sentences may be back-translated sentences each determined to include the same subject or predicate.

これによって、例えば、逆翻訳文群から、翻訳対象文と同じ主語または述語を有する逆翻訳文のみを選択することができ、最終的な翻訳精度を向上させることができる。 As a result, for example, only a reverse translation sentence having the same subject or predicate as the translation target sentence can be selected from the reverse translation sentence group, and the final translation accuracy can be improved.

上記態様において、例えば、前記機械翻訳システムは、前記翻訳処理において参照する確率モデルを管理し、前記翻訳処理において、機械学習を適応し、前記複数の逆翻訳文の中のいずれの前記逆翻訳文が前記一の逆翻訳文として選択されたかを示す情報に基づいて、前記機械学習を行なって、前記確率モデルのパラメータを更新するとしてもよい。 In the above aspect, for example, the machine translation system manages a probabilistic model referred to in the translation processing, adapts machine learning in the translation processing, and selects any one of the plurality of back-translated sentences from the plurality of back-translated sentences. May be updated based on the information indicating whether or not is selected as the one back-translated sentence to update the parameters of the probability model.

上記態様によると、前記複数の逆翻訳文の中のいずれの前記逆翻訳文が前記一の逆翻訳文として選択されたかを示す情報に基づいて、前記機械学習を行なって、前記確率モデルのパラメータを更新する。これによって、翻訳対象文に対して提示された複数の逆翻訳文の中のいずれの逆翻訳文が選択されたかを示す情報がシステムに反映されるため、機械翻訳システムが使用されることで翻訳精度を向上させることが可能である。 According to the above aspect, the machine learning is performed based on information indicating which of the plurality of back-translated sentences has been selected as the one back-translated sentence, and the parameter of the probability model is To update. As a result, information indicating which one of the plurality of back-translated sentences presented for the sentence to be translated is selected is reflected in the system. It is possible to improve accuracy.

上記態様において、例えば、前記確率モデルは、前記翻訳処理に用いられる単語またはフレーズ毎に付与される重み値を含み、前記機械翻訳システムは、前記一の逆翻訳文に対応する順翻訳文である選択順翻訳文に含まれる単語またはフレーズと、前記一の逆翻訳文以外の逆翻訳文に対応する順翻訳文である非選択翻訳文に含まれる単語またはフレーズと、を比較し、前記選択順翻訳文にのみ含まれる単語またはフレーズと、前記非選択順翻訳文にのみ含まれる単語またはフレーズと、前記選択順翻訳文と前記非選択順翻訳文の双方に含まれる単語またはフレーズとに対して、各々異なる前記重み値の更新方法を適応して前記重み値を更新し、更新された前記重み値と、更新された前記重み値に対応する前記単語または前記フレーズを教師データとして用いて前記機械学習を行なうとしてもよい。 In the above aspect, for example, the probability model includes a weight value assigned to each word or phrase used in the translation processing, and the machine translation system is a forward translation corresponding to the one reverse translation. Comparing the words or phrases included in the selected-order translated sentence with the words or phrases included in the unselected translated sentence that is a forward-translated sentence corresponding to the reverse-translated sentence other than the one reverse-translated sentence; For words or phrases included only in the translated sentence, words or phrases included only in the unselected ordered translated sentence, and words or phrases included in both the selected ordered translated sentence and the unselected ordered translated sentence Updating the weights by applying different methods of updating the weights, and updating the weights and the words or phrases corresponding to the updated weights with teacher data. May perform the machine learning is used as data.

これによって、例えば、選択順翻訳文に含まれる単語またはフレーズと、選択順翻訳文に含まれない単語またはフレーズとで、スコアに差を付けて機械学習を行うことができるので、非選択順翻訳文の中に含まれる単語またはフレーズであっても、重み値の更新においてプラスの評価が行なわれる場合がある。そのため、非選択順翻訳文において、部分的に正しい翻訳が行なわれていた場合に、その部分を正しく評価可能であり、ユーザの評価結果を反映できる。 Thus, for example, machine learning can be performed with a difference in score between words or phrases included in the selected-order translated sentence and words or phrases not included in the selected-order translated sentence. Even in the case of a word or a phrase included in a sentence, a positive evaluation may be performed in updating the weight value. For this reason, when a partially correct translation is performed in the non-selection order translated sentence, that part can be correctly evaluated, and the evaluation result of the user can be reflected.

さらに、機械学習によって、確率モデルに対して、逐次、単語またはフレーズ単位でユーザの選択結果を反映させながら、学習させることが可能となり、翻訳精度を向上させることができる。 Further, by machine learning, learning can be performed on the probability model while sequentially reflecting the user's selection result in units of words or phrases, and translation accuracy can be improved.

上記態様において、例えば、前記確率モデルは、前記翻訳処理に用いられる単語またはフレーズ毎に付与される重み値を含み、前記機械翻訳システムは、前記一の逆翻訳文に含まれる単語またはフレーズと、前記一の逆翻訳文以外の逆翻訳文である非選択逆翻訳文に含まれる単語またはフレーズと、を比較し、前記一の逆翻訳文にのみ含まれる単語またはフレーズと、前記非選択逆翻訳文にのみ含まれる単語またはフレーズと、前記一の逆翻訳文と前記非選択逆翻訳文の双方に含まれる単語またはフレーズとに対して、各々異なる前記重み値の更新方法を適応して前記重み値を更新し、更新された前記重み値と、更新された前記重み値に対応する前記単語または前記フレーズを教師データとして用いて前記機械学習を行なうとしてもよい。 In the above aspect, for example, the probability model includes a weight value assigned to each word or phrase used in the translation processing, and the machine translation system includes a word or phrase included in the one reverse-translated sentence, A word or a phrase included in a non-selected reverse-translated sentence that is a reverse-translated sentence other than the one reverse-translated sentence is compared, and a word or a phrase included only in the one reverse-translated sentence is compared with the non-selected reverse-translated sentence For each of the words or phrases included only in the sentence and the words or phrases included in both the one back-translated sentence and the unselected back-translated sentence, a different updating method of the weight value is applied, and the weight is applied. A value may be updated, and the machine learning may be performed using the updated weight value and the word or the phrase corresponding to the updated weight value as teacher data.

これによって、例えば、選択された一の逆翻訳文に含まれる単語またはフレーズと、選択された一の逆翻訳文に含まれない単語またはフレーズとで、スコアに差を付けて機械学習を行うことができるので、非選択逆翻訳文の中に含まれる単語またはフレーズであっても、重み値の更新においてプラスの評価が行なわれる場合がある。そのため、非選択逆翻訳文において、部分的に正しい翻訳が行なわれていた場合に、その部分を正しく評価可能であり、ユーザの評価結果を反映できる。 Thus, for example, performing machine learning with a difference in score between a word or phrase included in one selected back-translated sentence and a word or phrase not included in the selected one back-translated sentence Therefore, even if the word or phrase is included in the unselected reverse-translated sentence, a positive evaluation may be performed in updating the weight value. Therefore, when a correct translation is partially performed in the non-selected reverse-translated sentence, that portion can be correctly evaluated, and the evaluation result of the user can be reflected.

上記態様において、例えば、前記機械翻訳システムは、前記一の逆翻訳文に含まれる単語またはフレーズと、前記一の逆翻訳文以外の逆翻訳文である非選択逆翻訳文に含まれる単語またはフレーズと、を比較し、前記一の逆翻訳文にのみ含まれる単語またはフレーズと、前記非選択逆翻訳文にのみ含まれる単語またはフレーズと、前記一の逆翻訳文と前記非選択逆翻訳文の双方に含まれる単語またはフレーズとに対して、各々異なる前記重み値の更新方法を適応して前記重み値を更新し、更新された前記重み値と、更新された前記重み値に対応する前記単語または前記フレーズを教師データとして用いて前記機械学習を行なうとしてもよい。 In the above aspect, for example, the machine translation system may include a word or a phrase included in the one back-translated sentence and a word or phrase included in a non-selected reverse-translated sentence other than the one back-translated sentence. And a word or phrase included only in the one back-translated sentence, a word or phrase included only in the unselected back-translated sentence, and the one back-translated sentence and the non-selected back-translated sentence. For each word or phrase included in both, the weight value is updated by applying a different update method of the weight value, and the updated weight value and the word corresponding to the updated weight value are updated. Alternatively, the machine learning may be performed using the phrase as teacher data.

上記態様において、例えば、前記重み値は、前記一の逆翻訳文のみに対応する前記単語について、正例としての値であり、前記一の逆翻訳文以外の文のみに対応する前記単語について、負例としての値であるとしてもよい。 In the above aspect, for example, the weight value is a value as a positive example for the word corresponding to only the one back-translated sentence, and for the word corresponding to only a sentence other than the one back-translated sentence, It may be a negative value.

これによって、前記重み値に対して、プラスの評価とマイナスの評価の双方を反映できる。 Thereby, both the positive evaluation and the negative evaluation can be reflected on the weight value.

上記態様において、例えば、前記機械学習は、強化学習、識別学習、ニューラルネット学習の中の少なくとも一を用いた学習であるとしてもよい。 In the above aspect, for example, the machine learning may be learning using at least one of reinforcement learning, identification learning, and neural network learning.

また、他の態様において、第１言語と第２言語との間の翻訳処理を行う機械翻訳装置であって、前記第１言語の翻訳対象文の入力を受け付ける入力部と、前記翻訳対象文を前記第２言語へ翻訳した順翻訳文、前記順翻訳文を前記第１言語へ逆翻訳した逆翻訳文、を生成する翻訳部と、前記逆翻訳文および前記一の逆翻訳文に対応する前記順翻訳文を出力する出力部と、ユーザの入力を受け付けるユーザ入力部と、を備え、前記翻訳部は、前記翻訳対象文について複数の異なる前記順翻訳文を生成し、前記複数の異なる前記順翻訳文の各々に対応する複数の逆翻訳文を生成し、前記出力部は、前記複数の逆翻訳文を出力しているときに、前記ユーザ入力部において、前記複数の逆翻訳文から一の逆翻訳文を選択する入力を受け付けた場合、前記一の逆翻訳文に対応する前記順翻訳文を出力する。 Further, in another aspect, there is provided a machine translation apparatus that performs a translation process between a first language and a second language, wherein the input unit receives an input of the translation target sentence of the first language, and A translation unit that generates a forward-translated sentence translated into the second language, and a reverse-translated sentence reversely translated from the forward-translated sentence into the first language; and a translation unit corresponding to the reverse-translated sentence and the one reverse-translated sentence. An output unit that outputs a forward-translated sentence; and a user input unit that receives a user's input, wherein the translating unit generates a plurality of different forward-translated sentences for the translation target sentence, and generates the plurality of different forward-ordered sentences. The output unit generates a plurality of backward-translated sentences corresponding to each of the translated sentences, and the output unit outputs one of the backward-translated sentences from the plurality of backward-translated sentences in the user input unit when outputting the plurality of backward-translated sentences. If an input to select a back translation is received, To output the forward translation sentence corresponding to the reverse translation of.

また、第２の他の態様において、情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行う機械翻訳装置の動作を制御するプログラムであって、前記機械翻訳装置のコンピュータに対して、前記第１言語の翻訳対象文を受信させ、受信した前記翻訳対象文を前記第２言語へ翻訳した複数の異なる順翻訳文を生成させ、前記複数の異なる前記順翻訳文の各々について前記第１言語へ逆翻訳した複数の逆翻訳文を生成させ、前記情報出力装置において、前記複数の逆翻訳文を表示させているときに、前記複数の逆翻訳文から一の逆翻訳文を選択する操作を受け付けた場合、前記一の逆翻訳文を前記第２言語へ翻訳した順翻訳文を出力させる。 Further, in a second other aspect, there is provided a program for controlling an operation of a machine translation device connected to an information output device and performing a translation process between a first language and a second language, wherein Causing the computer to receive the translation target sentence in the first language, generate a plurality of different forward translated sentences obtained by translating the received translation target sentence into the second language, A plurality of back-translated sentences, each of which is back-translated into the first language, are generated, and when the plurality of back-translated sentences are displayed on the information output device, one back-translation from the plurality of back-translated sentences is generated. When an operation for selecting a sentence is received, a forward-translated sentence obtained by translating the one backward-translated sentence into the second language is output.

（実施の形態）
以下本発明の実施の形態について、図面を参照しながら説明する。 (Embodiment)
Hereinafter, embodiments of the present invention will be described with reference to the drawings.

なお以下の実施の形態では、翻訳前の言語である原言語を日本語、翻訳後の言語である目的言語を英語として説明している箇所があるが、これらは一例であり、原言語と目的語の対はどのような組み合わせの言語対であっても構わない。 In the following embodiments, the source language as the language before translation is described as Japanese, and the target language as the language after translation is described as English. The word pairs can be any combination of language pairs.

原言語から目的言語への翻訳で得られた翻訳文を順翻訳文、目的言語から原言語への翻訳で得られた翻訳文を逆翻訳文と表記する。 The translation obtained from the translation from the source language to the target language is referred to as a forward translation, and the translation obtained from the translation from the target language to the source language is referred to as a reverse translation.

また、ユーザに提示される逆翻訳文を、ユーザ提示文と表記し、ユーザが選択した文を（ユーザ）選択文、選択しなかった逆翻訳文を（ユーザ）非選択文と表記する。 In addition, a backward-translated sentence presented to the user is referred to as a user-provided sentence, a sentence selected by the user is referred to as a (user) selected sentence, and a reverse-translated sentence not selected is referred to as a (user) non-selected sentence.

図１は、本実施の形態における、システムの全体構成の一例を示す図である。情報表示端末１００、ネットワーク２００、翻訳サーバ３００、マイク４００、スピーカー５００を備える。情報表示端末１００の例としては、スマートフォンやタブレット端末、専用表示機器端末、パーソナルコンピュータ（ＰＣ）などが挙げられる。ここに挙げたもの以外でも、ユーザと情報のやりとりができる端末であれば何でもよい。 FIG. 1 is a diagram illustrating an example of the overall configuration of a system according to the present embodiment. It includes an information display terminal 100, a network 200, a translation server 300, a microphone 400, and a speaker 500. Examples of the information display terminal 100 include a smartphone, a tablet terminal, a dedicated display device terminal, and a personal computer (PC). Any terminal other than those listed here may be used as long as it can exchange information with the user.

また、情報表示端末１００におけるユーザの入力操作は、テキストでの入力、音声による入力などが想定される。テキストでの入力においては、例えば、タッチパネルによる入力や、キーボードによる入力が考えられる。また、音声による入力の場合、例えば、マイクによる入力が考えられる。この他にも、例えば、ジェスチャによる入力などを用いるとしてもよい。 In addition, the user's input operation on the information display terminal 100 is assumed to be text input, voice input, or the like. In text input, for example, input using a touch panel or input using a keyboard can be considered. In the case of input by voice, for example, input by a microphone can be considered. In addition to this, for example, a gesture input may be used.

情報表示端末１００において、機械翻訳結果等を出力する場合、ディスプレイを介して結果を出力してもよいし、音声を用いて結果を出力するとしてもよい。 When the information display terminal 100 outputs a machine translation result or the like, the result may be output via a display or the result may be output using voice.

ネットワーク２００は、情報表示端末１００、翻訳サーバ３００、マイク４００、スピーカー５００が接続される。接続方法の一例として、有線、無線によるＬＡＮ接続などが挙げられるが、各構成要素を通信可能に接続するものであれば、これに限らない。 The information display terminal 100, the translation server 300, the microphone 400, and the speaker 500 are connected to the network 200. An example of the connection method includes a wired or wireless LAN connection, but the connection method is not limited to this as long as each component is communicably connected.

翻訳サーバ３００は、情報表示端末１００から受信した翻訳対象文に対して機械翻訳処理を行う。例えば、情報表示端末１００から入力された原言語の文字列を受信し、機械翻訳処理を行う。また、機械翻訳結果についてユーザからのフィードバックを受けて、機械学習を行なう機能も有する。翻訳サーバ３００の詳細な構成は後述する。 The translation server 300 performs a machine translation process on the translation target sentence received from the information display terminal 100. For example, a character string in the source language input from the information display terminal 100 is received, and a machine translation process is performed. It also has a function of receiving machine feedback from the user on machine translation results and performing machine learning. The detailed configuration of the translation server 300 will be described later.

なお、例えば、情報表示端末１００と翻訳サーバ３００が一体となって実現されてもよい。 Note that, for example, the information display terminal 100 and the translation server 300 may be realized integrally.

マイク４００は、機械翻訳システムに対して、音声による入力を行なう。マイク４００は、情報表示端末１００に付属していてもよいし、単独でネットワーク２００に接続する機能を備えているとしてもよい。また、機械翻訳システムに対して、音声による入力が行われない場合は、マイク４００の構成は必須ではない。 Microphone 400 performs voice input to the machine translation system. The microphone 400 may be attached to the information display terminal 100 or may have a function of connecting to the network 200 alone. Further, when no voice input is performed to the machine translation system, the configuration of the microphone 400 is not essential.

スピーカー５００は、機械翻訳システムにおいて、音声による出力を行なう。スピーカー５００は、情報表示端末１００に付属していてもよいし、単独でネットワーク２００に接続する機能を備えているとしてもよい。また、機械翻訳システムにおいて、音声による出力が行われない場合は、スピーカー５００の構成は必須ではない。 The speaker 500 performs audio output in the machine translation system. The speaker 500 may be attached to the information display terminal 100 or may have a function of connecting to the network 200 alone. In the case where no voice output is performed in the machine translation system, the configuration of the speaker 500 is not essential.

機械翻訳システムの入力・出力モダリティは、音声による入出力、またはテキストでの入出力のいずれか一方のみを備えていてもよいし、両方を備えていてもよい。ユーザから、機械翻訳システムに対して音声による入力が行われた場合、音声による出力を行う。また、ユーザからテキスト形式で入力された場合は、テキスト（画面表示）による出力を行う。 The input / output modality of the machine translation system may include only one of input and output by voice and input and output by text, or may include both. When a voice input is performed from the user to the machine translation system, a voice output is performed. When a text is input by a user, the text (screen display) is output.

図２は、本実施の形態における、情報表示端末１００の構成を示すブロック図である。 FIG. 2 is a block diagram showing a configuration of information display terminal 100 in the present embodiment.

情報表示端末１００は、通信部１０１、入力部１０２、出力部１０３、制御部１０４、選択文検出部１０５、記憶部１０６を備える。 The information display terminal 100 includes a communication unit 101, an input unit 102, an output unit 103, a control unit 104, a selected sentence detection unit 105, and a storage unit 106.

通信部１０１は、翻訳サーバ３００との通信を行い、情報表示端末１００において入力された翻訳対象文の送信、後述する翻訳文および逆翻訳文の受信などを行なう。また、これらの情報に限らず、翻訳サーバ３００と各種の情報の送受信を行なう。 The communication unit 101 performs communication with the translation server 300, and performs transmission of a translation target sentence input in the information display terminal 100, reception of a later-described translated sentence and a reverse-translated sentence, and the like. In addition, not only these information but also various kinds of information are transmitted and received with the translation server 300.

入力部１０２は、ユーザからの入力を受け付ける。入力部１０２は、翻訳対象文の入力、後述する逆翻訳文の選択入力などの入力を受け付ける。入力の形態としては、音声入力、テキスト形式での入力が考えられる。音声入力が用いられる場合、音声によって入力された翻訳対象文に対して音声認識処理が行われ、音声認識処理の結果出力される文字列が入力文として機械翻訳システムに入力される。テキスト形式での入力が用いられる場合、キーボード、マウス、タッチパネルなどによる文字列の入力を受け付ける。 The input unit 102 receives an input from a user. The input unit 102 receives inputs such as an input of a translation target sentence and a selection input of a reverse translation sentence described later. As a form of the input, a voice input or an input in a text format can be considered. When speech input is used, a speech recognition process is performed on a translation target sentence input by speech, and a character string output as a result of the speech recognition process is input to the machine translation system as an input sentence. When text input is used, input of a character string by a keyboard, a mouse, a touch panel, or the like is accepted.

出力部１０３は、入力部１０２において入力された翻訳対象文、通信部１０１を介して受信した複数の逆翻訳文、翻訳結果などを出力する。なお、出力部１０３は、ディスプレイなど、画面表示を実行する表示部として実現されてもよく、例えば、スマートフォン、タブレット端末などに用いられるタッチパネル式のディスプレイまたはモニタが想定される。また、スピーカーなど、音声を出力する音声出力部として実現されてもよい。制御部１０４は、通信部１０１、入力部１０２、出力部１０３、制御部１０４、選択文検出部１０５、記憶部１０６の動作を制御する。 The output unit 103 outputs the translation target sentence input at the input unit 102, a plurality of reverse-translated sentences received via the communication unit 101, a translation result, and the like. The output unit 103 may be realized as a display unit that performs screen display, such as a display. For example, a touch panel display or monitor used for a smartphone, a tablet terminal, or the like is assumed. Further, it may be realized as an audio output unit that outputs audio, such as a speaker. The control unit 104 controls operations of the communication unit 101, the input unit 102, the output unit 103, the control unit 104, the selected sentence detection unit 105, and the storage unit 106.

選択文検出部１０５は、出力部１０３によって出力された複数の逆翻訳文に対して、ユーザがどの逆翻訳文を選択したかを検出する。例えば、入力部１０２において、複数の逆翻訳文から一の逆翻訳文を選択する旨の入力が行なわれた場合、どの逆翻訳文が選択されたかを示すユーザ選択情報が選択文検出部１０５において検出される。検出したユーザ選択情報は、通信部１０１を介して翻訳サーバ３００へ送信される。また、ユーザ選択情報に基づいて、出力部１０３の出力内容を制御してもよい。例えば、出力部１０３がディスプレイによって実現されている場合、ユーザが選択した逆翻訳文を強調表示する、もしくは、ユーザが選択しなかった逆翻訳文を表示画面から消去する制御を行なうとしてもよい。 The selected sentence detection unit 105 detects which one of the plurality of back-translated sentences output by the output unit 103 has been selected by the user. For example, when an input to select one backward-translated sentence from a plurality of backward-translated sentences is performed in the input unit 102, user selection information indicating which backward-translated sentence is selected is output to the selected sentence detecting unit 105. Is detected. The detected user selection information is transmitted to the translation server 300 via the communication unit 101. Further, the output content of the output unit 103 may be controlled based on the user selection information. For example, when the output unit 103 is implemented by a display, control may be performed to highlight the reverse-translated sentence selected by the user or to erase the reverse-translated sentence not selected by the user from the display screen.

ここで、ユーザによって選択された逆翻訳文、およびその逆翻訳文に対応する順翻訳文をユーザ選択文とする。また、ユーザが選択しなかった逆翻訳文、およびその逆翻訳文に対応する順翻訳文をユーザ非選択文とする。 Here, the backward-translated sentence selected by the user and the forward-translated sentence corresponding to the backward-translated sentence are defined as a user-selected sentence. In addition, the backward-translated sentence not selected by the user and the forward-translated sentence corresponding to the backward-translated sentence are defined as user non-selected sentences.

記憶部１０６は、翻訳サーバ３００から受信した情報の一時的な記憶、情報表示端末１００において実行される各種のアプリケーションプログラムの記憶などを行なう。 The storage unit 106 performs temporary storage of information received from the translation server 300, storage of various application programs executed in the information display terminal 100, and the like.

図３は、本実施の形態における、翻訳サーバ３００の構成を示すブロック図である。翻訳サーバ３００は、通信部２１０、制御部２２０、機械翻訳部２３０、記憶部２４０を備える。さらに、機械翻訳部２３０は、順翻訳部２３１、順翻訳文選択部２３２、逆翻訳部２３３、逆翻訳文選択部２３４、選択文判断部２３５、フレーズ分割部２３６、選択結果評価部２３７、学習部２３８を有する。 FIG. 3 is a block diagram showing a configuration of translation server 300 in the present embodiment. The translation server 300 includes a communication unit 210, a control unit 220, a machine translation unit 230, and a storage unit 240. Further, the machine translation unit 230 includes a forward translation unit 231, a forward translation sentence selection unit 232, a reverse translation unit 233, a reverse translation sentence selection unit 234, a selection sentence determination unit 235, a phrase division unit 236, a selection result evaluation unit 237, and learning. It has a part 238.

通信部２１０は、情報表示端末１００との通信を行い、情報表示端末１００において入力された翻訳対象文の受信、後述する翻訳文および逆翻訳文の送信などを行なう。また、これらの情報に限らず、情報表示端末１００と各種の情報の送受信を行なう。 The communication unit 210 performs communication with the information display terminal 100, and receives a translation target sentence input in the information display terminal 100, transmits a later-described translated sentence and a reverse translated sentence, and the like. In addition to the above information, transmission and reception of various information with the information display terminal 100 are performed.

制御部２２０は、通信部２１０、機械翻訳部２３０、記憶部２４０の各種の動作を制御する。 The control unit 220 controls various operations of the communication unit 210, the machine translation unit 230, and the storage unit 240.

記憶部２４０は、機械翻訳部２３０が各種の翻訳処理、フレーズ分割処理などにおいて参照するフレーズテーブルを格納する。フレーズテーブルについては後述する。 The storage unit 240 stores a phrase table that the machine translation unit 230 refers to in various types of translation processing, phrase division processing, and the like. The phrase table will be described later.

機械翻訳部２３０は、通信部を介して受信した翻訳対象文に対して機械翻訳処理を実行する。機械翻訳部２３０では、ルールベース機械翻訳（ＲＢＭＴ）、統計的機械翻訳（ＳＭＴ）、ディープニューラルネットワークによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）などによって機械翻訳が行われる。機械翻訳部２３０は、翻訳結果を評価して、自動評価スコア（ＢＬＥＵなど）、内部スコア（人手による評価など）などのスコアを取得する。 The machine translation unit 230 executes a machine translation process on the translation target sentence received via the communication unit. The machine translation unit 230 performs machine translation by a rule-based machine translation (RBMT), a statistical machine translation (SMT), a model acquisition type machine translation using a deep neural network (DNNMT), or the like. The machine translation unit 230 evaluates the translation result and acquires a score such as an automatic evaluation score (such as BLEU) and an internal score (such as evaluation by hand).

また、ユーザによる選択結果を機械学習へ反映させるため、翻訳手法によっては必要に応じて図１１に示すような、フレーズの対を予め示した、フレーズテーブルを用意する。 In addition, in order to reflect the result of the selection by the user to machine learning, a phrase table in which pairs of phrases are shown in advance as shown in FIG.

ルールベース機械翻訳（ＲＢＭＴ）は、人手によって構築された変換規則（訳語の対をデータベースとして記憶したもの）を元に翻訳を行うため、図１１のようなフレーズテーブルを保持していない可能性がある。ただし、句または単語単位での対訳データベースが存在する場合は、対訳データベースに学習結果を反映してもよいし、フレーズテーブルを別途用意してもよい。 Since rule-based machine translation (RBMT) performs translation based on conversion rules (stored pairs of translated words as a database) constructed manually, there is a possibility that the phrase table as shown in FIG. 11 is not held. is there. However, if there is a bilingual database for each phrase or word, the learning result may be reflected in the bilingual database, or a phrase table may be separately prepared.

統計的機械翻訳（ＳＭＴ）では、図１１のようなフレーズテーブルを予め保持しているため、これを使用すればよい。 In the statistical machine translation (SMT), since a phrase table as shown in FIG. 11 is held in advance, it may be used.

ディープニューラルネットによるモデル獲得型の機械翻訳（ＤＮＮＭＴ）では、モデル自体を自動で構築するため、フレーズテーブルを保持していないことが多い。よって、別途フレーズテーブルを用意してもよい。 In machine acquisition type machine translation (DNNMT) using a deep neural network, a phrase table is often not held because a model itself is automatically constructed. Therefore, a separate phrase table may be prepared.

さらに、ユーザの選択を学習結果に反映する対象はこれだけに限らず、例えば、原言語同士の言い換えの対を表すようなデータベースを持っていてもよい。なお、機械翻訳処理の詳細については、後述する。 Further, the object that reflects the user's selection in the learning result is not limited to this, and for example, a database that represents a pair of paraphrases between source languages may be provided. The details of the machine translation process will be described later.

順翻訳部２３１は、通信部２１０を介して受信した翻訳対象文の言語（原言語）から、翻訳対象文を翻訳した結果出力される言語（目的言語）への機械翻訳処理を実行する。ここで、原言語から目的言語への翻訳を、順翻訳とし、順翻訳によって得られる翻訳文を順翻訳文とする。このとき、順翻訳処理によって、翻訳対象文に対して複数の順翻訳文が生成される。また、順翻訳部２３１は、記憶部２４０に格納されているフレーズテーブルを参照して機械翻訳処理を行う。順翻訳部２３１において生成された複数の順翻訳文を、順翻訳文群とする。順翻訳部２３１は、生成した順翻訳文群を、順翻訳文選択部２３２へ出力する。 The forward translation unit 231 executes a machine translation process from the language (source language) of the translation target sentence received via the communication unit 210 to a language (target language) output as a result of translating the translation target sentence. Here, the translation from the source language to the target language is referred to as a forward translation, and the translation obtained by the forward translation is referred to as a forward translation. At this time, a plurality of forward translation sentences are generated for the translation target sentence by the forward translation process. The forward translation unit 231 performs a machine translation process with reference to the phrase table stored in the storage unit 240. The plurality of forward-translated sentences generated by the forward-translating unit 231 are referred to as a forward-translated sentence group. The forward translation unit 231 outputs the generated forward translation sentence group to the forward translation sentence selection unit 232.

順翻訳文選択部２３２は、順翻訳部２３１によって生成された順翻訳文群の中から、Ｎ個の順翻訳文を選択する順翻訳文選択処理を行う。この順翻訳文選択処理の詳細については後述する。順翻訳文選択部２３２は、選択したＮ個の順翻訳文を逆翻訳部２３３へ出力する。 The forward-translated sentence selection unit 232 performs a forward-translated sentence selection process of selecting N forward-translated sentences from the forward-translated sentence group generated by the forward-translation unit 231. Details of this forward translation sentence selection processing will be described later. The forward-translated sentence selection unit 232 outputs the selected N forward-translated sentences to the reverse-translation unit 233.

逆翻訳部２３３は、順翻訳文選択部２３２において選択されたＮ個の順翻訳文の各々について、順翻訳文の言語（目的言語）から、翻訳対象文の言語（原言語）への機械翻訳処理を実行する。ここで、目的言語から原言語への翻訳を、逆翻訳とし、逆翻訳によって得られる翻訳文を逆翻訳文とする。このとき、逆翻訳処理によって、各々の順翻訳文に対して一以上の逆翻訳文が生成される。そのため、結果として、複数の逆翻訳文が生成される。また、逆翻訳部２３３は、記憶部２４０に格納されているフレーズテーブルを参照して機械翻訳処理を行う。逆翻訳部２３３において生成された複数の逆翻訳文を逆翻訳文群とする。逆翻訳部２３３は、生成した逆翻訳文群を逆翻訳文選択部２３４へ出力する。 For each of the N forward-translated sentences selected by the forward-translated sentence selecting unit 232, the backward translating unit 233 performs machine translation from the language of the forward-translated sentence (target language) to the language of the sentence to be translated (source language). Execute the process. Here, a translation from the target language to the source language is referred to as a reverse translation, and a translation obtained by the reverse translation is referred to as a reverse translation. At this time, one or more backward-translated sentences are generated for each forward-translated sentence by the backward-translation processing. Therefore, as a result, a plurality of back-translated sentences are generated. The reverse translation unit 233 performs a machine translation process with reference to the phrase table stored in the storage unit 240. The plurality of backward-translated sentences generated by the backward-translating unit 233 are regarded as a backward-translated sentence group. The back translation unit 233 outputs the generated back translation sentence group to the back translation sentence selection unit 234.

逆翻訳文選択部２３４は、逆翻訳部２３３によって生成された逆翻訳文群の中から、Ｍ個の逆翻訳文を選択する逆翻訳文選択処理を行う。この逆翻訳文選択処理については、後述する。逆翻訳文選択部２３４は、通信部２１０を介して、選択したＭ個の逆翻訳文を情報表示端末１００へ送信する。情報表示端末１００の出力部１０３において、Ｍ個の逆翻訳文は選択可能に出力される。 The backward-translated sentence selecting unit 234 performs a backward-translated sentence selection process of selecting M backward-translated sentences from the backward-translated sentence group generated by the backward-translating unit 233. This reverse translation sentence selection processing will be described later. The backward-translated sentence selection unit 234 transmits the selected M backward-translated sentences to the information display terminal 100 via the communication unit 210. On the output unit 103 of the information display terminal 100, the M reverse-translated sentences are selectively output.

選択文判断部２３５は、通信部２１０を介して情報表示端末１００から受信したユーザ選択情報に基づいて、逆翻訳文選択部２３４において選択されたＭ個の逆翻訳文の中からユーザがどの逆翻訳文を選択したかを判断し、判断した情報をフレーズ分割部２３６へ出力する。 Based on the user selection information received from the information display terminal 100 via the communication unit 210, the selected sentence determination unit 235 determines which of the M reverse-translated sentences selected by the backward-translated It is determined whether a translation has been selected, and the determined information is output to the phrase division unit 236.

フレーズ分割部２３６は、逆翻訳文選択部２３４から入力された複数の逆翻訳文に基づいて、複数の逆翻訳文の各々について、逆翻訳文を句または単語単位に分割する。また、逆翻訳文に対応する順翻訳分についても、句または単語単位に分割する。このとき、選択文判断部２３５から入力された複数の逆翻訳文からいずれの逆翻訳文が選択されたかを示す情報もあわせて用いるとしてもよい。また、記憶部２４０に格納されているフレーズテーブルを用いるとしてもよい。逆翻訳文および順翻訳文を句または単語単位に分割した情報と、ユーザ選択情報とが選択結果評価部２３７へ出力される。 The phrase division unit 236 divides the back-translated sentence into phrases or words for each of the plurality of back-translated sentences based on the plurality of back-translated sentences input from the back-translated sentence selection unit 234. Further, the forward translation corresponding to the backward translation is also divided into phrases or words. At this time, information indicating which back-translation sentence was selected from the plurality of back-translation sentences input from the selection sentence determination unit 235 may be used together. Further, a phrase table stored in the storage unit 240 may be used. Information obtained by dividing the backward-translated sentence and the forward-translated sentence into units of phrases or words, and user selection information are output to the selection result evaluation unit 237.

フレーズ分割には、統計的機械翻訳（ＳＭＴ）に示されるような、双言語間での、句または単語単位での対応関係を示すフレーズテーブルを用いることが多いが、必ずしも決まったフレーズテーブルを用いる必要はなく、それに類するものであってもよい。機械翻訳においてフレーズテーブルが用いられている場合は、そのフレーズテーブルを用いて分割を行ってもよい。または、別途用意したフレーズテーブルなどを用いてもよいし、対訳辞書などがあればそれを用いてもよい。 For phrase division, a phrase table indicating correspondence between phrases or words between bilingual languages as shown in Statistical Machine Translation (SMT) is often used, but a fixed phrase table is always used. It is not necessary and may be similar. When a phrase table is used in machine translation, division may be performed using the phrase table. Alternatively, a separately prepared phrase table or the like may be used, or if there is a bilingual dictionary or the like, it may be used.

選択結果評価部２３７では、フレーズ分割部２３６から入力される情報に基づいて、順翻訳文および逆翻訳文に対して評価を行なう。このとき、ユーザ選択情報に基づいて、ユーザ選択文とユーザ非選択文とで異なる評価を行うとしてもよい。詳細な評価方法については、後述する。選択結果評価部２３７は、順翻訳文および逆翻訳文を評価した評価情報を学習部２３８へ出力する。 The selection result evaluation unit 237 evaluates the forward-translated sentence and the backward-translated sentence based on the information input from the phrase dividing unit 236. At this time, different evaluations may be performed for the user-selected sentence and the user-unselected sentence based on the user selection information. A detailed evaluation method will be described later. The selection result evaluation unit 237 outputs evaluation information obtained by evaluating the forward-translated sentence and the backward-translated sentence to the learning unit 238.

学習部２３８は、選択結果評価部２３７から入力された評価情報に基づいて、記憶部２４０に格納されているフレーズテーブルを更新することで機械翻訳処理における機械学習を行なう。すなわち、評価情報をフレーズテーブルに反映する。機械学習の対象としては、順翻訳部２３１が参照するフレーズテーブルでもよいし、逆翻訳部２３３が参照するフレーズテーブルでもよい。また、必ずしも評価情報をフレーズテーブルに反映する必要はなく、例えば、言い換えの辞書や単語の辞書などに結果を反映し、機械翻訳処理における機械学習を行なうとしてもよい。フレーズテーブルに対する評価情報の詳細な反映方法については、後述する。 The learning unit 238 performs machine learning in the machine translation process by updating the phrase table stored in the storage unit 240 based on the evaluation information input from the selection result evaluation unit 237. That is, the evaluation information is reflected in the phrase table. The target of machine learning may be a phrase table referred to by the forward translator 231 or a phrase table referred to by the reverse translator 233. It is not always necessary to reflect the evaluation information in the phrase table. For example, the result may be reflected in a paraphrase dictionary or a dictionary of words, and machine learning in the machine translation process may be performed. A detailed method of reflecting the evaluation information on the phrase table will be described later.

図４は、情報表示端末の各部の機能をプログラムにより実現するコンピュータのハードウェア構成を示す図である。このコンピュータ１０００は、入力ボタン、タッチパッドなどの入力装置１００１、ディスプレイ、スピーカーなどの出力装置１００２、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）１００３、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１００４、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）１００５などを備える。また、コンピュータ１０００は、ハードディスク装置、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）などの記憶装置１００６、ＤＶＤ−ＲＯＭ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリなどの記録媒体から情報を読み取る読取装置１００７、ネットワークを介して通信を行う送受信装置１００８を備えるとしてもよい。上述した各部は、バス１００９により接続される。 FIG. 4 is a diagram illustrating a hardware configuration of a computer that realizes functions of each unit of the information display terminal by a program. The computer 1000 includes an input device 1001 such as an input button and a touch pad, an output device 1002 such as a display and a speaker, a CPU (Central Processing Unit) 1003, a ROM (Read Only Memory) 1004, a RAM (Random Access Memory) 1005, and the like. Prepare. The computer 1000 reads information from a recording device such as a hard disk device, a storage device 1006 such as an SSD (Solid State Drive), a DVD-ROM (Digital Versatile Disk Read Only Memory), or a USB (Universal Serial Bus) memory. 1007, a transmission / reception device 1008 for performing communication via a network may be provided. The above-described units are connected by a bus 1009.

そして、読取装置１００７は、上記各部の機能を実現するためのプログラムを記録した記録媒体からそのプログラムを読み取り、記憶装置１００６に記憶させる。あるいは、送受信装置１００８が、ネットワークに接続されたサーバ装置と通信を行い、サーバ装置からダウンロードした上記各部の機能を実現するためのプログラムを記憶装置１００６に記憶させる。 Then, the reading device 1007 reads the program from a recording medium on which the program for realizing the function of each unit is recorded, and stores the program in the storage device 1006. Alternatively, the transmission / reception device 1008 communicates with a server device connected to the network, and causes the storage device 1006 to store a program downloaded from the server device for realizing the function of each unit.

そして、ＣＰＵ１００３が、記憶装置１００６に記憶されたプログラムをＲＡＭ１００５にコピーし、そのプログラムに含まれる命令をＲＡＭ１００５から順次読み出して実行することにより、上記各部の機能が実現される。また、プログラムを実行する際、ＲＡＭ１００５または記憶装置１００６には、各実施の形態で述べた各種処理で得られた情報が記憶され、適宜利用される。 Then, the CPU 1003 copies the program stored in the storage device 1006 to the RAM 1005, and sequentially reads out and executes the instructions included in the program from the RAM 1005, thereby realizing the functions of the respective units. When the program is executed, information obtained by the various processes described in the embodiments is stored in the RAM 1005 or the storage device 1006, and is appropriately used.

図５は、本実施の形態における、機械翻訳システムの動作を示すフローチャートである。簡単のため、本フローチャートでは、情報表示端末１００の入力部１０２および出力部１０３は、タッチパネル式ディスプレイによって実現されるとする。情報表示端末１００において、タッチパネル式ディスプレイを介して、ユーザの入力および翻訳結果等の出力が行なわれるとするが、例えば、キーボードとディスプレイなど、入力部１０２と出力部１０３がそれぞれ独立している構成であってもよい。また、音声による入出力が行なわれてもよい。また説明のため、原言語（母国語）を日本語、目的言語を英語として説明を行っている箇所がある。ただしこれらは一例であり、原言語および目的言語はどのような組合せであっても構わない。 FIG. 5 is a flowchart illustrating the operation of the machine translation system according to the present embodiment. For simplicity, in this flowchart, the input unit 102 and the output unit 103 of the information display terminal 100 are realized by a touch panel display. In the information display terminal 100, it is assumed that the input of the user and the output of the translation result and the like are performed via the touch panel display. For example, the input unit 102 and the output unit 103 such as a keyboard and a display are independent. It may be. Further, input / output by voice may be performed. For the sake of explanation, there are some places where the explanation is made with the source language (native language) being Japanese and the target language being English. However, these are merely examples, and the source language and the target language may be in any combination.

まず、ステップＳ４０１において、ユーザによって入力された翻訳対象文を取得する。ステップＳ４０２において、翻訳対象文に対する機械翻訳処理を行う。ここでの機械翻訳処理は、原言語の翻訳対象文を目的言語の文字列（順翻訳文）に翻訳する順翻訳処理である。このとき、例えば、統計的機械翻訳（ＳＭＴ）は、翻訳モデルと言語モデルから翻訳としての確からしさをスコア化する。翻訳モデルとは、訳語の尤もらしさを規定する統計モデルであり、言語モデルとは、出力言語の単語の並びの尤もらしさを規定する統計モデルである。これら２つのモデルから、翻訳としての確からしさをスコア化し、そのスコア順に翻訳結果を出力することにより、複数の順翻訳文が生成されるが、この複数の順翻訳文を、説明の便宜上、順翻訳文群とする。 First, in step S401, a translation target sentence input by the user is acquired. In step S402, a machine translation process is performed on the translation target sentence. The machine translation process here is a forward translation process for translating a translation target sentence in the source language into a character string (forward translation sentence) in the target language. At this time, for example, statistical machine translation (SMT) scores the likelihood of translation from a translation model and a language model. The translation model is a statistical model that defines the likelihood of the translated word, and the language model is a statistical model that defines the likelihood of the arrangement of words in the output language. From these two models, a plurality of forward-translated sentences are generated by scoring the certainty as translation and outputting the translation results in the order of the scores. A group of translated sentences.

本実施の形態における機械翻訳の具体的な処理には、一般的な機械翻訳の処理であるため、ここでの説明は省略する。 The specific processing of the machine translation according to the present embodiment is a general machine translation processing, and a description thereof will be omitted.

ステップＳ４０３において、順翻訳文群から、所定の基準に基づいてＮ個の順翻訳文が選択される。例えば、順翻訳文群に含まれる複数の順翻訳文の各々に対して評価スコアを付与し、評価スコアの高いものからＮ個選択するとしてもよい。また、評価スコアに関係なく、ランダムにＮ個選択するとしてもよい。などである。また、順翻訳文群に含まれる複数の順翻訳文が示す内容を考慮して、選択したＮ個の順翻訳文の中に同趣旨のものが含まれないように選択するとしてもよい。また、順翻訳文群に含まれる複数の順翻訳文の中に異なる趣旨のものが少ない場合には、異なる趣旨を持つ順翻訳文を追加する処理を、必要に応じて行ってもよい。順翻訳文の詳細な選択方法については、図６を用いて後述する。 In step S403, N forward-translated sentences are selected from the forward-translated sentence group based on a predetermined criterion. For example, an evaluation score may be given to each of a plurality of forward-translated sentences included in the group of forward-translated sentences, and N items may be selected from those having the highest evaluation scores. Alternatively, N items may be selected at random regardless of the evaluation score. And so on. In addition, in consideration of the contents indicated by a plurality of forward-translated sentences included in the forward-translated sentence group, the selected N forward-translated sentences may be selected so as not to have the same meaning. In addition, when a plurality of forward-translated sentences included in the forward-translated sentence group include only a small number of forward-translated sentences having different purposes, a process of adding forward-translated sentences having different purposes may be performed as necessary. A detailed method of selecting a forward-translated sentence will be described later with reference to FIG.

ステップＳ４０３の処理が行われたのちに、順翻訳文を増やす必要があると判断された場合（Ｓ４０４のＹＥＳ）、ステップＳ４０２に戻り、再び順翻訳処理を実行する。この時、すでに得られている順翻訳文とは異なる順翻訳文を得るために、先ほどよりもスコアの低いものを選択する。また、別の翻訳指標を用いても良い（例えば、ＲＩＢＥＳ：ｒａｎｋ−ｂａｓｅｄｉｎｔｕｉｔｉｖｅｂｉｌｉｎｇｕａｌｅｖａｌｕａｔｉｏｎｓｃｏｒｅ）。また、原言語の言い換えのデータベースを保持していれば、入力文に対してそれらを適用することで類似文を作成し、再び順翻訳処理を実行することもできる。これにより、表層的には異なるが、入力文と同じ意味を持った文が入力されるため、異なる順翻訳文を得ることができる。 If it is determined that it is necessary to increase the number of forward-translated sentences after the processing of step S403 is performed (YES in S404), the process returns to step S402 and performs the forward-translation processing again. At this time, in order to obtain a forward-translated sentence that is different from the already-acquired forward-translated sentence, one having a lower score than the previous one is selected. Alternatively, another translation index may be used (for example, rank-based intuitive bilingual evaluation score). Further, if a database of paraphrases of the source language is held, a similar sentence can be created by applying them to the input sentence, and the forward translation process can be executed again. As a result, a sentence having a different surface but having the same meaning as the input sentence is input, so that a different forward-translated sentence can be obtained.

再び順翻訳処理が実行されたのちに、ステップＳ４０３において順翻訳文選択処理が実行される場合には、前回と異なる基準で順翻訳文を選択してもよいし、同じ基準で順翻訳文を選択してもよい。 When the forward-translated sentence selection processing is executed in step S403 after the forward-translated processing is executed again, the forward-translated sentence may be selected based on a different criterion from the previous one, or the forward-translated sentence may be selected based on the same criteria. You may choose.

ステップＳ４０３の処理が行われたのちに、順翻訳文を増やす必要がないと判断された場合（Ｓ４０４のＮｏ）、ステップＳ４０５の逆翻訳処理へフローを進める。ステップＳ４０５では、ステップＳ４０３で得られたＮ個の順翻訳文に対して逆翻訳を行う逆翻訳処理が実行される。原言語から目的言語への翻訳を順方向の翻訳とすると、逆翻訳とは、目的言語から原言語への逆方向の翻訳である。Ｎ個の順翻訳文それぞれにおいて、任意の逆翻訳文を生成する逆翻訳処理を行う。任意の逆翻訳文を生成する逆翻訳処理とは、Ｎ個の順翻訳文の各々について一対一で対応する逆翻訳文を生成する逆翻訳処理、Ｎ個の順翻訳文の中に逆翻訳処理が行われない順翻訳文がある逆翻訳処理、一つの順翻訳文に対して複数の逆翻訳文を生成する逆翻訳処理、などを意味する。この逆翻訳処理によって、複数の逆翻訳文が生成される。この複数の逆翻訳文を、説明の便宜上、逆翻訳文群とする。 If it is determined that it is not necessary to increase the number of forward-translated sentences after the processing in step S403 is performed (No in S404), the flow proceeds to the reverse translation processing in step S405. In step S405, a reverse translation process for performing reverse translation on the N forward-translated sentences obtained in step S403 is executed. Assuming that the translation from the source language to the target language is a forward translation, the reverse translation is a reverse translation from the target language to the source language. For each of the N forward-translated sentences, a reverse translation process for generating an arbitrary backward-translated sentence is performed. The reverse translation process of generating an arbitrary reverse translated sentence is a reverse translation process of generating a one-to-one corresponding reverse translated sentence for each of the N forward translated sentences, and a reverse translation process in the N forward translated sentences. Means a reverse translation process in which there is a forward translation sentence not performed, a reverse translation process of generating a plurality of reverse translations for one forward translation, and the like. By this back translation process, a plurality of back translation sentences are generated. The plurality of back-translated sentences are referred to as a back-translated sentence group for convenience of explanation.

また、どのような基準で逆翻訳文を出力するかに対しては、何らかのシステムの基準で決めてもよいし、ユーザがそれらを決定してもよい。ここでのシステムの基準とは、ＢＬＥＵなどの評価や人手評価などを用いて順翻訳文のスコアを算出し、例えば、スコアが低い順翻訳文に関しては逆翻訳処理を行わない（ある順翻訳文に関しては生成する逆翻訳文が０個）、スコアが高い順翻訳文は任意の数の逆翻訳文を得る（ある順翻訳文に関しては逆翻訳文が複数個生成される）などである。ユーザが逆翻訳文の数を決定する場合には、一例として、一つの順翻訳文に対して逆翻訳文をいくつ生成するかを設定する、などが考えられるが、これに限らない。 In addition, the criteria for outputting the reverse-translated sentence may be determined based on some system criteria, or may be determined by the user. The standard of the system here means that the score of a forward-translated sentence is calculated using an evaluation such as BLEU or manual evaluation, and for example, a forward-translated sentence having a low score is not subjected to a reverse translation process (a certain forward-translated sentence). For example, for a forward-translated sentence having a high score, an arbitrary number of backward-translated sentences are obtained (for a given forward-translated sentence, a plurality of backward-translated sentences are generated). When the user determines the number of backward-translated sentences, for example, the number of backward-translated sentences to be generated for one forward-translated sentence may be set. However, the present invention is not limited to this.

ステップＳ４０６は、ステップＳ４０５で得られた逆翻訳文群の中からＭ個の逆翻訳文を選択する逆翻訳文選択処理である。逆翻訳文選択処理では、ステップＳ４０３の順翻訳文選択処理とほぼ同様の処理を行う。詳細な選択方法については、図６、図７を用いて後述する。 Step S406 is a backward-translated sentence selection process of selecting M backward-translated sentences from the backward-translated sentence group obtained in step S405. In the reverse translation sentence selection processing, processing that is substantially the same as the forward translation sentence selection processing in step S403 is performed. A detailed selection method will be described later with reference to FIGS.

ステップＳ４０６の逆翻訳文選択処理が実行されたのちに、逆翻訳文を増やす必要があると判断された場合（Ｓ４０７のＹＥＳ）、ステップＳ４０２に戻り、再び順翻訳処理を行う。再び順翻訳処理が実行されたのちに、ステップＳ４０３において順翻訳文選択処理が実行される場合には、前回と異なる基準で順翻訳文を選択してもよいし、同じ基準で順翻訳文を選択してもよい。 If it is determined that it is necessary to increase the number of backward-translated sentences after the backward-translated sentence selection process of step S406 is performed (YES in S407), the process returns to step S402 and performs the forward translation process again. When the forward-translated sentence selection processing is executed in step S403 after the forward-translated processing is executed again, the forward-translated sentence may be selected based on a different criterion from the previous one, or the forward-translated sentence may be selected based on the same criteria. You may choose.

ステップＳ４０６の逆翻訳処理が行われたのちに、逆翻訳文を増やす必要がないと判断された場合（Ｓ４０７のＮｏ）、逆翻訳文群から選択されたＭ個の逆翻訳文がユーザ提示文として次のステップで情報表示端末１００において出力される。 If it is determined that there is no need to increase the number of back-translated sentences after the back-translation process of step S406 (No in S407), the M backward-translated sentences selected from the back-translated sentence group are sent to the user presented sentence. Is output from the information display terminal 100 in the next step.

ステップＳ４０８において、Ｍ個の逆翻訳文が情報表示端末１００へ送信され、タッチパネル式ディスプレイに表示される。 In step S408, M reverse-translated sentences are transmitted to the information display terminal 100 and displayed on the touch panel display.

ステップＳ４０９では、選択文検出部１０５において、情報表示端末１００のタッチパネル式ディスプレイに表示されたＭ個の逆翻訳文から一の逆翻訳文が選択されたか否かを検出する。 In step S409, the selected sentence detection unit 105 detects whether one of the M back-translated sentences displayed on the touch panel display of the information display terminal 100 has been selected.

一定期間、逆翻訳文の選択がないと判断された場合、機械翻訳システムは初期状態に戻り、ユーザの入力文を受け付ける（Ｓ４０９のＮｏ）。このとき、タッチパネル式ディスプレイの表示画面がリセットされる。 When it is determined that there is no selection of a reverse translation sentence for a certain period, the machine translation system returns to the initial state and accepts the input sentence of the user (No in S409). At this time, the display screen of the touch panel display is reset.

また、ユーザが何らかのリセット操作を実行した場合も、機械翻訳システムは同様に初期状態へ戻ってユーザの入力を受け付ける。 Also, when the user performs some kind of reset operation, the machine translation system similarly returns to the initial state and accepts the user's input.

選択文検出部１０５において、いずれかの逆翻訳文が選択されたことが検出された場合（Ｓ４０９のＹｅｓ）、いずれの逆翻訳文が選択されたかを示すユーザ選択情報が翻訳サーバ３００へ送信される。 When the selected sentence detection unit 105 detects that any of the backward-translated sentences has been selected (Yes in S409), user selection information indicating which of the backward-translated sentences has been selected is transmitted to the translation server 300. You.

ステップＳ４１１において、選択文判断部２３５は、逆翻訳文選択部２３４からＭ個の逆翻訳文を取得し、情報表示端末１００から受信したユーザ選択情報に基づいて、情報表示端末１００においてＭ個の逆翻訳文の中のいずれの逆翻訳文が選択されたかを判断する。Ｍ個の逆翻訳文の中の選択された逆翻訳文を選択翻訳文、選択されなかった逆翻訳文を非選択逆翻訳文として、以降のステップを説明する。選択文判断部２３５は、選択逆翻訳文および非選択逆翻訳文をフレーズ分割部２３６へ出力する。 In step S411, the selected sentence determination unit 235 obtains M reverse-translated sentences from the backward-translated sentence selecting unit 234, and based on the user selection information received from the information display terminal 100, the information display terminal 100 It is determined which of the back-translated sentences is selected. The following steps will be described with the selected backward-translated sentence among the M backward-translated sentences as the selected translated sentence and the unselected backward-translated sentence as the unselected backward-translated sentence. The selected sentence determination unit 235 outputs the selected backward-translated sentence and the unselected backward-translated sentence to the phrase division unit 236.

ステップＳ４１２において、選択逆翻訳文と非選択逆翻訳文、および選択逆翻訳文と非選択逆翻訳文の各々に対応する順翻訳文に対して、フレーズ分割処理を行う。 In step S412, a phrase division process is performed on the selected backward-translated sentence and the unselected backward-translated sentence, and the forward-translated sentence corresponding to each of the selected backward-translated sentence and the unselected backward-translated sentence.

フレーズ分割とは、対象となる文を、より短い単位である句または単語に分割することである。フレーズ分割処理の具体例は後述する。 Phrase division is to divide a target sentence into phrases or words that are shorter units. A specific example of the phrase division processing will be described later.

図１１は、本実施の形態における、一般的なフレーズテーブルの例である。フレーズテーブルとは、原言語と目的言語の双言語間において、句または単語単位での対応関係を表したテーブルである。 FIG. 11 is an example of a general phrase table in the present embodiment. The phrase table is a table that represents a correspondence between phrases and words between the source language and the target language.

図１１では、原言語を日本語、目的言語を英語とした場合、左から、日本語のフレーズ、英語のフレーズ、フレーズの英日翻訳確率（英語のフレーズが日本語のフレーズに翻訳される確率）、英日方向の単語の翻訳確率の積（英語が日本語に翻訳される時の、フレーズ内の単語ごとの翻訳確率の積）、フレーズの日英翻訳確率（日本語のフレーズが英語のフレーズに翻訳される確率）、日英方向の単語の翻訳確率の積（日本語が英語に翻訳される時の、フレーズ内の単語ごとの翻訳確率の積）を表している。ただし、フレーズテーブルは、これらすべての情報を必ずしも含んでいる必要はなく、この表記方法に限らない。このフレーズテーブルは、翻訳確率を含むため、広義では確率モデルとも呼ばれる。 In FIG. 11, when the source language is Japanese and the target language is English, from the left, the Japanese phrase, the English phrase, and the English-to-Japanese translation probability of the phrase (the probability that the English phrase is translated into a Japanese phrase) ), The product of the translation probabilities of words in the English-Japanese direction (the product of the translation probabilities of each word in the phrase when English is translated into Japanese), the Japanese-English translation probability of the phrase (the Japanese phrase is (Probability of being translated into a phrase), and the product of the translation probabilities of words in the Japanese and English directions (the product of the translation probabilities of each word in the phrase when Japanese is translated into English). However, the phrase table does not necessarily need to include all of this information, and is not limited to this notation. Since this phrase table includes translation probabilities, it is also called a probability model in a broad sense.

例えば、図１１に示すフレーズテーブルでは、フレーズＰＨ１が、フレーズＰＨ１へ翻訳される確率が０．３８、単語ＳＤ３が単語ＳＤ１に翻訳される確率と、単語ＳＤ４が単語ＳＤ４へ翻訳される確率との積が０．０４、フレーズＰＨ１がフレーズＰＨ２へ翻訳される確率が０．０５、単語ＳＤ１が単語ＳＤ３へ翻訳される確率と、単語ＳＤ２が単語ＳＤ４へ翻訳される確率との積が０．０２であることを表している。 For example, in the phrase table shown in FIG. 11, the probability that the phrase PH1 is translated into the phrase PH1 is 0.38, the probability that the word SD3 is translated into the word SD1, and the probability that the word SD4 is translated into the word SD4 are: The product is 0.04, the probability that the phrase PH1 is translated into the phrase PH2 is 0.05, and the product of the probability that the word SD1 is translated into the word SD3 and the probability that the word SD2 is translated into the word SD4 is 0.02. It represents that.

このようなフレーズテーブルを用いて、選択逆翻訳文、非選択逆翻訳文、およびこれらの逆翻訳文の各々に対応する順翻訳文に対してフレーズ分割処理を行なう。 Using such a phrase table, a phrase division process is performed on the selected backward-translated sentence, the unselected backward-translated sentence, and the forward-translated sentence corresponding to each of these backward-translated sentences.

図１２は、フレーズ分割の概要を表す説明図である。 FIG. 12 is an explanatory diagram illustrating an outline of phrase division.

図１２には、原言語で表された逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０と、これらの逆翻訳文の各々に対応する目的言語の順翻訳文ＴＳ１０、ＴＳ２０、ＴＳ３０が示されている。原言語の逆翻訳文に対するフレーズ分割処理が実行される場合、例えば、逆翻訳文ＲＳ１０をフレーズ分割すると、フレーズＰＨ１１、ＰＨ１２、ＰＨ１２という３つのフレーズに分割される。また、目的言語の順翻訳文に対するフレーズ分割処理が実行される場合、例えば、順翻訳文ＴＳ１０をフレーズ分割すると、フレーズＰＨ１４、ＰＨ１５、ＰＨ１６という３つのフレーズ（単語）に分割される。 FIG. 12 shows reverse-translated sentences RS10, RS20, and RS30 expressed in the source language, and forward-translated sentences TS10, TS20, and TS30 in the target language corresponding to each of these backward-translated sentences. When the phrase division processing is performed on the back-translation sentence of the source language, for example, when the back-translation sentence RS10 is divided into phrases, the back-translation sentence RS10 is divided into three phrases of phrases PH11, PH12, and PH12. When a phrase division process is performed on a forward-translated sentence in the target language, for example, when the forward-translated sentence TS10 is phrase-divided, the forward-translated sentence TS10 is divided into three phrases (words) of phrases PH14, PH15, and PH16.

フレーズ分割処理を行なう対象文がどのようなフレーズに分割されるかについては、フレーズテーブルに表記されている原言語および目的言語の文字列に依存するため、一意には決まらないことがある。 The phrase into which the target sentence to be subjected to the phrase division processing is divided into phrases depends on the character strings of the source language and the target language described in the phrase table, and thus may not be uniquely determined.

ステップＳ４１３は、フレーズ分割処理によって出力される各フレーズに対して、所定の基準に従ってスコアを評価するフレーズ評価処理である。このフレーズ評価処理の詳細については、後述する。 Step S413 is a phrase evaluation process of evaluating a score for each phrase output by the phrase division process according to a predetermined standard. Details of the phrase evaluation processing will be described later.

ステップＳ４１４では、Ｓ４１１で判断された選択逆翻訳文に対応する順翻訳文を情報表示端末１００へ送信し、翻訳結果としてタッチパネル式ディスプレイに表示する。このとき、タッチパネル式ディスプレイに表示されている選択逆翻訳文を強調表示してもよい。また、タッチパネル式ディスプレイにおいて非選択逆翻訳文の表示を消去してもよく、表示された翻訳結果がユーザの選択した選択逆翻訳文に対応する順翻訳文であるということを明示できれば、どのような表示を行っても構わない。 In step S414, the forward-translated sentence corresponding to the selected reverse-translated sentence determined in S411 is transmitted to the information display terminal 100, and is displayed on the touch panel display as a translation result. At this time, the selected reverse translation sentence displayed on the touch panel display may be highlighted. In addition, the display of the unselected reverse-translated sentence may be deleted on the touch-panel display, and if it is possible to clearly indicate that the displayed translation result is a forward-translated sentence corresponding to the selected reverse-translated sentence selected by the user, Display may be performed.

なお、ステップＳ４１４とステップＳ４１２〜Ｓ４１４の一連の処理は、並列的に動作可能であるため、ステップＳ４１４はステップＳ４１２のフレーズ分割処理の前からステップＳ４１５の学習処理の後の間であれば、いずれのタイミングで実行されるとしてもよい。 Since a series of processes in step S414 and steps S412 to S414 can be operated in parallel, step S414 may be performed before the phrase dividing process in step S412 and after the learning process in step S415. It may be executed at the timing of.

ステップＳ４１５において、ステップＳ４１３で得られたフレーズ毎のスコアに基づいて、強化学習、識別学習、ニューラルネット学習などの機械学習を行う。この処理の詳細に関しては、図８、９を用いて説明する。 In step S415, machine learning such as reinforcement learning, identification learning, and neural network learning is performed based on the score for each phrase obtained in step S413. Details of this processing will be described with reference to FIGS.

図６は、本実施の形態における、翻訳文選択処理の具体的な動作を示すフローチャートである。 FIG. 6 is a flowchart showing a specific operation of the translation sentence selection process in the present embodiment.

図６を用いて、ステップＳ４０３において順翻訳文選択部２３２で実行される順翻訳文選択処理と、ステップＳ４０６において逆翻訳文選択部２３４で実行される逆翻訳文選択処理の具体的な処理を説明する。説明の便宜上、順翻訳文選択処理および逆翻訳文選択処理の２つの処理をまとめて翻訳文選択処理とする。 Referring to FIG. 6, specific processes of the forward translation sentence selection process executed by forward translation sentence selection unit 232 in step S403 and the reverse translation sentence selection process executed by reverse translation sentence selection unit 234 in step S406 will be described. explain. For convenience of explanation, the two processes of the forward translation sentence selection process and the backward translation sentence selection process are collectively referred to as a translation sentence selection process.

順翻訳文選択部２３２では、順翻訳部２３１で生成された順翻訳文群から、Ｎ個の順翻訳文を選択する順翻訳文選択処理が実行され、逆翻訳文選択部２３４では、逆翻訳部２３３で生成された逆翻訳文群から、Ｍ個の逆翻訳文を選択する逆翻訳文選択処理が実行される。順翻訳文群に含まれる複数の順翻訳文の各々の評価スコアに基づいて、Ｎ個の順翻訳文が選択され、逆翻訳文群に含まれる複数の逆翻訳文の各々の評価スコアに基づいて、Ｍ個の逆翻訳文が選択される。 The forward-translated sentence selection unit 232 performs a forward-translated sentence selection process of selecting N forward-translated sentences from the forward-translated sentence group generated by the forward-translation unit 231, and the backward-translated sentence selection unit 234 performs reverse-translation sentence selection. A backward-translated sentence selection process of selecting M backward-translated sentences from the backward-translated sentence group generated by the unit 233 is executed. N forward-translated sentences are selected based on the evaluation scores of the plurality of forward-translated sentences included in the forward-translated sentence group, and based on the evaluation scores of the plurality of backward-translated sentences included in the backward-translated sentence group. Thus, M backward-translated sentences are selected.

以下の説明は、順翻訳文選択処理、逆翻訳文選択処理のどちらの場合でもあてはまる内容であるため、翻訳文選択処理として説明し、順翻訳文と逆翻訳文とをまとめて翻訳文と表記する。また、順翻訳文群と逆翻訳文群は翻訳文群と表記する。さらに、実際には、順翻訳文はＮ個選択され、逆翻訳文はＭ個選択されるが、以下の説明では区別なくＮ個として表記して説明する。 The following description applies to both forward-translation sentence selection processing and reverse-translation sentence selection processing, so it is described as translation-sentence selection processing, and forward-translated sentences and reverse-translated sentences are collectively referred to as translated sentences. I do. The forward-translated sentences and the backward-translated sentences are referred to as translated sentences. Further, in practice, N forward-translated sentences are selected and M backward-translated sentences are selected, but in the following description, they are described as N without distinction.

ステップＳ５０１において、翻訳文群の中から評価スコアの高い翻訳文をＮ−ｋ個（１＜＝Ｎ、０＜＝ｋ＜＝Ｎ）選択する。ここで、評価スコアの例としては、翻訳精度を評価する手法としてよく用いられるＢＬＥＵが挙げられる。他にも、翻訳精度を評価する手法として、ＷＥＲ（ＷｏｒｄＥｒｒｏｒＲａｔｅ）、ＭＥＴＥＯＲ（ＭｅｔｒｉｃｆｏｒＥｖａｌｕａｔｉｏｎｏｆＴｒａｎｓｌａｔｉｏｎｗｉｔｈＥｘｐｌｉｃｉｔＯＲｄｅｒｉｎｇ）、ＲＩＢＥＳ（Ｒａｎｋ−ｂａｓｅｄＩｎｔｕｉｔｉｖｅＢｉｌｉｎｇｕａｌＥｖａｌｕａｔｉｏｎＳｃｏｒｅ）などがあるが、評価の手法はこれらのいずれを用いてもよいし、これらに限らず他の手法を用いてもよい。 In step S501, Nk translation sentences (1 <= N, 0 <= k <= N) having a high evaluation score are selected from the translation sentence group. Here, as an example of the evaluation score, BLEU, which is often used as a technique for evaluating translation accuracy, can be mentioned. Other methods for evaluating translation accuracy include WER (Word Error Rate), METEOR (Metric for Evaluation of Translation with Explicit Ordering), RIBES (Rank-based Intuitive), and RIBES (Rank-based Integrative Evaluation). Any of these may be used, and other methods may be used without being limited thereto.

翻訳文群の中でステップＳ５０１において選択されなかった翻訳文から、残りのｋ個を選択する（Ｓ５０２）。 The remaining k sentences are selected from among the translated sentences not selected in step S501 in the translated sentence group (S502).

ｋ個の翻訳文を選択するために、評価スコアが所定の閾値内の翻訳文を抽出し、抽出した翻訳文の中からランダムにｋ個選択するとしてよい。また、評価スコアが所定の閾値内の翻訳文を抽出し、抽出した翻訳文の中でスコアの低いものから順番にｋ個を選択するとしてもよい。Ｎ−ｋ個の翻訳文を選択する場合は、評価スコアの高い翻訳文を選択したが、ｋ個の翻訳文を選択する場合は、必ずしも評価スコアの高い翻訳文を選択するのではない。特定の評価基準によって機械的に付与される評価スコアの高い翻訳文ばかりを選択すると、選択した翻訳文は全て似通った内容の文章である可能性が高い。情報表示端末１００において複数の逆翻訳文をユーザに提示し、そこからユーザに一の逆翻訳文を選択させることを考慮すると、ある程度異なる観点で選択された複数の逆翻訳文を提示することが好ましい。 In order to select k translations, a translation having an evaluation score within a predetermined threshold may be extracted, and k translations may be randomly selected from the extracted translations. Alternatively, translated sentences having an evaluation score within a predetermined threshold value may be extracted, and among the extracted translated sentences, k sentences may be selected in ascending order of score. When selecting N−k translations, a translation with a high evaluation score is selected. However, when selecting k translations, a translation with a high evaluation score is not necessarily selected. When only translations with a high evaluation score, which are automatically given according to a specific evaluation criterion, are selected, it is highly likely that all the selected translations are sentences having similar contents. In consideration of presenting a plurality of back-translated sentences to the user on the information display terminal 100 and allowing the user to select one back-translated sentence therefrom, it is possible to present a plurality of back-translated sentences selected from a somewhat different viewpoint. preferable.

似通った逆翻訳文ばかりを提示して、その中からユーザに選択させると、後述する機械翻訳システムに対する機械学習処理において高い学習効果を得ることが出来ない懸念が生じる。ユーザによって選択された選択逆翻訳文と、ユーザによって選択されなかった非選択逆翻訳文とを教師データとして用いて機械学習処理を実行させる場合に、機械翻訳システムに対して正例（正解）として学習させた選択逆翻訳文と、負例（不正解）として学習させた非選択逆翻訳文とが似通った文章となるため、機械翻訳システムに対して正例と負例の顕著な差を示すことができず、学習の効果を期待することができない。そのため、本実施の形態で説明するように、ある程度異なる観点で複数の逆翻訳文を選択することが好ましい。 If only similar reverse-translated sentences are presented and the user selects them, there is a concern that a high learning effect cannot be obtained in a machine learning process for a machine translation system described later. As a correct example (correct answer) to the machine translation system, when a machine learning process is performed using the selected backward-translated sentence selected by the user and the unselected backward-translated sentence not selected by the user as teacher data. Because the selected reverse-translated sentence learned and the non-selected reverse-translated sentence learned as a negative example (incorrect answer) are similar sentences, there is a remarkable difference between the positive and negative examples for the machine translation system. I can't do it, and I can't expect the effects of learning. Therefore, as described in the present embodiment, it is preferable to select a plurality of reverse-translated sentences from a somewhat different viewpoint.

また、似通った逆翻訳文ばかりを提示すると、ユーザはそれらの逆翻訳文の細かな差異を考慮して一の逆翻訳文を選択する必要が生じるため、直感的に一の逆翻訳文を選択することができず、逆翻訳文の選択に時間がかかってしまうことも懸念される。本実施の形態のように、ある程度異なる観点で選択された複数の逆翻訳文を提示すれば、ユーザはその中から自身の意図する翻訳内容を直感的に選択可能となる。 Also, if only similar back-translated sentences are presented, the user needs to select one back-translated sentence in consideration of the small differences between the back-translated sentences, and thus intuitively selects one back-translated sentence. There is also a concern that the selection of the back translation may take time. By presenting a plurality of reverse-translated sentences selected from different viewpoints as in the present embodiment, the user can intuitively select the translation content intended by the user from among them.

また、逆翻訳文は順翻訳文から生成されるため、順翻訳文を選択する段階においても、同様にある程度異なる観念で複数の順翻訳文を選択しておくことが好ましいと考えられる。 In addition, since the backward-translated sentence is generated from the forward-translated sentence, it may be preferable to select a plurality of forward-translated sentences with a somewhat different idea in the stage of selecting the forward-translated sentence as well.

また、ｋ個の翻訳文を選択するための異なる手法として、過去のユーザ選択情報に基づいてｋ個の翻訳文を選択するとしてもよい。例えば、翻訳文毎に、ユーザによって過去に選択された回数などを記憶（逆翻訳文については直接選択された回数を記憶し、順翻訳文については対応する逆翻訳文が選択された回数を記憶する）しておいて、翻訳文群の中で記憶された回数が高いものから順番にｋ個の翻訳文を選択するとしてもよい。また、このような直接的な回数に基づいて選択するのではなく、ユーザの過去の機械翻訳システムの利用履歴に基づいて、翻訳対象文に対してユーザが選択しやすい翻訳文の傾向を分析し、分析した傾向に基づいてｋ個の翻訳文を選択するとしてもよい。なお、Ｎ−ｋ個の翻訳文を選択する選択基準と異なる選択基準を用いれば、ｋ個の翻訳文を選択する選択手法はこれらに限らない。 Further, as a different method for selecting k translations, k translations may be selected based on past user selection information. For example, for each translated sentence, the number of times selected by the user in the past is stored (for a backward-translated sentence, the number of direct selections is stored; for a forward-translated sentence, the number of times the corresponding reverse-translated sentence is selected is stored). Then, k translations may be selected in order from the one with the highest number of times stored in the translation sentence group. Also, instead of selecting based on such a direct number of times, based on the user's past use history of the machine translation system, the tendency of the translated sentence that the user can easily select for the translation target sentence is analyzed. Alternatively, k translation sentences may be selected based on the analyzed tendency. Note that if a selection criterion different from the selection criterion for selecting Nk translations is used, the selection method for selecting k translations is not limited to these.

さらに、これらｋ個の翻訳文の選択方法においては、Ｎ−ｋ個の翻訳文と同じ趣旨の翻訳文を除く処理を行なってもよい。また、ｋ個の翻訳文の中でも同じ趣旨の翻訳文を除く処理を行なってもよい。または、Ｎ−ｋ個の翻訳文と異なる趣旨の翻訳文がｋ個の翻訳文に含まれていない、またはｋ個の翻訳文の中に少ない場合には、異なる趣旨の翻訳文を追加する処理を行なってもよい。また、逆翻訳文選択処理に関しては、翻訳対象文と比較して、同じ趣旨を有する逆翻訳文を選択するなどの処理を行なってもよい。 Further, in the method of selecting the k translated sentences, a process of excluding the translated sentences having the same meaning as the Nk translated sentences may be performed. In addition, a process for excluding a translated sentence having the same meaning from among the k translated sentences may be performed. Alternatively, if the k translations do not include a translation having a purpose different from the N−k translations, or if the number of translations is small, the process adds a translation having a different purpose. May be performed. In addition, regarding the reverse translation sentence selection processing, a process such as selecting a reverse translation sentence having the same meaning as that of the translation target sentence may be performed.

例えば、翻訳文群に、疑問文・肯定文・否定文・命令文のそれぞれの形態の翻訳文がどれだけ含まれているか（文の異なり数）をカウントし、閾値以下であれば、再度、順翻訳処理または逆翻訳処理を行う。 For example, the translation sentence group counts the number of translation sentences in the respective forms of question sentence, affirmative sentence, negative sentence, and command sentence (the number of different sentences). Perform forward translation or reverse translation.

また、翻訳文に対して構文解析を行うなどの手法もある。翻訳文群に含まれる複数の翻訳文それぞれに対して構文解析を行い、それぞれ主語を示す単語が何であるかを判断し、主語を示す単語が何種類あったかを示す主語の異なり数が閾値以下であれば、再度、順翻訳処理または逆翻訳処理を行う。このとき、主語ではなく、述語について異なり数を算出するとしてもよい。また、その両方であってもよい。 There is also a method of performing a syntax analysis on a translated sentence. The parsing is performed on each of the plurality of translations included in the translation sentence group to determine what the word indicating the subject is, and the number of different subjects indicating the number of types of the subject is less than or equal to the threshold. If so, the forward translation process or the reverse translation process is performed again. At this time, a different number may be calculated not for the subject but for the predicate. Further, both of them may be used.

ここで、文の異なり数、主語の異なり数、動詞または目的語の異なり数などを評価スコアとして、異なり数が所定の数だけ含まれるように、順翻訳文選択処理および／または逆翻訳文選択処理を行うとしてもよい。 Here, the number of different sentences, the number of different subjects, the number of different verbs or objects, and the like are used as evaluation scores, and the forward translation sentence selection processing and / or the reverse translation sentence selection are performed so that the predetermined number of differences is included. Processing may be performed.

順翻訳文選択処理を実行する場合に、例えば、「文の異なり数を２以上とする」という評価基準を設け、文の異なり数を評価スコアとして順翻訳文を選択することができる。順翻訳文群に含まれる複数の順翻訳文の各々に対して、順番に構文解析および／または意味解析を用いて解析を行い、疑問文・肯定文・否定文・命令文のうち、いくつの種類が出現したかを文の異なり数としてスコア化し、２種類以上が含まれるｌ個の小集合を作成しても良い。 When executing the forward-translated sentence selection processing, for example, an evaluation criterion of “the number of different sentences is set to 2 or more” is provided, and the forward-translated sentence can be selected using the number of different sentences as an evaluation score. For each of the plurality of forward-translated sentences included in the forward-translated sentence group, analysis is sequentially performed using syntactic analysis and / or semantic analysis. It is also possible to score whether or not the type has appeared as a different number of sentences and create l small sets including two or more types.

また例えば、逆翻訳文選択処理を実行する場合に、「主語の異なり数を所定数以下にする」という評価基準を設け、主語の異なり数を評価スコアとして逆翻訳文を選択することもできる。 In addition, for example, when executing the reverse translation sentence selection processing, an evaluation criterion of “the number of different subjects may be set to a predetermined number or less” may be provided, and a reverse translation sentence may be selected using the number of different subjects as an evaluation score.

なお、順翻訳文選択処理または逆翻訳文選択処理は、これらの例に限定するものではなく任意の順翻訳文選択または逆翻訳文選択の手法を用いるとしてよい。 The forward-translated sentence selection process or the backward-translated sentence selection process is not limited to these examples, and an arbitrary forward-translated sentence selection or reverse-translated sentence selection method may be used.

また、逆翻訳文選択処理に関しては、翻訳対象文との比較評価を行い、比較評価によって得られた値を評価スコアとしてもよい。例えば、翻訳対象文、および逆翻訳文群の中の複数の逆翻訳文に対して構文解析を行い、翻訳対象文と複数の逆翻訳文の各々との類似度を判断し、判断した類似度に基づいて、ｋ個の逆翻訳文を選択するとしてもよい。 Further, in the reverse translation sentence selection process, a comparison evaluation with the translation target sentence may be performed, and a value obtained by the comparison evaluation may be used as the evaluation score. For example, a syntax analysis is performed on the translation target sentence and a plurality of back translation sentences in the group of back translation sentences, and the similarity between the translation target sentence and each of the plurality of back translation sentences is determined. May be used to select k reverse-translated sentences.

また、これらの例に限らず、任意の評価スコアを組み合わせて用いても良い。さらに、Ｎを所定の数として予め設定し、更にそこに含まれる異なり数が所望の数となるようにスコアを組み合わせて用いても良い。 The present invention is not limited to these examples, and any evaluation score may be used in combination. Further, N may be set in advance as a predetermined number, and scores may be combined and used so that the number of differences included therein becomes a desired number.

上述の翻訳文の選択方法は一例であり、これらに限らない。 The above-described method of selecting a translation is merely an example, and the present invention is not limited thereto.

なお、ｋ＝０の場合は、Ｎ個すべてが評価スコアの高い翻訳文から順に選択されるものとなる。また、ｋ＝Ｎの場合は、Ｎ個の翻訳文すべてが評価スコアの高い翻訳文から順に選択される以外の選択方法によって選択される。 When k = 0, all N words are selected in order from the translation with the highest evaluation score. When k = N, all of the N translations are selected by a selection method other than the translations having the highest evaluation score.

ここでのシステムの基準とは、ＢＬＥＵなどの評価や人手評価などを用いて順翻訳文のスコアを算出し、例えば、スコアが低い順翻訳文に関しては逆翻訳処理を行わない（ある順翻訳文に関しては生成する逆翻訳文が０個）、スコアが高い順翻訳文は任意の数の逆翻訳文を得る（ある順翻訳文に関しては逆翻訳文が複数個生成される）などである。ユーザが逆翻訳文の数を決定する場合には、一例として、一つの順翻訳文に対して逆翻訳文をいくつ生成するかを設定する、などが考えられるが、これに限らない。 The standard of the system here means that the score of a forward-translated sentence is calculated using an evaluation such as BLEU or manual evaluation, and for example, a forward-translated sentence having a low score is not subjected to a reverse translation process (a certain forward-translated sentence). For example, for a forward-translated sentence having a high score, an arbitrary number of backward-translated sentences are obtained (for a given forward-translated sentence, a plurality of backward-translated sentences are generated). When the user determines the number of backward-translated sentences, for example, the number of backward-translated sentences to be generated for one forward-translated sentence may be set. However, the present invention is not limited to this.

図７は、本実施の形態における、逆翻訳文選択処理の具体的な動作を示すフローチャートである。 FIG. 7 is a flowchart showing a specific operation of the reverse translation sentence selection process in the present embodiment.

図５を用いた逆翻訳処理（ステップＳ４０５）の説明にて、逆翻訳文が生成されない順翻訳文が存在してもよいこと、一つの順翻訳文から複数の逆翻訳文が生成されるとしてもよいことなどを述べた。ここでは、一つの順翻訳文から得られた複数の逆翻訳文の中から、一つの逆翻訳文を選択する処理について説明する。 In the description of the reverse translation process (step S405) with reference to FIG. And good things. Here, a process of selecting one backward-translated sentence from a plurality of backward-translated sentences obtained from one forward-translated sentence will be described.

順翻訳文選択処理によって選択されたＮ個の順翻訳文すべてに対して、以下の処理が行われる。 The following process is performed on all N forward-translated sentences selected by the forward-translated sentence selection process.

ステップＳ６０１において、Ｎ個の順翻訳文の中の順翻訳文Ａに対して生成された逆翻訳文を抽出する。 In step S601, the backward-translated sentence generated for the forward-translated sentence A among the N forward-translated sentences is extracted.

ステップＳ６０２において、抽出した逆翻訳文の数を判断する。順翻訳文Ａに対して、逆翻訳文が生成されていなかったとき、つまり、順翻訳文Ａに対して生成された逆翻訳文が０個であったとき（ステップＳ６０２の０個）、順翻訳文Ａはユーザに提示されることがないため、削除する（ステップＳ６０３）。 In step S602, the number of extracted backward-translated sentences is determined. When no reverse-translated sentence has been generated for the forward-translated sentence A, that is, when no reverse-translated sentence has been generated for the forward-translated sentence A (0 in step S602), Since the translation A is not presented to the user, it is deleted (step S603).

次に、順翻訳文Ａに対して逆翻訳文が一個であるとき（ステップＳ６０２の１個）、その逆翻訳文は順翻訳文Ａに対応する逆翻訳文として決定される（ステップＳ６０４）。 Next, when there is one backward-translated sentence for the forward-translated sentence A (one in step S602), the backward-translated sentence is determined as the backward-translated sentence corresponding to the forward-translated sentence A (step S604).

最後に、順翻訳文Ａに対して逆翻訳文が二個以上生成されているとき（ステップＳ６０２の２個以上）、その中から順翻訳文Ａに対応する逆翻訳文として最適なものを一つ決定する（ステップＳ６０５）。決定方法としては、例えば、自動評価スコアや人手評価によるスコアを参照する方法を用いることによって選ばれる。これらを、順翻訳文Ｎ個すべてに対し繰り返す。 Finally, when two or more backward-translated sentences have been generated for the forward-translated sentence A (two or more at step S602), one of the optimal backward-translated sentences corresponding to the forward-translated sentence A is selected. Are determined (step S605). The determination method is selected, for example, by using a method of referring to an automatic evaluation score or a manual evaluation score. These are repeated for all N forward-translated sentences.

最後に、これらの処理で得られた逆翻訳文のうち、同趣旨の逆翻訳文を除く、または、異なる趣旨の逆翻訳文が少ない場合に異なる趣旨を持つ翻訳文を追加する、などの処理を必要に応じて行ってもよい（ステップＳ６０６）。ここでの処理は、図５の説明にて述べた処理と同じである。 Finally, among the reverse-translated sentences obtained by these processes, a process of removing a reverse-translated sentence of the same meaning, or adding a translated sentence having a different meaning when there are few reverse-translated sentences of different purposes, etc. May be performed as needed (step S606). The processing here is the same as the processing described in the description of FIG.

なお、上述の説明では、図７のステップＳ６０５において、一つの順翻訳文に対して一つの逆翻訳文を選択するものとしていた。しかし、一つの順翻訳文に対して、複数個の逆翻訳文が選択される場合があってもよい。この場合、ユーザに提示される複数の逆翻訳文の中に、対応する順翻訳文が同一の逆翻訳文が存在することになる。 In the above description, one backward-translated sentence is selected for one forward-translated sentence in step S605 of FIG. However, a plurality of backward-translated sentences may be selected for one forward-translated sentence. In this case, among the plurality of backward-translated sentences presented to the user, there exists a backward-translated sentence whose corresponding forward-translated sentence is the same.

図８は、本実施の形態における、フレーズ評価処理の具体的な動作を示すフローチャートである。 FIG. 8 is a flowchart showing a specific operation of the phrase evaluation processing in the present embodiment.

フローの初期状態において、事前にフレーズに分割された、選択逆翻訳文、非選択逆翻訳文、および選択逆翻訳文と非選択逆翻訳文の各々に対応する順翻訳文を取得しているとする。また、これらの逆翻訳文および順翻訳文をフレーズ分割して得られたフレーズに対応するフレーズテーブルも同様に取得しているとする。このフレーズ評価処理は、逆翻訳文および順翻訳文それぞれについて行なわれるが、説明の便宜上、逆翻訳文に対するフレーズ分割処理を例に挙げて説明する。 In the initial state of the flow, it is assumed that a selected reverse translation sentence, a non-selection reverse translation, and a forward translation corresponding to each of the selection reverse translation and the non-selection reverse translation are divided into phrases in advance. I do. It is also assumed that a phrase table corresponding to a phrase obtained by dividing the backward-translated sentence and the forward-translated sentence into phrases is similarly acquired. This phrase evaluation process is performed for each of the backward-translated sentence and the forward-translated sentence. For convenience of explanation, a phrase division process for the backward-translated sentence will be described as an example.

ステップＳ７０１において、選択逆翻訳文、および非選択逆翻訳文に対して、これらの逆翻訳文に含まれるフレーズを逆翻訳文毎に比較し、選択逆翻訳文のみに存在するフレーズの有無を確認する。 In step S701, the phrases included in these selected reverse-translated sentences and non-selected reverse-translated sentences are compared for each reverse-translated sentence, and the presence or absence of a phrase existing only in the selected reverse-translated sentence is confirmed. I do.

選択逆翻訳文のみに存在するフレーズが有る場合（ステップＳ７０１のＹｅｓ）は、選択逆翻訳文のみに存在するフレーズについては、ユーザ選択スコアを加点する（ステップＳ７０２）。このユーザ選択スコアは、選択文のみに現れるフレーズを良いものとするスコアであり、最終的に図９の処理において、順翻訳文（フレーズ評価処理の対象が順翻訳文の場合は、翻訳対象文）の対応するフレーズとともに、フレーズテーブルの日英翻訳確率または英日翻訳確率に反映される。このときのスコアの加点方法は、どのような方法を用いてもよい。例えば、該当するフレーズに対して一律に加点する、該当するフレーズの長さに依存して加点する、などの方法が想定される。スコアの加点が完了すると、フローをステップＳ７０３に進める。選択逆翻訳文のみに存在するフレーズが無い場合（ステップＳ７０１のＮｏ）は、特段の処理をせずに、フローをそのままステップＳ７０３に進める。 If there is a phrase that exists only in the selected reverse-translated sentence (Yes in step S701), the user-selected score is added to the phrase that exists only in the selected reverse-translated sentence (step S702). This user selection score is a score that makes a phrase appearing only in the selected sentence a good one. Finally, in the processing of FIG. 9, the forward translation sentence (if the target of the phrase evaluation processing is a forward translation sentence, the translation target sentence Is reflected in the Japanese-English translation probability or the English-Japanese translation probability of the phrase table together with the corresponding phrase of ()). At this time, any method may be used as a method for adding scores. For example, a method of uniformly adding points to the corresponding phrase, adding points depending on the length of the corresponding phrase, and the like are assumed. When the score addition is completed, the flow proceeds to step S703. If there is no phrase that exists only in the selected reverse translation (No in step S701), the flow proceeds to step S703 without performing any special processing.

同様に、ステップＳ７０３において、選択逆翻訳文、および非選択逆翻訳文に対して、これらの逆翻訳文に含まれるフレーズを逆翻訳文毎に比較し、非選択逆翻訳文のみに存在するフレーズの有無を確認する。 Similarly, in step S703, the phrases included in the selected reverse-translated sentences and the non-selected reverse-translated sentences are compared for each reverse-translated sentence, and the phrases existing only in the non-selected reverse-translated sentences are compared. Check if there is any.

非選択逆翻訳文のみに存在するフレーズが有る場合（ステップＳ７０３のＹｅｓ）、非選択逆翻訳文のみに存在するフレーズについては、ユーザ選択スコアを減点する（ステップＳ７０４）。このときのスコアの減点方法は、どのような方法を用いてもよい。例えば、該当するフレーズに対して一律に減点する、該当するフレーズの長さに依存して減点する、などの方法が想定される。 If there is a phrase that exists only in the unselected reverse-translated sentence (Yes in step S703), the user-selected score is deducted for a phrase that exists only in the unselected reverse-translated sentence (step S704). Any method may be used as the score deduction method at this time. For example, a method of uniformly deducting points for the corresponding phrase, a method of deducting points depending on the length of the corresponding phrase, and the like are assumed.

なお、ステップＳ７０２とＳ７０４において、ユーザ選択スコアの加点、減点は必ずしも必須ではない。つまり、図８のフローチャートにおいて、選択逆翻訳文のみに存在するフレーズに対してスコアの加点も減点も行わず、非選択逆翻訳文のみに存在するフレーズに対してユーザ選択スコアの減点を行うとしてもよい。または、選択逆翻訳文のみに存在するフレーズに対してユーザ選択スコアの加点を行い、非選択逆翻訳文のみに存在するフレーズに対してユーザ選択スコアの加点も減点も行わないとしてもよい。 In steps S702 and S704, adding or subtracting points from the user-selected score is not necessarily essential. That is, in the flowchart of FIG. 8, it is assumed that no score addition or deduction is performed for a phrase existing only in the selected reverse-translated sentence, and a user-selected score is deducted for a phrase existing only in the non-selected reverse-translated sentence. Is also good. Alternatively, the user-selected score may be added to the phrase that exists only in the selected reverse-translated sentence, and the user-selected score may not be added or deducted to the phrase that exists only in the non-selected reverse-translated sentence.

また、選択逆翻訳文と一部の非選択逆翻訳文に存在するフレーズに関して、スコアの加点を行なっても構わない。この場合、例えば、選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアと、非選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアを考慮した値（例えば、選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアと、非選択逆翻訳文のみに含まれるフレーズのユーザ選択スコアの平均値）を加点することが考えられる。これらのスコアの加点方法については一例であり、これらに限らない。 In addition, scores may be added to phrases that are present in the selected reverse-translated sentence and some non-selected reverse-translated sentences. In this case, for example, a value considering the user selection score of the phrase included only in the selected reverse translation and the user selection score of the phrase included only in the non-selected reverse translation (for example, a phrase included only in the selected reverse translation) It is conceivable to add the user selection score of the phrase and the average value of the user selection scores of the phrases included only in the unselected reverse-translated sentences. The method of adding these scores is an example, and the present invention is not limited thereto.

以下、図１２を用いて、具体例を挙げて説明する。例えば、情報表示端末１００に３つの逆翻訳文が提示され、その中の１つの逆翻訳文をユーザが選択した場合について説明する。このとき、情報表示端末１００に提示される３つの逆翻訳文は、逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０として、ユーザによって選択された逆翻訳文は、逆翻訳文ＲＳ１０であるとする。また、説明の便宜上、ユーザによって選択された逆翻訳文を選択逆翻訳文、ユーザによって選択されなかった逆翻訳文を非選択逆翻訳文と定義する。フローの初期状態において、選択逆翻訳文である逆翻訳文ＲＳ１０は、フレーズＰＨ１１、ＰＨ１２、ＰＨ１３に分割されている。また、非選択逆翻訳文についても同様に、逆翻訳文ＲＳ２０は、フレーズＰＨ２１、ＰＨ２２、ＰＨ２３に分割されており、逆翻訳文ＲＳ３０は、フレーズＰＨ３１、ＰＨ３２、ＰＨ３３に分割されている。 Hereinafter, a specific example will be described with reference to FIG. For example, a case where three back-translated sentences are presented to the information display terminal 100 and the user selects one of the back-translated sentences will be described. At this time, it is assumed that the three reverse-translated sentences presented to the information display terminal 100 are reverse-translated sentences RS10, RS20, and RS30, and the reverse-translated sentence selected by the user is the backward-translated sentence RS10. For convenience of description, a reverse-translated sentence selected by the user is defined as a selected reverse-translated sentence, and a reverse-translated sentence not selected by the user is defined as a non-selected reverse-translated sentence. In the initial state of the flow, the backward-translated sentence RS10, which is the selected backward-translated sentence, is divided into phrases PH11, PH12, and PH13. Similarly, for the unselected reverse-translated sentence, the reverse-translated sentence RS20 is divided into phrases PH21, PH22, and PH23, and the reverse-translated sentence RS30 is divided into phrases PH31, PH32, and PH33.

Ｓ７０１において、逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０の中で、選択逆翻訳文である逆翻訳文ＲＳ１０のみに存在するフレーズの有無を確認する。すると、選択逆翻訳文である逆翻訳文ＲＳ１０にのみフレーズＰＨ１２が含まれるため、フレーズＰＨ１２のユーザ選択スコアを加点して、「＋１」とする。 In step S701, it is confirmed whether or not there is a phrase that exists only in the backward-translated sentence RS10 that is the selected backward-translated sentence among the backward-translated sentences RS10, RS20, and RS30. Then, since the phrase PH12 is included only in the reverse-translated sentence RS10, which is the selected reverse-translated sentence, the user selection score of the phrase PH12 is added to “+1”.

同様に、Ｓ７０２において、非選択逆翻訳文である逆翻訳文ＲＳ２０、ＲＳ３０のみに存在するフレーズの有無を確認する。すると、非選択逆翻訳文にのみフレーズＰＨ２２（ＰＨ３２）とフレーズＰＨ３１が含まれるため、フレーズＰＨ２２（ＰＨ３２）とフレーズＰＨ３１のユーザ選択スコアをそれぞれ減点して、「−１」とする。 Similarly, in S702, it is confirmed whether or not there is a phrase that exists only in the backward-translated sentences RS20 and RS30 that are the unselected backward-translated sentences. Then, since the phrase PH22 (PH32) and the phrase PH31 are included only in the unselected reverse-translated sentences, the user-selected scores of the phrases PH22 (PH32) and PH31 are deducted to "-1".

ここで、選択逆翻訳文、非選択逆翻訳文の両方に含まれるフレーズＰＨ１１（ＰＨ２１）、ＰＨ１３（ＰＨ２３、ＰＨ３３）に関しては、ユーザ選択スコアの加点または減点を行なわない。 Here, for the phrases PH11 (PH21) and PH13 (PH23, PH33) included in both the selected reverse-translated sentence and the non-selected reverse-translated sentence, no points are added or subtracted from the user-selected score.

上述の処理によって、フレーズ毎の最終的なユーザ選択スコアの加点量または減点量は次の通りである。フレーズＰＨ１１（ＰＨ２１）に対しては「±０」、フレーズＰＨ３１に対しては「−１」、フレーズＰＨ２２（ＰＨ３２）に対しては「−１」、フレーズＰＨ１２に対しては「＋１」、フレーズＰＨ１３（ＰＨ２３、ＰＨ３３）に対しては「±０」といった加点量または減点量となる。ここでのスコアの加点量および減点量は一例であるため、これよりも大きいオーダーで加点または減点を行なってもよいし、これよりも小さいオーダーで加点または減点を行なってもよい。 By the above-described processing, the final amount of addition or deduction of the user-selected score for each phrase is as follows. "± 0" for phrase PH11 (PH21), "-1" for phrase PH31, "-1" for phrase PH22 (PH32), "+1" for phrase PH12, phrase For PH13 (PH23, PH33), an additional point or a deducted amount such as “± 0” is obtained. Since the amount of points added and the amount of points deducted here are just examples, points may be added or deducted in an order larger than this, or points may be added or deducted in an order smaller than this.

また、目的言語である順翻訳文に対するフレーズ評価処理について、図１２を用いて、以下に具体例を挙げて説明する。前述した逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０には、それぞれ順翻訳文ＴＳ１０、ＴＳ２０、ＴＳ３０が対応している。フローの初期状態において、順翻訳文ＴＳ１０は、フレーズＰＨ１４、ＰＨ１５、ＰＨ１６に分割されている。また、順翻訳文ＴＳ２０は、フレーズＰＨ２４、ＰＨ２５、ＰＨ２６に分割されており、ＴＳ３０は、フレーズＰＨ３４、ＰＨ３５、ＰＨ３６に分割されている。 The phrase evaluation process for the forward-translated sentence that is the target language will be described below using a specific example with reference to FIG. The forward-translated sentences RS10, TS20, and TS30 correspond to the backward-translated sentences RS10, RS20, and RS30, respectively. In the initial state of the flow, the forward-translated sentence TS10 is divided into phrases PH14, PH15, and PH16. The forward-translated sentence TS20 is divided into phrases PH24, PH25 and PH26, and the TS30 is divided into phrases PH34, PH35 and PH36.

Ｓ７０１において、選択逆翻訳文１に対応する順翻訳文１にのみ存在するフレーズの有無を確認する。すると、順翻訳文ＴＳ１０にのみフレーズＰＨ１６が含まれるため、フレーズＰＨ１６のユーザ選択スコアを加点して、「＋１」とする。 In S701, it is confirmed whether or not there is a phrase that exists only in the forward-translated sentence 1 corresponding to the selected reverse-translated sentence 1. Then, since the phrase PH16 is included only in the forward-translated sentence TS10, the user-selected score of the phrase PH16 is added to "+1".

同様に、Ｓ７０２において、非選択逆翻訳文である逆翻訳文ＲＳ２０、ＲＳ３０に対応する順翻訳文ＴＳ２０、ＴＳ３０のいずれかにのみ存在するフレーズの有無を確認する。すると、順翻訳文ＴＳ２０またはＴＳ３０にのみ、フレーズＰＨ２６（ＰＨ３６）とフレーズＰＨ３４が含まれるため、フレーズＰＨ２６（ＰＨ３６）とフレーズＰＨ３４のユーザ選択スコアをそれぞれ減点して、「−１」とする。 Similarly, in S702, it is confirmed whether there is a phrase that exists only in one of the forward-translated sentences TS20 and TS30 corresponding to the backward-translated sentences RS20 and RS30 that are the unselected backward-translated sentences. Then, since the phrases PH26 (PH36) and the phrase PH34 are included only in the forward-translated sentences TS20 or TS30, the user-selected scores of the phrases PH26 (PH36) and PH34 are respectively deducted to "-1".

選択逆翻訳文に対応する順翻訳文ＴＳ１０に含まれ、非選択逆翻訳文に対応する順翻訳文ＴＳ２０または順翻訳文ＴＳ３０のいずれかにも含まれる、フレーズＰＨ２４とフレーズＰＨ１５（ＰＨ２５、ＰＨ３５）に関しては、ユーザ選択スコアの加点または減点を行なわない。 Phrase PH24 and phrase PH15 (PH25, PH35) included in forward-translated sentence TS10 corresponding to the selected reverse-translated sentence and included in either forward-translated sentence TS20 or forward-translated sentence TS30 corresponding to the unselected reverse-translated sentence. Regarding, no point is added or subtracted from the user-selected score.

上述の処理によって、フレーズ毎の最終的なスコアの加点量または減点量は、フレーズＰＨ２４（ＰＨ３４）に対しては「±０」、フレーズＰＨ３４に対しては「−１」、フレーズＰＨ２６（ＰＨ３６）に対しては「−１」、フレーズＰＨ１６に対しては「＋１」、フレーズＰＨ１５（ＰＨ２５、ＰＨ３５）に対しては「±０」となる。ここでのユーザ選択スコアの加点量および減点量は一例であるため、これよりも大きいオーダーで加点または減点を行なってもよいし、これよりも小さいオーダーで加点または減点を行なってもよい。 By the above-described processing, the final score increment or deduction for each phrase is “± 0” for the phrase PH24 (PH34), “−1” for the phrase PH34, and the phrase PH26 (PH36). For the phrase PH16, "+1" for the phrase PH16, and "± 0" for the phrase PH15 (PH25, PH35). Since the point addition amount and the point deduction amount of the user selection score here are examples, the point addition or the deduction may be performed in an order larger than this, or may be performed in the order smaller than this.

図９は、本実施の形態における、学習処理の具体的な動作を示すフローチャートである。 FIG. 9 is a flowchart showing a specific operation of the learning process in the present embodiment.

ステップＳ８０１において、選択逆翻訳文に対応する順翻訳文に含まれるフレーズと、選択逆翻訳文に含まれるフレーズとのフレーズ対、もしくは選択逆翻訳文に対応する翻訳対象文に含まれるフレーズと、選択逆翻訳文に対応する順翻訳文に含まれるフレーズとのフレーズ対を取得する。フレーズ対とは、機械翻訳の際、原言語、目的言語の両言語間でそれぞれ対応が取られた（同じ意味を持つ）２つのフレーズのことである。さらに、図８の処理で得たユーザ選択スコアも同時に取得する。 In step S801, a phrase pair of a phrase included in the forward translation corresponding to the selected reverse translation and a phrase included in the selected reverse translation, or a phrase included in the translation target sentence corresponding to the selected reverse translation, A phrase pair with a phrase included in the forward translation corresponding to the selected reverse translation is acquired. Phrase pairs are two phrases that have correspondence (having the same meaning) between the source language and the target language during machine translation. Further, the user selection score obtained in the processing of FIG. 8 is also obtained at the same time.

このフレーズ対において、原言語から目的言語へ翻訳される場合に参照されるフレーズテーブルの値に対するユーザ選択スコアを、図１２に示す例を用いて定義すると、例えば、フレーズＰＨ３１→フレーズＰＨ３４：−１／フレーズＰＨ２２（ＰＨ３２）→フレーズＰＨ２６（ＰＨ３５）：−１／フレーズＰＨ１１（ＰＨ２１）→フレーズＰＨ１４（ＰＨ２４）：０／フレーズＰＨ１３（ＰＨ２３、ＰＨ３３）→フレーズＰＨ１５（ＰＨ２５、ＰＨ３５）：０／フレーズＰＨ１２→フレーズＰＨ１６：＋１のようになる。 In this phrase pair, if the user selection score for the value of the phrase table referred to when translated from the source language to the target language is defined using the example shown in FIG. 12, for example, the phrase PH31 → phrase PH34: −1 / Phrase PH22 (PH32) → Phrase PH26 (PH35): -1 / Phrase PH11 (PH21) → Phrase PH14 (PH24): 0 / Phrase PH13 (PH23, PH33) → Phrase PH15 (PH25, PH35): 0 / Phrase PH12 → Phrase PH16: It becomes like +1.

また、フレーズ対において、目的言語から原言語へ翻訳される場合に参照されるフレーズテーブルの値に対するユーザ選択スコアを定義すると、例えば、フレーズＰＨ３４→フレーズＰＨ３１：−１／フレーズＰＨ２６（ＰＨ３６）→フレーズＰＨ２２（ＰＨ３２）：−１／フレーズＰＨ１４（ＰＨ２４）→フレーズＰＨ１１（ＰＨ２１）：０／フレーズＰＨ１５（ＰＨ２５、ＰＨ３５）→フレーズＰＨ１３（ＰＨ２３、ＰＨ３３）：０／フレーズＰＨ１６→フレーズＰＨ１２：＋１のようになる。 Also, in the phrase pair, defining a user selection score for the value of the phrase table referred to when the target language is translated from the target language, for example, the phrase PH34 → phrase PH31: −1 / phrase PH26 (PH36) → phrase PH22 (PH32): -1 / Phrase PH14 (PH24) → Phrase PH11 (PH21): 0 / Phrase PH15 (PH25, PH35) → Phrase PH13 (PH23, PH33): 0 / Phrase PH16 → Phrase PH12: +1 Become.

ステップＳ８０２において、記憶部２４０に格納されているフレーズテーブルの英日翻訳確率または日英翻訳確率に、上述のユーザ選択スコアを反映する。またユーザ選択スコアに一定の値を乗算してからフレーズテーブルに反映するなど、ユーザ選択スコアに傾斜や重み付けを与えても構わない。 In step S802, the above-mentioned user-selected score is reflected in the English-Japanese translation probability or the Japanese-English translation probability of the phrase table stored in the storage unit 240. Also, the user selection score may be given a slope or weight, for example, by multiplying the user selection score by a certain value and then reflecting the result in the phrase table.

これらを用いて、機械翻訳部２３０や逆翻訳部２３３において、強化学習や識別学習、ニューラルネット学習などの機械学習を行う。 Using these, the machine translation unit 230 and the reverse translation unit 233 perform machine learning such as reinforcement learning, identification learning, and neural network learning.

これまでの機械翻訳においては、図１１に示すようなフレーズテーブルの確率値を、対訳コーパス（異なる２つの言語間で、お互いが翻訳となっている文の対を集めたデータ）を元にチューニングすることは行われていたが、ユーザ選択文に含まれるものと含まれないものでスコアに差を付けて機械学習を行う手法はこれまでになく、本開示の機械翻訳システムは、よりユーザの評価結果を反映できる。 In conventional machine translation, the probability values in a phrase table as shown in FIG. 11 are tuned based on a bilingual corpus (data obtained by collecting pairs of sentences that are translated into each other between two different languages). However, there has never been a method of performing machine learning with a difference between scores included in and not included in a user-selected sentence, and the machine translation system according to the present disclosure is more user-friendly. Evaluation results can be reflected.

さらに、機械学習によって、予め用意された対訳コーパスから作成された翻訳モデルや言語モデルなどに対して、逐次、フレーズ単位でユーザの選択結果を混ぜ込みながら、翻訳モデルや言語モデルを学習させることができるため、精度を向上させることができる。 Furthermore, by machine learning, a translation model or a language model can be trained on a translation model or a language model created from a bilingual corpus prepared in advance while sequentially mixing user selection results in phrase units. Therefore, accuracy can be improved.

さらに、機械学習を行うことで、データを元に最適なパラメータが選択される。これらにより、人（ユーザ）の選択結果が翻訳システム反映されるため、人が使いやすい翻訳システムを構築することができる。 Furthermore, by performing machine learning, optimal parameters are selected based on the data. As a result, the selection result of a person (user) is reflected on the translation system, so that a translation system that is easy for a person to use can be constructed.

なお、これらのような機械学習を行うだけではなく、得られたフレーズから新規コーパスを生成して、翻訳エンジン学習のための対訳コーパスとして利用することもできる。 In addition to performing such machine learning, a new corpus can be generated from the obtained phrases and used as a bilingual corpus for translation engine learning.

図１０は、本実施の形態における、学習部２３８の具体的な処理を示すフローチャートである。図１２に示す逆翻訳文および順翻訳文を用いて、図１０に示すフローチャートの内容を説明する。 FIG. 10 is a flowchart illustrating a specific process of the learning unit 238 according to the present embodiment. The contents of the flowchart shown in FIG. 10 will be described using the backward-translated sentence and the forward-translated sentence shown in FIG.

ステップＳ９０１において、ユーザ選択文とその順翻訳文のフレーズ対を取得する。 In step S901, a phrase pair of a user-selected sentence and its forward translation is acquired.

例えば、ユーザ提示文表示エリア１１０２に、逆翻訳文ＲＳ１０、ＲＳ２０、ＲＳ３０が表示された状態で、ユーザによって逆翻訳文ＲＳ１０が選択された場合（逆翻訳文ＲＳ１０がユーザ選択文である場合）を説明する。逆翻訳文ＲＳ１０がユーザによって選択されたため、フレーズＰＨ１１とフレーズＰＨ１４、フレーズＰＨ１２とフレーズＰＨ１６、フレーズＰＨ１３とフレーズＰＨ１５といったフレーズ対が取得される。 For example, it is assumed that the user selects the backward-translated sentence RS10 while the backward-translated sentences RS10, RS20, and RS30 are displayed in the user-provided sentence display area 1102 (when the backward-translated sentence RS10 is a user-selected sentence). explain. Since the reverse translation sentence RS10 is selected by the user, phrase pairs such as the phrase PH11 and the phrase PH14, the phrase PH12 and the phrase PH16, and the phrase PH13 and the phrase PH15 are acquired.

ステップＳ９０２において、機械翻訳部２３０での機械翻訳の際に使われた、入力文とその順翻訳文におけるフレーズ対を取得する。 In step S902, a phrase pair in the input sentence and its forward-translated sentence used in the machine translation by the machine translation unit 230 is acquired.

例えば、逆翻訳文ＲＳ３０の示す内容が入力文である場合には、フレーズＰＨ３１とフレーズＰＨ３４、フレーズＰＨ３２とフレーズＰＨ３６、フレーズＰＨ３３とフレーズＰＨ３５といったフレーズ対が取得される。 For example, when the content indicated by the reverse translation sentence RS30 is an input sentence, a phrase pair such as the phrases PH31 and PH34, the phrases PH32 and PH36, and the phrases PH33 and PH35 is acquired.

ステップＳ９０３において、入力文とユーザ選択文で取得されたフレーズに対し、目的言語の文字列が同一であるフレーズを取得する。例えば、ユーザ選択文におけるフレーズ対が次の通りであったとする。ユーザ選択文におけるフレーズ対は、フレーズＰＨ１１とフレーズＰＨ１４、フレーズＰＨ１２とフレーズＰＨ１６、フレーズＰＨ１３とフレーズＰＨ１５である。これに対し、入力文のフレーズ対が次の通りであったとする。入力文のフレーズ対は、フレーズＰＨ３１とフレーズＰＨ３４、フレーズＰＨ３２とＰＨ３６、フレーズＰＨ４１とＰＨ４２である。 In step S903, a phrase having the same target language character string as the phrase acquired in the input sentence and the user-selected sentence is acquired. For example, assume that the phrase pairs in the user selection sentence are as follows. The phrase pairs in the user-selected sentence are the phrases PH11 and PH14, the phrases PH12 and PH16, and the phrases PH13 and PH15. On the other hand, it is assumed that the phrase pair of the input sentence is as follows. The phrase pairs of the input sentence are the phrases PH31 and PH34, the phrases PH32 and PH36, and the phrases PH41 and PH42.

このとき、ユーザ選択文のフレーズと入力文のフレーズにおいて、原言語のフレーズＰＨ３３とフレーズＰＨ４１は、同じ意味を持つが表現の異なるフレーズである。また、フレーズＰＨ３３とフレーズＰＨ４１とは、ユーザ選択文のフレーズと入力文のフレーズの間で、それぞれ対応するフレーズである。 At this time, in the phrase of the user-selected sentence and the phrase of the input sentence, the source language phrase PH33 and the phrase PH41 have the same meaning but have different expressions. Further, the phrases PH33 and PH41 are corresponding phrases between the phrase of the user selection sentence and the phrase of the input sentence.

最後に、目的言語が同一で原言語が異なるかどうかをチェックし、異なるものをパラフレーズ（言い換え）として保持する（Ｓ９０４のＹｅｓ、Ｓ９０５）。つまり、目的言語が同一でも原言語が異なるということから、これらは原言語における言い換えであるとみなすことができる。 Finally, it is checked whether the target language is the same and the source language is different, and the different language is stored as a paraphrase (paraphrase) (Yes in S904, S905). That is, since the source language is different even if the target language is the same, these can be regarded as paraphrasing in the source language.

例えば、フレーズＰＨ３３とフレーズＰＨ４１は言い換えとみなすことができ、これらを原言語同士のパラフレーズとして保持する。 For example, the phrases PH33 and PH41 can be regarded as paraphrases, and these are held as paraphrases between source languages.

このパラフレーズは、機械翻訳部２３０において、機械翻訳をする際に参照する、もしくは翻訳を行う前に、原言語側での言い換えをとして参照することできる。 The paraphrase can be referred to when performing machine translation in the machine translation unit 230, or referred to as a paraphrase in the source language before performing the translation.

図１３は、本実施の形態における、表示画面の一例を示す図である。 FIG. 13 is a diagram illustrating an example of a display screen according to the present embodiment.

例えば、図１３（Ａ）に示すように、ユーザから翻訳対象である原文の入力を受け付けると、入力された原文ＯＳ１内容が入力文表示エリア１１０１に表示される。 For example, as shown in FIG. 13A, when an input of an original sentence to be translated is received from a user, the contents of the input original sentence OS1 are displayed in the input sentence display area 1101.

次に、図１３（Ｂ）に示すように、原文ＯＳ１を翻訳した翻訳文に対する逆翻訳結果がユーザ提示文表示エリア１１０２に表示される。 Next, as shown in FIG. 13B, a reverse translation result for the translated sentence obtained by translating the original sentence OS1 is displayed in the user-presented sentence display area 1102.

一例としてここでは、３つの逆翻訳文を出力する形態について説明する。逆翻訳結果として、例えば逆翻訳文ＲＳ１、逆翻訳文ＲＳ２、逆翻訳文ＲＳ３が出力され、ユーザ提示文表示エリア１１０２に表示される。このとき、図１３（Ｂ）のユーザ提示文表示エリア１１０２に表示されている逆翻訳文ＲＳ１〜ＲＳ３は、原言語において、それぞれ同様の意味を有する類似文である。逆翻訳処理の特性上、これらは同様の意味を有する類似文であることが期待されるが、それぞれ異なる意味を有する文章を出力するよう実装されてもよい。 As an example, a mode in which three back-translated sentences are output will be described here. As the reverse translation result, for example, a reverse translation sentence RS1, a reverse translation sentence RS2, and a reverse translation sentence RS3 are output and displayed in the user presentation sentence display area 1102. At this time, the reverse-translated sentences RS1 to RS3 displayed in the user-provided sentence display area 1102 in FIG. 13B are similar sentences having the same meaning in the source language. Due to the characteristics of the reverse translation process, these are expected to be similar sentences having the same meaning, but may be implemented to output sentences having different meanings.

次に、図１３（Ｃ）に示すように、ユーザは、ユーザ提示文表示エリア１１０２に表示された逆翻訳結果を確認して、自分の意図した入力内容に一番近い逆翻訳文を選択する。ここでは、原文ＯＳ１に対して、例えば、逆翻訳文ＲＳ１が選択される。 Next, as shown in FIG. 13C, the user checks the reverse translation result displayed in the user-provided sentence display area 1102, and selects the reverse-translated sentence closest to the input content intended by the user. . Here, for example, the back-translated sentence RS1 is selected for the original sentence OS1.

ユーザが逆翻訳文を選択すると、選択された逆翻訳文に対応する翻訳文が、翻訳結果表示エリア１１０３に表示される。ここでは、逆翻訳文ＲＳ１に対応する翻訳文である、翻訳文ＴＳ１が表示される。 When the user selects a backward-translated sentence, a translated sentence corresponding to the selected backward-translated sentence is displayed in the translation result display area 1103. Here, a translated sentence TS1, which is a translated sentence corresponding to the reverse-translated sentence RS1, is displayed.

なお、画面表示に関しては、図１３（Ａ）、（Ｂ）、（Ｃ）に示す様なレイアウトに限らない。必要に応じて各種のボタンが配置されてもよく、例えば、翻訳対象の原文を入力した後に、ボタンに対する操作が行なわれると、翻訳処理が実行されるとしてもよい。また、ボタンに対する操作が行なわれることによって、ユーザ提示文表示エリア１１０２に逆翻訳文が表示されるとしてもよい。また、入力文表示エリア１１０１、ユーザ提示文表示エリア１１０２、翻訳結果表示エリア１１０３の配置や表示される内容、向きは上述の内容に限らない。 Note that the screen display is not limited to the layout shown in FIGS. 13A, 13B, and 13C. Various buttons may be arranged as necessary. For example, when an operation on a button is performed after inputting an original text to be translated, a translation process may be performed. The reverse translation may be displayed in the user-provided sentence display area 1102 by performing an operation on the button. Also, the arrangement, displayed contents, and orientation of the input sentence display area 1101, the user presentation sentence display area 1102, and the translation result display area 1103 are not limited to the above-described contents.

図１４は、本実施の形態における、表示画面の一例を示す図である。 FIG. 14 is a diagram illustrating an example of a display screen according to the present embodiment.

図１３（Ｃ）とは表示が一部異なる。ここでは、翻訳結果表示エリア１２０１に表示されている文章の向きと、入力文表示エリア１２０２およびユーザ提示文表示エリア１２０３に表示されている文章の向きとが異なる。これは、２人のユーザ（原言語話者と目的言語話者）が情報表示端末を挟んで向かい合ってコミュニケーションを行なっている場面を想定している。すなわち、入力文表示エリア１２０２およびユーザ提示文表示エリア１２０３に表示されている文章は、原言語話者に合わせた向きで表示され、翻訳結果表示エリア１２０１に表示されている文章は、目的言語話者に合わせた向きで表示されている。これによって、原言語話者は、入力文に対して出力された翻訳文を目的言語話者に対して読み上げたり、目的言語話者が翻訳文を確認しやすいように情報表示端末の向きを変えたりする必要がないので、情報表示端末などを介して異なる言語を話すユーザ間で円滑なコミュニケーションが可能となる。なお、翻訳結果表示エリア１２０１の向きは、ユーザによる任意の操作で変更することが可能である。また、図１３同様、各エリアの配置や表示される内容、向きはこれらに限らない。 The display is partially different from FIG. Here, the direction of the sentence displayed in translation result display area 1201 is different from the direction of the sentence displayed in input sentence display area 1202 and user presented sentence display area 1203. This assumes a situation in which two users (a source language speaker and a target language speaker) communicate with each other across an information display terminal. That is, the sentences displayed in the input sentence display area 1202 and the user presented sentence display area 1203 are displayed in a direction suitable for the source language speaker, and the sentences displayed in the translation result display area 1201 are displayed in the target language speech. Is displayed in an orientation suitable for the person. This allows the source language speaker to read out the translated sentence for the input sentence to the target language speaker or change the orientation of the information display terminal so that the target language speaker can easily check the translated sentence. Since there is no need to perform such communication, smooth communication between users who speak different languages via an information display terminal or the like is possible. The direction of the translation result display area 1201 can be changed by an arbitrary operation by the user. Further, similarly to FIG. 13, the arrangement of each area, the displayed content, and the orientation are not limited to these.

以上、本発明の一態様に係る翻訳方法について、実施の形態に基づいて説明したが、本発明はこれらの実施の形態に限定されるものではない。本発明の主旨を逸脱しない限り、当業者が思いつく各種変形を本実施の形態に施したもの、あるいは異なる実施の形態における構成要素を組み合わせて構築される形態も、本発明の範囲内に含まれる。 As described above, the translation method according to one embodiment of the present invention has been described based on the embodiments, but the present invention is not limited to these embodiments. Unless departing from the gist of the present invention, various modifications conceivable by those skilled in the art are applied to this embodiment, or forms configured by combining components in different embodiments are also included in the scope of the present invention. .

例えば、上述の説明では、情報表示端末１００に提示される複数の逆翻訳文の中からユーザによって一つの逆翻訳文が選択されるものとしたが、複数の逆翻訳文が選択されてもよい。例えば、一つはユーザの選択として選ばれ、順翻訳文が提示されるが、その他のユーザ非選択文に対しても評価を行い、その結果を学習結果としてシステムに反映するものでもよい。ここでの評価の方法として、例えば、ユーザ非選択文に対して、ユーザが良い順、悪い順に順位付けを行う、ユーザ選択文と同程度に許容できる非選択文をユーザが選択する、明らかに許容できない非選択文をユーザが選択する、などといった方法で評価を行う。これらを行うことで、選択されなかった文に対しても評価を行うことができ、これらをシステムに反映することで、システムの学習に繋がる。 For example, in the above description, one backward-translated sentence is selected by the user from a plurality of backward-translated sentences presented to the information display terminal 100, but a plurality of backward-translated sentences may be selected. . For example, one is selected as a user's selection and a forward-translated sentence is presented, but other non-user-selected sentences may be evaluated and the result may be reflected in the system as a learning result. As a method of evaluation here, for example, the user ranks the user non-selected sentences in the order of good or bad, selects a non-selected sentence that is as permissible as the user selected sentence, The evaluation is performed by a method in which the user selects an unacceptable non-selected sentence. By performing these operations, it is possible to evaluate sentences that have not been selected, and by reflecting these in the system, it is possible to learn the system.

また、上記の説明では、図１３のように入力文に対してテキストによりユーザ提示文や結果の翻訳文が出力されるものとしたが、これらに対して、テキストと音声、または音声のみで提示しても構わない。その場合、ユーザはユーザ提示文に対して、マイクロフォンを通じてユーザ提示文から一つを選択する、という方法で選択しても構わない。 Further, in the above description, a user-provided sentence or a translation of a result is output as a text with respect to an input sentence as shown in FIG. 13, but these are presented with text and voice or only voice. It does not matter. In this case, the user may select one of the user-provided sentences from the user-provided sentences via the microphone.

本発明にかかる機械翻訳方法は、言語情報を出力する情報出力装置へ接続し、第１言語と第２言語との間の翻訳処理を行なう機械翻訳システムにおいて有用である。 The machine translation method according to the present invention is useful in a machine translation system that connects to an information output device that outputs linguistic information and performs translation processing between a first language and a second language.

１００情報表示端末
１０１通信部
１０２入力部
１０３出力部
１０４制御部
１０５選択文検出部
１０６記憶部
２００ネットワーク
２１０通信部
２２０制御部
２３０機械翻訳部
２３１順翻訳部
２３２順翻訳文選択部
２３３逆翻訳部
２３４逆翻訳文選択部
２３５選択文判断部
２３６フレーズ分割部
２３７選択結果評価部
２３８学習部
２４０記憶部
３００翻訳サーバ
４００マイク
５００スピーカー
１０００コンピュータ
１００１入力装置
１００２出力装置
１００３ＣＰＵ
１００４ＲＯＭ
１００５ＲＡＭ
１００６記憶装置
１００７読取装置
１００８送受信装置
１００９バス REFERENCE SIGNS LIST 100 Information display terminal 101 Communication unit 102 Input unit 103 Output unit 104 Control unit 105 Selected sentence detection unit 106 Storage unit 200 Network 210 Communication unit 220 Control unit 230 Machine translation unit 231 Forward translation unit 232 Forward translation selection unit 233 Reverse translation unit 234 Reverse translation sentence selection unit 235 Selected sentence judgment unit 236 Phrase division unit 237 Selection result evaluation unit 238 Learning unit 240 Storage unit 300 Translation server 400 Microphone 500 Speaker 1000 Computer 1001 Input device 1002 Output device 1003 CPU
1004 ROM
1005 RAM
1006 storage device 1007 reading device 1008 transmitting / receiving device 1009 bus

Claims

A machine translation method in a machine translation system connected to an information output device that outputs language information and performing a translation process between a first language and a second language,
Receiving the translation target sentence in the first language,
Generating a plurality of different forward-translated sentences obtained by translating the received sentence to be translated into the second language,
For each of the plurality of different forward-translated sentences, generate a plurality of back-translated sentences back-translated into the first language,
If the operation of selecting one reverse-translated sentence from the plurality of backward-translated sentences is received while outputting the plurality of backward-translated sentences in the information output device, the information corresponding to the one backward-translated sentence is received. Output a forward translation,
Generating a forward-translated sentence group that is a set of forward-translated sentences obtained by translating the received sentence to be translated into the second language, and that includes the plurality of different forward-translated sentences;
For each of the forward-translated sentences included in the forward-translated sentence group, it is determined whether the sentence is classified into a question sentence, an affirmative sentence, a negative sentence, or a command sentence,
The plurality of different forward-translated sentences are determined by selecting a forward-translated sentence having the same form as the classified form from the forward-translated sentence group,
Machine translation method.

The machine translation system is further connected to a voice input device that receives a voice input by the user, and a text input device that receives a text input by the user,
The translation target sentence is received in the form of audio information or text information representing the translation,
The form of output of the forward-translated sentence corresponding to the one backward-translated sentence is changed according to which form of the voice information or the text information has been received.
The machine translation method according to claim 1.

The information output device has an audio output device and a display,
If the translation target sentence is received in the form of audio information, the forward translation corresponding to the one reverse translation is output via the audio output device,
If the translation target sentence is received in the form of text information, the forward translation corresponding to the one reverse translation is output via the display.
3. The machine translation method according to claim 2.

The machine translation system is further connected to a text input device that receives text input by a user,
The translation target sentence is received from the text input device as text information indicating the translation target sentence,
Based on the text information, generating a plurality of different forward-translated sentences that translated the sentence to be translated into the second language,
The machine translation method according to claim 1.

The machine translation system is further connected to a voice input device that receives a user's voice input,
The translation target sentence is received from the voice input device as audio information representing the translation target sentence,
Performing voice recognition processing on the received voice information to generate text information indicating the translation target sentence,
Based on the text information, generating a plurality of different forward-translated sentences that translated the sentence to be translated into the second language,
The machine translation method according to claim 1.

The information output device has a display,
The plurality of reverse-translated sentences are displayed in a first area of the display,
The translation target sentence is displayed in a second area different from the first area of the display,
The machine translation method according to claim 1.

The forward-translated sentence corresponding to the one backward-translated sentence is displayed in a third area of the display.
The machine translation method according to claim 6.

The forward-translated sentence corresponding to the one backward-translated sentence is displayed in a different direction from the plurality of backward-translated sentences displayed in the first area.
The machine translation method according to claim 7.

  The machine translation system,
  A set of the backward-translated sentences generated at least one or more for each of the plurality of different forward-translated sentences, and a backward-translated sentence group including the plurality of backward-translated sentences is generated,
  For each of the backward-translated sentences included in the backward-translated group, calculate an evaluation value that evaluates the degree of similarity with the translation target sentence,
  The plurality of back-translated sentences are selected from the group of back-translated sentences based on the evaluation value.
  The machine translation method according to claim 1.

A machine translation method in a machine translation system connected to an information output device that outputs language information and performing a translation process between a first language and a second language,
Receiving the translation target sentence in the first language,
Generating a plurality of different forward-translated sentences obtained by translating the received sentence to be translated into the second language,
For each of the plurality of different forward-translated sentences, generate a plurality of back-translated sentences back-translated into the first language,
If the operation of selecting one reverse-translated sentence from the plurality of backward-translated sentences is received while outputting the plurality of backward-translated sentences in the information output device, the information corresponding to the one backward-translated sentence is received. Output a forward translation,
Generating a forward-translated sentence group that is a set of the forward-translated sentences obtained by translating the received sentence to be translated into the second language;
The forward-translated sentence group includes the plurality of different forward-translated sentences,
Determine the subject or predicate of each forward-translated sentence included in the forward-translated sentence group,
The plurality of different forward-translated sentences are determined by selecting a forward-translated sentence having the same subject or predicate as the determined subject or pre-described word from the forward-translated sentence group ,
The machine translation method.

A machine translation method in a machine translation system connected to an information output device that outputs language information and performing a translation process between a first language and a second language,
Receiving the translation target sentence in the first language,
Generating a plurality of different forward-translated sentences obtained by translating the received sentence to be translated into the second language,
For each of the plurality of different forward-translated sentences, generate a plurality of back-translated sentences back-translated into the first language,
If the operation of selecting one reverse-translated sentence from the plurality of backward-translated sentences is received while outputting the plurality of backward-translated sentences in the information output device, the information corresponding to the one backward-translated sentence is received. Output a forward translation,
A set of the backward-translated sentences generated at least one or more for each of the plurality of different forward-translated sentences, and a backward-translated sentence group including the plurality of backward-translated sentences is generated,
For each of the back-translated sentences included in the back-translated sentence group, determine whether the sentence is classified into a question sentence, a positive sentence, a negative sentence, or a command sentence,
Wherein the plurality of reverse translation, from the reverse translation Bungun, is selected by selecting the inverse translation is the same form as classified above embodiment,
The machine translation method.

A machine translation method in a machine translation system connected to an information output device that outputs language information and performing a translation process between a first language and a second language,
Receiving the translation target sentence in the first language,
Generating a plurality of different forward-translated sentences obtained by translating the received sentence to be translated into the second language,
For each of the plurality of different forward-translated sentences, generate a plurality of back-translated sentences back-translated into the first language,
If the operation of selecting one reverse-translated sentence from the plurality of backward-translated sentences is received while outputting the plurality of backward-translated sentences in the information output device, the information corresponding to the one backward-translated sentence is received. Output a forward translation,
A set of the backward-translated sentences generated at least one or more for each of the plurality of different forward-translated sentences, and a backward-translated sentence group including the plurality of backward-translated sentences is generated,
Determine the subject or predicate of each of the back-translated sentences included in the back-translated sentence group,
The plurality of backward-translated sentences are selected from the backward-translated sentence group by selecting a backward-translated sentence having the same subject or predicate as the determined subject or pre- determined word,
The machine translation method.

The machine translation system manages a probability model referred to in the translation processing,
The probability model includes a weight value given to each word or phrase used in the translation process,
The machine translation system comprises:
A word or a phrase included in the selected-order translated sentence that is a forward-translated sentence corresponding to the one backward-translated sentence, and a non-selected translated sentence that is a forward-translated sentence corresponding to a reverse-translated sentence other than the one backward-translated sentence. Compares with the contained word or phrase,
A word or phrase included only in the selected order translation,
A word or phrase included only in the non-selection order translation,
For the words or phrases included in both the selected-order translated sentence and the non-selected-order translated sentence, the weight values are updated by applying different weight value updating methods, and the updated weight values are updated. Performing the machine learning using the word or the phrase corresponding to the updated weight value as teacher data;
The machine translation method according to claim 12 .

The machine translation system manages a probability model referred to in the translation processing,
The probability model includes a weight value given to each word or phrase used in the translation process,
The machine translation system comprises:
Compare the word or phrase included in the one back-translated sentence and the word or phrase included in a non-selected reverse-translated sentence that is a back-translated sentence other than the one back-translated sentence,
A word or phrase that is included only in said one back translation,
A word or phrase included only in the unselected reverse translation,
For words or phrases included in both the one back translation and the unselected back translation, update the weight value by applying a different update method of the weight value,
Performing the machine learning using the updated weight value and the word or the phrase corresponding to the updated weight value as teacher data,
The machine translation method according to claim 12 .

A machine translation device that performs a translation process between a first language and a second language,
An input unit configured to receive an input of a sentence to be translated in the first language;
A translation unit that generates a forward-translated sentence obtained by translating the translation target sentence into the second language, and a reverse-translated sentence obtained by reverse-translating the forward-translated sentence into the first language;
An output unit that outputs the backward-translated sentence and the forward-translated sentence corresponding to the one backward-translated sentence;
And a user input unit for receiving a user's input. The translation unit generates a plurality of different forward-translated sentences for the translation target sentence, and a plurality of reverse translations corresponding to each of the plurality of different forward-translated sentences. Generate a statement,
The output unit, when outputting the plurality of back-translated sentences, when the user input unit receives an input for selecting one back-translated sentence from the plurality of back-translated sentences, and it outputs the order translation sentence corresponding to the translation,
The translation unit,
Generating a forward-translated sentence group that is a set of forward-translated sentences obtained by translating the input target sentence into the second language, and that includes the plurality of different forward-translated sentences;
For each of the forward-translated sentences included in the forward-translated sentence group, it is determined whether the sentence is classified into a question sentence, an affirmative sentence, a negative sentence, or a command sentence,
The plurality of different forward-translated sentences are determined by selecting a forward-translated sentence having the same form as the classified form from the forward-translated sentence group,
Machine translation device.

A program for controlling an operation of a machine translation device connected to an information output device and performing a translation process between a first language and a second language,
For the computer of the machine translation device,
Receiving the translation target sentence in the first language,
Generating a plurality of different forward-translated sentences by translating the received sentence to be translated into the second language;
Generating a plurality of back-translated sentences back-translated into the first language for each of the plurality of different forward-translated sentences;
In the information output device, when an operation of selecting one backward-translated sentence from the plurality of backward-translated sentences is displayed while displaying the plurality of backward-translated sentences, the one backward-translated sentence is displayed in the second backward-translated sentence. To output the translated text translated into two languages ,
Generating a forward-translated sentence group that is a set of the forward-translated sentences obtained by translating the received translation target sentence into the second language and includes the plurality of different forward-translated sentences;
For each of the forward-translated sentences included in the forward-translated sentence group, a question sentence, an affirmative sentence, a negative sentence, a command sentence is determined,
The plurality of different forward-translated sentences are determined by selecting a forward-translated sentence having the same form as the classified form from the forward-translated sentence group,
program.