JPWO2013014877A1

JPWO2013014877A1 - Reliability calculation device, translation reliability calculation utilization method, and translation engine program

Info

Publication number: JPWO2013014877A1
Application number: JP2013525566A
Authority: JP
Inventors: 健花沢; 長田　誠也; 誠也長田; 岡部　浩司; 浩司岡部
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2011-07-28
Filing date: 2012-07-13
Publication date: 2015-02-23
Also published as: WO2013014877A1

Abstract

機械翻訳として統計的機械翻訳を前提としなくても、適切に翻訳結果に対する信頼度を得ることができ、かつ低信頼度であった場合に入力側ユーザに適切に再入力を促すことができるようにする。入力文に対して、機械翻訳を行う翻訳手段５０１と、その翻訳結果に対して、逆翻訳処理を行う逆翻訳手段５０２と、入力文と逆翻訳結果との距離に基づいて、翻訳信頼度を算出する翻訳信頼度計算手段５０３と、入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を、当該入力文、翻訳結果または逆翻訳結果に対応する信頼度とともにユーザに提示する信頼度提示手段５０４とを備え、信頼度提示手段５０４は、翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、翻訳信頼度とともに第一の言語を入力した側のユーザに提示することを特徴とする。Even if statistical machine translation is not premised on machine translation, the reliability of the translation result can be appropriately obtained, and if the reliability is low, the user on the input side can be prompted to re-input appropriately. To. The translation reliability is determined based on the distance between the translation unit 501 that performs machine translation on the input sentence, the reverse translation unit 502 that performs reverse translation processing on the translation result, and the input sentence and the reverse translation result. Translation reliability calculation means 503 to be calculated and reliability presentation for presenting at least one of the input sentence, translation result and reverse translation result to the user together with the reliability corresponding to the input sentence, translation result or reverse translation result Means 504, and when the translation reliability is less than or equal to a predetermined threshold, the reliability presentation means 504 receives at least the input sentence or the reverse translation result together with the translation reliability and the side on which the first language is input. Presented to the user.

Description

本発明は、機械翻訳や音声翻訳の信頼度を求める信頼度算出装置、信頼度算出利用方法および翻訳エンジン用プログラムに関する。 The present invention relates to a reliability calculation apparatus, a reliability calculation usage method, and a translation engine program for determining the reliability of machine translation and speech translation.

近年、第一の言語で記述されたテキストなどの文字情報を、異なる第二の言語に変換する機械翻訳の技術の実用化が盛んである。また、テキストの代わりに音声を入力とし、音声認識結果を機械翻訳の入力として用いる音声翻訳技術も実用化が始まっている。 In recent years, machine translation technology for converting character information such as text written in a first language into a different second language has been actively used. In addition, speech translation technology that uses speech instead of text as input and uses speech recognition results as input for machine translation has been put into practical use.

このような機械翻訳や音声翻訳システムを利用すれば、異なる言語を持つ者同士がコミュニケーションを行うことが可能になるため、本技術に対する期待は高い。一方で、音声認識や機械翻訳の技術は完全でなく、結果に誤認識や誤翻訳を含むことがある。誤認識・誤翻訳を含む結果を相手に提示してしまうと、コミュニケーションが破綻していく恐れがあるが、最終的な翻訳結果の妥当性を話し手であるユーザ自身が判断するのは一般に困難である。 If such a machine translation or speech translation system is used, people with different languages can communicate with each other, so there is high expectation for this technology. On the other hand, speech recognition and machine translation techniques are not perfect, and results may include misrecognition and mistranslation. If the result including misrecognition and mistranslation is presented to the other party, communication may be disrupted, but it is generally difficult for the user who is the speaker himself to judge the validity of the final translation result. is there.

このため、音声認識結果や翻訳結果にシステム側から確信度や信頼度といった数値を付与する試みがなされている。音声認識結果や翻訳結果に信頼度が付与されると、低信頼度の発話・入力を棄却することでコミュニケーションの破綻を回避したり、低信頼度の部分を話し手側が言い換えることで正しい結果に近づけることが可能となる。 For this reason, attempts have been made to assign numerical values such as confidence and reliability from the system side to the speech recognition results and translation results. When reliability is given to speech recognition results and translation results, communication failures can be avoided by rejecting low-confidence utterances / inputs, or a speaker can rephrase the low-reliability part to bring it closer to the correct result. It becomes possible.

例えば、機械翻訳の出力訳文候補の選択装置において、順位和に基づく信頼度（ＲＳＣＭ）を用いる方法が特許文献１に記載されている。この方法は、統計的機械翻訳（Statistic Machine Translation ：ＳＭＴ）が出力する上位Ｎ個の訳文候補（Ｎ−ｂｅｓｔ）のうち、各訳文候補である仮説の順位に基づいて各Ｎ−ｂｅｓｔの先頭仮説の単語の事後確率を算出し、信頼度を求めるものである。 For example, Patent Document 1 discloses a method of using reliability (RSCM) based on a sum of ranks in a machine translation output candidate selection apparatus. This method is based on the top hypothesis of each N-best based on the rank of the hypothesis that is each translation candidate among the top N translation candidates (N-best) output by Statistical Machine Translation (SMT). The posterior probability of the word is calculated and the reliability is obtained.

また、特許文献２には、第１の言語による参照テキストから第２の言語によるテキストへの機械翻訳を行う場合に、変換後の第２の言語によるテキストを、第１の言語による折り返し翻訳テキストに逆変換して、翻訳前の参照テキストと逆変換後の第１の言語によるデータとの類似度を変換適正値とするデータ変換適正評価方法が記載されている。 Further, in Patent Document 2, when machine translation from a reference text in a first language into a text in a second language is performed, the translated text in the second language is converted into a folded translation text in the first language. A data conversion appropriate evaluation method is described in which the similarity between the reference text before translation and the data in the first language after reverse conversion is converted into an appropriate conversion value.

また、特許文献３には、入力される第１の言語の音声に対して、音声認識および第２の言語への機械翻訳を行う機械翻訳システムにおいて、音声認識モジュールから出力されるＮベストの仮説とそれらの情報と、統計的機械翻訳モジュールから出力されるＮベスト仮説の各々に対するＫ個の訳文候補とそれらに付随する情報とに基づいて、Ｎ×Ｋ個の訳文候補各々を再スコアして、最も高いスコアを達成する訳文をベスト訳文として出力する再スコアリングモジュールを備える構成が記載されている。 Patent Document 3 discloses the N best hypothesis output from a speech recognition module in a machine translation system that performs speech recognition and machine translation into a second language for input speech in a first language. And re-scoring each of N × K translation candidates based on the K translation candidates for each of the N best hypotheses output from the statistical machine translation module and the information associated therewith. A configuration including a rescoring module that outputs a translation that achieves the highest score as the best translation is described.

特開２００６−０３９６１６号公報JP 2006-039616 A 特開２００６−２５２３２３号公報JP 2006-252323 A 特開２００６−０５３６８３号公報JP 2006-053683 A

しかし、特許文献１に記載されている方法は、翻訳処理として統計的機械翻訳を用いることが前提となっている。このため、文法ベース（ルールベース）で翻訳処理を行う文法規則型翻訳を用いる場合には適用できない。文法規則型翻訳においては尤度という概念が明確でないからである。なお、現在、一般には統計的機械翻訳よりも文法規則型翻訳の方がより高精度な翻訳が実現できると考えられている。 However, the method described in Patent Document 1 is based on the premise that statistical machine translation is used as a translation process. For this reason, it is not applicable when using grammatical rule type translation that performs translation processing on a grammar base (rule base). This is because the concept of likelihood is not clear in grammatical rule translation. Currently, it is generally considered that grammatical rule-based translation can achieve more accurate translation than statistical machine translation.

また、特許文献１に記載されている方法は、機械翻訳の処理結果に対してのみ信頼度を付与するものである。従って、機械翻訳において入力側の文章に不明瞭な部分があった場合等に、入力側の言語を持つユーザに適切に再入力を促すことができない。また、音声で入力される文章に対して音声認識を行った上で機械翻訳を行う場合などにおいて、原文の一部の表現を変えれば最終的に正しく翻訳される可能性がある場合に、機械翻訳の処理結果に対する信頼度だけでは、入力する側のユーザが原文を再入力すべきか否かの判断を行うことは困難である。 Further, the method described in Patent Document 1 gives reliability only to the processing result of machine translation. Therefore, when there is an unclear part in the input-side sentence in machine translation, it is not possible to prompt the user who has the input-side language to appropriately input again. In addition, when performing machine translation after performing speech recognition on a sentence that is input in speech, if there is a possibility that it will eventually be translated correctly if part of the original sentence is changed, It is difficult for the user on the input side to determine whether or not to re-input the original text only with the reliability of the translation processing result.

なお、特許文献２に記載されている方法を利用すれば、機械翻訳として統計的機械翻訳を前提としなくても、機械翻訳の信頼度を得ることはできる。しかし、特許文献２に記載されている方法は、機械翻訳の処理結果に対してのみ信頼度を付与するものであるので、特許文献１と同様、入力側の言語を持つユーザに適切に再入力を促すことができないという問題がある。 If the method described in Patent Document 2 is used, the reliability of machine translation can be obtained without assuming statistical machine translation as machine translation. However, since the method described in Patent Document 2 gives reliability only to the processing result of machine translation, as in Patent Document 1, it is appropriately re-input to a user having an input language. There is a problem that cannot be urged.

また、特許文献３に記載されている方法は、音声認識モジュールと機械翻訳モジュールとから得られる特徴量（例えば、それぞれの精度を反映した尤度情報）を組み合わせて、Ｎ×Ｋ個の訳文候補の各々にスコアを割り当てるので、翻訳処理の結果のみならず音声認識処理の結果をも考慮した、より高い信頼度をもった翻訳結果を得ることができるかもしれない。しかし、特許文献３に記載されている方法は、機械翻訳として統計的機械翻訳を前提としている。また、再スコアリングされたスコアを複数の訳文候補からベスト訳文を選定するために用いることしか想定していない。このため、モデリングすることにより一本化して算出される評価はどこに問題点があるのかがわかりにくく、信頼度を元にユーザに適切に再入力を促すことはできない。 In addition, the method described in Patent Document 3 combines N × K translation candidate candidates by combining feature amounts (for example, likelihood information reflecting each accuracy) obtained from the speech recognition module and the machine translation module. Since a score is assigned to each of these, it may be possible to obtain a translation result with higher reliability in consideration of not only the result of translation processing but also the result of speech recognition processing. However, the method described in Patent Document 3 assumes statistical machine translation as machine translation. Further, it is only assumed that the re-scored score is used to select the best translation from a plurality of translation candidates. For this reason, it is difficult to know where the problem is in the evaluation that is calculated by unifying by modeling, and it is not possible to prompt the user to input again appropriately based on the reliability.

そこで、本発明は、機械翻訳として統計的機械翻訳を前提としなくても、適切に翻訳結果に対する信頼度を得ることができ、かつ低信頼度であった場合に入力側ユーザに適切に再入力を促すことができる信頼度算出装置、翻訳信頼度算出利用方法および翻訳エンジン用プログラムを提供することを目的とする。ここで、翻訳結果に対する信頼度は、音声等のテキストデータ以外の方法により入力される場合においては、入力された第一の言語による文または文章データ（例えば、音声データ）に対して第一の言語による認識処理を経て行われる機械翻訳の最終的な処理結果に対する信頼度をいう。すなわち、単純に機械翻訳処理の処理結果だけでなく、その前段階の認識処理の処理結果をも考慮した信頼度をいう。 Therefore, the present invention can appropriately obtain the reliability of the translation result without assuming statistical machine translation as the machine translation, and appropriately re-input to the input side user when the reliability is low. An object of the present invention is to provide a reliability calculation device, a translation reliability calculation usage method, and a translation engine program. Here, in the case where the reliability for the translation result is input by a method other than text data such as speech, the reliability is the first for the sentence or sentence data (for example, speech data) in the input first language. This refers to the reliability of the final processing result of machine translation performed through language recognition processing. That is, it means the reliability considering not only the processing result of the machine translation process but also the processing result of the preceding recognition process.

本発明による信頼度算出装置は、入力文に対して、入力文の言語である第一の言語とは異なる言語である第二の言語への翻訳処理を行う翻訳手段と、翻訳手段による翻訳処理の結果得られる翻訳結果に対して、入力文の言語である第一の言語への逆翻訳処理を行う逆翻訳手段と、入力文と逆翻訳手段による逆翻訳処理の結果得られる逆翻訳結果との距離に基づいて、翻訳信頼度を算出する翻訳信頼度計算手段と、入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を、当該入力文、翻訳結果または逆翻訳結果に対応する信頼度とともにユーザに提示する信頼度提示手段とを備え、信頼度提示手段は、翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、翻訳信頼度とともに第一の言語を入力した側のユーザに提示することを特徴とする。 The reliability calculation apparatus according to the present invention includes a translation unit that performs translation processing on an input sentence into a second language that is different from the first language that is the language of the input sentence, and translation processing that is performed by the translation unit A reverse translation means for performing a reverse translation process to the first language which is the language of the input sentence, and a reverse translation result obtained as a result of the reverse translation process by the input sentence and the reverse translation means. A translation reliability calculation means for calculating a translation reliability based on the distance between the input sentence, the translation result, and the reverse translation result, at least one of the input sentence, the translation result, and the reverse translation result. A reliability presenting means for presenting to the user together with the degree of reliability, and the reliability presenting means first outputs at least the input sentence or the reverse translation result together with the translation reliability when the translation reliability is below a predetermined threshold. Entered the language of Characterized in that it presented to the user.

また、本発明による翻訳信頼度算出利用方法は、入力文に対して、入力文の言語である第一の言語とは異なる言語である第二の言語への翻訳処理を行い、翻訳処理の結果得られる翻訳結果に対して、入力文の言語である第一の言語への逆翻訳処理を行い、入力文と逆翻訳処理の結果得られる逆翻訳結果との距離に基づいて、翻訳信頼度を算出し、入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を、当該入力文、翻訳結果または逆翻訳結果に対応する信頼度とともに第一の言語を入力した側のユーザに提示する際に、翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、翻訳信頼度とともに第一の言語を入力した側のユーザに提示することを特徴とする。 The translation reliability calculation and utilization method according to the present invention performs a translation process on an input sentence into a second language that is different from the first language that is the language of the input sentence, and results of the translation process The translation result is reverse translated into the first language, which is the language of the input sentence, and the translation reliability is determined based on the distance between the input sentence and the reverse translation result obtained as a result of the reverse translation process. When calculating and presenting at least one of the input sentence, translation result and reverse translation result to the user who has input the first language together with the reliability corresponding to the input sentence, translation result or reverse translation result In addition, when the translation reliability is equal to or lower than a predetermined threshold, at least the input sentence or the reverse translation result is presented to the user who has input the first language together with the translation reliability.

また、本発明による翻訳エンジン用プログラムは、コンピュータに、第一の言語で記述された入力文を、第一の言語とは異なる言語である第二の言語に変換する翻訳処理、翻訳処理の結果得られる翻訳結果を、入力文の言語である第一の言語に変換する逆翻訳処理、入力文と逆翻訳処理の結果得られる逆翻訳結果との距離に基づいて、翻訳信頼度を算出する翻訳信頼度算出処理、および入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を、当該入力文、翻訳結果または逆翻訳結果に対応する信頼度とともにユーザに提示する信頼度提示処理を実行させ、信頼度提示処理で、翻訳信頼度が所定の閾値以下であった場合に、少なくとも入力文または逆翻訳結果を、翻訳信頼度とともに第一の言語を入力した側のユーザに提示させることを特徴とする。 Further, the translation engine program according to the present invention includes a computer, a translation process for converting an input sentence described in a first language into a second language that is a language different from the first language, and a result of the translation process. Translation that calculates translation reliability based on the distance between the reverse translation process that converts the obtained translation result into the first language that is the language of the input sentence, and the reverse translation result that is obtained as a result of the reverse translation process A reliability calculation process and a reliability presentation process for presenting at least one of the input sentence, the translation result, and the reverse translation result together with the reliability corresponding to the input sentence, the translation result, or the reverse translation result to the user. In the reliability presentation process, when the translation reliability is below a predetermined threshold, at least the input sentence or the reverse translation result is presented to the user who has input the first language together with the translation reliability. The features.

本発明によれば、機械翻訳として統計的機械翻訳を前提としなくても、適切に翻訳結果に対する信頼度を得ることができ、かつ低信頼度であった場合に入力側ユーザに適切に再入力を促すことができる。 According to the present invention, even if statistical machine translation is not premised on machine translation, the reliability of the translation result can be appropriately obtained, and when the reliability is low, the input user is appropriately re-input Can be encouraged.

第１の実施形態の信頼度算出装置の構成例を示したブロック図である。It is the block diagram which showed the example of a structure of the reliability calculation apparatus of 1st Embodiment. 第１の実施形態の信頼度算出装置における信頼度算出処理の処理フローの一例を示すフローチャートである。It is a flowchart which shows an example of the processing flow of the reliability calculation process in the reliability calculation apparatus of 1st Embodiment. 第１の実施形態の信頼度算出装置の他の構成例を示すブロック図である。It is a block diagram which shows the other structural example of the reliability calculation apparatus of 1st Embodiment. 第１の実施形態の信頼度算出装置の他の構成例を示すブロック図である。It is a block diagram which shows the other structural example of the reliability calculation apparatus of 1st Embodiment. 第１の実施例の機械翻訳装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the machine translation apparatus of a 1st Example. 第１の実施例における翻訳処理の処理フローの一例を示すフローチャートである。It is a flowchart which shows an example of the processing flow of the translation process in a 1st Example. 信頼度の算出例を示す説明図である。It is explanatory drawing which shows the example of calculation of reliability. 第２の実施形態の信頼度算出装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the reliability calculation apparatus of 2nd Embodiment. 第２の実施形態の信頼度算出装置における信頼度算出処理の処理フローの一例を示すフローチャートである。It is a flowchart which shows an example of the processing flow of the reliability calculation process in the reliability calculation apparatus of 2nd Embodiment. 第２の実施形態の信頼度算出装置の他の構成例を示すブロック図である。It is a block diagram which shows the other structural example of the reliability calculation apparatus of 2nd Embodiment. 第２の実施例の音声翻訳システムの構成例を示すブロック図である。It is a block diagram which shows the structural example of the speech translation system of a 2nd Example. 第２の実施例における翻訳処理の処理フローの一例を示すフローチャートである。It is a flowchart which shows an example of the processing flow of the translation process in a 2nd Example. 信頼度の算出例を示す説明図である。It is explanatory drawing which shows the example of calculation of reliability. 本発明の概要を示すブロック図である。It is a block diagram which shows the outline | summary of this invention.

以下、本発明の実施形態を図面を参照して説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

実施形態１．
図１は、本発明の第１の実施形態の信頼度算出装置の構成例を示したブロック図である。図１に示す信頼度算出装置１０は、テキストまたは文字列を入力し、機械翻訳を行うとともにその信頼度を算出して出力するものである。Embodiment 1. FIG.
FIG. 1 is a block diagram showing a configuration example of a reliability calculation apparatus according to the first embodiment of the present invention. The reliability calculation apparatus 10 shown in FIG. 1 inputs text or a character string, performs machine translation, calculates the reliability, and outputs it.

信頼度算出装置１０は、機械翻訳部１３と、逆翻訳部１４と、翻訳信頼度計算部１５とを備えている。また、信頼度算出装置１０は、この他に直接または間接的に、入力部１１と出力部１２とを備えているものとする。なお、「間接的に」とは、例えば、ネットワークを介して接続される他の装置が備えていてもよいことを意味している。 The reliability calculation device 10 includes a machine translation unit 13, a reverse translation unit 14, and a translation reliability calculation unit 15. In addition, the reliability calculation apparatus 10 includes an input unit 11 and an output unit 12 directly or indirectly. Note that “indirectly” means, for example, that other devices connected via a network may be provided.

信頼度算出装置１０は、例えば、図示しないＣＰＵ、ＲＡＭ、ＲＯＭ及び不揮発性記憶装置を備えた汎用的なコンピュータシステムであってもよい。すなわち、信頼度算出装置１０は、ＣＰＵ等の情報処理装置が、ＲＡＭ、ＲＯＭまたは不揮発性記憶装置に記憶されたＯＳおよび信頼度算出プログラムを読み込んで、信頼度算出処理を含む所定の処理を実行するよう構成されていてもよい。これにより、翻訳結果の信頼度を出力することができる。なお、信頼度算出装置１０は、一台のコンピュータシステムである必要はなく、複数台のコンピュータで構成されていてもよい。 The reliability calculation device 10 may be, for example, a general-purpose computer system that includes a CPU, a RAM, a ROM, and a nonvolatile storage device (not shown). That is, in the reliability calculation device 10, an information processing device such as a CPU reads an OS and a reliability calculation program stored in a RAM, a ROM, or a non-volatile storage device, and executes predetermined processing including reliability calculation processing. It may be configured to. Thereby, the reliability of a translation result can be output. In addition, the reliability calculation apparatus 10 does not need to be one computer system, and may be configured by a plurality of computers.

入力部１１は、信頼度算出装置１０に対して、翻訳前の第一の言語で記述された文すなわち原文を入力するキーボード等である。なお、入力部１１は、原文をどの言語に翻訳するかを示す指示情報を入力してもよい。また、出力部１２は、信頼度算出装置１０から出力される信頼度をユーザに表示するディスプレイ等である。 The input unit 11 is a keyboard or the like for inputting a sentence described in a first language before translation, that is, an original sentence, to the reliability calculation apparatus 10. Note that the input unit 11 may input instruction information indicating in which language the original sentence is translated. The output unit 12 is a display or the like that displays the reliability output from the reliability calculation device 10 to the user.

機械翻訳部１３は、入力された原文を、翻訳後の第二の言語の文すなわち訳文に翻訳する。なお、翻訳技術として用いる手法は、統計的機械翻訳でもよいし、文法規則型翻訳でもよいし、用例ベース翻訳でもよい。翻訳技術自体は、公知技術としてよく知られているものであるから、ここでは詳細な説明を省略する。 The machine translation unit 13 translates the input original sentence into a translated second language sentence, that is, a translated sentence. The technique used as the translation technique may be statistical machine translation, grammatical rule translation, or example-based translation. Since the translation technique itself is well known as a known technique, a detailed description thereof is omitted here.

逆翻訳部１４は、機械翻訳部１３によって翻訳された訳文を入力とし、入力された訳文を元の第一の言語に翻訳する。ここでの翻訳手法は、機械翻訳部１３における翻訳手法と異なる手法を用いてもよいが、同じ手法を用いる方が一般には平易である。 The reverse translation unit 14 receives the translation translated by the machine translation unit 13 and translates the input translation into the original first language. The translation method here may be a method different from the translation method in the machine translation unit 13, but it is generally easier to use the same method.

翻訳信頼度計算部１５は、翻訳前の第一の言語で記述された原文と、逆翻訳部１４によって再び第一の言語に翻訳された逆翻訳結果とを入力とし、入力された原文と逆翻訳結果との距離を求めて、信頼度とする。以下、翻訳信頼度計算部１５が算出する信頼度（具体的には、機械翻訳部１３の翻訳処理結果に対する信頼度）を翻訳信頼度という。なお、本実施形態では、ここで算出される翻訳信頼度をそのまま翻訳結果に対する信頼度として扱う。 The translation reliability calculation unit 15 receives the original text described in the first language before translation and the reverse translation result translated into the first language again by the reverse translation unit 14, and reverses the input original text. Find the distance from the translation result and use it as the reliability. Hereinafter, the reliability calculated by the translation reliability calculation unit 15 (specifically, the reliability with respect to the translation processing result of the machine translation unit 13) is referred to as translation reliability. In the present embodiment, the translation reliability calculated here is directly used as the reliability for the translation result.

翻訳信頼度計算部１５は、原文と逆翻訳結果との間の距離、すなわち両文の近さを表す所定の尺度による評価値を、例えば共有Ｎ−ｇｒａｍ数を利用したＢＬＥＵスコアによって求めてもよい。なお、ＢＬＥＵスコアは、統計的機械翻訳の評価を行う際に用いられる尺度であり、ある翻訳結果の文が正解文をどれだけ再現できているかを、両者で共有する単語連鎖（Ｎ−ｇｒａｍ）の数を利用して求めて、０〜１の範囲で出力するものである。ＢＬＥＵスコアの算出式の一例を以下に式（１）として示す。 The translation reliability calculation unit 15 may obtain an evaluation value based on a predetermined scale representing the distance between the original text and the reverse translation result, that is, the closeness of both sentences, for example, using a BLEU score using the shared N-gram number. Good. The BLEU score is a scale used when statistical machine translation is evaluated, and a word chain (N-gram) shared by both of how much a sentence of a certain translation result can reproduce a correct sentence. The number is calculated using the number of 0 and output in the range of 0-1. An example of a BLEU score calculation formula is shown below as Formula (1).

式（１）において、ＢＰは長さ方向のペナルティを表し、ＮはＮ−ｇｒａｍの最大長を表し、ｗ_ｎは結合重み（１／Ｎでよい）を表し、Ｐ_ｎは修正Ｎ−ｇｒａｍ精度を表している。なお、修正Ｎ−ｇｒａｍ精度は、共有Ｎ−ｇｒａｍ数の最大値を全体のＮ−ｇｒａｍ数で除算したものである。なお、ＢＬＥＵスコアは、文が短い場合に低精度になるという課題があるが、後述するような音声認識の信頼度など他の尺度と組み合わせることで、精度を上げることができる。また、ここでは距離としてＢＬＥＵスコアを用いる例を示したが、距離の算出方法はこの方法に限らない。In the formula (1), BP represents a penalty longitudinal, N represents represents the maximum length of the N-gram, _{w n} denotes a coupling weight (which may be 1 / N), _{P n} is modified N-gram accuracy Represents. The modified N-gram accuracy is obtained by dividing the maximum number of shared N-grams by the total number of N-grams. The BLEU score has a problem that it becomes low accuracy when the sentence is short, but accuracy can be improved by combining with other measures such as reliability of speech recognition as described later. Moreover, although the example which uses a BLEU score as distance was shown here, the calculation method of distance is not restricted to this method.

また、原文と逆翻訳結果との間の距離を単語または単語列単位に求めることも可能である。そのような場合には、単語または単語列単位での距離を求めた後、文全体で正規化するとよい。このように単語または単語列単位に距離すなわち翻訳信頼度を求めることで、一文内での低信頼度区間の提示といったことが可能になる。翻訳信頼度を文単位に拡張する方法としては、得られる単語単位の信頼度を文全体で正規化してもよいし、文全体の信頼度を各単語単位の信頼度の同時確率と捉えてもよい。 It is also possible to obtain the distance between the original text and the reverse translation result in units of words or word strings. In such a case, after obtaining the distance in units of words or word strings, it is preferable to normalize the whole sentence. Thus, by obtaining the distance, that is, the translation reliability in units of words or word strings, it becomes possible to present a low reliability section in one sentence. As a method of extending the translation reliability to the sentence unit, the obtained word unit reliability may be normalized for the whole sentence, or the whole sentence reliability may be regarded as the simultaneous probability of the reliability of each word unit. Good.

以下の式（２）は、文｛ｗ^ｎ _ｌ｝を構成する各単語ｗ_ｉの翻訳信頼度をＣＭ_ＭＴ（ｗ_ｉ）としたとき、単語事後確率を文全体で正規化して翻訳信頼度ＣＭ_ＭＴを文単位に拡張する場合の計算例である。The following equation (2) is obtained by normalizing the word posterior probabilities for the entire sentence when the translation reliability of each word w _i constituting the sentence {w ⁿ _l } is CM _MT (w _i ), and the translation reliability CM. It is a calculation example in the case of extending _MT in sentence units.

また、以下の式（３）は、各単語の信頼度を確率事象として捉え、文全体の信頼度をその同時確率として翻訳信頼度ＣＭ_ＭＴを文単位に拡張する場合の計算例である。Also, the following equation (3) is a calculation example in the case where the reliability of each word is regarded as a probability event, and the translation reliability CM _MT is expanded in sentence units with the reliability of the entire sentence as the joint probability.

なお、逆翻訳による信頼度は翻訳による再現性を表すとも言えるため、信頼度の付与は逆翻訳結果に付与することも翻訳前の原文に付与することも同等の意味を持つ。すなわち、低信頼度区間の提示は、翻訳結果に対してのみならず、入力された原文や逆翻訳結果に対しても行うことが可能である。 In addition, since it can be said that the reliability by reverse translation represents the reproducibility by translation, assigning the reliability has the same meaning as assigning to the reverse translation result or to the original text before translation. That is, the low confidence interval can be presented not only for the translation result but also for the input original text or reverse translation result.

次に、本実施形態の動作について説明する。以下では、本実施形態の信頼度算出装置１０における信頼度算出処理について図２を参照して説明する。図２は、本実施形態の信頼度算出装置１０における信頼度算出処理の処理フローの一例を示すフローチャートである。図２に示すように、まず信頼度算出装置１０は、翻訳対象の原文となるテキストを入力する（ステップＳ１００）。ここでは、入力部１１を介して、第一の言語で記述された文すなわち原文の入力を受け付け、受け付けた原文を機械翻訳部１３および翻訳信頼度計算部１５に出力する。 Next, the operation of this embodiment will be described. Below, the reliability calculation process in the reliability calculation apparatus 10 of this embodiment is demonstrated with reference to FIG. FIG. 2 is a flowchart illustrating an example of a processing flow of the reliability calculation process in the reliability calculation apparatus 10 of the present embodiment. As shown in FIG. 2, first, the reliability calculation device 10 inputs a text that is an original sentence to be translated (step S <b> 100). Here, the input of the sentence described in the first language, that is, the original sentence is received via the input unit 11, and the received original sentence is output to the machine translation unit 13 and the translation reliability calculation unit 15.

次に、機械翻訳部１３が、入力された原文を翻訳する（ステップＳ１０１）。機械翻訳部１３は、入力された原文に対して、原文の言語である第一の言語から指定された他の言語である第二の言語への翻訳処理を実行し、その翻訳結果を逆翻訳部１４に出力する。 Next, the machine translation unit 13 translates the input original text (step S101). The machine translation unit 13 performs a translation process on the input original sentence from the first language that is the original language to the second language that is the other language specified, and reversely translates the translation result To the unit 14.

次に、逆翻訳部１４が、機械翻訳部１３の翻訳結果である訳文に対して、逆翻訳を行う（ステップＳ１０２）。逆翻訳部１４は、機械翻訳部１３から入力された翻訳結果である訳文に対して、訳文の言語である第二の言語から再び原文の言語である第一の言語への翻訳処理を実行し、その翻訳結果である逆翻訳結果を翻訳信頼度計算部１５に出力する。 Next, the reverse translation unit 14 performs reverse translation on the translation that is the translation result of the machine translation unit 13 (step S102). The reverse translation unit 14 executes a translation process from the second language, which is the translated language, to the first language, which is the original language, again on the translation that is the translation result input from the machine translation unit 13. The reverse translation result, which is the translation result, is output to the translation reliability calculation unit 15.

次に、翻訳信頼度計算部１５が、信頼度を計算する（ステップＳ１０３）。翻訳信頼度計算部１５は、入力部１１を介して入力された翻訳前の第一の言語で記述された原文と、逆翻訳部１４にて再び第一の言語に翻訳された逆翻訳結果とを入力とし、両者の距離を計算して信頼度とする。なお、翻訳信頼度としての距離は、共有Ｎ−ｇｒａｍを利用して求めるのが好適であるが、これに限らない。 Next, the translation reliability calculation unit 15 calculates the reliability (step S103). The translation reliability calculation unit 15 includes the original text described in the first language before translation input via the input unit 11 and the reverse translation result translated into the first language again by the reverse translation unit 14. Is input, and the distance between the two is calculated as the reliability. In addition, although it is suitable to obtain | require the distance as a translation reliability using shared N-gram, it is not restricted to this.

最後に、信頼度算出装置１０は、結果を出力する（ステップＳ１０４）。信頼度算出装置１０は、翻訳信頼度計算部１５によって得られた翻訳信頼度を、例えば、ディスプレイ等の出力部１２を介して出力してユーザに表示する。このとき、信頼度算出装置１０は、翻訳信頼度とともに逆翻訳結果を出力してもよい。 Finally, the reliability calculation device 10 outputs the result (step S104). The reliability calculation device 10 outputs the translation reliability obtained by the translation reliability calculation unit 15 via the output unit 12 such as a display and displays it to the user. At this time, the reliability calculation device 10 may output the reverse translation result together with the translation reliability.

なお、上実施形態では、原文と逆翻訳結果との距離を計算する際に、両者の距離を計算するとしたが、このとき、原文および逆翻訳結果とを同義語展開してもよい。例えば、図３に示す信頼度算出装置１０ｂのように、さらに同義語展開部１６１と、同義語辞典１６２とを備えていてもよい。なお、図３は、本実施形態の信頼度算出装置の他の構成例を示すブロック図である。 In the above embodiment, the distance between the original sentence and the reverse translation result is calculated when calculating the distance between the original sentence and the reverse translation result. At this time, the original sentence and the reverse translation result may be developed synonymously. For example, a synonym expansion unit 161 and a synonym dictionary 162 may be further provided as in the reliability calculation device 10b illustrated in FIG. FIG. 3 is a block diagram illustrating another configuration example of the reliability calculation apparatus according to the present embodiment.

同義語辞書１６２は、発音や表記は異なるが同じ意味をもつ単語または単語列を対応づけて記憶する。同義語展開部１６１は、原文または逆翻訳結果の各単語または単語列を、同義語辞書１６２を用いて複数の異なる表現に展開する。 The synonym dictionary 162 stores words or word strings having different pronunciations and notations but having the same meaning in association with each other. The synonym expansion unit 161 expands each word or word string of the original text or the reverse translation result into a plurality of different expressions using the synonym dictionary 162.

そのような場合には、翻訳信頼度計算部１５は、同義語展開部１６１によって同義語展開された複数の候補に対して距離を計算し、最も距離が近かった候補との距離を最終的な翻訳信頼度として選択してもよい。このような構成とすることで、例えばある単語が逆翻訳の結果、意味は同じであるが表記が異なる同義異表記語になった場合でも、意味が同じであれば距離は近いものと判断でき、不当に距離が大きくなることを防いでより適切な信頼度計算が可能となる。 In such a case, the translation reliability calculation unit 15 calculates distances for a plurality of candidates that have been synonymously expanded by the synonym expansion unit 161, and finally calculates the distance from the closest candidate. You may select as translation reliability. By adopting such a configuration, for example, even if a word has the same meaning but has the same meaning but different notation as a result of reverse translation, it can be determined that the distance is close if the meaning is the same. Therefore, it is possible to more appropriately calculate the reliability by preventing the distance from being unduly increased.

また、例えば、図４に示すように、さらに信頼度提示部１７を備えて、当該信頼度算出装置１０の信頼度提出部１７で、逆翻訳部１４で得られる逆翻訳結果と、翻訳信頼度計算部１５で得られた信頼度とをあわせて提示するようにしてもよい。なお、図４は、本実施形態の信頼度算出装置の他の構成例を示すブロック図である。 Further, for example, as shown in FIG. 4, a reliability presentation unit 17 is further provided, and the reverse translation result obtained by the reverse translation unit 14 and the translation reliability obtained by the reliability submission unit 17 of the reliability calculation device 10. The reliability obtained by the calculation unit 15 may be presented together. FIG. 4 is a block diagram illustrating another configuration example of the reliability calculation apparatus according to the present embodiment.

以上のように、本実施形態によれば、原文と、翻訳処理結果である訳文を再度原文の言語に逆翻訳した逆翻訳結果との距離を翻訳信頼度として算出するので、翻訳処理として文法規則型翻訳など統計的機械翻訳以外の翻訳手法を用いても適切に信頼度を算出することができる。このように、適切な信頼度を得ることで、結果としてコミュニケーションの破綻を低減することが可能となる。 As described above, according to the present embodiment, the distance between the original sentence and the reverse translation result obtained by reversely translating the translated sentence that is the translation processing result into the original language is calculated as the translation reliability. The reliability can be appropriately calculated even by using a translation method other than statistical machine translation such as type translation. Thus, by obtaining an appropriate degree of reliability, it is possible to reduce communication failures as a result.

また、翻訳信頼度とともに逆翻訳結果を出力することで、例えば、出力先において低信頼度であった場合にユーザにその旨と逆翻訳結果を合わせて提示することで、原文を入力したユーザが翻訳対象とした第二の言語を理解できない場合であっても、話し手の言語による結果の確認が可能となり、その結果低信頼度となった原因やその箇所を推定させ、その部分の言い直しや言い換えを行わせることができる。 Also, by outputting the reverse translation result together with the translation reliability, for example, when the output destination is low reliability, the user who has input the original text can present the result together with the reverse translation result to the user. Even if you can't understand the second language you want to translate, you can check the result in the speaker's language, and as a result, you can estimate the cause and location of the low reliability, Paraphrasing can be performed.

次に、より具体的な例を用いて本実施形態を説明する。以下では、本発明による信頼度算出装置の第１の実施例として、図５に示す機械翻訳装置１００の例を用いて説明する。図５に示す機械翻訳装置１００は、入力部１１と、信頼度算出部１０と、結果判定部１１０と、出力部１２とを備えている。 Next, this embodiment will be described using a more specific example. Hereinafter, a first embodiment of the reliability calculation apparatus according to the present invention will be described using the example of the machine translation apparatus 100 shown in FIG. The machine translation apparatus 100 illustrated in FIG. 5 includes an input unit 11, a reliability calculation unit 10, a result determination unit 110, and an output unit 12.

機械翻訳装置１００は、例えば、ＣＰＵ、ＲＡＭ、ＲＯＭおよび不揮発性記憶装置を備える汎用的なコンピュータシステムにより実現される。すなわち、ＣＰＵが信頼度算出部１０や結果判定部１１０の動作をプログラム言語により記述した機械翻訳プログラムを読み込み、読み込んだプログラムに従い機械翻訳処理を実行するよう構成されていてもよい。なお、機械翻訳装置１００は、１台のコンピュータシステムである必要はなく、複数台のコンピュータシステムで構成されていてもよい。そのような構成において、機械翻訳装置１００は、例えば、一つの翻訳エンジンとして動作し、低信頼度の結果を棄却することで結果として精度の高い翻訳結果を提供する。 The machine translation device 100 is realized by, for example, a general-purpose computer system including a CPU, a RAM, a ROM, and a nonvolatile storage device. That is, the CPU may be configured to read a machine translation program in which the operations of the reliability calculation unit 10 and the result determination unit 110 are described in a program language, and execute machine translation processing according to the read program. The machine translation apparatus 100 does not have to be a single computer system, and may be configured by a plurality of computer systems. In such a configuration, the machine translation apparatus 100 operates as, for example, one translation engine, and provides a highly accurate translation result as a result by rejecting a low-reliability result.

入力部１１は、本装置に対して入力される第一の言語のテキストを受け付け、信頼度算出部１０に出力する。入力部１１は、例えば、テキスト入力デバイスとしてのキーボードでもよい。 The input unit 11 receives text in the first language input to the apparatus and outputs the text to the reliability calculation unit 10. The input unit 11 may be a keyboard as a text input device, for example.

信頼度算出部１０は、入力部１１からのテキスト入力を受け付け、翻訳処理と信頼度算出処理とを行う。信頼度算出部１０は、図１の信頼度算出装置１０に相当するものである。そのため、以下では図１と異なる動作についてのみ説明し、図１の構成と同じ動作については説明を省略する。なお、本例の信頼度算出部１０は、図３の信頼度算出装置１０ｂや図４の信頼度算出装置１０ｃに相当するものであってもよい。 The reliability calculation unit 10 receives text input from the input unit 11 and performs a translation process and a reliability calculation process. The reliability calculation unit 10 corresponds to the reliability calculation device 10 of FIG. Therefore, only the operation different from that in FIG. 1 will be described below, and the description of the same operation as the configuration in FIG. 1 will be omitted. The reliability calculation unit 10 in this example may correspond to the reliability calculation device 10b in FIG. 3 or the reliability calculation device 10c in FIG.

機械翻訳部１３は、第一の言語を翻訳して第二の言語とした文すなわち訳文を、逆翻訳部１４に出力するだけではなく、結果判定部１１０にも出力する。翻訳信頼度計算部１５は、計算した信頼度を結果判定部１１０に出力する。 The machine translation unit 13 translates the first language into the second language, that is, the translation, not only to the reverse translation unit 14 but also to the result determination unit 110. The translation reliability calculation unit 15 outputs the calculated reliability to the result determination unit 110.

結果判定部１１０は、機械翻訳部１３から入力される翻訳結果の訳文を出力すべきかどうかを、翻訳信頼度計算部１５から入力される信頼度に応じて判定する。例えば、入力された信頼度が予め定められた閾値以上であれば翻訳結果を出力し、そうでない場合は低信頼度であると判定して翻訳結果を出力せずに棄却する。 The result determination unit 110 determines whether the translation of the translation result input from the machine translation unit 13 should be output according to the reliability input from the translation reliability calculation unit 15. For example, if the input reliability is greater than or equal to a predetermined threshold value, the translation result is output, and if not, it is determined that the reliability is low and the result is rejected without outputting the translation result.

出力部１２は、結果判定部１１０から得られる翻訳結果である訳文を出力する。例えば、訳文すなわち第二言語で表記されたテキストを画面表示してもよいし、同テキストを音声出力するための音声合成装置を組み込んだスピーカを用いて音声にして読み上げてもよい。また、出力部１２は、結果判定部１１０が翻訳結果を棄却した場合には、何も出力しなくてもよいし、棄却された旨を示すメッセージとともに逆翻訳結果と信頼度とを出力してもよい。 The output unit 12 outputs a translation that is a translation result obtained from the result determination unit 110. For example, the translated text, that is, the text written in the second language may be displayed on the screen, or may be read out as a voice using a speaker incorporating a voice synthesizer for outputting the text. Moreover, the output part 12 does not need to output anything when the result determination part 110 rejects a translation result, and outputs a reverse translation result and reliability with the message which shows that it rejected. Also good.

なお、図５に示す例では、信頼度算出部１０が機械翻訳部１３を備える例を示したが、機械翻訳装置１００が備えている既存の機械翻訳部（検索エンジン等）を利用する構成とすることも可能である。そのような場合には、逆翻訳部１４は、機械翻訳装置１００が備える翻訳エンジン等を利用して逆翻訳を行ってもよい。 In the example illustrated in FIG. 5, the reliability calculation unit 10 includes the machine translation unit 13, but the configuration uses an existing machine translation unit (such as a search engine) included in the machine translation apparatus 100. It is also possible to do. In such a case, the reverse translation unit 14 may perform reverse translation using a translation engine provided in the machine translation apparatus 100.

次に、本実施例にかかる翻訳処理の全体の流れを図６に示すフローチャートを参照して説明する。図６は、本発明の第１の実施例における翻訳処理の処理フローの一例を示すフローチャートである。図６に示すように、まず、機械翻訳装置１００は、第一の言語のテキストを入力する（ステップＳ２００）。具体的には、機械翻訳装置１００は、入力部１１を介して入力される第一の言語のテキストを受け付け、信頼度算出部１０に出力する。 Next, the overall flow of the translation processing according to the present embodiment will be described with reference to the flowchart shown in FIG. FIG. 6 is a flowchart showing an example of the processing flow of translation processing in the first embodiment of the present invention. As shown in FIG. 6, first, the machine translation apparatus 100 inputs text in a first language (step S200). Specifically, the machine translation apparatus 100 accepts text in the first language input via the input unit 11 and outputs the text to the reliability calculation unit 10.

次に、機械翻訳装置１００は、入力されたテキストを翻訳する（ステップＳ２０１）。具体的には、機械翻訳装置１００は、入力部１１を介してユーザから入力された第一の言語のテキストに対し、機械翻訳部１３で第二の言語への翻訳処理を行う。当該翻訳結果は、逆翻訳部１４と結果判定部１１０に出力される。 Next, the machine translation device 100 translates the input text (step S201). Specifically, the machine translation device 100 performs translation processing into the second language by the machine translation unit 13 on the text of the first language input from the user via the input unit 11. The translation result is output to the reverse translation unit 14 and the result determination unit 110.

次に、機械翻訳装置１００は、逆翻訳を行う（ステップＳ２０２）。具体的には、機械翻訳装置１００は、機械翻訳部１３にて得られた翻訳結果に対して、逆翻訳部１４で再び第一の言語へ逆翻訳処理を行う。 Next, the machine translation device 100 performs reverse translation (step S202). Specifically, the machine translation apparatus 100 performs reverse translation processing on the translation result obtained by the machine translation unit 13 to the first language again by the reverse translation unit 14.

次に、機械翻訳装置１００は、信頼度を計算する（ステップＳ２０３）。具体的には、機械翻訳装置１００の翻訳信頼度計算部１５において、入力部１１を介して入力された翻訳前の第一の言語で記述された原文と、逆翻訳部１４にて再び第一の言語へ翻訳された逆翻訳結果とを入力とし、両者の距離を計算して信頼度とする。 Next, the machine translation device 100 calculates the reliability (step S203). Specifically, in the translation reliability calculation unit 15 of the machine translation apparatus 100, the original text described in the first language before translation input via the input unit 11 and the first in the reverse translation unit 14 again. The reverse translation result translated into the language is input, and the distance between the two is calculated as the reliability.

図７は、本実施例における信頼度の算出例を示す説明図である。例えば、図７（ａ）に示すように、第一の言語である英語での入力文「what would you like to drink」に対し、第二の言語である日本語での翻訳結果が「何を飲みたいですか（nani wo nomitaidesuka）」であり、その逆翻訳結果が「What would you like to drink? 」であったとする。このとき、逆翻訳結果は入力文を完全に再現しており、入力文と逆翻訳結果の信頼度は、ＢＬＥＵスコアを１−ｇｒａｍにおいて算出すると、１．０となる。 FIG. 7 is an explanatory diagram illustrating an example of calculating reliability in the present embodiment. For example, as shown in FIG. 7A, for an input sentence “what would you like to drink” in English as the first language, the translation result in Japanese as the second language is “what. Suppose you want to drink (nani wo nomitaidesuka) "and the reverse translation result is" What would you like to drink? " At this time, the reverse translation result completely reproduces the input sentence, and the reliability of the input sentence and the reverse translation result is 1.0 when the BLEU score is calculated in 1-gram.

一方、図７（ｂ）の例を参照すると、第一の言語である英語での入力文「do you have chairs for children 」に対し、第二の言語である中国語への翻訳結果が図中のとおりであり、その逆翻訳結果が「Is there a lecture of a child?」であったとする。ここでは誤翻訳が発生しており、同様に信頼度を算出すると、０．０となる。 On the other hand, referring to the example of FIG. 7B, the translation result into the second language Chinese is shown for the input sentence “do you have chairs for children” in the first language English. Suppose that the reverse translation result is “Is there a lecture of a child?”. Here, mistranslation has occurred, and when the reliability is calculated in the same manner, it becomes 0.0.

なお、このとき、図３の構成で示したように原文または逆翻訳結果の各単語または単語列を、同義語辞書１６２を用いて同義語展開部１６１が複数の候補に展開してもよい。そのような場合に、翻訳信頼度計算部１５は、同義語展開された複数の候補のそれぞれの組み合わせに対して距離を計算し、最も距離の近いものを最終的な翻訳信頼度として選択してもよい。例えば、図７（ｂ）の原文側において、「chairs」を「sofas」「stools」などに変換したり、「do you have」を「are there」などに変換して原文を複数の候補に展開してもよい。同様に、逆翻訳結果においても「lecture」を「class」などに変換して複数の候補に展開してもよい。そのようにして展開された複数の候補のすべての組み合わせについて距離を計算し、最も距離の近いものを信頼度として選択すればよい。なお、複数候補への展開はいずれか一方でもよい。同義語展開を行うことで、逆翻訳時に意味は同じだが表記が変わってしまったような場合でも精度の高い信頼度を計算することができる。 At this time, as shown in the configuration of FIG. 3, the synonym expansion unit 161 may expand each word or word string of the original text or the reverse translation result into a plurality of candidates using the synonym dictionary 162. In such a case, the translation reliability calculation unit 15 calculates the distance for each combination of a plurality of synonym expanded candidates, and selects the closest one as the final translation reliability. Also good. For example, on the source side of Fig. 7 (b), "chairs" is converted to "sofas", "stools", etc., or "do you have" is converted to "are there", etc., and the source text is expanded into multiple candidates. May be. Similarly, in the reverse translation result, “lecture” may be converted into “class” or the like and expanded into a plurality of candidates. The distances may be calculated for all combinations of the plurality of candidates developed in this manner, and the closest one may be selected as the reliability. In addition, the expansion to a plurality of candidates may be any one. By performing synonym expansion, it is possible to calculate highly reliable reliability even when the meaning is the same at the time of reverse translation but the notation has changed.

次に、機械翻訳装置１００は、結果判定を行う（ステップＳ２０４）。具体的には、機械翻訳装置１００は、結果判定部１１０において、機械翻訳部１３から通知された翻訳結果の訳文を出力すべきかどうかを、翻訳信頼度計算部１５から通知される信頼度に応じて判定する。例えば、図７（ａ）の場合、信頼度が１．０と高いため、受理すなわち出力可と判定する。また、図７（ｂ）の場合、信頼度が０．０と低いため、棄却すなわち出力不可とする。 Next, the machine translation apparatus 100 performs a result determination (step S204). Specifically, the machine translation apparatus 100 determines whether the result determination unit 110 should output the translation of the translation result notified from the machine translation unit 13 according to the reliability notified from the translation reliability calculation unit 15. Judgment. For example, in the case of FIG. 7A, since the reliability is as high as 1.0, it is determined that acceptance, that is, output is possible. Further, in the case of FIG. 7B, since the reliability is as low as 0.0, rejection, that is, output is impossible.

次に、機械翻訳装置１００は、結果を出力する（ステップＳ２０５）。具体的には、機械翻訳装置１００は、結果判定部１１０での判定の結果得られた翻訳結果を、出力部１２を介して出力することで例えばディスプレイ等に表示する。 Next, the machine translation apparatus 100 outputs the result (step S205). Specifically, the machine translation apparatus 100 outputs the translation result obtained as a result of the determination by the result determination unit 110 via the output unit 12 and displays the result on a display, for example.

このように、低信頼度の結果を棄却することで結果として精度の高い翻訳を実現することができる。なお、いずれの場合も翻訳結果は出力し、同時に信頼度を表示するようにしてもよい。また、図４の構成で示したように低信頼度で棄却された場合は、逆翻訳結果を信頼度と併せて表示し、ユーザに再入力を促してもよい。 Thus, by rejecting the result with low reliability, it is possible to realize translation with high accuracy as a result. In any case, the translation result may be output and the reliability may be displayed at the same time. Moreover, when rejected with low reliability as shown in the configuration of FIG. 4, the reverse translation result may be displayed together with the reliability to prompt the user to input again.

なお、信頼度の計算を単語または単語列単位で求めた場合は、例えば、図７（ｃ）、（ｄ）のようにして信頼度を提示してもよい。例えば図７（ｃ）に示すように、第一の言語である英語での入力文「what would you like to drink」に対し、第二の言語である日本語での翻訳結果が「何を飲みたいですか（nani wo nomitaidesuka）」であり、その逆翻訳結果が「What would you like to drink?」であったとする。このとき、逆翻訳結果は入力文を完全に再現しているため、すべての単語について信頼度は１．０となる。一方、図７（ｄ）の例を参照すると、第一の言語である英語での入力文「which will you take 」に対し、第二の言語である日本語での翻訳結果が「あなたはどちらがかかりますか（anata ha dochira ga kakarimasuka）」であり、その逆翻訳結果が「Which is your take? 」であったとする。ここでは誤翻訳が発生しているが、逆翻訳により再現された単語については信頼度を高く表示することで、それを見たユーザが、再現されなかった低信頼度部分を言い換えるといったことが容易になる。 In addition, when the calculation of reliability is calculated | required per word or word string, you may show reliability as FIG.7 (c) and (d), for example. For example, as shown in FIG. 7 (c), the input sentence “what would you like to drink” in the first language, English, the translation result in Japanese, the second language, is “What drink Nani wo nomitaidesuka ”and the reverse translation result is“ What would you like to drink? ”. At this time, since the reverse translation result completely reproduces the input sentence, the reliability is 1.0 for all words. On the other hand, referring to the example of FIG. 7 (d), the translation result in Japanese, which is the second language, for the input sentence “which will you take” in the first language, “Which is you? ”Do you take (anata ha dochira ga kakarimasuka)” and the reverse translation result is “Which is your take?” Here, mistranslation occurs, but it is easy for the user who sees the word reproduced by reverse translation to rephrase the low reliability part that was not reproduced by displaying it with high reliability. become.

なお、図７（ｃ）および図７（ｄ）の例では、簡単のため表層が一致している１−ｇｒａｍでは信頼度を１．０に、不一致の場合には０．０としているが、信頼度の値としてはこれに限らない。例えば、表層だけでなく品詞や読み情報などとの組み合わせ、活用等の違いを考慮するといったことも可能である。例えば、表層のみ一致している場合には０．８、品詞も一致していれば０．９などとしてもよいし、活用等の違い（図７（ｄ）の例ではyou とyour）だけであれば０．３といったスコア付けも可能である。 In the examples of FIGS. 7C and 7D, for simplicity, the reliability is set to 1.0 for 1-gram where the surface layers match, and 0.0 for non-matching, The reliability value is not limited to this. For example, it is possible to consider not only the surface layer but also the combination with the part of speech and the reading information, the difference in utilization, and the like. For example, 0.8 may be used if only the surface layer is matched, 0.9 may be used if the parts of speech are also matched, and only the difference in utilization (you and your in the example of FIG. 7 (d)). If so, a score of 0.3 is also possible.

実施形態２．
次に、本発明の第２の実施形態について説明する。本実施形態では、音声認識などテキスト以外の入力を想定し、かつその信頼度との統合を行うことで、より適切な信頼度を算出する。図８は、本発明の第２の実施形態の信頼度算出装置の構成例を示すブロック図である。図８に示す信頼度算出装置１０ｄは、音声認識結果の単語または単語列とその信頼度を入力とし、音声認識結果の信頼度と翻訳結果の信頼度とを統合した信頼度を出力する。Embodiment 2. FIG.
Next, a second embodiment of the present invention will be described. In the present embodiment, more appropriate reliability is calculated by assuming input other than text, such as voice recognition, and integrating the reliability. FIG. 8 is a block diagram illustrating a configuration example of the reliability calculation apparatus according to the second embodiment of this invention. The reliability calculation device 10d illustrated in FIG. 8 receives the word or word string of the speech recognition result and the reliability thereof, and outputs a reliability obtained by integrating the reliability of the speech recognition result and the reliability of the translation result.

図８に示す信頼度算出装置１０ｄは、機械翻訳部１３と、逆翻訳部１４と、翻訳信頼度計算部１５と、信頼度統合部２１とを備える。また、信頼度算出装置１０ｄは、この他に直接または間接的に、音声認識結果入力部２１０と、出力部１２とを備えていてもよい。ここで、機械翻訳部１３と逆翻訳部１４と翻訳信頼度計算部１５とは、図１に示す第１の実施形態と同様であるため、説明を省略する。以下では、第１の実施形態と異なる部分についてのみ説明する。 The reliability calculation apparatus 10d illustrated in FIG. 8 includes a machine translation unit 13, a reverse translation unit 14, a translation reliability calculation unit 15, and a reliability integration unit 21. In addition, the reliability calculation device 10d may include a speech recognition result input unit 210 and an output unit 12 directly or indirectly. Here, the machine translation unit 13, the reverse translation unit 14, and the translation reliability calculation unit 15 are the same as those in the first embodiment shown in FIG. Below, only a different part from 1st Embodiment is demonstrated.

音声認識結果入力部２１０は、音声で入力された翻訳対象の文または文章に対する音声認識結果と、その信頼度とを入力する。 The speech recognition result input unit 210 inputs a speech recognition result for a sentence or sentence to be translated input by speech and its reliability.

機械翻訳部１３と翻訳信頼度計算部１５とは、音声認識結果入力部２１０から音声認識結果を、原文テキストとして受け取る。 The machine translation unit 13 and the translation reliability calculation unit 15 receive the speech recognition result from the speech recognition result input unit 210 as the original text.

信頼度統合部２１は、音声認識結果入力部２１０から音声認識結果に対する信頼度である音声認識信頼度を入力し、さらに翻訳信頼度計算部１５から翻訳信頼度を入力し、入力された音声認識信頼度と翻訳信頼度とを統合する。信頼度統合部２１は、音声認識信頼度と翻訳信頼度を、例えば線形補間により統合して統合信頼度としてもよい。 The reliability integration unit 21 inputs the speech recognition reliability, which is the reliability of the speech recognition result, from the speech recognition result input unit 210, and further inputs the translation reliability from the translation reliability calculation unit 15. Integrate reliability and translation reliability. The reliability integration unit 21 may integrate the speech recognition reliability and the translation reliability by linear interpolation, for example, to obtain an integrated reliability.

例えば、統合信頼度をＣＭ、音声認識信頼度をＣＭ_ＡＳＲ、翻訳信頼度をＣＭ_ＭＴとおくと、統合信頼度ＣＭは、線形補間係数αを用いて以下の式（４）のように表される。For example, if the integrated reliability is CM, the speech recognition reliability is CM _ASR , and the translation reliability is CM _MT , the integrated reliability CM is expressed by the following equation (4) using the linear interpolation coefficient α. The

ＣＭ＝（１−α）×ＣＭ_ＡＳＲ＋α×ＣＭ_ＭＴ・・・式（４）CM = (1−α) × CM _ASR + α × CM _MT (4)

音声認識信頼度ＣＭ_ＡＳＲとしては、例えば音声認識装置が単語単位に計算する単語事後確率（Word Posteriori Probability ）を用いる方法が挙げられる。なお、音声認識信頼度の付与方法はこれに限らない。単語事後確率を計算する方法は、公知技術として良く知られているものであるから、ここでは詳細な説明を省略する。As the speech recognition reliability CM _ASR , for example, there is a method using a word posteriori probability (word posteriori probability) calculated by the speech recognition device for each word. Note that the method of providing the voice recognition reliability is not limited to this. Since the method for calculating the word posterior probability is well known as a known technique, a detailed description thereof is omitted here.

ところで、一般に、翻訳信頼度ＣＭ_ＭＴが文単位で付与されるのに対して、音声認識信頼度ＣＭ_ＡＳＲは単語事後確率のように単語単位で付与されることが多い。このような場合には、音声認識信頼度ＣＭ_ＡＳＲを文単位に拡張して用いてもよい。音声認識信頼度ＣＭ_ＡＳＲを文単位に拡張する方法としては、得られる単語事後確率を文全体で正規化してもよいし、文全体の事後確率を各単語事後確率の同時確率と捉えてもよい。By the way, in general, the translation reliability CM _MT is given in sentence units, whereas the speech recognition reliability CM _ASR is often given in word units like word posterior probabilities. In such a case, the speech recognition reliability CM _ASR may be extended in sentence units. As a method of extending the speech recognition reliability CM _ASR in sentence units, the obtained word posterior probabilities may be normalized for the entire sentence, or the posterior probabilities for the entire sentence may be regarded as the simultaneous probabilities for each word posterior probability. .

以下の式（５）は、文｛ｗ^ｎ _ｌ｝を構成する各単語ｗ_ｉの事後確率をＰ（ｗ_ｉ｜Ｘ）としたとき、単語事後確率を文全体で正規化して音声認識信頼度ＣＭ_ＡＳＲを文単位に拡張する場合の計算例である。The following equation (5) is obtained by normalizing the word posterior probabilities for the entire sentence, where P (w _i | X) is the posterior probability of each word w _i constituting the sentence {w ⁿ _l }, and the speech recognition reliability. It is a calculation example in the case of extending CM _ASR in sentence units.

また、以下の式（６）は、文全体の事後確率を各単語事後確率の同時確率と捉えて音声認識信頼度ＣＭ_ＡＳＲを文単位に拡張する場合の計算例である。Further, the following formula (6) is a calculation example in the case where the posterior probability of the whole sentence is regarded as the simultaneous probability of each word posterior probability and the speech recognition reliability CM _ASR is expanded in sentence units.

逆に、機械翻訳信頼度ＣＭ_ＭＴを、単語または単語列単位にするには、共有Ｎ−ｇｒａｍと音声認識のＮ−ｇｒａｍ言語モデルまたは音声認識信頼度ＣＭ_ＡＳＲとを単語または単語列単位で組み合わせることで実現可能である。例えば、簡単には、共有Ｎ−ｇｒａｍでない単語すなわち逆翻訳で再現しなかった単語については、音声認識信頼度ＣＭ_ＡＳＲに一定のペナルティ値を加えて統合信頼度ＣＭを単語単位で求め、その後、文全体で正規化するとよい。このように単語単位に統合信頼度を求めることで、一文内での低信頼度区間の提示といったことが可能になる。Conversely, in order to set the machine translation reliability CM _MT in units of words or word strings, the shared N-gram and the speech recognition N-gram language model or the voice recognition reliability CM _ASR are combined in units of words or word strings. This is possible. For example, for a word that is not a shared N-gram, that is, a word that has not been reproduced by reverse translation, a certain penalty value is added to the speech recognition reliability CM _ASR to obtain an integrated reliability CM in units of words, Normalize the whole sentence. Thus, by obtaining the integrated reliability in units of words, it becomes possible to present a low reliability section within a sentence.

出力部１２は、信頼度算出装置１０ｄから出力される信頼度（ここでは、統合信頼度）をユーザに表示するディスプレイ等である。出力部１２は、翻訳結果とともにその文単位や単語単位に求められた信頼度を表示してもよい。なお、本実施形態の場合、提示する信頼度としては、翻訳信頼度や入力部に音声認識など他のモジュールを想定する場合にその認識結果に対する信頼度（音声認識信頼度等）を用いてもよい。なお、これらを統合信頼度として組み合わせて用いてもよい。 The output unit 12 is a display or the like that displays the reliability (here, integrated reliability) output from the reliability calculation device 10d to the user. The output unit 12 may display the reliability obtained for each sentence or each word along with the translation result. In the case of this embodiment, as the reliability to be presented, the reliability of the recognition result (speech recognition reliability, etc.) may be used when assuming other modules such as translation reliability and speech recognition in the input unit. Good. These may be used in combination as the integrated reliability.

なお、本実施の形態では原文テキストとして音声認識結果を入力することを想定しているが、前段に置く構成としては音声認識に限らない。例えば、ＯＣＲ（Optical Character Reader：文字読取装置）やタッチパネルを用いた文字入力結果（または文字認識結果）とその信頼度とを入力としても同様の構成が可能である。 In this embodiment, it is assumed that the speech recognition result is input as the original text, but the configuration placed in the preceding stage is not limited to the speech recognition. For example, a similar configuration is possible by inputting a character input result (or character recognition result) using an OCR (Optical Character Reader) or a touch panel and its reliability.

また、本実施形態においても、第１の実施形態と同様に、同義語展開部１６１および同義語辞書１６２を備えて同義語展開を行うことで、より適切な信頼度計算を行うことができる。このとき、音声認識に用いる音声認識辞書と、機械翻訳に用いる翻訳辞書とが共通に保有する単語または単語列のみを対象として同義語展開を行うことで、より効率を高めることができる。すなわち、未知な単語への無駄な展開を防ぎ、翻訳可能な範囲でのみ同義語展開することで、処理量を減らすことができる。 Also in the present embodiment, as in the first embodiment, the synonym expansion unit 161 and the synonym dictionary 162 are provided and synonym expansion is performed, so that more appropriate reliability calculation can be performed. At this time, it is possible to further increase efficiency by performing synonym expansion only on words or word strings that are commonly held by the speech recognition dictionary used for speech recognition and the translation dictionary used for machine translation. That is, it is possible to reduce the amount of processing by preventing useless expansion to unknown words and expanding synonyms only within a translatable range.

また、例えば、図１０に示す信頼度算出装置１０ｅのように、さらに信頼度提示部１７を備えて、信頼度提出部１７で逆翻訳部１４で得られる逆翻訳結果と、信頼度統合部２１で得られた信頼度とをあわせて提示するようにしてもよい。なお、図１０は、本実施形態の信頼度算出装置の他の構成例を示すブロック図である。なお、信頼度提示部１７にて提示する信頼度は、文（または発話全体）単位である必要はなく、例えば単語単位に提示してもよい。 Further, for example, as in the reliability calculation device 10e shown in FIG. 10, the reliability presentation unit 17 is further provided, the reverse translation result obtained by the reverse translation unit 14 in the reliability submission unit 17, and the reliability integration unit 21. You may make it show together with the reliability obtained by. FIG. 10 is a block diagram illustrating another configuration example of the reliability calculation apparatus according to the present embodiment. The reliability presented by the reliability presentation unit 17 does not need to be in units of sentences (or the entire utterance), and may be presented in units of words, for example.

次に、本実施形態の動作について説明する。以下では、本実施形態の信頼度算出装置１０ｄにおける信頼度算出処理について図９を参照して説明する。図９は、本実施形態の信頼度算出装置１０ｄにおける信頼度算出処理の処理フローの一例を示すフローチャートである。図９に示すように、まず信頼度算出装置１０ｄは、翻訳対象の原文となる音声認識結果とその信頼度とを入力する（ステップＳ３００）。ここでは、音声認識結果入力部２１０を介して、音声認識結果の入力を、第一の言語で記述された文すなわち原文として受け付け、受け付けた原文を機械翻訳部１３および翻訳信頼度計算部１５に出力する。同時に、その音声認識結果の音声認識信頼度の入力を受け付け、受け付けた音声認識信頼度を信頼度統合部２１に出力する。 Next, the operation of this embodiment will be described. Below, the reliability calculation process in the reliability calculation apparatus 10d of this embodiment is demonstrated with reference to FIG. FIG. 9 is a flowchart illustrating an example of a processing flow of reliability calculation processing in the reliability calculation device 10d of the present embodiment. As shown in FIG. 9, first, the reliability calculation device 10d inputs a speech recognition result as an original sentence to be translated and its reliability (step S300). Here, the input of the speech recognition result is accepted as a sentence written in the first language, that is, the original text via the speech recognition result input section 210, and the received original text is sent to the machine translation section 13 and the translation reliability calculation section 15. Output. At the same time, an input of the voice recognition reliability of the voice recognition result is received, and the received voice recognition reliability is output to the reliability integration unit 21.

次に、機械翻訳部１３が、入力された原文を翻訳する（ステップＳ３０１）。機械翻訳部１３は、入力された原文に対して、原文の言語である第一の言語から指定された他の言語である第二の言語への翻訳処理を実行し、その翻訳結果を逆翻訳部１４に出力する。 Next, the machine translation unit 13 translates the input original text (step S301). The machine translation unit 13 performs a translation process on the input original sentence from the first language that is the original language to the second language that is the other language specified, and reversely translates the translation result To the unit 14.

次に、逆翻訳部１４が、機械翻訳部１３の翻訳結果である訳文に対して、逆翻訳を行う（ステップＳ３０２）。逆翻訳部１４は、機械翻訳部１３から入力された翻訳結果である訳文に対して、訳文の言語である第二の言語から再び原文の言語である第一の言語への翻訳処理を実行し、その翻訳結果である逆翻訳結果を翻訳信頼度計算部１５に出力する。 Next, the reverse translation unit 14 performs reverse translation on the translation that is the translation result of the machine translation unit 13 (step S302). The reverse translation unit 14 executes a translation process from the second language, which is the translated language, to the first language, which is the original language, again on the translation that is the translation result input from the machine translation unit 13. The reverse translation result, which is the translation result, is output to the translation reliability calculation unit 15.

次に、翻訳信頼度計算部１５が、信頼度を計算する（ステップＳ３０３）。翻訳信頼度計算部１５は、音声結果入力部２１０を介して入力された翻訳前の第一の言語で記述された原文と、逆翻訳部１４にて再び第一の言語に翻訳された逆翻訳結果とを入力とし、両者の距離を計算して翻訳信頼度とする。なお、本実施形態では、ここで算出される翻訳信頼度を、単純に翻訳処理結果に対する信頼度として扱う。 Next, the translation reliability calculation unit 15 calculates the reliability (step S303). The translation reliability calculation unit 15 uses the original text described in the first language before translation input via the speech result input unit 210 and the reverse translation translated into the first language again by the reverse translation unit 14. The result is input, and the distance between the two is calculated as the translation reliability. In the present embodiment, the translation reliability calculated here is simply handled as the reliability for the translation processing result.

次に、信頼度統合部２１が、原文の認識処理に対する信頼度である音声認識信頼度と、翻訳信頼度とを統合する（ステップＳ３０４）。信頼度統合部２１は、音声認識結果入力部２１０から入力された音声認識信頼度と、翻訳信頼度計算部１５から入力された翻訳信頼度とを、例えば線形補間により統合する。なお、統合する信頼度は、文単位である場合や単語または単語列単位である場合がある。 Next, the reliability integration unit 21 integrates the speech recognition reliability, which is the reliability of the original text recognition process, and the translation reliability (step S304). The reliability integration unit 21 integrates the speech recognition reliability input from the speech recognition result input unit 210 and the translation reliability input from the translation reliability calculation unit 15 by, for example, linear interpolation. Note that the reliability to be integrated may be a sentence unit or a word or a word string unit.

最後に、信頼度算出装置１０ｄは、結果を出力する（ステップＳ３０５）。信頼度算出装置１０ｄは、信頼度統合部２１によって得られた統合信頼度を、例えば、ディスプレイ等の出力部１２を介して出力してユーザに表示する。このとき、信頼度算出装置１０ｄは、統合信頼度とともに逆翻訳結果を出力してもよい。また、統合信頼度とともにその統合信頼度の算出に用いた音声認識信頼度と翻訳結果信頼度とを付加情報として出力してもよい。 Finally, the reliability calculation device 10d outputs the result (step S305). The reliability calculation device 10d outputs the integrated reliability obtained by the reliability integration unit 21 via the output unit 12 such as a display and displays it to the user. At this time, the reliability calculation apparatus 10d may output the reverse translation result together with the integrated reliability. In addition to the integrated reliability, the speech recognition reliability and the translation result reliability used to calculate the integrated reliability may be output as additional information.

以上のように、本実施形態によれば、音声認識などテキスト以外の入力が行われる場合に、入力された原文に対する第一の言語による認識処理の信頼度との統合を行うことで、より適切な信頼度を算出することができる。 As described above, according to the present embodiment, when input other than text, such as speech recognition, is performed, integration with the reliability of recognition processing in the first language for the input original sentence is more appropriate. Reliable reliability can be calculated.

また、低信頼度であった場合に、逆翻訳結果と信頼度とをあわせて提示することで、原文を入力したユーザが翻訳対象とした第二の言語を理解できない場合であっても、話し手の言語による結果の確認が可能となり、その結果低信頼度となった原因やその箇所を推定させ、その部分の言い直しや言い換えを行わせることができる。さらに、本実施形態では、単語単位での信頼度を付与することができるため、低信頼度となった単語に対応する逆翻訳分の文章表現を強調して提示することにより、話し手の言語による結果の確認がより容易に可能となり、その結果低信頼度となった原因やその箇所の推定をさらに容易にさせ、その部分の言い直しや言い換えを行わせることができる。 In addition, if the reliability is low, presenting the reverse translation result and reliability together, even if the user who entered the original text cannot understand the second language to be translated, As a result, it is possible to check the result and to estimate the cause and location of the low reliability, and to rephrase or rephrase the portion. Furthermore, in the present embodiment, since reliability can be given in units of words, by emphasizing and presenting a sentence expression for the reverse translation corresponding to the word that has become low reliability, The result can be confirmed more easily, and as a result, the cause of the low reliability and the location can be estimated more easily, and the portion can be rephrased and rephrased.

次に、より具体的な例を用いて本実施形態を説明する。以下では、本発明による信頼度算出装置の第２の実施例として、図１１に示す音声翻訳システム２００の例を説明する。図１０に示す音声翻訳システム２００は、音声入力部２１１と、音声認識部２１２と、信頼度算出部１０ｄと、結果判定部２２０と、出力部１２とを備える。 Next, this embodiment will be described using a more specific example. Hereinafter, an example of the speech translation system 200 shown in FIG. 11 will be described as a second embodiment of the reliability calculation apparatus according to the present invention. The speech translation system 200 illustrated in FIG. 10 includes a speech input unit 211, a speech recognition unit 212, a reliability calculation unit 10d, a result determination unit 220, and an output unit 12.

音声翻訳システム２００は、例えば、ＣＰＵ、ＲＡＭ、ＲＯＭおよび不揮発性記憶装置を備える汎用的なコンピュータシステムにより実現される。すなわち、ＣＰＵが音声認識部２１２や信頼度算出部１０ｄ、結果判定部２２０の動作をプログラム言語により記述した音声翻訳プログラムを読み込み、読み込んだプログラムに従い音声翻訳処理を実行するよう構成されていてもよい。なお、音声翻訳システム２００は、１台のコンピュータシステムである必要はなく、複数台のコンピュータシステムで構成されていてもよい。そのような構成において、音声翻訳システム２００は、翻訳対象の原文となる第一の言語の音声を受け付け、その音声が表している文または文章を第二の言語に翻訳して、翻訳結果を信頼度とともに出力する。 The speech translation system 200 is realized by, for example, a general-purpose computer system that includes a CPU, a RAM, a ROM, and a nonvolatile storage device. That is, the CPU may be configured to read a speech translation program in which the operations of the speech recognition unit 212, the reliability calculation unit 10d, and the result determination unit 220 are described in a program language, and execute the speech translation process according to the read program. . Note that the speech translation system 200 does not have to be a single computer system, and may be configured by a plurality of computer systems. In such a configuration, the speech translation system 200 receives the speech of the first language that is the original sentence to be translated, translates the sentence or sentence represented by the speech into the second language, and trusts the translation result. Output with degrees.

音声入力部２１１は、本システムに対して入力される第一の言語の音声を受け付け、音声認識部２１１に出力する。例えば、入力デバイスとしてのマイクロフォンでもよい。 The voice input unit 211 receives a first language voice input to the system and outputs the first language voice to the voice recognition unit 211. For example, a microphone as an input device may be used.

音声認識部２１０は、音声入力部２１１を介して入力された音声を音声認識処理し、音声認識結果とその信頼度である音声認識信頼度を、信頼度算出部１０ｄに出力する。 The speech recognition unit 210 performs speech recognition processing on the speech input via the speech input unit 211, and outputs the speech recognition result and the speech recognition reliability that is the reliability to the reliability calculation unit 10d.

信頼度算出部１０ｄは、音声認識部２１０からの音声認識結果とその信頼度の入力を翻訳対象とする原文と原文に対する信頼度として受け付け、翻訳処理と翻訳信頼度および統合信頼度の算出処理とを行う。信頼度算出部１０ｄは、図８に示す信頼度算出装置１０ｄに相当するものである。そのため、以下では、図８と異なる動作についてのみ説明し、図８の構成と同じ動作については説明を省略する。なお、本例の信頼度算出部１０ｄは、図１０の信頼度算出装置１０ｅに相当するものや、さらに図３に示すような同義語展開部１６１およぼ同義語辞書１６２を備える構成であってもよい。 The reliability calculation unit 10d accepts the speech recognition result from the speech recognition unit 210 and the input of the reliability as the original text to be translated and the reliability of the original text, and the translation process, the calculation process of the translation reliability and the integrated reliability, I do. The reliability calculation unit 10d corresponds to the reliability calculation device 10d shown in FIG. Therefore, only operations different from those in FIG. 8 will be described below, and description of the same operations as those in the configuration in FIG. 8 will be omitted. In addition, the reliability calculation unit 10d of this example has a configuration equivalent to the reliability calculation device 10e of FIG. 10, and further includes a synonym expansion unit 161 and a synonym dictionary 162 as shown in FIG. Also good.

機械翻訳部１３は、第一の言語を翻訳して第二の言語とした文すなわち訳文を、逆翻訳部１４に出力するだけではなく、結果判定部２２０にも出力する。信頼度統合部２１は、統合した信頼度を結果判定部２２０に出力する。結果判定部２２０は、機械翻訳部１３から入力される翻訳結果の訳文を出力すべきかどうかを、信頼度統合部２１から入力される信頼度すなわち統合信頼度に応じて判定する。 The machine translation unit 13 translates the first language into the second language, that is, the translated sentence, not only to the reverse translation unit 14 but also to the result determination unit 220. The reliability integration unit 21 outputs the integrated reliability to the result determination unit 220. The result determination unit 220 determines whether the translation of the translation result input from the machine translation unit 13 should be output according to the reliability input from the reliability integration unit 21, that is, the integrated reliability.

出力部１２は、結果判定部２２０から得られる翻訳結果である訳文を出力する。例えば、訳文すなわち第二言語で表記されたテキストを画面表示してもよいし、同テキストを音声出力するための音声合成装置を組み込んだスピーカを用いて音声にて読み上げてもよい。また、出力部１２は、結果判定部２２０が翻訳結果を棄却した場合には、何も出力しなくてもよいし、棄却された旨を示すメッセージとともに逆翻訳結果と信頼度とを出力してもよい。 The output unit 12 outputs a translation that is a translation result obtained from the result determination unit 220. For example, the translated text, that is, the text written in the second language may be displayed on the screen, or may be read out by voice using a speaker incorporating a voice synthesizer for outputting the text as voice. Further, the output unit 12 does not need to output anything when the result determination unit 220 rejects the translation result, and outputs the reverse translation result and the reliability together with a message indicating that the result is rejected. Also good.

次に、本実施例にかかる翻訳処理の全体の流れを図１２に示すフローチャートを参照して説明する。図１２は、本発明の第２の実施例における翻訳処理の処理フローの一例を示すフローチャートである。図１２に示すように、まず、音声翻訳システム２００は、第一の言語の音声を入力する（ステップＳ４００）。具体的には、音声翻訳システム２００は、音声入力部２１１を介して入力される第一の言語の音声を受け付け、音声認識部２１２に出力する。 Next, the overall flow of the translation processing according to the present embodiment will be described with reference to the flowchart shown in FIG. FIG. 12 is a flowchart showing an example of a processing flow of translation processing in the second embodiment of the present invention. As shown in FIG. 12, first, the speech translation system 200 inputs speech in the first language (step S400). Specifically, the speech translation system 200 accepts the first language speech input via the speech input unit 211 and outputs it to the speech recognition unit 212.

次に、音声翻訳システム２００は、入力された音声を音声認識する（ステップＳ４０１）。具体的には、音声翻訳システム２００は、音声入力部２１１から入力された音声に対して音声認識部２１２が音声認識処理を実施し、音声認識結果とその信頼度である音声認識信頼度を求め、得られた音声認識結果と音声認識信頼度とを信頼度算出部１０ｄに出力する。なお、音声を入力して音声認識結果としてのテキストを出力する音声認識処理の方法、およびその際に音声認識結果の信頼度を計算する方法については、公知技術として良く知られているものであるから、ここでは詳細な説明を省略する。音声信頼度算出部１０ｄでは、少なくとも機械翻訳部１３、翻訳信頼度計算部１５に音声認識結果が出力され、また得られた音声認識結果が出力され、信頼度統合部２１に音声認識結果が出力される。 Next, the speech translation system 200 recognizes the input speech (step S401). Specifically, in the speech translation system 200, the speech recognition unit 212 performs speech recognition processing on the speech input from the speech input unit 211, and obtains speech recognition results and speech recognition reliability that is reliability. The obtained voice recognition result and the voice recognition reliability are output to the reliability calculation unit 10d. Note that a speech recognition processing method for inputting speech and outputting text as a speech recognition result, and a method for calculating the reliability of the speech recognition result at that time are well known in the art. Therefore, detailed description is omitted here. In the speech reliability calculation unit 10d, the speech recognition result is output to at least the machine translation unit 13 and the translation reliability calculation unit 15, the obtained speech recognition result is output, and the speech recognition result is output to the reliability integration unit 21. Is done.

次に、音声翻訳システム２００は、音声認識結果を翻訳する（ステップＳ４０２）。具体的には、音声翻訳システム２００は、音声認識部２１２から入力された音声認識結果のテキストを機械翻訳部１３によって翻訳し、その翻訳結果を逆翻訳部１４と結果判定部２２０に出力する。 Next, the speech translation system 200 translates the speech recognition result (step S402). Specifically, the speech translation system 200 translates the text of the speech recognition result input from the speech recognition unit 212 by the machine translation unit 13, and outputs the translation result to the reverse translation unit 14 and the result determination unit 220.

次に、音声翻訳システム２００は、逆翻訳を行う（ステップＳ４０３）。具体的には、音声翻訳システム２００は、機械翻訳部１３によって得られた翻訳結果に対して、逆翻訳部１４で再び第一の言語に逆翻訳処理を行う。 Next, the speech translation system 200 performs reverse translation (step S403). Specifically, the speech translation system 200 performs reverse translation processing on the first language again by the reverse translation unit 14 on the translation result obtained by the machine translation unit 13.

次に、音声翻訳システム２００は、翻訳信頼度を計算する（ステップＳ４０４）。具体的には、音声翻訳システム２００の翻訳信頼度計算部１５において、音声認識部２１２による音声認識結果である翻訳前の第一の言語で記述された原文と、逆翻訳部１４にて再び第一の言語へ翻訳された逆翻訳結果とを入力とし、両者の距離を計算して翻訳信頼度とする。なお、ここで算出される翻訳信頼度は、翻訳処理に対する信頼度である。 Next, the speech translation system 200 calculates translation reliability (step S404). Specifically, in the translation reliability calculation unit 15 of the speech translation system 200, the original sentence described in the first language before translation, which is the speech recognition result by the speech recognition unit 212, and the reverse translation unit 14 again The reverse translation result translated into one language is input, and the distance between the two is calculated as the translation reliability. The translation reliability calculated here is the reliability for the translation process.

次に、音声翻訳システム２００は、信頼度を統合する（ステップＳ４０５）。具体的には、音声翻訳システム２００は、信頼度統合部２１にて、入力された音声認識信頼度と翻訳信頼度とを、例えば線形補間により統合して翻訳結果に対する最終的な信頼度とする。 Next, the speech translation system 200 integrates reliability (step S405). Specifically, in the speech translation system 200, the reliability integration unit 21 integrates the input speech recognition reliability and the translation reliability by, for example, linear interpolation to obtain a final reliability for the translation result. .

図１３は、本実施例における信頼度の算出例を示す説明図である。例えば、図１３（ａ）に示すように、第一の言語である英語での入力文「what would you like to drink」に対し、音声認識結果が「what would you like to drink」であり、その音声認識信頼度は完全正解であるので文（発話全体）単位で０．９９であったとする。また、音声認識結果を原文として第二の言語である日本語への翻訳結果が「何を飲みたいですか（nani wo nomitaidesuka）」であり、その逆翻訳結果が「What would you like to drink?」であったとする。このとき、翻訳信頼度は、逆翻訳結果が音声認識結果である原文を完全に再現されているため１．０となった。そのような場合に、これら音声認識信頼度と翻訳信頼度を一定の重みにおいて線形補間すると、統合した信頼度は０．９９となる。 FIG. 13 is an explanatory diagram illustrating a calculation example of reliability in the present embodiment. For example, as shown in FIG. 13A, for an input sentence “what would you like to drink” in English, which is the first language, the speech recognition result is “what would you like to drink”. Since the speech recognition reliability is completely correct, it is assumed that it is 0.99 in units of sentences (entire utterances). Moreover, the translation result into Japanese, which is the second language, using the speech recognition result as the original text is “What do you want to drink (nani wo nomitaidesuka)” and the reverse translation result is “What would you like to drink? ”. At this time, the translation reliability was 1.0 because the original text whose reverse translation result was the speech recognition result was completely reproduced. In such a case, when these speech recognition reliability and translation reliability are linearly interpolated with a constant weight, the integrated reliability is 0.99.

一方、図１３（ｂ）の例を参照すると、第一の言語である英語での入力文「you can check in two hours before the departure time」に対し、音声認識結果が「you can check into wars before the departure time 」であり、その音声認識信頼度は誤りを含んでいるので０．７９であったとする。また、音声認識結果を原文として第二の言語である日本語への翻訳結果が「出発時刻の前に戦争をチェックできます（shuppatsu jikoku no mae ni sensou wo check dekimasu）」であり、その逆翻訳結果が「I can check a war before the departure time.」であったとする。このとき、翻訳信頼度は、逆翻訳結果が音声認識結果である原文とは異なり、誤りを含んでいるため０．６７となった。そのような場合に、これら音声認識信頼度と翻訳信頼度を一定の重みにおいて線形補間すると、統合した信頼度は０．７７となる。 On the other hand, referring to the example of FIG. 13B, the speech recognition result is “you can check into wars before” for the input sentence “you can check in two hours before the departure time” in the first language, English. It is assumed that the speech recognition reliability is 0.79 because it includes an error. In addition, the translation result into Japanese, which is the second language, using the speech recognition result as the original text is “can check war before departure (shuppatsu jikoku no mae ni sensou wo check dekimasu)” and its reverse translation Assume that the result is "I can check a war before the departure time." At this time, the translation reliability is 0.67 because the reverse translation result includes an error, unlike the original text in which the speech recognition result is obtained. In such a case, when these speech recognition reliability and translation reliability are linearly interpolated with a constant weight, the integrated reliability is 0.77.

また、図１３（ｃ）の例を参照すると、第一の言語である英語での入力文「I'll watch over your bags 」に対し、音声認識結果が「I'll walk to the bank 」であり、その音声認識信頼度は０．６４であったとする。また、音声認識結果を原文として第二の言語である中国語への翻訳結果が図中のとおりであり、その逆翻訳結果が「I'll walk to the bank 」であったとする。このとき、翻訳信頼度は、逆翻訳結果が音声認識結果である原文を完全に再現しており、１．０となった。そのような場合に、これら音声認識信頼度と翻訳信頼度を一定の重みにおいて線形補間すると、統合した信頼度は０．７１となる。 Further, referring to the example of FIG. 13C, the speech recognition result is “I'll walk to the bank” for the input sentence “I'll watch over your bags” in the first language, English. It is assumed that the voice recognition reliability is 0.64. Further, it is assumed that the translation result into the second language, Chinese, using the speech recognition result as the original text is as shown in the figure, and the reverse translation result is “I'll walk to the bank”. At this time, the translation reliability was 1.0, which is a complete reproduction of the original text whose reverse translation result is the speech recognition result. In such a case, if these speech recognition reliability and translation reliability are linearly interpolated with a constant weight, the integrated reliability is 0.71.

また、図１３（ｄ）の例を参照すると、第一の言語である英語での入力文「um do you have chairs for children」に対し、音声認識結果が「do you have chairs for children 」であり、その音声認識信頼度は０．９４であったとする。また、音声認識結果を原文として第二の言語である中国語への翻訳結果が図中のとおりであり、その逆翻訳結果が「Is there a lecture of a child?」であったとする。このとき、翻訳信頼度は、逆翻訳結果が音声認識結果である原文とは異なり、誤りを含んでいるため0．０となった。そのような場合に、これら音声認識信頼度と翻訳信頼度を一定の重みにおいて線形補間すると、統合した信頼度は０．７５となる。 Further, referring to the example of FIG. 13D, the speech recognition result is “do you have chairs for children” for the input sentence “um do you have chairs for children” in the first language, English. Suppose that the voice recognition reliability is 0.94. Also, assume that the result of translation into Chinese, which is the second language, with the speech recognition result as the original text is as shown in the figure, and the reverse translation result is “Is there a lecture of a child?”. At this time, the translation reliability was set to 0.0 because the reverse translation result was different from the original sentence that was the speech recognition result and contained an error. In such a case, when these speech recognition reliability and translation reliability are linearly interpolated with a constant weight, the integrated reliability is 0.75.

次に、音声翻訳システム２００は、結果判定を行う（ステップＳ４０６）。具体的には、音声翻訳システム２００は、結果判定部２２０において、機械翻訳部１３から入力される翻訳結果の訳文を出力すべきかどうかを、信頼度統合部２１から入力される統合信頼度に応じて判定する。 Next, the speech translation system 200 performs a result determination (step S406). Specifically, the speech translation system 200 determines whether the result determination unit 220 should output the translation of the translation result input from the machine translation unit 13 according to the integrated reliability input from the reliability integration unit 21. Judgment.

例えば、図１３（ａ）の例では、音声認識結果と機械翻訳結果とが共に正しく、統合信頼度（図中では音声翻訳信頼度と表記）は０．９９と高いため、受理すなわち出力可として判定する。一方、図１３（ｂ）の例では、音声認識結果と機械翻訳結果とが共に誤りを含み、統合信頼度は０．７７と十分に高くないため（具体的には、所定の閾値より低いため）、棄却すなわち出力不可と判定する。 For example, in the example of FIG. 13 (a), both the speech recognition result and the machine translation result are correct, and the integrated reliability (denoted as speech translation reliability in the figure) is as high as 0.99. judge. On the other hand, in the example of FIG. 13B, both the speech recognition result and the machine translation result include an error, and the integrated reliability is not sufficiently high as 0.77 (specifically, it is lower than a predetermined threshold). ), It is determined that it is rejected, that is, output is impossible.

また、図１３（ｃ）の例では、機械翻訳結果は正しいが音声認識結果に誤りを含み、また図１３（ｄ）の例では、音声認識はほぼ正しいが翻訳結果に誤りを含み、いずれの例においても統合信頼度が０．７７と十分に高くないため、棄却すなわち出力不可と判定する。 In the example of FIG. 13C, the machine translation result is correct but the speech recognition result includes an error. In the example of FIG. 13D, the speech recognition is almost correct but the translation result includes an error. Also in the example, since the integrated reliability is not sufficiently high as 0.77, it is determined that rejection, that is, output is impossible.

次に、音声翻訳システム２００は、結果を出力する（ステップＳ４０７）。具体的には、音声翻訳システム２００は、結果判定部２２０での判定の結果得られた翻訳結果を、出力部１２を介して出力することで、例えばディスプレイ等に表示する。 Next, the speech translation system 200 outputs the result (step S407). Specifically, the speech translation system 200 outputs the translation result obtained as a result of the determination by the result determination unit 220 via the output unit 12, thereby displaying the result on a display or the like, for example.

例えば、図１３（ｄ）の例を参照すると、入力音声が「um do you have chairs for children」であったにも関わらず逆翻訳結果として「Is there a lecture of a child?」となり、低信頼度となっている。このとき、例えば図１０に示すような信頼度提示部１７を備えていた場合、信頼度提出部１７が、逆翻訳部１４で得られる逆翻訳結果と、信頼度統合部２１で得られた信頼度とをあわせて提示することで、話し手側すなわち英語側のユーザに、低信頼度となった原因やその箇所を推定させることができる。またその結果、その部分の言い直しや言い換えを行わせることができる。すなわち、話し手であるユーザが逆翻訳結果を原文の言語で確認することで、翻訳対象とした中国語の翻訳結果を理解できなくても、低信頼度になった原因として「chairs」が「lecture 」となってしまったことが容易に推測できる。このため、この部分を言い換えることによって高い信頼度の獲得すなわちコミュニケーションの破綻を低減できる可能性が高まる。 For example, referring to the example of FIG. 13D, although the input speech is “um do you have chairs for children”, the reverse translation result is “Is there a lecture of a child?” It is a degree. At this time, for example, when the reliability presentation unit 17 as shown in FIG. 10 is provided, the reliability submission unit 17 receives the reverse translation result obtained by the reverse translation unit 14 and the reliability obtained by the reliability integration unit 21. By presenting together with the degree, it is possible to cause the speaker side, that is, the English side user to estimate the cause of the low reliability and its location. As a result, the part can be reworded or rephrased. In other words, even if the user who is the speaker does not understand the translation result of the target Chinese by confirming the reverse translation result in the original language, “chairs” is “lecture” Can be easily guessed. For this reason, in other words, there is a high possibility that high reliability, that is, communication failure can be reduced by paraphrasing this part.

なお、このとき信頼度提示部１７にて提示する信頼度は、文（または発話全体）単位である必要はなく、例えば単語単位に提示してもよい。その方が言い換え箇所の同定が容易になることもある。その場合、信頼度統合部２１における音声認識信頼度と翻訳信頼度の統合を、まず単語単位に行った後に文単位に拡張すればよい。 At this time, the reliability presented by the reliability presentation unit 17 does not need to be in units of sentences (or entire utterances), and may be presented in units of words, for example. This may make it easier to identify the paraphrase. In that case, the integration of the speech recognition reliability and the translation reliability in the reliability integration unit 21 may be first performed for each word and then expanded for each sentence.

例えば、図１３（ｄ）の例においては、「chairs」または「do you have 」の部分の信頼度が低いことを提示すれば、その部分に問題があることを話し手側のユーザが確認でき、その箇所を別の単語や表現に言い換えるといった対策が容易になる。 For example, in the example of FIG. 13D, if the reliability of the “chairs” or “do you have” part is presented, the speaker side user can confirm that there is a problem in the part, It is easy to take measures such as rephrasing the part with another word or expression.

このように、本実施例では、音声を入力とする音声翻訳システムにおいて、音声認識と機械翻訳の信頼度を適切に統合して提示することで、高い信頼度の翻訳結果を出力することができる。また、ユーザにとって再入力の必要性や再入力箇所を明確にすることができる。その結果コミュニケーションの破綻を低減することができる。 As described above, in this embodiment, in a speech translation system using speech as an input, it is possible to output a translation result with high reliability by appropriately integrating and presenting the reliability of speech recognition and machine translation. . Further, it is possible for the user to clarify the necessity of re-input and the re-input part. As a result, communication failures can be reduced.

次に、本発明の概要を説明する。図１４は、本発明の概要を示すブロック図である。図１４に示すように、本発明による信頼度算出装置は、翻訳手段５０１と、逆翻訳手段５０２と、翻訳信頼度計算手段５０３と、信頼度提示手段５０４とを備えている。 Next, the outline of the present invention will be described. FIG. 14 is a block diagram showing an outline of the present invention. As shown in FIG. 14, the reliability calculation apparatus according to the present invention includes a translation unit 501, a reverse translation unit 502, a translation reliability calculation unit 503, and a reliability presentation unit 504.

翻訳手段５０１（例えば、機械翻訳部１３）は、入力文に対して、入力文の言語である第一の言語とは異なる言語である第二の言語への翻訳処理を行う。 The translation unit 501 (for example, the machine translation unit 13) performs a translation process on the input sentence into a second language that is different from the first language that is the language of the input sentence.

逆翻訳手段５０２（例えば、逆翻訳部１４）は、翻訳手段５０１による翻訳処理の結果得られる翻訳結果に対して、入力文の言語である第一の言語への逆翻訳処理を行う。 The reverse translation unit 502 (for example, the reverse translation unit 14) performs reverse translation processing on the translation result obtained as a result of translation processing by the translation unit 501 into the first language that is the language of the input sentence.

翻訳信頼度計算手段５０３（例えば、翻訳信頼度計算部１５）は、入力文と逆翻訳手段５０２による逆翻訳処理の結果得られる逆翻訳結果との距離に基づいて、翻訳信頼度を算出する。 The translation reliability calculation unit 503 (for example, the translation reliability calculation unit 15) calculates the translation reliability based on the distance between the input sentence and the reverse translation result obtained as a result of the reverse translation processing by the reverse translation unit 502.

信頼度提示手段５０４（例えば、信頼度提示部１７および出力部１２）は、入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を、その入力文、翻訳結果または逆翻訳結果に対応する信頼度とともにユーザに提示する。その際、翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、翻訳信頼度とともに第一の言語を入力した側のユーザに提示する。 The reliability presentation unit 504 (for example, the reliability presentation unit 17 and the output unit 12) corresponds to at least one of the input sentence, the translation result, and the reverse translation result to the input sentence, the translation result, or the reverse translation result. Present to the user along with the reliability. At that time, if the translation reliability is equal to or less than a predetermined threshold, at least the input sentence or the reverse translation result is presented to the user who has input the first language together with the translation reliability.

なお、「当該（提示する）単語または単語列に対応する信頼度」とは、例えば、原文や逆翻訳結果を提示する場合には、原文と逆翻訳結果との比較によりその単語または単語列に対して算出された翻訳信頼度やその翻訳信頼度と認識信頼度とを組み合わせた統合信頼度である。また例えば翻訳結果を提示する場合には、その単語または単語列を翻訳する際に元となった原文の単語または単語列に対して算出された翻訳信頼度や統合信頼度である。 The “reliability corresponding to the (presented) word or word string” refers to, for example, when the original text or the reverse translation result is presented, the word or word string is compared with the original text and the reverse translation result. It is the integrated reliability that combines the calculated translation reliability and the translation reliability and the recognition reliability. Further, for example, when presenting a translation result, the translation reliability or the integrated reliability calculated for the original sentence word or word string that was used when the word or word string was translated.

また、本発明による信頼度算出装置が、音声入力等のテキスト入力以外の方法で入力された文または文章データに対して、認識処理を行った結果であるテキストを翻訳手段５０１への入力文とする信頼度算出装置である場合には、入力文に対する認識処理の信頼度と、翻訳信頼度計算手段５０３により算出された翻訳信頼度とを用いて、統合信頼度を計算する統合信頼度計算手段（例えば、信頼度統合部２１）を備えていてもよい。そのような構成において、信頼度提示手段５０４は、統合信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、統合信頼度または翻訳信頼度とともに第一の言語を入力した側のユーザに提示してもよい。 In addition, the reliability calculation apparatus according to the present invention converts a text, which is a result of performing recognition processing on a sentence or sentence data input by a method other than text input such as voice input, as an input sentence to the translation unit 501. In the case of the reliability calculation device, the integrated reliability calculation means for calculating the integrated reliability using the reliability of the recognition process for the input sentence and the translation reliability calculated by the translation reliability calculation means 503. (For example, the reliability integration unit 21) may be provided. In such a configuration, when the integrated reliability is equal to or lower than a predetermined threshold, the reliability presentation unit 504 outputs at least the input sentence or the reverse translation result together with the integrated reliability or the translation reliability as the first language. You may show to the user of the input side.

また、翻訳信頼度計算手段５０３は、入力文と逆翻訳手段による逆翻訳結果との距離を、共有Ｎ−ｇｒａｍを用いて求めてもよい。 Moreover, the translation reliability calculation means 503 may obtain | require the distance of an input sentence and the reverse translation result by a reverse translation means using shared N-gram.

また、翻訳信頼度計算手段５０３は、翻訳信頼度を単語または単語列単位で求めるとともに、文単位の翻訳信頼度を正規化により求めてもよい。そのような構成において、信頼度提示手段５０４は、統合信頼度または翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、当該入力文または逆翻訳結果の各単語または単語列に対応する信頼度とともに第一の言語を入力した側のユーザに提示してもよい。さらに、信頼度提示手段５０４は、入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上をユーザに提示する際に、当該入力文、翻訳結果または逆翻訳結果の各単語または単語列に対応する信頼度に基づき一文内での低信頼度区間を強調して提示してもよい。 Moreover, the translation reliability calculation means 503 may obtain the translation reliability in units of words or word strings and may obtain the translation reliability in units of sentences by normalization. In such a configuration, when the integrated reliability or the translation reliability is equal to or lower than a predetermined threshold, the reliability presenting means 504 outputs at least the input sentence or the reverse translation result for each of the input sentence or the reverse translation result. You may show to the user who input the 1st language with the reliability corresponding to a word or a word string. Further, the reliability degree presentation unit 504 corresponds to each word or word string of the input sentence, the translation result, or the reverse translation result when presenting at least one of the input sentence, the translation result, and the reverse translation result to the user. The low confidence interval in one sentence may be emphasized and presented based on the confidence level.

また、本発明による信頼度算出装置は、入力文および逆翻訳結果のうち少なくともいずれかに対して、同義語展開を行う同義語展開手段（例えば、同義語展開部１６１および同義語辞書１６２）を備えていてもよい。そのような構成において、翻訳信頼度計算手段５０３は、同義語展開後の入力文と逆翻訳結果の組み合わせについて、それぞれ距離を求め、求めた距離に基づき翻訳信頼度を算出してもよい。さらに、同義語展開手段は、認識処理に用いられる認識辞書と、翻訳処理および逆翻訳処理に用いられる翻訳辞書とが共通に保有する単語または単語列のみを対象として同義語展開を行ってもよい。 The reliability calculation device according to the present invention further includes synonym expansion means (for example, synonym expansion unit 161 and synonym dictionary 162) that performs synonym expansion on at least one of the input sentence and the reverse translation result. You may have. In such a configuration, the translation reliability calculation unit 503 may calculate the distance for each combination of the input sentence after the synonym expansion and the reverse translation result, and calculate the translation reliability based on the calculated distance. Further, the synonym expansion means may perform synonym expansion only for words or word strings that are commonly held in the recognition dictionary used in the recognition process and the translation dictionary used in the translation process and the reverse translation process. .

上記の実施形態およびその変形例の一部または全部は、以下の付記のようにも記載され得るが、以下に限定されるわけではない。 Although part or all of said embodiment and its modification example may be described also as the following additional remarks, it is not necessarily limited to the following.

（付記１）入力文に対して、入力文の言語である第一の言語とは異なる言語である第二の言語への翻訳処理を行う翻訳部と、前記翻訳処理の結果得られる翻訳結果に対して、入力文の言語である第一の言語への逆翻訳処理を行う逆翻訳部と、前記入力文と前記逆翻訳処理の結果得られる逆翻訳結果との距離に基づいて、翻訳信頼度を算出する翻訳信頼度計算部と、前記入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を、当該入力文、翻訳結果または逆翻訳結果に対応する信頼度とともにユーザに提示する信頼度提示部とを備え、前記信頼度提示部は、前記翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、翻訳信頼度とともに第一の言語を入力した側のユーザに提示することを特徴とする信頼度算出装置。 (Supplementary note 1) For the input sentence, a translation unit that performs translation processing into a second language that is different from the first language that is the language of the input sentence, and a translation result obtained as a result of the translation process On the other hand, based on the distance between the reverse translation unit that performs the reverse translation process to the first language that is the language of the input sentence, and the reverse translation result obtained as a result of the reverse translation process, the translation reliability And a reliability of presenting at least one or more of the input sentence, the translation result and the reverse translation result together with a reliability corresponding to the input sentence, the translation result or the reverse translation result to the user. A presentation unit, and when the translation reliability is less than or equal to a predetermined threshold, the reliability presentation unit inputs at least the input sentence or the reverse translation result together with the translation reliability to the first language. Presented to users of Yoriyukido calculation device.

（付記２）テキスト入力以外の方法で入力された第一の言語による文または文章データに対して、認識処理を行った結果であるテキストを翻訳部への入力文とする信頼度算出装置であって、前記入力文に対する認識処理の信頼度と、前記翻訳信頼度計算部により算出された翻訳信頼度とを用いて、統合信頼度を計算する統合信頼度計算部を備え、信頼度提示部は、前記統合信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、前記統合信頼度または翻訳信頼度とともに第一の言語を入力した側のユーザに提示する付記１に記載の信頼度算出装置。 (Supplementary Note 2) A reliability calculation device that uses a text that is a result of recognition processing for sentence or sentence data in a first language input by a method other than text input, as an input sentence to the translation unit. Using the reliability of the recognition process for the input sentence and the translation reliability calculated by the translation reliability calculation unit, the integrated reliability calculation unit for calculating the integrated reliability, the reliability presentation unit is When the integrated reliability is equal to or lower than a predetermined threshold, at least the input sentence or the reverse translation result is presented to the user who has input the first language together with the integrated reliability or the translation reliability. The reliability calculation apparatus described in 1.

（付記３）翻訳信頼度計算部は、入力文と逆翻訳部による逆翻訳結果との距離を、共有Ｎ−ｇｒａｍを用いて求める付記１または付記２に記載の信頼度算出装置。 (Additional remark 3) The reliability calculation apparatus of Additional remark 1 or Additional remark 2 which a translation reliability calculation part calculates | requires the distance of an input sentence and the reverse translation result by a reverse translation part using shared N-gram.

（付記４）翻訳信頼度計算部は、翻訳信頼度を単語または単語列単位で求めるとともに、文単位の翻訳信頼度を正規化により求め、信頼度提示部は、統合信頼度または翻訳信頼度が所定の閾値以下であった場合には、少なくとも入力文または逆翻訳結果を、当該入力文または逆翻訳結果の各単語または単語列に対応する信頼度とともに第一の言語を入力した側のユーザに提示する付記１から付記３のうちのいずれかに記載の信頼度算出装置。 (Supplementary Note 4) The translation reliability calculation unit obtains the translation reliability in units of words or word strings and obtains the translation reliability in units of sentences by normalization, and the reliability presentation unit has the integrated reliability or the translation reliability. If it is less than or equal to a predetermined threshold, at least the input sentence or reverse translation result is sent to the user who has input the first language together with the reliability corresponding to each word or word string of the input sentence or reverse translation result. The reliability calculation apparatus according to any one of Supplementary Note 1 to Supplementary Note 3 to be presented.

（付記５）信頼度提示部は、入力文、翻訳結果および逆翻訳結果のうち少なくとも１つ以上を第一の言語をユーザに提示する際に、当該入力文、翻訳結果または逆翻訳結果の各単語または単語列に対応する信頼度に基づき一文内での低信頼度区間を強調して提示する付記４に記載の信頼度算出装置。 (Additional remark 5) When a reliability presentation part shows a 1st language to a user at least 1 or more among an input sentence, a translation result, and a back translation result, each of the said input sentence, a translation result, or a back translation result The reliability calculation apparatus according to appendix 4, wherein the low reliability interval in one sentence is emphasized and presented based on the reliability corresponding to the word or the word string.

（付記６）入力文および逆翻訳結果のうち少なくともいずれかに対して、同義語展開を行う同義語展開部を備え、翻訳信頼度計算部は、同義語展開後の入力文と逆翻訳結果の組み合わせについて、それぞれ距離を求め、求めた距離に基づき翻訳信頼度を算出する付記１から付記５のうちのいずれかに記載の信頼度算出装置。 (Supplementary Note 6) A synonym expansion unit that performs synonym expansion on at least one of the input sentence and the reverse translation result is provided, and the translation reliability calculation unit is configured to display the input sentence after the synonym expansion and the reverse translation result The reliability calculation apparatus according to any one of appendix 1 to appendix 5, which calculates a distance for each combination and calculates a translation reliability based on the calculated distance.

（付記７）テキスト入力以外の方法で入力された文に対して、認識処理を行った結果であるテキストを翻訳部への入力文とする信頼度算出装置であって、同義語展開部は、認識処理に用いられる認識辞書と、翻訳処理および逆翻訳処理に用いられる翻訳辞書とが共通に保有する単語または単語列のみを対象として同義語展開を行う付記６に記載の信頼度算出装置。 (Additional remark 7) It is the reliability calculation apparatus which makes the text which is the result of having performed the recognition process with respect to the sentence input by methods other than text input as an input sentence to a translation part, Comprising: The reliability calculation apparatus according to appendix 6, wherein synonym expansion is performed only for words or word strings that are commonly held in a recognition dictionary used for recognition processing and a translation dictionary used for translation processing and reverse translation processing.

この出願は、２０１１年７月２８日に出願された日本特許出願２０１１−１６５７０８を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims the priority on the basis of the JP Patent application 2011-165708 for which it applied on July 28, 2011, and takes in those the indications of all here.

以上、実施形態を参照して本願発明を説明したが、本願発明は上記の実施形態に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described with reference to the embodiments, the present invention is not limited to the above-described embodiments. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

Industrial applicability

本発明は、テキストデータ形式で入力された文または文章に対して翻訳処理を行う翻訳装置だけでなく、音声入力されて、またはＯＣＲやタッチパネルを用いて文または文章データが入力されて、認識処理を経た後に翻訳処理を行うような翻訳装置においても、好適に適用可能である。 The present invention is not limited to a translation device that performs a translation process on a sentence or sentence input in a text data format, but also a speech input or a sentence or sentence data input using an OCR or a touch panel, and a recognition process. The present invention can also be suitably applied to a translation apparatus that performs translation processing after passing through.

１０、１０ｂ、１０ｃ、１０ｄ、１０ｅ、１０ｆ信頼度算出装置、信頼度算出部
１１入力部
１２出力部
１３機械翻訳部
１４逆翻訳部
１５翻訳信頼度計算部
１６１同義語展開部
１６２同義語辞書
１７信頼度提示部
１００機械翻訳装置
１１０、２２０結果判定部
２００音声翻訳システム
２１０音声認識結果入力部
２１１音声入力部
２１２音声認識部
２１信頼度統合部
５０１翻訳手段
５０２逆翻訳手段
５０３翻訳信頼度計算手段
５０４信頼度提示手段10, 10b, 10c, 10d, 10e, 10f Reliability calculation device, reliability calculation unit 11 input unit 12 output unit 13 machine translation unit 14 reverse translation unit 15 translation reliability calculation unit 161 synonym expansion unit 162 synonym dictionary 17 Reliability presentation unit 100 machine translation device 110, 220 result determination unit 200 speech translation system 210 speech recognition result input unit 211 speech input unit 212 speech recognition unit 21 reliability integration unit 501 translation unit 502 reverse translation unit 503 translation reliability calculation unit 504 Reliability presentation means

Claims

Translation means for performing a translation process on the input sentence to a second language that is different from the first language that is the language of the input sentence;
Reverse translation means for performing reverse translation processing to the first language that is the language of the input sentence, with respect to the translation result obtained as a result of the translation processing;
A translation reliability calculation means for calculating a translation reliability based on the distance between the input sentence and the reverse translation result obtained as a result of the reverse translation process;
A reliability degree presentation unit that presents at least one or more of the input sentence, the translation result, and the reverse translation result together with a reliability corresponding to the input sentence, the translation result, or the reverse translation result;
When the translation reliability is equal to or lower than a predetermined threshold, the reliability presentation means presents at least an input sentence or a reverse translation result to the user who has input the first language together with the translation reliability. The reliability calculation apparatus characterized by this.

A reliability calculation device that uses a text that is a result of recognition processing for a sentence or sentence data in a first language that is input by a method other than text input, as an input sentence to the translation means,
Using the reliability of the recognition process for the input sentence and the translation reliability calculated by the translation reliability calculation means, comprising integrated reliability calculation means for calculating the integrated reliability;
When the integrated reliability is equal to or less than a predetermined threshold, the reliability presenting means provides at least an input sentence or a reverse translation result as a user who has input the first language together with the integrated reliability or the translation reliability. The reliability calculation device according to claim 1.

The reliability calculation device according to claim 1, wherein the translation reliability calculation unit obtains a distance between the input sentence and the reverse translation result by the reverse translation unit using a shared N-gram.

The translation reliability calculation means calculates the translation reliability in units of words or word strings, and calculates the translation reliability in units of sentences by normalization,
When the integrated reliability or the translation reliability is equal to or less than a predetermined threshold, the reliability presenting means corresponds at least the input sentence or the reverse translation result to each word or word string of the input sentence or the reverse translation result. The reliability calculation apparatus according to any one of claims 1 to 3, wherein the reliability is presented to a user who inputs the first language together with the reliability.

When presenting at least one or more of the input sentence, the translation result, and the reverse translation result to the user in the first language, the reliability degree presentation means each word or word string of the input sentence, the translation result, or the reverse translation result. The reliability calculation apparatus according to claim 4, wherein a low reliability section in one sentence is emphasized and presented based on the reliability corresponding to.

Synonym expansion means for performing synonym expansion on at least one of the input sentence and the reverse translation result,
The translation reliability calculation means calculates a distance for each combination of the input sentence after the synonym expansion and the reverse translation result, and calculates the translation reliability based on the calculated distance. The reliability calculation apparatus according to item 1.

A reliability calculation apparatus that uses a text that is a result of recognition processing for a sentence input by a method other than text input as an input sentence to the translation means,
The synonym expansion means expands synonyms only for words or word strings that are commonly held in a recognition dictionary used for recognition processing and a translation dictionary used for translation processing and reverse translation processing. Reliability calculation device.

The input sentence is translated into a second language that is different from the first language that is the language of the input sentence,
For the translation result obtained as a result of the translation process, perform a reverse translation process to the first language that is the language of the input sentence,
Based on the distance between the input sentence and the reverse translation result obtained as a result of the reverse translation process, the translation reliability is calculated,
When presenting to the user at least one or more of the input sentence, translation result and reverse translation result together with the reliability corresponding to the input sentence, translation result or reverse translation result, the translation reliability is below a predetermined threshold A translation reliability calculation and utilization method, wherein at least the input sentence or the reverse translation result is presented to the user who has input the first language together with the translation reliability.

On the computer,
A translation process for converting an input sentence written in the first language into a second language, which is a language different from the first language,
A reverse translation process for converting a translation result obtained as a result of the translation process into a first language that is a language of an input sentence;
A translation reliability calculation process for calculating a translation reliability based on a distance between the input sentence and a reverse translation result obtained as a result of the reverse translation process; and at least one of the input sentence, the translation result, and the reverse translation result The above process is executed to provide the user with the reliability corresponding to the input sentence, the translation result, or the reliability corresponding to the reverse translation result,
In the reliability presentation process, when the translation reliability is equal to or lower than a predetermined threshold, at least an input sentence or a reverse translation result is presented to a user who inputs the first language together with the translation reliability. Translation engine program.