JP2001222531A

JP2001222531A - Voice translation device and computer readable recording medium with recorded voice translation processing program with feedback function

Info

Publication number: JP2001222531A
Application number: JP2000030810A
Authority: JP
Inventors: Atsushi Nishino; 敦士西野; Singer Harald; ハラルド・シンガー; Gruhn Rainer; ライナー・グルーン; Masaki Naito; 正樹内藤
Original assignee: ATR Interpreting Telecommunications Research Laboratories
Current assignee: ATR Interpreting Telecommunications Research Laboratories
Priority date: 2000-02-08
Filing date: 2000-02-08
Publication date: 2001-08-17

Abstract

PROBLEM TO BE SOLVED: To provide a voice translation device, with which a source language input person can easily confirm whether contents transmitted to a party are right or not. SOLUTION: This device is provided with a first voice translation processing means for converting a voice in the source language inputted by the source language input person to a synthetic voice in a target language and a second voice translation processing means for converting the synthetic voice in the target language provided by the first voice translation processing means to a synthetic voice in the source language again and feeding it back to the source language input person.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、音声翻訳装置お
よびフィードバック機能付き音声翻訳処理プログラムを
記録したコンピュータ読み取り可能な記録媒体に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech translation apparatus and a computer-readable recording medium recording a speech translation processing program with a feedback function.

【０００２】[0002]

【従来の技術】ある言語の音声を他の言語の音声に変換
するための装置として、たとえばATR-MATRIXのような音
声翻訳装置が既に開発されている。ATR-MATRIXについて
は、T.Takezawa, T.Morimoto, Y.Sagisaka, N.Campbel
l, H.Iida, F.Sugaya, A.Yokoo,and S.Yamamoto. A Jap
anese-to-English speech translation system: ATR-MA
TRIX. In Proc. ICSLP, pages 957-960,1998を参照のこ
と。2. Description of the Related Art A speech translator such as ATR-MATRIX has already been developed as a device for converting speech in one language into speech in another language. About ATR-MATRIX, T. Takezawa, T. Morimoto, Y. Sagisaka, N. Campbel
l, H. Iida, F. Sugaya, A. Yokoo, and S. Yamamoto. A Jap
anese-to-English speech translation system: ATR-MA
See TRIX. In Proc. ICSLP, pages 957-960,1998.

【０００３】音声翻訳装置は、原言語入力者（発声者）
によって入力された原言語音声をテキストに変換する音
声認識処理部、音声認識処理部によって得られた原言語
のテキストを目的言語のテキストに変換する言語翻訳処
理部および言語翻訳処理部によって得られた目的言語の
テキストを目的言語音声に変換して相手に伝える音声合
成処理部を備えている。[0003] A speech translator is a source language input person (speaker).
A speech recognition processing unit that converts the source language speech input into text into text, a language translation processing unit that converts the source language text obtained by the speech recognition processing unit into text in the target language, and a language translation processing unit A speech synthesis processing unit is provided which converts text in the target language into target language voice and transmits it to the other party.

【０００４】この明細書においては、音声翻訳装置に
は、原言語入力者によって入力された原言語音声をテキ
ストを目的言語のテキストに変換する言語翻訳処理部お
よび言語翻訳処理部によって得られた目的言語のテキス
トを目的言語音声に変換して相手に伝える音声合成処理
部を備えたものも含まれる。[0004] In this specification, a speech translation apparatus includes a language translation processing unit for converting a source language speech input by a source language input person into text in a target language, and a target obtained by the language translation processing unit. Some include a voice synthesis processing unit that converts a text in a language into a target language voice and transmits it to the other party.

【０００５】このような音声翻訳装置において、相手に
伝えられた内容が正しいか否かを確認する方法として、
音声認識処理部によって得られた原言語のテキストを原
言語入力者にフィードバックする方法が考えられる。ま
た、音声合成処理部によって得られた目的言語音声を原
言語入力者にフィードバックする方法が考えられる。[0005] In such a speech translation apparatus, as a method for confirming whether or not the content transmitted to the other party is correct,
A method of feeding back the source language text obtained by the speech recognition processing unit to the source language input person is considered. Further, a method of feeding back the target language voice obtained by the voice synthesis processing unit to the source language input person is conceivable.

【０００６】しかしながら、音声認識処理部によって得
られた原言語のテキストを原言語入力者にフィードバッ
クする方法では、原言語入力者は、言語翻訳処理部に正
しい情報が入力されたか否かを確認することができる
が、原言語のテキストが正しく翻訳されたか否かを確認
することができないという問題がある。However, in the method of feeding back the source language text obtained by the speech recognition processing unit to the source language input unit, the source language input unit checks whether correct information has been input to the language translation processing unit. However, there is a problem that it is not possible to confirm whether the text in the source language has been correctly translated.

【０００７】また、音声合成処理部によって得られた目
的言語音声を原言語入力者にフィードバックする方法で
は、原言語入力者が目的言語を理解できない場合には、
相手に伝えられた内容が正しいか否かを認識することが
できないという問題がある。In the method of feeding back the target language speech obtained by the speech synthesis processing section to the source language input person, if the source language input person cannot understand the target language,
There is a problem that it is not possible to recognize whether the content transmitted to the other party is correct.

【０００８】[0008]

【発明が解決しようとする課題】この発明は、原言語入
力者が入力した原言語の内容が正しく翻訳されたか否か
を原言語入力者が容易に確認できるようになる音声翻訳
装置およびフィードバック機能付き音声翻訳処理プログ
ラムを記録したコンピュータ読み取り可能な記録媒体を
提供することを目的とする。SUMMARY OF THE INVENTION The present invention relates to a speech translation apparatus and a feedback function which enable a source language input user to easily confirm whether or not the contents of the source language input by the source language input user have been correctly translated. It is an object of the present invention to provide a computer-readable recording medium on which an attached speech translation processing program is recorded.

【０００９】[0009]

【課題を解決するための手段】この発明による第１の音
声翻訳装置は、原言語入力者によって入力される原言語
の音声を目的言語の合成音声に変換する第１の音声翻訳
処理手段、および第１の音声翻訳処理手段によって得ら
れた目的言語の合成音声を再び原言語の合成音声に変換
して原言語入力者にフィードバックさせるための第２の
音声翻訳処理手段を備えていることを特徴とする。A first speech translation apparatus according to the present invention comprises: first speech translation processing means for converting a source language speech input by a source language input person into a synthesized speech of a target language; and A second speech translation processing means is provided for converting the synthesized speech of the target language obtained by the first speech translation processing means into the synthesized speech of the original language again and feeding it back to the source language input person. And

【００１０】この発明による第１のフィードバック機能
付き音声翻訳処理プログラムを記録したコンピュータ読
み取り可能な記録媒体は、原言語入力者によって入力さ
れる原言語の音声を目的言語の合成音声に変換する第１
の音声翻訳処理手段、および第１の音声翻訳処理手段に
よって得られた目的言語の合成音声を再び原言語の合成
音声に変換して原言語入力者にフィードバックさせるた
めの第２の音声翻訳処理手段としてコンピュータを機能
させるためのフィードバック機能付き音声翻訳処理プロ
グラムを記録していることを特徴とする。A computer-readable recording medium storing a first speech translation processing program with a feedback function according to the present invention converts a source language voice input by a source language input person into a synthesized voice of a target language.
And the second speech translation processing means for converting the synthesized speech of the target language obtained by the first speech translation processing means into the synthesized speech of the original language again and feeding it back to the inputting person of the original language And a voice translation processing program with a feedback function for causing a computer to function.

【００１１】この発明による第１の音声翻訳装置または
第１の記録媒体において、第１の音声翻訳処理手段とし
ては、たとえば、原言語入力者によって入力される原言
語音声を原言語のテキストに変換する第１の音声認識処
理手段、第１の音声認識処理手段によって得られた原言
語のテキストを目的言語のテキストに変換する第１の言
語翻訳処理手段および第１の言語翻訳処理手段によって
得られた目的言語のテキストを目的言語の合成音声に変
換する第１の音声合成処理手段を備えているものが用い
られる。In the first speech translation device or the first recording medium according to the present invention, the first speech translation processing means converts, for example, a source language speech input by a source language input person into a source language text. First speech recognition processing means, a first language translation processing means for converting a source language text obtained by the first speech recognition processing means into a target language text, and a first language translation processing means. What is provided with the first speech synthesis processing means for converting the text of the target language into the synthesized speech of the target language.

【００１２】第２の音声翻訳処理手段としては、たとえ
ば、第１の音声翻訳処理手段内の第１の音声合成処理手
段によって得られた目的言語の合成音声を目的言語のテ
キストに変換する第２の音声認識処理手段、第２の音声
認識処理手段によって得られた目的言語のテキストを原
言語のテキストに変換する第２の言語翻訳処理手段およ
び第２の言語翻訳処理手段によって得られた原言語のテ
キストを原言語の合成音声に変換して原言語入力者にフ
ィードバックする第２の音声合成処理手段を備えている
ものが用いられる。As the second speech translation processing means, for example, the second speech translation processing means converts the synthesized speech of the target language obtained by the first speech synthesis processing means in the first speech translation processing means into a text of the target language. Second language translation processing means for converting the target language text obtained by the second speech recognition processing means into the original language text, and the source language obtained by the second language translation processing means And a second speech synthesis unit that converts the text into a synthesized speech in the source language and feeds it back to the input user of the source language.

【００１３】この発明による第２の音声翻訳装置は、原
言語入力者によって入力される原言語の音声を目的言語
の合成音声に変換する第１の音声翻訳処理手段、第１の
音声翻訳処理手段によって得られた目的言語の合成音声
を原言語のテキストに変換する第２の音声翻訳処理手
段、および第２の音声翻訳処理手段によって得られた原
言語のテキストを表示器に表示させて原言語入力者にフ
ィードバックさせるための手段を備えていることを特徴
とする。A second speech translation device according to the present invention comprises: first speech translation processing means for converting source language speech input by a source language input person into synthesized speech of a target language; first speech translation processing means Second speech translation processing means for converting the synthesized speech of the target language obtained by the above into text of the source language, and displaying the source language text obtained by the second speech translation processing means on a display device to display the source language It is characterized by having means for feeding back to the input person.

【００１４】この発明による第２のフィードバック機能
付き音声翻訳処理プログラムを記録したコンピュータ読
み取り可能な記録媒体は、原言語入力者によって入力さ
れる原言語の音声を目的言語の合成音声に変換する第１
の音声翻訳処理手段、第１の音声翻訳処理手段によって
得られた目的言語の合成音声を原言語のテキストに変換
する第２の音声翻訳処理手段、および第２の音声翻訳処
理手段によって得られた原言語のテキストを表示器に表
示させて原言語入力者にフィードバックさせるための手
段としてコンピュータを機能させるためのフィードバッ
ク機能付き音声翻訳処理プログラムを記録していること
を特徴とする。A computer-readable recording medium having recorded thereon a second speech translation processing program with a feedback function according to the present invention converts a source language speech input by a source language input user into a synthesized speech of a target language.
, The second speech translation processing means for converting the synthesized speech of the target language obtained by the first speech translation processing means into the text of the source language, and the second speech translation processing means A speech translation processing program with a feedback function for causing a computer to function as a means for displaying a source language text on a display and feeding back the source language input person is recorded.

【００１５】この発明による第２の音声翻訳装置または
第２の記録媒体において、第１の音声翻訳処理手段とし
ては、たとえば、原言語入力者によって入力される原言
語音声を原言語のテキストに変換する第１の音声認識処
理手段、第１の音声認識処理手段によって得られた原言
語のテキストを目的言語のテキストに変換する第１の言
語翻訳処理手段および第１の言語翻訳処理手段によって
得られた目的言語のテキストを目的言語の合成音声に変
換する音声合成処理手段を備えているものが用いられ
る。[0015] In the second speech translation device or the second recording medium according to the present invention, the first speech translation processing means converts, for example, a source language speech input by a source language input user into a source language text. First speech recognition processing means, a first language translation processing means for converting a source language text obtained by the first speech recognition processing means into a target language text, and a first language translation processing means. Further, a device having speech synthesis processing means for converting the text of the target language into a synthesized speech of the target language is used.

【００１６】第２の音声翻訳処理手段としては、たとえ
ば、第１の音声翻訳処理手段内の音声合成処理手段によ
って得られた目的言語の合成音声を目的言語のテキスト
に変換する第２の音声認識処理手段および第２の音声認
識処理手段によって得られた目的言語のテキストを原言
語のテキストに変換する第２の言語翻訳処理手段を備え
ているものが用いられる。As the second speech translation processing means, for example, a second speech recognition for converting a synthesized speech of the target language obtained by the speech synthesis processing means in the first speech translation processing means into a text of the target language. The one having a second language translation processing means for converting the target language text obtained by the processing means and the second speech recognition processing means into the original language text is used.

【００１７】この発明による第３の音声翻訳装置は、原
言語入力者によって入力される原言語のテキストを目的
言語の合成音声に変換する第１の音声翻訳処理手段、お
よび第１の音声翻訳処理手段によって得られた目的言語
の合成音声を原言語の合成音声に変換して原言語入力者
にフィードバックさせるための第２の音声翻訳処理手段
を備えていることを特徴とする。A third speech translation apparatus according to the present invention comprises: first speech translation processing means for converting a source language text input by a source language input person into a synthesized speech of a target language; and a first speech translation process. And a second speech translation processing means for converting the synthesized speech of the target language obtained by the means into the synthesized speech of the original language and feeding it back to the original language input user.

【００１８】この発明による第３のフィードバック機能
付き音声翻訳処理プログラムを記録したコンピュータ読
み取り可能な記録媒体は、原言語入力者によって入力さ
れる原言語のテキストを目的言語の合成音声に変換する
第１の音声翻訳処理手段、および第１の音声翻訳処理手
段によって得られた目的言語の合成音声を原言語の合成
音声に変換して原言語入力者にフィードバックさせるた
めの第２の音声翻訳処理手段としてコンピュータを機能
させるためのフィードバック機能付き音声翻訳処理プロ
グラムを記録していることを特徴とする。A computer-readable recording medium storing a third speech translation processing program with a feedback function according to the present invention converts a source language text input by a source language input user into a synthesized speech of a target language. And the second speech translation processing means for converting the synthesized speech of the target language obtained by the first speech translation processing means into the synthesized speech of the original language and feeding back to the original language input person A speech translation processing program with a feedback function for causing a computer to function is recorded.

【００１９】この発明による第３の音声翻訳装置または
第３の記録媒体において、第１の音声翻訳処理手段とし
ては、たとえば、原言語入力者によって入力される原言
語のテキストを目的言語のテキストに変換する第１の言
語翻訳処理手段および第１の言語翻訳処理手段によって
得られた目的言語のテキストを目的言語の合成音声に変
換する第１の音声合成処理手段を備えているものが用い
られる。In the third speech translation apparatus or the third recording medium according to the present invention, the first speech translation processing means converts, for example, a source language text input by a source language input person into a target language text. The first language translation processing means for conversion and the first language translation processing means for converting the text of the target language obtained by the first language translation processing means into the synthesized speech of the target language are used.

【００２０】第２の音声翻訳処理手段としては、たとえ
ば、第１の音声翻訳処理手段内の第１の音声合成処理手
段によって得られた目的言語の合成音声を目的言語のテ
キストに変換する音声認識処理手段、音声認識処理手段
によって得られた目的言語のテキストを原言語のテキス
トに変換する第２の言語翻訳処理手段および第２の言語
翻訳処理手段によって得られた原言語のテキストを原言
語の合成音声に変換して原言語入力者にフィードバック
する第２の音声合成処理手段を備えているものが用いら
れる。As the second speech translation processing means, for example, speech recognition for converting a synthesized speech of the target language obtained by the first speech synthesis processing means in the first speech translation processing means into a text of the target language. Processing means, second language translation processing means for converting the text in the target language obtained by the speech recognition processing means into text in the source language, and the source language text obtained by the second language translation processing means in the source language. The one provided with the second speech synthesis processing means for converting the speech into synthesized speech and feeding it back to the source language input user is used.

【００２１】この発明による第４の音声翻訳装置は、原
言語入力者によって入力される原言語の音声を目的言語
の合成音声に変換する第１の音声翻訳処理手段、および
第１の音声翻訳処理手段による音声翻訳処理の中間段階
で得られる目的言語のテキストを原言語の合成音声に変
換して原言語入力者にフィードバックさせるための第２
の音声翻訳処理手段を備えていることを特徴とする。A fourth speech translation apparatus according to the present invention comprises: first speech translation processing means for converting source language speech input by a source language input person into synthesized speech of a target language; and first speech translation processing. Means for converting the text of the target language obtained in the intermediate stage of the speech translation processing by the means into synthesized speech of the source language and feeding it back to the source language input person
Is provided.

【００２２】この発明による第４のフィードバック機能
付き音声翻訳処理プログラムを記録したコンピュータ読
み取り可能な記録媒体は、原言語入力者によって入力さ
れる原言語の音声を目的言語の合成音声に変換する第１
の音声翻訳処理手段、および第１の音声翻訳処理手段に
よる音声翻訳処理の中間段階で得られる目的言語のテキ
ストを原言語の合成音声に変換して原言語入力者にフィ
ードバックさせるための第２の音声翻訳処理手段として
コンピュータを機能させるためのフィードバック機能付
き音声翻訳処理プログラムを記録していることを特徴と
する。A computer-readable recording medium storing a fourth speech translation processing program with a feedback function according to the present invention converts a source language speech input by a source language input person into a synthesized speech of a target language.
And a second language for converting the text of the target language obtained in the intermediate stage of the speech translation processing by the first speech translation processing means into the synthesized speech of the source language and feeding it back to the source language input user. A speech translation processing program with a feedback function for causing a computer to function as speech translation processing means is recorded.

【００２３】この発明による第４の音声翻訳装置または
第４の記録媒体において、第１の音声翻訳処理手段とし
ては、たとえば、原言語入力者によって入力される原言
語音声を原言語のテキストに変換する音声認識処理手
段、音声認識処理手段によって得られた原言語のテキス
トを目的言語のテキストに変換する第１の言語翻訳処理
手段および第１の言語翻訳処理手段によって得られた目
的言語のテキストを目的言語の合成音声に変換する第１
の音声合成処理手段を備えているものが用いられる。[0023] In the fourth speech translation apparatus or the fourth recording medium according to the present invention, the first speech translation processing means converts, for example, a source language speech input by a source language input person into a source language text. Speech recognition processing means, first language translation processing means for converting source language text obtained by the speech recognition processing means into text of the target language, and text in the target language obtained by the first language translation processing means. First to convert to synthesized speech in target language
Is used.

【００２４】第２の音声翻訳処理手段としては、たとえ
ば、第１の音声翻訳処理手段内の第１の言語翻訳処理手
段によって得られた目的言語のテキストを原言語のテキ
ストに変換する第２の言語翻訳処理手段および第２の言
語翻訳処理手段によって得られた原言語のテキストを原
言語の合成音声に変換して原言語入力者にフィードバッ
クする第２の音声合成処理手段を備えているものが用い
られる。As the second speech translation processing means, for example, the second speech translation processing means converts the target language text obtained by the first language translation processing means into the source language text. One having a second speech synthesis processing means for converting the source language text obtained by the language translation processing means and the second language translation processing means into a synthesized speech of the source language and feeding back to the source language input user. Used.

【００２５】この発明による第５の音声翻訳装置は、原
言語入力者によって入力される原言語の音声を目的言語
の合成音声に変換する第１の音声翻訳処理手段、第１の
音声翻訳処理手段による音声翻訳処理の中間段階で得ら
れる目的言語のテキストを原言語のテキストに変換する
第２の音声翻訳処理手段、および第２の音声翻訳処理手
段によって得られた原言語のテキストを表示器に表示さ
せて原言語入力者にフィードバックさせるための手段を
備えていることを特徴とする。A fifth speech translation apparatus according to the present invention comprises: first speech translation processing means for converting source language speech input by a source language input user into synthesized speech of a target language; Speech translation processing means for converting the target language text obtained in the intermediate stage of the speech translation processing by the first language into text in the source language, and displaying the source language text obtained by the second speech translation processing means on the display It is characterized by having means for displaying and providing feedback to the source language input user.

【００２６】この発明による第５のフィードバック機能
付き音声翻訳処理プログラムを記録したコンピュータ読
み取り可能な記録媒体は、原言語入力者によって入力さ
れる原言語の音声を目的言語の合成音声に変換する第１
の音声翻訳処理手段、第１の音声翻訳処理手段による音
声翻訳処理の中間段階で得られる目的言語のテキストを
原言語のテキストに変換する第２の音声翻訳処理手段、
および第２の音声翻訳処理手段によって得られた原言語
のテキストを表示器に表示させて原言語入力者にフィー
ドバックさせるための手段としてコンピュータを機能さ
せるためのフィードバック機能付き音声翻訳処理プログ
ラムを記録していることを特徴とする。A computer readable recording medium storing a fifth speech translation processing program with a feedback function according to the present invention converts a source language speech input by a source language input user into a synthesized speech of a target language.
A second speech translation processing means for converting a target language text obtained at an intermediate stage of the speech translation processing by the first speech translation processing means into a source language text;
And a speech translation processing program with a feedback function for causing a computer to function as a means for displaying the text in the source language obtained by the second speech translation processing means on a display device and feeding back the text to the input user of the source language. It is characterized by having.

【００２７】この発明による第５の音声翻訳装置または
第５の記録媒体において、第１の音声翻訳処理手段とし
ては、たとえば、原言語入力者によって入力される原言
語音声を原言語のテキストに変換する音声認識処理手
段、音声認識処理手段によって得られた原言語のテキス
トを目的言語のテキストに変換する第１の言語翻訳処理
手段および第１の言語翻訳処理手段によって得られた目
的言語のテキストを目的言語の合成音声に変換する音声
合成処理手段を備えているものが用いられる。In the fifth speech translation apparatus or the fifth recording medium according to the present invention, the first speech translation processing means converts, for example, a source language speech input by a source language input person into a source language text. Speech recognition processing means, first language translation processing means for converting source language text obtained by the speech recognition processing means into text of the target language, and text in the target language obtained by the first language translation processing means. The one having speech synthesis processing means for converting into a synthesized speech of the target language is used.

【００２８】第２の音声翻訳処理手段としては、たとえ
ば、第１の音声翻訳処理手段内の第１の言語翻訳処理手
段によって得られた目的言語のテキストを原言語のテキ
ストに変換する第２の言語翻訳処理手段を備えているも
のが用いられる。As the second speech translation processing means, for example, the second speech translation processing means converts the target language text obtained by the first language translation processing means into the source language text. The one provided with the language translation processing means is used.

【００２９】この発明による第６の音声翻訳装置は、原
言語入力者によって入力される原言語のテキストを目的
言語の合成音声に変換する第１の音声翻訳処理手段、お
よび第１の音声翻訳処理手段による音声翻訳処理の中間
段階で得られる目的言語のテキストを原言語の合成音声
に変換して原言語入力者にフィードバックさせるための
第２の音声翻訳処理手段を備えていることを特徴とす
る。A sixth speech translation apparatus according to the present invention comprises: first speech translation processing means for converting a source language text input by a source language input person into a synthesized speech of a target language; and a first speech translation process. And a second speech translation processing means for converting the text of the target language obtained in the intermediate stage of the speech translation processing by the means into the synthesized speech of the source language and feeding it back to the source language input person. .

【００３０】この発明による第６のフィードバック機能
付き音声翻訳処理プログラムを記録したコンピュータ読
み取り可能な記録媒体は、原言語入力者によって入力さ
れる原言語のテキストを目的言語の合成音声に変換する
第１の音声翻訳処理手段、および第１の音声翻訳処理手
段による音声翻訳処理の中間段階で得られる目的言語の
テキストを原言語の合成音声に変換して原言語入力者に
フィードバックさせるための第２の音声翻訳処理手段と
してコンピュータを機能させるためのフィードバック機
能付き音声翻訳処理プログラムを記録していることを特
徴とする。A computer-readable recording medium storing a sixth speech translation processing program with a feedback function according to the present invention converts a source language text input by a source language input user into a synthesized speech of a target language. And a second language for converting the text of the target language obtained in the intermediate stage of the speech translation processing by the first speech translation processing means into the synthesized speech of the source language and feeding it back to the source language input user. A speech translation processing program with a feedback function for causing a computer to function as speech translation processing means is recorded.

【００３１】この発明による第６の音声翻訳装置または
第６の記録媒体において、第１の音声翻訳処理手段とし
ては、たとえば、原言語入力者によって入力される原言
語のテキストを目的言語のテキストに変換する第１の言
語翻訳処理手段および第１の言語翻訳処理手段によって
得られた目的言語のテキストを目的言語の合成音声に変
換する第１の音声合成処理手段を備えているものが用い
られる。In the sixth speech translating apparatus or the sixth recording medium according to the present invention, the first speech translation processing means converts, for example, a source language text input by a source language input person into a target language text. The first language translation processing means for conversion and the first language translation processing means for converting the text of the target language obtained by the first language translation processing means into the synthesized speech of the target language are used.

【００３２】第２の音声翻訳処理手段としては、たとえ
ば、第１の音声翻訳処理手段内の第１の言語翻訳処理手
段によって得られた目的言語のテキストを原言語のテキ
ストに変換する第２の言語翻訳処理手段および第２の言
語翻訳処理手段によって得られた原言語のテキストを原
言語の合成音声に変換して原言語入力者にフィードバッ
クする第２の音声合成処理手段を備えているものが用い
られる。As the second speech translation processing means, for example, the second speech translation processing means converts the target language text obtained by the first language translation processing means into the source language text. One having a second speech synthesis processing means for converting the source language text obtained by the language translation processing means and the second language translation processing means into a synthesized speech of the source language and feeding back to the source language input user. Used.

【００３３】[0033]

【発明の実施の形態】以下、図面を参照して、この発明
の実施の形態について説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００３４】図１は、音声翻訳装置の構成を示してい
る。FIG. 1 shows the configuration of the speech translation apparatus.

【００３５】音声翻訳装置は、原言語入力者１００から
発声される原言語の音声を目的言語の音声に変換して相
手２００に伝えるための第１の音声翻訳処理部１０と、
第１の音声翻訳処理部１０によって得られた目的言語の
音声を原言語の音声に変換して原言語入力者１００にフ
ィードバックするための第２の音声翻訳処理部２０とを
備えている。ここでは、説明の便宜上、原言語が日本語
であり、目的言語が英語であるとして、説明する。音声
翻訳装置は、コンピュータとフィードバック機能付き音
声翻訳処理プログラムとによって実現される。フィード
バック機能付き音声翻訳処理プログラムは、たとえば、
ＣＤ等の記憶媒体に格納されて提供される。The speech translation apparatus includes a first speech translation processing unit 10 for converting a speech in the source language uttered from the source language input person 100 into a speech in the target language and transmitting the speech to the other party 200;
A second speech translation processing unit 20 is provided for converting the speech of the target language obtained by the first speech translation processing unit 10 into the speech of the source language and feeding it back to the source language input user 100. Here, for convenience of explanation, the description will be made assuming that the source language is Japanese and the target language is English. The speech translation device is realized by a computer and a speech translation processing program with a feedback function. A speech translation processing program with a feedback function is, for example,
It is provided by being stored in a storage medium such as a CD.

【００３６】第１の音声翻訳処理部１０としては、日英
音声翻訳ＡＴＲ−ＭＡＴＲＩＸ等の日英音声翻訳装置が
用いられ、第２の音声翻訳処理部２０としては、英日音
声翻訳ＡＴＲ−ＭＡＴＲＩＸ等の英日音声翻訳装置が用
いられる。ATR-MATRIXについては、T.Takezawa, T.Mori
moto, Y.Sagisaka, N.Campbell, H.Iida, F.Sugaya,A.Y
okoo, and S.Yamamoto. A Japanese-to-English speech
translation system: ATR-MATRIX. In Proc. ICSLP, p
ages 957-960,1998を参照のこと。As the first speech translation processing unit 10, a Japanese-English speech translation device such as ATR-MATRIX is used, and as the second speech translation processing unit 20, an English-Japanese speech translation ATR-MATRIX is used. And the like. About ATR-MATRIX, T.Takezawa, T.Mori
moto, Y.Sagisaka, N.Campbell, H.Iida, F.Sugaya, AY
okoo, and S. Yamamoto.A Japanese-to-English speech
translation system: ATR-MATRIX. In Proc. ICSLP, p
See ages 957-960,1998.

【００３７】第１の音声翻訳処理部１０は、原言語入力
者１００から発声される日本語発声音声を日本語のテキ
ストに変換する音声認識処理部１１、音声認識処理部１
１によって得られた日本語のテキストを英語のテキスト
に変換する言語翻訳処理部１２および言語翻訳処理部１
２によって得られた英語のテキストを英語合成音声に変
換して相手２００に伝える音声合成処理部１３を備えて
いる。The first speech translation processing unit 10 includes a speech recognition processing unit 11 that converts a Japanese uttered speech uttered by the source language input user 100 into a Japanese text, and a speech recognition processing unit 1.
Translation processing unit 12 and language translation processing unit 1 for converting the Japanese text obtained in step 1 into English text
2 is provided with a speech synthesis processing unit 13 that converts the English text obtained in Step 2 into English synthesized speech and transmits it to the other party 200.

【００３８】第２の音声翻訳処理部２０は、第１の音声
翻訳処理部１０によって得られた英語合成音声を英語の
テキストに変換する音声認識処理部２１、音声認識処理
部２１によって得られた英語のテキストを日本語のテキ
ストに変換する言語翻訳処理部２２および言語翻訳処理
部２２によって得られた日本語のテキストを日本語合成
音声に変換して原言語入力者１００にフィードバックす
る音声合成処理部２３を備えている。The second speech translation processing section 20 is obtained by the speech recognition processing section 21 for converting the English synthesized speech obtained by the first speech translation processing section 10 into English text, and obtained by the speech recognition processing section 21. A language translation processing unit 22 that converts an English text into a Japanese text, and a speech synthesis process that converts the Japanese text obtained by the language translation processing unit 22 into a Japanese synthesized speech and feeds it back to the source language input unit 100 A part 23 is provided.

【００３９】原言語入力者１００から発声された日本語
発声音声は、第１の音声翻訳処理部１０によって英語合
成音声に変換されて相手２００に伝えられる。また、第
１の音声翻訳処理部１０によって得られた英語合成音声
は、第２の音声翻訳処理部２０によって日本語合成音声
に変換されて原言語入力者１００にフィードバックされ
る。このため、原言語入力者１００は、相手に伝えられ
た内容が正しいか否かを第２の音声翻訳処理部２０によ
ってフィードバックされた日本語合成音声に基づいて確
認することができるようになる。The Japanese uttered voice uttered by the source language input user 100 is converted into an English synthesized voice by the first voice translation processor 10 and transmitted to the other party 200. Further, the English synthesized speech obtained by the first speech translation processing unit 10 is converted into a Japanese synthesized speech by the second speech translation processing unit 20 and fed back to the source language input user 100. For this reason, the source language input user 100 can confirm whether the content transmitted to the other party is correct based on the Japanese synthesized speech fed back by the second speech translation processing unit 20.

【００４０】なお、図１に破線Ａで示すように、第１の
音声翻訳処理部１０内の言語翻訳処理部１２によって得
られた英語のテキストを、直接、第２の音声翻訳処理部
２０内の言語翻訳処理部２２に入力させるようにしても
よい。この場合には、第２の音声翻訳処理部２０内の言
語翻訳処理部２２は、第１の音声翻訳処理部１０内の言
語翻訳処理部１２によって得られた英語のテキストを、
日本語のテキストに変換する。言語翻訳処理部２２によ
って得られた日本語のテキストは、第２の音声翻訳処理
部２０内の音声合成処理部２３によって日本語合成音声
に変換されて原言語入力者１００にフィードバックされ
る。この場合には、第２の音声翻訳処理部２０は、音声
認識処理部２１を備えていなくてもよい。As shown by a broken line A in FIG. 1, the English text obtained by the language translation processing unit 12 in the first speech translation processing unit 10 is directly transmitted to the second speech translation processing unit 20. May be input to the language translation processing unit 22. In this case, the language translation processing unit 22 in the second speech translation processing unit 20 converts the English text obtained by the language translation processing unit 12 in the first speech translation processing unit 10 into
Convert to Japanese text. The Japanese text obtained by the language translation processing unit 22 is converted into Japanese synthesized speech by the speech synthesis processing unit 23 in the second speech translation processing unit 20 and fed back to the source language input user 100. In this case, the second speech translation processing unit 20 may not include the speech recognition processing unit 21.

【００４１】上記実施の形態によれば、原言語入力者１
００は、相手に伝えようとする内容を含んだ日本語発声
音声を音声認識処理部１１に入力した後、音声合成処理
部２３からフィードバックされる日本語合成音声を聞く
ことにより、自分が伝えようとした内容が相手に正しく
伝わっているかどうかを容易に確認することができる。According to the above embodiment, source language input person 1
00, after inputting the Japanese uttered voice including the content to be transmitted to the other party to the voice recognition processing unit 11 and listening to the synthesized Japanese voice fed back from the voice synthesis processing unit 23, It is possible to easily confirm whether or not the contents described are correctly transmitted to the other party.

【００４２】上記実施の形態では、原言語が日本語であ
り、目的言語が英語の場合について説明したが、原言語
と目的言語とのいかなる組み合わせにもこの発明を適用
することができる。In the above embodiment, the case where the source language is Japanese and the target language is English has been described. However, the present invention can be applied to any combination of the source language and the target language.

【００４３】なお、第２の音声翻訳処理部２０内の言語
翻訳処理部２２によって得られた原言語のテキストを、
図１に破線Ｂで示すように表示処理部３１を介して原言
語入力者１００側に設けられた表示器３２に表示するよ
うにしてもよい。この場合には、第２の音声翻訳処理部
２０内の音声合成処理部２３は無くてもよい。Note that the source language text obtained by the language translation processing unit 22 in the second speech translation processing unit 20 is
As shown by a dashed line B in FIG. 1, the information may be displayed on a display 32 provided on the source language input user 100 side via a display processing unit 31. In this case, the speech synthesis processing unit 23 in the second speech translation processing unit 20 may be omitted.

【００４４】また、この発明は、原言語入力者１００が
音声ではなく原言語のテキストをキーボード等から入力
し、入力された原言語のテキストを言語翻訳処理部で目
的言語のテキストに翻訳し、得られた目的言語のテキス
トを音声合成処理部で合成音声に変換して相手２００に
伝えるような音声翻訳装置において、相手に伝えられた
内容が正しいか否かを、原言語入力者１００が確認する
場合にも適用することができる。Also, according to the present invention, the source language input person 100 inputs text of the source language instead of voice from a keyboard or the like, and translates the input text of the source language into text of the target language by the language translation processing unit. In a speech translating apparatus that converts the obtained target language text into synthesized speech in the speech synthesis processing unit and transmits the synthesized speech to the partner 200, the source language input person 100 checks whether the content transmitted to the partner is correct. It can also be applied to the case.

【００４５】この場合には、図１において、第１の音声
翻訳処理部１０内の音声認識処理１１は不要であり、原
言語入力者１００は、第１の音声翻訳処理部１０内の言
語翻処理部１２に原言語のテキストを直接入力すること
になる。この場合にも、第２の音声翻訳処理部２０内の
言語翻訳処理部２２によって得られた原言語のテキスト
を、図１に破線Ｂで示すように表示処理部３１を介して
原言語入力者１００側に設けられた表示器３２に表示す
るようにしてもよい。この場合には、第２の音声翻訳処
理部２０内の音声合成処理部２３は無くてもよい。In this case, the speech recognition processing 11 in the first speech translation processing unit 10 is unnecessary in FIG. The source language text is directly input to the processing unit 12. In this case as well, the source language text obtained by the language translation processing unit 22 in the second speech translation processing unit 20 is input to the source language input user via the display processing unit 31 as shown by a broken line B in FIG. The information may be displayed on the display 32 provided on the 100 side. In this case, the speech synthesis processing unit 23 in the second speech translation processing unit 20 may be omitted.

【００４６】[0046]

【発明の効果】この発明によれば、原言語入力者が入力
した原言語の内容が正しく翻訳されたか否かを原言語入
力者が容易に確認できるようになる。According to the present invention, the source language input user can easily confirm whether or not the contents of the source language input by the source language input user have been correctly translated.

[Brief description of the drawings]

【図１】音声翻訳装置の構成を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration of a speech translation device.

[Explanation of symbols]

１０、２０音声翻訳処理部（音声翻訳処理手段）１１、２１音声認識処理部（音声認識処理手段）１２、２２言語翻訳処理部（言語翻訳処理手段）１３、２３音声合成処理部（音声合成処理手段）３１表示処理部３２表示器 10, 20 speech translation processing section (speech translation processing means) 11, 21 speech recognition processing section (speech recognition processing means) 12, 22 language translation processing section (language translation processing means) 13, 23 speech synthesis processing section (speech synthesis processing) Means) 31 display processing unit 32 display

───────────────────────────────────────────────────── フロントページの続き (72)発明者ハラルド・シンガー京都府相楽郡精華町大字乾谷小字三平谷５番地株式会社エイ・ティ・アール音声翻訳通信研究所内 (72)発明者ライナー・グルーン京都府相楽郡精華町大字乾谷小字三平谷５番地株式会社エイ・ティ・アール音声翻訳通信研究所内 (72)発明者内藤正樹京都府相楽郡精華町大字乾谷小字三平谷５番地株式会社エイ・ティ・アール音声翻訳通信研究所内Ｆターム(参考） 5B091 AA05 AA06 BA03 CB12 CB32 CD11 EA14 5D015 AA01 KK03 KK04 5D045 AB01 AB03 ──────────────────────────────────────────────────の Continuing on the front page (72) Inventor Harald Singer 5 Shiraya, Inaya, Koika-cho, Soraku-cho, Kyoto Prefecture AT / IR Speech Translation Research Institute, Inc. (72) Inventor Rainer-Green Kyoto 5, Shiragaya, Sayaka-cho, Soraku-gun, Fukuoka, Japan 5 Shiroya, Aitani Speech Translation Research Institute, Inc.・ F-term in Earl Speech Translation Communication Laboratory (reference) 5B091 AA05 AA06 BA03 CB12 CB32 CD11 EA14 5D015 AA01 KK03 KK04 5D045 AB01 AB03

Claims

[Claims]

1. A first speech translation processing means for converting a speech of a source language inputted by a source language input person into a synthesized speech of a target language, and a synthesis of a target language obtained by the first speech translation processing means A speech translation device comprising second speech translation processing means for converting speech again into synthesized speech in the source language and feeding it back to the source language input user.

2. The first speech translation processing means, comprising: first speech recognition processing means for converting a source language speech input by a source language input person into text in a source language; First language translation processing means for converting the obtained source language text into text in the target language, and first speech for converting the text in the target language obtained by the first language translation processing means into a synthesized speech in the target language The second speech translation processing means converts the synthesized speech of the target language obtained by the first speech synthesis processing means in the first speech translation processing means into text of the target language. A second speech recognition processing means for converting the target language text obtained by the second speech recognition processing means into a source language text;
And a second speech synthesis processing means for converting the text of the source language obtained by the language translation processing means and the second language translation processing means into a synthesized speech of the source language and feeding back the synthesized speech to the input user of the source language. Item 2. The speech translation device according to item 1.

3. A first speech translation processing means for converting a speech of a source language inputted by a source language input person into a synthesized speech of a target language, a synthesized speech of the target language obtained by the first speech translation processing means A second speech translation processing means for converting the source language text into a source language text, and means for displaying the source language text obtained by the second speech translation processing means on a display device to feed back to the source language input person; Speech translation device provided with.

4. The first speech translation processing means converts the source language speech input by the source language input person into text in the source language, and obtains the speech by the first speech recognition processing means. First language translation processing means for converting the obtained source language text into target language text, and speech synthesis processing means for converting the target language text obtained by the first language translation processing means into synthesized speech of the target language The second speech translation processing means converts the synthesized speech of the target language obtained by the speech synthesis processing means in the first speech translation processing means into text of the target language. 4. A speech translation apparatus according to claim 3, further comprising a second language translation processing means for converting the target language text obtained by the means and the second speech recognition processing means into a source language text.

5. A first speech translation processing means for converting a source language text input by a source language input person into a synthesized speech of a target language, and a synthesis of a target language obtained by the first speech translation processing means. Second speech translation processing means for converting speech into synthesized speech in the source language and feeding it back to the source language input user.

6. The first speech translation processing means comprising: a first language translation processing means for converting a source language text input by a source language input person into a target language text; First speech synthesis processing means for converting the obtained target language text into synthesized speech in the target language, and the second speech translation processing means comprises a first speech translation processing means in the first speech translation processing means. Speech recognition processing means for converting synthesized speech of the target language obtained by the synthesis processing means into text of the target language, and second language translation for converting text of the target language obtained by the speech recognition processing means into text of the original language Processing means and second speech synthesis processing means for converting the text in the source language obtained by the second language translation processing means into synthesized speech in the source language and feeding it back to the input user of the source language. For example it has speech translation apparatus according to claim 5.

7. A first speech translation processing means for converting a speech of a source language inputted by a source language input person into a synthesized speech of a target language, and an intermediate stage of speech translation processing by the first speech translation processing means. A speech translation device comprising second speech translation processing means for converting the obtained target language text into synthesized speech in the source language and feeding it back to the source language input user.

8. A first speech translation processing means, a speech recognition processing means for converting a source language speech input by a source language input person into a source language text, a source language text obtained by the speech recognition processing means. First language translation processing means for converting the target language into text in the target language, and first speech synthesis processing means for converting the target language text obtained by the first language translation processing means into synthesized speech in the target language. The second speech translation processing means converts the target language text obtained by the first language translation processing means in the first speech translation processing means into a source language text. And a second speech synthesis processing means for converting the text in the source language obtained by the second language translation processing means into synthesized speech in the source language and feeding back to the source language input user. The speech translation device according to claim 7.

9. A first speech translation processing means for converting a speech of a source language inputted by a source language input person into a synthesized speech of a target language, obtained in an intermediate stage of speech translation processing by the first speech translation processing means. Second speech translation processing means for converting the target language text into the source language text, and displaying the source language text obtained by the second speech translation processing means on a display device and feeding back the text to the source language input user Means for causing the speech to be translated.

10. A first speech translation processing means, a speech recognition processing means for converting a source language speech input by a source language input person into a source language text, and a source language text obtained by the speech recognition processing means. First language translation processing means for converting the target language into text in the target language, and speech synthesis processing means for converting the text in the target language obtained by the first language translation processing means into synthesized speech in the target language. The second speech translation processing means includes a second language translation processing means for converting the target language text obtained by the first language translation processing means in the first speech translation processing means into a source language text. The speech translation device according to claim 9.

11. A first speech translation processing means for converting a source language text input by a source language input person into a synthesized speech of a target language, and an intermediate stage of speech translation processing by the first speech translation processing means. A speech translation device comprising second speech translation processing means for converting the obtained target language text into synthesized speech in the source language and feeding it back to the source language input user.

12. The first speech translation processing means, comprising: a first language translation processing means for converting a source language text input by a source language input person into a target language text;
First speech synthesis processing means for converting the text in the target language obtained by the language translation processing means into synthesized speech in the target language, and the second speech translation processing means comprises: a first speech translation processing means. The second language translation processing means for converting the target language text obtained by the first language translation processing means into the source language text and the source language text obtained by the second language translation processing means The speech translation apparatus according to claim 11, further comprising a second speech synthesis processing unit that converts the speech into a synthesized speech of a language and feeds back the speech to a source language input user.

13. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded, wherein the first speech converts a speech of a source language input by a source language input user into a synthesized speech of a target language. Translation processing means, and second speech translation processing means for converting the synthesized speech of the target language obtained by the first speech translation processing means into the synthesized speech of the original language again and feeding it back to the source language input person, A computer-readable recording medium on which a speech translation processing program with a feedback function for causing a computer to function is recorded.

14. A first speech translation processing means, comprising: a first speech recognition processing means for converting a source language speech inputted by a source language input person into a text in a source language; First language translation processing means for converting the obtained source language text into the target language text, and
First speech synthesis processing means for converting the text in the target language obtained by the language translation processing means into synthesized speech in the target language, and the second speech translation processing means comprises: a first speech translation processing means. The second speech recognition processing means for converting the synthesized speech of the target language obtained by the first speech synthesis processing means into the text of the target language, and the text of the target language obtained by the second speech recognition processing means Second to convert to source language text
And a second speech synthesis processing means for converting the text of the source language obtained by the language translation processing means and the second language translation processing means into a synthesized speech of the source language and feeding back the synthesized speech to the input user of the source language. Item 14. A computer-readable recording medium recording the speech translation processing program with a feedback function according to Item 13.

15. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded, wherein a first speech for converting a speech of a source language input by a source language input person into a synthesized speech of a target language. Translation processing means, second speech translation processing means for converting the synthesized speech of the target language obtained by the first speech translation processing means into text in the source language, and source language obtained by the second speech translation processing means A computer-readable recording medium storing a speech translation processing program with a feedback function for causing a computer to function as a means for causing the text to be displayed on a display device and providing feedback to a source language input user.

16. The first speech translation processing means, comprising: first speech recognition processing means for converting a source language speech input by a source language input person into text in a source language; and first speech translation processing means. First language translation processing means for converting the obtained source language text into the target language text, and
And a speech synthesis processing means for converting the text of the target language obtained by the language translation processing means into a synthesized speech of the target language. The second speech translation processing means comprises: Second speech recognition processing means for converting the synthesized speech of the target language obtained by the synthesis processing means into text of the target language, and converting the text of the target language obtained by the second speech recognition processing means into the text of the original language 16. A computer-readable recording medium storing a speech translation processing program with a feedback function according to claim 15, further comprising a second language translation processing means.

17. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded, wherein a first speech for converting a source language text input by a source language input user into a synthesized speech of a target language. Translation processing means,
And a second speech translation processing means for converting the synthesized speech of the target language obtained by the first speech translation processing means into the synthesized speech of the original language and feeding it back to the source language input person. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded.

18. A first speech translation processing means for converting a source language text inputted by a source language input person into a text in a target language, and a first speech translation processing means.
First speech synthesis processing means for converting the text in the target language obtained by the language translation processing means into synthesized speech in the target language, and the second speech translation processing means comprises: a first speech translation processing means. Speech recognition processing means for converting the synthesized speech of the target language obtained by the first speech synthesis processing means into text of the target language, and converting the text of the target language obtained by the speech recognition processing means into the text of the source language Second language translation processing means, and second speech synthesis processing means for converting the source language text obtained by the second language translation processing means into a synthesized speech of the source language and feeding back the synthesized speech to the input user of the source language. A computer-readable recording medium on which the speech translation processing program with a feedback function according to claim 17 is recorded.

19. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded, wherein a first speech for converting a source language speech input by a source language input person into a synthesized speech of a target language. A second speech translation for converting the target language text obtained in the intermediate stage of the speech translation processing by the translation processing means and the first speech translation processing means into the synthesized speech of the source language and feeding back the text to the source language input person; A computer-readable recording medium on which a speech translation processing program with a feedback function for causing a computer to function as a processing means.

20. First speech translation processing means, speech recognition processing means for converting source language speech input by the source language input person into text in the source language, text in the source language obtained by the speech recognition processing means First language translation processing means for converting the target language into text in the target language, and first speech synthesis processing means for converting the target language text obtained by the first language translation processing means into synthesized speech in the target language. The second speech translation processing means converts the target language text obtained by the first language translation processing means in the first speech translation processing means into a source language text; And second speech synthesis processing means for converting the text in the source language obtained by the second language translation processing means into synthesized speech in the source language and feeding it back to the input user of the source language. Computer readable recording medium recording a feedback function speech translation processing program according to claim 19.

21. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded, wherein a first speech for converting a speech of a source language input by a source language input user into a synthesized speech of a target language. Translation processing means, second speech translation processing means for converting target language text obtained in an intermediate stage of speech translation processing by the first speech translation processing means into source language text, and second speech translation processing means Means for displaying the obtained source language text on a display and feeding back the source language input person; and a computer-readable recording medium storing a speech translation processing program with a feedback function for causing a computer to function.

22. A first speech translation processing means, a speech recognition processing means for converting a source language speech inputted by a source language input person into a text in the source language, a text in the source language obtained by the speech recognition processing means. First language translation processing means for converting the target language into text in the target language, and speech synthesis processing means for converting the text in the target language obtained by the first language translation processing means into synthesized speech in the target language. The second speech translation processing means includes a second language translation processing means for converting the target language text obtained by the first language translation processing means in the first speech translation processing means into a source language text. A computer-readable recording medium recording the speech translation processing program with a feedback function according to claim 21.

23. A computer-readable recording medium on which a speech translation processing program with a feedback function is recorded, wherein the first speech converts a source language text input by a source language input user into a synthesized speech of a target language. Translation processing means,
And second speech translation processing means for converting the text of the target language obtained at an intermediate stage of the speech translation processing by the first speech translation processing means into the synthesized speech of the source language and feeding back to the source language input person; A computer-readable recording medium on which a speech translation processing program with a feedback function for causing a computer to function is recorded.

24. The first speech translation processing means, comprising: a first language translation processing means for converting a source language text input by a source language input person into a target language text;
First speech synthesis processing means for converting the text in the target language obtained by the language translation processing means into synthesized speech in the target language, and the second speech translation processing means comprises: a first speech translation processing means. The second language translation processing means for converting the target language text obtained by the first language translation processing means into the source language text and the source language text obtained by the second language translation processing means 24. A computer-readable recording medium storing a speech translation processing program with a feedback function according to claim 23, further comprising second speech synthesis processing means for converting the speech into a synthesized speech of a language and feeding it back to a source language input user.