JP2022152805A

JP2022152805A - Simultaneous translation system and method

Info

Publication number: JP2022152805A
Application number: JP2021055726A
Authority: JP
Inventors: 将夫内山; Masao Uchiyama
Original assignee: National Institute of Information and Communications Technology
Current assignee: National Institute of Information and Communications Technology
Priority date: 2021-03-29
Filing date: 2021-03-29
Publication date: 2022-10-12
Also published as: WO2022209448A1

Abstract

To provide a simultaneous translation system and method which can realize simultaneous translation with high reliability.SOLUTION: A simultaneous translation system 50 for translating voice of a first language into a second language, includes: devices 64, 66, 68, and 70 for carrying out voice recognition of a voice signal of the first language, dividing the recognized voice into chunks, and outputting a chunk sequence formed of chunks of the first language; an automatic translation device 72 for automatically translating each of the chunks of the first language output by the devices 64, 66, 68, and 70 into the second language to output chunks of the second language; and a translation result editing device 74 having a display device for performing display so that the chunks of the second language output from the automatic translation device 72 and the chunks of the first language corresponding to the chunks of the second language can be contrasted with each other, and editing by a first editor can be performed with respect to the chunks of the second language, and for outputting the edited text of the second language in response to reception of completion of the edition.SELECTED DRAWING: Figure 1

Description

この技術は同時翻訳システムに関し、特に、翻訳後の訳文を効率的に編集できる同時翻訳システム及び方法に関する。 This technology relates to a simultaneous translation system, and more particularly to a simultaneous translation system and method capable of efficiently editing translated sentences.

最近では、様々な国の映画、テレビドラマ、ニュース等を視聴する機会が爆発的に増えている。これはインターネットを初めとする、通信技術の発達による。また、ウェブ会議に代表されるように、異なる国の人達が通信により会話又は会議をしたり、講演を視聴したりするためのシステムも非常な勢いで広がっている。 Recently, opportunities to watch movies, TV dramas, news, etc. from various countries have increased explosively. This is due to the development of communication technology including the Internet. In addition, as typified by web conferencing, systems for people from different countries to have conversations or conferences by communication, and to listen to lectures are spreading at a very rapid pace.

こうしたシステムで問題になるのは言葉の問題である。同じ言語の人だけが参加する会話のような場合には問題はないが、異なる国の人々が参加する会議、特に３種類以上の言語が使用される会議等では、同時翻訳がない限り有益な結果を得ることは難しい。 The problem with these systems is language. There is no problem in conversations where only people of the same language participate, but in meetings where people from different countries participate, especially meetings where three or more languages are used, it is useful unless there is simultaneous translation. Results are hard to come by.

従来は、こうした会議では、ある言語と別のある言語との双方に堪能な同時通訳者を言語の組合わせの数だけ準備し、それら同時通訳者による通訳のうちのいずれかを各参加者が利用していた。 Conventionally, in such meetings, simultaneous interpreters who are fluent in both one language and another are provided for the number of language combinations, and each participant receives one of the interpretations by the simultaneous interpreters. was using.

しかし、国際的な会議の機会が増加したり、様々な国のニュース等のリアルタイム性のある番組を視聴したりする機会が増加することにより、このような同時通訳者を必要な数だけ準備することは不可能ではないにしても極めて困難になっている。 However, due to the increase in opportunities for international conferences and viewing of real-time programs such as news from various countries, it is necessary to prepare the necessary number of simultaneous interpreters. It has become extremely difficult, if not impossible.

そこで、このような同時翻訳に、最近になって著しい進歩を見せている音声認識と自動翻訳とを利用することが考えられる。最終的な出力をテキストではなく音声で行う場合には、自動翻訳の結果に対して音声合成を行えばよい。最近の音声認識及び自動翻訳は速度も早く、その精度も以前と比較してはるかに高くなっている。したがって、同時翻訳にこれらの技術を利用することは非常に有効である。 Therefore, for such simultaneous translation, it is conceivable to use speech recognition and automatic translation, which have recently made remarkable progress. If the final output is not text but speech, speech synthesis may be performed on the result of automatic translation. Recent speech recognition and automatic translation are faster and more accurate than ever before. Therefore, it is very effective to use these techniques for simultaneous translation.

自動翻訳を利用した同時翻訳装置として、後掲の特許文献１に開示されたものがある。特許文献１に開示された同時翻訳装置は、第１言語の音声の入力を受けて第２言語に自動翻訳し、その結果を第２言語の音声で出力する、というものである。特許文献１では、第１言語から第２言語へ、及び第２言語から第１言語への２方向の同時翻訳を同一の自動翻訳装置により行っている。 As a simultaneous translation device using automatic translation, there is one disclosed in Patent Document 1 listed below. The simultaneous translation apparatus disclosed in Patent Document 1 receives input of speech in a first language, automatically translates it into a second language, and outputs the result in speech of the second language. In Patent Literature 1, the same automatic translation device performs two-way simultaneous translation from a first language to a second language and from the second language to a first language.

特開２０１８－１９５２７６号公報JP 2018-195276 A

しかし、特許文献１に開示された自動翻訳装置を含め、自動翻訳装置を使用する場合、特に会議等、その場で相手の発言を聞き直すことが難しい場合には、最終的な翻訳については人手で編集すること、すなわちポストエディットが必要である。というのは、途中で誤訳があったり解釈が難しい表現があったりしたときに、利用者がその後の話を正しく理解できなくなったり、議事の進行が軌道をはずれてしまったりする可能性が高くなるためである。 However, when using an automatic translation device, including the automatic translation device disclosed in Patent Document 1, especially when it is difficult to listen to the other party's remarks on the spot, such as in a meeting, the final translation is done manually. , i.e. post-editing is required. This is because, if there is a mistranslation or an expression that is difficult to interpret in the middle, there is a high possibility that the user will not be able to understand the rest of the story correctly, or that the proceedings will go off track. It's for.

特許文献１に開示された翻訳装置では、ポストエディットの必要性については全く開示がない。仮に特許文献１に開示された翻訳装置でポストエディットを行う場合には、翻訳前の音声又はテキストと翻訳後のテキストとを見比べて訳文の正確性を判断し、必要な修正を行う編集者が必要となる。 The translation device disclosed in Patent Document 1 does not disclose the need for post-editing at all. If post-editing is performed by the translation device disclosed in Patent Document 1, an editor who compares the pre-translation speech or text with the post-translation text to determine the accuracy of the translation and makes necessary corrections necessary.

しかし、そのように２つの言語の双方に堪能な通訳者の数は少ない。したがって、特許文献１に開示された翻訳装置では、信頼性の高い同時翻訳を実現するのは難しいという問題がある。 However, the number of such interpreters who are fluent in both languages is small. Therefore, the translation device disclosed in Patent Document 1 has the problem that it is difficult to achieve highly reliable simultaneous translation.

したがって、この発明の目的は、信頼性の高い同時翻訳を実現できる同時翻訳システム及び方法を提供することである。 SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a simultaneous translation system and method capable of realizing highly reliable simultaneous translation.

本発明の第１の局面に係る同時翻訳システムは、第１言語の音声を第２言語に翻訳する同時翻訳システムであって、第１言語の音声信号を音声認識し、チャンクに分割して第１言語のチャンクからなるチャンク列を出力するチャンク列出力手段と、チャンク列出力手段の出力する第１言語のチャンクの各々について、第２言語への自動翻訳を行い、第２言語のチャンクを出力する自動翻訳手段と、自動翻訳手段の出力する第２言語のチャンクと、当該第２言語のチャンクに対応する第１言語のチャンクとが対照できるように、かつ第２言語のチャンクに対する第１の編集者による編集ができるように表示する表示装置を持ち、編集終了の指示を受けたことに応答して、当該編集された第２言語のテキストを出力するポストエディット手段とを含む。 A simultaneous translation system according to a first aspect of the present invention is a simultaneous translation system for translating speech in a first language into a second language, wherein a speech signal in the first language is speech-recognized, divided into chunks, and A chunk string output means for outputting a chunk string composed of chunks in one language, and each chunk in the first language output by the chunk string output means is automatically translated into a second language, and a chunk in the second language is output. so that the automatic translation means, the second language chunk output by the automatic translation means, and the first language chunk corresponding to the second language chunk can be compared, and the first language chunk for the second language chunk and post-editing means for outputting the edited text in the second language in response to receiving an instruction to end editing, the display device having a display for editing by an editor.

好ましくは、同時翻訳システムは、第２言語のチャンクを第１言語に翻訳する逆翻訳手段をさらに含み、表示装置はさらに、逆翻訳手段による逆翻訳結果を表示する。 Preferably, the simultaneous translation system further includes back-translation means for translating chunks of the second language into the first language, and the display device further displays the back-translation result by the back-translation means.

より好ましくは、ポストエディット手段は、逆翻訳手段による翻訳結果と第２言語のチャンクの原文である第１言語のチャンクとの一致度を算出する一致度算出手段をさらに含み、表示装置はさらに、一致度を第２言語のチャンクの翻訳の信頼度として表示する。 More preferably, the post-editing means further includes matching degree calculation means for calculating a degree of matching between the result of translation by the reverse translation means and the chunk in the first language, which is the original text of the chunk in the second language, and the display device further includes: Display the degree of agreement as confidence in the translation of the chunk in the second language.

さらに好ましくは、同時翻訳システムは、第１及び第２のポストエディット手段を含み、同時翻訳システムはさらに、自動翻訳手段の出力を第１又は第２のポストエディット手段に振り分けて入力し、当該入力に対する第１及び第２のポストエディット手段の出力を正しい順番で統合する第１の統合手段を含む。 Further preferably, the simultaneous translation system includes first and second post-editing means, the simultaneous translation system further distributes the output of the automatic translation means to the first or second post-editing means and inputs the input and first merging means for merging in proper order the outputs of the first and second post-editing means for .

好ましくは、チャンク列出力手段は、第１言語の音声信号を音声認識し第１言語のテキストを出力する音声認識テキスト出力手段と、音声認識テキスト出力装置の出力する第１言語のテキストを自動的にチャンクに分割して第１言語のチャンク列を出力するテキスト自動分割手段と、テキスト自動分割手段が出力する第１言語のチャンク列を、第２の編集者によるチャンク編集ができるように表示する表示装置を持ち、第２の編集者による編集終了の指示に応答して、当該表示されている第１言語のチャンク列を出力するチャンク列編集手段とを含む。 Preferably, the chunk sequence output means includes speech recognition text output means for recognizing speech signals in the first language and outputting text in the first language, and automatic text output in the first language output by the speech recognition text output device. an automatic text dividing means for dividing into chunks and outputting a chunk string of the first language, and a chunk string of the first language outputted by the automatic text dividing means are displayed so that a second editor can edit the chunks. and chunk string editing means for outputting the displayed chunk string in the first language in response to an instruction to finish editing by the second editor.

より好ましくは、チャンク列出力手段は、第１及び第２のチャンク列編集手段を含み、同時翻訳システムはさらに、テキスト自動分割手段の出力を、第１又は第２のチャンク列編集手段に振り分けて入力し、当該入力に対する第１及び第２のチャンク列編集手段の出力を正しい順番で統合する第２の統合手段を含む。 More preferably, the chunk sequence output means includes first and second chunk sequence editing means, and the simultaneous translation system further distributes the output of the automatic text segmentation means to the first or second chunk sequence editing means. A second integration means for receiving the input and integrating the outputs of the first and second chunk sequence editing means for the input in the correct order.

さらに好ましくは、音声認識テキスト出力手段は、第１言語の音声信号を音声認識し第１言語のテキストを出力する音声認識装置と、音声認識装置の出力する第１言語のテキストを、第３の編集者による編集が可能なように表示する表示装置を持ち、編集終了の指示に応答して、表示されている第１言語のテキストをテキスト自動分割手段に入力するテキスト編集手段とを含む。 More preferably, the speech recognition text output means includes a speech recognition device for recognizing a speech signal in a first language and outputting a text in the first language; Text editing means for inputting the displayed text in the first language to the automatic text division means in response to an instruction to end editing.

好ましくは、音声認識テキスト出力手段は、第１及び第２のテキスト編集手段を含み、同時翻訳システムはさらに、音声認識装置の出力を、第１又は第２のテキスト編集手段に振り分けて入力し、当該入力に対する第１及び第２のテキスト編集手段の出力を正しい順番で統合する第３の統合手段を含む。 Preferably, the speech recognition text output means includes first and second text editing means, and the simultaneous translation system further distributes and inputs the output of the speech recognition device to the first or second text editing means, It includes third merging means for merging, in correct order, the outputs of the first and second text editing means for said input.

より好ましくは、チャンク列出力手段は、第１言語の音声信号を音声認識し第１言語のテキストを出力する音声認識手段と、音声認識装置の出力する第１言語のテキストを、第３の編集者による編集が可能なように表示する表示装置を持ち、編集終了の指示に応答して、表示されている第１言語のテキストを出力するテキスト編集手段と、テキスト編集手段の出力する第１言語のテキストをチャンクに分割し第１言語のチャンク列を出力するテキスト分割手段とを含む。 More preferably, the chunk string output means includes speech recognition means for recognizing speech signals in the first language and outputting text in the first language, and text in the first language output by the speech recognition device for performing third editing. text editing means for outputting the displayed text in the first language in response to an instruction to end editing; and the first language output by the text editing means. text segmentation means for segmenting the text of the language into chunks and outputting a sequence of chunks in the first language.

さらに好ましくは、チャンク列出力手段は、第１及び第２のテキスト編集手段を含み、同時翻訳システムはさらに、音声認識手段の出力を、第１又は第２のテキスト編集手段に振り分けて入力し、当該入力に対する第１及び第２のテキスト編集手段の出力を正しい順番で統合する第２の統合手段を含む。 More preferably, the chunk string output means includes first and second text editing means, and the simultaneous translation system further distributes the output of the speech recognition means to the first or second text editing means and inputs it, It includes second merging means for merging the outputs of the first and second text editing means for the input in the correct order.

本発明の第２の局面に係る同時翻訳方法は、第１言語の音声を第２言語に翻訳する同時翻訳方法であって、コンピュータが、第１言語の音声信号を音声認識し、チャンクに分割して第１言語のチャンクからなるチャンク列を出力するステップと、コンピュータが、第１言語のチャンクの各々について、第２言語への自動翻訳を行い、第２言語のチャンクを出力するステップと、コンピュータが、第２言語のチャンクと、当該第２言語のチャンクに対応する第１言語のチャンクとが対照できるように、かつ第２言語のチャンクに対する第１の編集者による編集ができるように表示装置に表示するステップと、コンピュータが第１の編集者による第２言語のチャンクに対する編集を受け付けるステップと、コンピュータが、編集終了の指示を受けたことに応答して、編集を受け付けるステップにおいて編集された第２言語のテキストを出力するステップとを含む。 A simultaneous translation method according to a second aspect of the present invention is a simultaneous translation method for translating speech in a first language into a second language, wherein a computer recognizes a speech signal in the first language and divides it into chunks. a step of automatically translating each of the first language chunks into a second language by a computer and outputting the second language chunks; A computer displays second language chunks and first language chunks corresponding to the second language chunks for comparison and for editing by a first editor on the second language chunks. a computer accepting edits to the chunk in the second language by the first editor; and a computer accepting edits in response to receiving an instruction to end editing. and outputting the second language text.

この発明の上記及び他の目的、特徴、局面及び利点は、添付の図面と関連して理解されるこの発明に関する次の詳細な説明から明らかとなるであろう。 The above and other objects, features, aspects and advantages of the present invention will become apparent from the following detailed description of the invention taken in conjunction with the accompanying drawings.

図１は、この発明の第１実施形態に係る同時翻訳システムの概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a simultaneous translation system according to the first embodiment of the invention. 図２は、図１に示すシステムにおける処理ステップを示す模式図である。FIG. 2 is a schematic diagram showing processing steps in the system shown in FIG. 図３は、音声認識結果の編集前の編集画面を示す図である。FIG. 3 is a diagram showing an editing screen before editing the speech recognition result. 図４は、音声認識結果の編集後の編集画面を示す図である。FIG. 4 is a diagram showing an editing screen after editing the speech recognition result. 図５は、自動翻訳の単位であるチャンクの編集前の編集画面を示す図である。FIG. 5 is a diagram showing an editing screen before editing a chunk, which is a unit of automatic translation. 図６は、自動翻訳の単位であるチャンクの編集後の編集画面を示す図である。FIG. 6 is a diagram showing an editing screen after editing a chunk, which is a unit of automatic translation. 図７は、翻訳結果の編集前の編集画面を示す図である。FIG. 7 is a diagram showing an editing screen before editing the translation result. 図８は、翻訳結果の編集後の編集画面を示す図である。FIG. 8 is a diagram showing an editing screen after editing the translation result. 図９は、図１に示す同時翻訳システムを実現するプログラムの制御構造を示すフローチャートである。FIG. 9 is a flow chart showing the control structure of a program that implements the simultaneous translation system shown in FIG. 図１０は、図１に示す音声認識結果編集装置を実現するプログラムの制御構造を示すフローチャートである。FIG. 10 is a flow chart showing the control structure of a program that implements the speech recognition result editing apparatus shown in FIG. 図１１は、図１に示す翻訳単位編集装置を実現するプログラムの制御構造を示すフローチャートである。FIG. 11 is a flow chart showing the control structure of a program that implements the translation unit editing device shown in FIG. 図１２は、図１に示す翻訳結果編集装置を実現するプログラムの制御構造を示すフローチャートである。FIG. 12 is a flow chart showing the control structure of a program that implements the translation result editing apparatus shown in FIG. 図１３は、図８に示す翻訳結果詳細処理を実現するプログラムの制御構造を示すフローチャートである。FIG. 13 is a flow chart showing the control structure of a program that implements the translation result detail processing shown in FIG. 図１４は、この発明の第２実施形態に係る統括サーバを実現するプログラムの制御構造を示すフローチャートである。FIG. 14 is a flow chart showing the control structure of a program that implements the central server according to the second embodiment of this invention. 図１５は、第２実施形態の第１変形例に係る翻訳結果の編集画面を示す図である。FIG. 15 is a diagram showing a translation result editing screen according to the first modification of the second embodiment. 図１６は、第２実施形態の第１変形例に係る翻訳結果編集装置を実現するプログラムの制御構造を示すフローチャートである。FIG. 16 is a flow chart showing the control structure of a program that implements the translation result editing device according to the first modified example of the second embodiment. 図１７は、第２実施形態の第２変形例に係る翻訳結果編集装置における翻訳結果の編集画面を示す図である。FIG. 17 is a diagram showing a translation result editing screen in the translation result editing device according to the second modification of the second embodiment. 図１８は、第２実施形態の第２変形例に係る翻訳結果編集装置を実現するプログラムの制御構造を示すフローチャートである。FIG. 18 is a flow chart showing the control structure of a program that implements the translation result editing device according to the second modified example of the second embodiment. 図１９は、第２実施形態の第３変形例に係る翻訳結果編集装置を実現するプログラムの制御構造を示すフローチャートである。FIG. 19 is a flow chart showing the control structure of a program that implements the translation result editing device according to the third modified example of the second embodiment.

以下の説明及び図面では、同一の部品には同一の参照番号を付してある。したがって、それらについての詳細な説明は繰返さない。 In the following description and drawings, identical parts are provided with identical reference numerals. Therefore, detailed description thereof will not be repeated.

第１第１実施形態
１構成
（１）全体構成
図１は、この発明の第１実施形態に係る同時翻訳システム５０の全体構成を示すブロック図である。図１を参照して、同時翻訳システム５０は、ネットワーク６２に接続され、ネットワーク６２を通じた通信により同時翻訳システム５０の各機能部を統括して制御する統括サーバ６０と、いずれもネットワーク６２に接続された音声認識装置６４、音声認識結果編集装置６６、発話分割装置６８、翻訳単位編集装置７０、自動翻訳装置７２、翻訳結果編集装置７４、音声合成装置７６及び翻訳結果編集装置７４による翻訳結果を逆翻訳するための自動翻訳装置７８とを含む。 1st Embodiment 1 Configuration (1) Overall Configuration FIG. 1 is a block diagram showing the overall configuration of a simultaneous translation system 50 according to the first embodiment of the present invention. Referring to FIG. 1, a simultaneous translation system 50 is connected to a network 62, and a central server 60 that centrally controls each functional unit of the simultaneous translation system 50 by communication through the network 62. Both are connected to the network 62. The translation result by the speech recognition device 64, the speech recognition result editing device 66, the speech segmentation device 68, the translation unit editing device 70, the automatic translation device 72, the translation result editing device 74, the speech synthesis device 76, and the translation result editing device 74 is and an automatic translation device 78 for back-translating.

音声認識装置６４、自動翻訳装置７２及び７８は、用途（分野及び言語の種類）に応じて利用可能なものであればどのようなものでもよい。自動翻訳装置７８は自動翻訳装置７２と原言語及び訳言語の関係を逆にしたものである必要がある。この実施形態では自動翻訳装置７８を自動翻訳装置７２とは別に設けている。しかしそのような実施形態に限定されず、自動翻訳装置７２を用いて訳言語から原言語への翻訳を行うようにしてもよい。 The speech recognition device 64 and the automatic translation devices 72 and 78 may be of any type as long as they can be used according to the application (field and type of language). The automatic translation device 78 should have the relationship between the original language and the translation language reversed from that of the automatic translation device 72 . In this embodiment, an automatic translation device 78 is provided separately from the automatic translation device 72 . However, it is not limited to such an embodiment, and the automatic translation device 72 may be used to translate from the target language to the original language.

音声認識結果編集装置６６は、音声認識装置６４が出力する音声認識結果のテキストをリアルタイムで受け、第１の編集者が編集して編集が完了したら直ちに発話分割装置６８に向けて出力するためのものである。音声認識装置６４の出力するテキストは音声認識結果なので、音声認識の誤りを含むことが多い。そのため原文を高い精度で自動翻訳するためには、このテキストに含まれる誤りを修正する必要がある。この修正を自動処理によって行ってもよいが、この実施形態では精度を重んじて人手で行う。 The speech recognition result editing device 66 receives the text of the speech recognition result output by the speech recognition device 64 in real time, edits it by the first editor, and immediately outputs it to the speech division device 68 when the editing is completed. It is. Since the text output by the speech recognition device 64 is the result of speech recognition, it often contains speech recognition errors. Therefore, in order to automatically translate the original text with high accuracy, it is necessary to correct errors contained in this text. Although this correction may be performed by automatic processing, in this embodiment, accuracy is emphasized and it is performed manually.

音声認識結果編集装置６６が行う処理は単純なテキストの編集なので、音声認識結果編集装置６６は通常のコンピュータで十分であり、特に複雑な構成は不要である。 Since the processing performed by the speech recognition result editing device 66 is simple text editing, the speech recognition result editing device 66 is sufficient with a normal computer and does not require a particularly complicated configuration.

発話分割装置６８は、音声認識結果編集装置６６が出力する発話単位に分割されたテキストを、翻訳が精度高く行えるよう、翻訳に適した、「チャンク」と呼ばれる単位に分割する機能を持つ。テキストをどこで分割してチャンクとするかは、言語により、また分野により異なる。発話の場合、一文が長くなることが多い。文が長いと一般に自動翻訳の精度が低くなることが知られている。そこでこのようにテキストをチャンクに分割しチャンク列として各チャンクについて翻訳することで自動翻訳の精度を高くできる。いわゆる「節」等がチャンクに相当する。 The utterance dividing device 68 has a function of dividing the text divided into utterance units output by the speech recognition result editing device 66 into units called "chunks" suitable for translation so that the translation can be performed with high accuracy. Where the text is divided into chunks depends on the language and domain. In the case of speech, one sentence is often long. It is known that the longer the sentence, the lower the accuracy of automatic translation. Therefore, by dividing the text into chunks in this way and translating each chunk as a chunk sequence, the accuracy of automatic translation can be improved. A so-called "section" or the like corresponds to a chunk.

発話分割装置６８は、テキストをどのようにチャンクに分割すればよいかをいわゆる機械学習による学習で判定するものとする。この学習には、多数の文について、各トークンで文をチャンクに分割するか否かを示すラベルを各トークンに付したものを訓練データとして準備する。この訓練データを用いてＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）を訓練することで発話分割装置６８を実現できる。また、深層学習により学習したニューラルネットワーク（ＤＮＮ）を用いることもできる。 The speech division device 68 determines how to divide the text into chunks by so-called machine learning. For this learning, a large number of sentences are prepared as training data in which each token is labeled to indicate whether or not the sentence is to be divided into chunks. The speech segmentation device 68 can be realized by training an SVM (Support Vector Machine) using this training data. A neural network (DNN) trained by deep learning can also be used.

翻訳単位編集装置７０は、発話分割装置６８が出力するチャンク列について、第２の編集者が手作業によりリアルタイムで編集し出力するためのものである。発話分割装置６８を訓練することはできるが、その判定が常に人間の判断と完全に一致することはあり得ない。翻訳の原文を誤って分割すれば、当然に翻訳も誤ってしまう。そこでこの実施形態では、発話分割装置６８による分割結果を第２の編集者が目視で確認し、必要であれば編集することとした。 The translation unit editing device 70 is for the second editor to manually edit and output the chunk sequence output by the speech segmentation device 68 in real time. Speech segmenter 68 can be trained, but its decisions cannot always match perfectly with human judgment. If the original text for translation is incorrectly divided, the translation will naturally be incorrect. Therefore, in this embodiment, the second editor visually confirms the result of division by the speech division device 68, and edits it if necessary.

なお発話分割装置６８が行う処理は単純なテキストの編集（チャンクの分割及び統合、これを便宜的に「チャンク編集」と呼ぶものとする。）だけなので、発話分割装置６８は通常のコンピュータで十分であり特に特に複雑な構成は不要である。 Since the processing performed by the speech segmentation device 68 is only simple text editing (segmentation and integration of chunks, which is referred to as "chunk editing" for convenience), the speech segmentation device 68 can be implemented by a normal computer. and a particularly complicated configuration is not required.

翻訳結果編集装置７４は、自動翻訳装置７２による翻訳結果を第３の編集者が目視で確認し、必要であれば修正するためである。この実施形態では、後述するように翻訳原文のテキストと自動翻訳装置７２による訳文とを対照して表示する。したがって編集者は両者を対比して訳文の適否を容易に判定し必要であれば修正できる。またこの実施形態ではさらに、後述するように、編集者の要求に応じて翻訳結果の詳細を表示できる。詳細画面では、原文と訳文とに加え、訳文を原言語に逆翻訳した結果と、そのスコア（信頼度）とを表示する。したがって編集者はこれらを全て考慮した上で訳文の適否の判定と修正とを行える。 The translation result editing device 74 is for the third editor to visually check the translation result by the automatic translation device 72 and correct it if necessary. In this embodiment, as will be described later, the original text to be translated and the translated text by the automatic translation device 72 are displayed in comparison. Therefore, the editor can easily judge whether the translation is appropriate or not by comparing the two, and make corrections if necessary. Further, in this embodiment, as will be described later, the details of the translation result can be displayed at the request of the editor. In addition to the original text and the translated text, the detailed screen displays the result of back-translating the translated text into the original language and its score (reliability). Therefore, the editor can judge whether the translation is appropriate or not and correct the translation after taking all of these into consideration.

（２）全体の処理構成
図２に、この同時翻訳システム５０の全体の処理構成を示す。図２を参照して、原言語音声信号１１０が音声認識装置６４の音声認識エンジン１１２に入力され、原言語テキスト１１４が出力される。原言語テキスト１１４は音声認識結果編集装置６６を用いた第１の編集者による音声認識結果編集１１６を受けて音声認識の修正後の原言語テキスト１１８となる。修正後の原言語テキスト１１８は発話分割装置６８の翻訳結果分割エンジン１２０に入力されチャンク列に分割された原言語テキスト１２２となり翻訳単位編集装置７０に入力される。原言語テキスト１２２は翻訳単位編集装置７０を用いた第２の編集者による翻訳単位分割編集１２４を受け、翻訳に適した正しいチャンク列に修正された原言語テキスト１２６となり自動翻訳エンジン１２８に入力される。 (2) Overall Processing Configuration FIG. 2 shows the overall processing configuration of this simultaneous translation system 50 . Referring to FIG. 2, source language speech signal 110 is input to speech recognition engine 112 of speech recognizer 64 and source language text 114 is output. The source language text 114 is subjected to speech recognition result editing 116 by a first editor using a speech recognition result editing device 66 to become a source language text 118 after speech recognition correction. The corrected source language text 118 is input to the translation result segmentation engine 120 of the speech segmentation device 68 , becomes the source language text 122 segmented into chunk strings, and is input to the translation unit editing device 70 . The source language text 122 undergoes translation unit segmentation editing 124 by a second editor using a translation unit editor 70, resulting in source language text 126 modified into the correct sequence of chunks suitable for translation and input to an automatic translation engine 128. be.

自動翻訳エンジン１２８は、このように翻訳に適した正しいチャンク列に修正された原言語テキスト１２６に対して自動翻訳を行い、訳言語テキスト１３０を出力する。訳言語テキスト１３０にどの程度の誤りが存在するかは自動翻訳エンジン１２８の性能に依存する。しかし、入力が翻訳に適した正しいチャンク列に修正されているため、少なくともその精度はそうした修正がされていない場合と比較して高くなる。訳言語テキスト１３０は図１に示す翻訳結果編集装置７４を用いた第３の編集者によるポストエディット１３２を受ける。この結果、翻訳結果に誤りが含まれていても第３の編集者により訂正されて訳言語テキスト１３４として出力される。 The automatic translation engine 128 automatically translates the source language text 126 corrected into a correct chunk sequence suitable for translation in this way, and outputs a target language text 130 . How many errors exist in the target language text 130 depends on the performance of the automatic translation engine 128 . However, since the input is corrected to the correct sequence of chunks suitable for translation, the accuracy is at least higher than without such corrections. The target language text 130 is post-edited 132 by a third editor using the translation result editing device 74 shown in FIG. As a result, even if the translation result contains an error, it is corrected by the third editor and output as the translation language text 134 .

この実施形態では、このようにしてポストエディットされた訳言語テキスト１３４は音声合成装置７６の音声合成エンジン１３６により訳言語音声信号１３８に変換され出力される。 In this embodiment, the translated language text 134 post-edited in this manner is converted into a translated language speech signal 138 by a speech synthesis engine 136 of the speech synthesizer 76 and output.

以上のようにこの実施形態に係る同時翻訳処理１００では、原言語音声信号１１０の音声認識、音声認識結果の修正、修正後のテキストのチャンクへの分割、チャンク列の修正、自動翻訳、自動翻訳の修正、及び音声合成が、音声認識エンジン１１２による音声認識を単位としてパイプライン式に行われる。その結果、同時翻訳を高い精度で行える。 As described above, in the simultaneous translation processing 100 according to this embodiment, speech recognition of the source language speech signal 110, correction of the speech recognition result, division of the corrected text into chunks, correction of the chunk sequence, automatic translation, automatic translation and speech synthesis are pipelined in units of speech recognition by the speech recognition engine 112 . As a result, simultaneous translation can be performed with high accuracy.

（３）各編集画面
ア音声認識結果のエディタ画面
図３に、音声認識結果の修正で用いられるエディタ画面２００を示す。この実施形態では、エディタ画面２００は、編集フィールド２１０と、音声信号表示部２１２と、再生音声を少しだけ戻す戻しボタン２１４、ポーズボタン２１６、及び再生音声を少しだけ先に進める進みボタン２１８と、「次へ」ボタン２２０とを含む。 (3) Edit Screens a. Editor screen for speech recognition results FIG. 3 shows an editor screen 200 used for correcting speech recognition results. In this embodiment, the editor screen 200 includes an edit field 210, an audio signal display portion 212, a reverse button 214 that slightly reverses the played audio, a pause button 216, and a forward button 218 that slightly advances the played audio, and a “Next” button 220 .

編集フィールド２１０には音声認識結果のテキストが表示される。図３に示すように、音声認識結果のテキストは漢字かな混じりではあるものの、句読点が含まれず、全体がつながった一つの文字列である。またこの文字列の中には、文字・文字列２３０、２３２、及び２３４のように発話中に挿入された単なる音を示す文字列、文字・文字列２３６及び２３８のように間投詞的に挿入された意味のない文字・文字列が含まれている。 Edit field 210 displays the text of the speech recognition result. As shown in FIG. 3, although the text of the speech recognition result is a mixture of kanji and kana, it does not contain punctuation marks and is a single character string that is connected as a whole. In addition, among these character strings, there are character strings indicating mere sounds inserted during speech, such as characters/character strings 230, 232, and 234, and characters/character strings 236 and 238, which are inserted interjectively. It contains meaningless characters/strings.

編集フィールド２１０に対しては第１の編集者による文字列編集が可能である。例えば図４に示す編集箇所２５０、２５２及び２５４のように単なる音を削除し、編集箇所２５６及び２５８に示すように意味のない文字・文字列を削除することで音声認識結果の文字列は正しい文字列となる。 Edit field 210 allows text editing by the first editor. For example, by deleting mere sounds as shown in edited portions 250, 252 and 254 shown in FIG. be a string.

音声信号表示部２１２には、編集フィールド２１０に対応する音声信号を示す画像が表示されている。この画像の中のある点にポインタを移動してクリックすると、その点に対応する位置から音声の再生が開始される。戻しボタン２１４をクリックすれば短い時間（例えば１秒）だけ再生位置を戻すことができる。進みボタン２１８をクリックすれば短い時間（例えば３秒）だけ再生位置を進めることができる。ポーズボタン２１６をクリックすれば再生を一時停止し、ポーズボタン２１６ボタンに代わって再生開始ボタン（図示せず）が表示される。したがって必要であれば第１の編集者が音声認識の対象の音声を容易に確認でき、音声認識結果を正しく修正できる。 An image showing the audio signal corresponding to the edit field 210 is displayed in the audio signal display section 212 . If you move the pointer to a point in this image and click on it, the sound will start playing from the position corresponding to that point. By clicking the return button 214, the playback position can be returned for a short period of time (for example, 1 second). By clicking the advance button 218, the playback position can be advanced by a short period of time (for example, 3 seconds). If the pause button 216 is clicked, reproduction is paused, and a reproduction start button (not shown) is displayed instead of the pause button 216 button. Therefore, if necessary, the first editor can easily confirm the target speech for speech recognition and correct the speech recognition result.

なお、エディタ画面２００による編集は、単なる文字列の削除、挿入のみである。また音声信号に関して行われる処理は、音声信号の表示、クリック位置の検出、及びクリックされた位置に応じた音声信号の再生、停止、再生位置の変更のみである。したがってこの編集処理は周知のプログラムを用いて実現できる。 Editing by the editor screen 200 is only simple deletion and insertion of character strings. Further, the processing performed on the audio signal is only the display of the audio signal, the detection of the clicked position, and the reproduction, stop, and change of the reproduction position of the audio signal according to the clicked position. Therefore, this editing process can be realized using a well-known program.

「次へ」ボタン２２０を編集者がクリックすると、編集フィールド２１０に表示されている編集後のテキストが後続する処理に向けて出力される。 When the editor clicks on the "Next" button 220, the edited text displayed in the edit field 210 is output for subsequent processing.

イ翻訳単位のエディタ画面
図５に、翻訳単位（チャンク）のエディタ画面３００を示す。図５を参照して、エディタ画面３００は、修正された音声認識結果のテキストであって図１に示す発話分割装置６８により分割されたテキストが、編集可能に表示される編集フィールド３０２と、「次へ」ボタン３０４及び３０６とを含む。 B. Translation Unit Editor Screen FIG. 5 shows the translation unit (chunk) editor screen 300 . Referring to FIG. 5, an editor screen 300 includes an edit field 302 in which the text of the corrected speech recognition result, which is divided by the utterance dividing device 68 shown in FIG. and Next buttons 304 and 306 .

発話分割装置６８の出力は、チャンクの終端（分割位置）と判定された箇所各々の直後に挿入された改行コードを含む。発話分割装置６８による分割は、訓練によりある程度の精度を実現できるが完全に誤りを排除するわけではない。例えば図５において、第１文３１０と第２文３１２とは本来は一つの文である。しかし発話分割装置６８はこの文を第１文３１０と第２文３１２とに分割している。 The output of the speech splitter 68 includes line feed codes inserted immediately after each location determined to be the end of the chunk (split position). The segmentation by the speech segmentation device 68 can achieve some degree of accuracy through training, but does not completely eliminate errors. For example, in FIG. 5, the first sentence 310 and the second sentence 312 are originally one sentence. However, the speech splitter 68 has split this sentence into a first sentence 310 and a second sentence 312 .

第１文３１０と第２文３１２とをまとめるべきと判断した第２の編集者は、第１文３１０の末尾に付されている改行コードを削除する。その結果、図６に示す編集箇所３２０のように、第１文３１０と第２文３１２とが正しく１文にまとめられる。 The second editor, who has determined that the first sentence 310 and the second sentence 312 should be put together, deletes the line feed code attached to the end of the first sentence 310 . As a result, the first sentence 310 and the second sentence 312 are correctly grouped into one sentence, like the edited part 320 shown in FIG.

「次へ」ボタン３０４又は「次へ」ボタン３０６がクリックされると、編集フィールド３０２に表示されているテキストが保存され、後続する処理に向けて出力される。 When "Next" button 304 or "Next" button 306 is clicked, the text displayed in edit field 302 is saved and output for subsequent processing.

ウ翻訳結果のエディタ画面
図１に示す翻訳結果編集装置７４のエディタ画面３５０を示す。図７を参照して、エディタ画面３５０は、自動翻訳の原文が表示される原文フィールド３５２と、その翻訳結果が表示される訳文編集フィールド３５４と、「次へ」ボタン３５６とを含む。 C Editor Screen of Translation Result Shown is the editor screen 350 of the translation result editing device 74 shown in FIG. Referring to FIG. 7, editor screen 350 includes an original text field 352 displaying the original text of automatic translation, a translated text editing field 354 displaying the translation result, and a “next” button 356 .

第３の編集者が原文フィールド３５２の表示内容と訳文編集フィールド３５４の表示内容とを比較する。もしも訳文に誤りがあれば第３の編集者は訳文編集フィールド３５４に表示されている訳文のうち、その誤りの箇所を修正する。 The third editor compares the displayed contents of the original text field 352 and the displayed contents of the translated text editing field 354 . If there is an error in the translation, the third editor corrects the error in the translation displayed in the translation edit field 354 .

図７の場合、例えば原文フィールド３５２に表示されている原文３５８に対応して訳文編集フィールド３５４に表示されている訳文３６０が、誤った、又は分かりにくい訳であると判断した場合、第３の編集者は訳文３６０の任意の箇所をクリックする。すると図８に示す、翻訳結果詳細のエディタ画面４００が表示され、後述するように訳文３６０を修正でき、修正後にこのエディタ画面３５０が表示される。 In the case of FIG. 7, for example, when it is determined that the translation 360 displayed in the translation edit field 354 corresponding to the original sentence 358 displayed in the original text field 352 is incorrect or difficult to understand, the third The editor clicks any part of the translation 360 . Then, an editor screen 400 for translation result details shown in FIG. 8 is displayed, and the translation 360 can be corrected as described later. After correction, this editor screen 350 is displayed.

「次へ」ボタン３５６がクリックされると、訳文編集フィールド３５４に表示されている訳文が正しい訳文として後続する処理に向けて出力される。 When the "Next" button 356 is clicked, the translated text displayed in the translated text edit field 354 is output as the correct translated text for subsequent processing.

エ翻訳結果詳細のエディタ画面
図８に、図７の訳文３６０を第３の編集者がクリックしたときに表示される翻訳結果詳細のエディタ画面４００を示す。図８を参照して、エディタ画面４００は、原文を表示する原文４１０と、その訳文を編集可能に表示する訳文４１４と、訳文４１４を原言語に逆翻訳した結果である逆翻訳４１２をその信頼度４２０とともに表示する逆翻訳４１２と、「修正を反映して閉じる」ボタン４１６及びキャンセルボタン４１８とを含む。 D. Editor screen for translation result details FIG. 8 shows an editor screen 400 for translation result details that is displayed when the third editor clicks on the translation 360 in FIG. Referring to FIG. 8, editor screen 400 displays original text 410 displaying the original text, translated text 414 displaying the translated text in an editable manner, and reverse translation 412 resulting from back-translating the translated text 414 into the original language. It includes a back-translation 412 displayed with degrees 420 , an “apply modifications and close” button 416 and a cancel button 418 .

逆翻訳４１２は、この実施形態では図１に示す自動翻訳装置７８により訳言語の訳文４１４を原言語に逆翻訳した結果である。原文４１０と逆翻訳４１２とが全く同じ、又は実質的に同じ意味を表しているなら、訳文４１４の信頼性は高い。逆に両者が異なっている場合には訳文４１４が誤っている可能性がある。そこでこのように原文４１０と逆翻訳４１２とを対照して表示することで第３の編集者が訳文４１４の妥当性を判定しやすくしている。 The reverse translation 412 is the result of reverse translation of the translated sentence 414 in the target language into the original language by the automatic translation device 78 shown in FIG. 1 in this embodiment. If the original sentence 410 and the reverse translation 412 express exactly the same or substantially the same meaning, then the translation 414 is highly reliable. Conversely, if the two are different, there is a possibility that the translation 414 is incorrect. By contrasting and displaying the original text 410 and the reverse translation 412 in this manner, the third editor can easily judge the validity of the translated text 414 .

なお信頼度４２０として、この実施形態では、原文４１０と逆翻訳４１２の両者に出現する単語の一致度を算出している。単語の一致度だけでは万全ではないが第３の編集者は原言語と訳言語との双方を解することが想定されているため、この値を参考にしながら適切に訳文４１４の適否及びその修正方法を判定できるはずである。 As the reliability 420, in this embodiment, the degree of matching between words appearing in both the original sentence 410 and the reverse translation 412 is calculated. It is assumed that the third editor understands both the original language and the translated language, although the degree of coincidence of words alone is not sufficient. You should be able to determine how.

「修正を反映して閉じる」ボタン４１６をクリックすると、訳文４１４に表示された訳文が正しいとして図７に示す訳文３６０が修正された形で図７に示すエディタ画面３５０が再表示される。キャンセルボタン４１８をクリックすると、訳文４１４に修正がされていたとしてもその修正は破棄され、訳文に対して何も変更がない形で図７のエディタ画面３５０が表示される。 When the "reflect corrections and close" button 416 is clicked, the translation displayed in the translation 414 is correct, and the editor screen 350 shown in FIG. 7 is redisplayed with the translation 360 shown in FIG. 7 corrected. When the cancel button 418 is clicked, even if the translation 414 has been corrected, the correction is discarded, and the editor screen 350 of FIG. 7 is displayed without changing the translation.

（４）プログラム構成
以上に説明した構造及び各機能部の機能は、いずれもコンピュータハードウェアと、コンピュータハードウェアにより実行されるコンピュータプログラムとにより実現される。以下、各機能部を実現するプログラムの制御構造、特に編集作業を実現するプログラムの制御構造について以下に説明する。 (4) Program Configuration The structure and functions of each functional unit described above are realized by computer hardware and a computer program executed by the computer hardware. The control structure of a program that implements each functional unit, particularly the control structure of a program that implements editing work, will be described below.

ア統括サーバ６０
図９を参照して、図１に示す統括サーバ６０を実現するプログラムは、起動後、必要とするメモリ領域及び通信資源を確保し、必要な初期化を実行するステップ４５０と、図１に示す同時翻訳システム５０の各要素とネットワーク６２を介して通信を開始するステップ４５２と、同時翻訳システム５０の各要素との通信に使用するために、先入れ先出しバッファとして使用するために、記憶装置に出力バッファ用の領域を確保し、上記要素のためのバッファの各々について、次にデータを書き込むアドレスを示す書込みポインタＰ_Ｗ及び次にデータを読み出すアドレスを示す読出しポインタＰ_Ｒを０に、それぞれ初期化するステップ４５４と、ステップ４５４の完了後、動作開始指示を待ち受けて次の処理に進むステップ４５６とを含む。なお、出力バッファは音声認識装置６４、音声認識結果編集装置６６、発話分割装置６８、翻訳単位編集装置７０、自動翻訳装置７２及び翻訳結果編集装置７４の各々について設けられ、書込みポインタＰ_Ｗ及び読出しポインタＰ_Ｒもバッファの数だけ設けられることに注意する必要がある。 A Central server 60
Referring to FIG. 9, the program that implements central server 60 shown in FIG. Step 452 initiates communication with each element of simultaneous translation system 50 over network 62, and an output buffer in storage for use as a first-in-first-out buffer for use in communicating with each element of simultaneous translation system 50. and _initializes to 0 a write pointer _PW indicating the address to which data is to be written next and a read pointer PR indicating the address to which data is to be read next for each of the buffers for the above elements. It includes step 454 and step 456 for waiting for an operation start instruction after completion of step 454 and proceeding to the next process. An output buffer is provided for each of the speech recognition device 64, the speech recognition result editing device 66, the speech _segmentation device 68, the translation unit editing device 70, the automatic translation device 72, and the translation result editing device 74. Note that pointers _PR are also provided for the number of buffers.

このプログラムはさらに、ステップ４５６で動作開始指示を受けたことに応答して、音声認識装置６４からネットワーク６２を介して音声認識結果のテキストを受信するまで待機するステップ４５８と、ステップ４５８で音声認識結果のテキストを受信したことに応答して実行され、受信した音声認識結果を音声認識結果用のバッファに書き込むステップ４６０と、音声認識結果のためのバッファの書込みポインタＰ_Ｗの値を１だけインクリメントするステップ４６２と、同時翻訳処理の開始を音声認識結果編集装置６６、発話分割装置６８、翻訳単位編集装置７０、自動翻訳装置７２、翻訳結果編集装置７４、音声合成装置７６及び自動翻訳装置７８に通知するステップ４６４とを含む。 The program further waits in step 458 for receiving the text of the speech recognition result from the speech recognition device 64 via the network 62 in response to receiving the operation start instruction in step 456, and performs speech recognition in step 458. A step 460, executed in response to receiving the resulting text, for writing the received speech recognition results into a buffer for speech recognition results and incrementing the value of the buffer write pointer _PW for speech recognition results by one. step 462, and the start of simultaneous translation processing is sent to the speech recognition result editing device 66, the speech segmentation device 68, the translation unit editing device 70, the automatic translation device 72, the translation result editing device 74, the speech synthesis device 76, and the automatic translation device 78. and step 464 of notifying.

このプログラムはさらに、ネットワーク６２上の各装置からデータの送信要求又は書込み要求を待機し、受信した要求の種類に応じて制御の流れを分岐させるステップ４６６を含む。なおステップ４６６での分岐は、図９には３つしか示していない。しかし、この３つの分岐のうち書込み及び読出しの分岐は、ネットワーク６２上の各装置の種類だけ存在することに注意すべきである。 The program further includes a step 466 that waits for a request to send or write data from each device on network 62 and branches the control flow depending on the type of request received. Note that only three branches at step 466 are shown in FIG. However, it should be noted that of these three branches, write and read branches exist for each type of device on network 62 .

このプログラムはさらに、受信した要求がデータの書込み要求であることに応答して、そのデータの種類に応じた出力バッファの書込みポインタＰ_Ｗの値と読出しポインタの値とが一致するか否かを判定し制御の流れを分岐させるステップ４６８と、ステップ４６８の判定が肯定であることに応答して、データの書込み位置を示すポインタが読出し位置を示すポインタを追い越すことを示す警告を統括サーバ６０の画面に表示し、又は所定のログファイルに書き出すステップ４７０と、書込みポインタＰ_Ｗが示すアドレスに、ステップ４６６で受信したデータを書き込むステップ４７２と、書込みポインタＰ_Ｗの値を１インクリメントして制御をステップ４６６に進めるステップ４７４とを含む。 Further, in response to the fact that the received request is a data write request, the program checks whether the value of the write pointer _PW of the output buffer and the value of the read pointer match according to the type of data. Step 468 for judging and branching the flow of control; Step 470 for displaying on the screen or writing to a predetermined log file; Step 472 for writing the data received in step 466 to the address indicated by the write _{pointer PW} _; and step 474 which proceeds to step 466 .

ステップ４６８の判定が必要な理由は以下のとおりである。この実施形態では、バッファとしてリングバッファを想定している。リングバッファの場合、ポインタがバッファの末端まで到達するとポインタは先頭に戻る。普通は、バッファにデータが書き込まれるとその位置からデータが読み出される。したがって書込み位置の方が読出し位置より常に先行する。書き込まれたデータが順調に読み出され処理されれば特に問題はない、しかし、仮に書込みが続いたのに対して何らかの理由で読出しが遅くなると、書込みポインタがバッファを１周して読出しポインタに追いついてしまう可能性がある。もちろんこの可能性は小さいが、ないとはいえない。そうした場合には、バッファに書き込まれたデータが読み出されないうちに上書きされてしまうというケースが発生する。そのようなトラブルが生じたときには、処理を停止させないためにこのように警告を表示又はログを記録して処理を先に進めることにしている。 The reason why the determination of step 468 is necessary is as follows. In this embodiment, a ring buffer is assumed as the buffer. For ring buffers, the pointer wraps around when it reaches the end of the buffer. Normally, when data is written to a buffer, data is read from that location. Therefore, the write position always precedes the read position. If the written data is read and processed smoothly, there is no particular problem. It may catch up. Of course, this possibility is small, but not impossible. In such a case, the data written in the buffer may be overwritten before it is read out. When such trouble occurs, a warning is displayed or a log is recorded in this way so as not to stop the process, and the process proceeds.

このプログラムはさらに、ステップ４６６で受信した要求がデータの読出し要求であることに応答して、そのバッファの読出しポインタＰ_Ｒが書込みポインタＰ_Ｗの値を等しいか否かを判定し、判定結果に従って制御の流れを分岐させるステップ４７６と、ステップ４７６の判定が肯定であることに応答して、読み出すべきデータが存在しないことを読出し要求の送信元に通知して制御をステップ４６６に戻すステップ４７８と、ステップ４７６の判定が否定であることに応答して、読出しポインタＰ_Ｒの示すアドレスからデータを読み出し、要求の送信元に送信するステップ４８０と、読出しポインタＰ_Ｒの値を１インクリメントして制御をステップ４６６に戻すステップ４８２とを含む。 In response to the request received at step 466 being a request to read data, the program further determines whether the read pointer _{PR of the buffer is equal to the value of the write pointer PW} _, and according to the result of the determination. step 476 to branch control flow, and step 478, in response to an affirmative determination of step 476, notifying the source of the read request that there is no data to read and returning control to step 466; , in response to a negative determination in step 476, step 480 of reading data from the address indicated by the read _pointer _PR and transmitting it to the source of the request; to step 466;

ステップ４７６の判定が肯定になるのは、典型的には処理の開始時である。処理の開始時にはデータの書込みがされていない。したがって書込みポインタＰ_Ｗの値は０である。このとき、読出しポインタＰ_Ｒの値も０である。したがって読出し要求を受信しても送信するデータはない。これは書込みと読出しとが同期して（同じ速度で）進んだときも同様である。 An affirmative determination at step 476 is typically at the beginning of the process. No data has been written at the start of processing. Therefore, the value of the write pointer _PW is zero. At this time, the value of the read pointer _PR is also zero. Therefore, there is no data to send when a read request is received. This is also the case when writing and reading proceed synchronously (at the same speed).

なお、統括サーバ６０の場合、操作者によりシステム全体の停止が指示されることがある。この実施形態では、ステップ４６６でそのような停止要求を受信したときにはステップ４６６の判定で、同時翻訳システム５０の各要素に動作の終了を指示する等の必要な処理を行った後、このプログラムの実行を終了する。 In the case of the central server 60, the operator may give an instruction to stop the entire system. In this embodiment, when such a stop request is received at step 466, at the determination of step 466, necessary processing such as instructing each element of the simultaneous translation system 50 to end the operation is performed. End execution.

イ音声認識装置６４
音声認識装置６４については前記したとおり、どのようなものを用いてもよい。現在ではいわゆるニューラルネットワークを用いたものが好ましい。音声認識装置を実現するプログラムについては既に多数存在しており、それらに関する教科書及び特許文献も多数存在する。したがってここでは記載を簡潔にすることを目的に音声認識装置６４を実現するプログラムの詳細については繰り返さない。 B Voice recognition device 64
Any speech recognition device 64 may be used as described above. At present, one using a so-called neural network is preferred. There are already many programs that implement speech recognition devices, and there are many textbooks and patent documents relating to them. Therefore, for the sake of brevity, the details of the program that implements speech recognizer 64 will not be repeated here.

ウ音声認識結果編集装置６６
図１０を参照して、音声認識結果編集装置６６を実現するプログラムは、起動されると初期化処理を実行するステップ５５０と、統括サーバ６０との通信を開始するステップ５５２と、所定の初期画面を表示するステップ５５４と、統括サーバ６０から同時翻訳システム５０の動作開始を示す指示を待機するステップ５５６と、動作開始を示す指示を受けたことに応答して、音声認識結果の送信要求を統括サーバ６０に対して送信するステップ５５８と、統括サーバ６０からのデータを待機するステップ５６０とを含む。ステップ５６０において、統括サーバ６０からの返信を受信しない場合、及び音声認識結果がまだ存在していない旨の返信を受信したときには、制御はステップ５５８に戻る。この部分の処理は、同時翻訳システム５０のネットワーク６２以外の各部で共通である。 C Speech recognition result editing device 66
Referring to FIG. 10, the program that implements speech recognition result editing device 66 executes initialization processing when activated, step 552 that initiates communication with central server 60, and execution of a predetermined initial screen. step 554 to display , step 556 to wait for an instruction to start operation of the simultaneous translation system 50 from the central server 60, and in response to receiving the instruction to start the operation, supervise a transmission request for speech recognition results. It includes step 558 of sending to server 60 and step 560 of waiting for data from central server 60 . In step 560 , if no reply is received from the central server 60 and if a reply is received stating that the speech recognition result does not yet exist, control returns to step 558 . The processing of this part is common to each part of the simultaneous translation system 50 other than the network 62 .

このプログラムはさらに、ステップ５６０で統括サーバ６０から音声認識結果のテキストデータを受信したことに応答して、図３に示すようなエディタ画面２００を生成し表示するステップ５６２と、第１の編集者による指示を待機するステップ５６４と、ステップ５６４で指示を受信するとその指示の内容に従って制御の流れを分岐させるステップ５６６と、指示の内容が図３に示す「次へ」ボタン２２０のクリックであることに応答して、編集フィールド２１０に表示されているテキストを統括サーバ６０に編集結果として送信して制御の流れをステップ５５８に戻すステップ５６８とを含む。 Further, in response to receiving the text data of the speech recognition result from the central server 60 in step 560, this program further generates and displays an editor screen 200 as shown in FIG. step 564 to wait for an instruction by step 564; step 566 to branch the flow of control according to the contents of the instruction when the instruction is received in step 564; in response to sending the text displayed in edit field 210 to central server 60 as an edit result and returning control flow to step 558 .

このプログラムはさらに、ステップ５６６の判定が否定であることに応答して、ステップ５６４で受信した指示が、動作の終了を指示するものか否かにより制御の流れを分岐させるステップ５７０と、ステップ５７０の判定が肯定であることに応答して、後処理を行いプログラムの実行を終了するステップ５７４と、ステップ５７０の判定が否定であることに応答して、指示に応じた処理を実行し制御をステップ５６４に戻すステップ５７２とを含む。ステップ５７０の判定が否定のときとは、編集フィールド２１０に表示されたテキストの編集、音声信号表示部２１２のクリック、戻しボタン２１４、ポーズボタン２１６、進みボタン２１８及び再生開始ボタン（図示せず）のクリック等である。 The program further responds to a negative determination at step 566 by branching the flow of control to step 570 and step 570 depending on whether the indication received in step 564 indicates an end of operation. Step 574 performs post-processing and terminates execution of the program in response to an affirmative determination of step 574, and executes a process according to the instruction and controls in response to a negative determination in step 570. and step 572 returning to step 564 . When the determination in step 570 is negative, the text displayed in the edit field 210 is edited, the voice signal display section 212 is clicked, the return button 214, the pause button 216, the forward button 218 and the playback start button (not shown) , and so on.

エ発話分割装置６８
発話分割装置６８については、前記したとおり、訓練済のＳＶＭ又はＤＮＮにより実現される。ＳＶＭ及びＤＮＮを実現するプログラムについては既に多数存在しており、ＳＶＭ及びＤＮＮの詳細に関する教科書、論文及び特許文献も多数存在している。したがってここでは説明を簡潔にするために発話分割装置６８と実現するプログラムの詳細については繰り返さない。 D Speech division device 68
Speech segmentation device 68 is implemented by a trained SVM or DNN, as described above. There are already many programs that implement SVMs and DNNs, and there are also many textbooks, papers and patent documents on the details of SVMs and DNNs. Therefore, the details of the speech splitter 68 and the program that implements it will not be repeated here for the sake of brevity.

オ翻訳単位編集装置７０
図１１を参照して、翻訳単位編集装置７０を実現するプログラムは、初期化処理を行うステップ６００と、統括サーバ６０との通信を開始するステップ６０２と、動作開始時の初期画面を表示するステップ６０４と、統括サーバ６０から動作開始の指示を待機するステップ６０６と、統括サーバ６０から動作開始時の指示を受信したことに応答して、統括サーバ６０に対して翻訳単位分割結果（発話分割装置６８の処理結果）の送信要求を送信するステップ６０８と、統括サーバ６０からデータを受信するまで待機するステップ６１０とを含む。 E translation unit editing device 70
Referring to FIG. 11, the program that implements translation unit editing device 70 includes step 600 for performing initialization processing, step 602 for starting communication with central server 60, and step 602 for displaying an initial screen at the start of operation. 604, a step 606 of waiting for an operation start instruction from the central server 60, and in response to receiving an operation start instruction from the central server 60, a translation unit segmentation result (speech segmentation device 68 processing result) and a step 610 of waiting until data is received from the central server 60 .

このプログラムはさらに、ステップ６１０で統括サーバ６０から処理対象のデータを受信したことに応答して、図５に示すエディタ画面３００を生成し表示装置に表示するステップ６１２と、第２の編集者による指示を待機するステップ６１４と、ステップ６１４で第２の編集者から何らかの指示を受信したことに応答して、その指示が「次へ」ボタン３０６か否かにより制御の流れを分岐させるステップ６１６と、ステップ６１６の判定が肯定であることに応答して、統括サーバ６０に対して編集フィールド３０２に表示されているテキストを翻訳単位（チャンク）結果のチャンク列として送信し制御をステップ６０８に戻すステップ６１８とを含む。 Further, in response to receiving data to be processed from the central server 60 in step 610, this program further generates an editor screen 300 shown in FIG. a step 614 that awaits an instruction, and a step 616 that, in response to receiving any instruction from the second editor at step 614, branches control flow depending on whether the instruction is the "next" button 306; , in response to an affirmative determination in step 616, sending the text displayed in edit field 302 to central server 60 as a chunk string of translation unit (chunk) results, and returning control to step 608; 618.

このプログラムはさらに、ステップ６１６の判定が否定であることに応答して、指示が処理の終了を示すものか否かに従って制御の流れを分岐させるステップ６２０と、ステップ６２０の判定が肯定であることに応答して、必要な後処理を実行してプログラムの実行を終了するステップ６２４と、ステップ６２０の判定が否定であることに応答して、受信した指示に対応した処理を実行し制御をステップ６１４に戻すステップ６２２とを含む。ステップ６２２での処理とは、図５において編集フィールド３０２に表示されたテキストを編集する処理である。 The program further responds to a negative determination at step 616 by branching control flow at step 620 according to whether the indication indicates the end of processing; In response to step 624, necessary post-processing is performed to end program execution, and in response to a negative determination in step 620, processing corresponding to the received instruction is performed and control is stepped. and step 622 returning to 614 . The processing in step 622 is processing for editing the text displayed in the edit field 302 in FIG.

カ自動翻訳装置７２
自動翻訳装置７２については、前記したとおり既存のものを利用できる。自動翻訳装置を実現するプログラムについては既に多数存在しており、自動翻訳装置の詳細に関する教科書、論文及び特許文献も多数存在している。したがってここでは説明を簡潔にするために自動翻訳装置７２を実現するプログラムの詳細については繰り返さない。ただし、現在のところ自動翻訳装置７２としてはいわゆるニューラル機械翻訳がその精度及び処理速度の点で最も望ましい。 F Automatic translation device 72
As for the automatic translation device 72, an existing one can be used as described above. There are already many programs that implement automatic translation devices, and there are many textbooks, papers, and patent documents related to the details of automatic translation devices. Therefore, for the sake of brevity, the details of the program that implements the automatic translation device 72 will not be repeated here. However, at present, so-called neural machine translation is most desirable as the automatic translation device 72 in terms of accuracy and processing speed.

キ翻訳結果編集装置７４
図１２を参照して、翻訳結果編集装置７４を実現するプログラムは、初期化処理を実行するステップ６５０と、統括サーバ６０との通信を開始するステップ６５２と、処理開始を示す初期画面を表示するステップ６５４と、統括サーバ６０からの動作開始の指示を待機するステップ６５６と、統括サーバ６０から動作開始の指示を受信したことに応答して、自動翻訳装置７２による自動翻訳結果の送信要求を翻訳結果編集装置７４に対して送信するステップ６５８と、統括サーバ６０からデータを受信するまで待機するステップ６６０とを含む。 G translation result editing device 74
Referring to FIG. 12, the program that implements translation result editing device 74 performs initialization processing at step 650, starts communication with central server 60 at step 652, and displays an initial screen indicating the start of processing. Step 654; Step 656 for waiting for an operation start instruction from the central server 60; It includes a step 658 of sending to the results editor 74 and a step 660 of waiting until data is received from the central server 60 .

このプログラムはさらに、ステップ６６０で統括サーバ６０からデータを受信したことに応答して、図７に示すエディタ画面３５０を表示し、第３の編集者による何らかの指示を待機するステップ６６４と、ステップ６６４で何らかの指示を受信したことに応答して、その指示が「次へ」ボタン３５６か否かにより制御の流れを分岐させるステップ６６６と、ステップ６６６の判定が肯定であることに応答して、訳文編集フィールド３５４に表示されている訳文のテキストを編集結果として統括サーバ６０に送信し、制御をステップ６５８に戻すステップ６６８とを含む。 Further, in response to receiving data from the central server 60 in step 660, the program displays the editor screen 350 shown in FIG. step 666 for branching the control flow depending on whether the instruction is the "Next" button 356 in response to receiving any instruction at , and in response to an affirmative determination at step 666, and a step 668 of transmitting the translated text displayed in the edit field 354 to the central server 60 as an edited result and returning control to step 658 .

このプログラムはさらに、ステップ６６６の判定が否定であることに応答して、訳文編集フィールド３５４のいずれかの訳文がクリックされたか否かにより制御の流れを分岐させるステップ６７０と、ステップ６７０の判定が肯定であることに応答して、クリックされた部分の原文及び訳文を引数として図８に示す翻訳詳細のエディタ画面４００を表示するプログラムを起動しエディタ画面４００から訳文の編集結果を受信するまで待機するステップ６７２と、ステップ６７２でエディタ画面４００から編集結果を受信したことに応答して、編集後のデータで元の訳文を更新するステップ６７４と、ステップ６７４で更新した訳文に従ってエディタ画面３５０の表示を更新し制御をステップ６６４に戻すステップ６７６とを含む。ステップ６７２の詳細については図１３を参照して後述する。 The program further responds to a negative determination at step 666 by branching control flow at step 670 depending on whether any of the translations in translation edit field 354 have been clicked; In response to the affirmative response, start a program that displays the translation details editor screen 400 shown in FIG. a step 672 for updating the original translation with the edited data in response to receiving the editing result from the editor screen 400 in step 672; and a display of the editor screen 350 according to the updated translation in step 674. and step 676 which updates and returns control to step 664 . Details of step 672 will be described later with reference to FIG.

このプログラムはさらに、ステップ６７０の判定が否定であることに応答して、受信した指示が終了指示か否かにより制御の流れを分岐させるステップ６７８と、ステップ６７８の判定が肯定であることに応答して、必要な後処理を実行しプログラムの実行を終了するステップ６８２と、ステップ６７８の判定が否定であることに応答して、操作に応じた処理を実行し制御をステップ６６４に戻すステップ６８０とを含む。ステップ６８０で行う処理として特に重要なものはないが、例えば操作の取り消し、操作の再実行、ヘルプの表示、ポインタの移動等がある。 The program further responds to a negative determination at step 670 by branching control flow at step 678 depending on whether the received indication is a termination indication, and responsive to a positive determination at step 678. Then, step 682 executes necessary post-processing and terminates execution of the program, and step 680 executes processing according to the operation and returns control to step 664 in response to a negative determination in step 678. including. Although there is nothing particularly important as the processing performed in step 680, there are, for example, operation cancellation, operation re-execution, help display, pointer movement, and the like.

図１３を参照して、図１２のステップ６７２を実現するプログラムは、必要な初期化処理を実行するステップ７００と、図１２に示すプログラムから受信した訳文を記憶装置に一時保存するステップ７０２と、受信した訳文を図１に示す自動翻訳装置７８に与えて訳言語から原言語への翻訳を実行させるステップ７０４と、図１２に示すプログラムから受信した原文（図８の原文４１０）及びその訳文（図８の訳文４１４）、ステップ７０４で得られたその逆翻訳（図８の逆翻訳４１２）、並びに原文４１０と逆翻訳４１２との一致度を示す信頼度を算出し、これらを用いて図８に示すエディタ画面４００を生成して表示するステップ７０６と、第３の編集者による指示を待機するステップ７０８とを含む。 Referring to FIG. 13, the program that implements step 672 of FIG. 12 includes step 700 of executing necessary initialization processing, step 702 of temporarily storing the translation received from the program shown in FIG. Step 704 for providing the received translation to the automatic translation device 78 shown in FIG. Translated sentence 414 in FIG. 8), its back-translation obtained in step 704 (back-translation 412 in FIG. 8), and reliability indicating the degree of matching between the original sentence 410 and the back-translation 412 are calculated. and a step 708 of waiting for an instruction by a third editor.

このプログラムはさらに、ステップ７０８で受けた指示が訳文の編集指示であることに応答して、一時記憶装置に保存した訳文を、編集後の訳文で更新するステップ７１０と、編集後の訳文を自動翻訳装置７８により逆翻訳するステップ７１２と、編集後の訳文とその新たな逆翻訳とによりエディタ画面４００の表示を更新し制御をステップ７０８に戻すステップ７１４と、ステップ７０８で受けた指示が「修正を反映して閉じる」ボタン４１６であることに応答して、一時記憶装置に保存した訳文のテキストを戻り値に設定してプログラムの実行を終了するステップ７１６とを含む。なおステップ７０８で受信した指示がキャンセルボタン４１８であるときには、一時保存した訳文を破棄してプログラムの実行を終了する。図１２のステップ６７２では、通常は図１３に示すプログラムにより渡される訳文を新たな訳文とし、キャンセルされたときには図１３に示すプログラムの出力は無視する。 Further, in response to the instruction received in step 708 being an instruction to edit the translated text, the program updates the translated text stored in the temporary storage device with the edited translated text in step 710, and automatically updates the edited translated text in step 710. Step 712 for back-translating by the translation device 78; Step 714 for updating the display of the editor screen 400 with the edited translation and the new back-translation and returning the control to Step 708; and a step 716 of, in response to the button 416, setting the translation text saved in the temporary storage as the return value and ending the execution of the program. When the instruction received at step 708 is the cancel button 418, the temporarily saved translation is discarded and the execution of the program ends. At step 672 in FIG. 12, the translated sentence normally delivered by the program shown in FIG. 13 is used as the new translated sentence, and when canceled, the output of the program shown in FIG. 13 is ignored.

ク音声合成装置７６
音声合成装置７６については、前記したとおり既存のものを利用できる。音声合成装置を実現するプログラムについては既に多数存在しており、音声合成装置の詳細に関する教科書、論文及び特許文献も多数存在している。したがってここでは説明を簡潔にするために音声合成装置７６を実現するプログラムの詳細については繰り返さない。 H Speech synthesizer 76
As for the speech synthesizer 76, an existing one can be used as described above. There are already many programs that implement speech synthesizers, and there are many textbooks, papers, and patent documents on the details of speech synthesizers. Therefore, the details of the program that implements the speech synthesizer 76 will not be repeated here for the sake of brevity.

２動作
（１）全体動作の開始
図１を参照して、同時翻訳システム５０が起動されると、統括サーバ６０、音声認識装置６４、音声認識結果編集装置６６、発話分割装置６８、翻訳単位編集装置７０、自動翻訳装置７２、翻訳結果編集装置７４、音声合成装置７６、自動翻訳装置７８がそれぞれ初期化され、統括サーバ６０との通信を開始する。そしてこれら各装置のうち、音声認識装置６４以外は、統括サーバ６０から処理開始の指示を待ち受ける。 2. Operation (1) Start of Overall Operation Referring to FIG. Device 70 , automatic translation device 72 , translation result editing device 74 , speech synthesizer 76 , and automatic translation device 78 are initialized and start communicating with central server 60 . Among these devices, the devices other than the voice recognition device 64 wait for an instruction to start processing from the central server 60 .

音声認識装置６４は、入力される音声信号から音声部分を検出し、音声部分の開始から終了までを１つのまとまりとして音声認識を行う。音声認識装置６４は音声認識の結果をテキストとして統括サーバ６０にデータの書込み要求として送信した後、次の音声部分の検出待ちとなる。 The speech recognition device 64 detects a speech portion from an input speech signal, and recognizes the speech from the start to the end of the speech portion as one unit. The speech recognition device 64 transmits the speech recognition result as a text to the central server 60 as a data write request, and then waits for detection of the next speech part.

統括サーバ６０はこの音声認識結果を受信すると（図９、ステップ４５８でＹＥＳ）、音声認識装置６４の出力バッファとしての記憶領域の、書込みポインタＰ_Ｗが示すアドレスに書き込み（ステップ４６０）、書込みポインタＰ_Ｗに１を加算する（ステップ４６２）。統括サーバ６０は各部に処理開始の指示を送信する（ステップ４６４）。これに応答して、音声認識装置６４以外の各装置はいずれも処理を開始して統括サーバ６０に対してデータの送信要求を送信し（図１０のステップ５５８、図１１のステップ６０８、図１２のステップ６５８）、統括サーバ６０からのデータ待ち状態となる（ステップ５６０,
６１０、及び６６０）。 When the central server 60 receives this speech recognition result (FIG. 9, YES at step 458), it writes it to the address indicated by the write pointer _PW in the storage area as the output buffer of the speech recognition device 64 (step 460). Add 1 to _PW (step 462). The central server 60 transmits an instruction to start processing to each unit (step 464). In response to this, each device other than the speech recognition device 64 starts processing and transmits a data transmission request to the central server 60 (step 558 in FIG. 10, step 608 in FIG. 11, step 608 in FIG. 12). step 658), and waits for data from the central server 60 (step 560,
610, and 660).

動作の開始時には音声認識装置６４の出力バッファのみにデータが格納されている（図９のステップ４７６でＹＥＳ）。統括サーバ６０は、音声認識装置６４の出力バッファの、読出しポインタが指し示すアドレスから音声認識結果のテキストを読み出し（図９のステップ４７８）、音声認識結果編集装置６６に送信する（ステップ４８０）。さらに統括サーバ６０は、音声認識装置６４の出力バッファの読出しポインタＰ_Ｒに１を加算し、制御をステップ６５８に戻す。 At the start of operation, data is stored only in the output buffer of the speech recognition device 64 (YES at step 476 in FIG. 9). The central server 60 reads the speech recognition result text from the address indicated by the read pointer in the output buffer of the speech recognition device 64 (step 478 in FIG. 9), and transmits it to the speech recognition result editing device 66 (step 480). Further, the central server 60 adds 1 to the read pointer _PR of the output buffer of the speech recognition device 64 and returns the control to step 658 .

以下、各部は以下のように動作する。 Below, each part operates as follows.

（２）音声認識装置６４の動作
音声認識装置６４は、入力される音声信号の音声部分を検出し、その音声部分の音声認識を行ってひとまとまりのテキストとして統括サーバ６０に送信することを繰り返す。統括サーバはこのデータを音声認識装置６４の出力バッファに書き込み、音声認識結果編集装置６６から読出し要求を受信すると先入れ先出し方式でデータを読み出して音声認識結果編集装置６６に送信する。いずれの出力バッファについてもこの基本的動作は共通である。 (2) Operation of Speech Recognition Device 64 The speech recognition device 64 detects the speech portion of the input speech signal, performs speech recognition of the speech portion, and transmits the text as a block of text to the central server 60 repeatedly. . The central server writes this data to the output buffer of the speech recognition device 64 , and upon receiving a read request from the speech recognition result editing device 66 , reads the data in a first-in, first-out manner and transmits it to the speech recognition result editing device 66 . This basic operation is common to any output buffer.

（３）音声認識結果編集装置６６の動作
音声認識結果編集装置６６は統括サーバ６０に対して音声認識結果の送信要求を送信し、音声認識結果が送信されるまで待機する（図１０のステップ５６０）。 (3) Operation of voice recognition result editing device 66 The voice recognition result editing device 66 transmits a request for transmission of the voice recognition result to the central server 60 and waits until the voice recognition result is transmitted (step 560 in FIG. 10). ).

音声認識結果編集装置６６は、統括サーバ６０からテキストを受信すると図３に示すエディタ画面２００を生成し表示する（ステップ５６２）。第１の編集者がテキストに対して何らかの編集をすると、図１０のステップ５６４⇒ステップ５６６⇒ステップ５７０⇒ステップ５７２を経てその操作に対応する編集処理がされ、必要ならテキストが更新される（ステップ５７２）。制御はステップ５６４に戻り、第１の編集者による次の指示待ち状態となる。 Upon receiving the text from the central server 60, the speech recognition result editing device 66 generates and displays the editor screen 200 shown in FIG. 3 (step 562). When the first editor makes some edits to the text, the corresponding editing process is performed through steps 564, 566, 570, 572 in FIG. 10, and the text is updated if necessary (step 570). 572). Control returns to step 564 to await the next instruction by the first editor.

第１の編集者が図３に示す「次へ」ボタン２２０をクリックすると編集後の（何ら編集されなかった場合には編集前の）テキストが統括サーバ６０に送信される。統括サーバ６０は音声認識結果編集装置６６用の出力バッファにそのデータを格納する。 When the first editor clicks the "next" button 220 shown in FIG. 3, the edited text (or the pre-edited text if no edits have been made) is sent to the central server 60. FIG. The central server 60 stores the data in the output buffer for the voice recognition result editing device 66 .

（４）発話分割装置６８の動作
発話分割装置６８は、統括サーバ６０に対し音声認識結果編集装置６６の編集結果のテキストの送信要求を送り、統括サーバ６０からテキストが送信されるまで待機する。統括サーバ６０は、音声認識結果編集装置６６のための出力バッファからテキストを読み出し、発話分割装置６８に送信する。発話分割装置６８はこのテキストに対してＳＶＭ又はＤＮＮによるチャンク末の検出を行い、チャンク末の文字の直後に改行コードを挿入して統括サーバ６０に送信する。統括サーバ６０は発話分割装置６８のための出力バッファにこのテキストを格納する。 (4) Operation of Speech Splitting Device 68 The speech splitting device 68 sends a transmission request for the text edited by the speech recognition result editing device 66 to the central server 60 and waits until the text is transmitted from the central server 60 . The central server 60 reads the text from the output buffer for the speech recognition result editor 66 and sends it to the speech segmentation device 68 . The speech segmentation device 68 detects the end of the chunk by SVM or DNN for this text, inserts a line feed code immediately after the character at the end of the chunk, and transmits it to the central server 60 . The central server 60 stores this text in an output buffer for the speech splitter 68 .

（５）翻訳単位編集装置７０の動作
翻訳単位編集装置７０は、統括サーバ６０に対して発話分割装置６８による分割結果のテキストの送信要求を送り（図１１のステップ６０８）、統括サーバ６０からテキストが送信されるまで待機する（ステップ６１０）。統括サーバ６０は、発話分割装置６８のための出力バッファからテキストを読み出し、翻訳単位編集装置７０に送信する。統括サーバ６０から発話分割装置６８による分割結果を受信すると（図１１のステップ６１０でＹＥＳ）翻訳単位編集装置７０は、図５に示すエディタ画面３００を生成し表示装置に表示する（ステップ６１２）。翻訳単位編集装置７０は第２の編集者による指示を待機し（ステップ６１４）、指示があるとその指示が何かを判定する（ステップ６１６及び６２０）。 (5) Operation of translation unit editing device 70 The translation unit editing device 70 sends a transmission request for the text resulting from the segmentation by the speech segmentation device 68 to the central server 60 (step 608 in FIG. 11). is sent (step 610). The central server 60 reads the text from the output buffer for the speech splitter 68 and sends it to the translation unit editor 70 . When the division result by the speech division device 68 is received from the central server 60 (YES at step 610 in FIG. 11), the translation unit editing device 70 generates the editor screen 300 shown in FIG. 5 and displays it on the display device (step 612). Translation unit editor 70 waits for instructions from the second editor (step 614), and if so, determines what the instructions are (steps 616 and 620).

指示が図５に示す「次へ」ボタン３０６なら翻訳単位編集装置７０は編集結果のテキストを統括サーバ６０に送信し（ステップ６１８）、次のデータ待ちとなる（ステップ６０８及び６１０）。統括サーバ６０はこのデータを翻訳単位編集装置７０のための出力バッファに格納する。指示が動作終了の指示でなければ、何らかの編集の指示である。したがって、その指示に従って翻訳単位分割のテキストの編集を行ってテキストを更新する等の処理を行い（ステップ６２２）、次のデータ待ちとなる（ステップ６０８及び６１０）。指示が動作終了の指示であれば必要な後処理を実行し（ステップ６２４）、プログラムの実行を終了する。指示が動作終了の指示でなければ、指定された編集処理を実行して翻訳単位分割のテキストの更新等を行い（ステップ６２２）、次の指示を待機する（ステップ６１４）。 If the instruction is the "next" button 306 shown in FIG. 5, the translation unit editing device 70 transmits the edited text to the central server 60 (step 618), and waits for the next data (steps 608 and 610). The central server 60 stores this data in an output buffer for the translation unit editing device 70 . If the instruction is not an instruction to end the operation, it is an instruction for some kind of editing. Therefore, according to the instruction, the text of the translation unit segmentation is edited, and processing such as updating the text is performed (step 622), and the next data is awaited (steps 608 and 610). If the instruction is to end the operation, necessary post-processing is executed (step 624), and execution of the program ends. If the instruction is not an instruction to end the operation, the designated editing process is executed to update the text of the translation unit division (step 622), and wait for the next instruction (step 614).

（６）自動翻訳装置７２の動作
自動翻訳装置７２は、統括サーバ６０に翻訳対象のテキストの送信要求を送信する。統括サーバ６０から翻訳対象のテキストを受信すると、自動翻訳装置７２はそのテキストに対して自動翻訳を行い、訳言語のテキストを出力する。自動翻訳装置７２はこのテキストを原文とともに統括サーバ６０に送信する。統括サーバ６０はこれらのテキストを自動翻訳装置７２のための出力バッファに格納する。 (6) Operation of Automatic Translation Device 72 The automatic translation device 72 transmits a transmission request for the text to be translated to the central server 60 . When the text to be translated is received from the central server 60, the automatic translation device 72 automatically translates the text and outputs the text in the translated language. The automatic translation device 72 transmits this text to the central server 60 together with the original text. Central server 60 stores these texts in an output buffer for automatic translation device 72 .

（７）翻訳結果編集装置７４の動作
翻訳結果編集装置７４は、統括サーバ６０に対して自動翻訳装置７２による出力のテキスト（原文と翻訳結果）の送信要求を送信し（図１２のステップ６５８）、統括サーバ６０からのデータ送信を待機する（ステップ６６０）。統括サーバ６０からテキストを受信すると、翻訳結果編集装置７４は、このテキストのうちの原言語の原文と訳言語の訳文とを対照する形で図７に示すエディタ画面３５０を生成し（ステップ６６２）、表示する。その後、翻訳結果編集装置７４は第３の編集者による指示待ちとなる（ステップ６６４）
ア概略画面での編集
編集者から何らかの指示があると、翻訳結果編集装置７４は図１２のステップ６６６でその指示が「次へ」ボタン３５６か否かを判定する。判定が肯定なら（ステップ６６６でＹＥＳ）翻訳結果編集装置７４は図７の訳文編集フィールド３５４に表示されている訳文を編集結果として統括サーバ６０に送信し（ステップ６６８）、統括サーバ６０からの次のデータ待ちとなる（ステップ６５８及び６６０）。統括サーバ６０は、このデータを翻訳結果編集装置７４のための出力バッファに格納する。 (7) Operation of Translation Result Editing Device 74 The translation result editing device 74 sends a transmission request for the text output by the automatic translation device 72 (original text and translation result) to the central server 60 (step 658 in FIG. 12). , waits for data transmission from the central server 60 (step 660). Upon receiving the text from the central server 60, the translation result editing device 74 generates an editor screen 350 shown in FIG. 7 by comparing the original text in the source language and the translated text in the target language (step 662). ,indicate. Thereafter, the translation result editing device 74 waits for instructions from the third editor (step 664).
a. Editing on the Overview Screen When the editor gives some instruction, the translation result editing device 74 determines whether or not the instruction is the "next" button 356 at step 666 in FIG. If the determination is affirmative (YES at step 666), the translation result editing device 74 transmits the translation displayed in the translation editing field 354 of FIG. data is awaited (steps 658 and 660). The central server 60 stores this data in the output buffer for the translation result editing device 74 .

ステップ６６６の判定が否定であれば、翻訳結果編集装置７４はステップ６７０で図７に示す訳文編集フィールド３５４に表示されている訳文のいずれかの一部がクリックされたか否かを判定し、その訳文と対応する原文とを引き数として図１３に制御構造を示すプログラムを起動する（ステップ６７２）。この結果、図８に示すエディタ画面４００が表示されてその結果がステップ６７２に戻ってくる。図８での翻訳結果編集装置７４の動作については後述する。 If the determination in step 666 is negative, the translation result editing device 74 determines in step 670 whether any part of the translated text displayed in the translated text editing field 354 shown in FIG. A program whose control structure is shown in FIG. 13 is started with the translation and the corresponding original as arguments (step 672). As a result, the editor screen 400 shown in FIG. 8 is displayed and the result returns to step 672 . The operation of the translation result editing device 74 in FIG. 8 will be described later.

翻訳結果編集装置７４は、図１２のステップ６７２で受信した編集後のデータで統括サーバ６０から受信したテキストのうちの訳文を更新し（ステップ６７４）、更新後のデータでエディタ画面３５０の表示を更新し、制御をステップ６６４に戻す（ステップ６７６）。 The translation result editing device 74 updates the translation of the text received from the central server 60 with the edited data received in step 672 of FIG. 12 (step 674), and displays the editor screen 350 with the updated data. update and return control to step 664 (step 676).

ステップ６７０の判定が否定であれば翻訳結果編集装置７４は、ステップ６７８で指示が処理の終了指示か否かを判定する。指示が終了指示であれば（ステップ６７８でＹＥＳ）、必要な後処理を行ってプログラムの実行を終了する（ステップ６８２）。指示が終了指示でなければ（ステップ６７８でＮＯ）、ステップ６８０で指示に応じた処理を実行し、制御をステップ６６４に戻して第３の編集者による次の指示を待機する。 If the determination in step 670 is negative, the translation result editing device 74 determines in step 678 whether or not the instruction is an instruction to end processing. If the instruction is an end instruction (YES at step 678), necessary post-processing is performed and execution of the program is terminated (step 682). If the instruction is not an end instruction (NO at step 678), the process according to the instruction is executed at step 680, and control is returned to step 664 to wait for the next instruction from the third editor.

イ詳細画面での編集
翻訳結果編集装置７４は、図１２のステップ６７２で呼び出されると以下のように動作する。図１３を参照して、まずステップ７００が行われ、ステップ６７２で引き数として渡された訳文と原文とを記憶装置に一時保存する。ステップ７０４で訳文を自動翻訳装置７８に入力して、原言語の文に逆翻訳する。そして、原文と、逆翻訳と、訳文とを用いて図８に示すエディタ画面４００を生成し表示する（ステップ７０６）。 B. Editing on the detailed screen When the translation result editing device 74 is called at step 672 in FIG. 12, it operates as follows. Referring to FIG. 13, step 700 is performed first, and the translated text and the original text passed as arguments in step 672 are temporarily stored in the storage device. At step 704, the translated sentence is input to the automatic translation device 78 and back-translated into the sentence in the original language. Then, an editor screen 400 shown in FIG. 8 is generated and displayed using the original text, the reverse translation, and the translated text (step 706).

第３の編集者の指示待ちとなり（ステップ７０８）、指示があるとその指示が訳文の編集指示か、「修正を反映して閉じる」ボタン４１６か、キャンセルボタン４１８かを判定して制御の流れを判定結果により分岐させる。 The process waits for an instruction from the third editor (step 708), and if there is an instruction, it is determined whether the instruction is an instruction to edit the translation, the "reflect corrections and close" button 416, or the cancel button 418, and the flow of control proceeds. is branched according to the judgment result.

訳文の編集指示であれば、記憶装置に一時保存した訳文を編集後の訳文で更新し（ステップ７１０）、編集後の訳文を自動翻訳装置７８に入力して逆翻訳させる（ステップ７１２）。こうして得られた編集後の訳文及びその逆翻訳、並びにもとの原文を用いて図８に示すエディタ画面４００を更新し再表示して（ステップ７１４）第３の編集者の指示待ちに戻る（ステップ７０８）。 If it is an instruction to edit the translated text, the translated text temporarily stored in the storage device is updated with the edited translated text (step 710), and the edited translated text is input to the automatic translation device 78 for reverse translation (step 712). The editor screen 400 shown in FIG. 8 is updated and re-displayed (step 714) using the edited translation and its reverse translation obtained in this way, and the original text (step 714), and the process returns to waiting for the third editor's instruction ( step 708).

「修正を反映して閉じる」ボタン４１６がクリックされた場合には、ステップ７１６において、一時保存した訳文を戻り値に設定して制御を図１２のステップ６７２に戻す（ステップ７１６）。 If the "reflect corrections and close" button 416 is clicked, the temporarily saved translation is set as the return value in step 716, and the control returns to step 672 in FIG. 12 (step 716).

キャンセルボタン４１８がクリックされた場合にはその旨を示す値を戻り値に設定して制御を図１２のステップ６７２に戻す。 If the cancel button 418 has been clicked, a value indicating that fact is set as the return value, and the control returns to step 672 in FIG.

（８）音声合成装置７６の動作
音声合成装置７６は統括サーバ６０に対して翻訳結果編集装置７４による編集結果の送信を要求する。統括サーバ６０から、翻訳結果編集装置７４による編集結果である訳言語のテキストが送信されてくると、音声合成装置７６は、公知の音声合成方法のいずれかを用いてそのテキストから音声信号を生成し、しかるべき出力装置に向けて出力する。 (8) Operation of speech synthesizer 76 The speech synthesizer 76 requests the central server 60 to transmit the result of editing by the translation result editing device 74 . When text in the target language, which is the result of editing by the translation result editing device 74, is transmitted from the central server 60, the speech synthesizer 76 generates a speech signal from the text using any known speech synthesis method. and output to an appropriate output device.

同時翻訳システム５０は、統括サーバ６０による統括制御のもと、音声認識装置６４が音声認識した結果得られる文字列の各々について、上記した処理をパイプライン式に繰り返す。その結果、音声認識装置６４に入力された原言語の音声信号が、次々に訳言語に翻訳され、テキスト化され、音声信号に変換される。 Under the overall control of the central server 60, the simultaneous translation system 50 repeats the above-described processing in a pipeline manner for each character string obtained as a result of speech recognition by the speech recognition device 64. FIG. As a result, the speech signal in the original language input to the speech recognition device 64 is translated into the target language one after another, converted into text, and converted into a speech signal.

３効果
以上に述べた第１実施形態によれば、音声認識、音声認識結果の修正、音声認識結果のチャンク列への分割、チャンク列の分割の修正、自動翻訳、自動翻訳結果の修正、及び音声合成が、パイプライン式に並列に実行される。統括サーバ６０が各装置の間の出力バッファが先入れ先出し方式で動作するように各データの格納先を振り分けて制御するため、音声認識の結果が正しい順序で訳言語に翻訳され音声信号化される。また音声認識結果の修正、認識結果のチャンク列への分割の修正、及び翻訳結果に対する修正が編集者により行われる。その結果、同時翻訳が、高い精度で実現できる。 3 Effect According to the first embodiment described above, speech recognition, correction of speech recognition results, division of speech recognition results into chunk strings, correction of division of chunk strings, automatic translation, correction of automatic translation results, and Speech synthesis is performed in parallel in a pipelined manner. Since the supervising server 60 sorts and controls the storage destinations of each data so that the output buffers between the devices operate in a first-in, first-out manner, the results of speech recognition are translated into translation languages in the correct order and converted into speech signals. Also, the editor corrects the speech recognition result, corrects the division of the recognition result into chunk strings, and corrects the translation result. As a result, simultaneous translation can be achieved with high accuracy.

また、音声認識結果の修正及び認識結果のチャンク列への分割の修正を行う第１の編集者及び第２の編集者は、いずれも原言語に関する知識を持っていればよく、訳言語の知識は必要とされない。また第３の編集者は原言語と訳言語の双方の知識を持つ必要があるが、原言語のテキスト化及び自動翻訳に適したチャンク化については既に行われているため、原言語に関する処理の負担は非常に少なくなる。また上記実施形態のように原文と訳文とを対照して表示するため、訳文の誤りに関する判定及びその修正が容易に行える。さらに図８に示すように訳文を逆翻訳したものを表示することで、訳文の妥当性が容易に判定できる。さらに逆翻訳と訳文との一致度が訳文の信頼度として表示されるため、さらに第３の編集者の負荷は軽減される。 In addition, the first editor and the second editor who correct the speech recognition result and the division of the recognition result into chunk sequences need only have knowledge of the original language, and knowledge of the target language. is not required. Also, the third editor needs to have knowledge of both the source language and the target language, but since the source language has already been converted into text and chunked suitable for automatic translation, processing related to the source language burden will be much less. Moreover, since the original text and the translated text are displayed in comparison with each other as in the above embodiment, it is possible to easily judge and correct errors in the translated text. Furthermore, by displaying a back-translated version of the translated text as shown in FIG. 8, the validity of the translated text can be easily determined. Furthermore, since the degree of matching between the reverse translation and the translated text is displayed as the reliability of the translated text, the load on the third editor is further reduced.

特許文献１にはポストエディットに関する記述はないが、仮にポストエディットを行う場合には、ポストエディットを行う編集者は第１言語及び第２言語の双方に関してネイティブとほぼ同様の理解力及び表現力がないと十分な編集を行うことはできないと考えられる。上記実施形態に係る同時翻訳システム５０を用いれば、原言語に関して十分な知識を持っていればよい編集者（第１の編集者及び第２の編集者）と、双方の言語に関しての知識を持つ編集者（第３の編集者）とが必要とされるだけであり、しかも第３の編集者については、原言語についてネイティブなみのスキルは要求されない。したがって、上記実施形態に係る同時翻訳システム５０により、同時翻訳に必要な人材の不足を補うことができる。 Although there is no description of post-editing in Patent Document 1, if post-editing is performed, the post-editing editor must have substantially the same comprehension and expressive power as a native speaker in both the first and second languages. Without it, it is considered that sufficient editing cannot be performed. If the simultaneous translation system 50 according to the above embodiment is used, editors (first editor and second editor) who need to have sufficient knowledge about the source language and editors who have knowledge about both languages Only an editor (third editor) is required, and the third editor does not require native-level skills in the source language. Therefore, the simultaneous translation system 50 according to the above embodiment can make up for the lack of personnel necessary for simultaneous translation.

なお、この第１実施形態では、統括サーバ６０が各装置の間の出力バッファとして機能している。しかしこの発明はそのような実施形態には限定されない。各装置の間に別々の出力バッファを設けてもよい。この場合、それら出力バッファは出力側の装置に設けてもよいし、入力側の装置に設けてもよい。また、出力側の装置及び入力側の装置のいずれからも独立した記憶装置を両者の間に設けてもよい。 Note that in the first embodiment, the central server 60 functions as an output buffer between each device. However, the invention is not limited to such embodiments. Separate output buffers may be provided between each device. In this case, these output buffers may be provided in the device on the output side or in the device on the input side. Also, a storage device independent of both the output-side device and the input-side device may be provided between the two.

第２第２実施形態
１概略
上記実施形態では、第１の編集者、第２の編集者及び第３の編集者はいずれも１人が想定されている。しかし、これら編集者を複数利用できるようにすれば、同時翻訳をより高速にしかも精度高く行うことができる。第２実施形態はそうした実施形態である。 2nd Embodiment 1 Outline In the above embodiment, it is assumed that there is one editor for each of the first editor, the second editor, and the third editor. However, if multiple editors are available, simultaneous translation can be performed faster and more accurately. The second embodiment is such an embodiment.

２ハードウェア構成
ハードウェアとしては、この第２実施形態に係る同時翻訳システムは、図１に示す第１実施形態の同時翻訳システム５０と同じものを利用できる。したがってここではその説明は繰り返さない。 2. Hardware Configuration As hardware, the simultaneous translation system according to the second embodiment can use the same hardware as the simultaneous translation system 50 of the first embodiment shown in FIG. Therefore, the description will not be repeated here.

３プログラム構成
（１）第１実施形態との相違点
第１実施形態と異なるのは、図１に示す音声認識結果編集装置６６、翻訳単位編集装置７０及び翻訳結果編集装置７４のいずれかが複数となること、及び第２実施形態で統括サーバ６０に相当する統括サーバが実行するプログラムが図９に示す制御構造とは異なる制御構造を持つことだけである。したがって以下では統括サーバ６０が実行するプログラムの制御構造のみについて述べる。なお、音声認識結果編集装置６６、翻訳単位編集装置７０及び翻訳結果編集装置７４の中で複数となるのはいずれか１種類に限定されない。これらのうちの任意の組合せの装置が複数となってもよい。 3 Program configuration (1) Differences from the first embodiment The difference from the first embodiment is that any of the speech recognition result editing device 66, the translation unit editing device 70, and the translation result editing device 74 shown in FIG. and that the program executed by the central server corresponding to the central server 60 in the second embodiment has a control structure different from the control structure shown in FIG. Therefore, only the control structure of the program executed by the central server 60 will be described below. It should be noted that among the speech recognition result editing device 66, the translation unit editing device 70, and the translation result editing device 74, the plural types are not limited to any one type. Any combination of these devices may be plural.

（２）統括サーバのプログラム
図１４に第２実施形態に係る統括サーバが実行するプログラムの制御構造をフローチャート形式で示す。図１４のフローチャートが図９に示すものと異なるのは、図９のステップ４６４に代えて、複数の音声認識結果編集装置６６、翻訳単位編集装置７０又は翻訳結果編集装置７４を含む、同時翻訳システムの全ての構成要素に対して処理開始を通知するステップ７５４を含むことと、ステップ４６６において要求が書込み要求であると判定されたときに、ステップ４６８の前に、各装置（特に音声認識結果編集装置６６、翻訳単位編集装置７０及び翻訳結果編集装置７４）から受信したデータについて、その順序が各装置にデータを送信した順序と整合するよう、受信したデータを一時保持してデータの整序を行うステップ７５６と、ステップ７５６の処理の結果、受信したデータが全て整序されており出力バッファに書き込めるときには制御をステップ４６８に進め、まだ整序されていないときにはデータを一時保存したまま制御をステップ４６６に戻すステップ７５８とを含む点である。図１４に示すフローチャートはまた、図９のステップ４８０に代えて、読出しポインタＰ_Ｒの位置からデータを読み出し、読出しポインタＰ_Ｒの値をデータに付して送信要求の送信元の装置に送信するステップ７６０を含む点でも図９と異なっている。なお、この実施形態では、各装置は自分が受信したデータに付されていた、読出し順序を示すトークンを、処理結果のデータに付して統括サーバ６０に送信するものとする。 (2) Program of Central Server FIG. 14 shows the control structure of the program executed by the central server according to the second embodiment in the form of a flowchart. 14 differs from that shown in FIG. 9 in that the simultaneous translation system includes a plurality of speech recognition result editors 66, translation unit editors 70 or translation result editors 74 instead of step 464 in FIG. , and if the request is determined to be a write request in step 466, before step 468, each device (especially the voice recognition result editing Regarding the data received from the device 66, the translation unit editing device 70, and the translation result editing device 74), the received data is temporarily stored and arranged in order so that the order matches the order in which the data was sent to each device. As a result of step 756 and the processing of step 756, if the received data is all ordered and can be written to the output buffer, the control proceeds to step 468, and if not yet ordered, the data is temporarily stored and the control proceeds to step 468. and step 758 to return to 466. 14, instead of step 480 in FIG. 9, data is read from the position of the read pointer _PR , the value of the read pointer _PR is attached to the data, and the data is transmitted to the device that sent the transmission request. It also differs from FIG. 9 in that step 760 is included. In this embodiment, each device attaches the token indicating the reading order attached to the data received by itself to the data of the processing result and transmits it to the central server 60 .

（３）動作の相違点
同じ処理を行う装置が複数個あるとき、例えば複数の翻訳結果編集装置７４がシステム中に存在しているときには、図１４のステップ７６０で翻訳結果をそれら複数の翻訳結果編集装置７４に振り分けて送信し、各編集者により編集させる。すなわち、ステップ７６０を実行するとき、最初にはある編集者の装置にデータを送信し、次の実行時には、別の編集者の装置にデータを送信する。この場合、編集者が違うと処理速度にも違いがあったり、何らかの原因で通信遅延が発生したりした場合、送信したときの順序と異なる順序で複数の翻訳結果編集装置７４から編集結果が返されてくることがある。この第２実施形態では、同じ処理を実行する複数の装置は、同じ出力バッファを共有するので、それらの出力を受信した順序に従って出力バッファに保存すると、次の処理に渡すときにデータの順序が前後してしまう。 (3) Differences in Operation When there are a plurality of devices that perform the same processing, for example, when there are a plurality of translation result editing devices 74 in the system, the translation results are translated into the translation results of the plurality of translation results at step 760 in FIG. They are sorted and transmitted to the editing device 74 and edited by each editor. That is, the first time step 760 is executed, the data is sent to one editor's device, and the next time the data is sent to another editor's device. In this case, if different editors have different processing speeds, or if communication delays occur for some reason, the editing results are returned from the plurality of translation result editing devices 74 in an order different from the order in which they were sent. Sometimes it comes. In this second embodiment, multiple devices performing the same process share the same output buffer, so storing their outputs in the output buffer in the order they were received ensures that the data is in order when passed to the next process. Back and forth.

そこで、この実施形態のステップ７５６では、それまでに受信したデータに付されていたトークンと、新たに受信したデータのトークンとを比較し、両者が連続していれば新たに受信したデータを出力バッファに保存し、そうでなければ、その不連続な部分のトークンを持つデータを受信するまで、出力バッファとは別の記憶領域に一時保存する。そして不連続な部分を補充するトークンを持つデータを受信したときに、そのデータを出力バッファに保存し一時保存したデータをその後に保存する。もちろん、一時保存するデータが複数になる場合もあり得る。 Therefore, in step 756 of this embodiment, the token attached to the data received so far is compared with the token of the newly received data, and if the two are consecutive, the newly received data is output. Store in a buffer, otherwise temporarily store in a separate storage area from the output buffer until data with tokens for that discontinuous portion is received. Then, when data with tokens to fill the discontinuous portion is received, the data is stored in the output buffer and the temporarily stored data is subsequently stored. Of course, there may be cases where there are multiple pieces of data to be temporarily saved.

こうすることにより、同じ処理をする複数の編集者にある順序で振り分けて送信したときに、送信時と異なる順序で編集後のデータが戻ってきたときに、送信時と正しい順番に編集後のデータの順序を並べ替えてバッファに保存できる。そのため、後の処理に悪影響を及ぼすことはない。また複数の編集者により編集処理を行うので、個々の編集者の処理するデータ量は少なくなり、全体として処理速度が向上し、また各編集者の負担も軽減されるという効果がある。 By doing this, when the edited data is sent back in a certain order to multiple editors who perform the same processing, and the edited data is returned in a different order than when it was sent, the edited data will be sent in the correct order. You can rearrange the order of the data and store it in the buffer. Therefore, there is no adverse effect on subsequent processing. In addition, since the editing process is performed by a plurality of editors, the amount of data processed by each editor is reduced, the processing speed is improved as a whole, and the burden on each editor is reduced.

なお、この場合、不連続な部分を補充するトークンが所定の時間内に送信されて来ない場合もあり得る。そうした場合には処理の速度を保つために、データが全て揃わなくても一時保存していたデータを出力バッファに保存することも可能である。この場合、後からその不足したデータを受信したときには、そのデータを無視すればよい。 In this case, it is possible that the tokens for replenishing the discontinuous portion may not be sent within the predetermined time. In such a case, in order to keep the processing speed, it is possible to store the temporarily stored data in the output buffer even if all the data is not complete. In this case, when the missing data is received later, the data can be ignored.

なおこの第２実施形態では、統括サーバ６０が各装置の出力バッファを、統括サーバ６０のメモリ領域を用いたリングバッファの形式で実現している。しかしこの発明はそのような実施形態には限定されない。統括サーバ６０が各装置の出力バッファをデータベースの形で実現してもよい。 Note that in the second embodiment, the central server 60 implements the output buffer of each device in the form of a ring buffer using the memory area of the central server 60 . However, the invention is not limited to such embodiments. The central server 60 may implement the output buffer of each device in the form of a database.

第３変形例
１第１変形例
図１５は、上記第１実施形態に係る翻訳結果編集装置７４において図７及び図８に示す翻訳結果のエディタ画面３５０及び４００に変わる翻訳結果編集装置のエディタ画面８００を示す。この例では、エディタ画面３５０と翻訳結果の詳細に関するエディタ画面４００のように処理を分割することなく、処理対象となっている全ての翻訳結果を一つの画面で処理する。 Third Modification 1 First Modification FIG. 15 shows an editor screen of the translation result editing device that changes from the translation result editor screens 350 and 400 shown in FIGS. 7 and 8 in the translation result editing device 74 according to the first embodiment. 800 is shown. In this example, all the translation results to be processed are processed on one screen without dividing the processing like the editor screen 350 and the editor screen 400 for details of the translation results.

ア編集画面
図１５を参照して、エディタ画面８００は、編集対象を表示する表示フィールド８１４と、終了ボタン８１０と、「次へ」ボタン８１２とを含む。 A. Edit Screen Referring to FIG. 15, editor screen 800 includes a display field 814 for displaying an object to be edited, an end button 810 and a “next” button 812 .

表示フィールド８１４には、各々が原文と訳文と訳文の逆翻訳とを含む３文を単位とする文表示部８２０、８２２、８２４及び８２６等とが表示される。これら文表示部が表示フィールド８１４に表示しきれないときには、表示フィールド８１４の右端にスクロールボタン８２８が表示され、スクロールボタン８２８を操作することで処理対象となっている全ての訳文を表示し処理できる。 The display field 814 displays sentence display portions 820, 822, 824, 826, etc. each having three sentence units each including an original sentence, a translated sentence, and a reverse translation of the translated sentence. When these sentence display portions cannot be displayed in the display field 814, a scroll button 828 is displayed at the right end of the display field 814, and by operating the scroll button 828, all translated sentences to be processed can be displayed and processed. .

文表示部８２０等において、訳文部分（訳文編集フィールド）は編集可能である。また各訳文編集フィールドの間は例えばタブキー、矢印キー、マウス操作等により相互に移動可能である。したがって、必要であれば全ての訳文を一度に修正してから「次へ」ボタン８１２をクリックすることで全ての訳文を一度に更新できる。 In the sentence display section 820 and the like, the translated sentence portion (translated sentence edit field) can be edited. In addition, it is possible to move between the respective translation edit fields by, for example, tab keys, arrow keys, mouse operation, or the like. Therefore, if necessary, all translated sentences can be updated at once by correcting all translated sentences at once and then clicking the “Next” button 812 .

イプログラム
図１６を参照して、この変形例を実現するためのプログラムの制御構造をフローチャート形式で示す。図１６に示すフローチャートは、図１２に示すものを変形したものである。図１６が図１２と異なるのは、ステップ６５０においてデータ（原文とその訳文との対）を受信したときに、受信したデータに含まれる各訳文の逆翻訳を例えば図１に示す自動翻訳装置７８により生成するステップ８５０と、これら逆翻訳と、その原文及び訳文の対とからなる全ての組を用いて図１５に示すエディタ画面８００を生成し表示するステップ８５２と、ステップ８５２に続いて編集者による指示を待機し、指示があったときに制御をステップ６６６に進めるステップ８５４を含むことと、図１２のステップ６８０に代えて、ステップ８５４で受信した指示に応じた処理（図１２と異なり、文表示部８２４の全体の編集を含む処理）を実行し制御をステップ８５４に戻すステップ８５６を含むこととである。 B. Program Referring to FIG. 16, the control structure of the program for realizing this modified example is shown in the form of a flow chart. The flowchart shown in FIG. 16 is a modification of that shown in FIG. FIG. 16 differs from FIG. 12 in that when data (a pair of an original text and its translated text) is received in step 650, each translated text contained in the received data is back-translated, for example, by the automatic translation device 78 shown in FIG. a step 850 for generating and displaying an editor screen 800 shown in FIG. including step 854 which waits for an instruction by and advances the control to step 666 when the instruction is received; and step 856 for executing the processing including editing of the entire sentence display portion 824 and returning control to step 854 .

この変形例では、各文についての修正とエディタ画面８００への反映とを一度に行うので、必要な修正を一度に行えるという利点がある。 In this modified example, each sentence is corrected and reflected on the editor screen 800 at once, so there is an advantage that necessary corrections can be made at once.

２第２変形例
第２変形例も第１変形例と同様、図１７及び図１８の変形であり、翻訳結果編集装置において全ての原文、訳文及びその逆翻訳を一つの画面に表示する例である。 2 Second Modification The second modification is a modification of FIGS. 17 and 18, similar to the first modification, and is an example in which all the original sentences, translated sentences, and their reverse translations are displayed on one screen in the translation result editing device. be.

ア編集画面
図１７を参照して、この第２変形例に係る翻訳結果編集装置のエディタ画面９００は、図１５と同様、編集対象が表示される表示フィールド９１０と、終了ボタン８１０及び「次へ」ボタン８１２とが表示される。 A. Edit Screen Referring to FIG. 17, an editor screen 900 of the translation result editing apparatus according to the second modified example includes, as in FIG. ' button 812 is displayed.

表示フィールド９１０にはこれも第１変形例と同様、各々が原文と訳文と訳文の逆翻訳とを含む３文を単位とする文表示部９２０、９２２、９２４、及び９２６等が表示される。これらの各々には、上から原文、訳文、そして訳文の逆翻訳がこの順番で表示される。また、表示フィールド９１０に全ての文表示部が表示できないときに、表示フィールド９１０の右端にスクロールボタン８２８が表示される点も図１５と同様である。 In the display field 910, sentence display portions 920, 922, 924, 926, etc. are displayed, each having three sentences each including an original sentence, a translated sentence, and a reverse translation of the translated sentence, as in the first modification. In each of these, the original text, the translated text, and the reverse translation of the translated text are displayed in this order from the top. 15 is also the same as FIG.

ただしこの例では、文表示部の左端にはＯＫボタンと呼ぶボタンが表示される。例えば文表示部９２０の左端には、ＯＫボタン９３０が表示される。このＯＫボタン９３０は、文表示部９２０に表示されている訳文の編集が完了したことを示すためのものである。 However, in this example, a button called an OK button is displayed at the left end of the sentence display area. For example, an OK button 930 is displayed at the left end of the sentence display portion 920 . This OK button 930 is for indicating that the translation text displayed in the text display section 920 has been edited.

この実施形態では、最初に文表示部９２０の訳文編集フィールドのみが編集可能であり、他の訳文編集フィールドは編集不可となっている。編集者が訳文を編集しＯＫボタン９３０をクリックすることにより、ＯＫボタン９３０がハイライトされ、訳文編集フィールドが編集不可となる。そして次の文表示部９２２の訳文編集フィールドが編集可能になる。 In this embodiment, only the translation edit field in the sentence display section 920 is editable, and the other translation edit fields are not editable. When the editor edits the translation and clicks the OK button 930, the OK button 930 is highlighted and the translation edit field is disabled. Then, the translation editing field of the next sentence display section 922 becomes editable.

図１７に示す例では、文表示部９２０及び９２２の編集が終了し、文表示部９２４が現在の編集対象となっている。 In the example shown in FIG. 17, editing of the sentence display portions 920 and 922 has been completed, and the sentence display portion 924 is currently being edited.

こうした処理を全ての訳文に対して行い、全ての訳文についてＯＫボタンをクリックしてハイライトさせた後に「次へ」ボタン８１２をクリックすると編集が終了する。したがってこの実施形態では、全てのＯＫボタンがハイライトされないと「次へ」ボタン８１２がクリック可能とならないようにしてもよい。 This process is performed for all the translated texts, and after all the translated texts are highlighted by clicking the OK button, the editing is completed by clicking the "Next" button 812 . Therefore, in this embodiment, the "Next" button 812 may not be clickable until all OK buttons are highlighted.

イプログラム
図１８を参照して、この変形例に係る翻訳結果編集装置を実現するためのプログラムは図１６に示すものと異なるのは、図１６のステップ８５０の後に、図１７に示す表示画面を生成し、文表示部９２０の編集を可能とし他は入力不可にするステップ９５０と、各訳文に対してＯＫフラグの配列を作成し全ての値を０（承認前）に初期化するステップ９５２とを含むこと、ステップ６６６の判定がＮＯのときに、指示がＯＫボタンであるか否かにより制御の流れを分岐させるステップ９５４と、ステップ９５４の判定が肯定のときに、編集対象となっている訳文に対応するＯＫフラグ配列の要素に「１」（承認後）を代入するステップ９５６と、ステップ９５６での処理の結果、全ての文表示部に対応するＯＫフラグ配列の要素の値が「１」となったか否かに従って制御の流れを分岐させるステップ９５８と、ステップ９５８の判定が否定のときに、処理中の文表示部の編集を不可にし、次の文表示部の編集を可能にするようエディタ画面９００を編集して再表示して制御をステップ８５４に戻すステップ９６０とを含むことである。ステップ９５４の判定が否定のときには制御はステップ６７８に進み、後は図１６と同様である。ステップ９５８の判定が肯定のときには、全ての訳文に対する編集が完了したということなので制御はステップ６６８に進み、編集結果を統括サーバ６０に送信して制御はステップ６５８に戻る。 B. Program Referring to FIG. 18, the program for realizing the translation result editing apparatus according to this modification differs from that shown in FIG. 16 in that the display screen shown in FIG. a step 950 of creating an OK flag array for each translated text and initializing all values to 0 (before approval); a step 954 for branching the flow of control depending on whether or not the instruction is the OK button when the determination in step 666 is NO; and when the determination in step 954 is affirmative, the Step 956 of substituting "1" (after approval) into the element of the OK flag array corresponding to the translation, and as a result of the processing in step 956, the value of the element of the OK flag array corresponding to all the sentence display parts is "1". '', and when the determination in step 958 is negative, the editing of the sentence display portion being processed is disabled and the editing of the next sentence display portion is enabled. and step 960 which edits and redisplays editor screen 900 and returns control to step 854 . If the determination in step 954 is negative, control proceeds to step 678, and the rest is the same as in FIG. If the determination in step 958 is affirmative, it means that editing for all translations has been completed, so control proceeds to step 668 to transmit the editing results to central server 60 and control returns to step 658 .

ウ動作
以上の説明から明らかなように、この第２変形例では、第１変形例と同様、一つの画面に全ての原文、訳文及びその逆翻訳を表示する。ただし第１変形例と異なり、編集完了は訳文の一つずつについて行い、先行する全ての訳文の編集が完了しないと次の訳文の編集を完了できない。そして全ての訳文の編集が完了した時点で編集結果を統括サーバ６０に送信する。 C. Operation As is clear from the above description, in this second modification, as in the first modification, all original texts, translated texts, and their reverse translations are displayed on one screen. However, unlike the first modified example, editing is completed for each translated text, and editing of the next translated text cannot be completed until all preceding translated texts have been edited. Then, when the editing of all translated sentences is completed, the edited result is transmitted to the central server 60 .

この第２変形例によれば、第１変形例と同様、訳文全体について原文と逆翻訳とを対照しながら編集者が訳文の編集を行える。さらにこの第２変形例では、第１変形例とは異なり一つずつ訳文の編集を行う。したがって、編集者が処理対象の一つの訳文に注意を集中させることができるという効果もある。またこの第２変形例では、一つの訳文に対する編集を完了するとその訳文の編集に後戻りできない。したがって、翻訳の流れが停滞するおそれが小さくなるという効果もある。 According to the second modified example, as in the first modified example, the editor can edit the translated text while comparing the original text and the reverse translation for the entire translated text. Furthermore, in this second modification, translations are edited one by one, unlike the first modification. Therefore, there is also the effect that the editor can concentrate his attention on one translation to be processed. In addition, in this second modification, once editing of one translated sentence is completed, it is not possible to go back to editing that translated sentence. Therefore, there is also the effect of reducing the possibility that the flow of translation will stagnate.

３第３変形例
ア編集画面
この第３変形例では、編集画面は第２変形例と同じく図１７に示すものを使用する。 3 Third Modification a Edit Screen In this third modification, the edit screen shown in FIG. 17 is used as in the second modification.

イプログラム
図１９にこの第３変形例に係る翻訳結果編集装置を実現するプログラムの制御構造をフローチャート形式で示す。図１９を参照して、このプログラムが図１８に示す第２変形例と異なるのは、ステップ８５０の後に、図１７に示す表示画面を生成するステップ１０００と、ステップ１０００に表示されている編集対象の訳文について、１行目から最後の行まで順番にステップ１００４を実行するステップ１００２を含むこととである。図１８と異なり、この実施形態ではステップ１０００では１行目を入力可能にして表示する処理は行わない。代わりに、ステップ１００４の内部で、処理対象の各行の入力を可能にしてから画面を表示する。 B. Program FIG. 19 is a flow chart showing the control structure of a program that implements the translation result editing apparatus according to the third modification. 19, this program differs from the second modification shown in FIG. 18 in that after step 850, step 1000 for generating the display screen shown in FIG. is included in step 1002 for executing step 1004 in order from the first line to the last line for the translated text. Unlike FIG. 18, in this embodiment, at step 1000, the process of displaying the first row so that it can be entered is not performed. Instead, within step 1004, the screen is displayed after enabling input for each line to be processed.

ステップ１００４は、処理対象のＯＫボタンと訳文編集フィールドとをアクティベート（入力可能に）し、他の行についてはデアクティベート（入力不可に）して表示画面を更新するステップ１０２０と、指示の入力があるまで待機し、指示を受信すると指示に応じて制御を分岐させるステップ１０２２と、受信した指示がＯＫボタンであることに応答して、該当行（処理中の行）の編集結果の訳文を統括サーバ６０に送信するステップ１０２４と、処理が完了した行を編集画面から削除するステップ１０２６と、編集画面を再表示して変更を反映させてステップ１００４の処理を終了するステップ１０２８とを含む。 Step 1004 activates (makes input possible) the OK button and the translation edit field to be processed, and deactivates (makes input impossible) the other lines to update the display screen. A step 1022 that waits until an instruction is received and branches the control according to the instruction, and in response to the received instruction being an OK button, the translation of the edited result of the corresponding line (the line being processed) is integrated. Step 1026 of deleting the processed line from the editing screen, and Step 1028 of redisplaying the editing screen to reflect the changes and ending the processing of step 1004.

ステップ１００４はさらにステップ１０２２で受けた指示が編集のときにその操作に応じた処理を実行するステップ１０３０と、ステップ１０３０の後に画面表示を更新することで編集結果を画面に反映させて制御をステップ１０２２に戻すステップ１０３２と、ステップ１０２２で受けた指示が処理の終了であるときに、必要な後処理を行ってこのプログラムの実行を終了するステップ１０３４とを含む。 Step 1004 further includes step 1030 of executing processing according to the operation when the instruction received in step 1022 is editing, and step 1030 of updating the screen display after step 1030 so that the editing result is reflected on the screen and control is executed. Step 1032 for returning to 1022, and Step 1034 for performing necessary post-processing and terminating the execution of this program when the instruction received in step 1022 is to terminate processing.

ウ動作
第２変形例では、編集を１行ずつ行うが、統括サーバ６０への編集結果の送信は、全ての編集結果を最後にまとめて行っている。この第３変形例では、最初の画面表示は第２変形例と同様である。しかし、先頭の訳文から順番に編集可能にし、ある訳文の編集が完了してＯＫボタンが押されると、それに応答して、その訳文のデータのみを編集後のデータとして直ちに統括サーバ６０に送信する点で第２変形例と異なる。またこの後にその訳文に関する表示は画面から消去する。したがって、第２変形例よりもさらに編集者の注意を処理対象の訳文に集中させることができる。編集者が既に編集の終わった訳文について考え直すことができないので、同時翻訳の流れが滞るという危険性を第２変形例よりさらに小さくできるという効果がある。 C. Operation In the second modified example, editing is performed line by line, but all the editing results are collectively sent to the central server 60 at the end. In this third modification, the initial screen display is the same as in the second modification. However, the translated text is made editable in order from the top, and when the editing of a certain translated text is completed and the OK button is pressed, only the data of the translated text is immediately transmitted to the central server 60 as the data after editing in response. This is different from the second modified example in that respect. After that, the display related to the translation is erased from the screen. Therefore, the editor's attention can be focused on the translation to be processed more than in the second modification. Since the editor cannot reconsider the translation that has already been edited, there is an effect that the risk of the flow of simultaneous translation becoming stagnant can be made even smaller than in the second modification.

なお、上記各実施形態で使用した画面例は単なる例にすぎない。上記した画面例と異なる編集画面を採用してもよい。 Note that the screen examples used in the above embodiments are merely examples. An editing screen different from the screen examples described above may be adopted.

また、上記実施形態では、第１から第３まで、編集者を少なくとも３人用いている。しかしこの発明はそのような実施形態に限定されるわけではない。仮に音声認識結果の信頼性が非常に高いときには第１の編集者を省略できる可能性がある。また音声認識結果のチャンクへの分割の精度が非常に高いときには第２の編集者を省略できる可能性がある。もちろん、そのように各処理の精度が高い場合も編集者を配置する方が信頼性の点では好ましい。さらに、上記実施形態では、第１の編集者及び第２の編集者は、少なくとも第１言語に関する知識を持っていること、第３の編集者は、第１言語と第２言語の双方の知識を持っていることとしたが、この発明によれば第３の編集者も同時通訳者としての高い能力を持っている必要はない。そのため、第１言語と第２言語の双方の知識を持っている者であれば、第１編集者から第３編集者までのいずれの役割でも担うことができる。したがって、これらの編集者の間で作業分担を切り替えるようにしてもよい。 Also, in the above embodiment, at least three editors are used from the first to the third. However, the invention is not limited to such embodiments. If the reliability of the speech recognition result is very high, there is a possibility that the first editor can be omitted. Also, when the accuracy of dividing the speech recognition result into chunks is very high, there is a possibility that the second editor can be omitted. Of course, even when the accuracy of each process is high, it is preferable from the standpoint of reliability to assign an editor. Further, in the above embodiment, the first editor and the second editor have knowledge of at least the first language, and the third editor has knowledge of both the first language and the second language. However, according to the present invention, the third editor does not need to have a high ability as a simultaneous interpreter. Therefore, a person who has knowledge of both the first language and the second language can play any role from the first editor to the third editor. Therefore, the work sharing may be switched among these editors.

今回開示された実施形態は単に例示であって、本発明が上記した実施形態のみに制限されるわけではない。本発明の範囲は、発明の詳細な説明の記載を参酌した上で、特許請求の範囲の各請求項によって示され、そこに記載された文言と均等の意味及び範囲内での全ての変更を含む。 The embodiments disclosed this time are merely examples, and the present invention is not limited to the embodiments described above. The scope of the present invention is indicated by each claim in the scope of claims after taking into consideration the description of the detailed description of the invention, and all changes within the meaning and range of equivalents to the wording described therein include.

５０同時翻訳システム
６０統括サーバ
６２ネットワーク
６４音声認識装置
６６音声認識結果編集装置
６８発話分割装置
７０翻訳単位編集装置
７２、７８自動翻訳装置
７４翻訳結果編集装置
７６音声合成装置
１００同時翻訳処理
２１０、３０２編集フィールド
２１２音声信号表示部
３５２原文フィールド
３５４訳文編集フィールド
３５８、４１０原文
３６０、４１４訳文
４１２逆翻訳
８１４、９１０表示フィールド
８２０、８２２、８２４、８２６、９２０、９２２、９２４、９２６文表示部
50 Simultaneous translation system 60 Central server 62 Network 64 Speech recognition device 66 Speech recognition result editing device 68 Speech segmentation device 70 Translation unit editing device 72, 78 Automatic translation device 74 Translation result editing device 76 Speech synthesis device 100 Simultaneous translation processing 210, 302 Edit field 212 Audio signal display section 352 Original text field 354 Translation edit fields 358, 410 Original text 360, 414 Translation text 412 Reverse translation 814, 910 Display fields 820, 822, 824, 826, 920, 922, 924, 926 Text display section

Claims

A simultaneous translation system for translating speech in a first language into a second language, comprising:
chunk string output means for recognizing the speech signal in the first language, dividing it into chunks, and outputting a chunk string consisting of chunks in the first language;
automatic translation means for automatically translating each of the chunks in the first language output by the chunk sequence output means into the second language and outputting the chunks in the second language;
a first editing of the chunk in the second language so that the chunk in the second language output by the automatic translation means can be compared with the chunk in the first language corresponding to the chunk in the second language; a post-editing means for outputting the edited text in the second language in response to receiving an instruction to end editing; .

The simultaneous translation system further includes reverse translation means for translating the chunks of the second language into the first language;
2. The simultaneous translation system according to claim 1, wherein said display device further displays a back-translation result by said back-translation means.

The post-editing means further includes matching degree calculation means for calculating a degree of matching between the translation result by the reverse translation means and the chunk in the first language, which is the original text of the chunk in the second language,
3. The simultaneous translation system of claim 2, wherein the display device further displays the degree of matching as a degree of confidence in the translation of the second language chunk.

The simultaneous translation system includes first and second post-editing means,
The simultaneous translation system further sorts and inputs the output of the automatic translation means to the first or second post-editing means, and outputs the outputs of the first and second post-editing means for the input in the correct order. 4. The simultaneous translation system according to any one of claims 1 to 3, comprising a first integration means for integrating with.

The chunk string output means is
speech recognition text output means for recognizing speech signals in the first language and outputting text in the first language;
automatic text division means for automatically dividing the text in the first language output from the speech recognition text output device into the chunks and outputting the chunk string in the first language;
a display device for displaying the chunk string in the first language output by the automatic text division means so that a second editor can edit the chunks, and responds to an instruction to finish editing by the second editor; 5. The simultaneous translation system according to any one of claims 1 to 4, further comprising chunk string editing means for outputting the displayed chunk string in the first language.

A simultaneous translation method for translating speech in a first language into a second language, comprising:
a computer performing voice recognition on the first language audio signal, dividing it into chunks and outputting a chunk sequence of chunks in the first language;
a computer performing an automatic translation into the second language for each of the chunks in the first language and outputting chunks in the second language;
so that the computer can compare the chunks in the second language with the chunks in the first language corresponding to the chunks in the second language, and the editing by the first editor to the chunks in the second language displaying on a display device such that
a computer accepting edits to the chunk in the second language by the first editor;
and a computer, in response to receiving an instruction to end editing, outputting said second language text edited in said step of accepting edits.