JP3999771B2

JP3999771B2 - Translation support program, translation support apparatus, and translation support method

Info

Publication number: JP3999771B2
Application number: JP2004199606A
Authority: JP
Inventors: 晶佐々木; 裕美子吉村
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2004-07-06
Filing date: 2004-07-06
Publication date: 2007-10-31
Anticipated expiration: 2024-07-06
Also published as: JP2006023844A

Description

本発明は、例えばある言語の文章を他の言語の文章に翻訳する翻訳支援プログラム、翻訳支援装置、翻訳支援方法に関する。 The present invention relates to a translation support program, a translation support apparatus, and a translation support method for translating sentences in one language into sentences in another language, for example.

ある言語（原言語）の文書を他の言語（目的言語）の文書に自動的に翻訳する機械翻訳装置がある。 There is a machine translation device that automatically translates a document in one language (source language) into a document in another language (target language).

従来の機械翻訳装置としては、ある言語で書かれている文（原文）と、その文を他の言語で表した文（訳文）とを対で格納したデータベースである翻訳メモリを参照して、翻訳作業を支援する翻訳支援装置が知られている（例えば特許文献１参照）。 As a conventional machine translation device, referring to a translation memory which is a database storing a sentence (original sentence) written in a certain language and a sentence (translation) expressing the sentence in another language, A translation support apparatus that supports translation work is known (see, for example, Patent Document 1).

従来の翻訳支援装置における翻訳処理の流れは以下のようになっている。
１．過去に翻訳済みの対訳文を翻訳メモリに予め登録しておき、翻訳対象文と類似の文が翻訳メモリ中にあれば、それを参照して翻訳に利用する。
２．翻訳対象文と類似の文が翻訳メモリ中にない場合は、機械翻訳を実行させて翻訳文（下訳）を作成し、下訳に人手で適宜修正を加えて訳文を完成する。
３．翻訳対象文と完成した訳文を翻訳メモリに登録して、新たな翻訳対象文書を翻訳する際に再利用する。
特開平６−６８１４２号公報 The flow of translation processing in a conventional translation support apparatus is as follows.
1. A previously translated bilingual sentence is registered in the translation memory in advance, and if there is a sentence similar to the translation target sentence in the translation memory, it is referred to and used for translation.
2. If there is no sentence similar to the translation target sentence in the translation memory, machine translation is executed to create a translation sentence (subordinate translation), and the translation is completed by appropriately modifying the subtranslation manually.
3. The translation target sentence and the completed translation are registered in the translation memory and reused when a new translation target document is translated.
JP-A-6-68142

従来の翻訳支援装置では、機械翻訳の翻訳結果を人手で修正して翻訳メモリに登録する場合、訳文の修正（後編集）は、簡単に行えても原文の修正（前編集）は簡単には行えないという問題があった。 In the conventional translation support device, when the translation result of machine translation is manually corrected and registered in the translation memory, the correction of the original sentence (pre-editing) is easy even if the correction of the translation (post-editing) can be easily performed. There was a problem that it could not be done.

これは、翻訳メモリに登録する原文がオリジナルの原文から変更されていると、別の文書の翻訳時に再利用され難くなるためである。 This is because if the original text registered in the translation memory is changed from the original original text, it is difficult to reuse it when translating another document.

例えば、ある日本語テキスト文書から、同じ内容のｈｔｍｌ文書、リッチテキスト文書を作ってあったものとする。そして、日本語テキスト文書を基にして英語の翻訳文書を作成し、これらを翻訳メモリに登録したものとする。その際に、翻訳メモリに登録した原文はオリジナルの原文から変更されているものとする。 For example, it is assumed that an html document and a rich text document having the same contents are created from a certain Japanese text document. Then, it is assumed that English translation documents are created based on the Japanese text documents and these are registered in the translation memory. At that time, it is assumed that the original text registered in the translation memory has been changed from the original text.

この翻訳メモリの内容を使って、ｈｔｍｌ文書、リッチテキスト文書を翻訳し、英語版のｈｔｍｌ文書、リッチテキスト文書を作成するものとする。 It is assumed that the contents of the translation memory are used to translate an html document and a rich text document to create an English version html document and a rich text document.

しかし、翻訳メモリには、オリジナルの原文とは異なる原文が登録されているため、翻訳メモリのデータとの類似率は低くなる。また、機械翻訳装置が翻訳し易いように原文を修正すると、実際には存在しない人工的な文になる場合もあり、このような場合は実際の文とマッチしない可能性がさらに高くなる。 However, since the original text different from the original text is registered in the translation memory, the similarity rate with the data in the translation memory is low. Further, when the original sentence is corrected so that it can be easily translated by the machine translation device, it may become an artificial sentence that does not actually exist. In such a case, the possibility of not matching with the actual sentence is further increased.

そこで、原文の修正をあえて行う場合には、修正前の原文を別途保存しておき、翻訳メモリ登録時に修正後の原文と置き換える必要がある。 Therefore, when the original text is corrected, it is necessary to save the original text before correction separately and replace it with the corrected text when registering the translation memory.

しかし、この場合にも、特に原文の分割・結合を行うと、原文との置き換え作業は非常に繁雑になるという問題がある。その上、原文の分割・結合が必要になるケースはまれではない。例えば原文である日本語の１文が「が、」でつながる複数の文からなり、文を分割した方が翻訳結果が良くなる場合や、ｈｔｍｌファイルなどの書式情報付き文書においてレイアウトの都合で１文が分割されており、正しく翻訳するには文を結合する必要がある場合など、枚挙にいとまがない。 However, even in this case, there is a problem that the replacement work with the original text becomes very complicated especially when the original text is divided and combined. In addition, it is not uncommon to need to split and combine text. For example, if the original Japanese sentence is composed of a plurality of sentences connected by “ga”, the result of translation is better if the sentence is divided, or in the case of layout information in a document with format information such as an html file. If the sentences are divided and need to be combined for correct translation, there is no limit.

このように、翻訳メモリに登録することを考えると、原文の修正は簡単にはできないため、現実的には機械翻訳結果の修正は、訳文の修正に頼る場合が大半であった。 In this way, considering the registration in the translation memory, the correction of the original text cannot be easily performed. Therefore, in reality, the correction of the machine translation result is mostly dependent on the correction of the translation.

最初から自分で英文を書き起こせるユーザは、機械翻訳の機能を十分に活用せずに、訳文の上書き編集をして多大な労力を費やし、最初から英文を書き起こすことの難しいユーザは、訳文に問題があることは分かっていても、それをどう直せばよいかを推敲できずに機械翻訳された訳文を容認せざるを得ない場合が多くあった。 Users who can transcribe English by themselves from the beginning do not make full use of the machine translation function, and do a great deal of effort by overwriting and editing the translation. Even though we knew that there was a problem, there were many cases where we had to accept a machine-translated translation without being able to figure out how to fix it.

本発明はこのような課題を解決するためになされたもので、原文を修正しても編集前の状態に戻せるような編集処理が行え、機械翻訳の機能および翻訳メモリ中の翻訳資産を有効に活用できる翻訳支援プログラム、翻訳支援装置、翻訳支援方法を提供することを目的としている。 The present invention has been made to solve such a problem, and can perform editing processing so that even if the original text is corrected, it can be restored to the state before editing, and the machine translation function and the translation assets in the translation memory are effectively used. The purpose is to provide a translation support program, a translation support apparatus, and a translation support method that can be utilized.

上記した目的を達成するために、本発明の翻訳支援プログラムは、翻訳辞書、翻訳結果を記憶する翻訳メモリ、翻訳文及び原文に対する編集履歴を記憶する編集履歴記憶部を備えたコンピュータによって、ある言語の原文を他の言語に翻訳処理する翻訳支援プログラムにおいて、前記コンピュータを、前記原文を、前記翻訳辞書に基づいて機械翻訳することで翻訳文を生成する翻訳手段と、前記翻訳手段により前記原文と前記翻訳文を対応付けて前記翻訳メモリに保存する手段と、前記翻訳メモリに記憶された原文および前記翻訳文のうち少なくとも一つの文にある文字列が編集された場合、その位置に、前記原文の文字列に対して編集部分を示す特殊記号を付加して特殊記号付きの原文を生成する編集手段と、
前記編集手段により生成された特殊記号付きの原文を、元の原文の編集履歴として前記編集履歴記憶部に記憶する手段として機能させることを特徴とする。
上記翻訳支援プログラムにおいて、前記コンピュータを、前記特殊記号付きの原文を、特殊記号に従って再度翻訳して新たな翻訳文を生成する再翻訳手段と、前記再翻訳手段により生成された新たな翻訳文を翻訳結果の文書として前記特殊記号付きの原文に対応付けて前記翻訳メモリに記憶する手段として機能させるようにしても良い。 In order to achieve the above-described object, a translation support program according to the present invention includes a translation dictionary, a translation memory for storing a translation result, a computer having an editing history storage unit for storing a translation sentence and an editing history for the original sentence. in the translation support program which processes a textual other language translation, the computer, the original text, the translation means for generating a translation by machine translation on the basis of the translation dictionary, and the original text by said translation means said means in association translations be stored in the translation memory, when said string in at least one sentence of the translation memory stored original and the translated sentence has been edited, in that position, the An editing means for generating a text with a special symbol by adding a special symbol indicating an editing part to the text string of the text,
The original with the generated special symbols by the editing means, characterized in that to function as a means for storing in said edit history storing section as an editing history of the original textual.
In the above-mentioned translation support program, the computer translates the original text with the special symbol again according to the special symbol to generate a new translated text, and a new translated text generated by the re-translating means. May be associated with the original text with the special symbol as a translation result document and stored in the translation memory.

本発明の翻訳支援装置は、ある言語の原文を、予め記憶されている翻訳辞書に基づいて機械翻訳することで他の言語の翻訳文を生成する翻訳手段と、前記翻訳手段による翻訳結果を前記原文と前記翻訳文とを対応付けて保存する翻訳メモリと、前記翻訳手段により翻訳された翻訳文および原文の少なくとも一方を編集した編集履歴が記憶される編集履歴記憶部と、前記原文および前記翻訳文のうち少なくとも一つの文にある文字列が編集された場合、その位置に、前記原文の文字列に対して編集部分を示す特殊記号を付加して特殊記号付きの原文を生成する編集手段と、前記編集手段により生成された特殊記号付きの原文を、元の原文の編集履歴として前記編集履歴記憶部に記憶する手段とを具備したことを特徴とする。
上記翻訳支援装置において、前記特殊記号付きの原文を、特殊記号に従って再度翻訳して新たな翻訳文を生成する再翻訳手段と、前記再翻訳手段により生成された新たな翻訳文を翻訳結果の文書として前記特殊記号付きの原文に対応付けて前記翻訳メモリに記憶する手段とを備えても良い。 Translation supporting apparatus of the present invention, the textual one language, a translation means for generating a translation of the other language by machine translation based on the translation dictionary stored in advance, the translation results by the translation means a translation memory that stores in association with the translation and the original text, and editing history storage unit for editing history editing at least one of the translated translation and original text by said translation means is stored, the original text and the translation Editing means for generating an original sentence with a special symbol by adding a special symbol indicating an editing portion to the original character string at that position when a character string in at least one sentence of the sentence is edited And means for storing the original text with the special symbol generated by the editing means in the editing history storage section as an editing history of the original text.
In the translation support apparatus, a retranslation means for re-translating the original sentence with the special symbol in accordance with the special symbol to generate a new translation sentence, and a new translation sentence generated by the re-translation means as a document of the translation result And means for storing in the translation memory in association with the original text with the special symbol.

本発明の翻訳支援方法は、翻訳手段、翻訳辞書、翻訳メモリ、編集手段、編集履歴記憶部を備えたコンピュータによって、ある言語の原文を他の言語に翻訳処理する翻訳支援方法において、前記翻訳手段が、ある言語の原文を、予め記憶されている前期翻訳辞書に基づいて機械翻訳することで他の言語の翻訳文を生成するステップと、前記翻訳手段が、前記原文と前記翻訳文とを対応付けて翻訳メモリに保存するステップと、前記原文および前記翻訳文のうち少なくとも一つの文にある文字列が編集された場合、その位置に、前記原文の文字列に対して編集部分を示す特殊記号を前記編集手段が付加して特殊記号付きの原文を生成するステップと、生成した特殊記号付きの原文を、前記編集手段が元の原文の編集履歴として編集履歴記憶部に記憶するステップとを有することを特徴とする。なお、上記翻訳支援方法において、前記特殊記号付きの原文を再翻訳手段が特殊記号に従って再度翻訳して新たな翻訳文を生成するステップと、再度翻訳して生成した新たな翻訳文を前記編集手段が翻訳結果の文書として前記特殊記号付きの原文に対応付けて翻訳メモリに保存するステップとを有してもよい。 The translation support method of the present invention is a translation support method for translating an original sentence of a certain language into another language by a computer having a translation means, a translation dictionary, a translation memory, an editing means, and an edit history storage unit. but the original text of a language, and Luz step to generate a translation in other languages by machine translation based on year translation dictionary stored in advance, said translation means, said translation and the original text and storing in the translation memory in association with, if the string in the at least one sentence of said original and said translation is edited, in that position, the editing portion for strings of the original generating a textual with special symbols added is the editing means of special symbols indicating the original text with the generated special symbols, the edit history storing unit said editing means as the edit history of the original textual Characterized by a step of 憶. In the translation support method, the step of re-translating the original sentence with the special symbol by the re-translation means according to the special symbol to generate a new translation sentence, and the new translation sentence generated by re-translation and the editing means There may have a step of storing the translation memory in association with the original text with the special symbols as a document of the translation result.

本発明では、選択された文字例の分割位置、結合位置、および一文編集のうちの少なくとも一つを行う編集位置に特殊記号を付加した特殊記号付きの原文を生成し、それを原文の編集履歴として編集履歴記憶部に記憶するので、編集履歴記憶部を参照して特殊記号付きの原文から特殊記号を検出することで、特殊記号付きの原文を編集前の原文の形態に戻すことができる。 In the present invention, an original sentence with a special symbol is generated by adding a special symbol to an editing position for performing at least one of a division position, a combining position, and a single sentence editing of the selected character example, and the original editing history is generated. Is stored in the editing history storage unit, and by detecting the special symbol from the original text with the special symbol with reference to the editing history storage unit, the original text with the special symbol can be returned to the original text before editing.

また、特殊記号付きの原文を再度翻訳した新たな翻訳文を翻訳結果の文書として特殊記号付きの原文に対応付けて翻訳メモリに保存するので、高い精度の翻訳結果を再利用できるようになる。 In addition, since a new translation sentence obtained by re-translating the original sentence with the special symbol is stored in the translation memory in association with the original sentence with the special symbol as a translation result document, the translation result with high accuracy can be reused.

以上説明したように本発明によれば、原文を修正しても編集前の状態に戻せるような編集処理が行え、機械翻訳の機能および翻訳メモリ中の翻訳資産を有効に活用できる。 As described above, according to the present invention, it is possible to perform an editing process so that even if the original text is corrected, it is possible to return to the state before editing, and the function of machine translation and the translation assets in the translation memory can be effectively utilized.

以下、本発明の実施の形態を図面を参照して詳細に説明する。図１は本発明に係る一実施形態の翻訳支援装置全体の構成を示すブロック図である。
この実施形態の翻訳支援装置は、大別して、文分割処理部１、翻訳手段としての翻訳処理部２、編集手段としての原文・訳文編集部３、文書出力部４等の４つの部分からなる。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. FIG. 1 is a block diagram showing the overall configuration of a translation support apparatus according to an embodiment of the present invention.
The translation support apparatus according to this embodiment is roughly divided into four parts: a sentence division processing unit 1, a translation processing unit 2 as a translation unit, an original / translation editing unit 3 as an editing unit, and a document output unit 4.

文分割処理部１は、翻訳対象の原言語で書かれた原文文書入力部５と、原文文書を所定の文分割規則に従って１文単位に自動的に分割する原文文書自動文分割部６と、自動文分割の際に参照される所定の文分割規則が記憶された文分割規則テーブル７とを有している。原文文書入力部５は、例えばキーボード、マウス、グラフィックユーザインタフェース画面等で構成され、原文および訳文のうち少なくとも一つに対して文字列の編集操作を行う操作手段として機能する。文分割規則テーブル７に記憶されている所定の文分割規則は、例えばメモリにファイルの形態で記憶されていても良く、また、文分割処理部１の処理プログラムの中に予め記述、つまり設定されていても良い。 The sentence division processing unit 1 includes an original document input unit 5 written in a source language to be translated, an original document automatic sentence division unit 6 that automatically divides the original document into one sentence according to a predetermined sentence division rule, And a sentence division rule table 7 in which predetermined sentence division rules referred to at the time of automatic sentence division are stored. The original document input unit 5 is configured by, for example, a keyboard, a mouse, a graphic user interface screen, and the like, and functions as an operation unit that performs a character string editing operation on at least one of the original sentence and the translated sentence. The predetermined sentence division rule stored in the sentence division rule table 7 may be stored in the form of a file in the memory, for example, and is described in advance, that is, set in the processing program of the sentence division processing unit 1. May be.

翻訳処理部２は、翻訳処理を制御する翻訳処理制御部８と、翻訳対象のある原言語（例えば日本語等）で書かれた文（翻訳対象文:第１の文書）と、翻訳後の他の言語、つまり目的言語（英語等）で書かれた文（翻訳結果の文書：第２の文書（以下訳文と称す））とが一対（一組）に対応付けられて保存されている翻訳メモリ９と、この翻訳メモリ９の中に、１文単位に分割された原言語からなる原文と類似する文（類似文）があるか否かを検索する翻訳メモリ検索部１０と、原文を自動的に目的言語の文に翻訳する機械翻訳処理部１１と、この機械翻訳処理部１１により参照される翻訳辞書としての翻訳用辞書１２を有している。 The translation processing unit 2 includes a translation processing control unit 8 that controls the translation processing, a sentence (translation target sentence: first document) written in a source language (for example, Japanese language) to be translated, Translation in which a sentence (translation result document: second document (hereinafter referred to as a translated sentence)) written in another language, that is, a target language (such as English) is stored in association with a pair (one set) A memory 9, a translation memory search unit 10 for searching whether or not there is a sentence (similar sentence) similar to the original sentence composed of the original language divided into one sentence in the translation memory 9, and the original sentence automatically In particular, a machine translation processing unit 11 that translates into a sentence in a target language and a translation dictionary 12 as a translation dictionary that is referred to by the machine translation processing unit 11 are provided.

翻訳処理部２は、ある原文を予め記憶されている翻訳用辞書１２に基づいて機械翻訳することで目的言語文書である訳語を生成する翻訳手段として機能する。
翻訳用辞書１２には、言語翻訳用の辞書情報（日→英辞書、英→日辞書等の辞書データ）と、この他、形態素解析、構文解析、意味解析、言語変換用の解析ルール、変換規則等が記憶されている。 The translation processing unit 2 functions as a translation unit that generates a translation that is a target language document by machine-translating a certain original text based on a translation dictionary 12 stored in advance.
The translation dictionary 12 includes dictionary information for language translation (dictionary data such as Japanese → English dictionary, English → Japanese dictionary), morphological analysis, syntax analysis, semantic analysis, analysis rules for language conversion, conversion Rules are stored.

翻訳メモリ９は、翻訳処理部２による翻訳結果の再利用を目的として翻訳前の文書とこの翻訳前の文書を翻訳した翻訳結果の文書とを対応付けて保存するものである。この翻訳メモリ９は、過去の翻訳実績を再利用するためにデータベースの形態で蓄積し翻訳作業の効率アップを図る機能の一部として用いられる。 The translation memory 9 stores a pre-translation document and a translation result document obtained by translating the pre-translation document in association with each other for the purpose of reusing the translation result by the translation processing unit 2. The translation memory 9 is used as part of a function for accumulating in the form of a database and improving the efficiency of translation work in order to reuse past translation results.

機械翻訳処理部１１は、形態素解析、構文解析、意味解析、言語変換処理部１３、訳文生成部１４を有している。
原文・訳文編集部３は、操作手段により原文に対して行われた文字列の編集操作に応じた位置に特殊記号を付加して第３の文書（図６参照）を生成する手段として機能する。 The machine translation processing unit 11 includes a morphological analysis, a syntax analysis, a semantic analysis, a language conversion processing unit 13 and a translation generation unit 14.
The original / translation editing unit 3 functions as a unit that generates a third document (see FIG. 6) by adding a special symbol to a position corresponding to a character string editing operation performed on the original by the operation unit. .

原文・訳文編集部３は、原文、訳文を編集した編集履歴が記憶される編集履歴記憶部１５を有している。編集履歴記憶部１５は、メモリ、ハードディスク装置に設けられた記憶領域等で実現される。原文・訳文編集部３は、生成した第３の文書を第１の文書の編集履歴として編集履歴記憶部１５に記憶する手段として機能する。つまり、編集履歴記憶部１５には、原文および訳文のうち、少なくとも一つを編集した編集履歴が記憶される。
原文・訳文編集部３は、操作手段である対訳編集画面によって、原文の中から選択された文字列に対して、編集履歴記憶部１５に記憶された特殊記号付きの訳文を基に、分割、結合、および一文編集のうちの少なくとも一つを行う位置を推定し、その位置に特殊記号を付加して特殊記号付きの原文を生成する手段として機能する。原文・訳文編集部３は、生成された特殊記号付きの原文を原文の編集履歴として編集履歴記憶部１５に記憶する手段として機能する。 The original sentence / translation editing section 3 has an editing history storage section 15 in which an original history and an editing history of editing the translation are stored. The editing history storage unit 15 is realized by a memory, a storage area provided in the hard disk device, or the like. The original / translation editing unit 3 functions as means for storing the generated third document in the editing history storage unit 15 as the editing history of the first document. That is, the edit history storage unit 15 stores an edit history in which at least one of the original sentence and the translated sentence is edited.
The original / translation editing unit 3 divides the character string selected from the original using the parallel translation editing screen, which is an operation unit, based on the translation with special symbols stored in the editing history storage unit 15. It functions as means for estimating a position where at least one of combination and single sentence editing is performed, and adding a special symbol to the position to generate an original sentence with a special symbol. The original / translation editing unit 3 functions as a unit that stores the generated original with special symbols in the editing history storage unit 15 as an editing history of the original.

訳文生成部１４は、形態素解析、構文解析、意味解析、言語変換処理部１３により形態素解析、構文解析、意味解析、言語変換されて意味をなす語句（文字列）となったものを文の形態に並べる処理を行う。 The translation generation unit 14 converts the morpheme analysis, syntax analysis, semantic analysis, and language conversion processing unit 13 into morpheme analysis, syntax analysis, semantic analysis, and language conversion into meaningful phrases (character strings). Process to arrange in.

文書出力部４は、文の形態の原文・訳文などを文書の形態で出力するものであり、原文、訳文、それぞれの修正文のうち少なくとも一つを出力するプリントドライバ、表示ドライバ等を含むプログラムと、プリンタ、表示装置等のハードウェア等である。 The document output unit 4 outputs an original sentence / translation sentence in the form of a sentence in the form of a document, and includes a print driver, a display driver, etc. that output at least one of the original sentence, the translated sentence, and each corrected sentence. And hardware such as a printer and a display device.

この翻訳支援装置のハードウェアは、ＣＰＵ、メモリ、ハードディスク装置等を備えたコンピュータと、このコンピュータに接続された表示装置および印刷装置等である。ハードディスク装置にはコンピュータシステム全体を動作させるオペレーティングシステム（以下ＯＳと称す）と、機械翻訳を実行する制御プログラム（以下翻訳支援プログラムと称す）がインストールされており、これら翻訳支援プログラム、ＯＳ、ＣＰＵ、メモリ等が協働して、文分割処理部１、翻訳処理部２、原文・訳文編集部３、文書出力部４等の処理動作を実現する。 The hardware of this translation support apparatus is a computer including a CPU, a memory, a hard disk device, and the like, and a display device and a printing device connected to the computer. An operating system (hereinafter referred to as OS) for operating the entire computer system and a control program (hereinafter referred to as translation support program) for executing machine translation are installed in the hard disk device. These translation support program, OS, CPU, The memory and the like cooperate to realize processing operations of the sentence division processing unit 1, the translation processing unit 2, the original / translation editing unit 3, the document output unit 4, and the like.

以下、図２を参照してこの実施形態の翻訳支援装置の動作を説明する。図２はこの実施形態の翻訳支援装置の処理全体を示すフローチャートである。 The operation of the translation support apparatus according to this embodiment will be described below with reference to FIG. FIG. 2 is a flowchart showing the entire processing of the translation support apparatus of this embodiment.

この翻訳支援装置では、原文文書入力部５に原文文書が入力されると（図２のステップＳ２０１）、原文文書自動文分割部６は、原文文書を１文単位に分割する（ステップＳ２０２）。この原文文書自動文分割部６では、日本語の文章の場合、１つの文は、読点を区切りとして分割される。また、英語の文章の場合、１つの文は、ピリオド、コロン、セミコロンなどを区切りとして分割される。この他、例えば括弧、改行記号、ｈｔｍｌのような書式情報付き文書では、１つの文は、改行記号、カッコなどを区切りとして分割される。また、英文の場合、“Dr.”などのようにピリオドが付く語があるため、個別に考慮すべき単語を辞書にまとめ、適宜参照する。上記区切り情報は予めメモリあるいはプログラム上に記憶（設定）されている。 In this translation support apparatus, when an original document document is input to the original document input unit 5 (step S201 in FIG. 2), the original document automatic sentence dividing unit 6 divides the original document into units of one sentence (step S202). In the original document automatic sentence dividing unit 6, in the case of a Japanese sentence, one sentence is divided with a punctuation mark as a delimiter. In the case of English sentences, one sentence is divided with a period, a colon, a semicolon, etc. as a delimiter. In addition, for example, in a document with format information such as parentheses, line feed symbols, and html, one sentence is divided with a line feed symbol, parentheses, etc. as a delimiter. In English, there are words with a period, such as “Dr.”, so the words that should be considered individually are compiled into a dictionary and referred to as appropriate. The delimiter information is stored (set) in advance in a memory or program.

原文文書自動文分割部６により１文単位に分割された原文は、１文ずつ、最後の文になるまで（ステップＳ２０３）、翻訳処理部２へ送られて、翻訳処理部２によって翻訳処理が行われる（ステップＳ２０４）。なお、ステップＳ２０４の翻訳処理の詳細な内容は後述する。 The original sentence divided into one sentence unit by the original document automatic sentence dividing unit 6 is sent to the translation processing unit 2 one sentence at a time until it becomes the last sentence (step S203), and the translation processing unit 2 performs the translation processing. Performed (step S204). The detailed contents of the translation process in step S204 will be described later.

翻訳処理が行われた訳文は、原文とともに原文・訳文編集部３へ送られる。原文・訳文編集部３では、訳文および／または原文の編集処理が行われる（ステップＳ２０５）。このステップＳ２０５の編集処理の詳細な内容については後述する。 The translated text that has undergone translation processing is sent to the original text / translated text editing section 3 together with the original text. The original / translation editing unit 3 performs a translation and / or original text editing process (step S205). Details of the editing process in step S205 will be described later.

ステップＳ２０５の編集処理が行われた後、編集済みの原文に対する翻訳処理が再度必要な場合、原文・訳文編集部３より翻訳処理部２へ翻訳対象の文が戻される。そして、翻訳処理部２により文書の有無が判定されて（ステップＳ２０３）、翻訳処理が行われる（ステップＳ２０４）。
なお、翻訳対象の文が複数存在したとしても、ステップＳ２０３の判定処理を経ることで、すべての翻訳対象文に対して１文ずつ翻訳処理が実効される。 After the editing process of step S205 is performed, when the edited original sentence needs to be translated again, the original sentence / translation editing part 3 returns the sentence to be translated to the translation processing part 2. Then, the translation processing unit 2 determines the presence / absence of a document (step S203), and translation processing is performed (step S204).
Even if there are a plurality of sentences to be translated, the translation process is executed one sentence at a time for every sentence to be translated through the determination process in step S203.

翻訳処理が再度必要でない場合（ステップＳ２０６のＮｏ）、原文・訳文編集部３は、原文と翻訳結果を文書出力部４へ送り（ステップＳ２０７）、原文・訳文編集部３としての処理動作を終了する。 If the translation process is not necessary again (No in step S206), the original / translation editing unit 3 sends the original and the translation result to the document output unit 4 (step S207), and ends the processing operation as the original / translation editing unit 3. To do.

＜翻訳処理部２の動作＞
ここで、図３を参照して図２のステップ２０４で示した翻訳処理について説明する。図３は図２のステップ２０４で示した翻訳処理を示すフローチャートである。 <Operation of translation processing unit 2>
Here, the translation process shown in step 204 of FIG. 2 will be described with reference to FIG. FIG. 3 is a flowchart showing the translation processing shown at step 204 in FIG.

文分割処理部１によって１文単位に分割された翻訳対象文が翻訳処理部２に入力されると（ステップＳ３０１）、翻訳処理部２では、翻訳メモリ検索部１０が、翻訳処理制御部８からの命令により、翻訳対象文をキーにして翻訳メモリ９を検索することで（ステップＳ３０２）、翻訳対象文と類似した文が翻訳メモリ９内に存在するか否かを判定する（ステップＳ３０３）。なおステップ３０２の翻訳メモリ検索部１０による翻訳メモリ検索処理の詳細な内容について後述する。 When the translation target sentence divided into sentence units by the sentence division processing unit 1 is input to the translation processing unit 2 (step S301), in the translation processing unit 2, the translation memory search unit 10 receives from the translation processing control unit 8. By searching the translation memory 9 using the translation target sentence as a key (step S302), it is determined whether a sentence similar to the translation target sentence exists in the translation memory 9 (step S303). The detailed contents of the translation memory search process by the translation memory search unit 10 in step 302 will be described later.

検索の結果、翻訳メモリ９内に、翻訳対象文と類似した文が存在した場合（ステップＳ３０３のＹｅｓ）、翻訳メモリ検索部１０は、その類似文を翻訳処理による訳文と判定して、翻訳結果を表示装置の表示画面へ出力し（ステップＳ３０４）、この翻訳対象文に対する翻訳処理を終了する。
また、翻訳対象文と類似した文が存在しない場合（ステップＳ３０３のＮｏ）、翻訳メモリ検索部１０は、翻訳対象文を機械翻訳処理部１１へ送り、機械翻訳処理を実行させる。
機械翻訳処理部１１は、翻訳メモリ検索部１０より受けた取った翻訳対象文に対して形態素解析、構文解析、意味解析、言語変換の各種処理からなる原文解析処理（ステップＳ３０５）と、訳文生成処理（ステップＳ３０６）とを行うことで機械翻訳処理を行う。
機械翻訳処理部１１は、機械翻訳処理が終了すると、翻訳結果を原文・訳文編集部３へ出力する（ステップＳ３０４）。 As a result of the search, if there is a sentence similar to the translation target sentence in the translation memory 9 (Yes in step S303), the translation memory search unit 10 determines that the similar sentence is a translation sentence by translation processing, and the translation result Is output to the display screen of the display device (step S304), and the translation processing for this translation target sentence is terminated.
When there is no sentence similar to the translation target sentence (No in step S303), the translation memory search unit 10 sends the translation target sentence to the machine translation processing unit 11 to execute the machine translation process.
The machine translation processing unit 11 performs source sentence analysis processing (step S305) including various processes such as morphological analysis, syntax analysis, semantic analysis, and language conversion on the translation target sentence received from the translation memory search unit 10, and translation generation. The machine translation process is performed by performing the process (step S306).
When the machine translation process ends, the machine translation processing unit 11 outputs the translation result to the original / translation editing unit 3 (step S304).

＜原文・訳文編集部３の動作＞
続いて、図４のフローチャートを参照して、上記図２のステップ２０５で示した原文・訳文編集部３の処理の詳細について説明する。 <Operation of Original / Translation Editor 3>
Next, the details of the processing of the original / translation editing unit 3 shown in step 205 of FIG. 2 will be described with reference to the flowchart of FIG.

原文・訳文編集部３に、翻訳処理部２から原文および翻訳処理の結果が送られると、原文・訳文編集部３は、原文・訳文の同じ内容を初期値として編集履歴記憶部１５に記憶するとともに、その原文・訳文を表示装置の表示画面に表示する。原文・訳文の編集処理は、表示画面に表示された原文・訳文のうち、ユーザが選択した文に対して実行される。この表示画面は、原文および訳文の対訳編集画面であり、原文および訳文の少なくとも一つに対して編集対象の文字列の選択操作を行う操作手段として機能する。 When the original text and the translation processing result are sent from the translation processing section 2 to the original text / translation text editing section 3, the original text / translation text editing section 3 stores the same contents of the original text / translation text in the editing history storage section 15 as initial values. At the same time, the original text / translation text is displayed on the display screen of the display device. The original sentence / translation editing process is executed on a sentence selected by the user from among the original sentences / translation sentences displayed on the display screen. This display screen is a parallel translation editing screen of the original sentence and the translated sentence, and functions as an operating means for performing an operation of selecting a character string to be edited with respect to at least one of the original sentence and the translated sentence.

図４には、ユーザが編集したい文を選択してから１回の編集作業が終了するまでの流れを示すものであり、すべての編集作業が終了するまで、必要に応じて図４の処理が繰り返される。 FIG. 4 shows a flow from when a user selects a sentence to be edited to when one editing operation is completed. The processing of FIG. 4 is performed as necessary until all editing operations are completed. Repeated.

ユーザにより選択された編集対象の文字列、つまり編集対象文が原文・訳文編集部３に入力されると（図４のステップＳ４０１）、原文・訳文編集部３は、編集対象文が訳文か原文かを判定する（ステップＳ４０２）。この判定の仕方としては、入力元（訳文は機械翻訳処理部１１から入力、原文は翻訳処理制御部８から入力）がどこであるか、つまりどこから送られてきたかで判定する方法と、ユーザが操作した画面あるいは文自体（原文と訳文を２分割画面に別個に表示しているため）で判定する方法がある。 When the editing target character string selected by the user, that is, the editing target sentence is input to the original / translation editing unit 3 (step S401 in FIG. 4), the original / translation editing unit 3 determines whether the editing target sentence is a translation or an original sentence. Is determined (step S402). As a method of this determination, there is a method of determining where the input source (the translated text is input from the machine translation processing unit 11 and the original text is input from the translation processing control unit 8), that is, from where the input source is sent, and a user operation There is a method of judging on the screen or the sentence itself (because the original sentence and the translated sentence are separately displayed on the two-divided screen).

判定の結果、編集対象文が訳文の場合（ステップＳ４０２のＹｅｓ）、原文・訳文編集部３は、訳文編集を行い（ステップＳ４０３）、編集結果を文書出力部４へ出力して（ステップＳ４１２）、編集処理を終了する。
また、編集対象文が原文の場合（ステップＳ４０２のＮｏ）、原文・訳文編集部３は、編集内容に応じて１文の分割を行うか否か（ステップＳ４０４）、複数文の結合を行うか否か（ステップＳ４０７）を判定し、この判定結果に応じて処理を行う。 As a result of the determination, if the edit target sentence is a translated sentence (Yes in step S402), the original / translated sentence editing unit 3 performs translation editing (step S403), and outputs the edited result to the document output unit 4 (step S412). The editing process is terminated.
If the edit target sentence is an original sentence (No in step S402), the original sentence / translation sentence editing unit 3 determines whether or not to divide one sentence according to the editing content (step S404) and whether to combine a plurality of sentences. It is determined whether or not (step S407), and processing is performed according to the determination result.

例えば１文の分割を行う場合（ステップＳ４０４のＹｅｓ）、原文・訳文編集部３は、文分割処理を実行し（ステップＳ４０５）、複数文の結合を行う場合には、文結合処理を実行し（ステップＳ４０８）、これら２つ以外の場合、つまり、編集内容が１文内での変更のみに留まる場合（ステップＳ４０４のＹｅｓ）、原文・訳文編集部３は、１文編集処理を実行する（ステップＳ４０９）。これら文分割、文結合、一文編集等の各処理によって、編集文に特殊記号が付される。各処理の内容については、後で図５の例を用いて詳細に説明する。なお、選択入力された文が長い場合、分割と結合を同時に行うことも有り得る。この場合、ステップＳ４０５、Ｓ４０８の処理が同時に実行されることになる。また、他の一文編集処理との組み合わせも考えられる。 For example, when dividing a sentence (Yes in step S404), the original / translation editing unit 3 executes a sentence dividing process (step S405), and when combining a plurality of sentences, executes a sentence combining process. (Step S408) In cases other than these two cases, that is, when the editing content is only changed within one sentence (Yes in Step S404), the original / translation editing unit 3 executes single sentence editing processing ( Step S409). A special symbol is attached to the edited sentence by each processing such as sentence division, sentence combination, and single sentence editing. The contents of each process will be described in detail later using the example of FIG. If the selected sentence is long, splitting and combining may be performed at the same time. In this case, the processes of steps S405 and S408 are executed simultaneously. A combination with other single sentence editing processing is also conceivable.

これらの処理の後、原文・訳文編集部３は、編集履歴記憶部１５に対して原文編集履歴内容の更新処理を行い（ステップＳ４０６）、編集履歴記憶部１５に、処理内容が時系列で記憶される。 After these processes, the original / translation editing unit 3 performs an update process of the content of the original text editing history in the editing history storage unit 15 (step S406), and the processing content is stored in the editing history storage unit 15 in time series. Is done.

原文の編集作業を終了した後、ユーザは再翻訳が必要か否かを判断する。再翻訳が必要と判断したユーザは、表示画面上の翻訳ボタンを操作し、再翻訳が不要と判断したユーザは、表示画面上の翻訳ボタン以外のボタン、あるいはキー操作を行うので、原文・訳文編集部３は、原文編集後のユーザの操作に応じて処理内容を変える（ステップＳ４１０）。 After completing the editing of the original text, the user determines whether retranslation is necessary. The user who determines that retranslation is necessary operates the translation button on the display screen, and the user who determines that retranslation is not necessary operates buttons or key operations other than the translation button on the display screen. The editing unit 3 changes the processing content according to the user's operation after editing the original text (step S410).

例えばユーザにより表示画面上の翻訳ボタンが操作された場合、原文・訳文編集部３は、再翻訳が必要と判定し（ステップＳ４１０のＹｅｓ）、この場合は、編集された原文を翻訳処理部２に渡し、再度の翻訳処理を実行させる（ステップＳ４１１）。 For example, when the translation button on the display screen is operated by the user, the original / translation editing unit 3 determines that retranslation is necessary (Yes in step S410). In this case, the edited original is converted into the translation processing unit 2. And a second translation process is executed (step S411).

また、ユーザにより他の操作が行われた場合、原文・訳文編集部３は、再翻訳を不要と判定し（ステップＳ４１０のＮｏ）、翻訳処理（ステップＳ４１１）をスキップし、訳文編集が必要か否かの判定処理を行う（ステップＳ４１２）。 If the user performs another operation, the original / translation editing unit 3 determines that re-translation is unnecessary (No in step S410), skips the translation process (step S411), and does the translation need to be edited? A determination process of whether or not is performed (step S412).

このステップＳ４１２の判定処理では、ユーザにより次の文（原文あるいは訳文）が編集対象として指定された場合に、原文・訳文編集部３は、訳文編集を必要と判定し（ステップＳ４１２のＹｅｓ）、指定された編集対象文に対して訳文編集を実行し（ステップＳ４０３）、その後、翻訳結果の文を翻訳メモリ９と文書出力部４へ出力し（ステップＳ４１３）、編集処理を終了する。原文・訳文編集部３より出力された翻訳結果の文は、特殊記号が付加された編集後の原文に対応付けられて翻訳メモリ９に保存（登録）される。 In the determination processing in step S412, when the next sentence (original sentence or translation) is designated as an editing target by the user, the original sentence / translation editing section 3 determines that translation editing is necessary (Yes in step S412), The translation editing is executed for the designated editing target sentence (step S403), and then the translation result sentence is output to the translation memory 9 and the document output unit 4 (step S413), and the editing process is terminated. The translation result sentence output from the original sentence / translation sentence editing unit 3 is stored (registered) in the translation memory 9 in association with the edited original sentence to which the special symbol is added.

また、ユーザにより次の文（原文あるいは訳文）が編集対象として指定されず、訳文編集が不要な場合（ステップＳ４１２のＮｏ）、原文・訳文編集部３は、訳文編集処理（ステップＳ４０３）をスキップして、編集文を出力し（ステップＳ４１３）、編集処理を終了する。 If the user does not specify the next sentence (original sentence or translated sentence) as an editing target and the translated sentence is unnecessary (No in step S412), the original sentence / translated sentence editing unit 3 skips the translated sentence editing process (step S403). Then, the edited sentence is output (step S413), and the editing process is terminated.

＜原文・訳文編集部３の動作の実例と翻訳メモリ９への登録例＞
図５は翻訳処理部２から原文・訳文編集部３に送られてきた、編集処理前の原文・訳文および編集履歴記憶部１５の内容を示したものである。これらの例を用いて、具体的な原文・訳文編集部３の動作と翻訳メモリ９への登録内容について説明する。原文・訳文未編集の状態では、編集履歴記憶部１５の内容（原文・訳文）は、それぞれの原文・訳文と全く同じ内容になっている。 <Example of operation of original / translation editing unit 3 and registration to translation memory 9>
FIG. 5 shows the contents of the original text / translation text and the editing history storage section 15 before the editing process sent from the translation processing section 2 to the original text / translation text editing section 3. Using these examples, the specific operation of the original / translation editing unit 3 and the contents registered in the translation memory 9 will be described. When the original text / translated text is not edited, the contents (original text / translated text) in the editing history storage unit 15 are exactly the same as the original text / translated text.

これらの例では、翻訳処理から出力された翻訳結果は原文の内容を十分に反映したものとは言えず、修正が必要である。
例えば文５１は、ひらがな表記になっているため、「たなか」が人名と認識されていない例である。文５２では、「田中ですが、」の「が、」が、日本語では軽い接続の意味で使われているが、翻訳結果では逆接の意味と解釈され、逆接の接続詞”although”が出力されている。文５３および文５４は、レイアウトの都合で１文が２つに分割されたため、正しく翻訳されていない例である。 In these examples, the translation result output from the translation process cannot be said to sufficiently reflect the contents of the original text and needs to be corrected.
For example, sentence 51 is an example in which “Tanaka” is not recognized as a person's name because it is written in hiragana. In sentence 52, “I am Tanaka,” but “ga,” is used in the meaning of light connection in Japanese, but in the translation result, it is interpreted as the meaning of reverse connection, and the reverse connection conjunction “although” is output. ing. Sentence 53 and sentence 54 are examples in which one sentence is divided into two parts for convenience of layout and is not correctly translated.

上記の点を考慮して、ユーザが文の選択操作を行い、この選択操作に応じて原文・訳文編集部３が文分割処理、文結合処理、１文編集処理を行い、原文を修正した結果を図６に示す。 In consideration of the above points, the user performs a sentence selection operation, and the original / translation editing unit 3 performs sentence division processing, sentence combination processing, and one sentence editing processing in accordance with the selection operation, and results of correcting the original sentence Is shown in FIG.

文６１は、ひらがな表記を漢字表記に修正したものである。漢字表記にすることで「田中」が人名と解釈され、正しい翻訳結果が出力されている。このように意味が一意に決まるように表記や表現を変更することで、正しい翻訳結果が得られる場合が多い。
文６２および文６３は、「が、」で接続された原文を２文に分割し、それぞれ文として完結するようにしたものである。これらの文を再度翻訳すると、当然だが、逆接の接続詞althoughは訳文に現れなくなる。このような文の分割処理を行うと、編集履歴記憶部１５には、分割した前半文字列の末尾と、後半文字列の先頭に特殊記号”＠数字＠”が挿入された編集履歴が記憶される。”＠”と”＠”の間の数字は、この位置に特殊記号を挿入したことを識別するための特殊記号のＩＤ番号であり、同じＩＤ番号が付いている原文同士は、編集前は繋がっていたことを示す。数字は、通常、文の文節や文の結合部が検出された際に原文・訳文編集部３により連続番号で付与される。 Sentence 61 is obtained by correcting hiragana notation to kanji notation. By using Kanji notation, “Tanaka” is interpreted as a personal name, and the correct translation result is output. In many cases, correct translation results can be obtained by changing the notation and expression so that the meaning is uniquely determined.
A sentence 62 and a sentence 63 are obtained by dividing an original sentence connected by “ga” into two sentences and completing each sentence as a sentence. If these sentences are translated again, of course, the conjunctive conjunction else does not appear in the translation. When such sentence division processing is performed, the editing history storage unit 15 stores the editing history in which the special symbol “@ number @” is inserted at the end of the divided first half character string and at the beginning of the second half character string. The The number between “@” and “@” is the ID number of the special symbol for identifying the insertion of the special symbol at this position. The originals with the same ID number are connected before editing. Indicates that it was. The numbers are normally given as serial numbers by the original / translation editing unit 3 when a sentence clause or sentence combination is detected.

文６４は、文５３および文５４の２文を結合したものである。結合した文の再翻訳結果は、意味の通るものとなっている。このような文の結合処理を行うと、編集履歴記憶部１５には文同士の結合部に”＠数字＠”が挿入された編集履歴が記憶される。 The sentence 64 is a combination of the sentences 53 and 54. The retranslation result of the combined sentence is meaningful. When such a sentence merging process is performed, the editing history storage unit 15 stores an editing history in which “@ number @” is inserted in the coupling part between sentences.

結合処理の場合は同じＩＤ番号を持つ複数の原文は存在しないが、同じＩＤを持つ原文と訳文は存在する。機械翻訳処理部１１は、結合した原文を再翻訳するので、訳文中の結合部は、本来は存在しないはずであるが、編集履歴記憶部１５（訳文）に保存されている結合前の翻訳結果と再翻訳結果とを比較することで、原文の結合部に対応する訳文の結合部を推定（特定）し翻訳を行う。例えば文６４は、”This processing”の後で切れると推定（特定）される。これは、編集履歴記憶部１５に記憶された＠前の文字列”This processing”と再翻訳結果の”This processing”が完全一致しているので、訳文の切れ目はprocessingの後ろであると推定すると、２つの翻訳結果の一致度がもっとも高くなるためである。
すなわち、翻訳処理制御部８は、編集履歴記憶部１５に記憶された訳文（最翻訳結果）に対して、編集履歴記憶部１５に記憶された編集記号付き訳文を基にして、文の切れ目を推定する。 In the case of the combining process, there are not a plurality of original sentences having the same ID number, but there are an original sentence and a translated sentence having the same ID. Since the machine translation processing unit 11 re-translates the combined original sentence, the combined part in the translated sentence should not originally exist, but the translation result before combining stored in the editing history storage unit 15 (translated sentence) And the retranslation result are compared to estimate (identify) the translation portion corresponding to the original portion, and perform translation. For example, the sentence 64 is estimated (specified) to be cut after “This processing”. This is because the previous character string “This processing” stored in the editing history storage unit 15 and the re-translation result “This processing” are completely matched, so it is assumed that the break of the translated sentence is after processing. This is because the degree of coincidence between the two translation results is the highest.
That is, the translation processing control unit 8 applies a sentence break to the translation (the most translated result) stored in the editing history storage unit 15 based on the translation with edit symbols stored in the editing history storage unit 15. presume.

図７は原文を修正して機械翻訳処理部１１が再翻訳した訳文に、更に修正を加えた例である。
この例は、文７４の「処理」に対する訳語「processing」を「process」に変更する修正を加えた例である。このように、原文の修正を行うことで構文的に正しい翻訳結果が得られると、後は翻訳結果の一部を修正するだけで一定水準の訳文を得ることができる。 FIG. 7 shows an example in which the original sentence is corrected and the translation is re-translated by the machine translation processing unit 11 and further corrected.
This example is an example in which a modification that changes the translated word “processing” to “process” for “processing” in the sentence 74 is added. As described above, when a translation result that is syntactically correct is obtained by correcting the original sentence, a translation at a certain level can be obtained by only correcting a part of the translation result.

したがって、最初から自分で英文を書き起こせるユーザは、訳文を最初から自分で作成する必要が無くなり、翻訳の労力が大幅に軽減できる。
また、最初から英文を自分で作成することが難しいユーザにとっては、わずかな修正で正しい英文が得られるため、機械翻訳の翻訳結果が間違っていても、比較的容易に修正が可能となる。 Therefore, a user who can transcribe himself / herself from the beginning does not need to create a translation from the beginning, and the translation effort can be greatly reduced.
For users who have difficulty in creating English sentences from the beginning, correct English sentences can be obtained with a slight correction, so that even if the translation result of machine translation is wrong, the correction can be made relatively easily.

図８は上記の翻訳結果から、翻訳メモリ９に登録された内容を示した図である。
翻訳メモリ９に登録された原文は、修正を加えた原文ではなく、編集履歴記憶部１５の内容であり、オリジナルの原文の文字列に対して文分割部分や文結合部を示す特殊記号を加えたもの（第３の文書の形態）になっている。 FIG. 8 is a diagram showing the contents registered in the translation memory 9 from the above translation result.
The original text registered in the translation memory 9 is not the corrected original text but the contents of the editing history storage section 15, and a special symbol indicating a sentence division part or a sentence combining part is added to the original text string. (A third document form).

この実施形態の翻訳支援装置では、このような翻訳メモリ９への登録方法を採っているが、原文を分割した場合に限り、分割前の原文に復元したものを翻訳メモリ９に登録する、という方法も可能である。この方法によれば、翻訳メモリ検索部１０に特別な機能が無くても、翻訳対象の文が編集前のオリジナルの原文と一致する場合、翻訳メモリ９に登録されたオリジナルの原文と対になっている訳文を出力することができる。 In the translation support apparatus of this embodiment, such a registration method to the translation memory 9 is adopted. However, only when the original text is divided, the restored original text is registered in the translation memory 9. A method is also possible. According to this method, even if the translation memory search unit 10 does not have a special function, if the sentence to be translated matches the original original text before editing, it is paired with the original original text registered in the translation memory 9. The translated text can be output.

以下、より汎用的な図８に示した内容で翻訳メモリ９に登録する場合について説明する。
＜新たな文書翻訳時の翻訳メモリ検索動作＞
図８に示した翻訳メモリ９を使って、図９に示す新たな文書を翻訳する場合の、翻訳メモリ検索の動作について説明する。翻訳メモリ検索は、図３のステップＳ３０２に示した処理である。 Hereinafter, the case of registering in the translation memory 9 with the more general content shown in FIG. 8 will be described.
<Translation memory search operation during new document translation>
The translation memory search operation when the new document shown in FIG. 9 is translated using the translation memory 9 shown in FIG. 8 will be described. The translation memory search is the process shown in step S302 of FIG.

この場合、翻訳メモリ検索部１０は、翻訳対象の１つの文（翻訳文０１と呼ぶとする）に対して、翻訳メモリ９内の１つのデータ（メモリデータ０１と呼ぶ）と比較処理を行い、比較処理終了後、翻訳メモリ９の次のデータとの比較処理を行う。これを繰り返し翻訳文０１と翻訳メモリ９のすべてのデータとの比較処理が終了すると、次の翻訳文０２の検索処理を開始する。 In this case, the translation memory search unit 10 performs a comparison process with one data (referred to as memory data 01) in the translation memory 9 for one sentence (referred to as translation sentence 01) to be translated, After the comparison process is completed, a comparison process with the next data in the translation memory 9 is performed. When this is repeated and the comparison process between the translated sentence 01 and all the data in the translation memory 9 is completed, the search process for the next translated sentence 02 is started.

このような比較方法をとったのは、本実施例の動作を分かりやすく説明するためであり、検査高速化のために、インデックス作成などの他の検索方法をとったとしても、本特許の範囲を逸脱するものではない。なお、比較処理の内容によっては、複数の翻訳文、複数のメモリデータをまとめて比較する場合もある。 The reason why such a comparison method is used is to explain the operation of the present embodiment in an easy-to-understand manner. Even if another search method such as index creation is used for speeding up the inspection, the scope of this patent It does not deviate from. Depending on the contents of the comparison process, a plurality of translated sentences and a plurality of memory data may be compared together.

図１０および図１１は、翻訳文０１と、翻訳メモリデータ１件（メモリデータ０１と呼ぶことにする）との比較処理を示すフローチャートであり、図１０は完全一致検索処理を示し、図１１は類似文検索処理を示す。 10 and 11 are flowcharts showing a comparison process between the translation sentence 01 and one translation memory data item (referred to as memory data 01). FIG. 10 shows an exact match search process, and FIG. A similar sentence search process is shown.

翻訳メモリ検索部１０は、まず、図１０の完全一致検索処理を実行した後、完全一致するメモリデータ０１が検出されなかった翻訳文０１に対して、引き続き、図１１の類似文検索処理を実行する。 The translation memory search unit 10 first executes the complete sentence search process of FIG. 10, and then executes the similar sentence search process of FIG. 11 for the translation sentence 01 for which no memory data 01 having a complete match has been detected. To do.

以下では、まず、図１０のフローチャートを用いて検索処理の動作を説明し、図９の例文が当てはまるケースに対して具体例を使った説明を加える。
＜翻訳メモリ検索動作−完全一致検索＞
翻訳メモリ検索部１０に翻訳対象文（翻訳文０１）が入力されると（ステップＳ５０１）、翻訳メモリ検索部１０は、まず、翻訳文０１とメモリデータ０１とを比較して互いが完全一致するか否かを判定する（ステップＳ５０２）。ここで、完全一致とは、翻訳文０１とメモリデータ０１、つまり翻訳メモリ９に記憶されている翻訳文とが一語一句違わないことを指す。 In the following, first, the operation of the search process will be described using the flowchart of FIG. 10, and a description using a specific example will be added to the case where the example sentence of FIG. 9 applies.
<Translation memory search operation-exact search>
When a translation target sentence (translation sentence 01) is input to the translation memory search unit 10 (step S501), the translation memory search unit 10 first compares the translation sentence 01 with the memory data 01 to completely match each other. Whether or not (step S502). Here, the complete match means that the translated sentence 01 and the memory data 01, that is, the translated sentence stored in the translation memory 9 are not different one by one.

比較の結果、翻訳文０１とメモリデータ０１とが完全一致した場合、翻訳メモリ検索部１０は、翻訳メモリ９に一致する文が存在するものと判定し（ステップＳ５０２のＹｅｓ）、一致した文をメモリデータ０１とする（ステップＳ５０３）。 As a result of the comparison, when the translation sentence 01 and the memory data 01 completely match, the translation memory search unit 10 determines that a matching sentence exists in the translation memory 9 (Yes in step S502), and selects the matching sentence. The memory data is 01 (step S503).

このステップＳ５０３のような結果になるケースは、翻訳文０１が図９の文９０の場合に相当する。つまり最初に行った翻訳時に、漢字表記になるよう原文を編集していても、翻訳メモリ９にはオリジナルのひらがな表記の原文である図８の対原文８１「わたしはたなかです。」が登録されていたため、オリジナルと同じひらがな表記の翻訳文０１にメモリデータ０１がマッチして、正しい英文”I am Tanaka.”が翻訳結果とされる。以上は、１文編集処理を行って翻訳メモリ９のデータがマッチするケースの場合である。 The case where the result of step S503 is obtained corresponds to the case where the translated sentence 01 is the sentence 90 in FIG. In other words, even when the original text is edited so that it is written in kanji at the time of the first translation, the translation memory 9 stores the original hiragana text as shown in FIG. Therefore, the memory data 01 matches the translated sentence 01 in the same hiragana notation as the original, and the correct English sentence “I am Tanaka.” Is taken as the translation result. The above is the case where the data in the translation memory 9 is matched by performing a single sentence editing process.

ステップＳ５０２の判定処理において、翻訳文０１とメモリデータ０１が完全一致しない場合（ステップＳ５０２のＮｏ）、翻訳メモリ検索部１０は、翻訳文０１とメモリデータ０１の原文中の“＠”の前にある文字列とを比較して完全一致するか否かを判定する（ステップＳ５０４）。 In the determination process of step S502, when the translation sentence 01 and the memory data 01 do not completely match (No in step S502), the translation memory search unit 10 precedes “@” in the original sentence of the translation sentence 01 and the memory data 01. A certain character string is compared to determine whether or not they completely match (step S504).

この比較判定の結果、翻訳文０１とメモリデータ０１の原文中の“＠”の前にある文字列とが完全一致しない場合（ステップＳ５０４のＮｏ）、翻訳メモリ検索部１０は、翻訳文０１と一致する文は無しと判定し、検索処理を終了する。 As a result of this comparison and determination, if the translated text 01 and the character string preceding “@” in the original text of the memory data 01 do not completely match (No in step S504), the translation memory search unit 10 determines that the translated text 01 and It is determined that there is no matching sentence, and the search process is terminated.

一方、翻訳文０１とメモリデータ０１の原文中の“＠”の前にある文字列とが完全に一致した場合（ステップＳ５０４のＹｅｓ）、翻訳メモリ検索部１０は、“＠”がメモリデータ０１の文末に存在するか否かを判定する（ステップＳ５０５）。 On the other hand, when the translated text 01 and the character string before “@” in the original text of the memory data 01 completely match (Yes in step S504), the translation memory search unit 10 indicates that “@” is the memory data 01. Is determined at the end of the sentence (step S505).

この判定の結果、“＠”がメモリデータ０１の文末に存在した場合（ステップＳ５０５のＹｅｓ）、翻訳メモリ検索部１０は、さらに一致しなかった翻訳文中の文字列（この部分を未一致部と呼び、一致している部分を一致部と呼ぶ）が、先頭部分に同じＩＤ番号が付いた＠記号を持った翻訳メモリ９のデータ（これをメモリデータ０２と呼ぶ）と一致するか否かを判定する（ステップＳ５０６）。 As a result of this determination, if “@” is present at the end of the sentence of the memory data 01 (Yes in step S505), the translation memory search unit 10 further determines a character string in the translated sentence that does not match (this part is regarded as an unmatched part). Whether or not the matching part is called the matching part) matches the data in the translation memory 9 (referred to as memory data 02) having the @ symbol with the same ID number at the head part. Determination is made (step S506).

この判定の結果、未一致部とメモリデータ０２とが完全一致した場合、翻訳メモリ検索部１０は、翻訳メモリ９に一致する文が有るものと判定し（ステップＳ５０６のＹｅｓ）、一致した文については、特殊記号“＠”を削除すると共にメモリデータ０１の後にメモリデータ０２を結合し、結合文（メモリデータ０１＋０２）を生成する（Ｓ５０７）。
このＳ５０７の結果になるケースは、翻訳文０１が図９の文９１の場合に相当する。 As a result of this determination, if the unmatched part and the memory data 02 are completely matched, the translation memory searching part 10 determines that there is a matching sentence in the translation memory 9 (Yes in step S506), and about the matching sentence Deletes the special symbol “@” and combines the memory data 02 after the memory data 01 to generate a combined statement (memory data 01 + 02) (S507).
The case resulting in S507 corresponds to the case where the translated sentence 01 is the sentence 91 in FIG.

翻訳メモリ検索部１０は、翻訳メモリ９を検索した場合、まず「私は田中ですが、」の部分がメモリデータの対原文８２「私は田中ですが、＠１＠」の”＠１＠”の前の部分と一致する（ステップＳ５０４）。なお＠１＠の数字の「１」は新たな文節や結合位置が検出されたときに、機械的に順に付加されている連続番号である。
次のステップＳ５０６の調査処理において、「彼は中田です。」の部分がメモリデータの対原文８３「＠１＠彼は中田です。」の＠１＠より後ろの文字列と一致する。
この調査結果に基づいて、Ｓ５０７の処理では、”I am Tanaka. He is Nakada.”という、逆接の接続詞の入らない英文の翻訳結果が生成される。 When the translation memory search unit 10 searches the translation memory 9, first, “I am Tanaka,” is the original text 82 of the memory data “I am Tanaka, @ 1 @” “@ 1 @” Matches the previous part (step S504). Note that the number “1” of @ 1 @ is a serial number that is mechanically added in order when a new phrase or coupling position is detected.
In the investigation processing in the next step S506, the part “He is Nakata” matches the character string after @ 1 @ of the memory data against the original text 83 “@ 1 @ He is Nakata.”
Based on the result of the investigation, in the process of S507, an English translation result “I am Tanaka. He is Nakada.” That does not include the reverse conjunctive conjunction is generated.

このように、翻訳メモリ９の内容を作成するときに、原文を分割して翻訳メモリ９に登録したときに、分割されたデータに、互いに連結していることを示す特殊記号（＠１＠等）が付加されているので、オリジナルと同じく分割されていない翻訳文にメモリデータがマッチする。以上は、文分割処理を行って登録した翻訳メモリデータがマッチするケースである。 Thus, when creating the contents of the translation memory 9, when the original text is divided and registered in the translation memory 9, special symbols (@ 1 @, etc.) indicating that the divided data are linked to each other. ) Is added, so that the memory data matches a translation that is not divided as in the original. The above is a case where translation memory data registered by performing sentence division processing matches.

ステップＳ５０６の判定処理で「未一致部」がメモリデータ０２と完全一致しない場合（ステップＳ５０６のＹｅｓ）、翻訳メモリ検索部１０は、予め設定された数式（関数式）を用いてあいまい一致の一致率を計算し、その計算結果の数値と予め設定されていた基準値とを比較して、計算結果の数値が基準値を超えているか否かを判定する（ステップＳ５０８）。
すなわち、類似度がある一定の値以上の場合（ステップＳ５０８のＹｅｓ）、翻訳メモリ検索部１０は、翻訳文０１の「一致部」とメモリデータ０１とがマッチ（一致）したものと見なし、翻訳結果にはメモリデータ０１の訳文を出力すると共に、「未一致部」についてはメモリデータとマッチしなかったものと判定する（ステップＳ５０９）。
類似度がある一定の値に満たない場合（ステップＳ５０８のＮｏ）、翻訳メモリ検索部１０は、「一致部」、「未一致部」共にメモリデータとマッチしなかったものと判定し、完全一致の検索処理を終了する。
これは、文の一部はマッチしていても、残りの部分の一致度が非常に低い場合は、文全体としての意味を考え直した方が良い場合があるからである。ただしユーザの意志によって、ステップＳ５０８の判定処理をスキップする設定に変更し、「一致部」はメモリデータとマッチしたものと判定して、メモリデータの訳文を表示するようなモードを導入するようにしてもよい。 When the “unmatched part” does not completely match the memory data 02 in the determination process of step S506 (Yes in step S506), the translation memory search unit 10 uses a preset mathematical expression (function formula) to match the fuzzy match The rate is calculated, and the numerical value of the calculation result is compared with a preset reference value to determine whether or not the numerical value of the calculation result exceeds the reference value (step S508).
That is, when the similarity is equal to or greater than a certain value (Yes in step S508), the translation memory search unit 10 regards that the “matching part” of the translation sentence 01 matches the memory data 01, and translates As a result, a translation of the memory data 01 is output, and it is determined that the “unmatched portion” does not match the memory data (step S509).
If the similarity is less than a certain value (No in step S508), the translation memory search unit 10 determines that both the “matching part” and the “unmatching part” do not match the memory data, and is a perfect match. The search process is terminated.
This is because it may be better to reconsider the meaning of the sentence as a whole when the sentence part matches but the remaining part has a very low degree of coincidence. However, according to the user's will, the setting is changed to skip the determination processing in step S508, and the “matching part” is determined to match the memory data, and a mode for displaying the translation of the memory data is introduced. May be.

ステップＳ５０９のような結果になるケースは、翻訳文が図９の文９２の場合に相当する。文９２の前半「私は田中ですが、」は、メモリデータの対原文８２”＠”前の部分「私は田中ですが、」と一致するが、後半部の「生まれは千葉です」がメモリデータの対原文８３と一致しているのは助詞「は」と助動詞「です」のみで、一致度は非常に低い（８語中３語）。このため、「一致部」は、「未一致部」とともにメモリデータと不一致と判定される。
ステップＳ５０５の判定処理において、“＠”がメモリデータの文末にない場合（ステップＳ５０５のＮｏ）、翻訳メモリ検索部１０は、ステップＳ５０４の処理での翻訳メモリ９の未一致部が、次の翻訳対象文（翻訳文０２と呼ぶ）と完全一致するか否かを判定する（ステップＳ５１０）。
未一致部と翻訳文０２が一致した場合（ステップＳ５１０のＹｅｓ）、翻訳メモリ検索部１０は、一致文ありと判定し、一致文を、翻訳文０１に対してはメモリデータ０１の”＠”前の部分、翻訳文０２に対してはメモリデータ０１の”＠”の後の部分とする結合処理を行い（ステップＳ５１１）、訳文を生成する。 The case where the result as in step S509 is obtained corresponds to the case where the translated sentence is the sentence 92 in FIG. The first half of sentence 92 “I am Tanaka,” is the same as “I am Tanaka, but” the previous part of memory data vs. original sentence 82 “@”, but the second half “Born in Chiba” is memory Only the particle “ha” and the auxiliary verb “is” coincide with the original text 83 of the data, and the degree of coincidence is very low (3 out of 8 words). For this reason, the “matching part” is determined not to match the memory data together with the “unmatching part”.
In the determination process of step S505, when “@” is not at the end of the memory data (No in step S505), the translation memory search unit 10 determines that the unmatched part of the translation memory 9 in the process of step S504 is the next translation. It is determined whether or not it completely matches the target sentence (referred to as translated sentence 02) (step S510).
If the unmatched part matches the translated sentence 02 (Yes in step S510), the translation memory search unit 10 determines that there is a matched sentence, and the matched sentence is “@” in the memory data 01 for the translated sentence 01. For the previous part and the translated sentence 02, a joining process is performed with the part after "@" in the memory data 01 (step S511), and a translated sentence is generated.

このステップＳ５１１のような結果になるケースは、翻訳文が図９の文９３および文９４の場合に相当する。 The case where the result as in step S511 is obtained corresponds to the case where the translated sentences are the sentence 93 and the sentence 94 in FIG.

すなわち、翻訳メモリ検索部１０が翻訳メモリ９を検索したところ、まず図９の文９３「この処理は」の部分がメモリデータの対原文８４の「この処理は＠２＠」の”＠２＠”の前の部分と一致する（ステップＳ５０４）。 That is, when the translation memory search unit 10 searches the translation memory 9, first, the sentence 93 “This process” in FIG. 9 is “@ 2 @” of “This process is @ 2 @” of the original text 84 of the memory data. Matches the part before "" (step S504).

次に、図９の文９４「以下のように行います。」の部分が”＠２＠”の後の部分と一致する（ステップＳ５１０）。ステップＳ５１１では、この検索結果に基づいて、”This process＠２＠ is performed as follows．”という訳文（翻訳結果）が生成される。 Next, the portion of the sentence 94 “do as follows” in FIG. 9 matches the portion after “@ 2 @” (step S510). In step S511, a translation (translation result) “This process @ 2 @ is performed as follows” is generated based on the search result.

このように、翻訳メモリ９のデータを作成したときに、原文を結合して翻訳メモリ９に登録しても、結合部に結合の履歴を示す特殊記号（＠２＠）が挿入されるので、オリジナルと同じく結合されていない翻訳文にメモリデータがマッチする。 As described above, when the data of the translation memory 9 is created, even if the original text is joined and registered in the translation memory 9, a special symbol (@ 2 @) indicating the joining history is inserted into the joining portion. As with the original, the memory data matches the translated text that is not combined.

また、原文の結合部と対応する結合部が訳文においても特殊記号（＠２＠）が挿入されているので、結合されていない原文の各部分に対して、対応する訳文を出力することができる。以上は、文結合処理を行って登録した翻訳メモリデータがマッチするケースである。 In addition, since a special symbol (@ 2 @) is inserted even in the translation part of the joint part corresponding to the joint part of the original sentence, the corresponding translation sentence can be output for each part of the original sentence that is not joined. . The above is a case where the translation memory data registered by performing sentence combination processing matches.

ステップＳ５１０の処理において、翻訳メモリ９の「未一致部」が翻訳文０２と完全一致しない場合（ステップＳ５１０のＮｏ）、翻訳メモリ検索部１０は、あいまい一致の一致率を計算し、ステップＳ５０８の判定処理と同様に、類似度がある一定の値を超えているか否かを判定する（ステップＳ５１２）。 In the process of step S510, when the “unmatched part” in the translation memory 9 does not completely match the translated sentence 02 (No in step S510), the translation memory search part 10 calculates the matching rate of fuzzy matching, and in step S508 Similar to the determination process, it is determined whether or not the similarity exceeds a certain value (step S512).

判定の結果、類似度がある一定の値以上の場合（ステップＳ５１２のＹｅｓ）、翻訳メモリ検索部１０は、翻訳メモリ９の「一致部」を翻訳文０１とマッチしたものと判定し、翻訳結果としてメモリデータの訳文の“＠”前の部分を出力し、「未一致部」についてはメモリデータとマッチしなかったものと判定する（ステップＳ５１３）。 As a result of the determination, if the similarity is equal to or higher than a certain value (Yes in step S512), the translation memory search unit 10 determines that the “matching part” in the translation memory 9 matches the translation sentence 01, and the translation result The portion before the “@” in the translation of the memory data is output, and it is determined that the “unmatched portion” did not match the memory data (step S513).

また、類似度がある一定の値に満たない場合（ステップＳ５１２のＮｏ）、翻訳メモリ検索部１０は、翻訳文０１、０２ともにメモリデータとマッチしなかったものと判定し、完全一致の検索処理を終了となる。なお、この他、ステップＳ５０８の基準値との比較判定処理の場合と同様に、ステップＳ５１２の比較判定を行わずに、「一致部」のメモリデータの訳文を表示するモードを設定してもよい。
ステップＳ５１３のような結果になるケースは、翻訳文が図９の文９５、文９６の場合に相当する。
すなわち、文９５は、翻訳メモリデータの対原文８４“＠”前の文字列と一致するが、文９６が“＠”後の文字列と一致しているのは動詞「行う」のみである上、活用形が異なり、一致度は非常に低い。このため、翻訳文０１は、翻訳文０２と共にメモリデータと不一致と判定される。 If the degree of similarity is less than a certain value (No in step S512), the translation memory search unit 10 determines that neither the translation sentence 01 or 02 matches the memory data, and a complete match search process. Ends. In addition, as in the case of the comparison determination process with the reference value in step S508, a mode for displaying the translation of the memory data of the “matching part” without performing the comparison determination in step S512 may be set. .
The case where the result as in step S513 is obtained corresponds to the case where the translated sentences are the sentence 95 and the sentence 96 in FIG.
That is, the sentence 95 matches the character string before “@” in the translation memory data 84, but the sentence 96 matches only the character string after “@” only in the verb “do”. The utilization is different and the degree of agreement is very low. Therefore, the translated sentence 01 is determined to be inconsistent with the memory data together with the translated sentence 02.

＜翻訳メモリ検索動作−類似文一致検索＞
図１０の完全一致検索で一致文なしと判定された翻訳対象文に対して、図１１の類似文検索を行う。以下では、図１１のフローチャートを参照して類似文検索処理について説明する。文分割処理、文結合処理が行われたメモリデータが類似文としてヒットする場合については、図９の具体例を用いて説明する。
翻訳メモリ検索部１０は、図１０に示した完全一致検索処理で翻訳対象文と一致する文が存在しないものと判定すると、類似文検索処理を行う。 <Translation Memory Search Operation-Similar Sentence Match Search>
The similar sentence search in FIG. 11 is performed on the translation target sentence determined as having no matching sentence in the complete match search in FIG. Hereinafter, the similar sentence search process will be described with reference to the flowchart of FIG. A case where the memory data subjected to sentence division processing and sentence combination processing hit as a similar sentence will be described with reference to a specific example of FIG.
If the translation memory search unit 10 determines in the complete match search process shown in FIG. 10 that there is no sentence matching the translation target sentence, the translation memory search unit 10 performs a similar sentence search process.

この場合、図１１に示すように、一致文なしの翻訳対象文が翻訳メモリ検索部１０に入力されると（ステップＳ６０１）、翻訳メモリ検索部１０は、まず、翻訳メモリ９を検索し、翻訳メモリ９に類似文はあるか否かを判定する（ステップＳ６０２）。 In this case, as shown in FIG. 11, when a translation target sentence without a matching sentence is input to the translation memory search unit 10 (step S601), the translation memory search unit 10 first searches the translation memory 9 to translate It is determined whether there is a similar sentence in the memory 9 (step S602).

ここでは、翻訳対象文とメモリデータ０１との類似度が一定以上の値か否かで類似文の有無を判定する。
類似文検索での一致度は、日本語の場合、文字列の中で一致した文字数がどのくらいあるかといった文字数割合を計算で求める。また、英語の場合は文字列の中で一致した単語数がどのくらいあるかの単語数割合を計算で求める。 Here, the presence or absence of a similar sentence is determined based on whether or not the similarity between the translation target sentence and the memory data 01 is a certain value or more.
In the case of Japanese, the degree of coincidence in the similar sentence search is obtained by calculating the ratio of the number of characters such as how many characters are matched in the character string. In the case of English, the ratio of the number of words indicating how many words are matched in the character string is obtained by calculation.

活用形が異なる単語は、一定の係数をかけた上で「一致」と判定する。計算の結果として得られた類似度が、ユーザが予め設定しておいた値以上の場合（ステップＳ６０２のＹｅｓ）、翻訳メモリ検索部１０は、翻訳メモリ９に類似文が存在したものと判定し、類似文をメモリデータ０１とする（ステップＳ６０３）。 Words with different utilization forms are determined to be “match” after being multiplied by a certain coefficient. If the similarity obtained as a result of the calculation is greater than or equal to the value set in advance by the user (Yes in step S602), the translation memory search unit 10 determines that a similar sentence exists in the translation memory 9. The similar sentence is set as memory data 01 (step S603).

また、類似度が、ユーザが予め設定しておいた値未満の場合（ステップＳ６０２のＮｏ）、翻訳メモリ検索部１０は、翻訳メモリ９に類似文が存在しないものと判定し、次に翻訳対象文とメモリデータ０１の原文中の“＠”の前にある文字列との類似度を判定する（ステップＳ６０４）。
翻訳対象文と“＠”の前にある文字列との類似度を判定した結果でも、類似度が一定の値に満たない場合（ステップＳ６０４のＮｏ）、翻訳メモリ検索部１０は、類似文なしと判定し、類似文検索処理を終了する。 If the similarity is less than the value preset by the user (No in step S602), the translation memory search unit 10 determines that there is no similar sentence in the translation memory 9, and then translates The degree of similarity between the sentence and the character string preceding “@” in the original text of the memory data 01 is determined (step S604).
Even when the similarity between the sentence to be translated and the character string preceding “@” is determined, the similarity is less than a certain value (No in step S604), the translation memory search unit 10 has no similar sentence. And the similar sentence search process is terminated.

一方、類似文ありと判定した場合（ステップＳ６０４のＹｅｓ）、翻訳メモリ検索部１０は、“＠”がメモリデータ０１の文末にあるか否かを判定する（ステップＳＳ６０５）。 On the other hand, if it is determined that there is a similar sentence (Yes in step S604), the translation memory search unit 10 determines whether “@” is at the end of the sentence of the memory data 01 (step SS605).

この判定の結果、“＠”がメモリデータ０１の文末にある場合（ステップＳ６０５のＹｅｓ）、翻訳メモリ検索部１０は、さらにステップＳ６０４の比較判定処理で一致しなかった翻訳文中の未一致部と、メモリデータ０２との類似度を判定する（ステップＳ６０６）。 As a result of this determination, if “@” is at the end of the sentence of the memory data 01 (Yes in step S605), the translation memory search unit 10 further matches the unmatched part in the translated sentence that did not match in the comparison determination process in step S604. The degree of similarity with the memory data 02 is determined (step S606).

この判定の結果、類似度がある一定以上の値に満たない場合（ステップＳ６０６のＮｏ）、翻訳メモリ検索部１０は、類似文なしと判定し、処理を終了する。
また、類似度がある一定の値以上の場合（ステップＳ６０６のＹｅｓ）、翻訳メモリ検索部１０は、類似文ありと判定し、検索された類似文の特殊記号＠数字＠を削除し、メモリデータ０１の後にメモリデータ０２を結合した文を生成する（ステップＳ６０７）。 As a result of this determination, if the similarity is less than a certain value (No in step S606), the translation memory search unit 10 determines that there is no similar sentence, and ends the process.
If the degree of similarity is greater than or equal to a certain value (Yes in step S606), the translation memory search unit 10 determines that there is a similar sentence, deletes the special symbol @ number @ in the searched similar sentence, and stores the memory data. A statement is generated by combining the memory data 02 after 01 (step S607).

ステップＳ６０７の結果となるケースは、翻訳文が図９の文９７の場合に相当する。
つまり、図９の文９７の「わたしは田中ですが、」および翻訳メモリ９の対原文８２「私は田中ですが、」と、文９８の「かれは中田です。」および翻訳メモリ９の対原文８２「彼は中田です。」とは、「私」および「彼」の表記が異なる以外は一致している。 The case resulting in step S607 corresponds to the case where the translated sentence is the sentence 97 in FIG.
That is, sentence 97 in FIG. 9 “I am Tanaka,” and translation memory 9 vs. original sentence 82 “I am Tanaka,” and sentence 98 “Hare is Nakata.” The original text 82 “He is Nakata” is identical except that “I” and “he” are different.

ステップＳ６０５の比較判定処理において、“＠”がメモリデータ０１の文末にない場合（ステップＳ６０５のＮｏ）、翻訳メモリ検索部１０は、さらにステップＳ６０４の比較判定処理での翻訳メモリ９の未一致部と、翻訳文０２との類似度を判定する（ステップＳ６０８）。 In the comparison determination process in step S605, when “@” is not at the end of the sentence of the memory data 01 (No in step S605), the translation memory search unit 10 further performs an unmatched part of the translation memory 9 in the comparison determination process in step S604. And the degree of similarity with the translated sentence 02 is determined (step S608).

類似度がある一定以上の値に満たない場合（ステップＳ６０８のＮｏ）、翻訳メモリ検索部１０は、類似文なしと判定して処理を終了する。
一方、類似度がある一定の値以上の場合（ステップＳ６０８のＹｅｓ）、翻訳メモリ検索部１０は、類似文ありと判定し、類似文は、翻訳文０１に対してはメモリデータ０１の“＠”前の部分、翻訳文０２に対してはメモリデータ０１の“＠”の後の部分とした文を生成する（ステップＳ６０９）。
ステップＳ６０９の結果になるケースは、翻訳文が図９の文９８および文９９の場合に相当する。 If the similarity is less than a certain value (No in step S608), the translation memory search unit 10 determines that there is no similar sentence and ends the process.
On the other hand, if the degree of similarity is equal to or greater than a certain value (Yes in step S608), the translation memory search unit 10 determines that there is a similar sentence, and the similar sentence is “@” in the memory data 01 for the translated sentence 01. For the "previous part, translated sentence 02", a sentence with the part after "@" in the memory data 01 is generated (step S609).
The case resulting from step S609 corresponds to the case where the translated sentences are the sentence 98 and the sentence 99 in FIG.

つまり図９の文９８の「あの処理は」は、メモリデータ対原文８４の“＠”前部分と指示語が異なるだけであり（「この」と「あの」）、文９９はメモリデータの対原文８４の“＠”後部分と完全一致している。 That is, “that process” in the sentence 98 of FIG. 9 is different from the memory data vs. the previous part of “@” in the original sentence 84 (“this” and “that”), and the sentence 99 is a pair of memory data. This completely matches the part after the “@” in the original text 84.

このようにこの実施形態の翻訳支援装置によれば、翻訳メモリ９に原文とその翻訳結果を登録する際、原文に対して編集が行われていた場合、編集前の原文の状態を再現できるよう、原文を編集した文字例の位置に特殊記号（＠数字＠）を付加した文書（第３の文書）を生成し、それを第１の文書の編集履歴として編集履歴記憶部１５に記憶する。
また、生成した第３の文書を再翻訳した翻訳結果の文書（第４の文書）を第３の文書と対応付けて翻訳メモリ９に保存（登録）するので、機械翻訳の精度を向上できると共に、編集履歴記憶部１５を参照することで機械翻訳が翻訳し易くなる。
さらに、第３の文書に付加されている特殊記号を付加時の規則で削除することで原文の状態に戻せるので、原文自体に対して編集を気軽に行うことができる。さらに原文に対して文分割、文結合、用語の置換、情報の追加などの編集を気軽に行うことができる。 As described above, according to the translation support device of this embodiment, when the original text and the translation result are registered in the translation memory 9, if the original text is edited, the state of the original text before editing can be reproduced. Then, a document (third document) in which a special symbol (@ number @) is added to the position of the character example obtained by editing the original text is generated and stored in the editing history storage unit 15 as the editing history of the first document.
In addition, since the translation result document (fourth document) obtained by retranslating the generated third document is stored (registered) in the translation memory 9 in association with the third document, the accuracy of machine translation can be improved. Referring to the editing history storage unit 15 makes machine translation easier to translate.
Furthermore, the special text added to the third document can be restored to the original text state by deleting it according to the rules at the time of addition, so that the original text itself can be easily edited. Furthermore, editing such as sentence division, sentence combination, term replacement, and information addition can be easily performed on the original sentence.

この結果、機械翻訳の能力を十分活用し、翻訳作業の効率を高めることができる。 As a result, the ability of machine translation can be fully utilized to improve the efficiency of translation work.

本発明に係る一実施形態の翻訳支援装置の全体構成を示すブロック図。1 is a block diagram showing the overall configuration of a translation support apparatus according to an embodiment of the present invention. 図１の翻訳支援装置の処理全体の流れを表すフローチャート。The flowchart showing the flow of the whole process of the translation assistance apparatus of FIG. 図１の翻訳支援装置の翻訳処理の流れを表すフローチャート。The flowchart showing the flow of the translation process of the translation assistance apparatus of FIG. 図１の翻訳支援装置の編集処理の流れを表すフローチャート。The flowchart showing the flow of the edit process of the translation assistance apparatus of FIG. 編集処理前の原文・訳文および編集履歴記憶部の内容を示す図。The figure which shows the content of the original sentence and translation before an edit process, and an edit history memory | storage part. 原文編集後の原文、再翻訳結果および編集履歴記憶部の内容を示す図。The figure which shows the content of the original text after an original text edit, the retranslation result, and an edit history memory | storage part. 訳文編集後の原文、再翻訳結果および編集履歴記憶部の内容を示す図。The figure which shows the content of the original sentence after translation edit, a retranslation result, and an edit history memory | storage part. 翻訳メモリの登録内容を示す図。The figure which shows the registration content of a translation memory. 新たな文書での翻訳対象文。Sentence to be translated in a new document. 図１の翻訳支援装置において、完全一致検索の際の翻訳メモリ検索処理を示すフローチャート。3 is a flowchart showing translation memory search processing at the time of an exact match search in the translation support apparatus of FIG. 1. 図１の翻訳支援装置において、類似文検索の際の翻訳メモリ検索処理を示すフローチャート。The flowchart which shows the translation memory search process in the case of a similar sentence search in the translation assistance apparatus of FIG.

Explanation of symbols

１…文分割処理部、２…翻訳処理部、３…原文・訳文編集部、４…文書出力部、５…原文文書入力部、６…原文文書自動分割部、７…文分割規則テーブル、８…翻訳処理制御部、９…翻訳メモリ、１０…翻訳メモリ検索部、１１…機械翻訳処理部、１２…翻訳用辞書、１３…形態素解析、構文解析、意味解析、言語変換処理部、１４…訳文生成部、１５…編集履歴記憶部。 DESCRIPTION OF SYMBOLS 1 ... Sentence division process part, 2 ... Translation process part, 3 ... Original sentence / translation edit part, 4 ... Document output part, 5 ... Original text document input part, 6 ... Original text automatic division part, 7 ... Sentence division rule table, 8 ... translation processing control unit, 9 ... translation memory, 10 ... translation memory search unit, 11 ... machine translation processing unit, 12 ... translation dictionary, 13 ... morphological analysis, syntax analysis, semantic analysis, language conversion processing unit, 14 ... translation Generation unit, 15... Editing history storage unit.

Claims

In a translation support program for translating an original sentence in one language into another language by a computer having a translation dictionary, a translation memory for storing a translation result, an edit history storage section for storing an edit history for the translated sentence and the original sentence ,
The computer,
The original text, the translation means for generating a translation by the machine translation on the basis of the translation dictionary,
Means for storing in said translation memories in association with the translation and the original text by said translation means,
If the string in at least one sentence of said translation memory stored original and the translated sentence has been edited, in that position, by adding a special symbol indicating the edit portion on a string of the original Editing means for generating original text with special symbols,
A translation support program for causing an original text with a special symbol generated by the editing means to function as means for storing the original text as an editing history in the editing history storage unit.

The translation support program according to claim 1,
The computer,
Re-translation means for re-translating the original sentence with the special symbol according to the special symbol to generate a new translation sentence;
A translation support program that functions as means for storing a new translated sentence generated by the retranslating means in a translation result document in association with the original sentence with the special symbol in the translation memory.

The translation support program according to claim 1,
The special symbol is composed of a number between @ and @, and the number is an ID number of a special symbol for identifying that the special symbol is inserted at this position. Support program.

In the translation support program according to claim 3,
A translation support program characterized by showing that originals with the same ID number among originals with special symbols are connected before editing.

In the translation support program according to claim 3 or 4,
The translation support program according to claim 1, wherein the number is given by a serial number by the editing means when a sentence clause or sentence combination is detected.

The original of a language, a translation means for generating a translation of the other language by machine translation based on the translation dictionary stored in advance,
A translation memory to store the translation results by the translation means in association with the translation and the original text,
An edit history storage unit for storing an edit history of editing at least one of a translated sentence and an original sentence translated by the translation unit;
If the string in at least one sentence of said original and said translation is edited, in that position, textual with special symbols by adding special symbol indicating the edit portion on a string of the original Editing means for generating
A translation support apparatus, comprising: an original history with a special symbol generated by the editing means; and means for storing the original history as an editing history of the original text in the editing history storage unit.

The translation support apparatus according to claim 6,
Re-translation means for re-translating the original sentence with the special symbol according to the special symbol to generate a new translation sentence;
A translation support apparatus, comprising: means for storing a new translated sentence generated by the retranslating means in the translation memory in association with the original sentence with the special symbol as a translation result document.

In a translation support method for translating an original text in one language into another language by a computer having a translation means, a translation dictionary, a translation memory, an editing means, and an editing history storage unit ,
It said translation means, and away step to generate a translation in other languages by machine translation based textual one language, the year translation dictionary stored in advance,
The translation means associates the original sentence with the translated sentence and stores them in a translation memory;
If the string in at least one sentence of said original and said translation is edited, in that position, a special symbol indicating an editing portion for strings of the original special added said editing means Generating a text with a symbol;
A translation support method comprising: a step of storing the generated original text with a special symbol in an editing history storage unit as an editing history of the original text.

9. The translation support method according to claim 8, wherein said computer further comprises re-translation means,
Re-translation means re-translates the original sentence with the special symbol according to the special symbol, and generates a new translation sentence;
A translation support method, comprising: a step of storing, in a translation memory, a new translated sentence generated by re-translating the editing means in association with the original sentence with the special symbol as a translation result document.