JP4248549B2

JP4248549B2 - Machine translation editing apparatus, machine translation editing program, and machine translation editing method

Info

Publication number: JP4248549B2
Application number: JP2006027209A
Authority: JP
Inventors: 貴志澁谷; 遠航蔡; 正樹新藤
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2006-02-03
Filing date: 2006-02-03
Publication date: 2009-04-02
Anticipated expiration: 2026-02-03
Also published as: JP2007207098A

Description

本発明は、マークアップ記号を含む第１言語のテキスト原文を第２の言語に翻訳して訳文を生成する機械翻訳編集装置に係り、特に訳文に対する編集に好適な機械翻訳装置、機械翻訳編集プログラム及び機械翻訳編集方法に関する。 The present invention relates to a machine translation editing apparatus that generates a translation by translating a text original text in a first language including a markup symbol into a second language, and in particular, a machine translation apparatus and a machine translation editing program suitable for editing a translation. And a machine translation editing method.

一般に機械翻訳装置は、翻訳モジュールを有している。翻訳モジュールは、入力されたソース言語の文書（テキスト）を、基本辞書及びユーザ辞書並びに文法規則を用いて解析し、構造解析木を生成する。翻訳モジュールはまた、木構造変換規則を利用して、上記ソース言語のテキストの構造解析木をターゲット言語の構造解析木に変換した上、各単語に適切な訳語を割り当て、さらに必要な部分を補ってターゲット言語のテキストを生成する。 In general, a machine translation apparatus has a translation module. The translation module analyzes the input source language document (text) using the basic dictionary, the user dictionary, and the grammar rules to generate a structural analysis tree. The translation module also uses the tree structure conversion rules to convert the structure parse tree of the source language text to the structure parse tree of the target language, assigns an appropriate translation to each word, and supplements the necessary parts. To generate text in the target language.

一方、最近では、マークアップ言語により記述されたソース言語の文書が翻訳の対象となることが増えてきている。この種のソース言語の文書では、当該文書の本文中にマークアップ記号を含ませておくことにより、そのマークアップ記号に従った体裁で表示などを行うことができる。マークアップ言語としては、ＷＷＷの普及により急速に広まったＨＴＭＬ（HyperText Markup Language）やＳＧＭＬ（Standard Generalized Markup Language）、ＸＭＬ（Extensible Markup Language）などが良く知られている。 On the other hand, recently, source language documents described in a markup language are increasingly subject to translation. In this type of source language document, by including a markup symbol in the text of the document, it is possible to display it in a format according to the markup symbol. Well-known markup languages include HTML (HyperText Markup Language), SGML (Standard Generalized Markup Language), and XML (Extensible Markup Language), which have spread rapidly with the spread of the WWW.

例えばＨＴＭＬ文書に含まれるマークアップ記号は、レイアウトや特定の文字の表示方法、更にはリンク機能などを提供する。このリンク機能によれば、マークアップ記号によって囲まれた部分をユーザがクリックすることで、リンク先の文書を呼び出すことができる。ソース言語中に含まれるマークアップ言語を翻訳した場合、当該マークアップ言語に含まれているマークアップ記号を訳文の適切な位置に反映することが求められる。 For example, a markup symbol included in an HTML document provides a layout, a specific character display method, a link function, and the like. According to this link function, a user can call a linked document by clicking a portion surrounded by markup symbols. When the markup language included in the source language is translated, it is required to reflect the markup symbol included in the markup language at an appropriate position in the translated sentence.

マークアップ記号を訳文に反映する機械翻訳技術は、例えば特許文献１に記載されている。この特許文献１に記載された機械翻訳技術（以下、先行技術と称する）によれば、次のような手順で機械翻訳が実行される。 A machine translation technique for reflecting a markup symbol in a translation is described in, for example, Patent Document 1. According to the machine translation technique described in Patent Document 1 (hereinafter referred to as the prior art), machine translation is executed in the following procedure.

まず、テキスト原文からマークアップ記号の開始タグが検出される。すると、このマークアップ記号が付されている単語の位置と当該マークアップ記号の開始タグと終了タグとによって囲まれる単語の数とが当該マークアップ記号に関連付けてバッファに格納される。これにより、テキスト本体が自動的にマークアップ記号と分離され、その分離されたテキスト本体に基づいて第１の言語の構造解析木が作成される。第１の言語の構造解析木は第２の言語の構造解析木に変換される。そして第２の言語の構造解析木から第２の言語のテキストが生成される。 First, a start tag of a markup symbol is detected from the original text. Then, the position of the word to which the markup symbol is attached and the number of words surrounded by the start tag and end tag of the markup symbol are stored in the buffer in association with the markup symbol. As a result, the text body is automatically separated from the markup symbol, and a structure analysis tree of the first language is created based on the separated text body. The structural analysis tree of the first language is converted into the structural analysis tree of the second language. Then, the text of the second language is generated from the structural analysis tree of the second language.

ここで、テキスト原文に含まれている各マークアップ記号は、そのマークアップ記号が付された単語の位置と、当該マークアップ記号の開始タグと終了タグとによって囲まれる単語の数と関連付けてバッファに記憶されている。そこで上記先行技術では、第２の言語のテキストが生成されるときに、テキスト原文においてマークアップ記号の開始タグと終了タグとによって囲まれていた単語に対応する訳語に対して、当該マークアップ記号と等価なマークアップ記号の開始タグ及び終了タグが適切な位置に付加される。
特許第３２２０５６０号公報（段落００１１及び００１２） Here, each markup symbol included in the original text is buffered in association with the position of the word with the markup symbol and the number of words surrounded by the start tag and end tag of the markup symbol. Is remembered. Therefore, in the above prior art, when the text in the second language is generated, the markup symbol corresponding to the translation word corresponding to the word surrounded by the markup symbol start tag and end tag in the original text is generated. The start and end tags of markup symbols equivalent to are added at the appropriate positions.
Japanese Patent No. 3220560 (paragraphs 0011 and 0012)

現在の機械翻訳技術では、複数の解釈が可能な文の翻訳などの場合、完全な翻訳は不可能である。このような場合、翻訳結果に対する後編集作業が必要となる。特に、翻訳結果において、語順が変化したり、１語が複数の単語から構成されたりする場合などには、マークアップ記号の位置によっては、当該マークアップ記号と等価なマークアップ記号を適切な位置に付加することが困難となる可能性がある。このような場合にも、翻訳結果に対する後編集作業が必要となる。 With the current machine translation technology, complete translation is impossible when translating sentences that can be interpreted in multiple ways. In such a case, post-editing work on the translation result is required. In particular, when the word order changes in the translation result, or when one word is composed of a plurality of words, depending on the position of the markup symbol, the markup symbol equivalent to the markup symbol is appropriately positioned. May be difficult to add to. Even in such a case, post-editing work on the translation result is required.

マークアップ記号が反映された翻訳結果のファイル（訳文ファイル）を編集するには、テキストエディタやＨＴＭＬエディタなどを用いるのが一般的である。ところが、例えばテキストエディタを用いて編集作業を行う場合、編集文字列中にマークアップ記号が混在するため、編集対象部分を特定するのもユーザにとって困難となる。 In order to edit a translation result file (translation file) reflecting a markup symbol, it is common to use a text editor or an HTML editor. However, for example, when an editing operation is performed using a text editor, it is difficult for the user to specify the editing target portion because markup symbols are mixed in the edited character string.

マークアップ記号を適切な位置に反映させるための編集作業では、ユーザは当該マークアップ記号全体を選択して、カット＆ペーストを行う必要がある。このような編集作業は、マークアップ記号の一部が誤って削除されるなどの危険性を秘めている。 In an editing operation for reflecting the markup symbol at an appropriate position, the user needs to select the entire markup symbol and perform cut and paste. Such an editing operation has a risk that a part of the markup symbol is accidentally deleted.

これに対し、ＨＴＭＬエディタを使用すればテキストエディタで編集する場合の上述の危険性は少なくなる。しかし、ＨＴＭＬエディタを使用してユーザが訳文を編集するには、多くの場合対応する原文を参照したり、辞書を参照したりする必要がある。ここでは、同時に原文と訳文の２つのファイルが開かれるのが一般的である。この状態でユーザは、編集対象の訳文に対応する原文を参照するために、画面を切り替えてスクロールさせたり、辞書を引くために別途起動した辞書参照ツールに画面を切り替えるなど、煩雑な操作を強いられる。 On the other hand, if the HTML editor is used, the above-mentioned danger when editing with a text editor is reduced. However, in order for a user to edit a translation using an HTML editor, it is often necessary to refer to a corresponding original sentence or a dictionary. Here, it is common that two files of the original text and the translated text are opened simultaneously. In this state, the user is forced to perform complicated operations such as switching the screen and scrolling to refer to the original text corresponding to the translation to be edited, or switching the screen to a dictionary reference tool activated separately to draw a dictionary. It is done.

本発明は上記事情を考慮してなされたものでその目的は、テキスト原文に含まれるマークアップ記号を文中で使用されるマークアップ記号と文外で使用されるマークアップ記号とに区分して、訳文内に反映されるマークアップ記号を文内マークアップ記号に制限することにより、マークアップ記号とテキスト部分とが混在する文書データの編集が効率的に行える機械翻訳編集装置、機械翻訳編集プログラム及び機械翻訳編集方法を提供することにある。 The present invention has been made in consideration of the above circumstances, and its purpose is to divide markup symbols included in a text original into markup symbols used in a sentence and markup symbols used outside a sentence, A machine translation editing device, a machine translation editing program, and a machine translation editing program capable of efficiently editing document data in which markup symbols and text parts are mixed by limiting markup symbols reflected in the translation to markup symbols in the sentence, and It is to provide a machine translation editing method.

本発明の１つの観点によれば、マークアップ記号を含む第１言語のテキスト原文を第２の言語に翻訳して訳文を生成する際に、前記テキスト原文においてマークアップ記号が付加されている単語に対応する訳語に、当該単語に付加されていたマークアップ記号を付加する機械翻訳編集装置が提供される。この機械翻訳編集装置は、記憶手段と、前記テキスト原文をマークアップ記号としての単語とマークアップ記号を含まないテキスト部分としての単語とに分離する分離手段と、前記分離手段によって分離された単語がマークアップ記号の場合、当該マークアップ記号が文中で使用される文中マークアップ記号であるか或いは文外で使用される文外マークアップ記号であるかのマークアップ記号種別を判定するマークアップ記号種別判定手段と、前記分離手段によって分離された単語の各々を、当該単語がマークアップ記号であるか或いはテキスト部分であるかを示すと共に、マークアップ記号の場合には前記マークアップ記号種別判定手段によって判定されたマークアップ記号種別を示す種別情報と関連付けて前記記憶手段に順次格納する関連付け手段と、前記テキスト原文の訳文の生成時に、当該訳文内に反映されるマークアップ記号を文内マークアップ記号に制限し、前記記憶手段に格納されている単語の群のうち前記文外マークアップ記号であることを示す種別情報と関連付けられた単語を除く単語の群を含む原文単語リスト、及び前記記憶手段に格納されている単語の群のうち前記テキスト部分であることを示す種別情報と関連付けられた全ての単語の訳語と当該訳語のうち前記文内マークアップ記号であることを示す種別情報と関連付けられたマークアップ記号が付加された単語の訳語に付加されるマークアップ記号とを含む訳文単語リストを生成して前記記憶手段に格納する原文／訳文単語リスト生成手段と、前記原文単語リスト及び前記訳文単語リストに基づいてそれぞれ原文及び訳文を編集可能に表示する表示制御手段とを具備する。 According to one aspect of the present invention, when a text source text in a first language including a markup symbol is translated into a second language to generate a translation, a word to which a markup symbol is added in the text source text A machine translation editing apparatus is provided that adds the markup symbol added to the word to the translated word corresponding to. The machine translation editing apparatus includes: a storage unit; a separation unit that separates the text source into a word as a markup symbol and a word as a text part that does not include the markup symbol; and the word separated by the separation unit In the case of a markup symbol, a markup symbol type for determining whether the markup symbol is an in-sentence markup symbol used in a sentence or an out-of-sentence markup symbol used outside a sentence Each of the words separated by the judging means and the separating means indicates whether the word is a markup symbol or a text part, and in the case of a markup symbol, the markup symbol type judging means A function for sequentially storing in the storage means in association with type information indicating the determined markup symbol type. A markup symbol that is reflected in the translation at the time of generation of the translation of the original text, and the out-of-sentence mark in the group of words stored in the storage means A source word list including a group of words excluding a word associated with the type information indicating that it is an up symbol, and type information indicating the text portion of the group of words stored in the storage unit; A translation of all the associated words, a type information indicating that the translation is a markup symbol in the sentence, and a markup symbol added to the translation of the word to which the associated markup symbol is added Source / translation word list generation means for generating a translation word list and storing it in the storage means; and based on the source word list and the translation word list Each and a display control means for editable displaying the original text and translation.

本発明によれば、テキスト原文に含まれるマークアップ記号を文中で使用されるマークアップ記号と文外で使用されるマークアップ記号とに区分して、マークアップ記号が付加される訳語を、文中で使用されるマークアップ記号が付加される単語の訳語に制限することにより、マークアップ記号とテキスト部分とが混在する文書データを編集する場合に、ユーザは文外で使用されるマークアップ記号に影響されずに、且つ翻訳されるべき単語または訳語と関連する文中で使用されるマークアップ記号を視認しながら編集できるため、効率的な編集が可能となる。 According to the present invention, a markup symbol included in a text original is divided into a markup symbol used in the sentence and a markup symbol used outside the sentence, and the translated word to which the markup symbol is added is When editing document data that includes both markup symbols and text parts, the user can select markup symbols used outside the sentence. Since the markup symbol used in the sentence related to the word to be translated or the translation to be translated can be edited while being visually recognized, efficient editing is possible.

以下、本発明の実施の形態につき図面を参照して説明する。
図１は本発明の一実施形態に係る機械翻訳編集装置のハードウェア構成を示すブロック図である。図１において、コンピュータ、例えばパーソナルコンピュータ（ＰＣ）１１は、ハードディスクドライブのような外部記憶装置１２と接続されている。この外部記憶装置１２には、機械翻訳及び翻訳結果の編集等のための機械翻訳編集プログラム１２０が格納されている。パーソナルコンピュータ１１は主メモリ１１０を含む。コンピュータ１１は、外部記憶装置１２に格納されている機械翻訳編集プログラム１２０を主メモリ１１０に読み込んで実行することにより、機械翻訳編集装置に必要な機械翻訳及び編集の機能を実現する。なお、機械翻訳編集プログラム１２０がネットワークを介して主メモリ１１０に読み込まれる構成であっても構わない。 Embodiments of the present invention will be described below with reference to the drawings.
FIG. 1 is a block diagram showing a hardware configuration of a machine translation editing apparatus according to an embodiment of the present invention. In FIG. 1, a computer such as a personal computer (PC) 11 is connected to an external storage device 12 such as a hard disk drive. The external storage device 12 stores a machine translation editing program 120 for machine translation and translation result editing. The personal computer 11 includes a main memory 110. The computer 11 reads the machine translation editing program 120 stored in the external storage device 12 into the main memory 110 and executes it, thereby realizing machine translation and editing functions necessary for the machine translation editing apparatus. The machine translation editing program 120 may be read into the main memory 110 via a network.

図２は、図１の機械翻訳編集装置の機能構成を示すブロック図である。この機械翻訳編集装置は、機械翻訳編集部２０、入力部３１、表示部３２、翻訳辞書部３３及びバッファ部３４から構成される。機械翻訳編集部２０は、図１中のパーソナルコンピュータ１１が外部記憶装置１２に格納されている機械翻訳編集プログラム１２０を主メモリ１１０に読み込んで実行することにより実現される。機械翻訳編集部２０は、システム制御部２１、翻訳部２２及び編集部２３から構成される。 FIG. 2 is a block diagram showing a functional configuration of the machine translation editing apparatus of FIG. The machine translation editing apparatus includes a machine translation editing unit 20, an input unit 31, a display unit 32, a translation dictionary unit 33, and a buffer unit 34. The machine translation editing unit 20 is realized when the personal computer 11 in FIG. 1 reads the machine translation editing program 120 stored in the external storage device 12 into the main memory 110 and executes it. The machine translation editing unit 20 includes a system control unit 21, a translation unit 22, and an editing unit 23.

入力部３１及び表示部３２は、図１中のパーソナルコンピュータ１１に設けられている。入力部３１はキーボード（ＫＢ）３１１及びマウス３１２を含む。入力部３１は、翻訳の対象となる原文ファイルのデータ、つまり原文データ（第１の言語の原文データ）や各種の指示（コマンド）を、例えばインターネットなどの通信手段やキーボード３１１等を通じて入力する。また、原文データを図１の外部記憶装置１２から入力することも可能である。表示部３２は、入力部３１によって入力された原文、つまり翻訳されるべき第１言語の文章（原文）や、翻訳部２２によって訳出された第２言語の文章（訳文）等を表示する。 The input unit 31 and the display unit 32 are provided in the personal computer 11 in FIG. The input unit 31 includes a keyboard (KB) 311 and a mouse 312. The input unit 31 inputs data of an original text file to be translated, that is, original text data (original text data of the first language) and various instructions (commands) through a communication means such as the Internet, a keyboard 311 and the like. It is also possible to input the original text data from the external storage device 12 of FIG. The display unit 32 displays the original text input by the input unit 31, that is, the first language text to be translated (original text), the second language text translated by the translation section 22 (translated text), and the like.

翻訳辞書部３３は、図１中の外部記憶装置１２に確保されている。翻訳辞書部３３は、翻訳処理で使用される図示せぬ各種の辞書（辞書データ）を格納する。翻訳辞書部３３はまた、タグ種別テーブル３３０を格納する。 The translation dictionary unit 33 is secured in the external storage device 12 in FIG. The translation dictionary unit 33 stores various dictionaries (dictionary data) (not shown) used in the translation process. The translation dictionary unit 33 also stores a tag type table 330.

バッファ部３４は、図１中の主メモリ１１０内に確保される。バッファ部３４は、第１言語の原文データとその翻訳結果としての第２言語のデータとを対応付けて格納すると共に、第１言語及び第２言語を構成する各単語の属性を格納する記憶手段として用いられる。バッファ部３４は、原文バッファ３４１、単語リストバッファ３４２、文リストバッファ３４３、原文単語リストバッファ３４４及び訳文単語リストバッファ３４５を含む。 The buffer unit 34 is secured in the main memory 110 in FIG. The buffer unit 34 stores the first language original text data and the second language data as a translation result thereof in association with each other, and stores the attributes of the words constituting the first language and the second language. Used as The buffer unit 34 includes an original sentence buffer 341, a word list buffer 342, a sentence list buffer 343, an original sentence word list buffer 344, and a translated word list buffer 345.

システム制御部２１は、図２の機械翻訳編集装置全体の制御を司る。ここでは、システム制御部２１は、入力部３１によって入力された例えば翻訳されるべき原文データを翻訳部２２に送出する。またシステム制御部２１は、入力された指示を、当該指示を実行すべき翻訳部２２、編集部２３または表示部３２などに送出する。 The system control unit 21 controls the entire machine translation editing apparatus shown in FIG. Here, the system control unit 21 sends, for example, original text data to be translated input by the input unit 31 to the translation unit 22. Further, the system control unit 21 sends the input instruction to the translation unit 22, the editing unit 23, the display unit 32, or the like that should execute the instruction.

翻訳部２２は、システム制御部２１によって送出された第１言語による原文データ（文章データ）に対して、翻訳辞書部３３に格納されている辞書群を使用して翻訳処理を行う。編集部２３は入力部３１からシステム制御部２１を介して送られる編集のための指示に応じての編集処理を行う。編集のための指示は、例えばＢａｃｋＳｐａｃｅキーの押下に応じて入力部３１から送られる文字の削除命令（を表すキーコード）などによって与えられる。編集部２３は、カーソルが存在する文字位置を特定し、その文字位置と編集のための指示とに基づいて、該当する文字を編集する。また編集部２３は、選択されたテキストのコピー、切り取り、移動などの制御を行う。 The translation unit 22 performs translation processing on the original text data (sentence data) in the first language sent by the system control unit 21 using the dictionary group stored in the translation dictionary unit 33. The editing unit 23 performs editing processing according to an editing instruction sent from the input unit 31 via the system control unit 21. An instruction for editing is given by, for example, a character deletion command (representing a key code) sent from the input unit 31 in response to pressing of the BackSpace key. The editing unit 23 identifies the character position where the cursor exists, and edits the corresponding character based on the character position and an instruction for editing. The editing unit 23 performs control such as copying, cutting, and moving of the selected text.

図３は翻訳部２２の機能構成を示すブロック図である。翻訳部２２は、翻訳文生成部２２０、分離部２２１、タグ種別判定部２２２、単語リスト生成部２２３、文リスト生成部２２４、原文／訳文単語リスト生成部２２５及びタグ種別指定部２２６を含む。 FIG. 3 is a block diagram showing a functional configuration of the translation unit 22. The translation unit 22 includes a translated sentence generation unit 220, a separation unit 221, a tag type determination unit 222, a word list generation unit 223, a sentence list generation unit 224, an original / translated sentence word list generation unit 225, and a tag type designation unit 226.

翻訳文生成部２２０は、タグ（マークアップ記号）を含む第１言語のテキスト原文を第２の言語に翻訳して訳文を生成する。翻訳文生成部２２０は、上記訳文の生成時に、テキスト原文においてタグが付加されている単語に対応する訳語に、当該単語に付加されていたタグを付加する。本実施形態では、このように訳文に反映されるタグは、後述する文内タグに制限される。 The translated sentence generation unit 220 generates a translated sentence by translating the original text of the first language including the tag (markup symbol) into the second language. When the translated sentence is generated, the translated sentence generation unit 220 adds the tag added to the word to the translated word corresponding to the word to which the tag is added in the original text. In the present embodiment, the tag reflected in the translation in this way is limited to a tag in a sentence to be described later.

分離部２２１は、上記テキスト原文をタグとしての単語とタグを含まないテキスト部分としての単語とに分離する。タグ種別判定部２２２は、分離された単語がタグの場合、当該タグが文中で使用される文中タグであるか或いは文外で使用される文外タグであるかのタグ種別を判定する。 The separation unit 221 separates the text original sentence into a word as a tag and a word as a text part not including the tag. When the separated word is a tag, the tag type determination unit 222 determines the tag type of whether the tag is an in-sentence tag used in a sentence or an out-sentence tag used outside a sentence.

単語リスト生成部２２３は、分離された単語の各々を、当該単語がタグであるか或いはテキスト部分であるかのデータ種別を示すデータ種別情報及びタグの場合にはタグ種別を示すタグ種別情報と関連付けて単語リストバッファ３４２に格納する関連付け手段として機能する。文リスト生成部２２４は、バッファ部３４内の単語リストバッファ３４２の内容（単語リスト）に基づき文のリスト（文リスト）を生成する。この文リストに含まれる文は、文外タグとしての単語の列からなり、データ種別がタグとして扱われる文（タグ文）と、文外タグ以外の単語の列からなり、データ種別がテキストとして扱われる文（テキスト文）とに分類される。文外タグ以外の単語には、テキスト部分としての単語と、文内タグとしての単語とがある。本実施形態では、テキスト文だけが翻訳対象文として処理される。 The word list generation unit 223 includes, for each separated word, data type information indicating a data type indicating whether the word is a tag or a text part, and tag type information indicating a tag type in the case of a tag, It functions as an association means for associating and storing in the word list buffer 342. The sentence list generation unit 224 generates a list of sentences (sentence list) based on the contents (word list) of the word list buffer 342 in the buffer unit 34. Sentences included in this sentence list consist of a sequence of words as out-of-sentence tags, consisting of a sentence whose data type is treated as a tag (tag sentence) and a sequence of words other than out-of-sentence tags, and the data type as text It is classified as a sentence (text sentence) to be handled. The words other than the out-of-sentence tags include a word as a text part and a word as an in-sentence tag. In the present embodiment, only text sentences are processed as translation target sentences.

原文／訳文単語リスト生成部２２５は、テキスト原文の訳文の生成時に、当該訳文に反映されるタグの種別を文内タグに制限する。原文／訳文単語リスト生成部２２５は、原文単語リストを及び訳文単語リストを生成して、それぞれバッファ部３４内の原文単語リストバッファ３４４及び訳文単語リストバッファ３４５に格納する。原文単語リストは、文リストバッファ３４３に格納されている文のうち、テキストであることを示すデータ識別情報と関連付けられているテキスト文を構成する単語の群（つまり、文外タグであることを示すタグ種別情報と関連付けられた単語を除く単語の群）を含む。訳文単語リストは、上記テキスト文を構成する単語の群のうち、テキスト部分であることを示すデータ種別情報と関連付けられた全ての単語の訳語と、当該訳語のうち文外タグであることを示すタグ種別情報と関連付けられたタグが付加された単語の訳語に付加されるタグとを含む。システム制御部２１は、原文／訳文単語リスト生成部２２５によって生成された原文単語リスト及び訳文単語リストに基づいてそれぞれ原文及び訳文を表示部３２に表示してユーザの編集操作に供する表示制御手段として機能する。 The original / translated sentence word list generation unit 225 restricts the tag type reflected in the translated sentence to the intra-sentence tag when the translated sentence of the original text is generated. The original / translated word list generating unit 225 generates the original word list and the translated word list, and stores them in the original word list buffer 344 and the translated word list buffer 345 in the buffer unit 34, respectively. The original word list is a group of words constituting a text sentence (that is, an out-of-sentence tag) associated with data identification information indicating that it is text among sentences stored in the sentence list buffer 343. A group of words excluding words associated with the tag type information shown). The translated word list indicates the translated words of all the words associated with the data type information indicating the text portion in the group of words constituting the text sentence, and the out-of-sentence tag among the translated words. A tag added to the translation of the word to which the tag associated with the tag type information is added. The system control unit 21 is a display control unit that displays the original sentence and the translated sentence on the display unit 32 based on the original word list and the translated word list generated by the original / translated word list generation unit 225 and provides them to the user's editing operation. Function.

図４は編集部２３の機能構成を示すブロック図である。編集部２３は、編集処理を行う編集処理部２３０に加えて、タグ表示指定部２３１、検出部２３２、タグ判定部２３３及び禁止部２３４を含む。 FIG. 4 is a block diagram showing a functional configuration of the editing unit 23. The editing unit 23 includes a tag display designation unit 231, a detection unit 232, a tag determination unit 233, and a prohibition unit 234 in addition to the editing processing unit 230 that performs editing processing.

タグ表示指定部２３１は、原文単語リスト及び前記訳文単語リストに含まれているマークアップ記号を表示するか否かをユーザに指定させる。検出部２３２は、原文及び訳文が表示されている状態で、当該原文または訳文中の任意の文字がユーザの操作によって選択されたことを検出する。タグ判定部２３３は、検出された文字を含む単語がタグであるかを判定する。禁止部２３４は、検出された文字を含む単語がタグである場合、当該タグ内の文字に対するユーザによる編集操作を無視する。つまりタグ内の文字が編集されるのを禁止する。 The tag display designation unit 231 allows the user to designate whether or not to display markup symbols included in the original word list and the translated word list. The detection unit 232 detects that any character in the original sentence or the translated sentence is selected by a user operation while the original sentence and the translated sentence are displayed. The tag determination unit 233 determines whether a word including the detected character is a tag. When the word including the detected character is a tag, the prohibition unit 234 ignores the editing operation performed by the user on the character in the tag. In other words, editing the characters in the tag is prohibited.

次に本実施形態における翻訳処理について、英語（第１言語）の原文を、日本語（第２言語）の訳文に翻訳する場合を例に、図５Ａ及び図５Ｂ並びに図６を参照して説明する。図５Ａ及び図５Ｂは翻訳処理の手順を示すフローチャート、図６は原文及び訳文を表示するための表示処理の手順を示すフローチャートである。 Next, the translation processing according to the present embodiment will be described with reference to FIGS. 5A, 5B, and 6, taking as an example the case of translating an original text in English (first language) into a translated text in Japanese (second language). To do. 5A and 5B are flowcharts showing the procedure of the translation process, and FIG. 6 is a flowchart showing the procedure of the display process for displaying the original sentence and the translated sentence.

まず、入力部３１によりＨＴＭＬで記述された英語の原文データに対する翻訳の指示が入力されたものとする。この指示は入力部３１からシステム制御部２１に渡される。また、この指示に従って翻訳されるべき英語の原文データが入力部３１によって入力される。この英語の原文データの一例を図７に示す。 First, it is assumed that an instruction for translation of original English text data described in HTML is input by the input unit 31. This instruction is passed from the input unit 31 to the system control unit 21. Further, the original text data to be translated according to this instruction is input by the input unit 31. An example of the original English text data is shown in FIG.

システム制御部２１は、入力部３１から翻訳の指示を受け取ると、当該入力部３１によって入力される翻訳されるべき原文データを翻訳部２２に渡す。翻訳部２２は、この原文データをバッファ部３４内の原文バッファ３４１に格納する（ステップＳ１００）。 When receiving a translation instruction from the input unit 31, the system control unit 21 passes the original text data to be translated input by the input unit 31 to the translation unit 22. The translation unit 22 stores the original text data in the original text buffer 341 in the buffer unit 34 (step S100).

次に翻訳部２２は、文字位置ポインタを初期値０にセットする（ステップＳ１１０）。この文字位置ポインタは、原文バッファ３４１に格納された原文データの次に参照されるべき（取り出されるべき）文字の位置を指し示す。ここでは、文字位置ポインタにより原文データの０番目（先頭）の文字、つまり“＜”が指し示される。また上記ステップＳ１１０において、翻訳部２２は単語番号Ｎ及び文番号Ｍをいずれも初期値１にセットする。単語番号Ｎ及び文番号Ｍは、それぞれ単語リストバッファ３４２及び文リストバッファ３４３のＮ番目及びＭ番目のエントリを指し示す。 Next, the translation unit 22 sets the character position pointer to the initial value 0 (step S110). This character position pointer indicates the position of the character to be referred to (to be extracted) next to the original text data stored in the original text buffer 341. Here, the character position pointer indicates the 0th (first) character of the original text data, that is, “<”. In step S110, the translation unit 22 sets both the word number N and the sentence number M to the initial value 1. Word number N and sentence number M indicate the Nth and Mth entries in word list buffer 342 and sentence list buffer 343, respectively.

翻訳処理において単語リストバッファ３４２及び文リストバッファ３４３の内容が変化する様子を、それぞれ図８及び図１１に示す。図８から明らかなように、単語リストバッファ３４２の各エントリは、単語番号が設定されるフィールドと、データとしての単語が設定されるデータフィールドと、当該データ（単語）がタグ（マークアップ記号）であるか、或いはタグ以外（つまりテキスト部分）であるかのデータ種別を示すデータ種別情報が設定されるデータ種別フィールドと、当該データ（単語）がタグの場合に、そのタグ（マークアップ記号）が文中で使用される文中タグ（文中マークアップ記号）であるか、或いは文中で使用されないタグ、つまり文外で使用される文外タグ（文外マークアップ記号）であるかのタグ種別（マークアップ記号種別）を示すタグ種別情報が設定されるタグ種別フィールドとを含む。ここで、データ種別フィールド及びタグ種別フィールドを１つのフィールドにまとめて、データフィールドに設定されるデータ（単語）が、テキスト部分であるか、文内タグであるか、或いは文外タグであるかを示す種別情報が設定されるフィールドとしても良い。 FIGS. 8 and 11 show how the contents of the word list buffer 342 and the sentence list buffer 343 change during the translation process, respectively. As is apparent from FIG. 8, each entry in the word list buffer 342 includes a field in which a word number is set, a data field in which a word as data is set, and the data (word) is a tag (markup symbol). Or a data type field in which data type information indicating a data type indicating a data type other than a tag (that is, a text part) is set, and the tag (markup symbol) when the data (word) is a tag The tag type (mark) that is a tag in a sentence (markup symbol in a sentence) used in a sentence or a tag that is not used in a sentence, that is, a tag outside a sentence (markout symbol outside a sentence) Tag type field in which tag type information indicating up symbol type) is set. Here, if the data type field and the tag type field are combined into one field, whether the data (word) set in the data field is a text part, an in-sentence tag, or an out-sentence tag It may be a field in which type information indicating is set.

一方、文リストバッファ３４３の各エントリは、図１１から明らかなように、文番号が設定されるフィールドと、データとしての文（単語列）が設定されるデータフィールドと、当該データ（文）がタグ（マークアップ記号）であるか、或いはテキストであるかのデータ種別を示すデータ種別情報が設定されるデータ種別フィールドとを含む。このように文リストバッファ３４３に格納される文は、タグ文とテキスト文とに分類される。 On the other hand, as is apparent from FIG. 11, each entry in the sentence list buffer 343 includes a field in which a sentence number is set, a data field in which a sentence (word string) as data is set, and the data (sentence). And a data type field in which data type information indicating a data type indicating whether it is a tag (markup symbol) or text is set. As described above, the sentences stored in the sentence list buffer 343 are classified into a tag sentence and a text sentence.

翻訳部２２はステップＳ１１０の後、文字位置ポインタが指し示す文字がタグ（マークアップ記号）の開始を示すか否かを判定する（ステップＳ１２０）。ここでは、“＜”であればタグの開始を示すと判定され、ステップＳ１３０の処理に進む。“＜”でない場合はステップＳ１４０に進む。 After step S110, the translation unit 22 determines whether the character pointed to by the character position pointer indicates the start of a tag (markup symbol) (step S120). Here, if “<”, it is determined that the start of the tag is indicated, and the process proceeds to step S130. If it is not “<”, the process proceeds to step S140.

ステップＳ１３０において翻訳部２２は、単語リストバッファ３４２の単語番号Ｎが１のエントリのデータフィールドに文字“＜”を、データ種別フィールドに“タグ”というデータ種別情報を、それぞれ格納する。図８（ａ）は、このときの単語リストバッファ３４２の状態を示す。なお、上記単語番号Ｎ（＝１）のエントリに既にデータが格納されている場合には、単語番号Ｎが１インクリメントされて、次の単語番号のエントリに文字“＜”が格納される。 In step S <b> 130, the translation unit 22 stores the character type “<” in the data field of the entry whose word number N is 1 in the word list buffer 342 and the data type information “tag” in the data type field. FIG. 8A shows the state of the word list buffer 342 at this time. When data is already stored in the entry having the word number N (= 1), the word number N is incremented by 1, and the character “<” is stored in the next word number entry.

一方、ステップＳ１４０において翻訳部２２は、文字位置ポインタが指し示す文字がタグの終了を示すか否かを判定する。このとき“＞”であれば、翻訳部２２はタグの終了を判定して、ステップＳ２２０の処理に進む。“＞”でない場合は、翻訳部２２はＳ１５０に進む。 On the other hand, in step S140, the translation unit 22 determines whether the character indicated by the character position pointer indicates the end of the tag. If “>” at this time, the translation unit 22 determines the end of the tag and proceeds to the process of step S220. If it is not “>”, the translation unit 22 proceeds to S150.

ステップＳ１５０において翻訳部２２の分離部２２１は、文字位置ポインタが指し示す文字が文末に使用される文字であるか否かを判定する。英語の場合、“．”，“！”，“？”等がこれに相当する。ここでは、“Ｍｒ．”等のように、“．”が単語の一部として使用される場合を考慮して、翻訳部２２の分離部２２１が辞書を参照することにより文末か否かが判定される。翻訳部２２は、文末と判定された場合はステップＳ２９０に進み、文末でないと判定された場合はステップＳ１６０に進む。 In step S150, the separation unit 221 of the translation unit 22 determines whether or not the character indicated by the character position pointer is a character used at the end of the sentence. In English, “.”, “!”, “?”, Etc. correspond to this. Here, in consideration of the case where “.” Is used as part of a word, such as “Mr.”, it is determined whether or not the separation unit 221 of the translation unit 22 is the end of the sentence by referring to the dictionary. Is done. The translation unit 22 proceeds to step S290 when it is determined that the sentence ends, and proceeds to step S160 when it is determined that the sentence does not end.

ステップＳ１６０において翻訳部２２の分離部２２１は、文字位置ポインタが指し示す文字が、単語リストバッファ３４２の単語番号がＮのエントリの文字列フィールドの先頭に格納されるべき文字（先頭文字）か否かを判定する。先頭文字ではない場合、翻訳部２２はステップＳ１９０に進む。先頭文字の場合、翻訳部２２のタグ種別判定部２２２は、単語リストバッファ３４２内の単語番号が１乃至Ｎ−１のエントリのデータが全て文外タグであるか否かを判定する（ステップＳ１７０）。 In step S160, the separation unit 221 of the translation unit 22 determines whether the character indicated by the character position pointer is a character (first character) to be stored at the beginning of the character string field of the entry having the word number N in the word list buffer 342. Determine. If it is not the first character, the translation unit 22 proceeds to step S190. In the case of the first character, the tag type determination unit 222 of the translation unit 22 determines whether or not all the data of entries with word numbers 1 to N−1 in the word list buffer 342 are out-of-sentence tags (step S170). ).

全て文外タグの場合、翻訳部２２の文リスト生成部２２４は、文リストバッファ３４３の文番号がＭのエントリに、単語リストバッファ３４２の全データ（単語番号が１乃至Ｎ−１のエントリのデータ）を格納する（ステップＳ１８０）。このとき、翻訳部２２は単語リストバッファ３４２の内容をクリアすると共に単語番号Ｎを初期値１に戻し、且つ文番号Ｍを１インクリメントする。次に翻訳部２２の単語リスト生成部２２３は、単語リストバッファ３４２の単語番号Ｎが１のエントリに、文字位置ポインタが指し示す文字を先頭文字として格納する（ステップＳ１９０）。このとき単語リスト生成部２２３は、上記単語番号Ｎが１のエントリにデータ種別がテキストであるという属性を併せて格納する。一方、上記ステップＳ１６０で、先頭文字ではないと判定された場合、単語リスト生成部２２３は単語リストバッファ３４２の単語番号がＮのエントリに文字位置ポインタが指し示す文字を追加する（ステップＳ１９０）。 In the case of all out-of-sentence tags, the sentence list generation unit 224 of the translation unit 22 adds all the data in the word list buffer 342 (entries with the word numbers 1 to N−1) to the entry with the sentence number M in the sentence list buffer 343. Data) is stored (step S180). At this time, the translation unit 22 clears the contents of the word list buffer 342, returns the word number N to the initial value 1, and increments the sentence number M by 1. Next, the word list generation unit 223 of the translation unit 22 stores the character indicated by the character position pointer as the first character in the entry having the word number N of 1 in the word list buffer 342 (step S190). At this time, the word list generation unit 223 stores the attribute that the data type is text in the entry having the word number N of 1. On the other hand, if it is determined in step S160 that the character is not the first character, the word list generation unit 223 adds the character indicated by the character position pointer to the entry having the word number N in the word list buffer 342 (step S190).

次に翻訳部２２の分離部２２１は、原文バッファ３４１に未処理のデータが残っているかを調べ（ステップＳ２００）、残っていない場合は処理を終了する。残っている場合、分離部２２１は文字位置ポインタを１つだけインクリメントし（ステップＳ２１０）、インクリメント後の文字位置ポインタの示す次の文字の評価を行う（ステップＳ１２０）。 Next, the separation unit 221 of the translation unit 22 checks whether or not unprocessed data remains in the original text buffer 341 (step S200), and ends the process if there is no remaining data. If it remains, the separating unit 221 increments the character position pointer by one (step S210), and evaluates the next character indicated by the character position pointer after the increment (step S120).

このように分離部２２１は、原文バッファ３４１に格納されている原文データを構成する文字列に対して処理を繰り返すことにより、当該文字列をタグ（マークアップ記号）とタグ以外の文字（テキスト）とに分離する。 In this manner, the separation unit 221 repeats the process on the character string that constitutes the original text data stored in the original text buffer 341, thereby converting the character string into a tag (markup symbol) and a character (text) other than the tag. And to separate.

次に、上記ステップＳ１４０においてタグの終了と判定された場合について説明する。この場合、翻訳部２２の単語リスト生成部２２３は単語リストバッファ３４２の単語番号がＮのエントリに“＞”を追加する（ステップＳ２２０）。これにより、単語リストバッファ３４２は図８（ｂ）に示すようにタグ＜ＨＴＭＬ＞が格納されている状態となる。次に単語リスト生成部２２３は、単語番号Ｎを１インクリメントする（ステップＳ２２１）。 Next, the case where it is determined in step S140 that the tag has ended will be described. In this case, the word list generation unit 223 of the translation unit 22 adds “>” to the entry whose word number is N in the word list buffer 342 (step S220). As a result, the word list buffer 342 is in a state where the tag <HTML> is stored as shown in FIG. Next, the word list generation unit 223 increments the word number N by 1 (step S221).

続いて翻訳部２２のタグ種別判定部２２２は、単語リストバッファ３４２の単語番号がＮ−１のエントリに格納されているタグの種類が文外タグであるか否かを判定する（ステップＳ２３０）。ＨＴＭＬでは使用可能なタグとその役割が定められている。このため、タグとその役割の情報に基づき、そのタグが文中で使用される文中タグであるか否か（つまり文外タグであるか）を簡単に判定することができる。この判定のために本実施形態では、タグ種別テーブル３３０が用いられる。このタグ種別テーブル３３０のデータ構造例を図１０に示す。図１０から明らかなように、タグ種別テーブル３３０には、ＨＴＭＬで使用可能なタグ毎に、そのタグの種別、つまり文内タグであるか或いは文外タグであるかを示す情報（タグ種別情報）が登録されている。タグ種別テーブル３３０は、前記したように翻訳辞書部３３内に予め用意されている。 Subsequently, the tag type determination unit 222 of the translation unit 22 determines whether or not the tag type stored in the entry having the word number N−1 in the word list buffer 342 is an out-of-sentence tag (step S230). . In HTML, usable tags and their roles are defined. For this reason, it is possible to easily determine whether or not the tag is an in-sentence tag used in a sentence (that is, an out-of-sentence tag) based on information on the tag and its role. In this embodiment, the tag type table 330 is used for this determination. An example of the data structure of the tag type table 330 is shown in FIG. As is clear from FIG. 10, the tag type table 330 includes, for each tag that can be used in HTML, information indicating the tag type, that is, an in-sentence tag or an out-of-sentence tag (tag type information). ) Is registered. The tag type table 330 is prepared in advance in the translation dictionary unit 33 as described above.

これに対して、ＸＭＬやＳＧＭＬの場合はタグの役割（種別）を特定するのが困難である。そこで、ＸＭＬやＳＧＭＬで記述された原文を翻訳する場合には、以下に述べるようにタグの種別をユーザに指定させると良い。 On the other hand, in the case of XML or SGML, it is difficult to specify the role (type) of the tag. Therefore, when translating the original text described in XML or SGML, it is preferable to let the user specify the tag type as described below.

まず翻訳部２２のタグ種別指定部２２６は、原文バッファ３４１に格納されている原文データ（原文ファイルのデータ）から、当該原文データ中に存在するタグを抽出する。タグ種別指定部２２６は、抽出されたタグのリスト（タグ一覧）をシステム制御部２１の制御により表示部３２の表示画面に表示させる。このときの表示画面例を図９に示す。図９に示す表示画面には、タグ一覧７１に加えて、文内タグ指定ボタン７２及び文外タグ指定ボタン７３が表示される。また、上記表示画面には、文内タグ表示領域７４及び文外タグ表示領域７５が確保されている。ユーザは例えば入力部３１のマウス３１２を操作することで、タグ一覧７１から任意のタグを選択する。またユーザは、選択されたタグが文内タグであるか或いは文外タグであるかを指定するために、文内タグ指定ボタン７２及び文外タグ指定ボタン７３のいずれか一方を選択する。するとシステム制御部２１は、文内タグが指定されたならば選択されたタグを文内タグ表示領域７４に表示し、文外タグが指定されたならば選択されたタグを文外タグ表示領域７５に表示する。タグ種別指定部２２６は、文内タグ表示領域７４及び文外タグ表示領域７５に表示されたタグから、図１０に示すタグ種別テーブル３３０と同様の形式の、翻訳されるべき原文データに固有のタグ種別テーブルを生成する。生成されたテーブルは、この原文データの翻訳処理のみに使用される。 First, the tag type designation unit 226 of the translation unit 22 extracts tags existing in the original text data from the original text data (original text file data) stored in the original text buffer 341. The tag type designation unit 226 displays the extracted tag list (tag list) on the display screen of the display unit 32 under the control of the system control unit 21. An example of the display screen at this time is shown in FIG. In addition to the tag list 71, an in-sentence tag designation button 72 and an out-sentence tag designation button 73 are displayed on the display screen shown in FIG. In addition, an in-sentence tag display area 74 and an out-of-sentence tag display area 75 are secured on the display screen. For example, the user operates the mouse 312 of the input unit 31 to select an arbitrary tag from the tag list 71. The user selects either the in-sentence tag designation button 72 or the out-of-sentence tag designation button 73 in order to designate whether the selected tag is an in-sentence tag or an out-sentence tag. Then, the system control unit 21 displays the selected tag in the in-sentence tag display area 74 if the in-sentence tag is specified, and displays the selected tag in the out-of-sentence tag display area if the out-of-sentence tag is specified. 75. The tag type designation unit 226 is specific to the original text data to be translated in the same format as the tag type table 330 shown in FIG. Generate a tag type table. The generated table is used only for translation processing of the original text data.

さて、翻訳されるべき原文が図７に示されるようにＨＴＭＬで記述されている本実施形態では、タグの種類の判定には予め用意されているタグ種別テーブル３３０（図１０参照）が用いられる。ここでは、原文バッファ３４１内のタグと一致するタグの種別（属性）を表す情報が、当該タグと関連付けて単語リストバッファ３４２に格納される。原文バッファ３４１に格納されているタグ＜ＨＴＭＬ＞の例では、当該タグは図１０に示されるタグ種別テーブル３３０内の＜ＨＴＭＬ＞と一致する。この場合、翻訳部２２のタグ種別判定部２２２は＜ＨＴＭＬ＞の属性として、タグ種別テーブル３３０から“文外”という情報を取得する。つまりタグ種別判定部２２２は、＜ＨＴＭＬ＞が文外タグであると判定する（ステップＳ２３０）。すると翻訳部２２の単語リスト生成部２２３は、図８（ｃ）に示すように、単語リストバッファ３４２の単語番号１（単語番号Ｎ−１）のエントリに、＜ＨＴＭＬ＞の属性として“文外”を示すタグ種別情報を格納する。 Now, in this embodiment in which the original text to be translated is described in HTML as shown in FIG. 7, a tag type table 330 (see FIG. 10) prepared in advance is used to determine the tag type. . Here, information indicating the type (attribute) of the tag that matches the tag in the original sentence buffer 341 is stored in the word list buffer 342 in association with the tag. In the example of the tag <HTML> stored in the original text buffer 341, the tag matches <HTML> in the tag type table 330 shown in FIG. In this case, the tag type determination unit 222 of the translation unit 22 acquires information “out of sentence” from the tag type table 330 as an attribute of <HTML>. That is, the tag type determination unit 222 determines that <HTML> is an out-of-sentence tag (step S230). Then, as shown in FIG. 8C, the word list generation unit 223 of the translation unit 22 adds “HTML” to the entry of word number 1 (word number N−1) in the word list buffer 342 as the attribute of <HTML>. The tag type information indicating “” is stored.

このように、ステップＳ２３０において文外タグと判定された場合、単語リスト生成部２２３は当該タグが格納された単語リストバッファ３４２の単語番号Ｎ−１のエントリに、“文外”を示すタグ種別情報を格納する。この場合、タグ種別判定部２２２は、単語リストバッファ３４２内の単語番号が１乃至Ｎ−２のエントリのデータが全て文外タグであるか否かを判定する（ステップＳ２４０）。全て文外タグの場合、分離部２２１はステップＳ２００を経てステップＳ２１０に進んで、原文バッファ３４１から次のデータを取得する。 As described above, when the tag is determined to be an out-of-sentence tag in step S230, the word list generation unit 223 uses the tag type indicating “out-of-sentence” in the entry of the word number N−1 in the word list buffer 342 in which the tag is stored. Store information. In this case, the tag type determination unit 222 determines whether or not all data of entries having word numbers 1 to N-2 in the word list buffer 342 are out-of-sentence tags (step S240). If all the tags are out-of-text tags, the separation unit 221 proceeds to step S210 through step S200, and acquires the next data from the original text buffer 341.

これに対し、単語番号が１乃至Ｎ−２のエントリのデータが全て文外タグであると判定されない場合、つまり単語番号が１乃至Ｎ−２のエントリのデータが文外タグ以外である場合（ステップＳ２４０）、タグ種別判定部２２２は、当該単語番号が１乃至Ｎ−２のエントリにテキストまたは文内タグが格納されていると判断する。この場合、翻訳文生成部２２０は、最後に単語リストバッファ３４２に格納された文外タグを除いた当該バッファ３４２のデータ（つまり単語番号が１乃至Ｎ−２のエントリのデータ）を翻訳対象文とする翻訳を行う（ステップＳ２５０）。この翻訳の手順については、上記特許文献１などに記載されているように従来からよく知られているため省略する。文リスト生成部２２４は、ステップＳ２５０での翻訳に用いられた翻訳対象文を文リストバッファ３４３の文番号がＭのエントリに格納して、文番号Ｍを１インクリメントする（ステップＳ２６０）。このとき単語リスト生成部２２３は、単語リストバッファ３４２の単語番号がＮ−１のエントリのデータを、当該バッファ３４２の単語番号１のエントリに移動して、単語番号Ｎを２に設定する。 On the other hand, when it is not determined that all the data of entries having the word numbers 1 to N-2 are out-of-sentence tags, that is, when the data of entries having word numbers 1 to N-2 are other than out-of-sentence tags In step S240, the tag type determination unit 222 determines that the text or in-sentence tag is stored in the entry having the word number 1 to N-2. In this case, the translated sentence generation unit 220 uses the data in the buffer 342 excluding the extra-sentence tag stored in the word list buffer 342 at the end (that is, data of entries having word numbers 1 to N-2) as the translation target sentence. Is translated (step S250). Since this translation procedure is well known from the past as described in Patent Document 1 and the like, a description thereof will be omitted. The sentence list generation unit 224 stores the translation target sentence used for the translation in step S250 in an entry whose sentence number is M in the sentence list buffer 343, and increments the sentence number M by 1 (step S260). At this time, the word list generation unit 223 moves the data of the entry whose word number is N−1 in the word list buffer 342 to the entry of word number 1 in the buffer 342 and sets the word number N to 2.

一方、ステップＳ２３０において文内タグと判定された場合、タグ種別判定部２２２は上記ステップＳ２４０と同様に、単語リストバッファ３４２内の単語番号が１乃至Ｎ−２のエントリのデータが全て文外タグであるか否かを判定する（ステップＳ２７０）。全て文外タグの場合、文リスト生成部２２４は、単語番号が１乃至Ｎ−２のエントリのデータ（つまり文外タグ群）を１つの文として、文リストバッファ３４３の文番号がＭのエントリに格納し、文番号Ｍを１インクリメントする（ステップＳ２８０）。このとき単語リスト生成部２２３は、最後に単語リストバッファ３４２に格納されたデータ、つまり単語番号がＮ−１のエントリのデータを、当該バッファ３４２の単語番号１のエントリに移動して、単語番号Ｎを２に設定する。。これに対し、単語番号が１乃至Ｎ−２のエントリのデータが全て文外タグ以外である場合、つまりテキストまたは文内タグの場合（ステップＳ２７０）、分離部２２１はステップＳ２００を経てステップＳ２１０に進んで、原文バッファ３４１から次のデータを取得する。 On the other hand, if it is determined in step S230 that the tag is an in-sentence tag, the tag type determination unit 222 determines that all the data of the entries having the word numbers 1 to N-2 in the word list buffer 342 are the out-of-sentence tags, as in step S240. It is determined whether or not (step S270). In the case of all out-of-sentence tags, the sentence list generation unit 224 takes the entry data (that is, out-of-sentence tag group) with the word numbers 1 to N-2 as one sentence, and the entry with the sentence number M in the sentence list buffer 343. And the sentence number M is incremented by 1 (step S280). At this time, the word list generation unit 223 moves the data stored last in the word list buffer 342, that is, the data of the entry having the word number N-1 to the entry of the word number 1 in the buffer 342, and the word number Set N to 2. . On the other hand, when all the data of the entries having the word numbers 1 to N-2 are other than the out-of-sentence tags, that is, in the case of text or in-sentence tags (step S270), the separation unit 221 goes to step S210 through step S200. Then, the next data is acquired from the original text buffer 341.

次に、上記ステップＳ１５０において文末と判定された場合について説明する。この場合、単語リスト生成部２２３は文末と判定された文字（文末文字）を単語リストバッファ３４２の単語番号がＮのエントリに格納する（ステップＳ２９０）。すると翻訳文生成部２２０は、現在単語リストバッファ３４２に格納されている全データ（つまり単語番号が１乃至Ｎのエントリに格納されているデータ）を翻訳対象文とする翻訳を行う（ステップＳ３００）。文リスト生成部２２４は、この翻訳対象文を文リストバッファ３４３の文番号がＭのエントリに格納し、文番号Ｍを１インクリメントする（ステップＳ３１０）。 Next, the case where it is determined that the sentence ends in step S150 will be described. In this case, the word list generation unit 223 stores the character determined to be the end of the sentence (end of sentence character) in an entry whose word number is N in the word list buffer 342 (step S290). Then, the translated sentence generation unit 220 performs translation using all data currently stored in the word list buffer 342 (that is, data stored in entries having word numbers 1 to N) as translation target sentences (step S300). . The sentence list generation unit 224 stores the sentence to be translated in an entry whose sentence number is M in the sentence list buffer 343, and increments the sentence number M by 1 (step S310).

次に、図７に示す英語の原文データに対する翻訳が指示された場合の上述の翻訳処理をより具体的に説明する。まず、図７に示す原文における文字列（タグ）“＜ＨＴＭＬ＞”の先頭文字“＜”がタグの開始と判定される（ステップＳ１２０）。文字“＜”は、図８（ａ）に示すように、単語リストバッファ３４２の単語番号１のエントリに格納される（ステップＳ１３０）。 Next, the above-described translation processing in the case where translation for the original English text data shown in FIG. 7 is instructed will be described more specifically. First, it is determined that the first character “<” of the character string (tag) “<HTML>” in the original text shown in FIG. 7 is the start of the tag (step S120). As shown in FIG. 8A, the character “<” is stored in the entry of word number 1 in the word list buffer 342 (step S130).

続いて文字列“＜ＨＴＭＬ＞”中の“Ｈ”，“Ｔ”，“Ｍ”，“Ｌ”が上記単語番号１のエントリに順次格納される（ステップＳ１９０）。次に文字列“＜ＨＴＭＬ＞”中の“＞”がタグの終了と判定され（ステップＳ１４０）、図８（ｂ）に示すように当該“＞”が上記単語番号１のエントリに格納される（ステップＳ２２０）。次に図１０に示すタグ種別テーブル３３０が参照されて、上記単語番号１のエントリに格納された文字列“＜ＨＴＭＬ＞”が文外タグであると判定される（ステップＳ２３０）。これにより、上記単語番号１のエントリには、図８（ｃ）に示すように、“＜ＨＴＭＬ＞”のタグ種別が文外タグであることを示す情報が格納される。このとき単語番号Ｎが１から２にインクリメントされる。 Subsequently, “H”, “T”, “M”, “L” in the character string “<HTML>” are sequentially stored in the entry of the word number 1 (step S190). Next, “>” in the character string “<HTML>” is determined to be the end of the tag (step S140), and the “>” is stored in the entry of the word number 1 as shown in FIG. 8B. (Step S220). Next, the tag type table 330 shown in FIG. 10 is referred to, and it is determined that the character string “<HTML>” stored in the entry of the word number 1 is an out-of-sentence tag (step S230). As a result, the entry of the word number 1 stores information indicating that the tag type of “<HTML>” is a non-sentence tag, as shown in FIG. 8C. At this time, the word number N is incremented from 1 to 2.

次に、図７に示す原文における文字列（タグ）“＜ＢＯＤＹ＞”の部分の処理が１文字ずつ同様に行われる。“＜ＢＯＤＹ＞”も“＜ＨＴＭＬ＞”と同様に文外タグと判定されるため、単語リストバッファ３４２は図８（ｄ）に示す状態となる。このとき単語番号Ｎが２から３にインクリメントされる。 Next, the processing of the character string (tag) “<BODY>” portion in the original text shown in FIG. 7 is similarly performed character by character. Similarly to “<HTML>”, “<BODY>” is also determined to be an out-of-sentence tag, so the word list buffer 342 is in the state shown in FIG. At this time, the word number N is incremented from 2 to 3.

続いて、図７に示す原文における文字列“Ｃｌｉｃｋ”の部分の処理が次のように行われる。まず文字“Ｃ”が取得された場合に、当該“Ｃ”が単語リストバッファ３４２の単語番号Ｎ（つまり単語番号３）のエントリにおける先頭文字と判定される（ステップＳ１６０）。このとき図８（ｄ）から明らかなように、単語リストバッファ３４２の単語番号１乃至Ｎ−１（つまり単語番号１及び２）のエントリに格納されているデータ“＜ＨＴＭＬ＞”及び“＜ＢＯＤＹ＞”は全て文外タグである（ステップＳ１７０）。この場合、“＜ＨＴＭＬ＞”及び“＜ＢＯＤＹ＞”は１つのデータとして、文リストバッファ３４３の文番号Ｍが１のエントリに格納される（ステップＳ１８０）。このときの文リストバッファ３４３の状態を図１１（ａ）に示す。上記ステップＳ１８０では、単語リストバッファ３４２の内容がクリアされると共に、単語番号Ｎ（＝３）が１に初期化される。 Subsequently, the processing of the portion of the character string “Click” in the original text shown in FIG. 7 is performed as follows. First, when the character “C” is acquired, it is determined that the “C” is the first character in the entry of the word number N (that is, the word number 3) in the word list buffer 342 (step S160). At this time, as apparent from FIG. 8D, the data “<HTML>” and “<BODY” stored in the entries of the word numbers 1 to N−1 (that is, the word numbers 1 and 2) in the word list buffer 342. > "Are all out-of-sentence tags (step S170). In this case, “<HTML>” and “<BODY>” are stored as one data in the entry having the sentence number M of 1 in the sentence list buffer 343 (step S180). The state of the sentence list buffer 343 at this time is shown in FIG. In step S180, the contents of the word list buffer 342 are cleared and the word number N (= 3) is initialized to 1.

次に文字“Ｃ”が、図８（ｅ）に示すように、単語リストバッファ３４２の単語番号１のエントリに格納されると共に“テキスト”という属性が格納される（ステップＳ１９０）。残りの“ｌｉｃｋ”の部分は、上記単語番号１のエントリの“Ｃ”に後続する位置に追加される（ステップＳ１９０）。 Next, the character “C” is stored in the entry of word number 1 in the word list buffer 342 and the attribute “text” is stored as shown in FIG. 8E (step S190). The remaining “lick” part is added to the position following “C” in the entry of word number 1 (step S190).

続いて、図７に示す原文における文字列“＜Ａｈｒｅｆ＝”ｈｔｔｐ：／／ｗｗｗ．ａａａ．ｃｏｍ＞”の部分の処理が次のように行われる。まず、“＜”が取得されると、当該“＜”が単語リストバッファ３４２の単語番号２のエントリに格納される（ステップＳ１３０）。次に、残りの“Ａｈｒｅｆ＝”ｈｔｔｐ：／／ｗｗｗ．ａａａ．ｃｏｍ＞”のうち“＞”を除く文字列が、上記単語番号２のエントリの“＜”に後続する位置に順次追加される（ステップＳ１９０）。そして最後の“＞”が上記単語番号２のエントリに格納されると（ステップＳ２２０）、当該エントリに格納されている文字列“＜Ａｈｒｅｆ＝”ｈｔｔｐ：／／ｗｗｗ．ａａａ．ｃｏｍ＞”のタグの種別が、図１０に示すタグ種別テーブル３３０を参照して行われる（ステップＳ２３０）。 Subsequently, the character string “<A href =” http: // www. aaa. The process of the part “com>” is performed as follows: First, when “<” is acquired, the “<” is stored in the entry of the word number 2 in the word list buffer 342 (step S130). Next, of the remaining “A href =” http://www.aaa.com> ”, a character string excluding“> ”is sequentially added to a position subsequent to“ <”in the entry of the word number 2. (Step S190). When the last “>” is stored in the entry of the word number 2 (step S220), the character string “<A href =” http: // www. aaa. The tag type of “com>” is performed with reference to the tag type table 330 shown in FIG. 10 (step S230).

文字列“＜Ａｈｒｅｆ＝”ｈｔｔｐ：／／ｗｗｗ．ａａａ．ｃｏｍ＞”は、文字列“＜Ａ”の次がスペースの場合、最後の文字“＞”と合わせてタグ“＜Ａ＞”として認識される。この場合、図１０に示すタグ種別テーブル３３０から明らかなように、タグ“＜Ａ＞”は文内タグと判定される（ステップＳ２３０）。すると、単語リストバッファ３４２の単語番号２より小さな単語番号のエントリ、つまり単語番号１のエントリのデータのタグが文外タグであるか否かが判定される（ステップＳ２７０）。このとき、単語番号１のエントリには文字列“Ｃｌｉｃｋ”が格納されており、当該文字列“Ｃｌｉｃｋ”は文外タグでないと判定される。 The character string “<A href =” http: // www. aaa. com> ”is recognized as a tag“ <A> ”together with the last character“> ”when the character string“ <A ”is followed by a space. In this case, the tag type table 330 shown in FIG. As is apparent, the tag “<A>” is determined to be an in-sentence tag (step S230), and the data of the entry of the word number smaller than the word number 2 of the word list buffer 342, that is, the entry of the word number 1 is stored. It is determined whether or not the tag is an out-of-sentence tag (step S270) At this time, the character string “Click” is stored in the entry of word number 1, and the character string “Click” is included in the out-of-sentence tag. It is determined that it is not.

続いて、図７に示す原文における文字列“Ｈｅｒｅ”及び“＜／Ａ＞”に対しても同様の処理が行われ、当該“Ｈｅｒｅ”及び“＜／Ａ＞”は単語リストバッファ３４２のそれぞれ単語番号３及び４のエントリに格納される。この結果、単語リストバッファ３４２は図８（ｆ）に示す状態となる。 Subsequently, the same processing is performed on the character strings “Here” and “</A>” in the original text shown in FIG. 7, and the “Here” and “</A>” are respectively stored in the word list buffer 342. It is stored in the entries of word numbers 3 and 4. As a result, the word list buffer 342 is in the state shown in FIG.

更に“＜／Ａ＞”に後続する“．”が文末文字と判定され（ステップＳ１５０）、図８（ｇ）に示すように、当該“．”が単語リストバッファ３４２の単語番号５のエントリに格納される（ステップＳ２９０）。すると、その時点において単語リストバッファ３４２に格納されている全データ、即ち単語番号が１乃至５のエントリに格納されているデータ）を翻訳対象文とする翻訳を行う（ステップＳ３００）。この翻訳においては、上記特許文献１に記載されているように、原文（翻訳対象原文）に含まれているタグが訳文に反映される。ここで、翻訳対象原文に文外タグが含まれていないことに注意されたい。このとき翻訳部２２の原文／訳文単語リスト生成部２２５は、単語リストバッファ３４２の内容に対応する原文単語リスト及びタグ情報が反映された訳文を構成する単語のリスト（訳文単語リスト）を生成する。原文単語リスト及び訳文単語リストは、原文と訳文との間の単語（テキスト）の対応関係及び原文に付されているタグと訳文に反映されているタグとの対応関係を示す情報を含む。 Further, “.” Following “</A>” is determined as the end-of-sentence character (step S150), and as shown in FIG. 8G, the “.” Is added to the entry of word number 5 in the word list buffer 342. Stored (step S290). Then, all the data stored in the word list buffer 342 at that time, that is, the data stored in the entries having the word numbers 1 to 5) are translated as translation target sentences (step S300). In this translation, as described in Patent Document 1, the tag included in the original sentence (translation target original sentence) is reflected in the translated sentence. It should be noted here that the translation target text does not include extra-text tags. At this time, the original sentence / translated word list generating section 225 of the translating section 22 generates an original sentence word list corresponding to the contents of the word list buffer 342 and a list of words constituting the translated sentence reflecting the tag information (translated word list). . The original word list and the translated word list include information indicating the correspondence between words (text) between the original sentence and the translated sentence, and the correspondence between the tag attached to the original sentence and the tag reflected in the translated sentence.

文リスト生成部２２４は、単語リストバッファ３４２に格納されている翻訳対象原文を、図１１（ｂ）に示すように、文リストバッファ３４３の文番号２のエントリに格納する（ステップＳ３１０）。このとき原文／訳文単語リスト生成部２２５は、翻訳対象原文に対応する上記原文単語リスト及び訳文単語リストを、それぞれ原文単語リストバッファ３４４及び訳文単語リストバッファ３４５に格納する。図１２及び図１３は、それぞれ、この時点における原文単語リストバッファ３４４及び訳文単語リストバッファ３４５の内容（原文単語リスト及び訳文単語リスト）を示す。単語リストバッファ３４４及び３４５の各エントリは、単語リストバッファ３４２の各エントリにおけるタグ種別フィールドに代えて、対応関係フィールドを含む。 The sentence list generation unit 224 stores the original text to be translated stored in the word list buffer 342 in the entry of sentence number 2 in the sentence list buffer 343 as shown in FIG. 11B (step S310). At this time, the original / translated word list generating unit 225 stores the original word list and the translated word list corresponding to the original sentence to be translated in the original word list buffer 344 and the translated word list buffer 345, respectively. FIGS. 12 and 13 show the contents (original word list and translated word list) of the original word list buffer 344 and the translated word list buffer 345 at this time point, respectively. Each entry in the word list buffers 344 and 345 includes a correspondence field instead of the tag type field in each entry in the word list buffer 342.

図１２及び図１３において、それぞれ同一の固有の数値（対応関係情報）が対応関係フィールドに設定されたエントリの単語（テキストまたはタグ）は対応していることを示す。例えば、原文単語リストバッファ３４４においてテキスト“Ｃｌｉｃｋ”が格納されているエントリ及び訳文単語リストバッファ３４５においてテキスト“クリックしなさい”が格納されているエントリの対応関係フィールドには、同一の数値１が設定されている。このことは、“Ｃｌｉｃｋ”と“クリックしなさい”とが対応することを示す。なお、原文単語リスト及び訳文リスト間で対応関係のない単語が格納されているエントリの対応関係フィールドには、その旨を示す特別の数値、例えば−１が設定されている。 12 and 13, the same unique numerical value (correspondence information) indicates that the word (text or tag) of the entry set in the correspondence field corresponds. For example, the same numerical value 1 is set in the correspondence field of the entry storing the text “Click” in the source word list buffer 344 and the entry storing the text “click” in the translated word list buffer 345. Has been. This indicates that “Click” corresponds to “Click”. Note that a special numerical value indicating this, for example, -1, is set in the correspondence field of the entry in which words having no correspondence between the original word list and the translation list are stored.

続いて、図７に示す原文における文字列“＜／ＢＯＤＹ＞”及び“＜／ＨＴＭＬ＞”に関しても同様に処理が行われて、当該“＜／ＢＯＤＹ＞”及び“＜／ＨＴＭＬ＞”はいずれも文外タグとして判定される。これにより、文リストバッファ３４３は最終的に図１１（ｃ）に示す状態となる。 Subsequently, the same processing is performed on the character strings “</ BODY>” and “</ HTML>” in the original text shown in FIG. 7, and the “</ BODY>” and “</ HTML>” Are also determined as out-of-sentence tags. As a result, the sentence list buffer 343 finally becomes in the state shown in FIG.

翻訳部２２による翻訳処理が完了すると、システム制御部２１は表示部３２を用いた表示処理を実行する。以下、この表示処理について、図６のフローチャート並びに図１４の原文及び訳文の表示例を参照して説明する。 When the translation process by the translation unit 22 is completed, the system control unit 21 executes a display process using the display unit 32. Hereinafter, this display process will be described with reference to the flowchart of FIG. 6 and the display example of the original text and the translated text of FIG.

まずシステム制御部２１は、文リストバッファ３４３のＭ番目のエントリを指定する文番号Ｍを初期値１にセットする（ステップＳ５００）。システム制御部２１は、文リストバッファ３４３の文番号Ｍ（＝１）のエントリを参照することにより（ステップＳ５０１）、当該エントリに文が存在するか否かを判定する（ステップＳ５１０）。もし、文番号Ｍのエントリに文が存在しない場合、システム制御部２１は表示処理を終了する。文番号Ｍのエントリに文が存在する場合、システム制御部２１は当該文がタグであるか否かを、エントリのデータ種別フィールドに基づいて判定する（ステップＳ５２０）。 First, the system control unit 21 sets the sentence number M designating the Mth entry in the sentence list buffer 343 to an initial value 1 (step S500). The system control unit 21 refers to the entry of the sentence number M (= 1) in the sentence list buffer 343 (step S501), and determines whether or not a sentence exists in the entry (step S510). If there is no sentence in the entry of sentence number M, the system control unit 21 ends the display process. If there is a sentence in the entry with the sentence number M, the system control unit 21 determines whether or not the sentence is a tag based on the data type field of the entry (step S520).

文番号Ｍのエントリに存在する文の種別がタグの場合（ステップＳ５２０）、システム制御部２１は文番号Ｍを１インクリメントし（ステップＳ５２１）、このインクリメント後の文番号Ｍで指定される文リストバッファ３４３のエントリを参照する（ステップＳ５０１）。 When the type of sentence existing in the entry of sentence number M is a tag (step S520), the system control unit 21 increments the sentence number M by 1 (step S521), and the sentence list specified by the sentence number M after this increment The entry in the buffer 343 is referred to (step S501).

一方、文番号Ｍのエントリに存在する文（つまり文番号Ｍの原文）の種別がテキストの場合（ステップＳ５２０）、システム制御部２１は、当該文番号Ｍの原文に関連付けられている原文単語リスト及び訳文単語リストを、それぞれ原文単語リストバッファ３４４及び訳文単語リストバッファ３４５から取得する（ステップＳ５３０）。システム制御部２１は、原文単語リスト及び訳文単語リストの各々に含まれている単語について以下の処理を繰り返す。 On the other hand, when the type of the sentence existing in the entry with the sentence number M (that is, the original sentence with the sentence number M) is text (step S520), the system control unit 21 selects the original sentence word list associated with the original sentence with the sentence number M. And the translated word list are obtained from the original word list buffer 344 and the translated word list buffer 345, respectively (step S530). The system control unit 21 repeats the following processing for the words included in each of the original word list and the translated word list.

まずシステム制御部２１は、原文単語リストまたは訳文単語リストから未処理の単語を取り出す処理を行う（ステップＳ５４０）。もし、未処理の単語が存在するために、当該未処理の単語を取り出すことができたならば（ステップＳ５５０）、システム制御部２１は当該取り出された単語に関連付けられているデータ種別を判定する（ステップＳ５６０）。 First, the system control unit 21 performs processing for extracting unprocessed words from the original word list or the translated word list (step S540). If there is an unprocessed word and the unprocessed word can be extracted (step S550), the system control unit 21 determines the data type associated with the extracted word. (Step S560).

取り出された単語のデータ種別がテキストの場合、つまりタグでない場合、システム制御部２１は表示部３２を用いて当該単語を表示する（ステップＳ５７０）。これに対し、取り出された単語のデータ種別がタグの場合、システム制御部２１は原文単語リストまたは訳文単語リストから次の未処理の単語を取り出す処理を行う（ステップＳ５４０）。ここでは、原文単語リストに含まれている単語についての処理が先に実行されるものとするが、この逆でも構わない。また、単語番号の例えば昇順に原文単語リスト及び訳文単語リストから交互に単語が取り出される構成であっても構わない。 If the data type of the extracted word is text, that is, if it is not a tag, the system control unit 21 displays the word using the display unit 32 (step S570). On the other hand, when the data type of the extracted word is a tag, the system control unit 21 performs a process of extracting the next unprocessed word from the original word list or the translated word list (step S540). Here, it is assumed that the processing for the words included in the original word list is executed first, but this may be reversed. In addition, for example, the word numbers may be alternately extracted from the source word list and the translated word list in ascending order.

原文単語リスト及び訳文単語リスト中の全ての単語に対して処理が行われると（ステップＳ５５０）、システム制御部２１は文番号Ｍを１インクリメントし（ステップＳ５２１）、このインクリメント後の文番号Ｍで指定される文リストバッファ３４３のエントリを参照する（ステップＳ５０１）。このようにしてシステム制御部２１は、最後の文まで処理を行う。その結果、図１１（ｃ）に示す文リスト、当該文リスト中の文番号２の文に対応するそれぞれ図１２及び図１３に示す原文単語リスト及び訳文単語リストの場合、図１４に示す原文及び訳文が表示される。この原文及び訳文はタグを含まないため、ユーザはタグの存在を意識せずに原文及び訳文の編集操作を行うことができる。なお、図１４では、速図の都合上、原文及び訳文が上限配置されているが、左右に配置されても構わない。 When all the words in the original word list and the translated word list are processed (step S550), the system control unit 21 increments the sentence number M by 1 (step S521), and the sentence number M after the increment is used. The entry of the designated sentence list buffer 343 is referred to (step S501). In this way, the system control unit 21 performs processing up to the last sentence. As a result, in the case of the sentence list shown in FIG. 11C, the original word list and the translated word list shown in FIGS. 12 and 13 respectively corresponding to the sentence with sentence number 2 in the sentence list, the original sentence shown in FIG. The translation is displayed. Since the original sentence and the translated sentence do not include a tag, the user can edit the original sentence and the translated sentence without being aware of the presence of the tag. In FIG. 14, the original sentence and the translated sentence are arranged at the upper limit for convenience of the speed diagram, but they may be arranged on the left and right.

なお、上記実施形態では、原文バッファ３４１、単語リストバッファ３４２、文リストバッファ３４３、原文単語リストバッファ３４４及び訳文単語リストバッファ３４５が予めバッファ部３４に確保されている場合を前提としている。しかし、これらのバッファが必要に応じて確保されるものであっても構わない。また、例えば単語リスト及び文リストの各エントリが、逐次生成される構成であっても構わない。 In the above embodiment, it is assumed that the original sentence buffer 341, the word list buffer 342, the sentence list buffer 343, the original sentence word list buffer 344, and the translated word list buffer 345 are secured in the buffer unit 34 in advance. However, these buffers may be secured as necessary. Further, for example, each entry of the word list and sentence list may be generated sequentially.

［変形例］
次に上記実施形態の変形例について説明する。この変形例の第１の特徴は、上記実施形態と異なってタグも表示される点にある。この変形例の第２の特徴は、タグとテキストとを表示画面上でユーザが容易に識別可能な表示を行う点にある。以下、上記実施形態の変形例における表示処理について、図１５の原文及び訳文の表示例並びに図１６のフローチャートを参照して、上記実施形態における表示処理と相違する部分のみ説明する。なお、図１６において、図６と同様の部分には同一符号を付してある。 [Modification]
Next, a modification of the above embodiment will be described. The first feature of this modification is that, unlike the above embodiment, tags are also displayed. The second feature of this modification is that the tag and text are displayed on the display screen so that the user can easily identify them. Hereinafter, the display processing in the modification of the above embodiment will be described with reference to the display example of the original text and the translation in FIG. 15 and the flowchart in FIG. In FIG. 16, the same parts as those in FIG. 6 are denoted by the same reference numerals.

まず、原文単語リストまたは訳文単語リストから取り出された単語のデータ種別がテキストであるものとする（ステップＳ５６０）。この場合、システム制御部２１は、上記取り出された単語の表示フォントの表示属性としての例えば強調表示を解除して（ステップＳ５６１）、当該単語を表示する（ステップＳ５７０）。一方、上記取り出された単語のデータ種別がタグの場合（ステップＳ５６０）、システム制御部２１は当該単語の表示フォントの表示属性として強調表示を設定して（ステップＳ５６２）、当該単語を表示する（ステップＳ５７０）。 First, it is assumed that the data type of the word extracted from the original word list or the translated word list is text (step S560). In this case, the system control unit 21 cancels, for example, highlighting as the display attribute of the display font of the extracted word (step S561), and displays the word (step S570). On the other hand, when the data type of the extracted word is a tag (step S560), the system control unit 21 sets highlighting as a display attribute of the display font of the word (step S562) and displays the word ( Step S570).

以上により、本変形例においては、図１１（ｃ）に示す文リスト、当該文リスト中の文番号２の文に対応するそれぞれ図１２及び図１３に示す原文単語リスト及び訳文単語リストの場合、図１５に示す原文及び訳文が表示される。この図１５では、原文と訳文にテキストとタグが混在する形で表示される。しかし、タグの部分が図１５においてハッチングで示されるように強調表示されている。このためユーザは、タグの部分（つまりマークアップ記号）とテキスト部分とを一目で識別することができ、且つタグが原文と訳文のどの位置に存在するかを確認することができる。これによりユーザは、タグ部分に誤った修正を行うことなく、例えば訳文に対する編集作業を効率的に行うことが可能となる。図１５では、速図の都合上、原文及び訳文が上限配置されているが、左右に配置されても構わない。 As described above, in this modification, in the case of the sentence list shown in FIG. 11C, the original word list and the translated word list shown in FIG. 12 and FIG. 13 corresponding to the sentence with sentence number 2 in the sentence list, The original sentence and the translated sentence shown in FIG. 15 are displayed. In FIG. 15, text and tags are mixedly displayed in the original sentence and the translated sentence. However, the tag portion is highlighted as shown by hatching in FIG. For this reason, the user can identify the tag portion (that is, the markup symbol) and the text portion at a glance, and can confirm the position of the tag in the original sentence and the translated sentence. As a result, the user can efficiently edit, for example, the translated sentence without erroneously correcting the tag portion. In FIG. 15, the original sentence and the translated sentence are arranged at the upper limit for convenience of the speed diagram, but they may be arranged on the left and right.

なお、上記実施形態のようにタグを表示しないか、本変形例のようにタグを表示するかは、編集部２３のタグ表示指定部２３１によりユーザに指定させることができる。このタグ表示指定部２３１が、システム制御部２１に含まれる構成であっても構わない。 It should be noted that whether the tag is not displayed as in the above embodiment or whether the tag is displayed as in this modification can be designated by the user by the tag display designation unit 231 of the editing unit 23. The tag display designation unit 231 may be included in the system control unit 21.

以下、本変形例で適用される、ユーザの編集操作に従う編集処理について、図１７のフローチャートを参照して説明する。この編集処理では、表示部３２に表示されている原文または訳文に対してデータの種別により編集が制御される。 Hereinafter, editing processing according to a user's editing operation, which is applied in the present modification, will be described with reference to the flowchart of FIG. In this editing process, editing of the original sentence or the translated sentence displayed on the display unit 32 is controlled according to the type of data.

まず、図１３の訳文に含まれている文字列“＜Ａｈｒｅｆ＝”ｈｔｔｐ：／／ｗｗｗ．ａａａ．ｃｏｍ＞”内にカーソルが存在する状態で、入力部３１に含まれているキーボード３１１上のＢａｃｋＳｐａｃｅキーが押されたものとする。すると、ＢａｃｋＳｐａｃｅキーに固有のキーコードが入力部３１により入力される。システム制御部２１は、この入力キーコードを編集部２３に送る。 First, the character string “<A href =” http: // www. aaa. It is assumed that the BackSpace key on the keyboard 311 included in the input unit 31 is pressed in a state where the cursor is present in “com>”. Then, a key code unique to the BackSpace key is input by the input unit 31. The system control unit 21 sends this input key code to the editing unit 23.

編集部２３の編集処理部２３０はシステム制御部２１によって送られたキーコードを受け取ると、当該キーコードに基づき編集キーが押されたか否かを判定する（ステップＳ７００）。ここで編集キーとは、ＢａｃｋＳｐａｃｅ（後退）キー、Ｄｅｌｅｔｅ（削除）キー、文字キー、数字キーなどを指す。ここでは、システム制御部２１によってＢａｃｋＳｐａｃｅキーのキーコードが送られている。この場合、編集処理部２３０は編集キーが押されたものと判定する（ステップＳ７００）。 Upon receiving the key code sent by the system control unit 21, the editing processing unit 230 of the editing unit 23 determines whether or not an editing key has been pressed based on the key code (step S700). Here, the edit key refers to a BackSpace key, a Delete key, a character key, a numeric key, and the like. Here, the key code of the BackSpace key is sent by the system control unit 21. In this case, the edit processing unit 230 determines that the edit key has been pressed (step S700).

すると編集部２３の検出部２３２は、現在のカーソル位置を取得して、当該カーソル位置に存在する文中の文字がユーザによって選択されたことを検出する（ステップＳ７１０）。検出部２３２は、カーソル位置に存在する文に関連付けられている単語リストを取得する（ステップＳ７２０）。ここでは、原文単語リストバッファ３４４に格納されている原文単語リスト及び訳文単語リストバッファ３４５に格納されている訳文単語リストのうち、訳文単語リスト（図１３参照）が取得される。 Then, the detection unit 232 of the editing unit 23 acquires the current cursor position, and detects that the character in the sentence existing at the cursor position has been selected by the user (step S710). The detecting unit 232 acquires a word list associated with the sentence existing at the cursor position (step S720). Here, the translated word list (see FIG. 13) is acquired from the original word list stored in the original word list buffer 344 and the translated word list stored in the translated word list buffer 345.

次に検出部２３２は、単語番号Ｎを初期値１に設定し（ステップＳ７２１）、ステップＳ７２０で取得された単語リスト中の単語番号Ｎの単語を参照する（ステップＳ７３０）。検出部２３２は、単語リストの先頭から単語番号Ｎの単語までの文字列の長さ（文字列長）ＬＮを取得する（ステップＳ７４０）。Ｎ＝１の場合、単語番号１の単語、つまり先頭の単語の文字列長が取得される。 Next, the detection unit 232 sets the word number N to an initial value 1 (step S721), and refers to the word with the word number N in the word list acquired in step S720 (step S730). The detection unit 232 acquires the length (character string length) LN of the character string from the beginning of the word list to the word with the word number N (step S740). When N = 1, the character string length of the word with the word number 1, that is, the first word is acquired.

次に検出部２３２は、単語リストの先頭から現在のカーソル位置までの文字列長ＬＣが、当該単語リストの先頭から単語番号Ｎの単語までの文字列長ＬＮよりも長いか否かを判定する（ステップＳ７５０）。もし、ＬＣ＞ＬＮであり（ステップＳ７５０）、且つ単語番号Ｎの単語が単語リストの最後の単語でないならば（ステップＳ７８０）、検出部２３２は単語番号Ｎを１インクリメントする（ステップＳ７６１）。そして検出部２３２は、インクリメント後の単語番号Ｎの単語を参照する（ステップＳ７３０）。検出部２３２は、単語リストの先頭からインクリメント後の単語番号Ｎの単語までの文字列長ＬＮを取得する（ステップＳ７４０）。この文字列長ＬＮは、前回のステップＳ７４０で取得されている単語リストの先頭から単語番号Ｎ−１の単語までの文字列長に、単語番号Ｎの単語の文字列長を加算することにより取得される。つまり検出部２３２は、単語リストの先頭の単語（単語番号１の単語）から単語番号Ｎの単語までの各単語の文字列長を順次加算することにより、文字列長ＬＮを取得する。 Next, the detection unit 232 determines whether or not the character string length LC from the top of the word list to the current cursor position is longer than the character string length LN from the top of the word list to the word with the word number N. (Step S750). If LC> LN (step S750) and the word number N is not the last word in the word list (step S780), the detection unit 232 increments the word number N by 1 (step S761). Then, the detecting unit 232 refers to the word with the incremented word number N (step S730). The detecting unit 232 acquires the character string length LN from the top of the word list to the word with the incremented word number N (step S740). This character string length LN is acquired by adding the character string length of the word with the word number N to the character string length from the top of the word list acquired at the previous step S740 to the word with the word number N-1. Is done. That is, the detection unit 232 acquires the character string length LN by sequentially adding the character string lengths of the words from the first word (word with word number 1) to the word with word number N in the word list.

やがて、ステップＳ７４０で取得された文字列長ＬＮが文字列長ＬＣ以上となったものとする（ステップＳ７５０）。つまり、単語リストの先頭から単語番号Ｎの単語までの文字列長ＬＮが単語リストの先頭からカーソル位置までの文字列長ＬＣ以上となったものとする。この場合、検出部２３２は現在の単語番号Ｎの単語を、現在カーソルが位置する単語として特定する。 Eventually, it is assumed that the character string length LN acquired in step S740 becomes equal to or longer than the character string length LC (step S750). That is, it is assumed that the character string length LN from the beginning of the word list to the word with the word number N is equal to or longer than the character string length LC from the beginning of the word list to the cursor position. In this case, the detection unit 232 specifies the word with the current word number N as the word where the current cursor is located.

すると編集部２３のタグ判定部２３３は、特定されたカーソル位置の単語、つまり単語番号Ｎの単語のデータ種別がタグであるか否かを判定する（ステップＳ７７０）。カーソルが“＜Ａｈｒｅｆ＝”ｈｔｔｐ：／／ｗｗｗ．ａａａ．ｃｏｍ＞”内に存在する本変形例では、カーソル位置の単語はタグと判定される。このように、カーソル位置の単語がタグと判定された場合（ステップＳ７７０）、禁止部２３４は編集処理部２３０に対して、システム制御部２１から送られたＢａｃｋＳｐａｃｅキー（編集キー）のキーコードを無視するように制御する。これにより、訳文に付されているタグに対するユーザによる編集操作が禁止される。原文に含まれているタグに対するユーザによる編集操作についても、同様に禁止される。 Then, the tag determination unit 233 of the editing unit 23 determines whether the data type of the word at the specified cursor position, that is, the word with the word number N is a tag (step S770). The cursor is “<A href =” http: // www. aaa. In the present modification example that exists in “com>”, the word at the cursor position is determined as a tag. In this way, when the word at the cursor position is determined as a tag (step S770), the prohibition unit 234 is an editing processing unit. 230 is controlled so as to ignore the key code of the BackSpace key (edit key) sent from the system control unit 21. As a result, the editing operation by the user on the tag attached to the translation is prohibited. Similarly, user editing operations on tags included in the original text are also prohibited.

さて、カーソルで位置指定された原文または訳文中の文字（ここでは訳文中の文字）を含む単語がタグと判定された場合、システム制御部２１は、当該判定されたタグ（単語）が選択されたとして、当該判定（選択）されたタグと原文単語リストまたは訳文単語リスト中の対応関係情報によって関連付けられている訳文または原文中のタグが、当該判定されたタグと対応関係があるとユーザが視認可能なように、両方のタグを特別の表示属性で表示する。 When the word including the original text or the text in the translated text (here, the text in the translated text) determined by the cursor is determined as a tag, the system control unit 21 selects the determined tag (word). For example, if the determined (selected) tag is associated with the tag in the original sentence list or the translated word list by the correspondence information in the original sentence list or the translated sentence word list, the user has a corresponding relationship with the determined tag. Both tags are displayed with special display attributes so that they can be seen.

一方、カーソル位置の単語、つまり単語番号Ｎの単語がテキストであると判定された場合には（ステップＳ７７０）、編集処理部２３０は単語番号Ｎの単語内の文字に対してシステム制御部２１から送られたキーコードに固有の編集を行う（ステップＳ７８０）。例えば、ＢａｃｋＳｐａｃｅキーのキーコードが送られた場合であれば、単語番号Ｎの単語内の（編集キー）カーソル位置の直前の文字が削除され、単語リスト（ここでは訳文単語リスト）が更新される。 On the other hand, when it is determined that the word at the cursor position, that is, the word with the word number N is text (step S770), the editing processing unit 230 applies the character in the word with the word number N from the system control unit 21. Editing specific to the sent key code is performed (step S780). For example, if the key code of the BackSpace key is sent, the character immediately before the cursor position in the word of the word number N (edit key) is deleted, and the word list (here, the translated word list) is updated. .

一方、システム制御部２１によって送られたキーコードが編集キーのキーコードではなくて（ステップＳ７００）、カーソル移動キーのキーコードであるならば（ステップＳ７９０）、編集処理部２３０は当該カーソル移動キーのキーコードに基づいてカーソルを移動する（ステップＳ８００）。 On the other hand, if the key code sent by the system control unit 21 is not the key code of the editing key (step S700) but the key code of the cursor movement key (step S790), the editing processing unit 230 will execute the cursor movement key. The cursor is moved based on the key code (step S800).

なお、本発明は、上記実施形態またはその変形例そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態またはその変形例に開示されている複数の構成要素の適宜な組み合せにより種々の発明を形成できる。例えば、実施形態またはその変形例に示される全構成要素から幾つかの構成要素を削除してもよい。 In addition, this invention is not limited to the said embodiment or its modification example as it is, A component can be deform | transformed and embodied in the range which does not deviate from the summary in an implementation stage. In addition, various inventions can be formed by appropriately combining a plurality of constituent elements disclosed in the embodiment or the modification thereof. For example, you may delete a some component from all the components shown by embodiment or its modification.

本発明の一実施形態に係る機械翻訳編集装置のハードウェア構成を示すブロック図。The block diagram which shows the hardware constitutions of the machine translation editing apparatus which concerns on one Embodiment of this invention. 図１の機械翻訳編集装置の機能構成を示すブロック図。The block diagram which shows the function structure of the machine translation editing apparatus of FIG. 図２中の翻訳部２２の構成を示すブロック図。The block diagram which shows the structure of the translation part 22 in FIG. 図２中の編集部２３の構成を示すブロック図。The block diagram which shows the structure of the edit part 23 in FIG. 同実施形態における翻訳処理の手順を示すフローチャート。The flowchart which shows the procedure of the translation process in the embodiment. 同実施形態における翻訳処理の手順を示すフローチャート。The flowchart which shows the procedure of the translation process in the embodiment. 同実施形態における表示処理の手順を示すフローチャート。The flowchart which shows the procedure of the display process in the embodiment. 同実施形態において翻訳されるべき原文データの例を示す図。The figure which shows the example of the original text data which should be translated in the same embodiment. 同実施形態の翻訳処理において単語リストバッファ３４２の内容が変化する様子を示す図。The figure which shows a mode that the content of the word list buffer 342 changes in the translation process of the embodiment. 同実施形態においてタグの種別をユーザに指定させるための表示画面例を示す。The example of a display screen for making a user specify the classification of a tag in the embodiment is shown. 同実施形態で適用されるタグ種別テーブル３３０のデータ構造例を示す図。The figure which shows the example of a data structure of the tag classification table 330 applied in the embodiment. 同実施形態の翻訳処理において文リストバッファ３４３の内容が変化する様子を示す図。The figure which shows a mode that the content of the sentence list buffer 343 changes in the translation process of the embodiment. 同実施形態の翻訳処理において生成される原文単語リストが原文単語リストバッファ３４４に格納されている状態を示す。The source word list generated in the translation process of the embodiment is stored in the source word list buffer 344. 同実施形態の翻訳処理において生成される訳文単語リストが訳文単語リストバッファ３４５に格納されている状態を示す。A translated word list generated in the translation process of the embodiment is stored in the translated word list buffer 345. 同実施形態の表示処理における原文及び訳文を含む表示画面例を示す図。The figure which shows the example of a display screen containing the original sentence and translation in the display process of the embodiment. 同実施形態の変形例の表示処理における原文及び訳文を含む表示画面例を示す図。The figure which shows the example of a display screen containing the original sentence and translation in the display process of the modification of the embodiment. 上記変形例における表示処理の手順を示すフローチャート。The flowchart which shows the procedure of the display process in the said modification. 上記変形例における編集処理の手順を示すフローチャート。The flowchart which shows the procedure of the edit process in the said modification.

Explanation of symbols

１１…パーソナルコンピュータ、１２…外部記憶装置、２０…機械翻訳編集部、２１…システム制御部、２２…翻訳部、２３…編集部、３１…入力部、３２…表示部、３３…翻訳辞書部、３４…バッファ部、１１０…主メモリ、１２０…機械翻訳編集プログラム、２２０…翻訳文生成部、２２１…分離部、２２２…タグ種別判定部、２２３…単語リスト生成部、２２４…文リスト生成部、２２５…原文／訳文単語リスト生成部、２２６…タグ種別指定部、２３０…タグ表示指定部、２３１…タグ表示指定部、２３２…検出部、２３３…タグ判定部、２３４…禁止部、３４１…原文バッファ、３４２…単語リストバッファ、３４３…文リストバッファ、３４４…原文単語リストバッファ、３４５…訳文単語リストバッファ。 DESCRIPTION OF SYMBOLS 11 ... Personal computer, 12 ... External storage device, 20 ... Machine translation editing part, 21 ... System control part, 22 ... Translation part, 23 ... Editing part, 31 ... Input part, 32 ... Display part, 33 ... Translation dictionary part, 34 ... Buffer unit, 110 ... Main memory, 120 ... Machine translation editing program, 220 ... Translation sentence generation unit, 221 ... Separation unit, 222 ... Tag type determination unit, 223 ... Word list generation unit, 224 ... Sentence list generation unit, 225 ... Original / translated word list generation unit, 226 ... Tag type designation unit, 230 ... Tag display designation unit, 231 ... Tag display designation unit, 232 ... Detection unit, 233 ... Tag determination unit, 234 ... Inhibition unit, 341 ... Original text Buffer, 342 ... Word list buffer, 343 ... Sentence list buffer, 344 ... Original word list buffer, 345 ... Translated word list buffer.

Claims

When a text source text in a first language including a markup symbol is translated into a second language to generate a translation, it is added to the translation word corresponding to the word to which the markup symbol is added in the text source text. In the machine translation editing device that adds the markup symbols that have been
Separating means for separating the storage means and the original text into a word as a markup symbol and a word as a text part not including the markup symbol;
When the word separated by the separating means is a markup symbol, the markup symbol indicates whether the markup symbol is an in-sentence markup symbol used in a sentence or an out-of-sentence markup symbol used outside a sentence. Markup symbol type determination means for determining a symbol type;
Each of the words separated by the separating means indicates whether the word is a markup symbol or a text part, and in the case of a markup symbol, the mark determined by the markup symbol type determining means Association means for sequentially storing in the storage means in association with type information indicating an up symbol type;
When generating the translation of the original text, the markup symbol reflected in the translation is limited to the in-sentence markup symbol, and is the extrasentence markup symbol of the group of words stored in the storage means All of the original word list including a group of words excluding the word associated with the type information indicating that, and all of the group information associated with the type information indicating the text portion of the group of words stored in the storage unit A translated word list including a translation of the word and a markup symbol added to the translation of the word to which the markup symbol associated with the type information indicating that it is the in-sentence markup symbol of the translation is added Source / translation word list generation means for generating and storing in the storage means;
A machine translation editing apparatus comprising: display control means for displaying an original sentence and a translated sentence in an editable manner based on the original sentence word list and the translated word list, respectively.

A markup symbol type table in which type information indicating a markup symbol type is registered in association with each of a plurality of different markup symbols;
The markup symbol type determination means, when the word separated by the separation means is a markup symbol, based on the type information registered in the markup symbol type table in association with the markup symbol. The machine translation editing apparatus according to claim 1, wherein a markup symbol type of the up symbol is determined.

A markup symbol type specifying means for allowing a user to specify a markup symbol type of a markup symbol included in the original text,
The machine translation editing apparatus according to claim 1, wherein the markup symbol type determining unit determines the markup symbol type based on a user designation according to the markup symbol type specifying unit.

The markup symbol type designation means displays a list of markup symbols included in the text source text, and allows the user to specify the markup symbol type of each markup symbol in the list, thereby allowing the text source text to be specified. Generate a markup symbol type table in which type information indicating the markup symbol type is registered in association with each of the included markup symbols,
The markup symbol type determining means, when the word separated by the separating means is a markup symbol, based on the type information registered in the markup symbol type table in association with the markup symbol, The machine translation editing apparatus according to claim 3, wherein a markup symbol type of the markup symbol is determined.

The original / translated word list generating means sets data type information indicating whether the word is a markup symbol or a text part for each word in the original word list, and the translated word in the translated word list And for each markup symbol, data type information indicating that it is a text part and data type information indicating that it is a markup symbol are set, and the display control means includes the source word list and the translated word list. 2. The machine translation editing apparatus according to claim 1, wherein the markup symbol and other than the markup symbol are displayed with different attributes based on the included data type information.

Markup symbol display designating means for allowing the user to designate whether or not to display the markup symbols included in the original word list and the translated word list,
When it is specified that the markup symbol is not displayed, the display control means, based on the data type information included in the original word list and the translated word list, only the text part as the original sentence and the translated sentence. The machine translation editing apparatus according to claim 5, wherein the machine translation editing apparatus is displayed.

Detecting means for detecting that an original character or an arbitrary character in the translated text is selected by a user operation while the original text and the translated text are displayed by the display control means;
Markup symbol determination means for determining whether a word including the character detected by the detection means is a markup symbol;
2. The machine translation according to claim 1, further comprising: a prohibiting unit that prohibits editing of a character in the markup symbol when a word including the character detected by the detecting unit is a markup symbol. Editing device.

Detecting means for detecting that an original character or an arbitrary character in the translated text is selected by a user operation while the original text and the translated text are displayed by the display control means;
Markup symbol determination means for determining whether a word including the character detected by the detection means is a markup symbol;
The original sentence / translation word list generating means sets correspondence information for associating each word in the original word list with a translation or a markup symbol in the corresponding translation word list corresponding to the word, and in the original word list For each translated word and markup symbol, set correspondence information for associating the translated word and markup symbol with the corresponding word in the original word list,
When it is determined that a word including an arbitrary character in the original sentence or the translated sentence is the markup symbol, the display control means determines the determined markup symbol and the original word list or the translated word. The markup symbol in the translation or the original sentence associated with the correspondence information in the list is displayed so as to be visible when there is a correspondence with the determined markup symbol. The machine translation editing apparatus according to claim 1.

When a text source text in a first language including a markup symbol is translated into a second language to generate a translation, it is added to the translation word corresponding to the word to which the markup symbol is added in the text source text. A machine translation editing program for adding a markup symbol,
In a computer comprising input means, storage means and display means ,
Storing the original text input by the input means in the storage means;
Separating the original text stored in the storage means into a word as a markup symbol and a word as a text part not including the markup symbol in order from the top;
When the separated word is a markup symbol, a markup symbol type indicating whether the markup symbol is an in-sentence markup symbol used in a sentence or an out-of-sentence markup symbol used outside a sentence A determining step;
For each word separated from the original text stored in the storage means, the word indicates whether the word is a markup symbol or a text part, and in the case of a markup symbol, the determination and storing in the storage means in association with the type information indicating the mark-up symbols type is,
When generating the translation of the original text stored in the storage means, the markup symbols reflected in the translation are limited to the in-sentence markup symbols, and the group of words stored in the storage means A source word list including a group of words excluding a word associated with type information indicating an extra-sentence markup symbol, and a text part of the group of words stored in the storage means A translation of all words associated with the type information, and a markup symbol added to the translation of the word to which the markup symbol associated with the type information indicating the markup symbol in the sentence is added. Generating a translated word list including and storing the translated word list in the storage means;
Machine translation editing program for executing a step of causing the display by the display means editable original and translation, respectively, based on the original word list and the translation word list stored in the storage means.

The computer further includes a storage device that stores a markup symbol type table in which type information indicating a markup symbol type is associated with each of a plurality of different markup symbols.
In the determining step , the markup symbol type of the separated markup symbol is determined based on the type information registered in the markup symbol type table in association with the separated markup symbol. The machine translation editing program according to claim 9.

A markup symbol is added to the original text when the text source of the first language including the markup symbol is translated into the second language by the computer having the input means, the storage means, and the display means to generate the translation. A machine translation editing method for adding a markup symbol added to a word to a translation corresponding to a word,
The computer storing the original text input by the input means in the storage means;
A step wherein the computer, to separate the text original text stored in said storage means from the top and words as words and text parts without markup symbol as a mark-up symbols in order,
If the separated word is a markup symbol, the computer marks whether the markup symbol is an in-sentence markup symbol used in a sentence or an out-of-sentence markup symbol Determining the up symbol type;
Each time a word is separated from the original text stored in the storage means, the computer indicates whether the word is a markup symbol or a text part, and in the case of a markup symbol, the computer and storing in the storage means in association with the type information indicating the determination markup symbols type,
When generating the translation of the original text stored in the storage means, the computer restricts the markup symbols reflected in the translation to in-sentence markup symbols, and the words stored in the storage means A source word list including a group of words excluding a word associated with the type information indicating the extra-sentence markup symbol in the group, and the text portion of the group of words stored in the storage unit A translation of all the words associated with the type information indicating that it is present and a translation of the word to which the markup symbol associated with the type information indicating that it is a markup symbol in the sentence is added. Generating a translated word list including a markup symbol and storing it in the storage means;
The computer comprises the step of causing the display means to display the original sentence and the translated sentence based on the original word list and the translated word list stored in the storage means, respectively. Editing method.