JPH05334348A

JPH05334348A - Translation system

Info

Publication number: JPH05334348A
Application number: JP4139123A
Authority: JP
Inventors: Youji Fukumochi; 陽士福持; Hitoshi Suzuki; 等鈴木
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1992-05-29
Filing date: 1992-05-29
Publication date: 1993-12-17

Abstract

PURPOSE:To provide the translation system which can be improved in translation throughput by improving the efficiency of a process for editing an original sentence before its translation to assist the translating process. CONSTITUTION:On this translation system, whether or not the pre-editing which is performed before the translation is learnt in word units or word string units is optionally selected through a keyboard 4. A translation module 5 learns a word or word string containing a pre-editing symbol in the original sentence whose learning is specified when the translation is indicated, and the learnt pre-editing symbol is stored in a file; when translation is performed next, the past learning contents stored in the file are utilized to perform the translation.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は翻訳の際に利用者が予
め原文に前編集処理を施すことによって翻訳効率や精度
を高めることができるような翻訳装置に関し、特に、そ
の改良に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a translation apparatus capable of improving translation efficiency and accuracy by pre-editing an original sentence during translation by a user, and particularly to an improvement thereof. ..

【０００２】[0002]

【従来の技術および発明が解決しようとする課題】従
来、機械翻訳装置としては次のようなものがある。この
翻訳装置はキーボード、ＣＰＵ（中央処理装置）および
翻訳処理するための翻訳モジュールおよびＣＲＴ（陰極
線管）などを備える。キーボードから入力されたソース
言語からなる原文は、ＣＰＵの制御により翻訳モジュー
ルに与えられる。翻訳モジュールは辞書データ、文法規
則および木構造変換規則を備えており、与えられるソー
ス言語をまず辞書を用いて形態素解析し、その品詞が付
された単語の系列に解析した後、得られた単語の品詞か
ら辞書と文法規則とを用いて構文解析して、ターゲット
言語からなる翻訳文を得ていた。翻訳して得られたター
ゲット言語からなる翻訳文は、原文と対比してＣＲＴな
どに表示される。利用者は、この画面を見て正しい翻訳
文が得られておらず、翻訳モジュールにおいて原文中の
単語の品詞の認識が誤っていると判断すると、原文にキ
ーボードなどを介して正しい翻訳文が得られるように前
編集処理を加えて、再度翻訳処理を行なわせることがあ
る。この前編集処理は、原文中のある単語の品詞を直接
に指定するなどして翻訳モジュールによる翻訳を支援す
ることを目的として行なわれる。2. Description of the Related Art Conventionally, the following machine translation devices are available. This translation apparatus includes a keyboard, a CPU (central processing unit), a translation module for translation processing, a CRT (cathode ray tube), and the like. The original text in the source language input from the keyboard is given to the translation module under the control of the CPU. The translation module has dictionary data, grammatical rules, and tree structure conversion rules. The given source language is first morphologically analyzed using a dictionary and then analyzed into a series of words with parts of speech A syntactic analysis was performed from the part-of-speech of the target language using a dictionary and grammatical rules to obtain a translated sentence in the target language. The translated sentence in the target language obtained by the translation is displayed on the CRT or the like in contrast to the original sentence. If the user does not get the correct translated sentence by looking at this screen and determines that the part of speech of the word in the original sentence is incorrect in the translation module, the correct translated sentence is obtained from the original sentence via the keyboard. As described above, the pre-editing process may be added and the translation process may be performed again. This pre-editing process is performed for the purpose of supporting the translation by the translation module by directly specifying the part of speech of a word in the original sentence.

【０００３】前編集処理の例について次に説明する。た
とえば、 “The invention uses the said components for the d
evice.” という原文が入力されたとき、従来の翻訳装置において
は、「発明は、装置のために言われた構成要素を使う。」というように、原文中の単語“said”を動詞と解釈して
翻訳する。これを修正するために、たとえば “The invention uses the a said components for th
e device. ” というように、原文に前編集処理を施す、即ち単語“sa
id”に前編集記号“ａ ”を付して、該単語の品詞を形容
詞（「前記の」）と翻訳時に指定することが可能であ
る。特許分野の文章においては、単語“said”が使用さ
れる頻度は高く、「動詞」（「言われた」）ではなく
「形容詞」（「前記の」）として使用される可能性もま
た高い。この類として、たとえば特許分野の文章におけ
る単語“means ”は、「動詞」（「意味する」）ではな
く「名詞」（「手段」）として使用される頻度が高い。
また、金融関係の文章においては単語“close ”は「動
詞」（「閉じる」）ではなく、「名詞」（「終値」）と
して使用される頻度が高い。An example of the pre-editing process will be described below. Was
For example, “The invention uses the said components for the d
When the original text "evice." is entered, the conventional translation device
Interprets the word "said" in the original sentence as a verb, such as "the invention uses the components said for the device."
translate. To fix this, for example, “The invention uses the a said components for th
e device. ”, the source text is preedited, that is, the word“ sa
Pre-edit symbol "a" in "id" ", To describe the part of speech of the word
It is possible to specify the words (“sai”) and translation
It The word "said" is used in patent texts.
Frequently, and not "verbs" ("said")
May also be used as an "adjective" ("above")
High For example, in the text of the patent field,
The word "means" is not a "verb" ("means").
Often used as a "noun" ("means").
In addition, the word "close" in a financial text is "moving".
"Noun" ("close price"), not "word" ("close")
Often used.

【０００４】上述したように特定の分野における文章に
関しては、繰り返し前編集が行なわれる可能性のある単
語はかなり特定されるという傾向がある。このため、従
来の翻訳装置においては、正しい翻訳文を得るために上
述のような原文中の特定単語の品詞を指定するような前
編集処理を行なう場合には、その前編集を繰り返し行な
う必要があるという点で翻訳作業の効率が悪いという問
題があった。As described above, with respect to sentences in a specific field, there is a tendency that a word for which repeated preediting may be performed is considerably specified. Therefore, in the conventional translation device, when performing the pre-editing process such as specifying the part of speech of the specific word in the original sentence as described above in order to obtain a correct translated sentence, it is necessary to repeat the pre-editing. There was a problem that the translation work was inefficient in that there was.

【０００５】また、前編集処理には前述の品詞指定のほ
かに、原文中の単語間の係り受けを指定する、原文中の
単語の翻訳時のまとまりを指定する、単語／句を削除す
るおよび単語／句を追加するなどの処理がある。このよ
うな前編集を繰り返し行なうために引き起こされる翻訳
作業効率の低下を防止するために、原文中の前編集処理
が所望される文字列の検索／置換を一括して行なう方法
が考えられる。このようにすれば同じ前編集処理を繰返
す必要はなくなるので、翻訳作業の効率が向上するが、
一括の文字列の検索／置換では、極端な場合、原文が大
幅に書換えられてしまい、もともとの原文がどのような
内容であったか知ることができない場合がある。たとえ
ば、原文中の単語／句を削除するという処理を一括して
行なった場合、原文が大幅に書換えられてしまい、原文
／翻訳文を対比しながら表示しても、もともとの原文が
どういう内容であったか知ることが不可能になり、その
翻訳文が正しいのかどうか判定が困難であるという問題
があった。Further, in the pre-editing process, in addition to the above-mentioned part-of-speech designation, the dependency between the words in the original sentence is designated, the group at the time of translation of the words in the original sentence is designated, the word / phrase is deleted, and There are processes such as adding words / phrases. In order to prevent a decrease in translation work efficiency caused by repeatedly performing such pre-editing, a method of collectively performing search / replacement of a character string in the original sentence for which pre-editing processing is desired can be considered. By doing this, it is not necessary to repeat the same pre-editing process, which improves the efficiency of translation work.
In an extreme case, the search / replacement of a batch of character strings may significantly rewrite the original text, and it may not be possible to know what the original text was like. For example, if the process of deleting words / phrases in the original sentence is performed collectively, the original sentence will be rewritten to a large extent. There was a problem that it was impossible to know if there was any, and it was difficult to judge whether or not the translation was correct.

【０００６】また、上述した一括して同じ前編集処理を
原文に施すような場合、前編集処理の効果を対話的に確
認しながら行なうことができないという問題もあった。Further, in the case where the same pre-editing process is collectively applied to the original sentence as described above, there is a problem that the effect of the pre-editing process cannot be interactively confirmed.

【０００７】それゆえにこの発明の目的は、翻訳処理に
先立って、翻訳処理を支援するための前編集記号を用い
て原文を編集する際に、その前編集処理の効率を向上さ
せることにより、翻訳処理自体の効率を向上させること
が可能な、翻訳装置を提供することである。Therefore, an object of the present invention is to improve the efficiency of the pre-editing process when the original sentence is edited using the pre-editing symbols for supporting the translation process before the translation process. An object of the present invention is to provide a translation device capable of improving the efficiency of processing itself.

【０００８】[0008]

【課題を解決するための手段】この発明に係る翻訳装置
は、与えられる原文を辞書データを含む所定の規則に基
づいて翻訳処理し、原文に対する翻訳文を出力する装置
であり、学習指定手段と、学習手段と、編集手段とさら
に制御手段とを備えて構成される。A translation device according to the present invention is a device for translating a given original sentence based on a predetermined rule including dictionary data, and outputting a translated sentence for the original sentence. , Learning means, editing means, and control means.

【０００９】学習指定手段は翻訳処理に先立って、原文
に含まれ、かつ翻訳処理を支援するための前編集記号を
それが付される文字列ごとに学習するか否か指定するよ
う構成される。Prior to the translation process, the learning designating means is configured to designate whether or not to learn, for each character string to which the preedit symbol included in the original sentence and supporting the translation process is attached. ..

【００１０】学習手段は、学習指定手段により指定され
た前編集記号をそれが付される文字列と関連付けて学習
するよう構成される。The learning means is configured to learn by associating the preedit symbol designated by the learning designating means with the character string to which it is attached.

【００１１】編集手段は、原文中の文字列のうち、学習
手段により学習された学習文字列に該当する文字列に、
該学習文字列に関連の前編集記号を付して編集するよう
構成される。The editing means selects, from among the character strings in the original sentence, a character string corresponding to the learned character string learned by the learning means.
The learning character string is configured to be edited with an associated preedit symbol.

【００１２】制御手段は、翻訳処理時、編集手段により
編集された原文を入力し、前編集記号に対応して予め規
定された翻訳処理支援のための情報および前述の規則を
適用して翻訳処理を実行するよう構成される。The control means inputs the original text edited by the editing means at the time of translation processing, applies the information for translation processing support defined in advance corresponding to the preedit symbol and the above-mentioned rule, and performs the translation processing. Is configured to perform.

【００１３】また、前述の学習手段は、予め複数のファ
イルを有し、選択されたファイルに、学習指定手段によ
り指定された前編集記号をそれが付される学習文字列と
関連付けて記憶するファイル記憶状態をさらに有し、前
編集手段は、原文中の文字列のうち、前述の選択された
ファイルに記憶される学習文字列に該当する文字列に、
該学習文字列に関連の前編集記号を付して編集する第１
編集手段をさらに備えて構成されてもよい。Further, the above-mentioned learning means has a plurality of files in advance, and stores the pre-edit symbol designated by the learning designating means in the selected file in association with the learning character string to which it is attached. The pre-editing means further has a storage state, and among the character strings in the original sentence, a character string corresponding to the learned character string stored in the selected file,
Editing by adding a related preedit symbol to the learning character string
It may be configured to further include an editing unit.

【００１４】前述の複数ファイルは、原文の記述内容が
属する分野ごと、または該翻訳装置の利用者ごとに準備
され、ファイルの選択は、翻訳すべき原文の記述内容が
属する分野または該翻訳装置の利用者に応じて選択され
てもよい。The above-mentioned plurality of files are prepared for each field to which the description content of the original sentence belongs, or for each user of the translation device, and the selection of files is performed by the field to which the description content of the original sentence to be translated belongs or the translation device of the translation device. It may be selected according to the user.

【００１５】さらに、学習手段による学習内容、または
ファイル記憶手段により選択されたファイルに記憶され
た内容は、任意に変更することが可能なように構成され
てもよい。Further, the learning content by the learning means or the content stored in the file selected by the file storage means may be configured to be arbitrarily changeable.

【００１６】[0016]

【作用】この発明は上述のように構成されて、学習指定
手段を介して原文中の前編集記号を学習することを指定
するだけで、学習手段によりこの前編集記号がそれが付
される原文中の文字列と関連付けて学習され、さらに編
集手段は学習手段により学習された内容に従って原文中
の学習文字列に該当する文字列に学習された関連の前編
集記号を付して該原文を編集し直すので、原文中に学習
文字列に該当の文字列が複数箇所あっても、繰り返し同
じ前編集記号を付して編集するという手間が省ける。The present invention is configured as described above, and the learning means specifies the learning of the preedit symbol in the original sentence, and the learning means attaches the preedit symbol to the original sentence. The editing means edits the original text by adding the relevant pre-editing symbol to the learned text string in the original text according to the content learned by the learning means Since it is redone, even if there are a plurality of relevant character strings in the learning character string in the original sentence, it is possible to save the trouble of repeatedly adding the same pre-editing symbol to edit.

【００１７】また、学習手段は複数のファイルを有した
ファイル記憶手段を有して、複数ファイルの中から選択
されたファイルに、学習指定手段により指定された前編
集記号を、それが付される文字列と関連付けて記憶する
ので、学習指定された前編集記号とそれに対する文字列
とがファイルごとに蓄積されて記憶されるとともに、フ
ァイルの選択は原文の記述内容が属する分野ごと、また
は該翻訳装置の利用者ごとに行なわれるので、各ファイ
ルにおいて前述の分野特有、または利用者固有の翻訳処
理を支援するための情報が蓄積される。Further, the learning means has a file storage means having a plurality of files, and a pre-editing symbol designated by the learning designating means is attached to a file selected from the plurality of files. Since it is stored in association with the character string, the pre-editing symbol designated for learning and the character string corresponding thereto are accumulated and stored for each file, and the file is selected for each field to which the description content of the original text belongs, or the translation. Since it is performed for each user of the apparatus, information for supporting the above-mentioned field-specific or user-specific translation processing is stored in each file.

【００１８】さらに学習手段により学習された内容、ま
たはファイル記憶手段によりファイルに記憶された内容
は、学習指定手段および学習手段を介さなくとも容易に
変更されて、編集手段における編集内容に多様性をもた
らすことが容易に可能となる。Further, the content learned by the learning means or the content stored in the file by the file storage means can be easily changed without the intervention of the learning designating means and the learning means, and the editing content in the editing means can be varied. It is possible to bring easily.

【００１９】[0019]

【実施例】以下、この発明の実施例について図面を参照
して詳細に説明する。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００２０】図１（ａ）および（ｂ）は、本発明の実施
例による翻訳装置の構成図である。1 (a) and 1 (b) are block diagrams of a translation apparatus according to an embodiment of the present invention.

【００２１】図１（ａ）において翻訳装置は、該装置を
集中的に管理および制御するためのＣＰＵ（中央処理装
置）１、翻訳処理に必要とされるプログラムを含むデー
タを格納するためのメインメモリ２、ＣＲＴまたはＬＣ
Ｄ（液晶ディスプレイ）を含む表示部３、外部からのデ
ータ入力のためのキーボード４、与えられるソース言語
をターゲット言語に翻訳する翻訳モジュール５、翻訳モ
ジュール５によりそこにストアされるデータがアクセス
させるメモリ６、および翻訳結果などが印字出力される
ためのプリンタ７を含む。これらの各部は、バスを介し
て相互に通信接続される。In FIG. 1A, a translation device is a CPU (central processing unit) 1 for centrally managing and controlling the device, and a main unit for storing data including programs required for translation processing. Memory 2, CRT or LC
A display unit 3 including a D (liquid crystal display), a keyboard 4 for inputting data from the outside, a translation module 5 for translating a given source language into a target language, and a memory accessed by the data stored therein by the translation module 5. 6 and a printer 7 for printing out translation results and the like. Each of these units is communicatively connected to each other via a bus.

【００２２】メモリ６は領域Ｍ１〜Ｍ４を含む。領域Ｍ
１は、翻訳処理時の作業領域に用いられる。領域Ｍ２に
は辞書Ｄ１、領域Ｍ３には文法規則Ｄ２、領域Ｍ４には
木構造変換規則Ｄ３がそれぞれ予めストアされている。The memory 6 includes areas M1 to M4. Area M
1 is used as a work area during translation processing. A dictionary D1 is stored in the area M2, a grammar rule D2 is stored in the area M3, and a tree structure conversion rule D3 is stored in the area M4.

【００２３】翻訳モジュール５の構成が図１（ｂ）に示
される。翻訳モジュール５は辞書引き形態素解析部１
１、構文解析部１２、変換部１３および翻訳文生成部１
４を含む。翻訳モジュール５は与えられるソース言語か
らなる原文を辞書引き形態素解析部１１において辞書Ｄ
１を参照しながら形態素解析処理して、形態素の系列に
分割する。形態素とは原文を構成する最小の言語単位で
あり、単語、接頭語および接尾語からなる。解析部１１
において得られた形態素列に含まれる各単語のそれぞれ
については品詞などの文法情報および訳語などが得ら
れ、さらに時制／人称／数なども解析される。構文解析
部１２では前段の解析部１１において得られた形態素列
を入力し、メモリ６の辞書Ｄ１と文法規則Ｄ２とに従っ
て、各単語間の係り受けの関係を示す構造解析木が決定
される。変換部１３では、前段の解析部１２で得られた
構造解析木について、木構造変換規則Ｄ３を用いて原文
に対する構造解析木が翻訳文に対する構造解析木に変換
される。そして翻訳文生成部１４においてターゲット言
語による翻訳文として適切な助詞や助動詞が付加され
て、得られた翻訳文は翻訳モジュール５から出力され
る。モジュール５から出力された翻訳文は、原文と対に
なってプリンタ７または表示部３に出力される。The structure of the translation module 5 is shown in FIG. The translation module 5 is a dictionary lookup morpheme analysis unit 1
1, syntax analysis unit 12, conversion unit 13 and translation sentence generation unit 1
Including 4. The translation module 5 uses the given source language in the dictionary lookup morpheme analysis unit 11 to extract the original sentence in the dictionary D.
A morpheme analysis process is performed with reference to 1 to divide into a series of morphemes. A morpheme is the smallest linguistic unit that constitutes the original sentence, and consists of words, prefixes, and suffixes. Analysis unit 11
For each of the words included in the morpheme string obtained in (3), grammatical information such as a part of speech and translated words are obtained, and the tense / person / number is also analyzed. The syntactic analysis unit 12 inputs the morpheme string obtained by the analysis unit 11 of the previous stage, and determines a structural analysis tree indicating the dependency relation between each word according to the dictionary D1 and the grammatical rule D2 of the memory 6. The converting unit 13 converts the structure analysis tree for the original sentence into the structure analysis tree for the translated sentence using the tree structure conversion rule D3 for the structure analysis tree obtained by the analysis unit 12 in the previous stage. Then, the translated sentence generation unit 14 adds appropriate particles and auxiliary verbs as translated sentences in the target language, and the obtained translated sentences are output from the translation module 5. The translated sentence output from the module 5 is output to the printer 7 or the display unit 3 in pairs with the original sentence.

【００２４】図２は、図１（ｂ）の辞書引き形態素解析
部１１における処理のフロー図である。図３は、本発明
の一実施例において使用される前編集記号の一例を示す
図である。FIG. 2 is a flow chart of processing in the dictionary lookup morpheme analysis unit 11 of FIG. 1 (b). FIG. 3 is a diagram showing an example of a pre-edit symbol used in an embodiment of the present invention.

【００２５】図４は、本発明の一実施例による翻訳結果
の表示画面の一例を示す図である。FIG. 4 is a diagram showing an example of a translation result display screen according to an embodiment of the present invention.

【００２６】図５は、図２の処理フロー実行時に参照さ
れるバッファＡの内容例を示す図である。FIG. 5 is a diagram showing an example of the contents of the buffer A referred to when the processing flow of FIG. 2 is executed.

【００２７】図６は、図２の処理フロー実行時に参照さ
れるバッファＢの内容例を示す図である。FIG. 6 is a diagram showing an example of contents of the buffer B referred to when the processing flow of FIG. 2 is executed.

【００２８】図７は、図２の処理フロー実行時に参照さ
れるバッファＣの内容例を示す図である。FIG. 7 is a diagram showing an example of the contents of the buffer C referred to when the processing flow of FIG. 2 is executed.

【００２９】図８は、図２の処理フロー実行時に参照さ
れるバッファＤの内容例を示す図である。FIG. 8 is a diagram showing an example of the contents of the buffer D referred to when the processing flow of FIG. 2 is executed.

【００３０】図９は、図２の処理フロー実行時に参照さ
れるバッファＥの内容例を示す図である。FIG. 9 is a diagram showing an example of the contents of the buffer E referred to when the processing flow of FIG. 2 is executed.

【００３１】図１０は、図２の処理フロー実行時に参照
されるバッファＦの内容例を示す図である。FIG. 10 is a diagram showing an example of the contents of the buffer F referred to when the processing flow of FIG. 2 is executed.

【００３２】図１１は、データ更新した後の図１０のバ
ッファＦの内容例を示す図である。FIG. 11 is a diagram showing an example of the contents of the buffer F of FIG. 10 after the data is updated.

【００３３】図１２（ａ）ないし（ｃ）は、本発明の一
実施例による前編集学習ファイルの内容例を示す図であ
る。FIGS. 12A to 12C are views showing examples of the contents of the preedit learning file according to the embodiment of the present invention.

【００３４】図３を参照して、本実施例で使用される前
編集記号とその記号の意味を説明する。The preedit symbol used in this embodiment and the meaning of the symbol will be described with reference to FIG.

【００３５】前編集記号ｇ［］は、原文中で記号ｇ
［と記号］で囲まれた文字列の前編集を学習すること
を意味する。この前編集記号の学習についての詳細は後
述する。前編集記号ａは、原文中で該記号が付された
単語の品詞を「形容詞」と指定することを意味する。前
編集記号ｎは、原文中で該記号が付された単語の品詞
を「名詞」と指定することを意味する。前編集記号ｖ
は、原文中で該記号が付された単語の品詞を「動詞」と
指定することを意味する。Pre-edit symbol g [] Is the symbol g in the original text
This means learning the pre-editing of the character string enclosed by [and symbols]. Details of learning the pre-edit symbol will be described later. Previous edit symbol a Means that the part of speech of the word to which the symbol is attached in the original sentence is designated as "adjective". Previous edit symbol n Means that the part of speech of the word to which the symbol is attached in the original sentence is designated as "noun". Previous edit symbol v
Means that the part of speech of the word to which the symbol is attached in the original sentence is designated as "verb".

【００３６】次に、図３〜図１２を参照して、図１の翻
訳装置における原文に付された前編集記号を学習する
処理、および過去に学習された前編集記号の情報を用
いた翻訳処理を順に説明する。Next, referring to FIGS. 3 to 12, the process of learning the pre-editing symbol attached to the original sentence in the translation apparatus of FIG. 1 and the translation using the information of the pre-editing symbol learned in the past. The processing will be described in order.

【００３７】原文に付された前編集記号を学習する処理以下の説明では、前編集記号の学習機能に関する説明で
あるので、後述する利用者が所望する前編集学習ファイ
ルが事前に選択されていると想定する。利用者が所望す
る前編集学習ファイルは、利用者の目的に応じて特許分
野、法律分野、経済分野、時事分野など予め個別に作成
されて準備されており、以降の翻訳処理では、利用者が
翻訳しようとする原文の分野に応じて学習ファイルを選
択することを想定する。Processing for Learning Pre-Editing Symbols Attached to Original Text In the following description, since a pre-editing symbol learning function is described, a pre-editing learning file desired by a user to be described later is selected in advance. Suppose. The pre-editing learning file desired by the user is prepared and prepared individually in advance in the patent field, legal field, economic field, current affairs field, etc. according to the purpose of the user. It is assumed that the learning file is selected according to the field of the source text to be translated.

【００３８】前述したように利用者が目的に応じて個別
に作成して準備する複数の学習ファイル、後述するバッ
ファＡ〜Ｆおよび翻訳文はメモリ６に格納されると想定
する。As described above, it is assumed that the memory 6 stores a plurality of learning files which the user individually creates and prepares according to the purpose, buffers A to F described later, and translated sentences.

【００３９】さらに、図１（ａ）の翻訳装置において、
翻訳が所望される原文は、利用者がキーボード４から１
センテンスごとに入力する場合もあれば、メモリ６に予
め格納しておき、ＣＰＵ１が必要に応じてメモリ６から
読出す場合もある。Further, in the translation device of FIG. 1 (a),
The original text that the user wants to translate is the keyboard 4 to 1
It may be input for each sentence, or it may be stored in the memory 6 in advance and read from the memory 6 by the CPU 1 as needed.

【００４０】図４は、図１の翻訳装置において得られた
翻訳結果の表示部３における画面表示の一例を示したも
のであるが、図示される英文（原文に相当する）を翻訳
処理して、図示される訳文（翻訳文に相当する）を得た
場合の処理について説明する。FIG. 4 shows an example of a screen display on the display unit 3 of the translation result obtained by the translation apparatus of FIG. 1. The English sentence (corresponding to the original sentence) shown in the figure is translated. A process for obtaining the illustrated translated text (corresponding to the translated text) will be described.

【００４１】まず、図４のＮｏ１の英文“The inventio
n uses the said components for the device.”が、た
とえばキーボード４から入力されると、ＣＰＵ１の制御
により翻訳モジュール５に与えられる。翻訳モジュール
５はＮｏ１の英文を翻訳処理し、図４のＮｏ５の訳文が
得られる。この翻訳結果では、単語「said」の品詞が本
来、形容詞（「前記の」）であるべきところが、動詞
（「言われた」）と解釈されたために、誤訳が生じてい
る。これに対して、図４のＮｏ２の英文“The inventio
n uses g ［ the a said］ components for the devi
ce. ”のように、予め原文に前編集が施されて翻訳装置
に入力されたと想定する。前編集されたＮｏ２の英文を
翻訳すると、正しい翻訳結果であるＮｏ６の訳文が得ら
れることになる。この詳細について以下に説明する。First, the No. 1 English sentence "The inventio" in FIG.
"uses the said components for the device." is input from the keyboard 4, for example, it is given to the translation module 5 by the control of the CPU 1. The translation module 5 translates the English sentence of No1 and the translated sentence of No5 of FIG. In this translation result, the part of speech of the word "said" should have been an adjective ("said"), but it was interpreted as a verb ("said"), resulting in a mistranslation. In contrast, No. 2 English sentence “The inventio” in FIG.
n uses g [The a said] components for the devi
It is assumed that the original text is pre-edited and input to the translation device in advance, such as “ce.”. When the pre-edited No. 2 English text is translated, the correct translation result No. 6 is obtained. The details will be described below.

【００４２】図２の処理フローを参照して、まずステッ
プＳ１（図中、Ｓ１と略す）において原文（図４のＮｏ
２の英文）の文字列がバッファＡにセットされる（図５
参照）。Referring to the processing flow of FIG. 2, first, in step S1 (abbreviated as S1 in the figure), the original text (No in FIG. 4) is entered.
2) is set in buffer A (Fig. 5).
reference).

【００４３】次に、ステップＳ２の処理において、バッ
ファＡの文字列中に、前編集記号「ｇ［］」がある
か否か検出する。もし記号ｇ［］が文字列中にあれ
ば、ｇ［と］で囲まれた文字列を変換後単語／単語列
Ｂ２としてバッファＢにセットする。それに並行して、
単語列Ｂ２の文字列から前編集記号（この場合、記号ａ
）を取除いた文字列を変換前単語／単語列Ｂ１として
バッファＢにセットする。この結果、単語列Ｂ１は“th
e said”となり、単語列Ｂ２は“ the a said”となっ
てバッファＢにセットされる（図６参照）。なお、この
例では、バッファＡにストアされた原文には学習が所望
される前編集記号は１個なので、バッファＢには単語列
Ｂ１およびＢ２のペアが１個だけ格納されることになる
が、バッファＡに格納された原文中に学習が所望される
異なる前編集記号が複数箇所に挿入されていれば、単語
列Ｂ１およびＢ２のペアがバッファＢに複数個収容され
ることになる。Next, in the processing of step S2,
In the character string of file A, the pre-editing symbol "g There is []
It is detected whether or not. If g [] Is in the string
If g Word / word string after converting the character string enclosed in [and]
Set it in buffer B as B2. In parallel with that,
From the character string of the word string B2, the pre-edit symbol (in this case, the symbol a
) Is used as the pre-conversion word / word string B1
Set in buffer B. As a result, the word string B1 is "th
e said "and the word string B2 becomes" the a became said
Are set in the buffer B (see FIG. 6). In addition, this
In the example, the original sentence stored in buffer A is desired to be learned.
Since there is only one pre-edit symbol, the word string is stored in buffer B.
Only one pair of B1 and B2 will be stored
However, learning is desired in the original text stored in buffer A.
If different preedit symbols are inserted in multiple places, the word
Buffer B contains a plurality of pairs of rows B1 and B2.
Will be.

【００４４】次のステップＳ３の処理において、バッフ
ァＡに格納されている文字列から学習指示の前編集記号
ｇ［］を取除いた文字列をバッファＣにセットする
（図７参照）。In the processing of the next step S3, the pre-editing symbol g of the learning instruction is extracted from the character string stored in the buffer A. The character string from which [] is removed is set in the buffer C (see FIG. 7).

【００４５】次に、ステップＳ４の処理に移行し、図７
のバッファＣに格納されている文字列とバッファＢに変
換前単語／単語列Ｂ１として格納されている文字列“th
e said”とのマッチングを行なう。もし、バッファＣに
格納されている文字列中に単語列Ｂ１とマッチするもの
があれば、マッチした原文中の文字列をバッファＢの対
応の変換後単語／単語列Ｂ２の文字列“ the a said”
に置換して、バッファＤにセットする。これは、バッフ
ァＡに格納された原文中に同じ前編集記号の付加が必要
とされる箇所が複数個あったとしても、その中の１ヵ所
だけに学習指示を表わす前編集記号ｇ［］を用いて
前編集を行なえば、その他の箇所も同じ前編集記号の付
加が行なえるようにするためである。したがって、ステ
ップＳ４の処理結果、バッファＤには図８に示されるよ
うな文字列がストアされることになる。Next, the process proceeds to step S4, as shown in FIG.
Character string stored in the buffer C and the character string “th stored in the buffer B as the pre-conversion word / word string B1
e said ”is performed. If there is a character string stored in the buffer C that matches the word string B1, the matched character string in the original sentence is converted to the corresponding word / word in the buffer B. Character string of word string B2 "the a said ”
To buffer D. This is because even if the original sentence stored in the buffer A has a plurality of places where the same pre-editing symbol needs to be added, the pre-editing symbol g indicating the learning instruction is present only at one of the places. This is because if the pre-editing is performed using [], the same pre-editing symbol can be added to other portions. Therefore, as a result of the processing in step S4, the character string as shown in FIG. 8 is stored in the buffer D.

【００４６】ここで、利用者ごとに選択可能な前編集学
習ファイルについて図１２を参照して説明する。今、翻
訳しようとする原文は、図４に示されるように特許分野
特有の文章である。したがって、利用者は、これに応じ
て特許分野の学習ファイル２０を選択する。前編集学習
ファイル２０は、図示されるように項目Ｉ１およびＩ２
を含む。項目Ｉ１には変換前単語／単語列Ｂ１が複数個
格納され、項目Ｉ２には、項目Ｉ１に格納された単語列
Ｂ１のそれぞれに対応して単語列Ｂ２がストアされる。Here, the pre-editing learning file that can be selected for each user will be described with reference to FIG. The original text to be translated is a text peculiar to the patent field as shown in FIG. Therefore, the user selects the learning file 20 in the patent field accordingly. The preedit learning file 20 includes items I1 and I2 as shown.
including. The item I1 stores a plurality of pre-conversion words / word strings B1, and the item I2 stores the word string B2 corresponding to each of the word strings B1 stored in the item I1.

【００４７】図２の処理フローに戻る。次に、ステップ
Ｓ５の処理において、利用者により前編集学習ファイル
が既に選択されているか否かがチェックされる。未選択
の場合、処理は後述するステップＳ７以降へ移行する
が、現在は、前編集学習ファイル２０が選択済であるの
で、ステップＳ５の判断に応じて処理はステップＳ６へ
移行する。Returning to the processing flow of FIG. Next, in the process of step S5, it is checked whether or not the user has selected the pre-editing learning file. If not selected, the process proceeds to step S7 and subsequent steps, which will be described later, but since the pre-editing learning file 20 is currently selected, the process proceeds to step S6 according to the determination in step S5.

【００４８】ステップＳ６に処理においては、バッファ
Ｄに格納されている原文中の文字列と前編集学習ファイ
ル２０の項目Ｉ１に格納されている変換前単語／単語列
Ｂ１のそれぞれとがマッチングされる。マッチした場
合、バッファＤ中の該当の文字列は、学習ファイル２０
の対応の変換後単語／単語列Ｂ２を用いて置換され、そ
の結果がバッファＥにセットされる。逆に、マッチしな
かった場合は、バッファＤの内容をそのままバッファＥ
に格納する（図９参照）。以上で入力原文の前編集記号
ｇ［］および前編集学習ファイル２０を用いた文字
列の一括した置換作業を終了する。In the processing in step S6, the character string in the original sentence stored in the buffer D is matched with each of the pre-conversion word / word string B1 stored in the item I1 of the preediting learning file 20. .. If there is a match, the corresponding character string in buffer D is the learning file 20.
The corresponding converted word / word string B2 is replaced and the result is set in the buffer E. On the other hand, if no match is found, the contents of buffer D are unchanged.
(See FIG. 9). This is the preedit symbol g of the input source text. [] And the batch replacement work of the character string using the pre-editing learning file 20 are completed.

【００４９】その後、ステップＳ８の処理に移行し、バ
ッファＢに何らかの値がセットされていると判断される
とステップＳ９の処理において、バッファＢの内容が前
編集学習ファイル２０に追加されて、前編集学習ファイ
ル２０がデータ更新される。この動作が完了した時点の
前編集学習ファイル２０の状態は、図１２（ａ）の状態
から図１２（ｂ）の状態に更新される。Thereafter, the process proceeds to step S8, and if it is determined that some value is set in the buffer B, the content of the buffer B is added to the preedit learning file 20 in the process of step S9, The data of the edit learning file 20 is updated. The state of the pre-editing learning file 20 at the time when this operation is completed is updated from the state of FIG. 12 (a) to the state of FIG. 12 (b).

【００５０】その後処理は、ステップＳ１０の辞書引き
処理へ移行し、バッファＥにセットれた文字列を辞書Ｄ
１を引きながら単語単位に分割して、かつ前編集記号ａ
を切離してバッファＦにセットする（図１０参照）。
図１０に示されるようにバッファＦは単語位置、単語文
字列、３つの品詞および前編集記号を含む。単語文字列
は、バッファＥにセットされた文字列を単語単位に分割
して得られた各単語の文字列である。単語位置は、対応
の単語文字列のバッファＥの文字列中における位置情報
を表わしている。３つの品詞は、対応の単語文字列を辞
書Ｄ１を引いたことによって得られる品詞情報および対
応の前編集記号によって指定される品詞情報を格納す
る。前編集記号は、対応の単語文字列に付加されていた
前編集記号である。バッファＥにセットされた文字列を
単語単位に分割して、かつ前編集記号を切離して、図１
０のバッファＦにセットした後、辞書Ｄ１を引いて各単
語文字列の品詞や形態属性などの情報を得て、バッファ
Ｆの「品詞」の欄にストアする。ここでは、説明を容易
にするため品詞以外の形態属性については説明を省略す
る。辞書Ｄ１を引いたことにより得られた情報がバッフ
ァＦにセットされた状態が図１１に示される。Thereafter, the processing shifts to the dictionary lookup processing of step S10, and the character string set in the buffer E is converted into the dictionary D.
It is divided into words while subtracting 1, and the preedit symbol a
Is separated and set in the buffer F (see FIG. 10).
As shown in FIG. 10, the buffer F includes a word position, a word character string, three parts of speech, and a preedit symbol. The word character string is a character string of each word obtained by dividing the character string set in the buffer E into word units. The word position represents the position information of the corresponding word character string in the character string of the buffer E. The three part-of-speech stores the part-of-speech information obtained by subtracting the corresponding word character string from the dictionary D1 and the part-of-speech information specified by the corresponding preedit symbol. The pre-edit symbol is the pre-edit symbol added to the corresponding word character string. The character string set in the buffer E is divided into word units, and the pre-edit symbol is cut off.
After setting in the buffer F of 0, the dictionary D1 is drawn to obtain information such as the part of speech and the form attribute of each word character string, and the information is stored in the "part of speech" column of the buffer F. Here, for ease of explanation, description of the morphological attributes other than the part of speech is omitted. FIG. 11 shows a state in which the information obtained by drawing the dictionary D1 is set in the buffer F.

【００５１】その後、処理はステップＳ１１に移行し、
図１１のバッファＦに格納される情報に対して前編集記
号（この例の場合、前編集記号「ａ」）の指示に従っ
た処理が施される。つまり、図１１を参照して、前編集
記号「ａ」が付された単語位置「４」の単語文字列
「said」の品詞の候補の中から、前編集記号「ａ」が
指定する「形容詞」以外の品詞（この場合、「動詞」）
をバッファＦより削除する。この削除によって得られる
バッファＦの状態は図示されないが、図１１に示された
バッファＦの状態から、単語位置「４」に該当する品詞
「動詞」が削除された状態となる。これにより、図１の
辞書引き形態素解析部１１における辞書Ｄ１および学習
ファイル２０を用いた形態素解析が終了し、解析して得
られた形態素列は次段の構文解析部１２に与えられる。Thereafter, the processing shifts to step S11,
For the information stored in the buffer F of FIG. 11, the pre-edit symbol (in this example, the pre-edit symbol "a )) Instruction is performed. That is, referring to FIG. 11, the pre-edit symbol “a From among the candidates for the part of speech of the word character string "said" at the word position "4" Part of speech other than "adjective" specified by "" (in this case, "verb")
Is deleted from the buffer F. Although the state of the buffer F obtained by this deletion is not shown, the part of speech “verb” corresponding to the word position “4” is deleted from the state of the buffer F shown in FIG. As a result, the morpheme analysis using the dictionary D1 and the learning file 20 in the dictionary lookup morpheme analysis unit 11 in FIG. 1 is completed, and the morpheme string obtained by the analysis is given to the syntax analysis unit 12 in the next stage.

【００５２】前述のステップＳ５の処理に戻り、利用者
により前編集学習ファイルが選択されていないと判断さ
れれば、ステップＳ７の処理において、図８に示された
バッファＤの内容を単語単位に分割し、図１０のように
バッファＦに格納する。その後、前述したステップＳ１
０以降の処理が、同様にして実行される。Returning to the processing of step S5, if it is determined that the user does not select the pre-editing learning file, the contents of the buffer D shown in FIG. It is divided and stored in the buffer F as shown in FIG. Then, the above-mentioned step S1
The processing from 0 onward is executed in the same manner.

【００５３】以上のように、図２の処理フローに従って
辞書Ｄ１および学習ファイル２０を用いた原文の前編集
処理が行なわれ、前編集処理された原文について形態素
解析される。そして得られた形態素列は、構文解析部１
２に与えられる。構文解析部１２は、図１１のバッファ
Ｆの単語位置「４」の品詞「動詞」の情報が削除されて
いるので、該当する単語文字列「said」を「形容詞」と
解釈して構文解析木を作成する。得られた構文解析木は
変換部１３および生成部１４を経て、表示部３などに図
４のＮｏ６の訳文がＮｏ２の英文とともに表示される。
以上で、この原文の翻訳処理と前編集記号ｇ［］を
用いた学習（学習ファイル２０の更新）が終了したこと
になる。As described above, the pre-editing process of the original sentence using the dictionary D1 and the learning file 20 is performed according to the process flow of FIG. 2, and the morpheme analysis is performed on the pre-edited original sentence. Then, the obtained morpheme string is used by the syntax analysis unit 1
Given to 2. Since the information of the part-of-speech “verb” at the word position “4” in the buffer F of FIG. 11 has been deleted, the syntactic analysis unit 12 interprets the corresponding word character string “said” as an “adjective” and parses the syntactic analysis tree. To create. The obtained syntactic analysis tree passes through the conversion unit 13 and the generation unit 14, and the translated text No. 6 of FIG. 4 is displayed together with the English text No. 2 on the display unit 3 and the like.
This is the translation processing of this original text and the preedit symbol g Learning using [] (update of the learning file 20) is completed.

【００５４】過去に学習された前編集記号の情報を利
用して翻訳する処理前述のように前編集記号の学習処理が完了した後、原文
として図４のＮｏ３の英文が翻訳モジュール５に入力さ
れると、再び、辞書引き形態素解析部１１において図３
の処理フローが実行される。Process of translating using information of pre-edited symbols learned in the past After the learning process of pre-edited symbols is completed as described above, the English sentence No. 3 in FIG. 4 is input to the translation module 5 as the original sentence. Then, again in the dictionary lookup morpheme analysis unit 11, FIG.
The processing flow of is executed.

【００５５】なお、このとき利用者により前編集学習フ
ァイル２０（図１２（ｂ）参照）が選択されていたと想
定する。また、バッファＡ〜Ｆは、１センテンスごとの
原文が入力されるごとにクリアされて用いられると想定
する。It is assumed that the user has selected the pre-editing learning file 20 (see FIG. 12B) at this time. It is also assumed that the buffers A to F are cleared and used each time an original sentence for each sentence is input.

【００５６】図４のＮｏ３の英文が入力されると、解析
部１１は前述と同様にして図２のステップＳ１ないしＳ
５の処理を実行する。このとき、入力された原文には前
編集記号の学習を指示する前編集記号ｇ［］が含ま
れていないので、バッファＤにはＮｏ３の英文がそのま
まストアされる。また、バッファＢには値は何らストア
されない。When the No. 3 English sentence in FIG. 4 is input, the analyzing unit 11 performs steps S1 to S in FIG. 2 in the same manner as described above.
Process 5 is executed. At this time, a pre-edit symbol g for instructing learning of the pre-edit symbol is input to the input original text. Since [] is not included, the English sentence of No3 is stored in the buffer D as it is. Further, no value is stored in the buffer B.

【００５７】ステップＳ５の判定結果、前編集学習ファ
イル２０が選択されているので、次のステップＳ６の処
理に移行する。Since the pre-edit learning file 20 is selected as a result of the determination in step S5, the process proceeds to the next step S6.

【００５８】ステップＳ６の処理において、バッファＤ
に格納された文字列は図１２（ｂ）に示された前編集学
習ファイル２０に基づいて検索／置換される。つまり、
図１２（ｂ）のファイル２０の変換前単語／単語列Ｂ１
がバッファＤに格納されたＮｏ３の英文中に検出される
ので、検出されたこの文字列を、ファイル２０の対応の
変換後単語／単語列Ｂ２「 the a said」を用いて置換
された後、バッファＤの文字列はバッファＥにセットさ
れる。この場合、バッファＥの状態は前述した図９のそ
れと同様となる。In the process of step S6, the buffer D
The character string stored in is searched / replaced based on the pre-editing learning file 20 shown in FIG. That is,
Unconverted word / word string B1 in the file 20 of FIG. 12 (b)
Is detected in the No. 3 English sentence stored in the buffer D, the detected character string is converted to the corresponding converted word / word string B2 “the a of the file 20. After being replaced with "said", the string in buffer D is set in buffer E. In this case, the state of the buffer E is the same as that of FIG. 9 described above.

【００５９】次にステップＳ８の処理において、バッフ
ァＢに値があるか否かが判定される。前述したようにＮ
ｏ３の英文に関してバッファＢに値はセットされないの
で、処理はステップＳ１０に移行する。ステップＳ１０
の処理においてはバッファＥ（図９参照）の内容がバッ
ファＦ（図１０）にストアされ、バッファＦ（図１０）
の内容に対して辞書引き処理が実行され、その結果が改
めてバッファＦにセットされる（図１１参照）。これに
より、Ｎｏ３の英文に関して辞書Ｄ１および過去の前編
集記号学習情報を利用した前編集処理が終了する。Next, in the process of step S8, it is determined whether or not there is a value in the buffer B. As mentioned above, N
Since no value is set in the buffer B for the English sentence of o3, the process proceeds to step S10. Step S10
Process, the contents of the buffer E (see FIG. 9) are stored in the buffer F (FIG. 10), and the buffer F (FIG. 10) is stored.
The dictionary lookup processing is executed for the contents of the above, and the result is set again in the buffer F (see FIG. 11). As a result, the pre-editing process using the dictionary D1 and the past pre-editing symbol learning information for the No3 English sentence ends.

【００６０】以下同様にして、構文解析部１２、変換部
１３および翻訳文生成部１４を経て、Ｎｏ３の英文につ
いて図４示されるようにＮｏ７の訳文が得られ、その結
果が原文および翻訳文が対となって表示部３などに画面
表示される。Similarly, the No. 7 translated sentence is obtained for the No. 3 English sentence through the syntax analysis unit 12, the conversion unit 13 and the translated sentence generation unit 14 as shown in FIG. 4, and the result is converted into the original sentence and the translated sentence. The images are displayed on the display unit 3 or the like as a pair.

【００６１】さらに、次に該翻訳装置に図４に示される
Ｎｏ４の英文が入力されると、辞書引き形態素解析部１
１は、前述と同様にして図２のステップＳ１〜ステップ
Ｓ４の処理を実行するので、バッファＤの内容は、“It
v means that the component is good. ”また、Ｎｏ
４の英文には前編集の学習を指示する記号ｇ［］が
含まれないので、バッファＢに値は格納されない。Further, when the No. 4 English sentence shown in FIG. 4 is next input to the translation device, the dictionary lookup morphological analysis unit 1
1 executes the processing of steps S1 to S4 of FIG. 2 in the same manner as described above, the content of the buffer D is "It
v means that the component is good. ”Also, No
In the English sentence of 4, the symbol g that indicates the learning of preediting Since [] is not included, the value is not stored in the buffer B.

【００６２】ステップＳ５の処理において、利用者によ
り前編集学習ファイル２０（図１２（ｂ）参照）が選択
されていると判定されるので、応じて処理は次のステッ
プＳ６に移行する。In the process of step S5, it is determined that the user has selected the pre-editing learning file 20 (see FIG. 12B), and accordingly the process proceeds to the next step S6.

【００６３】ステップＳ６の処理においてバッファＤに
格納された文字列と前編集学習ファイル２０の変換前単
語／単語列Ｂ１とのマッチングが図られるが、バッファ
Ｄ中の文字列“ v means ”と単語列Ｂ１の“means ”
がマッチしないため、変換後単語／単語列Ｂ２の“ n
means ”を用いた文字列置換は行なわれない。したがっ
てバッファＥには“It v means that the component i
s good. ”がセットされることになる。In the process of step S6, the character string stored in the buffer D and the pre-conversion word / word string B1 of the pre-editing learning file 20 are matched, but the character string "v" in the buffer D is matched. means "and" means "of the word string B1
Does not match, so the converted word / word string B2 "n
String replacement using "means" is not performed. Therefore, "It v means that the component i
s good. ”will be set.

【００６４】次にステップＳ８の処理において、バッフ
ァＢに値がないと判定されるので、次のステップＳ１０
およびＳ１１の処理を経て、図４に示されるＮｏ８の訳
文が得られることになる。すなわちＮｏ４の英文におけ
る文字列“ v means ”は、図３に示されたように前編
集記号“ｖ ”が付された単語“means ”を「動詞」と
指定するように翻訳されることになる。このように、前
編集記号の処理の優先の順序としては（１）文字列中に
ある前編集記号、（２）同一原文の他の箇所で学習が指
定された前編集記号（すなわち、バッファＢにストアさ
れた前編集記号）、（３）前編集学習ファイル２０に予
め学習された前編集記号、となる。Next, in the processing of step S8, since it is determined that there is no value in the buffer B, the next step S10
And through the process of S11, the translated sentence of No8 shown in FIG. 4 is obtained. That is, the character string "v means "is a pre-editing symbol" v "as shown in FIG. The word "means" with "" will be translated to specify it as a "verb". In this way, the prioritized processing order of the pre-editing symbols is (1) the pre-editing symbol in the character string, (2) the pre-editing symbol designated to be learned in another part of the same original sentence (that is, the buffer B (Pre-edit symbol stored in), (3) pre-edit symbol learned in advance in the pre-edit learning file 20.

【００６５】以上から明らかなように、本実施例では利
用者が任意に選択する前編集学習ファイルごとに、利用
者が行なう前編集処理の内容を選択的に学習させること
によって、特定の分野に頻出する前編集処理の必要性を
極力少なくすることができる。これは、学習が進行する
に従ってより効果的に行なわれる。また、前編集学習フ
ァイルが選択されている状態において、原文中の特定の
単語に関して過去に学習した前編集の情報を適用したく
ない場合には、原文中の当該単語に独自の前編集処理を
施すことにより前編集学習ファイルの適用を回避させる
ことができる。このように、利用者による前編集作業の
効率、ひいては該翻訳装置を用いた翻訳作業全体の作業
効率を向上させることができる。As is clear from the above, in the present embodiment, by selectively learning the contents of the pre-editing processing performed by the user for each pre-editing learning file arbitrarily selected by the user, a specific field can be obtained. The need for frequent pre-editing processing can be minimized. This is done more effectively as the learning progresses. Also, if you do not want to apply the pre-editing information learned in the past for a specific word in the original sentence when the pre-editing learning file is selected, perform the original pre-editing process for the word in the original sentence. By applying it, it is possible to avoid applying the pre-editing learning file. In this way, the efficiency of the pre-editing work by the user, and thus the work efficiency of the entire translation work using the translation device can be improved.

【００６６】また、前編集学習ファイル２０に登録され
た前編集学習情報を取消すためには、たとえば前編集記
号“ｃ［］”のような学習情報を取消すことを指示
するような前編集記号を予め設けておき、原文中におい
てこの記号で囲まれた文字列があれば、該文字列から前
編集記号を取除いた文字列（変換前単語／単語列Ｂ１）
に関するデータを前編集学習ファイル２０から削除する
ことによって実現可能である。Further, in order to cancel the pre-editing learning information registered in the pre-editing learning file 20, for example, the pre-editing symbol "c" is used. A pre-edit symbol, such as [], which indicates to cancel learning information is provided in advance, and if there is a character string enclosed by this symbol in the original sentence, the pre-edit symbol is removed from the character string. Character string (word before conversion / word string B1)
This can be realized by deleting the data related to the pre-editing learning file 20.

【００６７】また、原文にバッファＢの内容または選択
される前編集学習ファイル２０の内容を適用したことに
より得られた結果（バッファＥの内容）を表示部３など
に表示して、該装置に入力された原文が装置内部でどの
ように前編集処理されたか所望に応じて表示することも
可能である。Further, the result obtained by applying the contents of the buffer B or the contents of the preediting learning file 20 to be selected to the original sentence (contents of the buffer E) is displayed on the display unit 3 or the like, and is displayed on the device. It is also possible to display, as desired, how the input original text was preedited inside the device.

【００６８】また、過去にどのような前編集処理が学習
されているかを知るために、所望に応じて最新の前編集
学習ファイル２０（図１２（ｃ）参照）を表示部３に表
示、あるいはプリンタ７に印字して確認することも可能
である。Further, in order to know what kind of pre-editing process has been learned in the past, the latest pre-editing learning file 20 (see FIG. 12C) is displayed on the display unit 3 as desired, or It is also possible to print on the printer 7 for confirmation.

【００６９】さらに、本実施例では図１２（ａ）〜
（ｃ）に示された前編集学習ファイル２０の検索の効率
化については言及していないが、翻訳作業全体の効率を
上昇させるために前編集学習ファイル２０の内部を変換
前単語／単語列Ｂ１をキーにしてソートしながら格納
し、ソートされたキーに対するインデックスを作成し、
前編集学習ファイル２０検索時にはこのインデックスを
用いて高速検索を図ることも可能である。Further, in this embodiment, FIG.
Although there is no mention of improving the efficiency of the search of the pre-editing learning file 20 shown in (c), the inside of the pre-editing learning file 20 is converted to the pre-conversion word / word string B1 in order to increase the efficiency of the whole translation work. Store while sorting by using as a key, create an index for the sorted key,
At the time of searching the pre-editing learning file 20, it is possible to use this index for high-speed searching.

【００７０】図１３（ａ）〜（ｃ）および図１４は、本
発明の一実施例において使用される前編集記号のその他
の例を示す図である。図１３（ａ）は単語の品詞を指定
するための前編集記号である。１３（ｂ）は、英文など
における第１文型〜第５文型のいずれかを指定するため
の前編集記号である。１３（ｃ）は、翻訳モジュール５
における翻訳処理時に参照される辞書Ｄ１について、ユ
ーザ辞書、専門用語辞書および基本語辞書のいずれかを
特に指定するための前編集記号である。図１４に示され
る前編集記号は、原文を分割するために用いられる前編
集記号である。たとえば、いくつかの要件をまとめて１
つの長い文とした原文である場合、カンマやコロンの位
置で図１４に示されるような記号を用いて原文を分割す
るように前編集処理すれば、効率の良い翻訳が実行され
る。13 (a) to 13 (c) and FIG. 14 are diagrams showing other examples of pre-edit symbols used in one embodiment of the present invention. FIG. 13A shows a pre-editing symbol for designating the part of speech of a word. Reference numeral 13 (b) is a pre-editing symbol for designating any of the first sentence pattern to the fifth sentence pattern in an English sentence or the like. 13 (c) is the translation module 5
It is a pre-editing symbol for specifically designating any one of the user dictionary, the technical term dictionary, and the basic word dictionary for the dictionary D1 referred to in the translation process in. The pre-editing symbol shown in FIG. 14 is a pre-editing symbol used for dividing the original sentence. For example, some requirements are summarized as 1
If the original sentence is one long sentence, efficient translation can be executed by preediting the original sentence by dividing the original sentence by using a symbol as shown in FIG. 14 at the position of comma or colon.

【００７１】また、本実施例による翻訳装置では、特定
の前編集記号を学習するか否かが任意に選択できるの
で、無条件に前編集記号を学習するのにくらべて副作用
の少ない前編集処理が得られる。ここで言う副作用と
は、前編集記号の学習内容を、別の原文に適用した場合
に悪影響を及ぼすことを指す。詳細には、学習するか否
かの選択操作を備えていないと想定すると、単語“mean
s ”が常に前編集記号が付された文字列“ n means ”
（「意味する」ではなく「手段」）であるという学習を
行なってしまい、翻訳しようとする原文が特許関係以外
の分野に該当する原文であった場合、誤った解釈を頻繁
に生じてしまう。たとえば、翻訳しようとする原文が特
許関係の原文である場合には、たいていの場合、単語
“means ”を「手段」と名詞に特定して翻訳して問題な
いが、他の分野、たとえば経済関係に関する原文であっ
た場合、単語“means ”を「意味する」の動詞として翻
訳した方がよいということになる。それゆえに後者の経
済関係の原文において、単語“means ”を学習しないよ
うに選択すればよい。これに対して、特許関係の原文を
翻訳する場合は、周知のとおり頻繁に単語“means ”が
「手段」の名詞として使用されるので、むしろ学習する
ように選択した方がよいと言える。Further, in the translation apparatus according to the present embodiment, it is possible to arbitrarily select whether or not to learn a specific preedit symbol, so that the preedit processing with less side effects than unconditionally learning the preedit symbol is possible. Is obtained. The side effect mentioned here means that the learning content of the preedit symbol is adversely affected when it is applied to another original text. In detail, assuming that there is no selection operation for learning or not, the word “mean
"s" is always the string "n" with a preedit symbol means ”
If the original text to be translated is an original text applicable to a field other than patent-related, it will be misinterpreted frequently. For example, if the original sentence to be translated is a patent-related original sentence, in most cases, the word "means" can be specified as a "means" and a noun and translated, but there is no problem, but in other fields such as economic relations. If it is a source sentence related to, it means that the word "means" should be translated as a verb of "meaning". Therefore, in the latter text of economic relations, one may choose not to learn the word "means". On the other hand, when translating original texts related to patents, the word "means" is frequently used as a noun of "means" as is well known, so it can be said that it is better to choose to learn.

【００７２】[0072]

【発明の効果】以上のようにこの発明によれば、学習指
定手段を介して、原文中の前編集記号を学習することを
指定するだけで、以降学習手段によりこの前編集記号が
それが付される文字列と関連付けて学習される。さらに
編集手段は学習手段により学習された内容に従って学習
文字列に該当する原文中の文字列に、学習された関連の
前編集記号を付してこの原文を編集し直すので、原文中
に学習文字列に該当の文字列が複数箇所存在したとして
も、繰り返し前編集記号を付して編集する手間が省け
る。これにより、翻訳処理を支援するために前編集記号
を付して編集処理する作業が省力化され、ひいては翻訳
処理自体の処理効率が向上するという効果がある。As described above, according to the present invention, it is only necessary to specify, through the learning designating means, that the preediting symbol in the original sentence is to be learned. Learned by associating with the character string. Further, the editing means re-edits the original text by adding the relevant pre-editing symbol learned to the character string in the original text corresponding to the learning character string according to the content learned by the learning means. Even if there are a plurality of corresponding character strings in a column, it is possible to save the trouble of adding the pre-edit symbol and editing it. As a result, there is an effect that the work of adding a pre-editing symbol to support the translation process and performing the editing process is labor-saving, and the processing efficiency of the translation process itself is improved.

【００７３】さらに、学習手段は複数のファイルを有し
たファイル記憶状態を有して、各ファイルには原文が記
述する内容が属する分野ごと、または利用者ごとの翻訳
処理を支援するための情報が蓄積されるので、これら情
報が蓄積されたファイルを用いて原文を編集する第１編
集手段における編集効率は、選択されたファイルに蓄積
される情報量に伴って向上する。Further, the learning means has a file storage state having a plurality of files, and each file has information for supporting the translation process for each field to which the content described in the original text belongs or for each user. Since the information is stored, the editing efficiency in the first editing means for editing the original text using the file in which the information is stored improves with the amount of information stored in the selected file.

【００７４】また、学習手段による学習内容、またはフ
ァイル記憶手段によりファイルに記憶された内容を任意
に変更することが可能なので、これらの学習内容を用い
て原文を編集する処理は多様化し、前述の前編集に係る
作業もその内容が多様化されるという効果もある。Further, since the learning contents by the learning means or the contents stored in the file by the file storing means can be arbitrarily changed, the processing for editing the original sentence using these learning contents is diversified. The work relating to pre-editing also has the effect of diversifying its contents.

[Brief description of drawings]

【図１】（ａ）および（ｂ）は、本発明の一実施例によ
る翻訳装置の構成図である。1A and 1B are configuration diagrams of a translation device according to an embodiment of the present invention.

【図２】図１（ｂ）の辞書引き形態素解析部における処
理のフロー図である。FIG. 2 is a flow chart of processing in a dictionary lookup morpheme analysis unit of FIG. 1 (b).

【図３】本発明の一実施例において使用される前編集記
号の一例を示す図である。FIG. 3 is a diagram showing an example of a pre-editing symbol used in an embodiment of the present invention.

【図４】本発明の一実施例による翻訳結果の表示画面の
一例を示す図である。FIG. 4 is a diagram showing an example of a translation result display screen according to an embodiment of the present invention.

【図５】図２の処理フロー実行時に参照されるバッファ
Ａの内容例を示す図である。5 is a diagram showing an example of contents of a buffer A referred to when the processing flow of FIG. 2 is executed.

【図６】図２の処理フロー実行時に参照されるバッファ
Ｂの内容例を示す図である。FIG. 6 is a diagram showing an example of contents of a buffer B referred to when the processing flow of FIG. 2 is executed.

【図７】図２の処理フロー実行時に参照されるバッファ
Ｃの内容例を示す図である。7 is a diagram showing an example of contents of a buffer C that is referred to when the processing flow of FIG. 2 is executed.

【図８】図２の処理フロー実行時に参照されるバッファ
Ｄの内容例を示す図である。FIG. 8 is a diagram showing an example of contents of a buffer D referred to when the processing flow of FIG. 2 is executed.

【図９】図２の処理フロー実行時に参照されるバッファ
Ｅの内容例を示す図である。9 is a diagram showing an example of contents of a buffer E referred to when the processing flow of FIG. 2 is executed.

【図１０】図２の処理フロー実行時に参照されるバッフ
ァＦの内容例を示す図である。10 is a diagram showing an example of contents of a buffer F referred to when the processing flow of FIG. 2 is executed.

【図１１】データ更新された後の図１０のバッファＦの
内容例を示す図である。11 is a diagram showing an example of the contents of the buffer F of FIG. 10 after the data is updated.

【図１２】（ａ）ないし（ｃ）は、本発明の一実施例に
よる前編集学習ファイルの内容例を示す図である。12A to 12C are diagrams showing examples of contents of a preediting learning file according to an embodiment of the present invention.

【図１３】（ａ）ないし（ｃ）は、本発明の一実施例に
おいて使用される前編集記号のその他の例を示す図であ
る。13A to 13C are diagrams showing other examples of pre-edit symbols used in an embodiment of the present invention.

【図１４】本発明の一実施例において使用される前編集
記号のその他の例を示す図である。FIG. 14 is a diagram showing another example of the pre-editing symbol used in the embodiment of the present invention.

[Explanation of symbols]

５翻訳モジュールＤ１辞書Ｄ２文法規則Ｄ３木構造変換規則なお、各図中、同一符号は同一または相当部分を示す。 5 Translation Module D1 Dictionary D2 Grammar Rule D3 Tree Structure Conversion Rule In each figure, the same reference numerals indicate the same or corresponding parts.

Claims

[Claims]

1. A translation device for translating a given original sentence based on a predetermined rule including dictionary data, and outputting a translated sentence for the original sentence, which is included in the original sentence before the translation process, And a learning designating means for designating whether or not to learn the pre-editing symbol for supporting the translation processing for each character string to which it is attached, and the pre-editing symbol designated by the learning designating means,
A learning unit for learning by associating with the character string to which it is attached; a character string corresponding to the learning character string learned by the learning unit among the character strings of the original sentence; Editing means for adding a pre-editing symbol, and inputting the original sentence edited by the editing means at the time of the translation processing, and supporting the translation processing defined in advance corresponding to the pre-editing symbol. And control means for executing the translation process by applying the information and the rules.
Translation device.

2. The learning means has a plurality of files in advance, and associates the preedit symbol designated by the learning designating means with the learning character string to which the preedit symbol designated by the learning designating means is associated with the selected file. The editing means further includes a file storage means for storing, of the character strings in the original sentence, a character string corresponding to the learned character string stored in the selected file, The translation device according to claim 1, further comprising first editing means for editing with the pre-edit symbol.

3. The plurality of files are prepared for each field to which the description content of the original sentence belongs, or for each user of the translation device, and the selection of the file is performed in the field to which the description content of the original sentence to be translated belongs or The translation device according to claim 2, wherein the translation device is selected according to a user of the translation device.

4. The translation device according to claim 1, wherein the contents learned by the learning means can be arbitrarily changed.

5. The translation device according to claim 2, wherein contents stored in the selected file can be arbitrarily changed.