JPH0628397A

JPH0628397A - Machine translation apparatus

Info

Publication number: JPH0628397A
Application number: JP4179883A
Authority: JP
Inventors: Takeshi Kutsumi; 毅九津見; Noriyuki Hirai; 徳行平井
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1992-07-07
Filing date: 1992-07-07
Publication date: 1994-02-04
Anticipated expiration: 2014-03-31
Also published as: JP2875927B2

Abstract

PURPOSE:To easily use a sentence including the pre-edit symbol of the pre-edit processing prior to the machine translation as a sentence by deleting prescribed symbols in a batch from an input original sentence after the pre-processing. CONSTITUTION:The pre-edit symbol is roughly classified into two types according to its form. All the pre-symbols of the 1st type have engings ''''. The places just after the symbols are not blank. On the other hand, the places just before these symbols are always blank. The 2nd type of the pre-edit symbol is used to punctuate sentences before and after the symbol in consideration by some meanings. Namely, these symbols are inserted between words. In the case of the 2nd type of the pre-edit symbol, the places just before and after these symbols inserted into the text are blank as a rule. With the use of these features, the processing of deleting the pre-edit symbols in a batch is performed by the program to be executed by main CPU.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、翻訳の際にユーザが予
め原文に前編集処理を施すことによって翻訳の効率や精
度を高めることができる機械翻訳装置に関し、特に、そ
の翻訳後の処理の改良に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation apparatus capable of improving the efficiency and accuracy of translation by pre-editing the original text by the user during translation, and more particularly to a machine translation apparatus Regarding improvement.

【０００２】[0002]

【従来の技術】従来の一般的機械翻訳装置は、ソース言
語のテキストを入力するためのキーボードと、入力され
たソース言語のテキストをターゲット言語に翻訳するた
めの翻訳モジュールと、翻訳モジュールによる翻訳の際
に用いられるメイン辞書やユーザ辞書からなる辞書、文
法規則および木構造変換規則などを格納するためのメモ
リと、これら各部分の動作の制御を行なうためのメイン
ＣＰＵ（中央演算処理装置）とを含む。2. Description of the Related Art A conventional general machine translation apparatus includes a keyboard for inputting source language text, a translation module for translating the input source language text into a target language, and a translation module. A main dictionary (main processing unit), a memory for storing a dictionary composed of a main dictionary and a user dictionary, a grammar rule, a tree structure conversion rule, and the like, and a main CPU (central processing unit) for controlling the operation of each of these parts. Including.

【０００３】キーボードから入力されたソース言語のテ
キストは、メインＣＰＵの制御により翻訳モジュールに
入力される。入力されたソース言語のテキストは、メイ
ン辞書やユーザ辞書からなる辞書、文法規則および木構
造変換規則を利用することにより、翻訳モジュールによ
ってターゲット言語に翻訳される。The source language text input from the keyboard is input to the translation module under the control of the main CPU. The input source language text is translated into the target language by the translation module by using the dictionary including the main dictionary and the user dictionary, the grammar rules, and the tree structure conversion rules.

【０００４】このような機械翻訳装置では、形態素解析
の結果得られた品詞から、辞書と文法規則とを用いて構
文解析を行なう場合、あらゆる可能な構文構造を組立て
るようになっている。したがって複数の構文構造に応じ
て複数の翻訳結果が出力される。そのため、この複数の
翻訳結果のうちどれが正しい翻訳結果であるかをユーザ
が選択する必要があった、また、考えられるあらゆる構
文構造を組立てるために、構文解析に長時間を要すると
いう問題点があった。これらの問題点は、原文の構造が
単純なものではなく、複雑で解釈に曖昧性があるために
生ずる。In such a machine translation device, when parsing a part of speech obtained as a result of morphological analysis using a dictionary and grammatical rules, all possible syntactic structures are assembled. Therefore, a plurality of translation results are output according to a plurality of syntactic structures. Therefore, it is necessary for the user to select which of the plurality of translation results is the correct translation result, and it takes a long time for parsing to assemble all possible syntactic structures. there were. These problems arise because the structure of the original text is not simple, but is complicated and vague in interpretation.

【０００５】このような問題を減少させるために、翻訳
に先立って入力原文に前編集処理を施すことが提案され
ている。前編集処理には、「前編集記号」が利用され
る。前編集記号とは、通常の文章には用いられないよう
な特殊な文字・記号列である。何種類か用意されている
前編集記号を原文の文中に挿入することにより、挿入さ
れた位置および前編集記号の種類に従って、特定の単語
の品詞を指定したり、何種類かの構文解析が可能な構文
構造を特定の構造に指定したりすることができる。In order to reduce such problems, it has been proposed to subject the input source text to pre-editing processing prior to translation. The "pre-editing symbol" is used for the pre-editing process. The pre-edit symbol is a special character / symbol string that is not used in ordinary text. By inserting several types of pre-edit symbols into the original sentence, you can specify the part of speech of a specific word according to the inserted position and the type of pre-edit symbol, and perform some types of syntactic analysis. You can specify a specific syntactic structure as a specific structure.

【０００６】このように前編集処理を施し、機械翻訳装
置ではこの前編集処理によって与えられる情報に基づき
翻訳を行なう。その結果、構文構造の候補が限定された
り、単語の品詞が明確に特定されたりするために、構文
解析の速度を速めることができる。また、翻訳結果の精
度も向上する。The pre-editing process is performed as described above, and the machine translation device translates based on the information given by the pre-editing process. As a result, the syntactic structure candidates are limited and the part of speech of the word is clearly specified, so that the parsing speed can be increased. Also, the accuracy of the translation result is improved.

【０００７】[0007]

【発明が解決しようとする課題】前編集記号は、機械翻
訳のみに用いられる記号である。したがって翻訳が終れ
ば、原文に挿入された前編集記号は不要である。しか
し、原文中にはこの前編集記号が翻訳後にも残ってい
る。したがってたとえば翻訳後の原文を文として利用し
たい場合には、この前編集記号を文書編集機能（たとえ
ばエディタ）などを用いて１つ１つ見つけて手作業で削
除しなければならない。The pre-editing symbol is a symbol used only for machine translation. Therefore, once the translation is complete, the preedit symbol inserted in the original text is not needed. However, this pre-edit symbol remains in the original text after translation. Therefore, for example, when the translated original sentence is desired to be used as a sentence, the pre-edit symbol must be found one by one using a document editing function (for example, an editor) and manually deleted.

【０００８】それゆえにこの発明の目的は、機械翻訳に
先立って行なわれた前編集処理の前編集記号を含む文章
を、容易に文として利用することを可能にできる機械翻
訳装置を提供することである。Therefore, an object of the present invention is to provide a machine translation device which makes it possible to easily use a sentence including a preedit symbol of a preedit process performed prior to machine translation as a sentence. is there.

【０００９】[0009]

【課題を解決するための手段】本発明に係る機械翻訳装
置は、自動翻訳の補助のための所定の記号を前処理とし
て挿入された前処理後入力原文を格納するための前処理
後入力原文格納手段と、前処理後入力原文を所定の記号
を翻訳の補助として用いながら他の言語に翻訳するため
の自動翻訳手段と、前処理後入力原文格納手段に格納さ
れた前処理後入力原文から所定の記号を除去した文を生
成するための前処理記号除去手段とを含む。A machine translation apparatus according to the present invention includes a preprocessed input source text for storing a preprocessed input source text in which a predetermined symbol for assisting automatic translation is inserted as a preprocessing. A storage means, an automatic translation means for translating the preprocessed input source text into another language while using a predetermined symbol as a translation aid, and a preprocessed input source text stored in the preprocessed input source text storage means. Preprocessing symbol removing means for generating a sentence from which a predetermined symbol is removed.

【００１０】[0010]

【作用】本発明に係る機械翻訳装置では、前処理のため
の所定の記号を含む入力原文が自動翻訳手段によって他
の言語に翻訳される。そして翻訳処理後の前処理後入力
原文からは、前処理記号除去手段により、前処理のため
の所定の記号が除去された文が生成される。前処理後の
入力原文を、従来のように文書編集機能を用いて手作業
で訂正して前処理のための記号を除去する必要はない。In the machine translation device according to the present invention, the input original sentence containing a predetermined symbol for preprocessing is translated into another language by the automatic translation means. Then, from the preprocessed input original sentence after the translation process, the preprocessed symbol removing means generates a sentence from which a predetermined symbol for preprocessing is removed. It is not necessary to manually correct the input original text after preprocessing by using the document editing function to remove the symbols for preprocessing as in the conventional case.

【００１１】[0011]

【実施例】図１は、本発明の機械翻訳装置の一実施例の
ブロック図である。図１を参照して機械翻訳装置は、メ
インＣＰＵ１と、メインＣＰＵ１が接続されたバス８と
を含む。さらに、以下の各装置もバス８に接続されてい
る。すなわち、この機械翻訳装置はさらに、メインメモ
リ２と、ＣＲＴ（カソード・レイ・チューブ）などから
なる表示装置３と、キーボード４と、翻訳処理を行なう
ための翻訳モジュール５と、翻訳結果などを必要であれ
ば出力するためのプリント７とを含む。翻訳モジュール
５には、翻訳の際に用いられる辞書、文法規則、木構造
変換規則などを格納したメモリ６が接続されている。1 is a block diagram of an embodiment of a machine translation apparatus of the present invention. Referring to FIG. 1, the machine translation device includes a main CPU 1 and a bus 8 to which the main CPU 1 is connected. Furthermore, the following devices are also connected to the bus 8. That is, the machine translation device further requires a main memory 2, a display device 3 including a CRT (cathode ray tube), a keyboard 4, a translation module 5 for performing translation processing, a translation result, and the like. If so, a print 7 for outputting is included. The translation module 5 is connected to a memory 6 which stores a dictionary used for translation, grammar rules, tree structure conversion rules, and the like.

【００１２】以下で説明する前編集記号の一括削除の処
理は、メインＣＰＵ１で実行されるプログラムにより行
なわれる。メインメモリ２には、このプログラムの実行
のために必要なバッファが割当てられる。The process of collectively deleting the pre-edit symbols described below is performed by a program executed by the main CPU 1. A buffer required for executing this program is allocated to the main memory 2.

【００１３】キーボード４から入力されたソース言語の
テキストはメインＣＰＵ１の制御により翻訳モジュール
５に送られる。翻訳モジュール５は、メモリ６に記憶さ
れている辞書、文法規則および木構造変換規則などを用
いて、入力されたソース言語のテキストをターゲット言
語に翻訳する。その結果はメインメモリ２に一旦記憶さ
れる。翻訳結果はまた表示装置３にも表示される。必要
であれば翻訳結果はプリンタ７によって紙片に印字され
る。メインメモリ２は、各種プログラム実行中のレジス
タメモリおよび表示バッファなどにも利用される。The source language text input from the keyboard 4 is sent to the translation module 5 under the control of the main CPU 1. The translation module 5 translates the input text in the source language into the target language using the dictionary, the grammar rules, the tree structure conversion rules, and the like stored in the memory 6. The result is temporarily stored in the main memory 2. The translation result is also displayed on the display device 3. If necessary, the translation result is printed on a piece of paper by the printer 7. The main memory 2 is also used as a register memory and a display buffer during execution of various programs.

【００１４】この場合、前編集処理を行なうために、入
力されたソース言語のテキストはメインメモリ２に一旦
記憶される。そしてメインＣＰＵ１によって制御される
エディタプログラムなどの編集処理機能を用い、テキス
ト中に前編集記号が挿入される。この挿入はキーボード
４および表示装置３を用いて行なわれる。このように前
編集処理を行なったテキストを翻訳モジュール５で翻訳
する場合、前述したように翻訳の効率が向上し、その精
度も向上する。In this case, the input source language text is temporarily stored in the main memory 2 in order to perform the pre-editing process. Then, a pre-edit symbol is inserted into the text by using an editing processing function such as an editor program controlled by the main CPU 1. This insertion is performed using the keyboard 4 and the display device 3. When the translation module 5 translates the pre-edited text as described above, the translation efficiency is improved and the accuracy thereof is improved as described above.

【００１５】以下、説明のためにテキスト原文を英語の
文章とし、これを日本語に翻訳する際を想定して、この
発明による前編集後の入力原文からの前編集記号の一括
削除の動作を説明する。なおこの処理は、メインＣＰＵ
１で実行されるプログラムにより行なわれる。Hereinafter, for the purpose of explanation, assuming that the text original sentence is an English sentence and translating it into Japanese, the operation of batch deletion of preedit symbols from the input original sentence after preediting according to the present invention will be described. explain. Note that this processing is performed by the main CPU
1 is performed by the program executed.

【００１６】図２〜図４は、前編集記号一括削除処理の
フローチャートである。このフローチャートについて詳
細に説明する前に、図１に示される機械翻訳装置で用い
られる前編集記号の例について説明する。前編集記号
は、その形態上２種に大別される。第１の種類の前編集
記号は図５に示される。第２の種類の前編集記号は図６
に示される。2 to 4 are flowcharts of the pre-editing symbol batch deletion process. Before describing this flowchart in detail, an example of pre-edit symbols used in the machine translation device shown in FIG. 1 will be described. Pre-edit symbols are roughly classified into two types according to their forms. The first type of preedit symbol is shown in FIG. The second type of preedit symbol is shown in FIG.
Shown in.

【００１７】図５に示される前編集記号は、単語の先頭
に付けることにより、その単語を何らかの意味で修飾す
るためのものである。修飾の種別としては、図５に示さ
れるように品詞指定記号と、文型指定記号と、辞書指定
記号などがある。品詞指定記号とは、この記号に続く単
語の品詞を指定するためのものである。文型指定記号と
は、この記号の後に続く文の文型を指定するためのもの
である。辞書指定記号とは、この記号の後に続く単語の
辞書引き処理を行なう際の辞書を指定するためのもので
ある。The pre-editing symbol shown in FIG. 5 is provided at the beginning of a word to modify the word in some sense. As the types of modification, there are a part-of-speech designation symbol, a sentence pattern designation symbol, a dictionary designation symbol, etc., as shown in FIG. The part-of-speech designation symbol is for designating the part-of-speech of the word following this symbol. The sentence pattern designating symbol is for designating the sentence pattern of the sentence following this symbol. The dictionary designation symbol is for designating a dictionary when performing a dictionary lookup process for the word following this symbol.

【００１８】図５に示される記号は、いずれも末尾が
“ ”であるという特徴を持つ。またこの記号の直後に
は単語が続く。すなわち、これらの記号の直後は空白で
はない。一方、これらの記号の直前は必ず空白である。The symbols shown in FIG. 5 all have the suffix "". The symbol is followed by a word, which is immediately followed by a word, that is, there is no blank immediately after these symbols, whereas there is always a blank immediately before these symbols.

【００１９】図６を参照して、第２の種類の前編集記号
は、その記号の前後を何らかの意味で区切るために用い
られるものである。すなわちこれらの記号は単語と単語
との間に挿入される。本実施例の場合にはこの第２番目
の種類の前編集記号は、括弧類を表わす同一の記号を２
個重ねたものとなっている。なお、図６に示される例で
は“／／”も含まれているが、これも一括して括弧類と
呼ぶものとする。Referring to FIG. 6, the second type of pre-edit symbol is used to separate the symbol before and after the symbol in some sense. That is, these symbols are inserted between words. In the case of the present embodiment, the second type of pre-edit symbol is the same symbol that represents parentheses.
It is a stack of pieces. Although "//" is also included in the example shown in FIG. 6, these are also collectively referred to as parentheses.

【００２０】この第２の種類の前編集記号の場合、テキ
スト中に挿入されたこれらの記号の直前および直後は原
則として空白である。しかし場合によってはそれらの空
白が設けられない場合もある。すなわち、前編集記号の
直前・直後に単語を構成する文字が存在する場合もあり
得る。In the case of this second type of pre-editing symbols, there is in principle a blank just before and immediately after these symbols inserted in the text. However, in some cases, these blanks may not be provided. That is, there may be cases where there are characters that form a word immediately before and after the preedit symbol.

【００２１】以上ような前編集記号を用いて前編集処理
を行なった入力原文を翻訳する処理につき、その概略を
説明する。最初に、翻訳すべき原文が読込まれる。読込
まれた英語の文章は、図７に示されるようにバッファＳ
１に格納される。このバッファＳ１には、図１に示され
るメインメモリ２の一部が割当てられる。An outline of the process of translating the input original text which has been subjected to the pre-editing process using the above-mentioned pre-editing symbols will be described. First, the original text to be translated is read. The read English sentences are stored in the buffer S as shown in FIG.
It is stored in 1. A part of the main memory 2 shown in FIG. 1 is assigned to this buffer S1.

【００２２】バッファＳ１に格納された英語の文には、
前編集処理が施される。前編集処理はユーザにより行な
われる。もちろんこの前編集処理は、必要な場合のみ行
なわれる。In the English sentence stored in the buffer S1,
Pre-editing processing is performed. The pre-editing process is performed by the user. Of course, this pre-editing process is performed only when necessary.

【００２３】前編集処理が施された英文の例が図７に示
される。図７を参照して、第１文に用いられた前編集記
号“ｖ３ ”は、図５の文型指定記号の第３番目に相当
する。すなわちこの記号は、図７に示される第１文の動
詞“ｃａｌｌｅｄ”によって統括されるこの第１文全体
が第３文型であることを指定している。FIG. 7 shows an example of English text which has been preedited. With reference to FIG. 7, the pre-editing symbol “v3” used in the first sentence 5 corresponds to the third of the sentence pattern designating symbols in FIG. 5. That is, this symbol is the third sentence pattern in which the entire first sentence is governed by the verb “called” of the first sentence shown in FIG. 7. Has been specified.

【００２４】また第２文に用いられた前編集記号“＜
＜”および“＞＞”は、それぞれ図６に示されるように
フレーズ指定開始記号、フレーズ指定終了記号である。
この２種類の記号よって挟まれた部分は、翻訳の際の構
文解析において一まとまりのものとして取扱われる。The pre-editing symbol "<used in the second sentence
<”And“ >> ”are a phrase designation start symbol and a phrase designation end symbol, respectively, as shown in FIG.
The part sandwiched between these two types of symbols is handled as a group in the syntactic analysis during translation.

【００２５】このように入力原文に前編集記号を挿入す
ることにより翻訳処理の効率が向上し精度も向上するこ
とについては前に述べたとおりである。本発明は、翻訳
後の、前編集処理された入力テキストから前編集記号を
簡単に削除できる点に特徴がある。以下、前編集記号の
削除処理について説明する。As described above, the efficiency of translation processing is improved and the accuracy is improved by inserting the pre-editing symbol in the input original sentence. The present invention is characterized in that the preedit symbol can be easily deleted from the input text that has been preedited after translation. The process of deleting the pre-edit symbol will be described below.

【００２６】図２を参照して、まずステップＳ００１
で、前編集記号の挿入された文がバッファＳ１に格納さ
れる。格納した結果は図７に示されるとおりである。な
お、バッファＳ１に格納された入力テキストは、現在の
説明では翻訳直後のものと想定されている。しかし、翻
訳直後である必要はなく、過去に翻訳した文章の原文が
別途保存されていた場合に、その保存されていた文章を
バッファＳ１に格納して以後の前編集記号の削除処理を
行なうことも可能である。また翻訳が行なわれていない
文章を一旦バッファＳ１に格納し、前編集処理後に翻訳
文を生成し、残っている原文に対して翻訳処理直後に前
編集記号の削除を行なうことも可能である。Referring to FIG. 2, first, step S001.
Then, the sentence in which the preedit symbol is inserted is stored in the buffer S1. The stored result is as shown in FIG. 7. The input text stored in the buffer S1 is assumed to be immediately after translation in the present description. However, it does not have to be immediately after the translation, and when the original sentence of the sentence translated in the past is separately stored, the stored sentence is stored in the buffer S1 and the subsequent pre-edit symbol deletion processing is performed. Is also possible. It is also possible to temporarily store the untranslated sentence in the buffer S1, generate a translated sentence after the pre-editing process, and delete the pre-editing symbol for the remaining original sentence immediately after the translation process.

【００２７】続いてステップＳ００２で、バッファＳ１
の、現在の処理対象の文字位置を示すポインタが０にセ
ットされる。これによりポインタの指す文字はバッファ
Ｓ１に格納された文章の先頭の文字「Ｈ」となる。Then, in step S002, the buffer S1
The pointer indicating the current character position to be processed is set to 0. As a result, the character pointed to by the pointer becomes the first character "H" of the sentence stored in the buffer S1.

【００２８】さらにステップＳ００３では、バッファＳ
２の内容がクリアされる。バッファＳ２は、バッファＳ
１に格納された入力原文から前編集記号を除去した文を
一旦格納するのに用いられるものである。このバッファ
Ｓ２にも、図１に示されるメインメモリ２が割当てられ
る。Further, in step S003, the buffer S
The contents of 2 are cleared. The buffer S2 is the buffer S
It is used to temporarily store a sentence in which the preedit symbol is removed from the input original sentence stored in 1. The main memory 2 shown in FIG. 1 is also allocated to this buffer S2.

【００２９】ステップＳ００４では、バッファＳ２の、
文字書込位置を示すポインタがバッファＳ２の先頭を指
すように０にセットされる。At step S004, in the buffer S2,
The pointer indicating the character writing position is set to 0 so as to point to the beginning of the buffer S2.

【００３０】ステップＳ００５では、バッファＷがクリ
アされる。バッファＷは、原文から切出した文字列を単
語単位で一時的に格納するのにあてられる作業バッファ
である。バッファＷにも、図１に示されるメインメモリ
２が割当てられる。In step S005, the buffer W is cleared. The buffer W is a work buffer that is used to temporarily store a character string cut out from the original sentence in units of words. The main memory 2 shown in FIG. 1 is also allocated to the buffer W.

【００３１】ステップＳ００６では、バッファＷの文字
位置ポインタが０にセットされる。これによりバッファ
Ｗへの文字の書込位置はその先頭となる。In step S006, the character position pointer of the buffer W is set to 0. As a result, the writing position of the character in the buffer W becomes the beginning thereof.

【００３２】以上で、原文の文字列を検査するための準
備が完了する。Thus, the preparation for inspecting the original character string is completed.

【００３３】ステップＳ００７から実際の文字列検査が
開始される。まずステップＳ００７で、バッファＳ１の
文字位置ポインタが指している文字が検査される。その
文字が空白か括弧類であれば処理はステップＳ０２１に
進む。さもなければ処理はステップＳ００８に進む。The actual character string check is started from step S007. First, in step S007, the character pointed to by the character position pointer in buffer S1 is examined. If the character is blank or brackets, the process proceeds to step S021. Otherwise, the process proceeds to step S008.

【００３４】図７に示される文章を処理する場合には、
文字位置ポインタが当初指している位置０の文字は
「Ｈ」である。したがって、この場合処理はステップＳ
００８に進むことになる。When processing the sentence shown in FIG.
The character at position 0 that the character position pointer initially points to is “H”. Therefore, in this case, the processing is step S
It will proceed to 008.

【００３５】なお、本実施例では前述したように「括弧
類」という語は、図８に示される９種類の記号文字を指
すものとする。この場合記号「／」も含まれていること
に注意すべきである。In this embodiment, as described above, the term "brackets" refers to the nine types of symbol characters shown in FIG. Note that the symbol "/" is also included in this case.

【００３６】ステップＳ０２１〜Ｓ０２４の処理につい
ては後述する。The processing of steps S021 to S024 will be described later.

【００３７】ステップＳ００８では、バッファＳ１の文
字位置ポインタが指す位置の文字が、バッファＷの、文
字位置ポインタの指している位置にコピーされる。In step S008, the character at the position indicated by the character position pointer in buffer S1 is copied to the position indicated by the character position pointer in buffer W.

【００３８】図７に示される例の場合では、バッファＳ
１の文字「Ｈ」が、バッファＷの先頭位置にコピーされ
る。In the case of the example shown in FIG. 7, the buffer S
The character "H" of 1 is copied to the beginning position of the buffer W.

【００３９】続いてステップＳ００９で、バッファＷの
文字位置ポインタが１加算される。Subsequently, in step S009, 1 is added to the character position pointer of the buffer W.

【００４０】図３を参照して、ステップＳ０１０で、バ
ッファＳ１の文字位置ポインタが指している文字が
「」であるか否かが判定される。すなわちこの場合、
ステップＳ００８でバッファＳ１からバッファＷにコピ
ーした文字が「」であるか否かが判定される。判定の
結果がＹＥＳであれば処理はステップＳ０２０に、さも
なければ処理はステップＳ０１１に進む。ステップＳ０
２０以下の処理については後述する。文字「」は、図
５に示される第１の種類の前編集記号の末尾に必ず用い
られている文字である。したがってバッファＳ１の文字
列中にこの文字が検出された場合には、その位置に前編
集記号が存在する可能性が高いと考えられる。Referring to FIG. 3, in step S010, the character pointed by the character position pointer in buffer S1 is " It is determined whether or not That is, in this case,
The character copied from buffer S1 to buffer W in step S008 is " It is determined whether or not If the determination result is YES, the process proceeds to step S020; otherwise, the process proceeds to step S011. Step S0
The processing of 20 or less will be described later. letter" "Is a character always used at the end of the first type of pre-edit symbol shown in FIG. Therefore, when this character is detected in the character string of the buffer S1, it is highly likely that the pre-edit symbol exists at that position.

【００４１】図７に示される例の場合にはバッファＳ１
の現在の文字位置ポインタが指している文字は「Ｈ」で
あるので、処理はステップＳ０１１に進む。In the case of the example shown in FIG. 7, the buffer S1
Since the character pointed by the current character position pointer of is “H”, the process proceeds to step S011.

【００４２】ステップＳ０１１では、バッファＳ１の文
字位置ポインタが１進められる。その結果、バッファＳ
１の文字位置ポインタが指している文字は「ｅ」とな
る。In step S011, the character position pointer in the buffer S1 is advanced by 1. As a result, the buffer S
The character pointed by the character position pointer 1 is “e”.

【００４３】ステップＳ０１２では、バッファＳ１の文
字位置ポインタ以降に文字が存在するかどうかについて
の判断が行なわれる。文字が存在しないと判断されれば
前編集記号削除機能はこれで終了する。すなわち処理は
ステップＳ０２７に進み、バッファＷ内の文字列を、バ
ッファＳ２内の、現在のポインタ１以降にコピーする処
理が行なわれ、全体の処理が終了する。バッファＳ１に
次の文字がある場合には処理はステップＳ０１３に進
む。In step S012, it is determined whether or not a character exists after the character position pointer in the buffer S1. If it is determined that the character does not exist, the pre-edit symbol deletion function ends. That is, the process proceeds to step S027, the process of copying the character string in the buffer W to the current pointer 1 or later in the buffer S2 is performed, and the entire process ends. If there is the next character in the buffer S1, the process proceeds to step S013.

【００４４】図７に示される英文の場合、バッファＳ１
の文字位置ポインタが指している位置には文字「ｅ」が
存在する。したがって、ステップＳ０１２における判断
の結果はＹＥＳとなり、処理はステップＳ０１３に進
む。In the case of the English sentence shown in FIG. 7, the buffer S1
The character "e" is present at the position pointed to by the character position pointer. Therefore, the result of the determination in step S012 is YES, and the process proceeds to step S013.

【００４５】ステップＳ０１３では、バッファＳ１の文
字位置ポインタが指している文字が空白か括弧類である
かどうかについての判断が行なわれる。空白または括弧
類である場合には処理はステップＳ０１４に、さもなけ
れば処理は図２のステップＳ００８に戻る。In step S013, it is determined whether the character pointed by the character position pointer in the buffer S1 is blank or parenthesized. If it is blank or brackets, the process returns to step S014, otherwise the process returns to step S008 in FIG.

【００４６】バッファＳ１の文字位置ポインタが指して
いる文字がもし空白であればその箇所は単語の切目であ
ると考えられる。文字が括弧類であれば図６に示される
前編集記号がその箇所に存在する可能性がある。もしそ
うであればその箇所は単語の切目でもある。If the character pointed to by the character position pointer in buffer S1 is blank, then that part is considered to be a word break. If the characters are parentheses, the pre-edit symbol shown in FIG. 6 may be present at that location. If so, that part is also a word break.

【００４７】図７に示される例文の場合には、文字位置
ポインタが指している文字は「ｅ」である。したがって
ステップＳ０１３における判断の答えはＮＯとなり処理
は図２のステップＳ００８に戻る。In the case of the example sentence shown in FIG. 7, the character pointed by the character position pointer is "e". Therefore, the answer to the determination in step S013 is NO, and the process returns to step S008 in FIG.

【００４８】以下同様に、バッファＳ１の文字位置ポイ
ンタが指している文字「ｅ」につき、図２のステップＳ
００８、Ｓ００９、図３のステップＳ０１０、Ｓ０１
１、Ｓ０１２が順に実行される。その結果文字「ｅ」が
バッファＷに追加される。Similarly, for the character "e" pointed to by the character position pointer in the buffer S1, step S in FIG.
008, S009, steps S010, S01 of FIG.
1 and S012 are sequentially executed. As a result, the character "e" is added to the buffer W.

【００４９】処理がステップＳ０１３に進んだ場合を考
える。この時点ではバッファＷの内部には図９に示され
るように文字列「Ｈｅ」が格納されている。バッファＳ
１の文字位置ポインタが指している文字は、図７を参照
して、「ｅ」の次の空白位置である。したがって、ステ
ップＳ０１３での判定の結果、処理はステップＳ０１４
に進む。Consider a case where the process proceeds to step S013. At this point, the character string "He" is stored in the buffer W as shown in FIG. Buffer S
The character pointed to by the character position pointer 1 is the blank position next to "e" with reference to FIG. Therefore, as a result of the determination in step S013, the processing is step S014.
Proceed to.

【００５０】ステップＳ０１４では、バッファＳ１の文
字位置ポインタが指している文字が空白であるかどうか
についての判断が行なわれる。空白であれば処理は図４
のステップＳ０１５に進み、それ以外の場合（すなわち
括弧類である場合）には処理はステップＳ０２５に進
む。In step S014, a determination is made as to whether the character pointed to by the character position pointer in buffer S1 is blank. If it is blank, the process is shown in FIG.
Processing proceeds to step S015, and in other cases (that is, parentheses), the processing proceeds to step S025.

【００５１】図７に示される例文の場合にはバッファＳ
１の文字位置ポインタによって指し示される文字は空白
である。したがって処理はステップＳ０１５（図４参
照）に進む。In the case of the example sentence shown in FIG. 7, the buffer S
The character pointed to by the 1 character position pointer is blank. Therefore, the process proceeds to step S015 (see FIG. 4).

【００５２】ステップＳ０１５まで処理が進んだ場合、
バッファＳ１の文字位置ポインタが１つの単語を完全に
通過したことになる。この間単語の各文字はバッファＷ
に順次コピーされている。したがってバッファＷに格納
されている文字列は１つの完全な単語である。ステップ
Ｓ０１５では、このバッファＷに格納されている文字列
をバッファＳ２の文字位置ポインタの指す位置以降にコ
ピーする処理が行なわれる。この結果、バッファＳ２の
内部の先頭には図１０に示されるように単語「Ｈｅ」が
コピーされる。When the process proceeds to step S015,
The character position pointer in buffer S1 has completely passed one word. During this time, each character of the word is buffer W
Have been sequentially copied to. Therefore, the character string stored in the buffer W is one complete word. In step S015, a process of copying the character string stored in this buffer W to a position after the character position pointer of buffer S2 is performed. As a result, the word "He" is copied to the top of the inside of the buffer S2 as shown in FIG.

【００５３】次のステップＳ０１６では、バッファＳ２
の文字位置ポインタをバッファＷの文字数分だけ進める
処理が行なわれる。これにより文字位置ポインタはコピ
ーされた文字の次の位置を指し示すことになる。In the next step S016, the buffer S2
A process of advancing the character position pointer by the number of characters in the buffer W is performed. This causes the character position pointer to point to the next position of the copied character.

【００５４】図７に示される例文の処理の場合、現在処
理対象となっている単語「Ｈｅ」は２文字の単語であ
る。したがってＳ０１６の処理によりバッファＳ２の文
字位置ポインタが２進められる。この結果、図１０に示
されるバッファＳ２の文字位置ポインタは文字「ｅ」の
次の位置を指し示す。In the case of the processing of the example sentence shown in FIG. 7, the word "He" currently being processed is a two-character word. Therefore, the character position pointer of the buffer S2 is advanced by 2 by the processing of S016. As a result, the character position pointer of the buffer S2 shown in FIG. 10 points to the next position of the character "e".

【００５５】次のステップＳ０１７では、バッファＳ２
の文字位置ポインタが指している位置に空白をセットす
る処理が行なわれる。これは、バッファＳ２のおいて単
語の切目を生成するための処理である。In the next step S017, the buffer S2
The process of setting a blank at the position pointed to by the character position pointer is performed. This is a process for generating a word break in the buffer S2.

【００５６】ステップＳ０１８では、挿入された空白の
分だけバッファＳ２のポインタを進める処理が行なわれ
る。すなわちバッファＳ２の文字位置ポインタが１加算
される。同様にしてステップＳ０１９ではバッファＳ１
の文字位置ポインタも１加算される。ステップＳ０１９
の後処理は図２のステップＳ００５に戻る。In step S018, a process of advancing the pointer of buffer S2 by the amount of the inserted blank is performed. That is, the character position pointer of the buffer S2 is incremented by 1. Similarly, in step S019, the buffer S1
The character position pointer of is also incremented by 1. Step S019
The post-processing returns to step S005 in FIG.

【００５７】ステップＳ００５では、バッファＷがクリ
アされ、ステップＳ００６ではバッファＷの文字位置ポ
インタが０にセットされる。このようにバッファＷをク
リアするのは、処理中の単語「Ｈｅ」をバッファＳ２に
コピーする処理が完了し、バッファＷに格納されている
単語はもはや不要となったためである。The buffer W is cleared in step S005, and the character position pointer of the buffer W is set to 0 in step S006. The reason why the buffer W is cleared in this way is that the process of copying the word “He” being processed into the buffer S2 is completed, and the word stored in the buffer W is no longer needed.

【００５８】以上、バッファＳ１に格納されている文中
の単語「Ｈｅ」を例にとり、前編集記号の付いていない
単語についてバッファＳ１からバッファＳ２にコピーを
行なう手順を示した。上述の説明から明らかなように、
前編集記号の付いていない単語は、図１０に示されるよ
うにバッファＳ１からバッファＳ２にそのままの形でコ
ピーされる。In the above, the procedure of performing copying from the buffer S1 to the buffer S2 for a word without a preedit symbol has been described by taking the word "He" in the sentence stored in the buffer S1 as an example. As is clear from the above description,
Words without a preedit symbol are copied as is from buffer S1 to buffer S2 as shown in FIG.

【００５９】図５に示される第１の種類の前編集記号の
ある箇所は、次のようにして処理される。図７に示され
る例文の「Ｈｅ」の次の単語「ｖ３ｃａｌｌｅｄ」に
は、図５に示される第１の種類の前編集記号「ｖ３」
が付いている。以下、「ｖ３ｃａｌｌｅｄ」に対する処
理を例にとり、図５に示される第１の種類の前編集記号
が付加された単語についての処理を説明する。In the first type of pre-edit symbol shown in FIG.
Some parts are processed as follows. Shown in Figure 7
Word "v3" next to "He" in the example sentence "called"
Is the first type of preedit symbol "v3 "
Is attached. Hereinafter, "v3 for "called"
As an example, the first type of pre-edit symbol shown in FIG.
The process for a word added with will be described.

【００６０】「Ｈｅ」のコピーが終了した時点で、処理
はステップＳ００７にある。バッファＳ１の文字位置ポ
インタは「ｖ３ｃａｌｌｅｄ」の「ｖ」を指してい
る。バッファＷの内部には文字は存在しない。バッファ
Ｗの文字位置ポインタはバッファＷの先頭を指してい
る。バッファＳ２の内容は図１０に示されるとおりであ
る。なお、単語「Ｈｅ」の次には空白文字が１個存在す
る。そしてバッファＳ２の文字位置ポインタは、その空
白の次の位置を指している。When the copying of "He" is completed, the processing is in step S007. The character position pointer of the buffer S1 is "v3 "v" of "called". No character exists inside the buffer W. The character position pointer of the buffer W points to the beginning of the buffer W. The contents of the buffer S2 are as shown in FIG. There is one blank character after the word “He”. The character position pointer in the buffer S2 points to the position next to the blank.

【００６１】図２を参照して、ステップＳ００７で、バ
ッファＳ１の現在の文字位置ポインタが指している文字
は空白ではない。したがって処理はステップＳ００８に
進む。以下、ステップＳ００８、Ｓ００９、図３のステ
ップＳ０１０、Ｓ０１１、Ｓ０１２、Ｓ０１３、図２の
ステップＳ００８のループを２回回ることにより、文字
列「ｖ３」がバッファＷにコピーされる。バッファＳ１
の文字位置ポインタは「ｖ３」の次の「」を指してい
る。Referring to FIG. 2, in step S007, the character pointed to by the current character position pointer in buffer S1 is not blank. Therefore, the process proceeds to step S008. Hereinafter, the character string “v3” is copied to the buffer W by repeating the loop of steps S008, S009, steps S010, S011, S012, S013 of FIG. 3, and step S008 of FIG. 2 twice. Buffer S1
The character position pointer of is "v3" Is pointing to.

【００６２】そしてステップＳ００８、Ｓ００９の処理
により、文字「」がバッファＷに追加される。この結
果、バッファＷの格納内容は図１１に示されるとおりと
なる。図３を参照して、ステップＳ０１０では、バッフ
ァＳ１の文字位置ポインタが指している文字が「」で
あるので判断の答えはＹＥＳとなり処理はＳ０２０に進
む。Then, by the processing of steps S008 and S009, the character " Is added to the buffer W. As a result, the contents stored in the buffer W are as shown in FIG. Referring to FIG. 3, in step S010, the character pointed by the character position pointer in buffer S1 is " Since the answer is YES, the process proceeds to S020.

【００６３】Ｓ０２０ではバッファＷの内容が図５に示
される前編集記号のいずれかと一致しているか否かにつ
いての判断が行なわれる。いずれとも一致していない場
合には処理はステップＳ０１１に進む。この場合の文字
「」は前編集記号とは関係ない単語の一部であると判
断されるためである。またバッファＷ内の文字列が前編
集記号のいずれかと一致する場合には処理はステップＳ
０２８に進む。At S020, a determination is made as to whether the contents of buffer W match any of the pre-edit symbols shown in FIG. If none of them match, the process proceeds to step S011. In this case, the character " This is because "" is determined to be a part of a word that is unrelated to the preedit symbol. If the character string in the buffer W matches any of the pre-edit symbols, the process proceeds to step S.
Proceed to 028.

【００６４】図７に示される例文の場合には、バッファ
Ｗの内容は図１１に示されるように文字列「ｖ３」で
ある。図５からわかるようにこれは前編集記号の一種で
ある。したがって、ステップＳ０２０における判断の結
果はＹＥＳとなり処理はステップＳ０２８に進む。In the case of the example sentence shown in FIG. 7, the content of the buffer W is the character string "v3" as shown in FIG. It is. As can be seen from FIG. 5, this is a kind of preedit symbol. Therefore, the result of the determination in step S020 is YES, and the process proceeds to step S028.

【００６５】ステップＳ０２８では、バッファＳ１の文
字位置ポインタが１進められる。これによりバッファＳ
１の文字位置ポインタによって示される文字は「ｃ」と
なる。ステップＳ０２８の後処理はステップＳ００５に
戻る。In step S028, the character position pointer in the buffer S1 is advanced by 1. This makes the buffer S
The character indicated by the character position pointer of 1 is "c". The post-processing of step S028 returns to step S005.

【００６６】再びステップＳ００５、Ｓ００６によって
バッファＷがクリアされる。バッファＷに格納されてい
た単語は前編集記号であるために、バッファＳ２にコピ
ーする必要はなく、さらに処理をこの前編集記号の次の
単語に進める必要があるからである。The buffer W is cleared again in steps S005 and S006. Since the word stored in the buffer W is the pre-edit symbol, it is not necessary to copy it to the buffer S2, and it is necessary to further advance the process to the word next to this pre-edit symbol.

【００６７】以下、再び図７を参照して、バッファＳ１
の現在の文字位置ポインタが指している文字は「ｃ」で
ある。したがって図２を参照して、処理はステップＳ０
０７からＳ００８に進む。さらにステップＳ００８〜Ｓ
０１３→Ｓ００８のループを６回繰り返して実行するこ
とにより、バッファＳ１内の文字列「ｃａｌｌｅｄ」が
バッファＷにコピーされる。このときバッファＳ１の文
字位置ポインタは「ｃａｌｌｅｄ」の次の空白を指す。Hereinafter, referring again to FIG. 7, the buffer S1
The character pointed to by the current character position pointer of is "c". Therefore, referring to FIG. 2, the process proceeds to step S0.
It progresses from 07 to S008. Further steps S008-S
By repeating the loop of 013 → S008 six times, the character string “called” in the buffer S1 is copied to the buffer W. At this time, the character position pointer of the buffer S1 points to the space next to "called".

【００６８】さらに、ステップＳ００８〜Ｓ０１５と処
理が進む。これにより、図１２に示されるようにバッフ
ァＷに格納されている単語「ｃａｌｌｅｄ」がバッファ
Ｓ２に追加される。Further, the processing proceeds to steps S008 to S015. As a result, the word "called" stored in the buffer W is added to the buffer S2 as shown in FIG.

【００６９】さらにステップＳ０１６〜Ｓ０１８の処理
によりバッファＳ２にさらに空白文字が追加される。こ
れによりバッファＳ２の内容は図１３に示されるように
なる。Further, a blank character is added to the buffer S2 by the processing of steps S016 to S018. As a result, the contents of the buffer S2 become as shown in FIG.

【００７０】さらに図４に示されるステップＳ０１９の
処理によって、バッファＳ１の文字位置ポインタは「ｃ
ａｌｌｅｄ」の次の空白の次の文字「ｈ」を指するよう
になる。さらに図２に示されるステップＳ００５、Ｓ０
０６に処理が戻り、バッファＷがクリアされる。Further, by the processing of step S019 shown in FIG. 4, the character position pointer of the buffer S1 is "c".
It comes to point to the character "h" next to the blank after "alled". Further, steps S005 and S0 shown in FIG.
The processing returns to 06, and the buffer W is cleared.

【００７１】以上の処理によりバッファＳに格納されて
いる前編集記号付きの単語「ｖ３ｃａｌｌｅｄ」は、前
編集記号の除去された単語「ｃａｌｌｅｄ」となってバ
ッファＳ２にコピーされる。バッファＳ２の内容は図１
３に示されている。By the above processing, the data is stored in the buffer S.
The word with a pre-edit symbol "v3 Called "is before
The word "called" with the editing symbol removed is displayed as a bar.
Is copied to the Saffa S2. The contents of the buffer S2 are shown in FIG.
3 is shown.

【００７２】図７に示される例文の場合、「ｂｏｕｇｈ
ｔ」までの部分には前編集記号が存在しない。したがっ
て、この間のそれぞれの単語の処理は前述した「Ｈｅ」
の単語の処理と同様に行なわれる。単語「ｂｏｕｇｈ
ｔ」までの処理が終了した段階では、バッファＳ２の格
納内容は図１４に示されるようになっている。In the case of the example sentence shown in FIG. 7, "bough
There is no preedit symbol in the part up to t. Therefore, the processing of each word during this period is performed by the above-mentioned "He".
It is performed in the same manner as the processing of the word. The word "bough
The contents stored in the buffer S2 are as shown in FIG.

【００７３】図７を参照して、単語「ｂｏｕｇｈｔ」と
その次の単語「ｎｅｃｋｌａｃｅｓ」との間には前編集
記号「＜＜」が存在する。以下、この前編集記号「＜
＜」の処理を例にとり、図６に示される第２の種類の前
編集記号に対する処理を説明する。With reference to FIG. 7, there is a preedit symbol "<<" between the word "boot" and the next word "necklaces". Hereafter, the previous editing symbol "<
The process for the second type of pre-edit symbol shown in FIG. 6 will be described by taking the process of "<" as an example.

【００７４】図７に示される例文の単語「ｂｏｕｇｈ
ｔ」までの処理が終了した段階で、処理ステップは図２
のステップＳ００７となっている。バッファＳ１の文字
位置ポインタは前編集記号「＜＜」のうちの左側の
「＜」を指す。バッファＷの内容はクリアされており、
その文字位置ポインタはバッファＷの先頭を指してい
る。バッファＳ２の内容は図１４に示されるとおりであ
る。図１４において、単語「ｂｏｕｇｈｔ」の次には空
白文字が１個存在する。文字位置ポインタはその空白の
次の位置を指している。The word "bough" in the example sentence shown in FIG.
When the processing up to t ”is completed, the processing steps are as shown in FIG.
Is step S007. The character position pointer of the buffer S1 points to "<" on the left side of the preedit symbol "<<". The contents of buffer W have been cleared,
The character position pointer points to the beginning of the buffer W. The contents of the buffer S2 are as shown in FIG. In FIG. 14, there is one blank character after the word "boot". The character position pointer points to the position after the blank.

【００７５】前編集記号「＜＜」の処理において、ステ
ップＳ００７（図２参照）で、判断の結果はＹＥＳとな
る。したがって処理はステップＳ０２１に進む。In the processing of the pre-edit symbol "<<", the result of the determination is YES in step S007 (see FIG. 2). Therefore, the process proceeds to step S021.

【００７６】ステップＳ０２１では、バッファＳ１のポ
インタが指している文字が空白であるか否かについての
判断が行なわれる。空白である場合には処理はステップ
Ｓ０２２に進む。空白でない場合、すなわち括弧類であ
る場合には処理はステップＳ０２３に進む。In step S021, it is determined whether or not the character pointed by the pointer of the buffer S1 is blank. If it is blank, the process proceeds to step S022. If it is not blank, that is, if it is in parentheses, the process proceeds to step S023.

【００７７】現在の状態ではポインタの指す文字は
「＜」であるため処理はステップＳ０２３に進む。In the current state, the character pointed to by the pointer is "<", and the process proceeds to step S023.

【００７８】ステップＳ０２３では、次の文字が同じ文
字、すなわち現在の場合「＜」であるかどうかについて
の判断が行なわれる。同じでない場合には処理はステッ
プＳ００８に、同じ場合には処理はステップＳ０２４に
進むるステップＳ０２３の処理は次の理由により行なわ
れる。同じ括弧類であっても前編集記号として使われる
場合には同じ括弧類が２個使用される。もし括弧類が単
独で存在するならばそれは文章中の記号としての単なる
括弧であると考えられる。ステップＳ０２３では、した
がって同じ括弧類が２つ続いて発生する場合のみ処理を
ステップＳ０２４に進ませ、それ以外の場合には通常の
場合と同様にステップＳ００８に進ませる。In step S023, it is determined whether or not the next character is the same character, that is, "<" in the present case. If not the same, the process proceeds to step S008, and if the same, the process proceeds to step S024. The process of step S023 is performed for the following reason. If the same parenthesis is used as a preedit symbol, the same two parentheses are used. If brackets are present alone, they are considered to be just brackets as symbols in the text. Therefore, in step S023, therefore, the process proceeds to step S024 only when the same pair of parentheses occurs consecutively, and otherwise proceeds to step S008 as in the normal case.

【００７９】ステップＳ０２４では、バッファＳ１の文
字位置ポインタが２進められる。これは、括弧類を用い
た前編集記号の場合には、２つの括弧類で１組として使
用されているからである。ステップＳ０２４の処理の結
果、バッファＳ１の文字位置ポインタは「＜」の次の空
白を指すことになる。ステップＳ０２４の後処理はステ
ップＳ００７に戻る。In step S024, the character position pointer in the buffer S1 is advanced by 2. This is because, in the case of the pre-edit symbol using parentheses, the two parentheses are used as one set. As a result of the processing in step S024, the character position pointer of the buffer S1 points to the blank space next to “<”. The post-processing of step S024 returns to step S007.

【００８０】ステップＳ００７では、バッファＳ１の１
ポインタが指す文字が再び検査される。そして空白、括
弧類であれば処理はステップＳ０２１、それ以外の場合
には処理はステップＳ００８に進む。At step S007, 1 of the buffer S1
The character pointed to by the pointer is checked again. If it is blank or brackets, the process proceeds to step S021, and otherwise, the process proceeds to step S008.

【００８１】図７に示される例文の場合にはポインタの
指す文字は空白である。したがって、この場合処理はス
テップＳ０２１に、さらにステップＳ０２２に進む。そ
してステップＳ０２２ではバッファＳ１のポインタが１
進められる。その結果、バッファＳ１の文字位置ポイン
タは単語「ｎｅｃｋｌａｃｅｓ」の先頭文字「ｎ」を指
す。In the case of the example sentence shown in FIG. 7, the character pointed to by the pointer is blank. Therefore, in this case, the process proceeds to step S021, and further proceeds to step S022. Then, in step S022, the pointer of the buffer S1 is 1
Can be advanced. As a result, the character position pointer of buffer S1 points to the first character "n" of the word "necklaces".

【００８２】以上の処理により、バッファＳ１に存在す
る前編集記号「＜＜」をバッファＷはバッファＳ２にコ
ピーすることなく、バッファＳ１の文字位置ポインタが
この前編集記号「＜＜」を通過する。前編集記号「＜
＜」がバッファＳ２にコピーされることはない。By the above processing, the character position pointer of the buffer S1 passes through the pre-edit symbol "<<" without copying the pre-edit symbol "<<" existing in the buffer S1 to the buffer S2. . Previous edit symbol "<
<”Is never copied to the buffer S2.

【００８３】図７に示される例文の場合、次の単語「ｎ
ｅｃｋｌａｃｅｓ」の処理は単語「Ｈｅ」の処理と同様
に行なわれる。この結果、バッファＳ２の格納内容は図
１５に示されるとおりとなる。バッファＳ１の単語「ｂ
ｏｕｇｈｔ」と「ｎｅｃｋｌａｃｅｓ」との間に存在す
る前編集記号「＜＜」は、バッファＳ２の文中には存在
しない。In the case of the example sentence shown in FIG. 7, the next word "n"
The processing of “ecklaces” is performed in the same manner as the processing of the word “He”. As a result, the contents stored in the buffer S2 are as shown in FIG. The word "b" in buffer S1
The pre-edit symbol "<<" existing between "out" and "necklaces" does not exist in the sentence of the buffer S2.

【００８４】以下、図７に示される単語「ｏｆ」以後の
処理は、以上の方法を順次適用することによって可能で
ある。Hereinafter, the processing after the word "of" shown in FIG. 7 can be performed by sequentially applying the above method.

【００８５】最後の単語「ｓｉｌｖｅｒ．」の処理が終
了した時点を考える。バッファＳ１の文字位置ポインタ
は「ｓｉｌｖｅｒ．」の末尾の「．」を指している。バ
ッファＷに文字列「ｓｉｌｖｅｒ．」が格納されてい
る。この時点では、処理は図２のステップＳ００９であ
る。ステップＳ００９から図３のステップＳ０１０、Ｓ
０１１、Ｓ０１２と処理が進む。ステップＳ０１２にお
ける判断では、「．」の次の文字はもはや存在しないた
めに、処理はステップＳ０２７に進むことになる。Consider the time when the processing of the last word "silver." Is completed. The character position pointer of the buffer S1 points to “.” At the end of “silver.”. The character string “silver.” Is stored in the buffer W. At this point, the process is step S009 in FIG. Steps S009 to S010 and S in FIG.
The processing proceeds from 011 and S012. In the determination in step S012, since the character next to "." Is no longer present, the process proceeds to step S027.

【００８６】ステップＳ０２７では、既に説明したよう
にバッファＷの内容「ｓｉｌｖｅｒ．」をバッファＳ２
に追加する。これによりバッファＳ２の内容が完成し、
全体の処理は終了する。In step S027, the content "silver." Of the buffer W is added to the buffer S2 as described above.
Add to. This completes the contents of buffer S2,
The whole process ends.

【００８７】以上の処理が終了した時点でバッファＳ２
の格納内容は図１６に示されるとおりとなる。図７に示
される例文のうち、図５、図６に示される前編集記号は
完全に除去されている。When the above processing is completed, the buffer S2
The contents stored in are as shown in FIG. Of the example sentences shown in FIG. 7, the pre-edit symbols shown in FIGS. 5 and 6 have been completely removed.

【００８８】なお、図３に示されるフローチャートのう
ち、ステップＳ０１４、Ｓ０２５、Ｓ０２６の処理は、
図２に示されるステップＳ０２１、Ｓ０２３、Ｓ０２４
の処理と全く同じである。このようにステップＳ０１４
以下の処理を設けているのは次のような理由による。既
に説明したように図６に示される第１の種類の前編集記
号の場合、その前後に空白が存在しない場合があり得
る。そのような場合、ステップＳ０２１、Ｓ０２３、Ｓ
０２４の処理では対応することができない。そのため
に、ステップＳ０１３の時点から、図６に示される第２
の種類の前編集記号に対する処理をする手順としてこの
ステップＳ０１４、Ｓ０２５、Ｓ０２６のルーチンが設
けられている。In the flow chart shown in FIG. 3, the processes of steps S014, S025 and S026 are as follows.
Steps S021, S023, S024 shown in FIG.
It is exactly the same as the processing of. In this way, step S014
The following processing is provided for the following reason. As described above, in the case of the first type of pre-edit symbol shown in FIG. 6, there may be no blank before and after it. In such a case, steps S021, S023, S
The processing of 024 cannot handle this. Therefore, from the time of step S013, the second time shown in FIG.
The routines of steps S014, S025, and S026 are provided as a procedure for processing the pre-edit symbols of types.

【００８９】以上のように本発明の機械翻訳装置では、
前編集処理がされた入力テキストのうち、前編集記号の
みを取除いた原文が自動的に生成される。従来のように
ユーザが文書編集機能を用いて前編集記号を１つずつ探
索して手作業で削除していく必要はない。そのため、翻
訳後の文書を効率よく利用することができる。As described above, in the machine translation device of the present invention,
Of the input text that has been preedited, the original text from which only the preedit symbols have been removed is automatically generated. There is no need for the user to manually search for the pre-edit symbols one by one using the document editing function and delete them manually as in the prior art. Therefore, the translated document can be efficiently used.

【００９０】[0090]

【発明の効果】以上のように本発明の機械翻訳装置で
は、前処理後入力原文から、前編集のための所定の記号
を除去した分が前編集記号除去手段によって生成され
る。前編集記号を手作業によって１つずつ探索し、除去
していく必要はない。As described above, in the machine translation apparatus of the present invention, the pre-editing symbol removing means generates the pre-edited input original sentence by removing a predetermined symbol for pre-editing. It is not necessary to manually search and remove the preedit symbols one by one.

【００９１】その結果、翻訳のための前処理がされた入
力原文の文章を、容易に文として利用することができ
る。As a result, the sentence of the input original sentence which has been preprocessed for translation can be easily used as a sentence.

[Brief description of drawings]

【図１】本発明の一実施例の機械翻訳装置のブロック図
である。FIG. 1 is a block diagram of a machine translation device according to an embodiment of the present invention.

【図２】前編集記号除去処理の前半のフローチャートで
ある。FIG. 2 is a first half flowchart of a pre-editing symbol removal process.

【図３】前編集記号除去処理の中間のフローチャートで
ある。FIG. 3 is an intermediate flowchart of a pre-editing symbol removal process.

【図４】前編集記号除去処理の後半のフローチャートで
ある。FIG. 4 is a second half flowchart of the pre-editing symbol removal processing.

【図５】第１の種類の前編集記号を表形式で示す図であ
る。FIG. 5 is a diagram showing a first type of pre-edit symbol in a table format.

【図６】第２の種類の前編集記号を表形式で示す図であ
る。FIG. 6 is a diagram showing a second type of pre-edit symbol in a table format.

【図７】バッファＳ１の格納内容を示す模式図である。FIG. 7 is a schematic diagram showing stored contents of a buffer S1.

【図８】括弧類を示す模式図である。FIG. 8 is a schematic diagram showing brackets.

【図９】バッファＷの内容を示す模式図である。FIG. 9 is a schematic diagram showing the contents of a buffer W.

【図１０】バッファＳ２の模式図である。FIG. 10 is a schematic diagram of a buffer S2.

【図１１】バッファＷの模式図である。FIG. 11 is a schematic diagram of a buffer W.

【図１２】バッファＷの模式図である。FIG. 12 is a schematic diagram of a buffer W.

【図１３】バッファＳ２の模式図である。FIG. 13 is a schematic diagram of a buffer S2.

【図１４】バッファＳ２の模式図である。FIG. 14 is a schematic diagram of a buffer S2.

【図１５】バッファＳ２の模式図である。FIG. 15 is a schematic diagram of a buffer S2.

【図１６】バッファＳ２の模式図である。FIG. 16 is a schematic diagram of a buffer S2.

[Explanation of symbols]

１メインＣＰＵ２メインメモリ３表示装置４キーボード５翻訳モジュール６メモリ７プリンタ 1 main CPU 2 main memory 3 display device 4 keyboard 5 translation module 6 memory 7 printer

Claims

[Claims]

1. A preprocessed input source sentence storage unit for storing a preprocessed input source sentence in which a predetermined symbol for assisting automatic translation is inserted as a preprocess, and the preprocessed input source sentence is stored in the storage unit. Automatic translation means for translating into a different language while using a predetermined symbol as an aid of translation, and a sentence obtained by removing the predetermined symbol from the preprocessed input source text stored in the preprocessed input source text storage means. And a preprocessing symbol removing means for generating the.