JPH0434644A

JPH0434644A - Machine translation system

Info

Publication number: JPH0434644A
Application number: JP2139986A
Authority: JP
Inventors: Mariko Kimura; 真理子木村; Yumiko Yoshimura; 裕美子吉村; Yoshimi Asahioka; 旭岡　佳美; Katsumi Tanaka; 克己田中
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1990-05-31
Filing date: 1990-05-31
Publication date: 1992-02-05

Abstract

PURPOSE:To efficiently execute translation processing by deleting a blank part in an unknown word which can not be retrieved in an original sentence or substitute a code for the blank part and executing word retrieval again. CONSTITUTION:The machine translation system is constituted of an input part 1, an original sentence storing part 2, a translated sentence storing part 3, an edition control part 4, a translation part 5, a translation dictionary 6, a display control part 7, a display part 8, and a printing part 9. The translation part 5 deletes a blank part in an unknown word which can not be retrieved at the time of executing word retrieval for retrieving words corresponding to an original sentence while referring the dictionary 6 or substiture a code for the blank part and then executes word retrieval again. Even when a word including a space exists in an inputted original sentence, the word is surely retrieved and translation processing is efficiently executed.

Description

【発明の詳細な説明】［発明の目的コ（産業上の利用分野）この発明は入力原文の単語から翻訳辞書の見出し語の検
索を行なう単語検索を改良した機械翻訳システムに関す
る。DETAILED DESCRIPTION OF THE INVENTION [Purpose of the Invention (Industrial Application Field) The present invention relates to a machine translation system that improves word search for searching for headwords in a translation dictionary from words in an input original text.

（従来の技術）近年、コンピュータを利用して入力原文を自動的に機械
翻訳するシステムが注目されている。(Prior Art) In recent years, systems that automatically machine translate input original texts using computers have been attracting attention.

従来の機械翻訳システムは、基本的には入力原文を辞書
引き、形態解析、構文解析するなどして語（句）等の所
定の処理単位に区分し、その処理単位ごとに翻訳辞書を
検索して対応する訳語（訳語句）等を見出し、これを所
定の訳文規則にしたがって結合する等してその訳文を得
るように構成されている。辞書引き処理、形態素処理で
は、通常、入力文字列に対して、語用変化規則、話尾変
化規則などの文法的な変形情報を用いて辞書項目とのマ
ツチングを行つており、「話さ」、「話し」、「話す」
などが翻訳辞書の辞書項目にマツチした場合には検索で
きたことになる。すると、後の構文解析等を行なうこと
ができる。また、辞書引き、形態素処理で単語として認
識されない文字列は未知語として処理されることになる
。Conventional machine translation systems basically divide the input source text into predetermined processing units such as words (phrases) by looking them up in a dictionary, morphologically analyzing them, and parsing their syntax, and then search a translation dictionary for each processing unit. The system is configured to find corresponding translated words (translated phrases), etc., and combine them according to predetermined translation rules to obtain the translated text. In dictionary lookup processing and morphological processing, input character strings are usually matched with dictionary entries using grammatical transformation information such as pragmatic inflection rules and suffix inflection rules. "talk", "talk"
If it matches a dictionary entry in the translation dictionary, it means that the search was successful. Then, later syntax analysis etc. can be performed. Further, a character string that is not recognized as a word by dictionary lookup or morphological processing will be processed as an unknown word.

ところが実際の文書では、前記文法的な処理で対応でき
ない場合が“ある。例えば、「ディスク」という単語を
入力する際にオペレータの入力ミス等により、「デイ　
スフ」　（以下、空白を表わす記号としてａ　′を用い
る）と入力されてしまう場合がある。また、文章の表題
のようにレイアウトのためｒＷ　　ＯＲＬ　　ＤＪと単
語中に故意に空白（以下、スペースという）を挿入して
いる場合がある。この他にも、例えば「ディフェクト・
スキッピング」というような複合語をつなぐ際に、オペ
レータによっては「ディフェクト　スキッピング」のよ
うにスペースを挿入する場合もある。これらの単語を含
んだ文が該機械翻訳システムに入力されると、本来は前
記翻訳辞書の中に単語が格納されているにもかかわらず
、未知語となってしまう。このため、翻訳処理を効率的
に行なうことができないという問題点があった。However, in actual documents, there are cases where the grammatical processing described above cannot be used.
In some cases, "a' is used as a symbol to represent a blank space". Furthermore, as in the title of a text, a blank space (hereinafter referred to as a space) may be intentionally inserted between the words rW ORL DJ for layout purposes. In addition to this, for example, “Defect
When connecting compound words such as ``skipping,'' some operators may insert spaces, such as ``defect skipping.'' When a sentence containing these words is input into the machine translation system, the words become unknown words even though the words are originally stored in the translation dictionary. Therefore, there was a problem that translation processing could not be performed efficiently.

（発明が解決しようとする課題）上述したように、従来の機械翻訳システムでは、入力原
文中の単語と翻訳辞書の辞書項目とのマツチングを行な
うとき、該原文中にスペースを含んだ単語があると、単
語検索が行なえないため、翻訳処理を効率良く行なえな
いという問題点があった。(Problems to be Solved by the Invention) As described above, in conventional machine translation systems, when matching words in an input source text with dictionary entries in a translation dictionary, it is possible to detect words that include spaces in the source text. However, since word searches cannot be performed, there is a problem in that translation processing cannot be carried out efficiently.

本発明は上述した問題点を解決するためになされたもの
であり、入力原文中にスペースを含んだ単語があっても
確実に単語検索を行ない、よって翻訳処理を効率良く行
なうことができる機械翻訳システムを提供することを目
的とする。The present invention has been made in order to solve the above-mentioned problems, and is a machine translation system that can reliably perform word searches even if there are words that include spaces in the input original text, and can therefore perform translation processing efficiently. The purpose is to provide a system.

［発明の構成］（課題を解決するための手段）上述した目的を達成するために本発明の機械翻訳システ
ムでは、翻訳対象となる原文の入力を行なう入力部と、翻訳処理
に使用する知識情報を格納している翻訳辞書部と、この翻訳辞書部を用いて前記原文を翻訳処理して訳文を
得る翻訳部と、前記入力部による原文及び前記翻訳部による訳文を表示
する表示部とを備える。そして前記翻訳部は前記原文に
対し翻訳辞書部を参照して対応する単語を検索する単語
検索の際、検索できない未知語に対して、該未知語中の
空白部を削除或いは記号へ置換した上で再度単語検索を
行なうものである。[Structure of the Invention] (Means for Solving the Problems) In order to achieve the above-mentioned purpose, the machine translation system of the present invention includes an input section for inputting an original text to be translated, and knowledge information used for translation processing. a translation dictionary section that stores the translation dictionary section; a translation section that uses the translation dictionary section to translate the original text to obtain a translated text; and a display section that displays the original text from the input section and the translated text from the translation section. . Then, when performing a word search for the original text by referring to the translation dictionary section to search for a corresponding word, the translation section deletes blank spaces in the unknown words or replaces them with symbols for unknown words that cannot be searched. The word search is performed again.

また前記入力部は、前記未知語中の空白部に対して前記
翻訳部が該空白部の削除若しくは記号への置換を行なう
か否かの指示を行なうためのものである。Further, the input section is for instructing whether the translation section should delete the blank section in the unknown word or replace it with a symbol.

（作　　用）上述したように構成された本発明の機械翻訳システムに
よれば、前記翻訳部が前記入力部からの原文に対して前記翻訳辞
書を用いて単語検索する際に、未知語となった単語中の
スペースがある場合には、該スペースに対して削除若し
くは記号への置換を行なう。(Function) According to the machine translation system of the present invention configured as described above, when the translation unit uses the translation dictionary to search for words in the original text from the input unit, unknown words are detected. If there is a space in the word, the space is deleted or replaced with a symbol.

この結果、前記翻訳辞書は原文の単語を修正することな
く正しく検索することができる。したがって、効率よく
翻訳処理を行なうことができる。As a result, the translation dictionary can correctly search for words in the original text without modifying them. Therefore, translation processing can be performed efficiently.

（実施例）以下、図面を参照しながら本発明の一実施例について詳
述する。(Example) Hereinafter, an example of the present invention will be described in detail with reference to the drawings.

第１図は本発明の一実施例としての日英機械翻訳システ
ムの全体ブロック図である。該システムは、キーボード
からなる入力部１、原文としてキー人力された日本語文
を記憶する原文記憶部２、訳文としての入力日本語文に
対応する英文を記憶する訳文記憶部３、本システム全体
を制御する編集制御部４、翻訳処理を実行する翻訳部５
、翻訳処理に利用する知識情報を収容した翻訳辞書６、
原文や訳文などの表示を制御する表示制御部７、表示部
８及び原文や訳文などを清書して７）−トコビーとして
出力するための印刷部９とにより構成されている。FIG. 1 is an overall block diagram of a Japanese-English machine translation system as an embodiment of the present invention. The system includes an input section 1 consisting of a keyboard, an original text storage section 2 that stores the human-keyed Japanese text as the original text, a translation storage section 3 that stores the English text corresponding to the input Japanese text as the translated text, and controls the entire system. an editing control section 4 that executes the translation process; a translation section 5 that executes the translation process;
, a translation dictionary 6 containing knowledge information used in translation processing;
It is composed of a display control section 7 for controlling the display of the original text, translated text, etc., a display section 8, and a printing section 9 for transcribing the original text, translated text, etc. and outputting it as 7)-tocoby.

第２図は、入力部１のキー配列の一例を示す図である。FIG. 2 is a diagram showing an example of the key layout of the input unit 1.

人力部１は日本語文、英文人力用の文字キーに加えて以
下の各種キーを備えている。The human resources section 1 is equipped with the following various keys in addition to character keys for Japanese and English human resources.

翻訳指示キー　　　　　＝１００編集キー　　　：１０１〜１０６機能キー　　　：１１０〜１１７カーソルキー　　　　　　：１３０〜１３３スペース削
除モードキー：１４０スペース−中点変換−８１４１モードキスベース−ハイフン変換、１４２モードキー第３図は表示部８の画面のレイアウトの一例を示す図で
ある。入力原文は画面左側の原文表示領域に表示される
。翻訳処理の結果得られた訳文は画面右側の訳文表示領
域にその原文と対応する位置に表示される。また画面上
部の編集領域は各種編集に必要な情報をを表示するため
に用いることができる。Translation instruction key = 100 Edit key: 101-106 Function key: 110-117 Cursor key: 130-133 Space deletion mode key: 140 Space - midpoint conversion - 8141 Mode kiss base - hyphen conversion, 142 Mode key Figure 3 is 3 is a diagram showing an example of a layout of a screen of a display unit 8. FIG. The input original text is displayed in the original text display area on the left side of the screen. The translated text obtained as a result of the translation process is displayed in the translated text display area on the right side of the screen at a position corresponding to the original text. Additionally, the editing area at the top of the screen can be used to display information necessary for various types of editing.

第４図は編集制御部４による対話翻訳処理の流れの一例
を示すフローチャートである。対話翻訳処理では原文の
入力と対応する訳文の編集作業をオペレータが適宜に行
うことのできる。編集制御部４は、ステップ８１〜Ｓ６
において、入力部１からなんらかのキー人力があるかあ
るいは翻訳部５から翻訳完了信号を受けているかを監視
しており、なんらかのキー人力があった場合にそれに対
応する処理を行う。FIG. 4 is a flowchart showing an example of the flow of dialog translation processing by the editing control unit 4. In interactive translation processing, the operator can input the original text and edit the corresponding translated text as appropriate. The editing control unit 4 performs steps 81 to S6.
In this step, it is monitored whether there is any key input from the input unit 1 or whether a translation completion signal is received from the translation unit 5, and if there is any key input, a corresponding process is performed.

オペレータが機能キーのいずれかを操作した場合、ステ
ップＳ５によりこれを検知し、ステップＳ１２でその機
能キーに対応した処理を行う。If the operator operates any of the function keys, this is detected in step S5, and the process corresponding to that function key is performed in step S12.

オペレータが編集キーのいずれかを操作した場合、ステ
ップＳ４によりこれを検知し、ステップＳｌｌでその編
集キーに対応した処理を行う。If the operator operates any of the editing keys, this is detected in step S4, and the process corresponding to the editing key is performed in step Sll.

オペレータがカーソルキー１３０あるいはその他の制御
キーを操作した場合、ステップ８１〜Ｓ６を介してステ
ップＳ１３でキー操作に対応したカーソルの移動のため
の処理やその他の処理を行う。When the operator operates the cursor key 130 or other control keys, processing for moving the cursor corresponding to the key operation and other processing are performed in step S13 via steps 81 to S6.

オペレータが文字キーを操作して原文を入力すると、ス
テップＳ３で各文字キーが検知された後、対応する文字
コードが順次編集制御部４内の（図示しない）入力バッ
ファにセットされる（ステップＳ８）。この入力原文は
表示制御部７を介して表示部８の原文表示領域に表示さ
れる（ステップＳ９）。オペ−レータが任意の時点で例
えば−文の入力が終了したとき、翻訳指示キー１００を
打鍵すると、ステップＳ２でこのキーが検知され、編集
制御部４は翻訳部５に対し入力バッファ内の原文を供給
し、翻訳処理の開始を指示する（ステップ５１０）。な
お、原文人力の途中で訂正・挿入・削除等の入力編集が
必要な場合には、カーソルキーにより所望の編集箇所へ
カーソルを移動した後、挿入キー１０１、削除キー１０
２等の編集キーを用いてこれを行うことができる。When the operator inputs the original text by operating the character keys, each character key is detected in step S3, and then the corresponding character code is sequentially set in the input buffer (not shown) in the editing control unit 4 (step S8). ). This input original text is displayed in the original text display area of the display unit 8 via the display control unit 7 (step S9). When the operator presses the translation instruction key 100 at any time, for example, when he has finished inputting a - sentence, this key is detected in step S2, and the editing control unit 4 sends the translation unit 5 to the original text in the input buffer. is supplied to instruct the start of translation processing (step 510). In addition, if input editing such as correction, insertion, deletion, etc. is required in the middle of editing the original text, move the cursor to the desired editing location using the cursor keys, and then press the insert key 101 and the delete key 10.
This can be done using the 2nd etc. edit key.

第５図は翻訳部５における翻訳処理の流れを示すフロー
チャートである。FIG. 5 is a flowchart showing the flow of translation processing in the translation section 5.

形態素解析部（Ｓ　５１）では、入力原文に対し、活用
変化・辞書６ａを用いて語尾等に変化がある単語を、そ
の原形（基本形）に変換する。The morphological analysis unit (S51) uses the conjugation change/dictionary 6a to convert words with changes in endings, etc. into their original forms (basic forms) in the input original text.

辞書検索部（Ｓ　５２）では、入力原文を構成する単語
毎に単語・熟語辞書６ｂを検索し、その品詞や訳語等の
情報を集める。The dictionary search section (S52) searches the word/idiom dictionary 6b for each word constituting the input original text, and collects information such as its part of speech and translated words.

解析制御部（Ｓ　５３）は、訳文の候補を構文解析部に
送る処理を行う。The analysis control unit (S53) performs a process of sending translation candidates to the syntactic analysis unit.

構文解析部（Ｓ　５４）は、解析文法６ｃを使って訳文
候補の構文を解析し、日本語文の構造を生成する。解析
に失敗した場合は解析制御部に戻る。The syntax analysis unit (S54) analyzes the syntax of the translation candidate using the analysis grammar 6c, and generates the structure of a Japanese sentence. If the analysis fails, the process returns to the analysis control section.

構造変換部（Ｓ　５５）は、変換文法６ｄを使って日本
語の構造を英語の構造に変換する。棄却された場合は、
構文解析部（Ｓ　５４）に戻る。The structure conversion unit (S55) converts the Japanese structure into the English structure using the conversion grammar 6d. If rejected,
The process returns to the syntax analysis section (S54).

構文生成部（３５Ｂ）は、生成文法６ｅに従って英語の
構造から英語の語順を決定し単語列に変換する。The syntax generation unit (35B) determines the English word order from the English structure according to the generative grammar 6e and converts it into a word string.

形態素生成部（Ｓ　５７）は、形態素生成文法６ｆによ
り単語の語形等を変化させて翻訳文を完成させる。The morpheme generation unit (S57) completes the translated sentence by changing the word form etc. using the morpheme generation grammar 6f.

以上のそれぞれの処理部は、処理を始めた時点では信号
を編集制御部４に出力する。Each of the above-mentioned processing units outputs a signal to the editing control unit 4 when starting processing.

再び第４図において、編集制御部４はステップＳ１で上
記翻訳完了信号を検知すると、ステップＳ７に進み、翻
訳部５から得られた訳文候補及び補助情報を表示制御部
７に送り訳文を表示させる（ステップＳ７）。表示制御
部７は上記訳文候補を訳文表示領域内において入力原文
（画面左側）と対応する位置に表示するとともに、補助
情報がある場合には、その訳語を高輝度または反転表示
することによりオペレータがその旨識別するようにして
いる。Referring again to FIG. 4, when the editing control section 4 detects the translation completion signal in step S1, the process proceeds to step S7, where it sends the translation candidate and auxiliary information obtained from the translation section 5 to the display control section 7 and causes the translation to be displayed. (Step S7). The display control unit 7 displays the translation candidate in the translation display area at a position corresponding to the input original text (on the left side of the screen), and if there is auxiliary information, displays the translation with high brightness or in reverse video so that the operator can I'm trying to identify that.

この状態から、編集キーの打鍵があると、編集制御部４
はステップＳ４でこれを検知し、このときカーソルで指
示されている語または句に対して各キーに対応した編集
処理を行う（ステップ５１１）。例えば、挿入キー１０
１の操作により、カーソル位置の前に文字を挿入する。From this state, when the edit key is pressed, the edit control unit 4
detects this in step S4, and performs editing processing corresponding to each key on the word or phrase indicated by the cursor at this time (step 511). For example, insert key 10
Operation 1 inserts a character before the cursor position.

削除キー１０２の操作により、カーソルが指示している
範囲の文字列を削除する。移動キー１０３の操作により
、カーソルが指示している範囲を移動する。By operating the delete key 102, the character string in the range indicated by the cursor is deleted. By operating the movement key 103, the range indicated by the cursor is moved.

取消キー１０４の操作により、上記１０１゜１０２．１
０３の効果を無効にする。係り受はキー１０５の操作に
より、補助情報を用いて、カーソルが指示している語句
の他の係り受は候補を表示させる。By operating the cancel key 104, the above 101°102.1
Nullifies the effect of 03. Modifications are made by operating the key 105, using auxiliary information to display other modification candidates for the word or phrase indicated by the cursor.

なお、機能キーにより以下の機能を実現できる。Note that the following functions can be realized using the function keys.

訳語表示キー１１０の操作により、訳文中の語に対して
その訳語を表示できる。辞書表示キー１１１の操作によ
り、原文中の語に対して辞書を表示できる。辞書登録キ
ー１１２の操作により、単語・熟語の登録ができる。辞
書削除キー１１３の操作により、辞書登録により登録さ
れた単語・熟語の削除を行うことができる。部分訳キー
１１４の操作により、翻訳が失敗した時にその部分訳を
表示させる。By operating the translated word display key 110, the translated word can be displayed for each word in the translated sentence. By operating the dictionary display key 111, a dictionary can be displayed for a word in the original text. By operating the dictionary registration key 112, words and phrases can be registered. By operating the dictionary deletion key 113, words and phrases registered through dictionary registration can be deleted. By operating the partial translation key 114, when translation fails, the partial translation is displayed.

また、カーソルキーとしては、カーソルをそれぞれ各方
向に移動させるカーソル移動キー１３０、カーソルが移
動する単位を切換える単位切換キー１３１、各表示領域
間にカーソルを移動させるための領域間移動キー１３２
、カーソルのサイズを文字単位に縮小または語単位に拡
大するための拡大縮小キーなどで構成されている。そし
て、スペース削除モードキー１４０は、前記辞書引き、
形態素処理において、入力文中のスペースを削除した文
字列に対しても単語検索を行うかを０Ｎ１０ＦＦで指定
する。スペース←→中点変換モードキー１４１は、前記
辞書引き、形態素処理において、入力文中のスペースを
中点に置換した上で単語検索を行うかを０Ｎ１０ＦＦで
指定する。スペース□ハイフン変換モードキー１４２は
、前記辞書、引き、形態素処理において、入力文中のス
ペースをハイフンに置換した上で単語検索を行うかを０
Ｎ１０ＦＦで指定する。Further, the cursor keys include cursor movement keys 130 for moving the cursor in each direction, a unit switching key 131 for switching the unit in which the cursor moves, and an inter-area movement key 132 for moving the cursor between display areas.
, and zoom keys to reduce the cursor size character by character or enlarge the cursor size word by word. Then, the space deletion mode key 140 is used for the dictionary lookup,
In morpheme processing, 0N10FF specifies whether word search should also be performed for character strings with spaces removed from the input sentence. The space←→midpoint conversion mode key 141 specifies with 0N10FF whether to perform a word search after replacing spaces in the input sentence with midpoints in the dictionary lookup and morpheme processing. The space□hyphen conversion mode key 142 selects whether to replace spaces in the input sentence with hyphens and then perform a word search in the dictionary, lookup, and morpheme processing.
Specify with N10FF.

したがって、オペレータは原文の入力と、その訳文の編
集処理とを適宜行いながら対話的に翻訳文を得ることが
できる。Therefore, the operator can interactively obtain a translated text while inputting the original text and editing the translated text as appropriate.

以上のように構成されたシステムにおいて、更に辞書引
きにおける単語検索について以下に詳細に説明する。In the system configured as described above, a word search in a dictionary will be further explained in detail below.

第６図は、単語検索の処理の流れを示す図である。Ｓ８
１では、前記原文中の各文単位で翻訳辞書の見出し語に
該当する語があるかないかの単語検索を行ない、未知語
の抽出を行なう。この結果、マツチした見出し語は、辞
書単語リストに単語の候補として登録され、単語検索処
理を終了する。FIG. 6 is a diagram showing the flow of word search processing. S8
In step 1, a word search is performed for each sentence in the original text to see if there is a word that corresponds to the headword in the translation dictionary, and unknown words are extracted. As a result, the matched headword is registered as a word candidate in the dictionary word list, and the word search process ends.

Ｓ６１で辞書単語リストにセットされなかった語は未知
語とみなして次のステップを行なう。次に８６２におい
て、前記スペース削除モードキー１４０により設定され
るスペース削除モードがＯＮかＯＦＦか判定され、ＯＦ
Ｆの場合はＳＢ４へ行く。Words that are not set in the dictionary word list in S61 are regarded as unknown words and the next step is performed. Next, in 862, it is determined whether the space deletion mode set by the space deletion mode key 140 is ON or OFF;
If F, go to SB4.

ＯＮの場合は、Ｓ６３にて入力文字列からスペースを除
いた文字列と辞書内の見出し語とのマツチングが行われ
る。この結果、マツチした見出し語は辞書単語リストに
単語の候補として登録される。If it is ON, matching is performed in S63 between the character string obtained by removing spaces from the input character string and the headword in the dictionary. As a result, the matched headword is registered as a word candidate in the dictionary word list.

次に５６４において、前記スペース−中点変換モードキ
ー１４１により設定されるスペース−中点変換モードが
ＯＮかＯＦＦか判定され、ＯＦＦの場合はＳＢ５へ行く
。ＯＮの場合はＳ６５にて入力文字列のスペースを中点
に置換した文字列と辞書内の見出し語とのマツチングが
行われる。この結果、マツチした見出し語は辞書単語リ
ストに単語の候補として登録される。次にＳＢＢにおい
て、前記スペース←→ハイフン変換モードキー１４２に
より設定されるスペース←→ハイフン変換モードがＯＮ
かＯＦＦか判定され、ＯＦＦの場合は処理を終了する。Next, in 564, it is determined whether the space-midpoint conversion mode set by the space-midpoint conversion mode key 141 is ON or OFF, and if it is OFF, the process goes to SB5. If it is ON, matching is performed in S65 between the character string in which spaces in the input character string are replaced with midpoints and the headword in the dictionary. As a result, the matched headword is registered as a word candidate in the dictionary word list. Next, in SBB, the space←→hyphen conversion mode set by the space←→hyphen conversion mode key 142 is turned on.
It is determined whether or not it is OFF, and if it is OFF, the process ends.

ＯＮの場合は８６６にて入力文字列のスペースをハイフ
ンに置換した文字列と辞書内の見出し語とのマツチング
が行われる。この結果、マツチした見出し語は辞書単語
リストに単語の候補として登録される。If it is ON, matching is performed in step 866 between the character string in which spaces in the input character string are replaced with hyphens and the headword in the dictionary. As a result, the matched headword is registered as a word candidate in the dictionary word list.

以下では、具体例を用いて上記処理を説明する。The above processing will be explained below using a specific example.

第７図は翻訳辞書の見出し語及び訳語を示した一例であ
る。FIG. 7 is an example of entry words and translated words in a translation dictionary.

第８図は、入力文字列として不要なスペースを含んだ文
字列として、「デイ　スフ」が与えられた時の処理を説
明するための図である。今、スペース削除モードがＯＮ
であり、スペース←−中点変換モード及びスペースーー
ハイフン変換モードがＯＦＦであるとする。なお、入力
文字列「ディスク」は「ディスク」の誤りである。まず
、第６図の８６０で入力文字列と第７図の辞書とのマツ
チングが行われるが、辞書には「デイ　スフ」「デイ」
　「スフ」などにマツチする見出し語がないため、第８
図の辞書単語リストには何もセットされない（８ａ）。FIG. 8 is a diagram for explaining the processing when "disfu" is given as an input character string containing unnecessary spaces. Space deletion mode is now ON
Assume that the space←-midpoint conversion mode and the space-hyphen conversion mode are OFF. Note that the input character string "disk" is incorrect. First, at 860 in FIG. 6, the input character string is matched with the dictionary in FIG.
Since there is no entry word that matches ``sufu'' etc.
Nothing is set in the dictionary word list in the figure (8a).

Ｓ６１では辞書単語リストに何もセットされないことか
ら、該単語を未知語と判断し、ＳＢ２に進む。次に８８
２でスペース削除モードがＯＮであることにより、５１
３３が行われる。ここでは、入力文字列「デイ　スフ」
からスペースが削除された文字列「ディスク」が新たに
辞書マツチング対象文字列になる。この文字列に対して
辞書とのマツチングが行われると、第７図により「ディ
スク」がマツチし、この単語が辞書単語リストに登録さ
れる（８ｂ）。ここでは、「ディスク」の後ろのカッコ
内の数字は該当する辞書番号である。処理はＳ６４．Ｓ
８Ｂを通って終了し、辞書引き結果として「デイ　スフ
」から「ディスク」を得る。Since nothing is set in the dictionary word list in S61, the word is determined to be an unknown word and the process proceeds to SB2. Next 88
51 because the space deletion mode is ON in 2.
33 will be carried out. Here, the input string "day sufu"
The character string "disc" with spaces removed from it becomes the new dictionary matching target character string. When this character string is matched with the dictionary, "disc" is matched as shown in FIG. 7, and this word is registered in the dictionary word list (8b). Here, the number in parentheses after "disk" is the corresponding dictionary number. The process is S64. S
After passing through 8B, we get "disc" from "disk" as the dictionary lookup result.

第９図は「ディフェクト　スキッピング」という文字列
に対する処理を説明するための図である。FIG. 9 is a diagram for explaining processing for the character string "defect skipping."

今、スペース削除モード及びスペース−ハイフン変換モ
ードがＯＦＦであり、スペース−中点変換モードがＯＮ
であるとする。まず、第６図の８８０で入力文字列と第
７図の辞書とのマツチングが行われるが、辞書には「デ
ィフェクト　スキッピング」　「ディフェクト」　「ス
キッピング」などにマツチする見出し語がないため、辞
書単語リストには何もセットされない（９ａ）。そして
Ｓ６１で未知語と判断し、Ｓ６２でスペース削除変換モ
トを確認する。これによりＳ８４に進む。ＳＢ４ではス
ペース−中点変換モードがＯＮであることにより、Ｓ６
５が行われる。ここでは、入力文字列「ディフェクト　
スキッピング」のスペースが中点に変換された文字列「
ディフェクト・スキッピング」が新たに辞書マツチング
対象文字列になる。Currently, space deletion mode and space-hyphen conversion mode are OFF, and space-midpoint conversion mode is ON.
Suppose that First, at 880 in Figure 6, the input character string is matched with the dictionary in Figure 7, but since the dictionary does not have headwords that match "defect skipping,""defect,""skipping," etc., the dictionary word Nothing is set in the list (9a). Then, in S61, it is determined that it is an unknown word, and in S62, a space deletion conversion is confirmed. This causes the process to proceed to S84. In SB4, since the space-midpoint conversion mode is ON, S6
5 will be performed. Here, the input string "defect
The string "skipping" with spaces converted to midpoints "
"Defect Skipping" is a new string to be dictionary matched.

この文字列に対して辞書とのマツチングが行われると、
第７図により「ディフェクト・スキッピング」がマツチ
し、この単語が辞書単語リストに登録される（９ｂ）。When this string is matched with a dictionary,
According to FIG. 7, "defect skipping" is found, and this word is registered in the dictionary word list (9b).

処理は８６６を通って終了し、辞書引き結果として「デ
ィフェクト　スキッピング」から「ディフェクト・スキ
ッピング」を得る。The process ends through 866, and "defect skipping" is obtained from "defect skipping" as a dictionary lookup result.

第１０図は「デイレイド　ピリング」という文字列に対
する処理を説明するための図である。今、スペース削除
モード及びスペース←中点変換モトかＯＦＦであり、ス
ペース−ハイフン変換モードがＯＮであるとする。まず
、第６図の８６０で入力文字列と第７図の辞書とのマツ
チングが行われるが、辞書には「デイレイド　ピリング
」「デイレイド」　「ピリング」などにマツチする見出
し語がないたぬ、辞書単語リストには何もセットされな
い（ｌｏｇ）。そしてＳ８１で未知語と判断され、ＳＢ
２．Ｓ６４でモードをチエツクし、これによりＳＢＢに
進む。そして５６１１１ではスペース←→ハイフン変換
モードがＯＮであることにより、Ｓ８７が行われる。こ
こでは入力文字列「デイレイトビリング」のスペースが
ハイフンに変換された文字列「デイレイド−ピリング」
が新たに辞書マツチング対象文字列になる。この文字列
に対して辞書とのマツチングが行われると、第７図によ
り「デイレイド−ピリング」がマツチし１この単語が辞
書単語リストに登録され（１０ｂ）、処理は終了する。FIG. 10 is a diagram for explaining the processing for the character string "delayed pilling". Assume that the space deletion mode and the space←midpoint conversion mode are OFF, and the space-hyphen conversion mode is ON. First, at 860 in FIG. 6, the input character string is matched with the dictionary in FIG. Nothing is set in the word list (log). Then, in S81, it is determined that it is an unknown word, and SB
2. The mode is checked in S64, and the process proceeds to SBB. Then, in 56111, since the space←→hyphen conversion mode is ON, S87 is performed. Here, spaces in the input string "Day-Late Billing" are converted to hyphens, resulting in the string "Day-Late-Pilling".
becomes a new string for dictionary matching. When this character string is matched with the dictionary, "delayed pilling" is found as shown in FIG. 7, and this word is registered in the dictionary word list (10b), and the process ends.

辞書引き結果としては「デイレイド　ピリング」から「
デイレイド−ピリング」を得る。As a result of dictionary lookup, "day-laid pilling" to "
Obtain "Delayed Pilling".

なお、上記実施例においては、入力文字列中のスペース
について、削除、中点との変１ｍ、／％イ７ンとの変換
のいずれかを行うものとしたが、スペースを上記以外の
記号に置き換える、複数の変換モードを組み合わせて行
うなどの拡張が考えられる。In the above embodiment, spaces in the input character string are either deleted, converted from the midpoint to 1m, or converted to /% in, but spaces can also be converted to symbols other than the above. Possible expansions include replacement, combination of multiple conversion modes, etc.

また上述した実施例では、日英機械翻訳システムについ
て述べたが、前記翻訳辞書６に収容する知識情報を変更
することにより英日機械翻訳システムとして用いること
ができる。英日機械翻訳を行なう場合、原文としてキー
人力された英文は、単語ごとにスペースが含まれている
。このとき、単語間のスペースで単語検索処理が停止す
ることはない。なぜなら、例えば第１１図に示すように
表示部８の原文表示領域に”Ｔｈ１ｓ　ｉｓ　ａ　ｐｅ
ｎ　、“なる英文が入力部１を介して英日機械翻訳シス
テムに入力される。すると、該英文をスペースごとに分
類して各単語、つまり”Ｔｈ１ｓ”、”ｉｓ”、”ａ“
。Further, in the above embodiment, a Japanese-English machine translation system has been described, but by changing the knowledge information stored in the translation dictionary 6, it can be used as an English-Japanese machine translation system. When performing English-Japanese machine translation, the original English text that has been manually translated contains spaces between each word. At this time, the word search process does not stop due to spaces between words. This is because, for example, as shown in FIG.
n, "is input into the English-Japanese machine translation system through the input unit 1. Then, the English sentence is classified by space and each word, that is, "Th1s", "is", "a"
.

”ｐｅｎ”、“、゛ごとに文頭から順次単語検索し、翻
訳して訳文を得る。そしてたとえ、第１１図に示した英
文を入力しようとして入力ミスにより、第１２図に示す
ように表示部８の原文表示領域に入力してしまった場合
を考える。なお、スペース１２０１は入力ミスによるス
ペースである。The words ``pen'', ``, and ゛ are searched sequentially from the beginning of the sentence and translated to obtain the translated text.Even if you try to input the English sentence shown in Figure 11 and make a typing error, the display screen as shown in Figure 12. Consider the case where an input is made in the original text display area No. 8. Note that the space 1201 is a space due to an input error.

この場合、第１２図の英文は”Ｔｈ”、１ｓ”、”ＩＳ
”、”ａ”。In this case, the English text in Figure 12 is “Th”, “1s”, “IS”.
”, “a”.

“ｐｅｎ“、”、゛にスペースごとに分類して辞書引き
が行なわれる。この辞書引きに用いられる辞書としては
例えば、第１３図に示すような該辞書の見出し語及び訳
訳を格納している。Dictionary lookup is performed by classifying each space into "pen", ",".The dictionary used for this dictionary lookup may, for example, store headwords and translations of the dictionary as shown in Figure 13. There is.

”Ｔｈ”に対して第１３図の辞書とのマツチングが行な
われる。このマツチングの際には”Ｔｈ”は”ｔｈ”、
つまり大文字は小文字に変換した上で行なう。Matching of "Th" with the dictionary shown in FIG. 13 is performed. In this matching, "Th" is "th",
In other words, uppercase letters are converted to lowercase letters.

第１３図の辞書には“ｔｈ”にマツチする見出し語がな
いので第１４図の辞書単語リストには何もセットされな
い（１４ｇ）。このため、第６図のＳ６１により該単語
は未知語であると判断される。次に“Ｔｈ　ｉｓ“のス
ペース１２０１を削除する。これはスペース削除モード
がＯＮであることにより実行される。この結果、”Ｔ旧
Ｓ”が新たに辞書マツチング対象文字列になる。この文
字列に対して辞書とのマツチングが行なわれると、第１
３図の辞書より”Ｔｈ１ｓ”がマツチし、この単語が辞
書単語リストに登録される（１４ｂ）。そして”Ｔｈ１
ｓ”の次に動詞の“ＩＳ”、”ａ“の順に単語検索して
いく。なお、“Ｔｈ１ｓ“の後ろ半分の”１Ｓ”を動詞
の”ＩＳ゛と誤認識することはない。なぜならば、単語
検索は文の最初から順次検索していく。よってｔｈ“と
“ｉｓ”間のスペースが削除された上で”ｔｈｉｓ”と
検索されて、“ｔｈｉｓ“の次の単語である動詞の”Ｉ
Ｓ“は単独の単語として辞書にマツチする見出し語があ
るかないかのマツチングを行なうためである。Since there is no headword that matches "th" in the dictionary of FIG. 13, nothing is set in the dictionary word list of FIG. 14 (14g). Therefore, the word is determined to be an unknown word at S61 in FIG. Next, the space 1201 for "This is" is deleted. This is executed by turning on the space deletion mode. As a result, "T old S" becomes a new character string for dictionary matching. When this character string is matched with the dictionary, the first
"Th1s" is found in the dictionary shown in FIG. 3, and this word is registered in the dictionary word list (14b). And “Th1
Next, the word search is performed in the order of the verb "IS" and "a". Note that the last half of "Th1s", "1S", will not be mistakenly recognized as the verb "IS". This is because word searches sequentially search from the beginning of the sentence. Therefore, the space between “th” and “is” is deleted and “this” is searched, and the verb “I” which is the next word after “this” is searched.
S'' is used to perform matching as a single word to see if there is a matching headword in the dictionary.

この他にも、タイトルなどのように入力文字列中にスペ
ースを複数含むもの（例えば、「は　じめ　に」や、”
Ｗ　　ＯＲＬ　　Ｄ”）ｌ；１ｍ対しても上述した実施
例で説明した処理により、単語検索を確実に行なうこと
ができる。よって、オペレータが原文の修正や、あらゆ
る単語を辞書に登録するといった手間を省くことができ
る。これは翻訳処理が終了したのちの編集作業の著しい
簡略化がはかられたことになり、簡易に適切な翻訳結果
を得ることができる。また。前記スペースの削除や記号
への置換は未知語に対してのみ行なうものであるから、
翻訳処理時間が増えることはない。In addition, there are also titles that contain multiple spaces in the input character string (for example, "Introduction", "
W ORL D”)l;1m, word searches can be performed reliably by the process explained in the above embodiment.Therefore, the operator does not have to take the trouble of correcting the original text or registering all the words in the dictionary. This greatly simplifies the editing work after the translation process is complete, and allows you to easily obtain an appropriate translation result.Also, it is possible to delete the spaces and replace them with symbols. Since the substitution is performed only for unknown words,
Translation processing time will not increase.

［発明の効果］本発明によれば、単語中に不要なスペースが入っている
場合でも確実に単語検索を行うことができる。また、通
常中点やハイフンを含んで表示される語がそれらの記号
の代わりにスペースを含んでいるなど表記にゆれがある
場合でも翻訳辞書中の見出し語とのマツチングを行うこ
とが可能となり、確実に単語検索を行うことができる。[Effects of the Invention] According to the present invention, a word search can be performed reliably even when an unnecessary space is included in a word. In addition, even if there is a discrepancy in the notation, such as a word that is normally displayed with a midpoint or hyphen, but contains a space instead of those symbols, it is now possible to match it with the headword in the translation dictionary. Word searches can be performed reliably.

よって、翻訳処理を効率良く行うことができる。Therefore, translation processing can be performed efficiently.

[Brief explanation of the drawing]

第１図は本発明の一実施例の全体ブロック図、第２図は
本発明の一実施例で用いる入力部のキー配列を示す図、
第３図は本発明の一実施例で用いる表示部の画面のレイ
アウトの一例を示す図、第４図は本発明の一実施例で用
いる編集制御部の処理の流れの一例を示すフローチャー
ト、第５図は本発明の一実施例で用いる翻訳処理の流れ
の一例を示すフローチャート、第６図は本発明の一実施
例で用いる辞書引き処理における単語検索の処理の流れ
を示すフローチャート、第７図、第１３図は本発明の一
実施例で用いる翻訳辞書の見出し及び訳語を示した図、
第８図、第９図、第１０図。第１１図、第１２図、第１４図は本発明の一実施例の動
作を説明するための図である。１・・・入力部、２・・・原文記憶部、３・・・訳文記
憶部、４・・・編集制御部、５・・・翻訳部、６・・・
翻訳辞書、８・・・表示部、９・・・印刷部FIG. 1 is an overall block diagram of an embodiment of the present invention, FIG. 2 is a diagram showing a key arrangement of an input section used in an embodiment of the present invention,
FIG. 3 is a diagram showing an example of the screen layout of the display unit used in an embodiment of the present invention, FIG. FIG. 5 is a flowchart showing an example of the flow of translation processing used in one embodiment of the present invention, FIG. 6 is a flowchart showing the flow of word search processing in dictionary lookup processing used in one embodiment of the present invention, and FIG. , FIG. 13 is a diagram showing the headings and translated words of the translation dictionary used in one embodiment of the present invention,
Figures 8, 9, and 10. FIG. 11, FIG. 12, and FIG. 14 are diagrams for explaining the operation of one embodiment of the present invention. DESCRIPTION OF SYMBOLS 1... Input part, 2... Original text storage part, 3... Translation text storage part, 4... Edit control part, 5... Translation part, 6...
Translation dictionary, 8...display section, 9...printing section

Claims

[Claims]

(1) An input unit for inputting the original text to be translated; a translation dictionary unit that stores knowledge information used in translation processing; and a translation process for the original text using the contents of this translation dictionary unit to produce a translated text. and a display unit that displays the original text input from the input unit and the translated text obtained from the translation unit, and the translation unit refers to the translation dictionary unit for the original text to determine the corresponding word. 1. A machine translation system characterized in that when a word search is performed to search for an unknown word in the original text that cannot be searched, a blank space in the unknown word is deleted or replaced with a symbol and the word search is performed again.

(2) The machine translation system according to claim 1, wherein the input unit is for instructing whether or not to delete or replace blanks in the unknown word with symbols.

(3) The translation unit searches for a set of character strings obtained by removing a blank space between an unknown word that cannot be searched and a character string adjacent thereto in the original text. Machine translation system.