JPH06295311A

JPH06295311A - Machine translation system

Info

Publication number: JPH06295311A
Application number: JP5106051A
Authority: JP
Inventors: Sakiko Honma; 咲子本間
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1993-04-08
Filing date: 1993-04-08
Publication date: 1994-10-21

Abstract

PURPOSE:To improve the translation accuracy of a sentence including an unregistered composite word by referring to a dictionary for the meaning of the unregistered composite word in each its constitutional element and applying proper translation information in accordance with the internal structure of the composite word obtained from the reference of the dictionary. CONSTITUTION:A composite word detecting part 15 detects a composite word combined through a specific character such as a hyphen and a composite word processing part 16 divides the detected composite word into individual constitutional elements by using the specific character as a delimiter and applies translation information such as a part of speech and a translated word to the whole composite word in accordance with the internal structure obtained from the reference of the dictionary. When the composite word detected by the detecting part 15 has no entry as a whole composite word and morphem processing such as affix processing and numeral processing for the composite word is not executed by a translation processing part 17, the processing part 16 gives proper translation information in accordance with the number of constitutional words of the composite word, speech part information obtained by the reference of the dictionary for the meaning of respective constitutional elements and the combination of the number and the information to utilize the information for the translation processing of the original sentence.

Description

Detailed Description of the Invention

【０００１】[0001]

【技術分野】本発明は、機械翻訳装置に関し、より詳細
には、未登録複合語を構成要素毎に辞書引きし、その結
果から得られる複合語内部の構造に応じて適切な翻訳情
報を付与することで、未登録複合語を含む文の翻訳精度
を上げるようにした機械翻訳装置に関する。例えば、未
登録複合語の処理機能を有する機械翻訳装置に適用され
るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device, and more specifically, it extracts a dictionary of unregistered compound words for each constituent element and adds appropriate translation information according to the internal structure of the compound word obtained from the result. By doing so, the present invention relates to a machine translation device that improves the translation accuracy of a sentence including an unregistered compound word. For example, it is applied to a machine translation device having a function of processing unregistered compound words.

【０００２】[0002]

【従来技術】機械翻訳などの自然言語処理において、シ
ステム辞書に登録されていない未知語の処理は重要な課
題である。特に複合語は、新造力（productivity）に富
んでいるため、その全てをシステム辞書に追加すること
は、辞書の容量を膨大にする上、実際上不可能である。
複合語全体が未登録であっても、構成要素の中に特に新
造力が強く、接辞辞書に登録されているものがあれば、
形態素処理で訳語や品詞などが付与される。また、数詞
など形態上の特徴が強いものは、やはり形態素処理でま
とめ上げることが可能である。辞書に未登録で、かつ形
態素処理を受けるパターンにも該当しない複合語の処理
法としては、複合語全体を未知語として処理し、幾つか
の品詞候補を与えて原文のまま訳出する方法がある。し
かし、ハイフンなどの特殊文字で結合された複合語は、
結合のパターンや構成要素間の意味関係にある程度の法
則性があり、辞書に未登録の場合でも、その法則に応じ
て訳語や品詞などの推定処理を行うことが望ましい。2. Description of the Related Art In natural language processing such as machine translation, processing of unknown words not registered in a system dictionary is an important issue. In particular, since compound words are rich in productivity, it is practically impossible to add all of them to the system dictionary, because the capacity of the dictionary is enormous.
Even if the entire compound word is not registered, if there is something that has a particularly strong new creativity and is registered in the affix dictionary,
Morphological processing adds translations and parts of speech. Also, those with strong morphological characteristics such as numbers can be put together by morphological processing. As a processing method of a compound word that is not registered in the dictionary and does not correspond to a pattern that is subjected to morphological processing, there is a method of processing the entire compound word as an unknown word, giving some part-of-speech candidates, and translating the original text as it is. . However, compound words joined by special characters such as hyphens are
There is a certain degree of rule in the connection pattern and the semantic relationship between the constituents, and even if it is not registered in the dictionary, it is desirable to perform the process of estimating the translated word and the part of speech according to the rule.

【０００３】このような問題を解決しようとする従来技
術としては、下記の３つがある。まず、特開平３−４８
３６６号公報のものは、複合語内部を入力されたままの
語順で構文解析したり、ハイフンをスペースに置き換え
て原文全体をそのまま構文解析して処理しようとするも
のである。しかし、複合語の語順は通常文の語順と同じ
であるとは限らない。例えば、”money-saving”は通常
の語順とは逆で、目的語が動詞の前に来ており、これ
を”money saving”として解析しても正しい結果は得ら
れない。また、ハイフンをスペースに置き換えて、その
他の部分と区別せずに構文解析を行った場合、複合語の
構成要素が他の部分と紛れてしまうため、誤った解析を
行う可能性が高く、生成された訳文の理解容易性が非常
に低くなってしまう。There are the following three conventional techniques for solving such a problem. First, JP-A-3-48
Japanese Patent No. 366 discloses an attempt to parse the inside of a compound word in the word order as it is input, or replace hyphens with spaces to parse and process the entire original sentence as it is. However, the word order of compound words is not always the same as the word order of ordinary sentences. For example, "money-saving" is the reverse of the normal word order, the object comes before the verb, and even if this is analyzed as "money saving", the correct result cannot be obtained. Also, if you replace the hyphen with a space and perform syntactic analysis without distinguishing it from other parts, the constituent elements of the compound word will be confused with other parts, so there is a high possibility that incorrect parsing will occur, and The intelligibility of the translated text is very low.

【０００４】また、特開平４−２６２４５９号公報、及
び特開平２−２８１３７４号公報のものは、「名詞−分
詞」などの限られたパターンに対し、一意に訳出パター
ンを決めて、個々の構成要素の辞書引き結果から複合語
全体の訳語を生成しようとするものである。しかし、入
力パターンに対して一意の訳出パターンのみを設定して
いるため、どちらの手法を用いても次の例のような問題
が生じる。例） college-trained people （ａ）「大学で訓練された人々」（ｂ）「大学が訓練した人々」（ｃ）「大学を訓練した人々」（ａ）は正解釈、（ｂ）は特開平４−２６２４５９号公
報での手法による翻訳例、（ｃ）は特開平２−２８１３
７４号公報での手法による翻訳例である。正解はハイフ
ン左側の名詞に場所格の格ラベル「で」を付与しなけれ
ばならないが、（ｂ）においては名詞を一律右側の動詞
の主語として扱い、（ｃ）においては、一律目的語とし
て扱うため、それぞれ誤った格ラベルを付与してしま
う。「名詞−過去分詞」における各構成要素の格関係は
一律ではなく、場所の他に原因、内容、行為者など様々
な関係があり得るが、従来技術では極く限られた場合に
しか対応できない。In Japanese Patent Laid-Open No. 4-262459 and Japanese Patent Laid-Open No. 2-281374, a translation pattern is uniquely determined for a limited pattern such as "noun-participle", and individual configurations are provided. It is intended to generate the translated word of the entire compound word from the result of the dictionary lookup of the elements. However, since only a unique translation pattern is set for the input pattern, whichever method is used, the following problem occurs. Example) college-trained people (a) "People trained at college" (b) "People trained at college" (c) "People trained at college" (a) is a correct interpretation, (b) is JP-A An example of translation by the method disclosed in Japanese Patent Laid-Open No. 4-262459, (c) is Japanese Patent Laid-Open No. 2813/1990.
This is an example of translation by the method disclosed in Japanese Patent Publication No. 74. In the correct answer, the noun on the left side of the hyphen must be given the case label "de" of the place case, but in (b) the noun is treated as the subject of the verb on the right side, and in (c) it is treated as the uniform object. Therefore, the wrong case label is given to each. The case relation of each component in the "noun-past participle" is not uniform, and there may be various relations such as cause, content, actor, etc. in addition to the place, but the conventional technology can handle only limited cases. .

【０００５】更に上記の従来技術では、対応している入
力パターンも少なく、右側が分詞でも左側が形容詞であ
る場合などには対応できず、構成要素が３語以上の場合
にも対応していない。特に特開平２−２８１３７４号公
報で対応しているのは「名詞−分詞」のみであり、比較
的出現頻度の高い、左側が副詞であったり、右側が名詞
であったりするパターンには全く対応していない。Further, the above-mentioned conventional technique does not support a small number of input patterns, and cannot handle cases where the right side is a participle and the left side is an adjective, and the case where the constituent elements are three words or more is not supported. . In particular, Japanese Unexamined Patent Publication No. 2-281374 only deals with "noun-participle", and corresponds to patterns with relatively high frequency of appearance, where the left side is an adverb or the right side is a noun. I haven't.

【０００６】[0006]

【目的】本発明は、上述のごとき実情に鑑みてなされた
もので、未登録複合語を構成要素毎に辞書引きし、その
結果から得られる複合語内部の構造に応じて適切な翻訳
情報を付与することで、未登録複合語を含む文の翻訳精
度を上げること、特に、複合語の構成要素数や、構成要
素中のキーとなる語の品詞、およびその有無によって処
理を分け、精度の向上を図ること、さらに、キーとなる
語を持たない複合語は、解析失敗や誤解析を避けるため
に全体を一つの未知語として扱うようにした機械翻訳装
置を提供することを目的としてなされたものである。[Purpose] The present invention has been made in view of the above-mentioned circumstances, and searches an unregistered compound word into a dictionary for each constituent element, and obtains appropriate translation information according to the internal structure of the compound word obtained from the result. By adding it, the translation accuracy of sentences including unregistered compound words can be improved, and in particular, the processing can be divided according to the number of constituent elements of the compound word, the part of speech of the key word in the constituent elements, and the presence or absence of the The purpose was to improve and to provide a machine translation device that treats compound words that do not have a key word as one unknown word in order to avoid parsing failures and erroneous analysis. It is a thing.

【０００７】[0007]

【構成】本発明は、上記目的を達成するために、（１）
少なくとも、原文を入力するための入力部と、該入力部
により入力された原文を記憶する原文記憶部と、前記原
文の翻訳処理に用いる知識情報を格納した翻訳辞書部
と、前記知識情報を用いて、前記原文記憶部に記憶され
た原文を翻訳処理する翻訳処理部と、該翻訳処理部によ
る翻訳処理によって得られた訳文を、原文に対応させて
記憶する訳文記憶部と、少なくとも前記原文または訳文
を表示する表示部および表示制御部と、該表示部に表示
された原文および訳文に対する編集処理を制御する編集
制御部と、入力された原文からハイフンなどの特殊文字
をはさんで結合された複合語を検出する複合語検出部
と、該複合語検出部で検出された複合語を、特殊文字を
デリミタとして個々の構成要素に分けて辞書引きし、そ
の結果から得られる内部構造に応じて複合語全体に対し
て品詞や訳語等の翻訳情報を与える複合語処理部とから
成り、前記複合語検出部で検出された複合語が、前記翻
訳処理部に全体としてのエントリを持たず、また翻訳処
理部において接辞処理、数詞処理などの形態素処理を受
けなかった場合には、前記複合語処理部において、複合
語の構成要素の数や、各構成要素を辞書引きして得られ
る品詞情報とその組合せなどに応じて適切な翻訳情報を
与え、原文の翻訳処理に利用すること、更には（２）前
記複合語処理部において、複合語の構成要素数が２語
で、かつハイフンの左側構成要素に品詞が副詞である場
合は、右側構成要素を副詞の係り先として複合語内部を
翻訳処理し、その結果から訳語を生成し、品詞候補と併
せて複合語全体に付与し、原文の翻訳処理に利用する第
１の翻訳処理手段を有すること、更には（３）前記複合
語処理部において、複合語の構成要素数が２語で、前記
第１の翻訳処理に該当せず、かつハイフンの右側構成要
素の品詞が動詞の現在分詞の場合は、左右の要素の語順
を入れ替えて複合語内部を翻訳処理し、その結果から訳
語を生成し、品詞候補と併せて複合語全体に付与し、原
文の翻訳処理に利用する第２の翻訳処理手段を有するこ
と、更には（４）前記複合語処理部において、複合語の
構成要素数が２語で、前記第１の翻訳処理に該当せず、
かつハイフンの右側構成要素の品詞が動詞の過去分詞の
場合は、前記翻訳辞書部の情報に基づいて、該動詞の格
フレームと左側構成要素の素性とのマッチングを行い、
適当な前置詞を補って複合語内部を翻訳処理し、その結
果から訳語を生成し、品詞候補と併せて複合語全体に付
与し、原文の翻訳処理に利用する第３の翻訳処理手段を
有すること、更には（５）前記複合語処理部において、
複合語の構成要素数が２語で、前記第１の翻訳処理に該
当せず、かつハイフンの右側構成要素が名詞の場合は、
複合語内部を翻訳処理し、その結果から訳語を生成し、
品詞候補と併せて複合語全体に付与し、原文の翻訳処理
に利用する第４の翻訳処理手段を有すること、更には
（６）前記複合語処理部において、構成要素数が２語
で、かつ前記第１の翻訳処理から第４の翻訳処理のいず
れにも該当しない場合は、複合語全体を一つのまとまり
として未知語処理する第１の未知語処理手段を有するこ
と、更には（７）前記複合語処理部において、構成要素
数が３語以上で、かつ構成要素のうち１つの品詞が等位
接続詞である場合は、複合語内部を並列句として翻訳処
理し、その結果から訳語を生成し、品詞候補と併せて複
合語全体に付与し、原文の翻訳処理に利用する第５の翻
訳処理手段を有すること、更には（８）前記複合語処理
部において、構成要素数が３語以上で、かつ前記第５の
翻訳処理にマッチしない場合は、構成要素の左から２語
を取り出して、前記第１の翻訳処理から第４の翻訳処理
の処理を行い、いずれかにマッチした場合は、該部分と
更に右側の１語との組合せにより再度前記第１の翻訳処
理から第４の翻訳処理の処理を行い、これを最後の構成
要素まで繰り返し、その結果から訳語を生成し、品詞候
補と併せて複合語全体に付与し、原文の翻訳処理に利用
する第６の翻訳処理手段を有すること、更には（９）前
記複合語処理部において、構成要素数が３語以上で、か
つ第５の翻訳処理又は第６の翻訳処理のいずれにもマッ
チしない場合には、複合語全体を一つのまとまりとして
未知語処理する第２の未知語処理手段を有することを特
徴としたものである。以下、本発明の実施例に基づいて
説明する。In order to achieve the above object, the present invention provides (1)
At least an input unit for inputting an original sentence, an original sentence storage unit for storing the original sentence input by the input unit, a translation dictionary unit for storing knowledge information used for translation processing of the original sentence, and the knowledge information are used. A translation processing unit that translates the original sentence stored in the original sentence storage unit, a translation storage unit that stores the translated sentence obtained by the translation processing by the translation processing unit in association with the original sentence, and at least the original sentence or The display unit and the display control unit that display the translated text, the edit control unit that controls the editing process for the original sentence and the translated text displayed on the display unit, and the input source sentence are connected by a special character such as a hyphen. A compound word detection unit that detects a compound word, and a compound word detected by the compound word detection unit is divided into individual constituent elements using special characters as delimiters, and a dictionary is created. A compound word processing unit that gives translation information such as a part of speech or a translated word to the entire compound word according to the structure, and the compound word detected by the compound word detection unit is registered in the translation processing unit as a whole. If it does not have it and it has not undergone morpheme processing such as affix processing or number processing in the translation processing unit, the compound word processing unit obtains the number of constituent elements of the compound word and a dictionary lookup for each constituent element. Appropriate translation information is provided according to the part-of-speech information and the combination thereof, which is used for the translation process of the original sentence. Furthermore, (2) the compound word processing unit has two compound elements, and When the part of speech is an adverb in the left component of the hyphen, the inside of the compound word is translated using the right component as the adjunct of the adverb, a translated word is generated from the result, and it is given to the entire compound word together with the part of speech candidate. , Original text translation process And (3) the compound word processing unit has two compound elements, does not correspond to the first translation process, and is on the right side of the hyphen. When the part-of-speech of a constituent element is the present participle of a verb, the word order of the left and right elements is exchanged, and the inside of the compound word is translated, and a translated word is generated from the result, which is given to the entire compound word together with the part-of-speech candidate. Second translation processing means used for the translation processing of, and (4) in the compound word processing unit, the number of constituent elements of the compound word is two, which does not correspond to the first translation processing,
And when the part of speech of the right component of the hyphen is a past participle of the verb, based on the information of the translation dictionary part, the case frame of the verb and the feature of the left component are matched,
Having a third translation processing means for translating the inside of the compound word by supplementing an appropriate preposition, generating a translated word from the result, adding it to the entire compound word together with a part-of-speech candidate, and utilizing it for the translation processing of the original sentence. , And (5) In the compound word processing section,
When the number of constituent elements of the compound word is two, does not correspond to the first translation process, and the right-side constituent element of the hyphen is a noun,
Translate the inside of the compound word, generate a translated word from the result,
It has a fourth translation processing means that is given to the entire compound word together with the part-of-speech candidate and is used for the translation processing of the original sentence. Furthermore, (6) the compound word processing section has two constituent elements, and If the first translation process to the fourth translation process do not apply, a first unknown word processing unit that processes the entire compound word as an unknown word is further provided, and further (7) the above In the compound word processing unit, when the number of constituent elements is three or more and one of the constituent elements is a coordinate conjunction, translation processing is performed inside the compound word and a translated word is generated from the result. And a fifth translation processing unit which is added to the entire compound word together with the part-of-speech candidate and is used for the translation processing of the original sentence. Furthermore, (8) the compound word processing unit has three or more constituent elements. , And does not match the fifth translation process In the case, two words from the left of the constituent element are taken out, the processing from the first translation processing to the fourth translation processing is performed, and if any of them is matched, the portion is combined with one word on the further right side. Then, the processes from the first translation process to the fourth translation process are performed again, and this process is repeated up to the last component, and a translated word is generated from the result, and is added to the entire compound word together with the part-of-speech candidate. Having a sixth translation processing means used for the translation processing, and further (9) the compound word processing section having three or more constituent elements, whichever of the fifth translation processing or the sixth translation processing. If there is no match, the second unknown word processing means for processing the unknown word as a unit of the entire compound word is featured. Hereinafter, description will be given based on examples of the present invention.

【０００８】図１は、本発明による機械翻訳装置の一実
施例を説明するための構成図で、図中、１１は入力部、
１２は原文記憶部、１３は訳文記憶部、１４は編集制御
部、１５は複合語検出部、１６は複合語処理部、１７は
翻訳部（翻訳処理部）、１８は表示制御部、１９は表示
部である。入力部１１から入力された原文は原文記憶部
１２に送られる。翻訳部１７は、編集制御部１４の制御
下で記憶部１２の原文を翻訳処理する。FIG. 1 is a block diagram for explaining an embodiment of a machine translation apparatus according to the present invention, in which 11 is an input unit,
Reference numeral 12 is an original sentence storage unit, 13 is a translated sentence storage unit, 14 is an edit control unit, 15 is a compound word detection unit, 16 is a compound word processing unit, 17 is a translation unit (translation processing unit), 18 is a display control unit, and 19 is It is a display unit. The original text input from the input unit 11 is sent to the original text storage unit 12. The translation unit 17 translates the original sentence in the storage unit 12 under the control of the editing control unit 14.

【０００９】すなわち、入力部１１により入力された原
文は、原文記憶部１２に記憶される。翻訳辞書部には前
記原文の翻訳処理に用いられる知識情報が格納されてい
る。翻訳処理部１７は、前記翻訳辞書部の知識情報を用
いて、前記原文記憶部１２に記憶された原文を翻訳処理
する。訳文記憶部１３は、翻訳処理によって得られた訳
文を原文に対応させて記憶する。表示制御部１８及び表
示部９は、少なくとも前記原文または訳文を表示する。
編集制御部１４は、表示された原文および訳文に対する
編集処理を制御する。複合語検出部１５は、入力された
原文からハイフンなどの特殊文字をはさんで結合された
複合語を検出する。複合語処理部１６は、前記複合語検
出部１５で検出された複合語を、特殊文字をデリミタと
して個々の構成要素に分けて辞書引きし、その結果から
得られた内部構造に応じて複合語全体に対して品詞や訳
語等の翻訳情報を与える。このような構成により、前記
複合語検出部１５で検出された複合語が、翻訳辞書部に
全体としてのエントリを持たず、また翻訳処理部１７に
おいて接辞処理、数詞処理などの形態素処理を受けなか
った場合には、前記複合語処理部１６において、複合語
の構成要素の数や、各構成要素を辞書引きして得られる
品詞情報とその組合せなどに応じて適切な翻訳情報を与
え、原文の翻訳処理に利用する。That is, the original sentence input by the input unit 11 is stored in the original sentence storage unit 12. Knowledge information used for the translation process of the original sentence is stored in the translation dictionary unit. The translation processing unit 17 uses the knowledge information of the translation dictionary unit to translate the original sentence stored in the original sentence storage unit 12. The translated text storage unit 13 stores the translated text obtained by the translation processing in association with the original text. The display control unit 18 and the display unit 9 display at least the original sentence or the translated sentence.
The editing control unit 14 controls the editing process for the displayed original sentence and translated sentence. The compound word detection unit 15 detects a compound word combined with a special character such as a hyphen from the input original sentence. The compound word processing unit 16 divides the compound word detected by the compound word detecting unit 15 into individual constituent elements by using special characters as delimiters and looks up the dictionary, and according to the internal structure obtained from the result, the compound word is extracted. Give translation information such as parts of speech and translations to the whole. With such a configuration, the compound word detected by the compound word detecting unit 15 does not have an entry in the translation dictionary unit as a whole, and the translation processing unit 17 is not subjected to morpheme processing such as affix processing or number processing. In such a case, the compound word processing unit 16 provides appropriate translation information according to the number of constituent elements of the compound word, the part-of-speech information obtained by looking up each constituent element in the dictionary, and the combination thereof, and Used for translation processing.

【００１０】図２は、図１における翻訳部の処理を英日
機械翻訳を例にして示した構成図で、図中、２１は形態
素解析部、２２は構文解析部、２３は変換部、２４は生
成部、２５は辞書、２６は文法規則である。まず、入力
文である英文は、形態素解析部２１で、辞書２５の情報
をもとに個々の単語に認定される。また接辞辞書などを
参照して、システム辞書に未登録である語の一部に対
し、品詞や訳語などの情報を与える。次に構文解析部２
２で文法規則２６を適用して文構造が決定され、係り受
け関係などを明らかにした木構造が作られる。該木構造
は、変換部２３で出力言語である日本語の構造に即した
木に変換され、生成部２４でノード毎に訳出される。FIG. 2 is a block diagram showing the processing of the translation unit in FIG. 1 by taking English-Japanese machine translation as an example. In the figure, 21 is a morphological analysis unit, 22 is a syntax analysis unit, 23 is a conversion unit, and 24 is a conversion unit. Is a generation unit, 25 is a dictionary, and 26 is a grammar rule. First, the English sentence which is the input sentence is recognized as an individual word by the morphological analysis unit 21 based on the information in the dictionary 25. Further, by referring to an affix dictionary or the like, information such as a part of speech or a translated word is given to some of the words that are not registered in the system dictionary. Next, the parser 2
In step 2, the grammar rule 26 is applied to determine the sentence structure, and a tree structure that defines the dependency relations is created. The tree structure is converted by the conversion unit 23 into a tree conforming to the structure of the output language Japanese, and the generation unit 24 translates the tree for each node.

【００１１】複合語検出部１５は原文中からハイフンな
どの特殊文字によって結合された複合語を切り出す。複
合語処理部１６では、形態素解析部２１での処理結果を
受け、切り出された複合語が辞書引き情報や接辞処理情
報を持たない場合は、特殊文字をデリミタとして構成要
素単位で辞書引きし、その結果に応じて品詞や訳語など
の翻訳情報を与え、構文解析部２２以降の処理に渡す。
得られた訳文は順次訳文記憶部１３に格納される。編集
制御部１４は表示制御部１８を駆動し、原文記憶部１２
に格納された英文と、訳文記憶部１３に格納された日本
語文を対応付けて表示部１９において表示する。ユーザ
はこの表示を見ながら後編集処理を行う。後編集処理
は、入力部１１から入力される制御情報に従い、編集制
御部１４で実行される。The compound word detecting unit 15 cuts out compound words combined by special characters such as hyphens from the original sentence. The compound word processing unit 16 receives the processing result of the morpheme analysis unit 21, and when the cut out compound word does not have the dictionary lookup information or the affix processing information, the special character is used as a delimiter in the dictionary to construct the dictionary. According to the result, translation information such as a part-of-speech and a translated word is given and passed to the processing of the syntax analysis unit 22 and thereafter.
The obtained translated texts are sequentially stored in the translated text storage unit 13. The edit control unit 14 drives the display control unit 18, and the original sentence storage unit 12
The English sentence stored in the and the Japanese sentence stored in the translated sentence storage unit 13 are displayed on the display unit 19 in association with each other. The user performs post-editing processing while looking at this display. The post-editing process is executed by the edit control unit 14 according to the control information input from the input unit 11.

【００１２】図３は、図１における複合語処理部におけ
る複合語処理のうち、構成要素数による振り分け処理ま
でのフローチャートである。以下、各ステップに従って
順に説明する。ハイフンなどによって区切られた複合語
を検出し（Ｓ３１）、形態素解析を行う（Ｓ３２）。辞
書エントリを持たず、接辞処理等も受けなかった場合は
（Ｓ３３〜Ｓ３４）、個々の語を辞書引きし（Ｓ３
５）、構成要素数をカウントする（Ｓ３６）。構成要素
数が２語であれば（ａ）の処理へ、３語以上であれば
（ｂ）の処理へ送る（Ｓ３７）。FIG. 3 is a flowchart of the compound word processing in the compound word processing section in FIG. 1 up to the distribution processing according to the number of constituent elements. Hereinafter, each step will be described in order. A compound word delimited by hyphens or the like is detected (S31), and morphological analysis is performed (S32). If it does not have a dictionary entry and is not subjected to affix processing or the like (S33 to S34), the individual words are looked up in the dictionary (S3).
5) The number of components is counted (S36). If the number of constituent elements is 2 words, it is sent to the processing of (a), and if it is 3 words or more, it is sent to the processing of (b) (S37).

【００１３】図４は、未登録複合語の個々の構成要素を
辞書引きした結果の例である。図５は、構成要素数が２
語である場合の、構成要素の品詞候補による振り分け処
理のフローチャートである。以下、各ステップに従って
順に説明する。左側構成要素の品詞候補を検索し（Ｓ５
１）、その中に副詞がある場合は、図中の（ｃ）の処理
へ送る（Ｓ５２）。副詞がない場合は右側構成要素の品
詞候補を検索し（Ｓ５３）、その中に現在分詞がある場
合は、図中の（ｄ）の処理へ送る（Ｓ５４）。現在分詞
がない場合は、過去分詞があるかをチェックし、過去分
詞があれば、図中の（ｅ）の処理へ送る（Ｓ５５）。な
ければ、次は名詞があるかどうかをチェックし、名詞あ
れば、図中の（ｆ）の処理へ送る（Ｓ５６）。名詞なけ
れば未知語処理を行い、構文解析へ送る（Ｓ５７）。な
おこのフローチャートでは、流れに沿って処理を行え
ば、構成要素が多品詞語であるの場合にも対応できる
が、別に多品詞の優先順位情報を設け、それに応じて処
理を行うことも可能である。（以下余白）FIG. 4 shows an example of the result obtained by looking up the individual constituent elements of the unregistered compound word in the dictionary. In FIG. 5, the number of components is two.
9 is a flowchart of a process of allocating a part of speech candidate of a component in the case of a word. Hereinafter, each step will be described in order. Search for a part-of-speech candidate for the left component (S5
1) If there is an adverb in it, it is sent to the process of (c) in the figure (S52). If there is no adverb, the part-of-speech candidate of the right component is searched (S53), and if there is a current participle, it is sent to the process of (d) in the figure (S54). If there is no present participle, it is checked if there is a past participle, and if there is a past participle, it is sent to the process of (e) in the figure (S55). If not, next, it is checked whether or not there is a noun, and if there is a noun, it is sent to the process of (f) in the figure (S56). If there is no noun, unknown word processing is performed and sent to the syntactic analysis (S57). It should be noted that in this flowchart, if the process is performed in accordance with the flow, it is possible to deal with the case where the component is a multi-part-of-speech word, but it is also possible to separately provide priority information of the multi-part-of-speech and perform the process accordingly. is there. (Below margin)

【００１４】図６は、左側構成要素が副詞である場合の
処理フローチャートである。以下、各ステップに従って
順に説明する。副詞の係り先を右側構成要素として翻訳
処理を行い（Ｓ６１）、解析成功であれば（Ｓ６２）、
複合語全体を形容詞とし、得られた訳語を与えて構文解
析に送る（Ｓ６３）。解析失敗であれば（Ｓ６２）、図
５（ｇ）に戻り、Ｓ５３以降の処理を行う。図４の例で
はＷ１がこれに該当する。左側構成要素である”fastes
t”は品詞候補に副詞を含むので、右側の”growing”を
係り先として翻訳処理を行い、「最も早く成長する」と
いう訳語を得る。品詞を形容詞として翻訳を行った例が
図１２のＳ１である。FIG. 6 is a processing flow chart when the left component is an adverb. Hereinafter, each step will be described in order. Translation processing is performed using the adverb's destination as the right component (S61), and if the analysis is successful (S62),
The entire compound word is used as an adjective, the obtained translated word is given and sent to the syntactic analysis (S63). If the analysis is unsuccessful (S62), the process returns to FIG. In the example of FIG. 4, W1 corresponds to this. Left side component "fastes"
Since "t" includes an adverb in the part-of-speech candidate, the translation process is performed with "growing" on the right as the target, and the translated word "growing fastest" is obtained. An example of translation using a part of speech as an adjective is S1 in FIG.

【００１５】図７は、右側構成要素が現在分詞である場
合の処理フローチャートである。以下、各ステップに従
って順に説明する。区切り文字の左右を入れ替えて翻訳
処理を行い（Ｓ７１）、解析成功であれば（Ｓ７２）、
複合語全体を形容詞とし、得られた訳語を与えて構文解
析に送る（Ｓ７３）。解析失敗であれば（Ｓ７２）、図
５（ｈ）に戻り、Ｓ５５以降の処理を行う。図４の例で
はＷ２，Ｗ３がこれに該当する。右側構成要素である”
threatening””looking”は共に品詞候補に現在分詞を
含むので、左右を入れ替え、それぞれ”threatening li
fe””looking wistful”として翻訳処理を行う。前者
は他動詞、後者は自動詞として解析されそれぞれ「生命
を脅かす」「物欲しそうに見える」という訳語を得る。
品詞を形容詞として翻訳を行った例が図１２のＳ２，Ｓ
３である。FIG. 7 is a processing flowchart in the case where the right side component is the present participle. Hereinafter, each step will be described in order. The left and right of the delimiter are replaced and the translation processing is performed (S71). If the analysis is successful (S72),
The entire compound word is used as an adjective, the obtained translated word is given and sent to the syntactic analysis (S73). If the analysis is unsuccessful (S72), the process returns to FIG. In the example of FIG. 4, this corresponds to W2 and W3. The right component "
The threatening “” looking ”both include the present participle in the part-of-speech candidate, so swap the left and right to“ threatening li
The translation process is performed as fe “” looking wistful. ”The former is analyzed as a transitive verb, and the latter is analyzed as an intransitive verb to obtain the translated words“ life threatening ”and“ looks greedy ”.
An example of translation using a part of speech as an adjective is S2 and S in FIG.
It is 3.

【００１６】図８は、右側構成要素が過去分詞である場
合の処理フローチャートである。以下、各ステップに従
って順に説明する。右側要素の格フレームを検索し（Ｓ
８１）、格フレームがあれば（Ｓ８２）、左側要素の素
性とマッチングを行う（Ｓ８３）。格フレームがなけれ
ば（Ｓ８２）、図５（ｉ）に戻り、Ｓ５６以降の処理を
行う。左側構成要素の素性とマッチするものが格フレー
ムの要素にあり（Ｓ８４）、その要素がＯＢＪ以外であ
れば（Ｓ８５）、ＳＵＢＪの場合は”by”を補い（Ｓ８
６，８７）、それ以外の場合は格フレームに記載された
前置詞を補って訳語を生成し（Ｓ８８）、複合語全体の
品詞を形容詞として構文解析に送る（Ｓ８９）。マッチ
するものがなかったり（Ｓ８４）、マッチした要素がＯ
ＢＪであった場合は（Ｓ８５）、図５（ｉ）に戻り、Ｓ
５６以降の処理を行う。FIG. 8 is a processing flowchart when the right component is a past participle. Hereinafter, each step will be described in order. Search the case frame of the right element (S
81), if there is a case frame (S82), the feature of the left element is matched (S83). If there is no case frame (S82), the process returns to FIG. There is an element of the case frame that matches the feature of the left component (S84), and if that element is other than OBJ (S85), in the case of SUBJ, "by" is supplemented (S8).
6, 87), in other cases, a preposition written in the case frame is supplemented to generate a translated word (S88), and the part of speech of the entire compound word is sent as an adjective to the syntactic analysis (S89). There is no match (S84), or the matched element is O
If it is BJ (S85), the process returns to FIG.
The processing after 56 is performed.

【００１７】図４の例ではＷ４，Ｗ５がこれに該当す
る。右側構成要素である”owned””loaded”は共に品
詞候補に過去分詞を含む。”own””load”の格フレー
ムの例は図９（ａ），（ｂ）に示す。”government”
は”own”のＳＵＢＪと素性ＯＲＧが一致するので、”b
y”を補って”owned by government”で翻訳処理を行
い、「政府によって所有された」という訳語を得る。ま
た”propane”は”load”のＷＩＴＨと素性ＭＡＴが一
致するので、”with”を補って”loaded with propan
e”で翻訳処理を行い、「プロパンを積まれた」という
訳語を得る。品詞を形容詞として翻訳を行った例が図１
２のＳ４，Ｓ５である。In the example of FIG. 4, W4 and W5 correspond to this. "Owned" and "loaded", which are the right components, both include the past participle as a part-of-speech candidate. Examples of "own" and "load" case frames are shown in FIGS. 9 (a) and 9 (b). "Government"
Has the feature ORG that matches the SUBJ of "own", so "b"
The translation process of "owned by government" is supplemented with "y" to obtain the translated word "owned by the government". In addition, since "propane" has the same feature MAT as the "load" WITH, "with" is supplemented by "loaded with propan".
The translation process is performed by "e", and the translated word "loaded with propane" is obtained. Figure 1 shows an example of translation using part-of-speech as an adjective.
2 are S4 and S5.

【００１８】図１０は、右側構成要素が名詞である場合
の処理フローチャート出ある。以下、各ステップに従っ
て順に説明する。左側構成要素を前置修飾要素として翻
訳処理を行い（Ｓ１０１）、解析成功であれば（Ｓ１０
２）、複合語全体の品詞として形容詞と名詞を付与し、
形容詞の場合は得られた訳語に「の」を加え、名詞の場
合は得られた訳語をそのまま与えて構文解析に送る（Ｓ
８３）。解析失敗であれば（Ｓ１０２）、図５（ｊ）に
戻り、以降の処理を行う。図４の例ではＷ６がこれに該
当する。左側構成要素である”product”は品詞候補に
名詞を含むので、”new product”で翻訳処理を行い、
「新しい製品（の）」という訳語を得る。品詞を形容詞
または名詞として翻訳を行った例が図１２のＳ６であ
る。この例では形容詞の候補が取られ、「新しい製品
の」と訳出される。FIG. 10 shows a processing flowchart when the right component is a noun. Hereinafter, each step will be described in order. Translation processing is performed using the left-side component as a prefix modifier (S101), and if the analysis is successful (S10).
2), add adjectives and nouns as part-of-speech of all compound words,
In the case of an adjective, "no" is added to the obtained translated word, and in the case of a noun, the obtained translated word is given as it is and sent to the syntactic analysis (S
83). If the analysis is unsuccessful (S102), the process returns to FIG. In the example of FIG. 4, W6 corresponds to this. The left component "product" contains a noun in the part-of-speech candidate, so perform translation processing with "new product".
Get the translation "new product". An example of translation using a part of speech as an adjective or noun is S6 in FIG. In this example, the adjective candidates are taken and translated as "new product".

【００１９】図１１は、構成要素が３語以上の場合の処
理フローチャートである。以下、各ステップに従って順
に説明する。構成要素の品詞候補を検索し（Ｓ１１
１）、その中に等位接続詞を含むものがあれば（Ｓ１１
２）、複合語内部を翻訳し（Ｓ１１３）、解析成功であ
れば（Ｓ１１４）、訳語と品詞（形容詞、名詞）を与え
て構文解析に渡す（Ｓ１１５）。解析失敗であれば（Ｓ
１１４）、未知語処理を行って構文解析に渡す（Ｓ１１
６）。図４の例ではＷ７がこれに該当する。構成要素中
の”and”は品詞候補に等位接続詞を含むので、”husba
nd and wife”で翻訳処理を行い、「夫と妻（の）」と
いう訳語を得る。品詞を形容詞または名詞として翻訳を
行った例が図１２のＳ７である。この例では形容詞の候
補が取られ、「夫と妻の」と訳出される。FIG. 11 is a processing flowchart when the number of constituent elements is three or more. Hereinafter, each step will be described in order. The part-of-speech candidate of the component is searched (S11
1), if any of them contains coordinate conjunctions (S11)
2) The inside of the compound word is translated (S113), and if the analysis is successful (S114), the translated word and the part of speech (adjective, noun) are given and passed to the syntactic analysis (S115). If the analysis fails (S
114), unknown word processing is performed, and the result is passed to the syntax analysis (S11).
6). In the example of FIG. 4, W7 corresponds to this. Since "and" in the constituent elements includes coordinate conjunctions in the part-of-speech candidates, "husba"
"nd and wife" is translated and the translated word "husband and wife (no)" is obtained. An example of translation using a part of speech as an adjective or noun is S7 in FIG. In this example, adjective candidates are taken and translated as "husband and wife's".

【００２０】品詞候補の中に等位接続詞がなければ（Ｓ
１１２）、左端から２語を取って（Ｓ１１７）構成要素
が２語の場合の処理（Ｓ５１〜５６）を行う（Ｓ１１
８）。処理語、未知語と判定されれば（Ｓ１１９）、複
合語全体を未知語として構文解析へ送る（Ｓ１１６）。
未知語ではなく、何らかの品詞と訳語を付与されていれ
ば（Ｓ１１９）、右側の１語を含めて（Ｓ１２１）、更
に２語としてＳ１１８の処理を行い、これを最も右側の
構成要素まで繰り返し（Ｓ１２０）、最終結果を複合語
全体の品詞および訳語として構文解析へ送る（Ｓ１２
２）。If there is no coordinate conjunction in the part-of-speech candidate (S
112), two words are taken from the left end (S117), and the processing (S51 to 56) when the constituent element is two words is performed (S11).
8). If it is determined that the processed word is an unknown word (S119), the entire compound word is sent to the syntactic analysis as an unknown word (S116).
If it is not an unknown word but some part-of-speech and translation are added (S119), the word on the right side is included (S121), the process of S118 is performed as two more words, and this is repeated up to the rightmost component ( S120), the final result is sent to the syntactic analysis as the part of speech and translation of the entire compound word (S12).
2).

【００２１】図４の例ではＷ８がこれに該当する。ま
ず”flexible metal”でＳ５１〜Ｓ５６の処理を行う。
この例はＳ５６の条件に当てはまるため、図１０の処理
を行い、「柔軟な金属（の）」という訳語を与えられ、
品詞は形容詞または名詞となる。更に右側の”pipe”を
含め、”flexible-metal”と”pipe”の２語としてＳ５
１〜５６の処理を行う。この場合もＳ５６の条件に当て
はまるため、図１０の処理を行い、”flexible-metal”
は形容詞と解析され、「柔軟な金属のパイプ（の）」と
いう訳語を得る。品詞を形容詞または名詞として翻訳を
行った例が図１２のＳ８である。この例では形容詞の候
補が取られ、「柔軟な金属のパイプのプラント」と訳出
される。なお、上記の処理において複合語内部の翻訳処
理を行う場合、通常の解析文法を用いても、複合語内部
の統語的特徴に合わせた文法（名詞句に冠詞が付かな
い、前置修飾語は通常１つのみ、など）を用いてもよ
い。図１２は、本発明における未登録複合語処理によ
る、図４に挙げた未登録複合語を含む名詞句の翻訳例で
ある。In the example of FIG. 4, W8 corresponds to this. First, the processing of S51 to S56 is performed with "flexible metal".
Since this example satisfies the condition of S56, the process of FIG. 10 is performed, and the translation of “flexible metal ()” is given.
Part of speech can be an adjective or a noun. Furthermore, including the "pipe" on the right side, S5 as two words "flexible-metal" and "pipe"
The processes 1 to 56 are performed. In this case as well, the condition of S56 is satisfied, so the process of FIG. 10 is performed and "flexible-metal" is performed.
Is parsed as an adjective and gets the translation "flexible metal pipe". An example of translation using a part of speech as an adjective or noun is S8 in FIG. In this example, the adjective candidate is taken and translated as "flexible metal pipe plant". In addition, in the above process, when performing the translation process inside the compound word, even if a normal parsing grammar is used, a grammar that matches the syntactic features inside the compound word (no prefix is attached to the noun phrase, the prefix modifier is Usually only one, etc.) may be used. FIG. 12 is an example of translation of a noun phrase including the unregistered compound words shown in FIG. 4 by the unregistered compound word processing according to the present invention.

【００２２】[0022]

【効果】以上の説明から明らかなように、本発明による
と、以下のような効果がある。（１）請求項１に対応する効果：未登録複合語を構成要
素毎に辞書引きし、その結果から得られる複合語内部の
構造に応じて適切な翻訳情報を付与することで、未登録
複合語を含む文の翻訳精度を上げることができる。（２）請求項２〜５、７、８に対応する効果：特に、複
合語の構成要素数や、構成要素中のキーとなる語の品
詞、およびその有無によって処理を分け、精度の向上が
可能となる。（３）請求項６、９に対応する効果：キーとなる語を持
たない複合語は、全体を一つの未知語として扱うことに
より、誤解析や解析失敗を避けることが出来る。As is apparent from the above description, the present invention has the following effects. (1) Effect corresponding to claim 1: An unregistered compound word is dictionary-divided for each constituent element, and appropriate translation information is added according to the internal structure of the compound word obtained from the result. It is possible to improve the translation accuracy of sentences including words. (2) Effects corresponding to claims 2 to 5, 7, and 8: Particularly, the processing is divided according to the number of constituent elements of the compound word, the part of speech of the key word in the constituent elements, and the presence or absence thereof, thereby improving accuracy. It will be possible. (3) Effects corresponding to claims 6 and 9: A compound word having no key word can be treated as one unknown word to avoid erroneous analysis and analysis failure.

[Brief description of drawings]

【図１】本発明による機械翻訳装置の一実施例を説明
するための構成図である。FIG. 1 is a configuration diagram for explaining an embodiment of a machine translation device according to the present invention.

【図２】図１における翻訳部の処理を英日機械翻訳を
例にして示した構成図である。FIG. 2 is a configuration diagram showing processing of a translation unit in FIG. 1 by taking English-Japanese machine translation as an example.

【図３】図１における複合語処理部における複合語処
理のうち、構成要素数による振り分け処理までのフロー
チャートである。FIG. 3 is a flowchart of the compound word processing in the compound word processing unit in FIG. 1 up to the distribution processing according to the number of constituent elements.

【図４】本発明による未登録複合語の個々の構成要素
を辞書引きした結果の例を示す図である。FIG. 4 is a diagram showing an example of a result obtained by looking up individual constituent elements of an unregistered compound word according to the present invention in a dictionary.

【図５】本発明における構成要素数が２語である場合
の構成要素の品詞候補による振り分け処理のフローチャ
ートである。FIG. 5 is a flow chart of a process of allocating components by part-of-speech candidates when the number of components is 2 in the present invention.

【図６】本発明における左側構成要素が現在分詞であ
る場合の処理のフローチャートである。FIG. 6 is a flowchart of processing when the left-side component is the present participle in the present invention.

【図７】本発明における右側構成要素が現在分詞であ
る場合の処理のフローチャートである。FIG. 7 is a flowchart of a process when the right-side component is the present participle in the present invention.

【図８】本発明における右側構成要素が過去分詞であ
る場合の処理のフローチャートである。FIG. 8 is a flowchart of a process when the right component is a past participle in the present invention.

【図９】本発明における格フレームの例を示す図であ
る。FIG. 9 is a diagram showing an example of a case frame according to the present invention.

【図１０】本発明における右側構成要素が名詞である
場合の処理のフローチャートである。FIG. 10 is a flowchart of a process when the right component is a noun according to the present invention.

【図１１】本発明における構成要素が３語以上の場合
の処理のフローチャートである。FIG. 11 is a flowchart of a process in the case where the constituent elements of the present invention are three or more words.

【図１２】本発明における未登録複合語処理による翻
訳例を示す図である。FIG. 12 is a diagram showing an example of translation by unregistered compound word processing according to the present invention.

[Explanation of symbols]

１１…入力部、１２…原文記憶部、１３…訳文記憶部、
１４…編集制御部、１５…複合語検出部、１６…複合語
処理部、１７…翻訳部（翻訳処理部）、１８…表示制御
部、１９…表示部。11 ... Input unit, 12 ... Original sentence storage unit, 13 ... Translated sentence storage unit,
Reference numeral 14 ... Edit control unit, 15 ... Compound word detection unit, 16 ... Compound word processing unit, 17 ... Translation unit (translation processing unit), 18 ... Display control unit, 19 ... Display unit.

Claims

[Claims]

1. At least an input unit for inputting an original sentence, an original sentence storage unit for storing the original sentence input by the input unit, and a translation dictionary unit for storing knowledge information used for translation processing of the original sentence. A translation processing unit that translates the original sentence stored in the original sentence storage unit using the knowledge information; and a translation sentence storage unit that stores the translated sentence obtained by the translation process by the translation processing unit in association with the original sentence. , At least the display unit and the display control unit for displaying the original sentence or the translated sentence, the edit control unit for controlling the editing process for the original sentence and the translated sentence displayed on the display unit, and the special characters such as hyphens from the input original sentence. The compound word detection unit for detecting the compound word combined by the above, and the compound word detected by the compound word detection unit,
It consists of a compound word processing unit that divides the special characters into individual components as delimiters and looks up the dictionary, and gives translation information such as parts of speech and translation words to the entire compound word according to the internal structure obtained from the result. The compound word detected by the compound word detection unit does not have an entry in the translation processing unit as a whole,
When the translation processing unit does not undergo morpheme processing such as affix processing and number processing, the compound word processing unit obtains the number of constituent elements of the compound word and part-of-speech information obtained by looking up each constituent element in a dictionary. A machine translation device characterized in that appropriate translation information is given according to the combination and the like, and is used for translation processing of the original sentence.

2. In the compound word processing unit, when the number of constituent elements of the compound word is two and the left-side component of the hyphen is a part-of-speech as an adverb, the right-side constituent is used as the adverb's target 2. The first translation processing means for translating a sentence, generating a translated word from the result, assigning it to the entire compound word together with a part-of-speech candidate, and utilizing it for the translation processing of the original sentence. Machine translation device.

3. In the compound word processing unit, when the number of constituent elements of the compound word is two, which does not correspond to the first translation processing, and the part of speech of the component on the right side of the hyphen is the present participle of the verb, Second translation processing means for translating the inside of the compound word by changing the word order of the left and right elements, generating a translated word from the result, assigning it to the entire compound word together with the part-of-speech candidate, and utilizing it for the translation processing of the original sentence. The machine translation device according to claim 1, further comprising:

4. In the compound word processing unit, when the number of constituent elements of the compound word is two, does not correspond to the first translation processing, and the part-of-speech of the component on the right side of the hyphen is the past participle of the verb. , Based on the information of the translation dictionary unit, the case frame of the verb and the feature of the left component are matched, the inside of the compound word is translated with an appropriate preposition, and a translated word is generated from the result, 2. The machine translation device according to claim 1, further comprising a third translation processing unit that is provided to the entire compound word together with the part-of-speech candidate and is used for translation processing of the original sentence.

5. In the compound word processing unit, when the number of constituent elements of the compound word is two, does not correspond to the first translation process, and the right component of the hyphen is a noun, the inside of the compound word is 4. A fourth translation processing unit that performs translation processing, generates a translated word from the result, assigns it to the entire compound word together with a part-of-speech candidate, and uses it for translation processing of the original sentence.
The machine translation device described.

6. In the compound word processing unit, when the number of constituent elements is two and does not correspond to any of the first translation processing to the fourth translation processing, the entire compound word is regarded as one group. 2. The machine translation device according to claim 1, further comprising first unknown word processing means for processing an unknown word.

7. In the compound word processing unit, when the number of constituent elements is three or more and one of the constituent elements is a coordinate conjunction, translation processing is performed inside the compound word as a parallel phrase. The machine translation apparatus according to claim 1, further comprising a fifth translation processing unit that generates a translated word from the result, assigns it to the entire compound word together with the part-of-speech candidate, and uses it for translation processing of the original sentence.

8. In the compound word processing unit, when the number of constituent elements is 3 words or more and does not match the fifth translation processing, two words from the left of the constituent element are extracted and the first word is extracted.
When the processing of the first translation processing to the fourth translation processing is performed and any one of them is matched, the processing of the first translation processing to the fourth translation processing is performed again by a combination of the portion and one word on the right side. It has a sixth translation processing means for performing the processing, repeating the processing up to the last component, generating a translated word from the result, assigning it to the entire compound word together with the part-of-speech candidate, and utilizing it for the translation processing of the original sentence. The machine translation device according to claim 1.

9. In the compound word processing section, when the number of constituent elements is three or more and does not match any of the fifth translation processing and the sixth translation processing, the entire compound word is grouped into one group. And a second unknown word processing means for processing an unknown word as.