JPH06295311A - Machine translation system - Google Patents

Machine translation system

Info

Publication number
JPH06295311A
JPH06295311A JP5106051A JP10605193A JPH06295311A JP H06295311 A JPH06295311 A JP H06295311A JP 5106051 A JP5106051 A JP 5106051A JP 10605193 A JP10605193 A JP 10605193A JP H06295311 A JPH06295311 A JP H06295311A
Authority
JP
Japan
Prior art keywords
translation
processing
word
compound word
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP5106051A
Other languages
Japanese (ja)
Inventor
Sakiko Honma
咲子 本間
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP5106051A priority Critical patent/JPH06295311A/en
Publication of JPH06295311A publication Critical patent/JPH06295311A/en
Pending legal-status Critical Current

Links

Landscapes

  • Machine Translation (AREA)

Abstract

PURPOSE:To improve the translation accuracy of a sentence including an unregistered composite word by referring to a dictionary for the meaning of the unregistered composite word in each its constitutional element and applying proper translation information in accordance with the internal structure of the composite word obtained from the reference of the dictionary. CONSTITUTION:A composite word detecting part 15 detects a composite word combined through a specific character such as a hyphen and a composite word processing part 16 divides the detected composite word into individual constitutional elements by using the specific character as a delimiter and applies translation information such as a part of speech and a translated word to the whole composite word in accordance with the internal structure obtained from the reference of the dictionary. When the composite word detected by the detecting part 15 has no entry as a whole composite word and morphem processing such as affix processing and numeral processing for the composite word is not executed by a translation processing part 17, the processing part 16 gives proper translation information in accordance with the number of constitutional words of the composite word, speech part information obtained by the reference of the dictionary for the meaning of respective constitutional elements and the combination of the number and the information to utilize the information for the translation processing of the original sentence.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【技術分野】本発明は、機械翻訳装置に関し、より詳細
には、未登録複合語を構成要素毎に辞書引きし、その結
果から得られる複合語内部の構造に応じて適切な翻訳情
報を付与することで、未登録複合語を含む文の翻訳精度
を上げるようにした機械翻訳装置に関する。例えば、未
登録複合語の処理機能を有する機械翻訳装置に適用され
るものである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device, and more specifically, it extracts a dictionary of unregistered compound words for each constituent element and adds appropriate translation information according to the internal structure of the compound word obtained from the result. By doing so, the present invention relates to a machine translation device that improves the translation accuracy of a sentence including an unregistered compound word. For example, it is applied to a machine translation device having a function of processing unregistered compound words.

【0002】[0002]

【従来技術】機械翻訳などの自然言語処理において、シ
ステム辞書に登録されていない未知語の処理は重要な課
題である。特に複合語は、新造力(productivity)に富
んでいるため、その全てをシステム辞書に追加すること
は、辞書の容量を膨大にする上、実際上不可能である。
複合語全体が未登録であっても、構成要素の中に特に新
造力が強く、接辞辞書に登録されているものがあれば、
形態素処理で訳語や品詞などが付与される。また、数詞
など形態上の特徴が強いものは、やはり形態素処理でま
とめ上げることが可能である。辞書に未登録で、かつ形
態素処理を受けるパターンにも該当しない複合語の処理
法としては、複合語全体を未知語として処理し、幾つか
の品詞候補を与えて原文のまま訳出する方法がある。し
かし、ハイフンなどの特殊文字で結合された複合語は、
結合のパターンや構成要素間の意味関係にある程度の法
則性があり、辞書に未登録の場合でも、その法則に応じ
て訳語や品詞などの推定処理を行うことが望ましい。
2. Description of the Related Art In natural language processing such as machine translation, processing of unknown words not registered in a system dictionary is an important issue. In particular, since compound words are rich in productivity, it is practically impossible to add all of them to the system dictionary, because the capacity of the dictionary is enormous.
Even if the entire compound word is not registered, if there is something that has a particularly strong new creativity and is registered in the affix dictionary,
Morphological processing adds translations and parts of speech. Also, those with strong morphological characteristics such as numbers can be put together by morphological processing. As a processing method of a compound word that is not registered in the dictionary and does not correspond to a pattern that is subjected to morphological processing, there is a method of processing the entire compound word as an unknown word, giving some part-of-speech candidates, and translating the original text as it is. . However, compound words joined by special characters such as hyphens are
There is a certain degree of rule in the connection pattern and the semantic relationship between the constituents, and even if it is not registered in the dictionary, it is desirable to perform the process of estimating the translated word and the part of speech according to the rule.

【0003】このような問題を解決しようとする従来技
術としては、下記の3つがある。まず、特開平3−48
366号公報のものは、複合語内部を入力されたままの
語順で構文解析したり、ハイフンをスペースに置き換え
て原文全体をそのまま構文解析して処理しようとするも
のである。しかし、複合語の語順は通常文の語順と同じ
であるとは限らない。例えば、”money-saving”は通常
の語順とは逆で、目的語が動詞の前に来ており、これ
を”money saving”として解析しても正しい結果は得ら
れない。また、ハイフンをスペースに置き換えて、その
他の部分と区別せずに構文解析を行った場合、複合語の
構成要素が他の部分と紛れてしまうため、誤った解析を
行う可能性が高く、生成された訳文の理解容易性が非常
に低くなってしまう。
There are the following three conventional techniques for solving such a problem. First, JP-A-3-48
Japanese Patent No. 366 discloses an attempt to parse the inside of a compound word in the word order as it is input, or replace hyphens with spaces to parse and process the entire original sentence as it is. However, the word order of compound words is not always the same as the word order of ordinary sentences. For example, "money-saving" is the reverse of the normal word order, the object comes before the verb, and even if this is analyzed as "money saving", the correct result cannot be obtained. Also, if you replace the hyphen with a space and perform syntactic analysis without distinguishing it from other parts, the constituent elements of the compound word will be confused with other parts, so there is a high possibility that incorrect parsing will occur, and The intelligibility of the translated text is very low.

【0004】また、特開平4−262459号公報、及
び特開平2−281374号公報のものは、「名詞−分
詞」などの限られたパターンに対し、一意に訳出パター
ンを決めて、個々の構成要素の辞書引き結果から複合語
全体の訳語を生成しようとするものである。しかし、入
力パターンに対して一意の訳出パターンのみを設定して
いるため、どちらの手法を用いても次の例のような問題
が生じる。 例) college-trained people (a)「大学で訓練された人々」 (b)「大学が訓練した人々」 (c)「大学を訓練した人々」 (a)は正解釈、(b)は特開平4−262459号公
報での手法による翻訳例、(c)は特開平2−2813
74号公報での手法による翻訳例である。正解はハイフ
ン左側の名詞に場所格の格ラベル「で」を付与しなけれ
ばならないが、(b)においては名詞を一律右側の動詞
の主語として扱い、(c)においては、一律目的語とし
て扱うため、それぞれ誤った格ラベルを付与してしま
う。「名詞−過去分詞」における各構成要素の格関係は
一律ではなく、場所の他に原因、内容、行為者など様々
な関係があり得るが、従来技術では極く限られた場合に
しか対応できない。
In Japanese Patent Laid-Open No. 4-262459 and Japanese Patent Laid-Open No. 2-281374, a translation pattern is uniquely determined for a limited pattern such as "noun-participle", and individual configurations are provided. It is intended to generate the translated word of the entire compound word from the result of the dictionary lookup of the elements. However, since only a unique translation pattern is set for the input pattern, whichever method is used, the following problem occurs. Example) college-trained people (a) "People trained at college" (b) "People trained at college" (c) "People trained at college" (a) is a correct interpretation, (b) is JP-A An example of translation by the method disclosed in Japanese Patent Laid-Open No. 4-262459, (c) is Japanese Patent Laid-Open No. 2813/1990.
This is an example of translation by the method disclosed in Japanese Patent Publication No. 74. In the correct answer, the noun on the left side of the hyphen must be given the case label "de" of the place case, but in (b) the noun is treated as the subject of the verb on the right side, and in (c) it is treated as the uniform object. Therefore, the wrong case label is given to each. The case relation of each component in the "noun-past participle" is not uniform, and there may be various relations such as cause, content, actor, etc. in addition to the place, but the conventional technology can handle only limited cases. .

【0005】更に上記の従来技術では、対応している入
力パターンも少なく、右側が分詞でも左側が形容詞であ
る場合などには対応できず、構成要素が3語以上の場合
にも対応していない。特に特開平2−281374号公
報で対応しているのは「名詞−分詞」のみであり、比較
的出現頻度の高い、左側が副詞であったり、右側が名詞
であったりするパターンには全く対応していない。
Further, the above-mentioned conventional technique does not support a small number of input patterns, and cannot handle cases where the right side is a participle and the left side is an adjective, and the case where the constituent elements are three words or more is not supported. . In particular, Japanese Unexamined Patent Publication No. 2-281374 only deals with "noun-participle", and corresponds to patterns with relatively high frequency of appearance, where the left side is an adverb or the right side is a noun. I haven't.

【0006】[0006]

【目的】本発明は、上述のごとき実情に鑑みてなされた
もので、未登録複合語を構成要素毎に辞書引きし、その
結果から得られる複合語内部の構造に応じて適切な翻訳
情報を付与することで、未登録複合語を含む文の翻訳精
度を上げること、特に、複合語の構成要素数や、構成要
素中のキーとなる語の品詞、およびその有無によって処
理を分け、精度の向上を図ること、さらに、キーとなる
語を持たない複合語は、解析失敗や誤解析を避けるため
に全体を一つの未知語として扱うようにした機械翻訳装
置を提供することを目的としてなされたものである。
[Purpose] The present invention has been made in view of the above-mentioned circumstances, and searches an unregistered compound word into a dictionary for each constituent element, and obtains appropriate translation information according to the internal structure of the compound word obtained from the result. By adding it, the translation accuracy of sentences including unregistered compound words can be improved, and in particular, the processing can be divided according to the number of constituent elements of the compound word, the part of speech of the key word in the constituent elements, and the presence or absence of the The purpose was to improve and to provide a machine translation device that treats compound words that do not have a key word as one unknown word in order to avoid parsing failures and erroneous analysis. It is a thing.

【0007】[0007]

【構成】本発明は、上記目的を達成するために、(1)
少なくとも、原文を入力するための入力部と、該入力部
により入力された原文を記憶する原文記憶部と、前記原
文の翻訳処理に用いる知識情報を格納した翻訳辞書部
と、前記知識情報を用いて、前記原文記憶部に記憶され
た原文を翻訳処理する翻訳処理部と、該翻訳処理部によ
る翻訳処理によって得られた訳文を、原文に対応させて
記憶する訳文記憶部と、少なくとも前記原文または訳文
を表示する表示部および表示制御部と、該表示部に表示
された原文および訳文に対する編集処理を制御する編集
制御部と、入力された原文からハイフンなどの特殊文字
をはさんで結合された複合語を検出する複合語検出部
と、該複合語検出部で検出された複合語を、特殊文字を
デリミタとして個々の構成要素に分けて辞書引きし、そ
の結果から得られる内部構造に応じて複合語全体に対し
て品詞や訳語等の翻訳情報を与える複合語処理部とから
成り、前記複合語検出部で検出された複合語が、前記翻
訳処理部に全体としてのエントリを持たず、また翻訳処
理部において接辞処理、数詞処理などの形態素処理を受
けなかった場合には、前記複合語処理部において、複合
語の構成要素の数や、各構成要素を辞書引きして得られ
る品詞情報とその組合せなどに応じて適切な翻訳情報を
与え、原文の翻訳処理に利用すること、更には(2)前
記複合語処理部において、複合語の構成要素数が2語
で、かつハイフンの左側構成要素に品詞が副詞である場
合は、右側構成要素を副詞の係り先として複合語内部を
翻訳処理し、その結果から訳語を生成し、品詞候補と併
せて複合語全体に付与し、原文の翻訳処理に利用する第
1の翻訳処理手段を有すること、更には(3)前記複合
語処理部において、複合語の構成要素数が2語で、前記
第1の翻訳処理に該当せず、かつハイフンの右側構成要
素の品詞が動詞の現在分詞の場合は、左右の要素の語順
を入れ替えて複合語内部を翻訳処理し、その結果から訳
語を生成し、品詞候補と併せて複合語全体に付与し、原
文の翻訳処理に利用する第2の翻訳処理手段を有するこ
と、更には(4)前記複合語処理部において、複合語の
構成要素数が2語で、前記第1の翻訳処理に該当せず、
かつハイフンの右側構成要素の品詞が動詞の過去分詞の
場合は、前記翻訳辞書部の情報に基づいて、該動詞の格
フレームと左側構成要素の素性とのマッチングを行い、
適当な前置詞を補って複合語内部を翻訳処理し、その結
果から訳語を生成し、品詞候補と併せて複合語全体に付
与し、原文の翻訳処理に利用する第3の翻訳処理手段を
有すること、更には(5)前記複合語処理部において、
複合語の構成要素数が2語で、前記第1の翻訳処理に該
当せず、かつハイフンの右側構成要素が名詞の場合は、
複合語内部を翻訳処理し、その結果から訳語を生成し、
品詞候補と併せて複合語全体に付与し、原文の翻訳処理
に利用する第4の翻訳処理手段を有すること、更には
(6)前記複合語処理部において、構成要素数が2語
で、かつ前記第1の翻訳処理から第4の翻訳処理のいず
れにも該当しない場合は、複合語全体を一つのまとまり
として未知語処理する第1の未知語処理手段を有するこ
と、更には(7)前記複合語処理部において、構成要素
数が3語以上で、かつ構成要素のうち1つの品詞が等位
接続詞である場合は、複合語内部を並列句として翻訳処
理し、その結果から訳語を生成し、品詞候補と併せて複
合語全体に付与し、原文の翻訳処理に利用する第5の翻
訳処理手段を有すること、更には(8)前記複合語処理
部において、構成要素数が3語以上で、かつ前記第5の
翻訳処理にマッチしない場合は、構成要素の左から2語
を取り出して、前記第1の翻訳処理から第4の翻訳処理
の処理を行い、いずれかにマッチした場合は、該部分と
更に右側の1語との組合せにより再度前記第1の翻訳処
理から第4の翻訳処理の処理を行い、これを最後の構成
要素まで繰り返し、その結果から訳語を生成し、品詞候
補と併せて複合語全体に付与し、原文の翻訳処理に利用
する第6の翻訳処理手段を有すること、更には(9)前
記複合語処理部において、構成要素数が3語以上で、か
つ第5の翻訳処理又は第6の翻訳処理のいずれにもマッ
チしない場合には、複合語全体を一つのまとまりとして
未知語処理する第2の未知語処理手段を有することを特
徴としたものである。以下、本発明の実施例に基づいて
説明する。
In order to achieve the above object, the present invention provides (1)
At least an input unit for inputting an original sentence, an original sentence storage unit for storing the original sentence input by the input unit, a translation dictionary unit for storing knowledge information used for translation processing of the original sentence, and the knowledge information are used. A translation processing unit that translates the original sentence stored in the original sentence storage unit, a translation storage unit that stores the translated sentence obtained by the translation processing by the translation processing unit in association with the original sentence, and at least the original sentence or The display unit and the display control unit that display the translated text, the edit control unit that controls the editing process for the original sentence and the translated text displayed on the display unit, and the input source sentence are connected by a special character such as a hyphen. A compound word detection unit that detects a compound word, and a compound word detected by the compound word detection unit is divided into individual constituent elements using special characters as delimiters, and a dictionary is created. A compound word processing unit that gives translation information such as a part of speech or a translated word to the entire compound word according to the structure, and the compound word detected by the compound word detection unit is registered in the translation processing unit as a whole. If it does not have it and it has not undergone morpheme processing such as affix processing or number processing in the translation processing unit, the compound word processing unit obtains the number of constituent elements of the compound word and a dictionary lookup for each constituent element. Appropriate translation information is provided according to the part-of-speech information and the combination thereof, which is used for the translation process of the original sentence. Furthermore, (2) the compound word processing unit has two compound elements, and When the part of speech is an adverb in the left component of the hyphen, the inside of the compound word is translated using the right component as the adjunct of the adverb, a translated word is generated from the result, and it is given to the entire compound word together with the part of speech candidate. , Original text translation process And (3) the compound word processing unit has two compound elements, does not correspond to the first translation process, and is on the right side of the hyphen. When the part-of-speech of a constituent element is the present participle of a verb, the word order of the left and right elements is exchanged, and the inside of the compound word is translated, and a translated word is generated from the result, which is given to the entire compound word together with the part-of-speech candidate. Second translation processing means used for the translation processing of, and (4) in the compound word processing unit, the number of constituent elements of the compound word is two, which does not correspond to the first translation processing,
And when the part of speech of the right component of the hyphen is a past participle of the verb, based on the information of the translation dictionary part, the case frame of the verb and the feature of the left component are matched,
Having a third translation processing means for translating the inside of the compound word by supplementing an appropriate preposition, generating a translated word from the result, adding it to the entire compound word together with a part-of-speech candidate, and utilizing it for the translation processing of the original sentence. , And (5) In the compound word processing section,
When the number of constituent elements of the compound word is two, does not correspond to the first translation process, and the right-side constituent element of the hyphen is a noun,
Translate the inside of the compound word, generate a translated word from the result,
It has a fourth translation processing means that is given to the entire compound word together with the part-of-speech candidate and is used for the translation processing of the original sentence. Furthermore, (6) the compound word processing section has two constituent elements, and If the first translation process to the fourth translation process do not apply, a first unknown word processing unit that processes the entire compound word as an unknown word is further provided, and further (7) the above In the compound word processing unit, when the number of constituent elements is three or more and one of the constituent elements is a coordinate conjunction, translation processing is performed inside the compound word and a translated word is generated from the result. And a fifth translation processing unit which is added to the entire compound word together with the part-of-speech candidate and is used for the translation processing of the original sentence. Furthermore, (8) the compound word processing unit has three or more constituent elements. , And does not match the fifth translation process In the case, two words from the left of the constituent element are taken out, the processing from the first translation processing to the fourth translation processing is performed, and if any of them is matched, the portion is combined with one word on the further right side. Then, the processes from the first translation process to the fourth translation process are performed again, and this process is repeated up to the last component, and a translated word is generated from the result, and is added to the entire compound word together with the part-of-speech candidate. Having a sixth translation processing means used for the translation processing, and further (9) the compound word processing section having three or more constituent elements, whichever of the fifth translation processing or the sixth translation processing. If there is no match, the second unknown word processing means for processing the unknown word as a unit of the entire compound word is featured. Hereinafter, description will be given based on examples of the present invention.

【0008】図1は、本発明による機械翻訳装置の一実
施例を説明するための構成図で、図中、11は入力部、
12は原文記憶部、13は訳文記憶部、14は編集制御
部、15は複合語検出部、16は複合語処理部、17は
翻訳部(翻訳処理部)、18は表示制御部、19は表示
部である。入力部11から入力された原文は原文記憶部
12に送られる。翻訳部17は、編集制御部14の制御
下で記憶部12の原文を翻訳処理する。
FIG. 1 is a block diagram for explaining an embodiment of a machine translation apparatus according to the present invention, in which 11 is an input unit,
Reference numeral 12 is an original sentence storage unit, 13 is a translated sentence storage unit, 14 is an edit control unit, 15 is a compound word detection unit, 16 is a compound word processing unit, 17 is a translation unit (translation processing unit), 18 is a display control unit, and 19 is It is a display unit. The original text input from the input unit 11 is sent to the original text storage unit 12. The translation unit 17 translates the original sentence in the storage unit 12 under the control of the editing control unit 14.

【0009】すなわち、入力部11により入力された原
文は、原文記憶部12に記憶される。翻訳辞書部には前
記原文の翻訳処理に用いられる知識情報が格納されてい
る。翻訳処理部17は、前記翻訳辞書部の知識情報を用
いて、前記原文記憶部12に記憶された原文を翻訳処理
する。訳文記憶部13は、翻訳処理によって得られた訳
文を原文に対応させて記憶する。表示制御部18及び表
示部9は、少なくとも前記原文または訳文を表示する。
編集制御部14は、表示された原文および訳文に対する
編集処理を制御する。複合語検出部15は、入力された
原文からハイフンなどの特殊文字をはさんで結合された
複合語を検出する。複合語処理部16は、前記複合語検
出部15で検出された複合語を、特殊文字をデリミタと
して個々の構成要素に分けて辞書引きし、その結果から
得られた内部構造に応じて複合語全体に対して品詞や訳
語等の翻訳情報を与える。このような構成により、前記
複合語検出部15で検出された複合語が、翻訳辞書部に
全体としてのエントリを持たず、また翻訳処理部17に
おいて接辞処理、数詞処理などの形態素処理を受けなか
った場合には、前記複合語処理部16において、複合語
の構成要素の数や、各構成要素を辞書引きして得られる
品詞情報とその組合せなどに応じて適切な翻訳情報を与
え、原文の翻訳処理に利用する。
That is, the original sentence input by the input unit 11 is stored in the original sentence storage unit 12. Knowledge information used for the translation process of the original sentence is stored in the translation dictionary unit. The translation processing unit 17 uses the knowledge information of the translation dictionary unit to translate the original sentence stored in the original sentence storage unit 12. The translated text storage unit 13 stores the translated text obtained by the translation processing in association with the original text. The display control unit 18 and the display unit 9 display at least the original sentence or the translated sentence.
The editing control unit 14 controls the editing process for the displayed original sentence and translated sentence. The compound word detection unit 15 detects a compound word combined with a special character such as a hyphen from the input original sentence. The compound word processing unit 16 divides the compound word detected by the compound word detecting unit 15 into individual constituent elements by using special characters as delimiters and looks up the dictionary, and according to the internal structure obtained from the result, the compound word is extracted. Give translation information such as parts of speech and translations to the whole. With such a configuration, the compound word detected by the compound word detecting unit 15 does not have an entry in the translation dictionary unit as a whole, and the translation processing unit 17 is not subjected to morpheme processing such as affix processing or number processing. In such a case, the compound word processing unit 16 provides appropriate translation information according to the number of constituent elements of the compound word, the part-of-speech information obtained by looking up each constituent element in the dictionary, and the combination thereof, and Used for translation processing.

【0010】図2は、図1における翻訳部の処理を英日
機械翻訳を例にして示した構成図で、図中、21は形態
素解析部、22は構文解析部、23は変換部、24は生
成部、25は辞書、26は文法規則である。まず、入力
文である英文は、形態素解析部21で、辞書25の情報
をもとに個々の単語に認定される。また接辞辞書などを
参照して、システム辞書に未登録である語の一部に対
し、品詞や訳語などの情報を与える。次に構文解析部2
2で文法規則26を適用して文構造が決定され、係り受
け関係などを明らかにした木構造が作られる。該木構造
は、変換部23で出力言語である日本語の構造に即した
木に変換され、生成部24でノード毎に訳出される。
FIG. 2 is a block diagram showing the processing of the translation unit in FIG. 1 by taking English-Japanese machine translation as an example. In the figure, 21 is a morphological analysis unit, 22 is a syntax analysis unit, 23 is a conversion unit, and 24 is a conversion unit. Is a generation unit, 25 is a dictionary, and 26 is a grammar rule. First, the English sentence which is the input sentence is recognized as an individual word by the morphological analysis unit 21 based on the information in the dictionary 25. Further, by referring to an affix dictionary or the like, information such as a part of speech or a translated word is given to some of the words that are not registered in the system dictionary. Next, the parser 2
In step 2, the grammar rule 26 is applied to determine the sentence structure, and a tree structure that defines the dependency relations is created. The tree structure is converted by the conversion unit 23 into a tree conforming to the structure of the output language Japanese, and the generation unit 24 translates the tree for each node.

【0011】複合語検出部15は原文中からハイフンな
どの特殊文字によって結合された複合語を切り出す。複
合語処理部16では、形態素解析部21での処理結果を
受け、切り出された複合語が辞書引き情報や接辞処理情
報を持たない場合は、特殊文字をデリミタとして構成要
素単位で辞書引きし、その結果に応じて品詞や訳語など
の翻訳情報を与え、構文解析部22以降の処理に渡す。
得られた訳文は順次訳文記憶部13に格納される。編集
制御部14は表示制御部18を駆動し、原文記憶部12
に格納された英文と、訳文記憶部13に格納された日本
語文を対応付けて表示部19において表示する。ユーザ
はこの表示を見ながら後編集処理を行う。後編集処理
は、入力部11から入力される制御情報に従い、編集制
御部14で実行される。
The compound word detecting unit 15 cuts out compound words combined by special characters such as hyphens from the original sentence. The compound word processing unit 16 receives the processing result of the morpheme analysis unit 21, and when the cut out compound word does not have the dictionary lookup information or the affix processing information, the special character is used as a delimiter in the dictionary to construct the dictionary. According to the result, translation information such as a part-of-speech and a translated word is given and passed to the processing of the syntax analysis unit 22 and thereafter.
The obtained translated texts are sequentially stored in the translated text storage unit 13. The edit control unit 14 drives the display control unit 18, and the original sentence storage unit 12
The English sentence stored in the and the Japanese sentence stored in the translated sentence storage unit 13 are displayed on the display unit 19 in association with each other. The user performs post-editing processing while looking at this display. The post-editing process is executed by the edit control unit 14 according to the control information input from the input unit 11.

【0012】図3は、図1における複合語処理部におけ
る複合語処理のうち、構成要素数による振り分け処理ま
でのフローチャートである。以下、各ステップに従って
順に説明する。ハイフンなどによって区切られた複合語
を検出し(S31)、形態素解析を行う(S32)。辞
書エントリを持たず、接辞処理等も受けなかった場合は
(S33〜S34)、個々の語を辞書引きし(S3
5)、構成要素数をカウントする(S36)。構成要素
数が2語であれば(a)の処理へ、3語以上であれば
(b)の処理へ送る(S37)。
FIG. 3 is a flowchart of the compound word processing in the compound word processing section in FIG. 1 up to the distribution processing according to the number of constituent elements. Hereinafter, each step will be described in order. A compound word delimited by hyphens or the like is detected (S31), and morphological analysis is performed (S32). If it does not have a dictionary entry and is not subjected to affix processing or the like (S33 to S34), the individual words are looked up in the dictionary (S3).
5) The number of components is counted (S36). If the number of constituent elements is 2 words, it is sent to the processing of (a), and if it is 3 words or more, it is sent to the processing of (b) (S37).

【0013】図4は、未登録複合語の個々の構成要素を
辞書引きした結果の例である。図5は、構成要素数が2
語である場合の、構成要素の品詞候補による振り分け処
理のフローチャートである。以下、各ステップに従って
順に説明する。左側構成要素の品詞候補を検索し(S5
1)、その中に副詞がある場合は、図中の(c)の処理
へ送る(S52)。副詞がない場合は右側構成要素の品
詞候補を検索し(S53)、その中に現在分詞がある場
合は、図中の(d)の処理へ送る(S54)。現在分詞
がない場合は、過去分詞があるかをチェックし、過去分
詞があれば、図中の(e)の処理へ送る(S55)。な
ければ、次は名詞があるかどうかをチェックし、名詞あ
れば、図中の(f)の処理へ送る(S56)。名詞なけ
れば未知語処理を行い、構文解析へ送る(S57)。な
おこのフローチャートでは、流れに沿って処理を行え
ば、構成要素が多品詞語であるの場合にも対応できる
が、別に多品詞の優先順位情報を設け、それに応じて処
理を行うことも可能である。(以下余白)
FIG. 4 shows an example of the result obtained by looking up the individual constituent elements of the unregistered compound word in the dictionary. In FIG. 5, the number of components is two.
9 is a flowchart of a process of allocating a part of speech candidate of a component in the case of a word. Hereinafter, each step will be described in order. Search for a part-of-speech candidate for the left component (S5
1) If there is an adverb in it, it is sent to the process of (c) in the figure (S52). If there is no adverb, the part-of-speech candidate of the right component is searched (S53), and if there is a current participle, it is sent to the process of (d) in the figure (S54). If there is no present participle, it is checked if there is a past participle, and if there is a past participle, it is sent to the process of (e) in the figure (S55). If not, next, it is checked whether or not there is a noun, and if there is a noun, it is sent to the process of (f) in the figure (S56). If there is no noun, unknown word processing is performed and sent to the syntactic analysis (S57). It should be noted that in this flowchart, if the process is performed in accordance with the flow, it is possible to deal with the case where the component is a multi-part-of-speech word, but it is also possible to separately provide priority information of the multi-part-of-speech and perform the process accordingly. is there. (Below margin)

【0014】図6は、左側構成要素が副詞である場合の
処理フローチャートである。以下、各ステップに従って
順に説明する。副詞の係り先を右側構成要素として翻訳
処理を行い(S61)、解析成功であれば(S62)、
複合語全体を形容詞とし、得られた訳語を与えて構文解
析に送る(S63)。解析失敗であれば(S62)、図
5(g)に戻り、S53以降の処理を行う。図4の例で
はW1がこれに該当する。左側構成要素である”fastes
t”は品詞候補に副詞を含むので、右側の”growing”を
係り先として翻訳処理を行い、「最も早く成長する」と
いう訳語を得る。品詞を形容詞として翻訳を行った例が
図12のS1である。
FIG. 6 is a processing flow chart when the left component is an adverb. Hereinafter, each step will be described in order. Translation processing is performed using the adverb's destination as the right component (S61), and if the analysis is successful (S62),
The entire compound word is used as an adjective, the obtained translated word is given and sent to the syntactic analysis (S63). If the analysis is unsuccessful (S62), the process returns to FIG. In the example of FIG. 4, W1 corresponds to this. Left side component "fastes"
Since "t" includes an adverb in the part-of-speech candidate, the translation process is performed with "growing" on the right as the target, and the translated word "growing fastest" is obtained. An example of translation using a part of speech as an adjective is S1 in FIG.

【0015】図7は、右側構成要素が現在分詞である場
合の処理フローチャートである。以下、各ステップに従
って順に説明する。区切り文字の左右を入れ替えて翻訳
処理を行い(S71)、解析成功であれば(S72)、
複合語全体を形容詞とし、得られた訳語を与えて構文解
析に送る(S73)。解析失敗であれば(S72)、図
5(h)に戻り、S55以降の処理を行う。図4の例で
はW2,W3がこれに該当する。右側構成要素である”
threatening””looking”は共に品詞候補に現在分詞を
含むので、左右を入れ替え、それぞれ”threatening li
fe””looking wistful”として翻訳処理を行う。前者
は他動詞、後者は自動詞として解析されそれぞれ「生命
を脅かす」「物欲しそうに見える」という訳語を得る。
品詞を形容詞として翻訳を行った例が図12のS2,S
3である。
FIG. 7 is a processing flowchart in the case where the right side component is the present participle. Hereinafter, each step will be described in order. The left and right of the delimiter are replaced and the translation processing is performed (S71). If the analysis is successful (S72),
The entire compound word is used as an adjective, the obtained translated word is given and sent to the syntactic analysis (S73). If the analysis is unsuccessful (S72), the process returns to FIG. In the example of FIG. 4, this corresponds to W2 and W3. The right component "
The threatening “” looking ”both include the present participle in the part-of-speech candidate, so swap the left and right to“ threatening li
The translation process is performed as fe “” looking wistful. ”The former is analyzed as a transitive verb, and the latter is analyzed as an intransitive verb to obtain the translated words“ life threatening ”and“ looks greedy ”.
An example of translation using a part of speech as an adjective is S2 and S in FIG.
It is 3.

【0016】図8は、右側構成要素が過去分詞である場
合の処理フローチャートである。以下、各ステップに従
って順に説明する。右側要素の格フレームを検索し(S
81)、格フレームがあれば(S82)、左側要素の素
性とマッチングを行う(S83)。格フレームがなけれ
ば(S82)、図5(i)に戻り、S56以降の処理を
行う。左側構成要素の素性とマッチするものが格フレー
ムの要素にあり(S84)、その要素がOBJ以外であ
れば(S85)、SUBJの場合は”by”を補い(S8
6,87)、それ以外の場合は格フレームに記載された
前置詞を補って訳語を生成し(S88)、複合語全体の
品詞を形容詞として構文解析に送る(S89)。マッチ
するものがなかったり(S84)、マッチした要素がO
BJであった場合は(S85)、図5(i)に戻り、S
56以降の処理を行う。
FIG. 8 is a processing flowchart when the right component is a past participle. Hereinafter, each step will be described in order. Search the case frame of the right element (S
81), if there is a case frame (S82), the feature of the left element is matched (S83). If there is no case frame (S82), the process returns to FIG. There is an element of the case frame that matches the feature of the left component (S84), and if that element is other than OBJ (S85), in the case of SUBJ, "by" is supplemented (S8).
6, 87), in other cases, a preposition written in the case frame is supplemented to generate a translated word (S88), and the part of speech of the entire compound word is sent as an adjective to the syntactic analysis (S89). There is no match (S84), or the matched element is O
If it is BJ (S85), the process returns to FIG.
The processing after 56 is performed.

【0017】図4の例ではW4,W5がこれに該当す
る。右側構成要素である”owned””loaded”は共に品
詞候補に過去分詞を含む。”own””load”の格フレー
ムの例は図9(a),(b)に示す。”government”
は”own”のSUBJと素性ORGが一致するので、”b
y”を補って”owned by government”で翻訳処理を行
い、「政府によって所有された」という訳語を得る。ま
た”propane”は”load”のWITHと素性MATが一
致するので、”with”を補って”loaded with propan
e”で翻訳処理を行い、「プロパンを積まれた」という
訳語を得る。品詞を形容詞として翻訳を行った例が図1
2のS4,S5である。
In the example of FIG. 4, W4 and W5 correspond to this. "Owned" and "loaded", which are the right components, both include the past participle as a part-of-speech candidate. Examples of "own" and "load" case frames are shown in FIGS. 9 (a) and 9 (b). "Government"
Has the feature ORG that matches the SUBJ of "own", so "b"
The translation process of "owned by government" is supplemented with "y" to obtain the translated word "owned by the government". In addition, since "propane" has the same feature MAT as the "load" WITH, "with" is supplemented by "loaded with propan".
The translation process is performed by "e", and the translated word "loaded with propane" is obtained. Figure 1 shows an example of translation using part-of-speech as an adjective.
2 are S4 and S5.

【0018】図10は、右側構成要素が名詞である場合
の処理フローチャート出ある。以下、各ステップに従っ
て順に説明する。左側構成要素を前置修飾要素として翻
訳処理を行い(S101)、解析成功であれば(S10
2)、複合語全体の品詞として形容詞と名詞を付与し、
形容詞の場合は得られた訳語に「の」を加え、名詞の場
合は得られた訳語をそのまま与えて構文解析に送る(S
83)。解析失敗であれば(S102)、図5(j)に
戻り、以降の処理を行う。図4の例ではW6がこれに該
当する。左側構成要素である”product”は品詞候補に
名詞を含むので、”new product”で翻訳処理を行い、
「新しい製品(の)」という訳語を得る。品詞を形容詞
または名詞として翻訳を行った例が図12のS6であ
る。この例では形容詞の候補が取られ、「新しい製品
の」と訳出される。
FIG. 10 shows a processing flowchart when the right component is a noun. Hereinafter, each step will be described in order. Translation processing is performed using the left-side component as a prefix modifier (S101), and if the analysis is successful (S10).
2), add adjectives and nouns as part-of-speech of all compound words,
In the case of an adjective, "no" is added to the obtained translated word, and in the case of a noun, the obtained translated word is given as it is and sent to the syntactic analysis (S
83). If the analysis is unsuccessful (S102), the process returns to FIG. In the example of FIG. 4, W6 corresponds to this. The left component "product" contains a noun in the part-of-speech candidate, so perform translation processing with "new product".
Get the translation "new product". An example of translation using a part of speech as an adjective or noun is S6 in FIG. In this example, the adjective candidates are taken and translated as "new product".

【0019】図11は、構成要素が3語以上の場合の処
理フローチャートである。以下、各ステップに従って順
に説明する。構成要素の品詞候補を検索し(S11
1)、その中に等位接続詞を含むものがあれば(S11
2)、複合語内部を翻訳し(S113)、解析成功であ
れば(S114)、訳語と品詞(形容詞、名詞)を与え
て構文解析に渡す(S115)。解析失敗であれば(S
114)、未知語処理を行って構文解析に渡す(S11
6)。図4の例ではW7がこれに該当する。構成要素中
の”and”は品詞候補に等位接続詞を含むので、”husba
nd and wife”で翻訳処理を行い、「夫と妻(の)」と
いう訳語を得る。品詞を形容詞または名詞として翻訳を
行った例が図12のS7である。この例では形容詞の候
補が取られ、「夫と妻の」と訳出される。
FIG. 11 is a processing flowchart when the number of constituent elements is three or more. Hereinafter, each step will be described in order. The part-of-speech candidate of the component is searched (S11
1), if any of them contains coordinate conjunctions (S11)
2) The inside of the compound word is translated (S113), and if the analysis is successful (S114), the translated word and the part of speech (adjective, noun) are given and passed to the syntactic analysis (S115). If the analysis fails (S
114), unknown word processing is performed, and the result is passed to the syntax analysis (S11).
6). In the example of FIG. 4, W7 corresponds to this. Since "and" in the constituent elements includes coordinate conjunctions in the part-of-speech candidates, "husba"
"nd and wife" is translated and the translated word "husband and wife (no)" is obtained. An example of translation using a part of speech as an adjective or noun is S7 in FIG. In this example, adjective candidates are taken and translated as "husband and wife's".

【0020】品詞候補の中に等位接続詞がなければ(S
112)、左端から2語を取って(S117)構成要素
が2語の場合の処理(S51〜56)を行う(S11
8)。処理語、未知語と判定されれば(S119)、複
合語全体を未知語として構文解析へ送る(S116)。
未知語ではなく、何らかの品詞と訳語を付与されていれ
ば(S119)、右側の1語を含めて(S121)、更
に2語としてS118の処理を行い、これを最も右側の
構成要素まで繰り返し(S120)、最終結果を複合語
全体の品詞および訳語として構文解析へ送る(S12
2)。
If there is no coordinate conjunction in the part-of-speech candidate (S
112), two words are taken from the left end (S117), and the processing (S51 to 56) when the constituent element is two words is performed (S11).
8). If it is determined that the processed word is an unknown word (S119), the entire compound word is sent to the syntactic analysis as an unknown word (S116).
If it is not an unknown word but some part-of-speech and translation are added (S119), the word on the right side is included (S121), the process of S118 is performed as two more words, and this is repeated up to the rightmost component ( S120), the final result is sent to the syntactic analysis as the part of speech and translation of the entire compound word (S12).
2).

【0021】図4の例ではW8がこれに該当する。ま
ず”flexible metal”でS51〜S56の処理を行う。
この例はS56の条件に当てはまるため、図10の処理
を行い、「柔軟な金属(の)」という訳語を与えられ、
品詞は形容詞または名詞となる。更に右側の”pipe”を
含め、”flexible-metal”と”pipe”の2語としてS5
1〜56の処理を行う。この場合もS56の条件に当て
はまるため、図10の処理を行い、”flexible-metal”
は形容詞と解析され、「柔軟な金属のパイプ(の)」と
いう訳語を得る。品詞を形容詞または名詞として翻訳を
行った例が図12のS8である。この例では形容詞の候
補が取られ、「柔軟な金属のパイプのプラント」と訳出
される。なお、上記の処理において複合語内部の翻訳処
理を行う場合、通常の解析文法を用いても、複合語内部
の統語的特徴に合わせた文法(名詞句に冠詞が付かな
い、前置修飾語は通常1つのみ、など)を用いてもよ
い。図12は、本発明における未登録複合語処理によ
る、図4に挙げた未登録複合語を含む名詞句の翻訳例で
ある。
In the example of FIG. 4, W8 corresponds to this. First, the processing of S51 to S56 is performed with "flexible metal".
Since this example satisfies the condition of S56, the process of FIG. 10 is performed, and the translation of “flexible metal ()” is given.
Part of speech can be an adjective or a noun. Furthermore, including the "pipe" on the right side, S5 as two words "flexible-metal" and "pipe"
The processes 1 to 56 are performed. In this case as well, the condition of S56 is satisfied, so the process of FIG. 10 is performed and "flexible-metal" is performed.
Is parsed as an adjective and gets the translation "flexible metal pipe". An example of translation using a part of speech as an adjective or noun is S8 in FIG. In this example, the adjective candidate is taken and translated as "flexible metal pipe plant". In addition, in the above process, when performing the translation process inside the compound word, even if a normal parsing grammar is used, a grammar that matches the syntactic features inside the compound word (no prefix is attached to the noun phrase, the prefix modifier is Usually only one, etc.) may be used. FIG. 12 is an example of translation of a noun phrase including the unregistered compound words shown in FIG. 4 by the unregistered compound word processing according to the present invention.

【0022】[0022]

【効果】以上の説明から明らかなように、本発明による
と、以下のような効果がある。 (1)請求項1に対応する効果:未登録複合語を構成要
素毎に辞書引きし、その結果から得られる複合語内部の
構造に応じて適切な翻訳情報を付与することで、未登録
複合語を含む文の翻訳精度を上げることができる。 (2)請求項2〜5、7、8に対応する効果:特に、複
合語の構成要素数や、構成要素中のキーとなる語の品
詞、およびその有無によって処理を分け、精度の向上が
可能となる。 (3)請求項6、9に対応する効果:キーとなる語を持
たない複合語は、全体を一つの未知語として扱うことに
より、誤解析や解析失敗を避けることが出来る。
As is apparent from the above description, the present invention has the following effects. (1) Effect corresponding to claim 1: An unregistered compound word is dictionary-divided for each constituent element, and appropriate translation information is added according to the internal structure of the compound word obtained from the result. It is possible to improve the translation accuracy of sentences including words. (2) Effects corresponding to claims 2 to 5, 7, and 8: Particularly, the processing is divided according to the number of constituent elements of the compound word, the part of speech of the key word in the constituent elements, and the presence or absence thereof, thereby improving accuracy. It will be possible. (3) Effects corresponding to claims 6 and 9: A compound word having no key word can be treated as one unknown word to avoid erroneous analysis and analysis failure.

【図面の簡単な説明】[Brief description of drawings]

【図1】 本発明による機械翻訳装置の一実施例を説明
するための構成図である。
FIG. 1 is a configuration diagram for explaining an embodiment of a machine translation device according to the present invention.

【図2】 図1における翻訳部の処理を英日機械翻訳を
例にして示した構成図である。
FIG. 2 is a configuration diagram showing processing of a translation unit in FIG. 1 by taking English-Japanese machine translation as an example.

【図3】 図1における複合語処理部における複合語処
理のうち、構成要素数による振り分け処理までのフロー
チャートである。
FIG. 3 is a flowchart of the compound word processing in the compound word processing unit in FIG. 1 up to the distribution processing according to the number of constituent elements.

【図4】 本発明による未登録複合語の個々の構成要素
を辞書引きした結果の例を示す図である。
FIG. 4 is a diagram showing an example of a result obtained by looking up individual constituent elements of an unregistered compound word according to the present invention in a dictionary.

【図5】 本発明における構成要素数が2語である場合
の構成要素の品詞候補による振り分け処理のフローチャ
ートである。
FIG. 5 is a flow chart of a process of allocating components by part-of-speech candidates when the number of components is 2 in the present invention.

【図6】 本発明における左側構成要素が現在分詞であ
る場合の処理のフローチャートである。
FIG. 6 is a flowchart of processing when the left-side component is the present participle in the present invention.

【図7】 本発明における右側構成要素が現在分詞であ
る場合の処理のフローチャートである。
FIG. 7 is a flowchart of a process when the right-side component is the present participle in the present invention.

【図8】 本発明における右側構成要素が過去分詞であ
る場合の処理のフローチャートである。
FIG. 8 is a flowchart of a process when the right component is a past participle in the present invention.

【図9】 本発明における格フレームの例を示す図であ
る。
FIG. 9 is a diagram showing an example of a case frame according to the present invention.

【図10】 本発明における右側構成要素が名詞である
場合の処理のフローチャートである。
FIG. 10 is a flowchart of a process when the right component is a noun according to the present invention.

【図11】 本発明における構成要素が3語以上の場合
の処理のフローチャートである。
FIG. 11 is a flowchart of a process in the case where the constituent elements of the present invention are three or more words.

【図12】 本発明における未登録複合語処理による翻
訳例を示す図である。
FIG. 12 is a diagram showing an example of translation by unregistered compound word processing according to the present invention.

【符号の説明】[Explanation of symbols]

11…入力部、12…原文記憶部、13…訳文記憶部、
14…編集制御部、15…複合語検出部、16…複合語
処理部、17…翻訳部(翻訳処理部)、18…表示制御
部、19…表示部。
11 ... Input unit, 12 ... Original sentence storage unit, 13 ... Translated sentence storage unit,
Reference numeral 14 ... Edit control unit, 15 ... Compound word detection unit, 16 ... Compound word processing unit, 17 ... Translation unit (translation processing unit), 18 ... Display control unit, 19 ... Display unit.

Claims (9)

【特許請求の範囲】[Claims] 【請求項1】 少なくとも、原文を入力するための入力
部と、該入力部により入力された原文を記憶する原文記
憶部と、前記原文の翻訳処理に用いる知識情報を格納し
た翻訳辞書部と、前記知識情報を用いて、前記原文記憶
部に記憶された原文を翻訳処理する翻訳処理部と、該翻
訳処理部による翻訳処理によって得られた訳文を、原文
に対応させて記憶する訳文記憶部と、少なくとも前記原
文または訳文を表示する表示部および表示制御部と、該
表示部に表示された原文および訳文に対する編集処理を
制御する編集制御部と、入力された原文からハイフンな
どの特殊文字をはさんで結合された複合語を検出する複
合語検出部と、該複合語検出部で検出された複合語を、
特殊文字をデリミタとして個々の構成要素に分けて辞書
引きし、その結果から得られる内部構造に応じて複合語
全体に対して品詞や訳語等の翻訳情報を与える複合語処
理部とから成り、前記複合語検出部で検出された複合語
が、前記翻訳処理部に全体としてのエントリを持たず、
また翻訳処理部において接辞処理、数詞処理などの形態
素処理を受けなかった場合には、前記複合語処理部にお
いて、複合語の構成要素の数や、各構成要素を辞書引き
して得られる品詞情報とその組合せなどに応じて適切な
翻訳情報を与え、原文の翻訳処理に利用することを特徴
とした機械翻訳装置。
1. At least an input unit for inputting an original sentence, an original sentence storage unit for storing the original sentence input by the input unit, and a translation dictionary unit for storing knowledge information used for translation processing of the original sentence. A translation processing unit that translates the original sentence stored in the original sentence storage unit using the knowledge information; and a translation sentence storage unit that stores the translated sentence obtained by the translation process by the translation processing unit in association with the original sentence. , At least the display unit and the display control unit for displaying the original sentence or the translated sentence, the edit control unit for controlling the editing process for the original sentence and the translated sentence displayed on the display unit, and the special characters such as hyphens from the input original sentence. The compound word detection unit for detecting the compound word combined by the above, and the compound word detected by the compound word detection unit,
It consists of a compound word processing unit that divides the special characters into individual components as delimiters and looks up the dictionary, and gives translation information such as parts of speech and translation words to the entire compound word according to the internal structure obtained from the result. The compound word detected by the compound word detection unit does not have an entry in the translation processing unit as a whole,
When the translation processing unit does not undergo morpheme processing such as affix processing and number processing, the compound word processing unit obtains the number of constituent elements of the compound word and part-of-speech information obtained by looking up each constituent element in a dictionary. A machine translation device characterized in that appropriate translation information is given according to the combination and the like, and is used for translation processing of the original sentence.
【請求項2】 前記複合語処理部において、複合語の構
成要素数が2語で、かつハイフンの左側構成要素に品詞
が副詞である場合は、右側構成要素を副詞の係り先とし
て複合語内部を翻訳処理し、その結果から訳語を生成
し、品詞候補と併せて複合語全体に付与し、原文の翻訳
処理に利用する第1の翻訳処理手段を有することを特徴
とする請求項1記載の機械翻訳装置。
2. In the compound word processing unit, when the number of constituent elements of the compound word is two and the left-side component of the hyphen is a part-of-speech as an adverb, the right-side constituent is used as the adverb's target 2. The first translation processing means for translating a sentence, generating a translated word from the result, assigning it to the entire compound word together with a part-of-speech candidate, and utilizing it for the translation processing of the original sentence. Machine translation device.
【請求項3】 前記複合語処理部において、複合語の構
成要素数が2語で、前記第1の翻訳処理に該当せず、か
つハイフンの右側構成要素の品詞が動詞の現在分詞の場
合は、左右の要素の語順を入れ替えて複合語内部を翻訳
処理し、その結果から訳語を生成し、品詞候補と併せて
複合語全体に付与し、原文の翻訳処理に利用する第2の
翻訳処理手段を有することを特徴とする請求項1記載の
機械翻訳装置。
3. In the compound word processing unit, when the number of constituent elements of the compound word is two, which does not correspond to the first translation processing, and the part of speech of the component on the right side of the hyphen is the present participle of the verb, Second translation processing means for translating the inside of the compound word by changing the word order of the left and right elements, generating a translated word from the result, assigning it to the entire compound word together with the part-of-speech candidate, and utilizing it for the translation processing of the original sentence. The machine translation device according to claim 1, further comprising:
【請求項4】 前記複合語処理部において、複合語の構
成要素数が2語で、前記第1の翻訳処理に該当せず、か
つハイフンの右側構成要素の品詞が動詞の過去分詞の場
合は、前記翻訳辞書部の情報に基づいて、該動詞の格フ
レームと左側構成要素の素性とのマッチングを行い、適
当な前置詞を補って複合語内部を翻訳処理し、その結果
から訳語を生成し、品詞候補と併せて複合語全体に付与
し、原文の翻訳処理に利用する第3の翻訳処理手段を有
することを特徴とする請求項1記載の機械翻訳装置。
4. In the compound word processing unit, when the number of constituent elements of the compound word is two, does not correspond to the first translation processing, and the part-of-speech of the component on the right side of the hyphen is the past participle of the verb. , Based on the information of the translation dictionary unit, the case frame of the verb and the feature of the left component are matched, the inside of the compound word is translated with an appropriate preposition, and a translated word is generated from the result, 2. The machine translation device according to claim 1, further comprising a third translation processing unit that is provided to the entire compound word together with the part-of-speech candidate and is used for translation processing of the original sentence.
【請求項5】 前記複合語処理部において、複合語の構
成要素数が2語で、前記第1の翻訳処理に該当せず、か
つハイフンの右側構成要素が名詞の場合は、複合語内部
を翻訳処理し、その結果から訳語を生成し、品詞候補と
併せて複合語全体に付与し、原文の翻訳処理に利用する
第4の翻訳処理手段を有することを特徴とする請求項1
記載の機械翻訳装置。
5. In the compound word processing unit, when the number of constituent elements of the compound word is two, does not correspond to the first translation process, and the right component of the hyphen is a noun, the inside of the compound word is 4. A fourth translation processing unit that performs translation processing, generates a translated word from the result, assigns it to the entire compound word together with a part-of-speech candidate, and uses it for translation processing of the original sentence.
The machine translation device described.
【請求項6】 前記複合語処理部において、構成要素数
が2語で、かつ前記第1の翻訳処理から第4の翻訳処理
のいずれにも該当しない場合は、複合語全体を一つのま
とまりとして未知語処理する第1の未知語処理手段を有
することを特徴とする請求項1記載の機械翻訳装置。
6. In the compound word processing unit, when the number of constituent elements is two and does not correspond to any of the first translation processing to the fourth translation processing, the entire compound word is regarded as one group. 2. The machine translation device according to claim 1, further comprising first unknown word processing means for processing an unknown word.
【請求項7】 前記複合語処理部において、構成要素数
が3語以上で、かつ構成要素のうち1つの品詞が等位接
続詞である場合は、複合語内部を並列句として翻訳処理
し、その結果から訳語を生成し、品詞候補と併せて複合
語全体に付与し、原文の翻訳処理に利用する第5の翻訳
処理手段を有することを特徴とする請求項1記載の機械
翻訳装置。
7. In the compound word processing unit, when the number of constituent elements is three or more and one of the constituent elements is a coordinate conjunction, translation processing is performed inside the compound word as a parallel phrase. The machine translation apparatus according to claim 1, further comprising a fifth translation processing unit that generates a translated word from the result, assigns it to the entire compound word together with the part-of-speech candidate, and uses it for translation processing of the original sentence.
【請求項8】 前記複合語処理部において、構成要素数
が3語以上で、かつ前記第5の翻訳処理にマッチしない
場合は、構成要素の左から2語を取り出して、前記第1
の翻訳処理から第4の翻訳処理の処理を行い、いずれか
にマッチした場合は、該部分と更に右側の1語との組合
せにより再度前記第1の翻訳処理から第4の翻訳処理の
処理を行い、これを最後の構成要素まで繰り返し、その
結果から訳語を生成し、品詞候補と併せて複合語全体に
付与し、原文の翻訳処理に利用する第6の翻訳処理手段
を有することを特徴とする請求項1記載の機械翻訳装
置。
8. In the compound word processing unit, when the number of constituent elements is 3 words or more and does not match the fifth translation processing, two words from the left of the constituent element are extracted and the first word is extracted.
When the processing of the first translation processing to the fourth translation processing is performed and any one of them is matched, the processing of the first translation processing to the fourth translation processing is performed again by a combination of the portion and one word on the right side. It has a sixth translation processing means for performing the processing, repeating the processing up to the last component, generating a translated word from the result, assigning it to the entire compound word together with the part-of-speech candidate, and utilizing it for the translation processing of the original sentence. The machine translation device according to claim 1.
【請求項9】 前記複合語処理部において、構成要素数
が3語以上で、かつ第5の翻訳処理又は第6の翻訳処理
のいずれにもマッチしない場合には、複合語全体を一つ
のまとまりとして未知語処理する第2の未知語処理手段
を有することを特徴とする機械翻訳装置。
9. In the compound word processing section, when the number of constituent elements is three or more and does not match any of the fifth translation processing and the sixth translation processing, the entire compound word is grouped into one group. And a second unknown word processing means for processing an unknown word as.
JP5106051A 1993-04-08 1993-04-08 Machine translation system Pending JPH06295311A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP5106051A JPH06295311A (en) 1993-04-08 1993-04-08 Machine translation system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP5106051A JPH06295311A (en) 1993-04-08 1993-04-08 Machine translation system

Publications (1)

Publication Number Publication Date
JPH06295311A true JPH06295311A (en) 1994-10-21

Family

ID=14423833

Family Applications (1)

Application Number Title Priority Date Filing Date
JP5106051A Pending JPH06295311A (en) 1993-04-08 1993-04-08 Machine translation system

Country Status (1)

Country Link
JP (1) JPH06295311A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08297672A (en) * 1995-04-25 1996-11-12 Nec Corp Relevant translated word book preparing method
KR19980066877A (en) * 1997-01-29 1998-10-15 김광호 Morphological interpretation based on types of unregistered words
US6385569B1 (en) * 1999-03-11 2002-05-07 Sharp Kabushiki Kaisha Translator, translating method and recording medium having translating program recorded thereon
KR100716158B1 (en) * 2005-12-09 2007-05-10 한국전자통신연구원 Method and apparatus for selecting target word of unregistered word including hyphen

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08297672A (en) * 1995-04-25 1996-11-12 Nec Corp Relevant translated word book preparing method
KR19980066877A (en) * 1997-01-29 1998-10-15 김광호 Morphological interpretation based on types of unregistered words
US6385569B1 (en) * 1999-03-11 2002-05-07 Sharp Kabushiki Kaisha Translator, translating method and recording medium having translating program recorded thereon
KR100716158B1 (en) * 2005-12-09 2007-05-10 한국전자통신연구원 Method and apparatus for selecting target word of unregistered word including hyphen

Similar Documents

Publication Publication Date Title
McKeown Paraphrasing using given and new information in a question-answer system
KR100530154B1 (en) Method and Apparatus for developing a transfer dictionary used in transfer-based machine translation system
WO2003065245A1 (en) Translating method, translated sentence outputting method, recording medium, program, and computer device
JPS62163173A (en) Mechanical translating device
US20010029443A1 (en) Machine translation system, machine translation method, and storage medium storing program for executing machine translation method
Kempe et al. Parallel replacement in finite state calculus
JPH06295311A (en) Machine translation system
JP4007413B2 (en) Natural language processing system, natural language processing method, and computer program
JP2005173933A (en) Device, method and program for analyzing chinese
JP3919732B2 (en) Machine translation apparatus and machine translation program
Sheremetyeva et al. Meaning-Text Theory for textual input analysis and proofing in a generation system
JP3113257B2 (en) Machine translation equipment
KR20010057763A (en) Device and method for generating translated sentences based on partial translation patterns
Nouar et al. Deep Semantic Parsing Method to Capture the Structure of Multi-Relation Questions over Linked Data.
JP4033088B2 (en) Natural language processing system, natural language processing method, and computer program
JP2005092615A (en) Natural language processing system, natural language processing method, and computer program
JP2859661B2 (en) Machine translation equipment
JPS62272357A (en) System for displaying error of input sentence in mechanical translating system
JP2742059B2 (en) Dictionary editor for translation
JPH1031676A (en) Retrieval device for bilingual example sentence
JPH05290084A (en) Unregstered word translation system
JPH04213164A (en) Dictionary consulting system
JPH09319746A (en) Document analysis method and device
JP2003114890A (en) Translation device, translation method, translation server, and program
JPH09185623A (en) Language processing device and method