JPH05334339A

JPH05334339A - Machine translating system

Info

Publication number: JPH05334339A
Application number: JP4142737A
Authority: JP
Inventors: Akiko Niimi; 晶子新美
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1992-06-03
Filing date: 1992-06-03
Publication date: 1993-12-17
Anticipated expiration: 2014-03-17
Also published as: JP2871300B2

Abstract

PURPOSE:To reduce the burden of postediting on a user by providing an electronic dictionary with a specific conjugation/inflection field. CONSTITUTION:The electronic dictionary 15 is provided with the conjugation/ inflection field wherein information indicating whether or not respective words as the constituent elements of a compound word are conjugated or inflected when an index word is the compound word can optionally be set and stored. Namely, when the input index word is the compound word, the user optionally sets and stores the information showing whether or not the respective words as the constituent elements are conjugated or inflected in the conjugation/ inflection field of the electronic dictionary 15. Then, the information is referred to during translation execution, all the words to be conjugated or inflected among the words constituting the index word are correctly conjugated or inflected. Therefore, the user can easily input the information for conjugation or inflection even unless the user have technical knowledge and the burden of postediting on the user is reducible.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、機械翻訳装置に係り、
特にユーザが機械翻訳装置用の電子辞書を構築する際の
辞書編集手段に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device,
In particular, the present invention relates to dictionary editing means when a user constructs an electronic dictionary for a machine translation device.

【０００２】[0002]

【従来の技術】従来、機械翻訳などのシステムで用いる
電子辞書は予めシステム内に構築してある基本辞書だけ
でなく、ユーザが翻訳業務の中で随時必要な語を登録す
るユーザ辞書の二本立てで構成されている。機械で翻訳
を行うために状況に即したユーザ辞書が不可欠である。
しかし、ユーザ辞書は基本辞書とは異なり、機械翻訳に
対して専門知識の無いユーザが作成するので、ユーザの
手間を必要最低限にするためにもソース語の見出し語と
対訳、または既存の学習用辞書を見ればわかる程度の必
要最低限の情報だけをユーザに入力させ、後はデフォル
ト的に値を補うといった簡易的なユーザ辞書を構築させ
ることが多かった。従って、例えば日本語から英語に翻
訳する際のユーザ辞書を構築する際に、デフォルト的な
値としてそれぞれの言語的特徴を利用している。ユーザ
が用いる訳語が複数の単語から成り立っている複合語に
なった場合の活用は、通常英語の特性からその品詞に依
存してある程度決めることができると言える。例えば、
英語の名詞ならば「パソコン」が "personal computer
(s)" のように最後の単語、動詞ならば「共用する」が"
use(d) in common"のように先頭の単語、という具合い
にその品詞の特徴から活用する単語を決定し、必要なら
ば規則変化、不規則変化などの活用の種類をユーザに入
力させるという手法がとられていた。2. Description of the Related Art Conventionally, electronic dictionaries used in systems such as machine translation are not only basic dictionaries built in advance in the system, but also two user dictionaries for registering words that the user needs at any time during translation work. It is configured. A user dictionary suitable for the situation is indispensable for the machine translation.
However, unlike the basic dictionary, the user dictionary is created by a user who does not have specialized knowledge about machine translation. Therefore, in order to minimize the user's time and effort, the headword and parallel translation of the source word, or the existing learning It was often the case that the user inputs only the minimum necessary information that can be understood by looking at the user dictionary and then constructs a simple user dictionary in which the values are supplemented by default. Therefore, for example, when constructing a user dictionary for translating from Japanese to English, each linguistic feature is used as a default value. It can be said that, when the translated word used by the user is a compound word composed of a plurality of words, it can be determined to some extent depending on the part of speech from the characteristics of normal English. For example,
If it is an English noun, "PC" is "personal computer"
(s) "is the last word, and if it is a verb," share "is"
A method in which the word to be used is determined from the characteristics of the part of speech, such as the first word such as "use (d) in common", and the user inputs the type of usage such as rule change or irregular change if necessary. Was taken.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、実際に
は、システム内で勝手に値を決めてしまうとうまくいか
ないことがある。例えばソース語が日本語で翻訳語が英
語である日英機械翻訳システムにおいて、日本語の「ボ
ケ切れする」に当たる英語を"fray and break"としたと
すると、当然その翻訳語の見出し語の活用は過去形が"f
rayed and broke"、過去分詞形が"frayed and broken"
になるはずなのに、動詞は先頭の単語を活用させるとい
う手法があるために"fray"しか活用しない。そのため、
過去形が"frayed and break"、過去分詞形が"frayed an
d break"というように不自然な動詞の活用ができてしま
う。ユーザはこれらに対して、翻訳が終了し英文が生成
された後で毎回人手で修正しなくてはならないという問
題があった。However, in practice, it may not work if the value is arbitrarily decided in the system. For example, in a Japanese-English machine translation system in which the source word is Japanese and the translated word is English, if the English word "bokeh break" in Japanese is "fray and break", then naturally the headword of that translated word is used. The past tense is "f
rayed and broke ", past participle is" frayed and broken "
However, the verb uses only "fray" because there is a method of using the first word. for that reason,
The past tense is "frayed and break", the past participle is "frayed an"
Unnatural verbs such as "d break" can be used. There is a problem that the user has to manually correct these after the translation is completed and the English sentence is generated.

【０００４】本発明は、上述した問題点を解決するため
になされたものであり、ユーザが入力した見出し語が複
合語である場合はその見出し語を構成する単語のうち活
用するすべての単語を正しく活用させるための情報をユ
ーザが指定できるようにし、後編集などのユーザへの付
加を軽減する機械翻訳装置を提供することを目的とす
る。The present invention has been made in order to solve the above-mentioned problems, and when the entry word input by the user is a compound word, all the words to be utilized among the words constituting the entry word are used. It is an object of the present invention to provide a machine translation device that allows a user to specify information for correct utilization and reduces the addition to the user such as post-editing.

【０００５】[0005]

【課題を解決するための手段】この目的を達成するため
に本発明の機械翻訳装置は、見出し語と、その見出し語
が複数の単語から成り立っている複合語である場合に、
その構成要素である単語と、その各単語の品詞とを格納
するフィールドを有する電子辞書を備え、前記電子辞書
には、前記見出し語が複合語である場合に、その構成要
素である各単語が活用するか否かの情報を任意に設定・
格納できる活用フィールドを設けている。In order to achieve this object, a machine translation device of the present invention, when a headword and a headword are compound words composed of a plurality of words,
The electronic dictionary is provided with a word that is a component of the word and a field that stores a part of speech of each word, and in the electronic dictionary, when the entry word is a compound word, each word of the component is Arbitrarily set information on whether to utilize or not
There is a utilization field that can be stored.

【０００６】[0006]

【作用】上記の構成を有する本発明の機械翻訳装置に於
いては、電子辞書の活用フィールドに、入力された見出
し語が複合語である場合、その構成要素である各単語が
活用するか否かの情報がユーザによって任意に設定・格
納される。翻訳実行時にこれらの情報を参照し、これに
よって前記見出し語を構成する単語のうち活用する全て
の単語を正しく活用させる。In the machine translation apparatus of the present invention having the above configuration, when the entry word entered in the utilization field of the electronic dictionary is a compound word, whether or not each of the constituent words is utilized This information is arbitrarily set / stored by the user. This information is referred to when the translation is executed, so that all the words to be used among the words forming the headword are correctly used.

【０００７】[0007]

【実施例】以下、本発明を具体化した一実施例を図面を
参照して説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings.

【０００８】図１は、本発明に関する機械翻訳装置のハ
ードウェア構成を概略的に示すブロック図である。同図
に示されるように、この日英機械翻訳装置１０は、計算
機で構成されたデータ処理部１１と、ＣＲＴ等で構成さ
れる表示部１２と、キーボードなどで構成された入力部
１３と、走査型プリンタやレーザープリンタなどで構成
される印刷部１４と、ハードディスクや光ディスクなど
に格納された電子辞書１５から成る。FIG. 1 is a block diagram schematically showing the hardware configuration of a machine translation device according to the present invention. As shown in FIG. 1, the Japanese-English machine translation device 10 includes a data processing unit 11 including a computer, a display unit 12 including a CRT, an input unit 13 including a keyboard, and the like. The printing unit 14 includes a scanning printer or a laser printer, and an electronic dictionary 15 stored in a hard disk, an optical disk, or the like.

【０００９】前記電子辞書１５は、システムが内部に持
っている基本辞書とユーザが必要なものを登録できるユ
ーザ辞書を包含しており、ソース語としての日本語を解
析するために必要な日本語形態素解析辞書部３１と、日
本語構文解析辞書部３２と、日本語から翻訳語である英
語へ変換するために必要な変換辞書部３３と、翻訳語で
ある英語を生成するために必要な英語構文生成辞書部３
４と、英語形態素生成辞書部３５とから成る。The electronic dictionary 15 includes a basic dictionary that the system internally has and a user dictionary that allows the user to register necessary items, and is a Japanese word necessary for analyzing Japanese as a source word. Morphological analysis dictionary unit 31, Japanese syntax analysis dictionary unit 32, conversion dictionary unit 33 necessary for converting from Japanese to English which is a translation word, and English necessary for generating English which is a translation word Syntax generation dictionary unit 3
4 and an English morpheme generation dictionary unit 35.

【００１０】前記データ処理部１１内には、計算機の機
能を利用してソフトウェア的に実現された翻訳処理を行
う翻訳部１６と、前記入力部１２からの入力、翻訳の前
後の編集や辞書登録を行い、入力文や出力文をＣＲＴ１
３に送る編集部１７と、入出力されるデータを一時的に
格納するデータ格納部１８と、ユーザが前記編集部１７
の辞書登録メニューに従って前記入力部１３より入力し
た辞書情報を加工し、前記電子辞書１５に登録したり削
除する複合語編集部１９から成る。In the data processing unit 11, a translation unit 16 for performing translation processing realized by software using the function of a computer, an input from the input unit 12, editing before and after translation, and dictionary registration. And input and output sentences on CRT1
3, an editing unit 17 to be sent to the data processing unit 3, a data storage unit 18 for temporarily storing input / output data, and a user
The compound word editing unit 19 processes the dictionary information input from the input unit 13 according to the dictionary registration menu, and registers or deletes the dictionary information in the electronic dictionary 15.

【００１１】前記複合語編集部１９は、英語複合語中の
スペースを検出するスペース検出部４１と、複合語を構
成している各単語に分割して編集を行う単語分割処理部
４２と、ユーザが入力した前記各単語が活用するか否か
の情報を取り込みどの単語が活用するかという情報に読
み替えて一時的に格納する格納部４３と、前記フィール
ドに活用するという情報が格納されている前記各単語毎
に前記電子辞書１５中の前記英語形態素生成辞書部３５
を検索し、前記英語形態素生成辞書部３５内に活用情報
があるかどうかを確認する辞書検索部４４とから成る。The compound word editing section 19 includes a space detecting section 41 for detecting spaces in an English compound word, a word dividing processing section 42 for dividing each word constituting the compound word for editing. The storage unit 43 that temporarily stores the information that is input by each of the words, that is, whether the word is used or not, is temporarily stored after being replaced with the information indicating which word is used, and the information that is used in the field is stored. For each word, the English morpheme generation dictionary unit 35 in the electronic dictionary 15
And a dictionary search unit 44 for checking whether or not there is utilization information in the English morpheme generation dictionary unit 35.

【００１２】前記英語形態素生成辞書部３５内には、図
２に示すように、見出し語フィールド６１と、その見出
し語の品詞６２と、見出し語が複合語であれば構成単語
フィールド６３と、その構成単語の品詞フィールド６４
と、その構成単語が活用することを示す活用フィールド
６５からなる複合語情報フィールド６０が設けてある。
また、見出し語が一語で構成されている場合に必要であ
る規則活用か不規則活用かを示す値が入る活用形情報フ
ィールド６６がある。不規則活用を意味する値が不規則
活用である場合のみ過去形の情報を記述するための過去
形フィールド６７と過去分詞形の情報を記述する過去分
詞形フィールド６８とに情報が記述される。また、現在
分詞形を作る場合に必要になる語尾の子音を重ねるか規
則変化かを示す値が入る現在分詞形フィールド６９があ
る。In the English morpheme generation dictionary unit 35, as shown in FIG. 2, a headword field 61, a part-of-speech 62 of the headword, and a constituent word field 63 if the headword is a compound word. Part-of-speech field 64 of constituent words
And a compound word information field 60 including a utilization field 65 indicating that the constituent word is utilized.
Further, there is an inflectional type information field 66 in which a value indicating whether the inflection word is used regularly or irregularly is entered when the headword is composed of one word. Information is described in the past participle field 67 for describing the past participle information and the past participle form field 68 for describing the past participle form information only when the value indicating the irregular part utilization is the irregular use. Further, there is a present participle field 69 in which a value indicating whether a consonant at the end of a sentence is overlaid or a rule change, which is necessary when forming the present participle, is entered.

【００１３】次に、この機械翻訳システムの基本的な動
作について概略的に説明する。Next, the basic operation of this machine translation system will be schematically described.

【００１４】前記入力部１３から入力されたソース語の
入力文は、前記編集部１７で必要に応じて適切な形に編
集された後、前記翻訳部１６に送られる。前記翻訳部１
６は前記電子辞書１５を用いながら入力文を翻訳語に翻
訳し、その結果である出力文を前記データ格納部１８に
一時的に格納する。ユーザが前記入力部１３から前記編
集部１７を起動して必要な出力文を指定すると、前記編
集部１７は前記データ格納部１８から出力文を取り出し
て前記表示部１２に表示したり、あるいは前記印刷部１
４から出力させる。もし、出力文中に辞書情報に誤りが
ある語や、未登録語が見つかれば、次回の翻訳精度を上
げるために、ユーザは随時前記編集部１７を起動してユ
ーザ辞書を作成して前記電子辞書１５に登録したり、す
でにある辞書内容を修正したりする。The input sentence of the source word input from the input unit 13 is edited by the editing unit 17 into an appropriate form as needed, and then sent to the translation unit 16. The translation unit 1
Reference numeral 6 translates an input sentence into a translated word using the electronic dictionary 15, and temporarily stores the resulting output sentence in the data storage unit 18. When the user activates the editing unit 17 from the input unit 13 and specifies a necessary output sentence, the editing unit 17 retrieves the output sentence from the data storage unit 18 and displays it on the display unit 12, or Printing department 1
Output from 4. If a word with incorrect dictionary information or an unregistered word is found in the output sentence, the user activates the editing unit 17 at any time to create a user dictionary to improve the translation accuracy in the next time. Register in 15 or modify existing dictionary contents.

【００１５】日英機械翻訳システムで用いる電子辞書で
は、ソース語が一まとまりの単語であれば翻訳語も一ま
とまりとして扱うことが多い。そこでどうしても訳語も
複合語の形態をとることが多くなる。このような見出し
語の処理の流れを、未登録語「ボケ切れする」を例にと
って電子辞書内に登録される過程を図３に示す。In the electronic dictionary used in the Japanese-English machine translation system, if the source word is a group of words, the translated words are often handled as a group of words. Therefore, translated words often take the form of compound words. FIG. 3 shows a flow of processing of such an entry word, in which an unregistered word “out of focus” is registered in the electronic dictionary as an example.

【００１６】まず、動詞「ボケ切れする」を前記電子辞
書１５内に登録するために、図４に示すように、ユーザ
は前記編集部１７の辞書登録メニューを前記入力部１３
より起動し、登録したい見出し語である「ボケ切れす
る」を、次にその訳語"fray and break"を前記入力部１
３より入力する（Ｓ１）。英語は、その言語的特徴か
ら、複合語の多くは主となる語、つまり活用する語の位
置を品詞によって決定することができる。動詞では、例
えば「共用する」の"use(d) in common"のように、先頭
の"use"が活用する語、すなわち主動詞である。ところ
が、「ボケ切れする」の訳語"fray and break"の場合、
上記のルールには当てはまらずに活用は過去形が"fraye
d and broke"、過去分詞形が"frayed and broken" にな
り、主動詞を二つ持っている。First, in order to register the verb "out of focus" in the electronic dictionary 15, the user inputs the dictionary registration menu of the editing unit 17 to the input unit 13 as shown in FIG.
The input word "fray and break", which is the headword to be registered and which is to be registered, and then its translation "fray and break"
Input from 3 (S1). Due to the linguistic characteristics of English, in many compound words the position of the main word, that is, the word to be utilized, can be determined by the part of speech. In the verb, for example, "use (d) in common" of "shared" is a word used by the leading "use", that is, a main verb. However, in the case of the translated word "fray and break" for "out of focus",
The past tense is "fraye"
"d and broke", the past participle is "frayed and broken", and it has two main verbs.

【００１７】前記複合語編集部１９は前記編集部１７か
ら、ユーザに前記入力部１２より入力された訳語のデー
タを受け取る。その内部のスペース検出部４１は訳語内
にスペースが含まれているかどうかを判断する（Ｓ
２）。スペースが含まれていれば複合語であると判断
し、前記単語分割処理部４２で複合語処理に入り、スペ
ース毎に各単語に分割する（Ｓ３）。The compound word editing unit 19 receives, from the editing unit 17, the translated word data input by the user from the input unit 12. The space detection unit 41 therein determines whether or not a space is included in the translated word (S
2). If a space is included, it is determined that the word is a compound word, and the word division processing unit 42 starts compound word processing to divide the word into spaces (S3).

【００１８】前記複合語編集部１９は複合語処理のため
のメニューに入るように前記編集部１７に伝え、前記編
集部１７は訳語中の活用する語、すなわち主動詞が何番
目の単語かという質問を前記表示部１２に表示させ（Ｓ
４）、図５に示すように、ユーザは前記入力部１３より
「１」と「３」を入力する。この番号は前記複合語編集
部１８内の前記格納部４３に送られ、ここで一番目の"f
ray"と三番目の"break"がそれぞれ動詞であり、活用す
る単語であるという情報に読み変えられる。そして前記
格納部４３は前記単語分割処理部４２で"fray"、"an
d"、"break"に分けられている各単語に、活用する単語
は必ず元の見出し語と同一品詞であるということから、
Ｓ５から"fray"と"break"を動詞であるとし、各単語の
見出し語と、その品詞と、活用するという情報を格納す
る（Ｓ５）。The compound word editing unit 19 informs the editing unit 17 to enter a menu for compound word processing, and the editing unit 17 determines which word in the translated word is used, that is, what number word the main verb is. Display the question on the display unit 12 (S
4) As shown in FIG. 5, the user inputs “1” and “3” from the input unit 13. This number is sent to the storage unit 43 in the compound word editing unit 18, where the first "f
The information "ray" and the third "break" are verbs and are words to be utilized, and the storage unit 43 stores "fray" and "an" in the word division processing unit 42.
For each word divided into "d" and "break", the word used is always the same part of speech as the original headword, so
From S5, "fray" and "break" are assumed to be verbs, and the entry word of each word, its part of speech, and information to be used are stored (S5).

【００１９】前記複合語編集部１９内の辞書検索部４３
は前記電子辞書１５内の前記英語形態素生成辞書部３５
を検索し、"fray"と"break"の個々の辞書情報があるか
どうか調べる（Ｓ６）。個々の辞書情報が前記英語形態
素生成辞書部３５に存在する場合、構文情報他の辞書情
報を入力する（Ｓ８）。個々の辞書情報が前記英語形態
素生成辞書部部３５に存在しない場合、前記編集部１７
に活用形の情報や不規則変化の場合の分詞形など個々の
辞書の形態素情報を付与させるメニューを前記表示部１
２に表示させる（Ｓ７）。The dictionary search unit 43 in the compound word editing unit 19
Is the English morpheme generation dictionary unit 35 in the electronic dictionary 15.
Is searched to see if there is individual dictionary information for "fray" and "break" (S6). When the individual dictionary information exists in the English morpheme generation dictionary unit 35, the dictionary information such as the syntax information is input (S8). When the individual dictionary information does not exist in the English morpheme generation dictionary unit 35, the editing unit 17
The display unit 1 has a menu for adding morphological information of individual dictionaries such as inflectional information and participles in case of irregular change.
2 is displayed (S7).

【００２０】もし、訳語内にスペースが含まれていなけ
れば複合語処理には入らずに、直接形態素情報を付与さ
せるためのメニューに入る（Ｓ７）。If the translated word does not include a space, the compound word processing is not entered and a menu for directly adding morpheme information is entered (S7).

【００２１】必要な形態素情報が入力されたら、前記編
集部で他の辞書情報を入力させる（Ｓ８）。情報が揃っ
たところで、前記編集部１７は、ユーザにより前記入力
部１３から入力された前記データ格納部１８内の辞書情
報と、前記複合語編集部１９内の前記格納部４３に格納
してある複合語情報をまとめて前記電子辞書１５に登録
する（Ｓ９）。When the necessary morpheme information is input, other dictionary information is input by the editing section (S8). When the information is collected, the editing unit 17 stores the dictionary information in the data storage unit 18 input by the user from the input unit 13 and the storage unit 43 in the compound word editing unit 19. The compound word information is collectively registered in the electronic dictionary 15 (S9).

【００２２】以下、前記電子辞書１５内の前記英語形態
素生成辞書部３５に実際に情報が格納される順序を図６
を用いて説明する。「ボケ切れする」の訳語"fray and
break"の場合、前記Ｓ５で前記格納部４３に格納された
情報から、前記英語形態素生成辞書部３５内の各フィー
ルドにそれぞれ以下のように情報が格納される。まず"f
ray and break"が前記見出し語フィールド６１に記述さ
れ（Ｓ１１）、その品詞である"Ｖ（動詞）"が前記品詞
フィールド６２に記述される（Ｓ１２）。前記複合語情
報フィールド６０内の前記構成単語フィールド６３には
前記単語分割処理部４２で分割され前記格納部４３に格
納されている個々の見出し語である"fray"と"and"と"br
eak"がそれぞれ記述される（Ｓ１３）。前記品詞フィー
ルド６４には前記格納部４３に格納されている"fray"
と"break"が動詞であるという情報から"fray"と"break"
に品詞情報を記述し、活用しない"and"については品詞
の情報は不要なので何も記述しない（Ｓ１４）。前記活
用フィールド６５には"fray"と"break"について活用す
ることを示すフラグ"T"がたてられる（Ｓ１５）。The order in which information is actually stored in the English morpheme generation dictionary unit 35 in the electronic dictionary 15 is shown in FIG.
Will be explained. Translation of "out of focus""fray and
In the case of "break", the information is stored in the fields in the English morpheme generation dictionary unit 35 from the information stored in the storage unit 43 in S5 as follows.
“Ray and break” is described in the entry word field 61 (S11), and its part of speech “V (verb)” is described in the part of speech field 62 (S12). In the word field 63, the individual headwords “fray”, “and”, and “br” that are divided by the word division processing unit 42 and are stored in the storage unit 43.
"eak" is described (S13). In the part-of-speech field 64, "fray" stored in the storage unit 43 is described.
"Fray" and "break" from the fact that and "break" are verbs
The part-of-speech information is described in, and the part-of-speech information is unnecessary for "and" that is not used, so nothing is described (S14). In the utilization field 65, a flag "T" indicating that "fray" and "break" are utilized is set (S15).

【００２３】個々の単語の情報が前記英語形態素生成辞
書に既に登録されていなければ、前記Ｓ７で示されてい
るようにメニューで前記各フィールドに情報を記述して
いくことになる。その場合の格納手順は以下の通りであ
る。前出の"fray"の場合、複合語ではないので前記複合
語情報フィールド６０の中は何も記述されない。代わり
に前記活用形情報フィールド６６に規則変化を示す値が
記述される（Ｓ１６）。ここで不規則変化を示す値が記
述されると、前記過去形フィールド６７と前記過去分詞
形フィールド６８に必要な値が記述される（Ｓ１
７）。"break"の場合、前記活用形情報フィールド６６
に不規則変化を示す値と、前記過去形フィールド６７
に"broke"、前記過去分詞形フィールド６８に"broken"
が記述される。その後前記現在分詞形フィールド６９に
規則変化を示す値が記述される（Ｓ１８）。If the information of each word is not already registered in the English morpheme generation dictionary, the information is described in each of the fields by the menu as shown in S7. The storage procedure in that case is as follows. In the case of the above-mentioned "fray", since it is not a compound word, nothing is described in the compound word information field 60. Instead, a value indicating a rule change is described in the inflection type information field 66 (S16). When a value indicating irregular change is described here, necessary values are described in the past tense field 67 and the past participle field 68 (S1).
7). In the case of "break", the inflected information field 66
And a value indicating an irregular change in the past tense field 67.
"Broke" and "broken" in the past participle field 68
Is described. After that, a value indicating a rule change is described in the present participle field 69 (S18).

【００２４】以上の例はユーザが登録したい語が電子辞
書内に未登録であった場合を述べた。しかしながら、ユ
ーザがある見出し語の対訳語を変更したい場合であって
も、英訳語が複合語であれば新しい訳語を入力した時点
で同様のスペース検出が行われ、同様の効果が期待でき
る。また、今回の例のような見出し語は小数であるか
ら、ユーザインターフェースとしては品詞によって一応
活用語の候補となる単語を表示してユーザに確認を求
め、必要なものだけ上記のような処理に入るように応用
することも可能である。見出し語の中に"and"や"or"と
いった接続詞が入っていると、この例のように活用する
語が複数含まれている可能性が高いから、該当するもの
のみ今回の処理に優先的にかけてもよい。本発明は機械
翻訳装置で用いる複数言語間の辞書のみならず、言語を
扱う他の自然言語処理用の電子辞書やシソーラス、知識
データベース等の構築にも応用することができる。The above example describes the case where the word that the user wants to register is not registered in the electronic dictionary. However, even if the user wants to change the parallel translation of a certain headword, if the English translation is a compound word, the same space detection is performed at the time when a new translation is input, and the same effect can be expected. In addition, since the headwords like this example are decimals, the user interface displays words that are candidates for inflection words by part-of-speech and asks the user for confirmation, and only the necessary items are processed as described above. It is also possible to apply to enter. If the headword contains a conjunction such as "and" or "or", there is a high possibility that multiple words will be used, as in this example. Therefore, only the relevant word has priority for this process. You may call it. INDUSTRIAL APPLICABILITY The present invention can be applied not only to a dictionary between a plurality of languages used in a machine translation device but also to the construction of other natural language processing electronic dictionaries, thesauri, knowledge databases and the like.

【００２５】[0025]

【発明の効果】以上説明したことから明かなように、本
発明の複合語編集部を有する機械翻訳装置によれば、翻
訳などで用いる電子辞書に、見出し語が活用する場合そ
の構成要素であるすべての単語が正しく活用させるため
の情報を、専門知識の無いユーザが簡単に入力すること
ができ、後編集などのユーザへの付加を軽減することが
できる。As is apparent from the above description, according to the machine translation device having the compound word editing unit of the present invention, when the entry word is utilized in the electronic dictionary used for translation or the like, it is a constituent element. A user who does not have specialized knowledge can easily input information for correctly utilizing all words, and it is possible to reduce post-editing and the like to be added to the user.

[Brief description of drawings]

【図１】本発明の構成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of the present invention.

【図２】本発明に関わる一実施例の電子辞書に格納され
ている辞書情報である。FIG. 2 is dictionary information stored in an electronic dictionary according to an embodiment of the present invention.

【図３】本発明に関わる一実施例の翻訳処理の流れを示
すフローチャートである。FIG. 3 is a flowchart showing a flow of a translation process according to an embodiment of the present invention.

【図４】本発明に関わる一実施例の日本語の見出し語、
英訳を入力するメニュー画面の図である。FIG. 4 is a Japanese headword according to an embodiment of the present invention;
It is a figure of the menu screen which inputs an English translation.

【図５】本発明に関わる一実施例の主動詞の順番を入力
する画面の図である。FIG. 5 is a diagram of a screen for inputting the order of main verbs according to an embodiment of the present invention.

【図６】本発明に関わる英語形態素生成辞書に情報を格
納する一実施例の流れを示すフローチャートである。FIG. 6 is a flowchart showing a flow of an embodiment for storing information in an English morpheme generation dictionary according to the present invention.

[Explanation of symbols]

１０日英機械翻訳装置１５電子辞書６１見出し語フィールド６３構成単語フィールド６４品詞フィールド６５活用フィールド 10 Japanese-English machine translation device 15 Electronic dictionary 61 Headword field 63 Constituent word field 64 Part-of-speech field 65 Utilization field

Claims

[Claims]

1. An electronic dictionary having a field for storing a headword and, when the headword is a compound word composed of a plurality of words, a word which is a constituent of the headword and a part of speech of each word. In the machine translation device provided with, in the electronic dictionary, when the headword is a compound word, a utilization field capable of arbitrarily setting and storing information on whether or not each word that is a constituent element thereof is utilized. A machine translation device characterized by being provided.