JPH09204429A

JPH09204429A - Machine translation equipment

Info

Publication number: JPH09204429A
Application number: JP8010106A
Authority: JP
Inventors: Yoko Oike; 陽子大池
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 1996-01-24
Filing date: 1996-01-24
Publication date: 1997-08-05

Abstract

(57)【要約】【課題】異音同表記の日本語について、かな漢字変換
時に入力された読みを参照することにより、精度の良い
翻訳を実現する。【解決手段】かな漢字変換結果バッファ領域３２に記
憶された日本語文の文字列を切り出し（Ｓ６１）、翻訳
形態素解析辞書４９内に、切り出した文字列と同じ表記
を持つ単語が複数あり（Ｓ６５・ＹＥＳ）、読みが異な
る場合（Ｓ６６・ＹＥＳ）、かな漢字変換の際に読み入
力バッファ領域３１に記憶した読みを参照して（Ｓ６
７）、辞書レコードを選択し（Ｓ６８）、翻訳する。 (57) [Abstract] [Problem] To realize accurate translation by referring to the pronunciation input at the time of Kana-Kanji conversion for Japanese with allophones. SOLUTION: A character string of a Japanese sentence stored in a kana-kanji conversion result buffer area 32 is cut out (S61), and a plurality of words having the same notation as the cut out character string are present in the translation morphological analysis dictionary 49 (S65: YES). ), If the reading is different (YES in S66), the reading stored in the reading input buffer area 31 at the time of kana-kanji conversion is referred to (S6).
7) Select a dictionary record (S68) and translate it.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、翻訳処理の効率化
を図り、自然な訳文を得ることのできる機械翻訳装置に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation apparatus which can improve efficiency of translation processing and obtain natural translated text.

【０００２】[0002]

【従来の技術】従来、コンピューターを利用して入力原
文（日本語）を自動的に機械翻訳し、その訳文（英語
等）を得る機械翻訳装置において、精度の良い訳文を得
るための開発が種々試みられている。2. Description of the Related Art Conventionally, various developments have been made to obtain an accurate translated text in a machine translation device that automatically machine-translates an input original text (Japanese) using a computer and obtains the translated text (English etc.). Being tried.

【０００３】この種の機械翻訳装置は、基本的には、
［かな］や［漢字］で構成される入力原文（日本語）を
形態素解析して、例えば単語等の所定の言語処理単位に
区分し、翻訳処理に必要な知識情報を格納した翻訳辞書
を参照して、各言語処理単位毎にその訳語（英語等）を
求め、これらの訳語を所定の訳文規則に従って結合し
て、原文に対する訳文を生成するように構成されてい
る。This type of machine translation device is basically
Morphological analysis of the input original sentence (Japanese) consisting of [Kana] and [Kanji] is divided into predetermined language processing units such as words, and the translation dictionary that stores knowledge information necessary for translation processing is referenced. Then, the translated words (English, etc.) are obtained for each language processing unit, and these translated words are combined according to a predetermined translation rule to generate a translated text for the original text.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、漢字で
表記される語には、読み方が２通り以上存在するものが
ある。例えば「陽子」には「ようこ」と「ようし」の２
通りの読みが存在し、それぞれ別の意味を持つ。「十
分」（「じゅっぷん」と「じゅうぶん」）や、「半月」
（「はんげつ」と「はんつき」）等も同様である。上記
のような機械翻訳装置では、入力された語が、どちらの
読みを持つ語かを判定できなかったため、間違って翻訳
されることがあった。However, some words written in kanji have two or more ways of reading. For example, "Yoko" and "Yoshi" for "Yoko"
There are street readings, each with a different meaning. "Enough"("Jupun" and "Jubun") and "Half-moon"
The same applies to (“Hangetsu” and “Hantsuki”). In the machine translation device as described above, it was not possible to determine which reading the input word had, so it was sometimes translated incorrectly.

【０００５】本発明は、上記の問題点を解決するために
なされたものであり、漢字を入力した際の読みを記憶し
ておき、漢字の読み方が複数あった場合に、その読みを
用いて、使用者が意図した読みに対応する語に翻訳する
ことにより、精度の高い翻訳を実現できる機械翻訳装置
を提供することを目的とする。The present invention has been made in order to solve the above-mentioned problems, and it stores a reading when a Chinese character is input, and uses the reading when there are a plurality of readings of the Chinese character. An object of the present invention is to provide a machine translation device capable of realizing highly accurate translation by translating into a word corresponding to the reading intended by the user.

【０００６】[0006]

【課題を解決するための手段】この目的を達成するため
に、本発明の請求項１に記載の機械翻訳装置は、日本語
の読み文字列を入力するための入力手段と、単語の読み
とその読みに対する表記を記憶したかな漢字変換辞書
と、前記入力手段により入力された読み文字列を前記か
な漢字変換辞書を用いて表記文字列に変換するかな漢字
変換手段と、そのかな漢字変換手段により変換された表
記文字列を記憶するかな漢字変換結果記憶手段と、日本
語を第二の言語に翻訳するために必要な、日本語の表記
やその表記の読みやその表記に対する第二の言語の訳語
等の情報を記憶した翻訳辞書と、前記かな漢字変換結果
記憶手段に記憶されている表記文字列を、前記翻訳辞書
に記憶された情報を参照しながら形態素解析する形態素
解析手段と、前記形態素解析手段により形態素解析され
た表記文字列を、前記翻訳辞書を用いて第二の言語に翻
訳する翻訳手段とを備え、さらに、前記かな漢字変換結
果記憶手段は、変換した表記文字列を、単語毎にその読
みと対応づけて記憶し、前記形態素解析手段は、前記翻
訳辞書に前記表示文字列と同じ表記の語が複数存在する
場合に、前記かな漢字変換結果記憶手段に記憶されたそ
の表記文字列の読みを参照して、形態素解析を行なうよ
うに構成している。In order to achieve this object, a machine translation device according to claim 1 of the present invention comprises an input means for inputting a Japanese reading character string, and a reading of a word. A kana-kanji conversion dictionary that stores the notation for the reading, a kana-kanji conversion device that converts the reading character string input by the input device into a notation character string using the kana-kanji conversion dictionary, and a notation converted by the kana-kanji conversion device. Kana-Kanji conversion result storage means for storing character strings and information necessary for translating Japanese into a second language, such as Japanese notation, reading of the notation, translation of the notation in the second language, etc. A morpheme analysis means for morphologically analyzing the stored translation dictionary and the written character string stored in the kana-kanji conversion result storage means with reference to the information stored in the translation dictionary; The notation character string morphologically analyzed by the elementary analysis means is provided with a translation means for translating it into a second language by using the translation dictionary, and the kana-kanji conversion result storage means further converts the notation character string into a word. Each of them is stored in association with its reading, and the morpheme analysis means, when there are a plurality of words having the same notation as the display character string in the translation dictionary, the notation characters stored in the kana-kanji conversion result storage means. The morphological analysis is performed by referring to the reading of the column.

【０００７】従って、形態素解析の際に、対象となる日
本語の表記文字列と同じ表記の語が翻訳辞書内に複数存
在した場合には、入力された際のその表記の読みを利用
して、どの語であるかを決定し、正しい訳語を得ること
ができる。Therefore, when a plurality of words having the same notation as the target Japanese notation character string exist in the translation dictionary during morphological analysis, the reading of the notation when input is used. , You can determine which word it is and get the correct translation.

【０００８】[0008]

【発明の実施の形態】以下、本発明を具体化した実施の
形態を図面を参照して説明する。Embodiments of the present invention will be described below with reference to the drawings.

【０００９】本発明は、日本語と第二言語との間の機械
翻訳装置に関するものであるが、ここでは日本語と英語
との間の翻訳装置、つまり日英翻訳装置を例に挙げる。The present invention relates to a machine translation device between Japanese and a second language. Here, a translation device between Japanese and English, that is, a Japanese-English translation device is taken as an example.

【００１０】まず、図１を参照して本実施形態の日英翻
訳装置全体の構成を説明する。First, the overall configuration of the Japanese-English translation apparatus of this embodiment will be described with reference to FIG.

【００１１】読み文字列や指示を入力するための入力手
段としてのキーボード等の入力装置１０と、図２のよう
な画面を出力するＣＲＴ等の出力装置５０は、装置全体
を制御するための中央処理装置（ＣＰＵ）２０に接続さ
れている。An input device 10 such as a keyboard as an input means for inputting a read character string or an instruction and an output device 50 such as a CRT for outputting a screen as shown in FIG. 2 are provided at the center for controlling the entire device. It is connected to the processing unit (CPU) 20.

【００１２】記憶手段であるＲＡＭ３０は、ＣＰＵ２０
に接続され、入力されたかな読み文字列を記憶するため
の読み入力バッファ領域３１と、その読み文字列をかな
漢字変換した結果のかな漢字変換文字列をそれに付随す
る情報と共に記憶するためのかな漢字変換結果バッファ
領域３２と、そのかな漢字変換文字列を翻訳した結果の
文字列を記憶するための翻訳出力バッファ領域３３と、
ポインタ情報等を記録するワークエリア３４とを格納し
ている。ここで、読み入力バッファ領域３１とかな漢字
変換結果バッファ領域３２とが、本発明のかな漢字変換
結果記憶手段を構成する。The RAM 30 serving as a storage means includes a CPU 20.
Connected to, and a kana-kanji conversion result for storing the kana-kanji conversion character string of the kana-kanji conversion result obtained by kana-kanji conversion of the kana-kanji conversion string area for storing the input kana-kanji character string together with the accompanying information. A buffer area 32, a translation output buffer area 33 for storing a character string obtained by translating the kana-kanji converted character string,
A work area 34 for recording pointer information and the like is stored. Here, the reading input buffer area 31 and the kana-kanji conversion result buffer area 32 constitute the kana-kanji conversion result storage means of the present invention.

【００１３】読み入力バッファ領域３１には、例えば図
３の［読み文字列記憶］に示すように、入力装置１０に
よって入力された読みが、先頭からの文字番号と共に記
憶される。また、かな漢字変換結果バッファ領域３２に
は、やはり図３に示すように、かな漢字変換された文字
列が、単語ごとに切れ目を付され、その単語に対応する
読みの先頭の文字番号と共に記憶される。In the reading input buffer area 31, the reading input by the input device 10 is stored together with the character number from the beginning as shown in [Reading character string storage] of FIG. 3, for example. Further, in the kana-kanji conversion result buffer area 32, as shown in FIG. 3, the kana-kanji converted character string is marked for each word and is stored together with the reading leading character number corresponding to the word. .

【００１４】プログラムや辞書等を格納するＲＯＭ４０
は、ＣＰＵ２０と接続され、かな漢字変換関連のもの
と、翻訳関連のものに大別される。ROM 40 for storing programs, dictionaries, etc.
Are connected to the CPU 20 and are roughly divided into those related to kana-kanji conversion and those related to translation.

【００１５】かな漢字変換関連としては、本発明のかな
漢字変換手段であるかな漢字変換プログラム４１と、本
発明のかな漢字変換辞書であるかな漢字変換辞書４８と
がある。かな漢字変換辞書４８には、図４に示すよう
に、それぞれの単語の読みや、その単語の表記（漢字
等）、その単語の品詞情報等が記憶されている。また、
単語同士の接続関係を品詞情報により規定するデータを
記憶した接続テーブルも合わせ持っている。The kana-kanji conversion programs include a kana-kanji conversion program 41, which is the kana-kanji conversion means of the present invention, and a kana-kanji conversion dictionary 48, which is the kana-kanji conversion dictionary of the present invention. As shown in FIG. 4, the kana-kanji conversion dictionary 48 stores the reading of each word, the notation (kanji, etc.) of the word, the part-of-speech information of the word, and the like. Also,
It also has a connection table that stores data that defines the connection relationship between words based on part-of-speech information.

【００１６】翻訳関連としては、翻訳プログラム部４２
と、本発明の翻訳辞書としての翻訳形態素解析辞書４９
とがある。この翻訳形態素解析辞書４９には、図５に示
すように、日本語の表記と、その読み、品詞等の情報が
記憶されている。ここでは、訳語（英語）の情報も共に
持たせているが、訳語については別の辞書を持つように
構成しても良い。訳語（英語）の品詞については、この
場合、ｖは動詞、ａは助動詞、ｄは代名詞の所有形、ｎ
は名詞を表す。Regarding translation, the translation program section 42
And a translation morphological analysis dictionary 49 as the translation dictionary of the present invention
There is. As shown in FIG. 5, the translated morphological analysis dictionary 49 stores information such as Japanese notation, its reading, and part of speech. Here, the translation word (English) information is also provided, but the translation word may be configured to have another dictionary. Regarding the part of speech of the translated word (English), in this case, v is a verb, a is an auxiliary verb, d is a pronoun possessive form, and n is
Represents a noun.

【００１７】翻訳プログラム部４２は、後述の形態素解
析プログラム４３、解析プログラム４４、変換プログラ
ム４５、生成プログラム４６、形態素生成プログラム４
７を格納している。The translation program unit 42 includes a morpheme analysis program 43, an analysis program 44, a conversion program 45, a generation program 46, and a morpheme generation program 4 which will be described later.
7 is stored.

【００１８】次に、このように構成された日英機械翻訳
装置の動作を、図６と図７のフローチャートを参照して
説明する。Next, the operation of the Japanese-English machine translation apparatus configured as described above will be described with reference to the flowcharts of FIGS. 6 and 7.

【００１９】入力装置１０からのキー入力を受け取り
（Ｓ１０）、その入力が読み文字列であった場合には
（Ｓ１２・ＹＥＳ）、そのかな文字コードがＲＡＭ３０
の読み入力バッファ領域３１に記憶される（Ｓ１４）。
例えば、読み文字列「わたしのなまえはようこです。」
が入力されると、例えば図３に示すように、そのかな文
字コード（わかりやすいように、図にはひらがなで示
す）が、先頭からの文字番号と共に読み入力バッファ領
域３１に記憶される。When a key input from the input device 10 is received (S10) and the input is a read character string (S12, YES), the kana character code is stored in the RAM 30.
Is stored in the reading input buffer area 31 (S14).
For example, the reading string "My name is Yoko."
Is input, the kana character code (indicated by hiragana in the figure for easy understanding) is stored in the read input buffer area 31 together with the character number from the beginning, as shown in FIG.

【００２０】入力が読みでもなく（Ｓ１２・ＮＯ）、変
換の指示でもない場合には（Ｓ１６・ＮＯ）、その他の
処理が行われる（Ｓ１８）。If the input is neither reading (S12 / NO) nor conversion instruction (S16 / NO), other processing is performed (S18).

【００２１】入力が変換の指示であれば（Ｓ１６・ＹＥ
Ｓ）、かな漢字変換が行われる。すなわち、図４に示す
ような、ＲＯＭ４０中のかな漢字変換辞書４８に基づい
て、かな漢字変換プログラム４１により、読み入力バッ
ファ領域３１に記憶されているかな文字コードが、例え
ば「私の名前は陽子です。」と、漢字かな混じり文に変
換され（Ｓ２０）、候補として出力装置５０に出力され
る（Ｓ２２）。具体例を、図２の画面左側に示す。If the input is a conversion instruction (S16.YE)
S), Kana-Kanji conversion is performed. That is, based on the Kana-Kanji conversion dictionary 48 in the ROM 40 as shown in FIG. 4, the Kana-Kanji conversion program 41 stores the Kana character code stored in the reading input buffer area 31, for example, "My name is proton. Is converted into a kanji-kana mixed sentence (S20) and output as a candidate to the output device 50 (S22). A specific example is shown on the left side of the screen in FIG.

【００２２】次のキー入力で（Ｓ２４）、確定キーが押
されなければ（Ｓ２６・ＮＯ）、次候補や文節の区切り
なおし等の処理をする（Ｓ２８）。When the confirmation key is not pressed by the next key input (S24) (S26, NO), the process of re-dividing the next candidate or phrase is performed (S28).

【００２３】確定キーが押されれば（Ｓ２６・ＹＥ
Ｓ）、日英翻訳に供せられる原文として確定処理をす
る。すなわち、かな漢字変換結果バッファ領域３２に、
この変換結果が記憶されると共に、Ｓ１２において読み
入力バッファ３１に記憶した、かな文字コードの先頭か
らの文字番号が、それぞれの単語と対応して記憶される
（Ｓ２０）。例えば、図３に示した［かな漢字変換結果
記憶］の例では、かな漢字変換結果の単語「私」は、読
み文字列の１番目から３番目（次の単語「の」は、読み
文字列の４番目からと示されているので、その１つ前ま
で）、すなわち「わたし」に対応し、単語「陽子」は、
読み文字列９番目から１１番目、すなわち「ようこ」に
対応する、ということが記憶される。If the confirm key is pressed (S26.YE
S): Finalize the original text for Japanese-English translation. That is, in the Kana-Kanji conversion result buffer area 32,
The conversion result is stored, and the character numbers from the beginning of the kana character code stored in the reading input buffer 31 in S12 are stored in association with each word (S20). For example, in the example of [Kana-Kanji conversion result storage] shown in FIG. 3, the word "I" in the Kana-Kanji conversion result is the first to third characters in the reading character string (the next word "no" is the reading character string 4). It is indicated from the th, so it corresponds to "I", and the word "proton" corresponds to
It is stored that it corresponds to the 9th to 11th reading character strings, that is, "Yoko".

【００２４】次に翻訳キーが押されると（Ｓ３２・ＹＥ
Ｓ）、翻訳処理に入る。Next, when the translation key is pressed (S32.YE
S), translation processing starts.

【００２５】まず、図５に示したような翻訳形態素解析
辞書４９を用いて、形態素解析プログラム４３により、
形態素解析処理がなされる（Ｓ３６）。以下、図８に基
づき詳しく説明する。First, the translation morphological analysis dictionary 49 as shown in FIG.
Morphological analysis processing is performed (S36). Hereinafter, a detailed description will be given with reference to FIG.

【００２６】まず、文字列の切り出しが行われる（Ｓ６
１）。すなわち、かな漢字変換結果バッファ領域３２に
記憶された表記の、前からいちばん長い文字列から順
に、翻訳形態素解析辞書４９に単語があるか否かをチェ
ックし、なければ、後ろから一つずつ短くした長さの単
語があるか否かをチェックしていく。動詞等の語尾が変
化する語は、終止形で検索する（語幹で検索するよう設
定してもよい）等、基本形で検索を可能にしておく。First, the character string is cut out (S6).
1). That is, it is checked whether or not there is a word in the translation morphological analysis dictionary 49 in order from the longest character string from the front of the notation stored in the kana-kanji conversion result buffer area 32, and if there is no, it is shortened one by one from the back. Check if there is a word of length. For words whose endings are changed, such as verbs, it is possible to search in the basic form, such as searching in the final form (may be set to search in the stem).

【００２７】例えば、「私の名前は陽子です。」という
原文であれば、まず、「私の名前は陽子です。」を切り
出し（Ｓ６１、Ｓ６２・ＮＯ）、図５に示すような翻訳
形態素解析辞書４９で日本語表記を検索をする（Ｓ６
３）が、そのような表記はないので（Ｓ６４・ＮＯ）、
文字列を一つ削って、「私の名前は陽子です」として、
また同様の処理を行う。この場合、次々に文字列を削っ
ていって、「私の」が切り出されたところで（Ｓ６１、
Ｓ６２・ＮＯ）、翻訳形態素解析辞書４９内の表記と一
致する（Ｓ６４・ＹＥＳ）。For example, in the case of the original sentence "My name is proton.", First, "My name is proton." Is cut out (S61, S62, NO), and the translation morphological analysis as shown in FIG. 5 is performed. Search the Japanese notation in the dictionary 49 (S6
3), but there is no such notation (S64, NO),
Delete one of the strings and say, "My name is Yoko."
The same processing is performed. In this case, the character strings are deleted one after another, and when "my" is cut out (S61,
(S62, NO), and matches the notation in the translation morphological analysis dictionary 49 (S64, YES).

【００２８】次いで、切り出された文字列と同じ表記
が、翻訳形態素解析辞書４９に複数存在するか否かを調
べる（Ｓ６５）。この場合、「私の」という表記は、複
数存在しないので（Ｓ６５・ＮＯ）、日本語の品詞情報
等で接続をチェックし（Ｓ６９）、接続がＯＫなら（Ｓ
７０・ＹＥＳ）、翻訳形態素解析辞書４９に基づき、そ
の単語に対応する訳語と品詞が、単語の切れ目と共に、
ワークエリア３４に一時記憶される（Ｓ７１）。Next, it is checked whether or not the same notation as the cut out character string exists in the translation morphological analysis dictionary 49 (S65). In this case, since there is not more than one "my" notation (S65 / NO), the connection is checked with Japanese part-of-speech information (S69), and if the connection is OK (S65).
70, YES), based on the translation morphological analysis dictionary 49, the translated word and the part of speech corresponding to the word, together with the break of the word,
It is temporarily stored in the work area 34 (S71).

【００２９】このようにして、「私の」、「名前」、
「は」までが切り出され、接続のチェックが通って記憶
されたところで、次に「陽子」が切り出される（Ｓ６
１、Ｓ６２・ＮＯ）。すると、翻訳形態素解析辞書４９
に表記は存在する（Ｓ６４・ＹＥＳ）が、その表記「陽
子」は２つ有り、複数である（Ｓ６５・ＹＥＳ）。翻訳
形態素解析辞書４９に登録されているレコード「陽子」
の読みには、「ようこ」と「ようし」が有り、別の読み
の単語なので（Ｓ６６・ＹＥＳ）、かな漢字変換結果バ
ッファ領域３２に記憶されているかな漢字変換結果の表
記と、読み入力バッファ３１に記憶されているその読み
を参照し（Ｓ６７）、どちらの読みの単語かを選択する
（Ｓ６８）。In this way, "my", "name",
"Ha" is cut out, and when the connection check is memorized, "proton" is cut out next (S6).
1, S62, NO). Then, the translation morphological analysis dictionary 49
Is present (S64, YES), but the notation "proton" is two and plural (S65, YES). Record "proton" registered in translation morphological analysis dictionary 49
The readings of “Kana-Kanji” and “Yoshi” are different reading words (S66, YES). Therefore, the Kana-Kanji conversion result notation stored in the Kana-Kanji conversion result buffer area 32 and the reading input buffer 31 The reading which is stored in is referred to (S67), and which reading word is selected (S68).

【００３０】例えば、図３のように記憶されている場
合、［かな漢字変換結果］の「陽子」という文字列は
「９」という数字と共に記憶されており、「陽子」の次
の単語は「１２」という数字と共に記憶されている。よ
って、［読み文字列記憶］の「９」番目に対応するかな
から、「１２」から１を引いた数字、すなわち「１１」
番目に対応するかなが、「陽子」に対応した読みである
ということが判定される（Ｓ６７）。For example, in the case of being stored as shown in FIG. 3, the character string "proton" of [Kana-Kanji conversion result] is stored together with the number "9", and the next word of "proton" is "12". It is memorized with the number. Therefore, since it corresponds to the "9" th of [reading character string storage], the number obtained by subtracting 1 from "12", that is, "11"
It is determined that the kana corresponding to the second is the reading corresponding to "proton" (S67).

【００３１】ここで「ようこ」という読みが判定された
ので、図５に示す翻訳形態素解析辞書４９中のレコード
「陽子」という表記の単語の中で、この読み「ようこ」
を持った単語を選択する（Ｓ６８）。Since the reading "YOKO" has been determined here, this reading "YOKO" is included in the word written as "proton" in the translation morphological analysis dictionary 49 shown in FIG.
The word with is selected (S68).

【００３２】次に接続のチェックを行い（Ｓ６９）、接
続がＯＫだと判断されれば（Ｓ７０・ＹＥＳ）、翻訳形
態素解析辞書４９をもとに、単語の訳語「Ｙｏｋｏ」と
品詞「ｎ（名詞）」が、単語の切れ目と共に、ワークエ
リア３４に記憶される。Next, the connection is checked (S69), and if it is determined that the connection is OK (S70, YES), the translated word "Yoko" and the part of speech "n (" Noun) ”is stored in the work area 34 together with the word break.

【００３３】以上の処理を繰り返し、「私の」「名前」
「は」「陽子」「です」「。」が、切れ目と、訳語と、
訳語の品詞と共に、ワークエリア３４に一時記憶され、
切り出す文字列がなくなると（Ｓ６１、Ｓ６２・Ｎ
Ｏ）、形態素解析処理は終了する。以上の形態素解析処
理過程が、本発明の形態素解析手段を構成する。By repeating the above processing, "my""name"
"Ha", "Yoko", "Dai", "."
It is temporarily stored in the work area 34 together with the part of speech of the translated word,
When there is no character string to be cut out (S61, S62 ・ N
O), the morphological analysis process ends. The above morphological analysis processing steps constitute the morphological analysis means of the present invention.

【００３４】図７に戻り、ここからは周知であるが、解
析プログラム４４により構文解析を行い、品詞の並びの
構造や、その係り受け関係等を求める（Ｓ３８）。Returning to FIG. 7, as is well known from here on, a parsing is performed by the analysis program 44 to obtain the structure of the part of speech sequence, its dependency relationship, etc. (S38).

【００３５】その後、変換プログラム４５により、原文
である日本語の構造を、訳文である英語の構造に変換す
る（Ｓ４０）。After that, the conversion program 45 converts the original Japanese structure into the translated English structure (S40).

【００３６】次に、生成プログラム４６により、訳文の
構文構造に基づいて、先にワークエリア３４に記憶され
ている、それぞれの単語の訳語候補を、訳文の言語形態
に従った語順に並び変え、日本語の原文に対する英語の
訳文を得る（Ｓ４２）。Next, the generation program 46 rearranges the translated word candidates of each word previously stored in the work area 34 based on the syntactic structure of the translated text, in the order of words according to the language form of the translated text, An English translation of the Japanese original is obtained (S42).

【００３７】そして、形態素生成プログラム４７によ
り、その訳文候補を構成する各訳語候補の基本形を語尾
変化させる等の変形処理をし（Ｓ４４）、その訳文を適
切な言語表現として完成させ、出力装置５０から図２の
画面右側のように出力する。Then, the morpheme generation program 47 performs transformation processing such as changing the ending of the basic form of each translated word candidate that constitutes the translated sentence candidate (S44), and completes the translated sentence as an appropriate language expression, and outputs the output device 50. To output as shown on the right side of the screen in FIG.

【００３８】その後、ユーザーのキー入力が確定キーな
ら（Ｓ５０・ＹＥＳ）、訳文を確定し（Ｓ５４）、後編
集をする等して（Ｓ５６）、翻訳を終了する。After that, if the user's key input is the confirm key (S50, YES), the translated sentence is confirmed (S54), post-editing is performed (S56), and the translation is completed.

【００３９】以上説明したように、本実施形態の機械翻
訳装置では、異音同表記の単語（例えば「陽子」には
「ようこ」と「ようし」との２通りの読みの単語が存在
する）については、その読みを参照して翻訳を行なうた
め、読み文字列「ようこ」をかな漢字変換した結果「陽
子」となった単語は、訳語は「Ｙｏｋｏ」となり、読み
文字列「ようし」をかな漢字変換し「陽子」とした単語
を翻訳すれば「ｐｒｏｔｏｎ」となる。従って、これま
でのように、同表記ではあるが、読みが異なる別の単語
として、間違って翻訳されることがなくなり、精度の高
い翻訳が実現される。As described above, in the machine translation device of this embodiment, there are two different pronunciation words (eg, "proton" and "yoko" and "youshi"). ), Since the translation is performed by referring to the reading, the word that became “Yoko” as a result of Kana-Kanji conversion of the reading character string “Yoko” has the translated word “Yoko” and the reading character string “Yoshi”. Kana-Kanji conversion and translation of the word "proton" results in "proton". Therefore, as in the past, it is possible to realize a highly accurate translation without being mistakenly translated as another word having the same notation but different reading.

【００４０】尚、本発明は、日英翻訳に限ったものでは
なく、日本語から他の言語に翻訳するの機械翻訳装置に
も応用可能なものである。The present invention is not limited to Japanese-English translation, but can be applied to a machine translation device for translating Japanese into another language.

【００４１】[0041]

【発明の効果】以上説明したことから明かなように、本
発明の請求項１に記載の機械翻訳装置によれば、日本語
を第二の言語に翻訳する際、異音同表記の単語について
は、日本語を入力した際の読みを参照するので、単語の
訳語を正しくつけることが可能となり、読みの違いによ
る誤翻訳を防ぐことができ、意図しない翻訳結果から別
の訳を探したり、翻訳をやり直したりすることなく、ス
ムーズに翻訳を行うことができる。As is apparent from the above description, according to the machine translation device of the first aspect of the present invention, when translating Japanese into a second language, words with different allophones are used. Refers to the reading when Japanese is input, so it is possible to correctly add the translation of the word, prevent mistranslation due to difference in reading, search for another translation from unintended translation results, It is possible to translate smoothly without re-translating.

[Brief description of drawings]

【図１】本実施形態の機械翻訳装置の制御部のブロック
図である。FIG. 1 is a block diagram of a control unit of a machine translation device of this embodiment.

【図２】本実施形態の機械翻訳装置の出力手段の画面の
表示形態の一例を示す図である。FIG. 2 is a diagram showing an example of a display form of a screen of an output unit of the machine translation device of this embodiment.

【図３】本実施形態のかな漢字変換結果記憶手段の記憶
内容の一例を示す図である。FIG. 3 is a diagram showing an example of stored contents of a kana-kanji conversion result storage means of the present embodiment.

【図４】本実施形態のかな漢字変換辞書の内容の概念図
である。FIG. 4 is a conceptual diagram of contents of a Kana-Kanji conversion dictionary according to the present embodiment.

【図５】本実施形態の翻訳形態素解析辞書の内容の概念
図である。FIG. 5 is a conceptual diagram of the contents of a translation morphological analysis dictionary of this embodiment.

【図６】本実施形態の処理の流れの前半を示すフローチ
ャートである。FIG. 6 is a flowchart showing the first half of the flow of processing of the present embodiment.

【図７】本実施形態の処理の流れの後半を示すフローチ
ャートである。FIG. 7 is a flowchart showing the latter half of the processing flow of this embodiment.

【図８】本実施形態の形態素解析の処理の流れを示すフ
ローチャートである。FIG. 8 is a flowchart showing a flow of morphological analysis processing according to the present embodiment.

【符号の説明】１０入力装置２０中央処理装置３０ＲＡＭエリア３１読み入力バッファ領域３２かな漢字変換結果バッファ領域４０ＲＯＭエリア４１かな漢字変換プログラム４３形態素解析プログラム４８かな漢字変換辞書４９翻訳形態素解析辞書[Explanation of symbols] 10 input device 20 central processing unit 30 RAM area 31 reading input buffer area 32 kana-kanji conversion result buffer area 40 ROM area 41 kana-kanji conversion program 43 morphological analysis program 48 kana-kanji conversion dictionary 49 translation morphological analysis dictionary

Claims

[Claims]

1. An input unit for inputting a Japanese reading character string, a Kana-Kanji conversion dictionary storing a reading of a word and a notation for the reading, and a reading character string input by the input unit, the Kana-Kanji character. Kana-Kanji conversion means for converting into a notation character string using a conversion dictionary, Kana-Kanji conversion result storage means for storing the notation character string converted by the Kana-Kanji conversion means, and necessary for translating Japanese into a second language A translation dictionary that stores information such as Japanese notation, reading of the notation, and translations of the notation in a second language, and the notation character string stored in the Kana-Kanji conversion result storage means A morpheme analysis means for performing a morpheme analysis while referring to the information stored in the second phrase using the translation dictionary for the notation character string morpheme analyzed by the morpheme analysis means. In a machine translation device including a translation unit that translates into words, the Kana-Kanji conversion result storage unit stores the converted notation character string in association with the reading of each word, and the morphological analysis unit includes the translation unit. When a plurality of words having the same notation as the display character string are present in the dictionary, the morphological analysis is performed by referring to the reading of the notation character string stored in the kana-kanji conversion result storage means. Machine translation device.