JP3161942B2

JP3161942B2 - Translation machine translation device

Info

Publication number: JP3161942B2
Application number: JP14791795A
Authority: JP
Inventors: 稔幸奥西
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1995-06-14
Filing date: 1995-06-14
Publication date: 2001-04-25
Anticipated expiration: 2016-04-25
Also published as: EP0749075A2; EP0749075A3; JPH096787A; US5873055A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、原文における各言語の
近傍に訳語を配置して出力する訳振り機械翻訳装置に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a translation machine translation apparatus for arranging and outputting translated words near each language in an original sentence.

【０００２】[0002]

【従来の技術】従来、ある言語で書かれた文章を別の言
語に翻訳する際、入力文が複数行にわたるような長い文
では訳文と入力文の対応関係をとるのが難しい場合があ
る。特に、英語と日本語のように文の構造が大きく異な
る場合には「入力文の１行目の語句の訳が訳文では３行
目に存在する」という現象はしばしば見受けられる。2. Description of the Related Art Conventionally, when translating a sentence written in a certain language into another language, it is sometimes difficult to establish a correspondence between the translated sentence and the input sentence in a long sentence in which the input sentence extends over a plurality of lines. In particular, when the sentence structures are significantly different, such as in English and Japanese, the phenomenon that "the translation of the phrase on the first line of the input sentence exists on the third line in the translated sentence" is often seen.

【０００３】現在の機械翻訳システムは訳文の後編集操
作を前提としている以上、入力文と翻訳結果をつきあわ
せて見比べる過程で語句の対応関係を一目で見つけられ
ないことは、翻訳精度がもたらす以上の負担を利用者に
強いることになる。The current machine translation system presumes post-editing of a translated sentence, and the inability to find the correspondence between words at a glance in the process of comparing an input sentence and a translation result at a glance is due to the translation accuracy. Burden on the user.

【０００４】外国語の文の速読や粗読をする場合、ま
た、英語がある程度理解できる人や入力文がある程度簡
単な構造をしている時には、上記のような問題を抱える
１文単位の訳文が生成されるより、自分が知らない難し
い英単語にだけ訳語が振られていれば、後は自分で英文
を読んで文を理解する方が便利といったことがある。When reading or reading a foreign language sentence quickly, or when a person who can understand English to a certain extent or an input sentence has a somewhat simple structure, a one-sentence unit having the above-described problem is generated. If a translation is assigned only to a difficult English word that you do not know, it may be more convenient to read and understand the sentence yourself.

【０００５】このような需要に応える従来技術として、
特開平６−２４３１６２号公報「翻訳支援装置」および
特開平６−３２５０８１号公報「機械翻訳装置」があ
る。[0005] As a conventional technology to meet such demands,
JP-A-6-243162 discloses a "translation support device" and JP-A-6-325081 discloses a "machine translation device".

【０００６】まず、この特開平６−２４３１６２号公報
は、原文における各言語の近傍に訳語を配置して出力
し、原語と訳語の関係を明確にする翻訳支援装置に関す
るものである。この翻訳支援装置は、図１２に示すよう
に、キー入力部２１から入力されて文章メモリ２２に格
納される和文の文章データを漢字に変換するための辞書
データを格納するとともに、英文の文書データを翻訳す
るための訳語データを格納する辞書メモリ２３や、翻訳
支援処理に際して文書メモリ２２から読み出される原語
のうち対訳が不要である原語（英単語）を記憶する訳語
不要単語メモリ２４や文書メモリ２２から読み出した訳
語を格納する訳語レジスタ２５ｂなどから形成されるレ
ジスタ２５などにより構成されている。First, Japanese Patent Laid-Open No. Hei 6-243162 relates to a translation support apparatus for arranging and outputting a translated word in the vicinity of each language in an original sentence and clarifying the relationship between the original word and the translated word. As shown in FIG. 12, the translation support apparatus stores dictionary data for converting Japanese sentence data input from the key input unit 21 and stored in the sentence memory 22 into kanji, and also stores English document data. A dictionary memory 23 for storing translated word data for translating a word, a translation-free word memory 24 and a document memory 22 for storing original words (English words) that need not be translated among the original words read from the document memory 22 during the translation support processing. And a register 25 formed from a translated word register 25b for storing the translated word read from the.

【０００７】この翻訳支援装置２６の動作について以下
に説明する。The operation of the translation support device 26 will be described below.

【０００８】例えば、”I will purchase the restaura
nt from my uncle." という英文がキー入力部２１から
入力され、文書メモリ２２に格納されているとする。こ
のキー操作により翻訳が指示されると、ＣＰＵ２７は、
文書メモリ２２中の文字データが原語レジスタ２５ａに
格納され、原語の桁数を数えて原語桁数レジスタ２５ｃ
に格納するように制御し、その後、原語レジスタ２５ａ
に格納した原語データが訳語不要単語メモリ２４に格納
されているか否かをサーチし、その原語に訳語を付与す
ることが必要か否かを判別する。例えば、"I","will","
the","from","my"が訳語不要単語メモリ２４に格納され
ていたとすると、残りの"purchase","restaurant","unc
le"に対して訳語を付加する必要があるため、対応する
訳語を辞書メモリ２３からサーチし、訳語がある時は、
訳語を辞書メモリ２３から読み出して訳語レジスタ２５
ｂに格納するとともに、訳語の桁数を計算して訳語桁数
レジスタ２５ｄに記憶する。その後、原語、訳語の桁数
に応じて表示処理が行われ、表示メモリ２８を介して表
示部２９で表示される。このとき、次に示すように、単
語単位に区切って見やすく訳語が付与される。For example, "I will purchase the restaura
It is assumed that the English sentence “nt from my uncle.” is input from the key input unit 21 and stored in the document memory 22. When a translation is instructed by this key operation, the CPU 27
The character data in the document memory 22 is stored in the source language register 25a, and the number of digits of the source language is counted to obtain the source word digit register 25c.
To be stored in the original language register 25a.
Is searched for whether or not the original word data stored in the translation-necessary word memory 24 is stored, and it is determined whether or not it is necessary to add a translation word to the original word. For example, "I", "will", "
Assuming that the "," from "," my "are stored in the translation unnecessary word memory 24, the remaining" purchase "," restaurant "," unc "
Since it is necessary to add a translated word to "le", the corresponding translated word is searched from the dictionary memory 23, and when there is a translated word,
The translated word is read from the dictionary memory 23 and the translated word register 25
b, the number of digits of the translated word is calculated and stored in the translated word digit register 25d. Thereafter, display processing is performed according to the number of digits of the original language and the translated word, and the display is performed on the display unit 29 via the display memory 28. At this time, as shown below, translated words are assigned in units of words for easy viewing.

【０００９】次に、特開平６−３２５０８１号公報「機械翻訳装置」
について説明する。[0009] Next, Japanese Unexamined Patent Publication No. 6-325081 entitled "Machine Translator"
Will be described.

【００１０】この特開平６−３２５０８１号公報は、品
詞や訳語などの目的原語の情報だけを選択して提示し、
対象原語の読解を容易にする機械翻訳装置である。この
機械翻訳装置は、図１３に示すように、第１原語文を入
力するための入力手段３１と、解析辞書３２を利用して
形態素解析を行う形態素解析手段３３と、第１言語を品
詞の連鎖確立表３４を利用して第１原語文の各単語に対
応する最適品詞を求める品詞推定手段３５と、第１言語
と第２言語の情報とを対応させた出力を作成する出力形
態決定手段３６と、出力装置に出力する出力手段３７と
により構成される。[0010] This Japanese Patent Application Laid-Open No. Hei 6-325081 selects and presents only information of a target original language such as a part of speech or a translated word.
A machine translation device that facilitates reading of target language. As shown in FIG. 13, the machine translation device includes an input unit 31 for inputting a first original sentence, a morphological analysis unit 33 for performing a morphological analysis using an analysis dictionary 32, and a A part-of-speech estimating unit 35 for finding an optimal part-of-speech corresponding to each word of the first original sentence using the chain establishment table 34, and an output form determining unit for creating an output in which information of the first language and the information of the second language are associated with each other. And output means 37 for outputting to an output device.

【００１１】この機械翻訳装置３８の動作について以下
に説明する。The operation of the machine translation device 38 will be described below.

【００１２】例えば、"I will book the restaurant fo
r my uncle."という英文が入力手段３１により入力され
たとすると、形態素解析により形態素に分割され、各形
態素毎に辞書引きされる。その結果、英語などの多品詞
語では、下記に示すように形態素毎に複数の候補がセッ
トされる。For example, "I will book the restaurant fo
If the English sentence "r my uncle." is input by the input means 31, it is divided into morphemes by morphological analysis, and the dictionary is looked up for each morpheme. As a result, in a multi-speech word such as English, as shown below, A plurality of candidates are set for each morpheme.

【００１３】単語品詞候補 I 代名詞 will 助動詞／名詞 book 名詞／動詞 the 冠詞 restaurant 名詞 for 前置詞／接続詞 my 代名詞／感嘆詞 uncle 名詞上記例文では、"will","book","for","my"には複数の品
詞候補があって曖昧性がある。そこで、品詞推定手段３
５では品詞の連接確率を推定し、その連接確率に基づい
て品詞列の出現確率を算出し、確率が最大である品詞列
を選択する。この従来例では、品詞推定手段３５として
連接確率を直前２品詞までに制限するtrigramモデルを
利用し、さらに、単語内相対品詞出現確率も加味する。
今、この品詞推定手段３５により、入力文の品詞は下記
に示すように推定される。特に、直前の品詞が「助動
詞」であるなどのことから、"book"の品詞は「動詞」と
推定できる。Word Part-of-speech candidate I Pronoun will auxiliary verb / noun book noun / verb the article restaurant noun for preposition / connective my pronoun / exclamation uncle noun In the above example sentence, "will", "book", "for", "my" Has multiple parts of speech and is ambiguous. Therefore, the part of speech estimation means 3
In step 5, the part-of-speech connection probability is estimated, the appearance probability of the part-of-speech sequence is calculated based on the connection probability, and the part-of-speech sequence having the highest probability is selected. In this conventional example, a part-of-speech estimating means 35 uses a trigram model that limits the connection probability to the two preceding parts-of-speech, and further takes into account the relative part-of-speech appearance probability in words.
Now, the part of speech of the input sentence is estimated by the part of speech estimation means 35 as shown below. In particular, the part of speech of "book" can be estimated as a "verb" because the part of speech immediately before is an "auxiliary verb".

【００１４】単語品詞候補 I 代名詞 will ［助動詞］／名詞 book 名詞／［動詞］ the 冠詞 restaurant 名詞 for ［前置詞］／接続詞 my ［代名詞］／感嘆詞 uncle 名詞［］内が推定した品詞この後は、その品詞に対応した第２言語の情報を辞書か
ら得て、出力形態決定手段３６により第１言語の単語の
位置に対応させて第２言語の情報を付与して行けばよ
い。このとき、第２言語情報の内容のうち、よく知られ
た内容を頻繁に出力するのを抑制するために、辞書中に
出力抑制欄を設けてマークしておく。また、複数の用法
がある見出しに対しては用法毎に優先度を数値で記入し
ておくと一部の用法の出力だけを抑制することができ
る。例えば、例文の"book"は「名詞（本）」のときには
抑制し、「動詞（予約する）」のときは出力する。以上
の処理により、例文に対する出力結果は以下のようにな
る。The word part of speech candidate I pronoun will [auxiliary verb] / noun book noun / [verb] the article restaurant noun for [preposition] / conjunction my [pronoun] / exclamation uncle noun [] The information of the second language corresponding to the part of speech may be obtained from the dictionary, and the output form determining means 36 may add the information of the second language in correspondence with the position of the word of the first language. At this time, in order to suppress frequent output of well-known contents of the contents of the second language information, an output suppression column is provided and marked in the dictionary. Also, for a heading having a plurality of usages, if the priority is numerically entered for each usage, it is possible to suppress output of only some usages. For example, "book" in the example sentence is suppressed when it is "noun (book)", and is output when it is "verb (reserve)". With the above processing, the output result for the example sentence is as follows.

【００１５】 [0015]

【００１６】[0016]

【発明が解決しようとする課題】しかしながら、上記従
来の翻訳支援装置２６や機械翻訳装置３８では、以下の
（ａ），（ｂ）に示すような問題があった。However, the above-mentioned conventional translation support device 26 and machine translation device 38 have the following problems (a) and (b).

【００１７】（ａ）訳語の精度が悪い。(A) The accuracy of the translated word is poor.

【００１８】特開平６−２４３１６２号公報の翻訳支援
装置２６では、文の前後の単語に関係なく、毎回同じ訳
語（一番先に格納された訳や最も優先される訳）を付与
する。例えば、単語"book"に対しては品詞が「名詞」訳
が「本」という解釈が最も一般的なので、次のような上
述した例文でも「本」という誤った訳語を付与してしま
う。The translation support device 26 of Japanese Patent Laid-Open No. Hei 6-243162 assigns the same translation (the translation stored first or the translation with the highest priority) every time, regardless of the words before and after the sentence. For example, the most common interpretation of the word “book” is that the part of speech is “noun” and the translation is “book”, so even the following example sentence would give an incorrect translation of “book”.

【００１９】これに対して、特開平６−３２５０８１号公報の機械翻
訳装置３８では、前述したように直前２単語との連接品
詞をチェックするため、上記した入力文例の"book"には
「動詞：予約する」という正しい訳を与えることができ
る。[0019] On the other hand, in the machine translator 38 disclosed in Japanese Patent Laid-Open No. Hei 6-325081, as described above, the part-of-speech with the immediately preceding two words is checked. "Can be given the correct translation.

【００２０】しかしながら、特開平６−３２５０８１号
公報の機械翻訳装置３８において、例えば構文解析を使
ったとしても、品詞しか特定できないので、同じ品詞の
訳語の中で意味的な制限で決まる訳語を正しく選択する
ことはできない。例えば、"take"は以下のように、種々
の意味（訳語）を持つ英単語であるが、 take（動詞）手に取る、奪う、連れていく、乗る、食
べる、購読する、要する、受ける、取る、・・・品詞を「動詞」と推定するだけでは、その中のどの訳語
を選択していいかわからない。以下に示す入力文におい
ても「取る」という最も一般的な訳語を振ってしまう
が、実際には、目的語が「人」を意味する場合には「連
れていく」という訳語が相応している。However, in the machine translation apparatus 38 disclosed in Japanese Patent Laid-Open No. 6-325081, even if, for example, parsing is used, only the part of speech can be specified. You cannot choose. For example, "take" is an English word with various meanings (translations) as follows, but take (verb) take, take, take, take, ride, eat, subscribe, need, receive, Take ... It is not possible to estimate which part of speech is to be selected simply by estimating the part of speech as a "verb". In the input sentence shown below, the most common translation of "take" is used, but in practice, if the object means "person", the translation of "take" is appropriate .

【００２１】このように、誤った訳語が振られたこれらの２つの入力
文を読んだ利用者は誤った理解をしてしまう。もちろ
ん、特開平６−２４３１６２号公報と特開平６−３２５
０８１号公報のどちらにおいても複数の訳語候補を出力
することはできるが、利用者自身に訳語を選択させると
いう負担を強いることになる。[0021] As described above, a user who reads these two input sentences to which an incorrect translation has been assigned will misunderstand. Of course, JP-A-6-243162 and JP-A-6-325
Although both translations can output a plurality of candidate translations, it imposes a burden on the user to select a translation.

【００２２】（ｂ）本当に必要な単語だけに訳語が振ら
れない。(B) No translation is given only to words that are really needed.

【００２３】利用者にとって自分の「知らない」単語に
だけ訳語が振られていればそれで十分である。逆に、
「知っている」単語に訳語が振られていても邪魔なこと
が多い。特開平６−２４３１６２号公報と特開平６−３
２５０８１号公報では「訳語不要単語」、「出力抑制不
要フィールドの設定」、「訳語に対する優先度」を予め
設けておき、訳語を付与する単語を制限している。これ
らの「訳語不要単語」はシステムに固定であるのに対
し、ある単語を「知っている」とか「知らない」という
のは利用者によって大きく異なる。つまり、英語の「初
級者」にとって知らない単語（訳語が欲しい単語）であ
っても「上級者」にとっては既知の単語であることは十
分考えられるため、特開平６−２４３１６２号公報と特
開平６−３２５０８１号公報のように、どんな利用者が
使おうが抑制される訳語が決っているシステムでは本当
に役立つ訳語付与が期待できない。It is sufficient for a user to assign a translation only to a word that he does not know. vice versa,
Even if a translation is given to a word that "knows", it is often disturbing. JP-A-6-243162 and JP-A-6-3
In Japanese Patent Publication No. 25081, “words not requiring translation”, “setting of fields not requiring output suppression” , and “priority to translations” are provided in advance to limit the words to which translations are assigned. While these "translation-unnecessary words" are fixed in the system, "knowing" or "don't know" a certain word greatly differs depending on the user. In other words, even words that are not known to English “beginners” (words for which translations are desired) are sufficiently considered to be words that are known to “experts”. As in JP-A-6-325081, in a system in which the translation is determined to suppress the use of any user, a really useful translation can not be assigned.

【００２４】このように、（ａ）訳語の精度が悪かった
り、（ｂ）訳振りに不必要な単語が多くあると紛らわし
く、外国語の文の速読や粗読を効率よく容易に行うこと
ができないという問題があった。As described above, it is confusing that (a) the accuracy of the translated word is poor or (b) there are many words unnecessary for translation, and it is possible to efficiently and easily read a foreign language sentence quickly and roughly. There was a problem that can not be.

【００２５】本発明は、上記従来の問題を解決するもの
で、外国語の入力文の速読や粗読を効率よく容易に行う
ことができる訳振り機械翻訳装置を提供することを目的
とする。An object of the present invention is to solve the above-mentioned conventional problems, and an object of the present invention is to provide a translation machine translation apparatus capable of efficiently and easily reading an input sentence in a foreign language efficiently and easily. .

【００２６】[0026]

【課題を解決するための手段】本発明の訳振り機械翻訳
装置は、単語に対する品詞などの文法情報および訳語を
格納するとともに、該訳語毎に訳振りレベルが付与され
た単語辞書と、入力文を単語列に分割し、前記単語辞書
から該単語列の各単語に対する品詞などの文法情報およ
び訳語を得るとともに、該各単語の時制・人称・数を解
析する辞書引き・形態素解析手段と、前記辞書引き・形
態素解析手段で得られた文法情報に従って該単語列の単
語間の係り受け関係などの入力文章構造を決定する構文
解析手段と、前記構文解析手段で決定した入力文章構造
を翻訳文に対する文章構造に変換し、意味的な整合性を
検査して該単語に対する該訳語を選択する構文変換手段
と、前記構文変換手段で得られた訳語の並び順を決定
し、活用変化を調整して翻訳文を生成する翻訳文生成手
段とを備えた訳振り機械翻訳装置であって、利用者が、
該各単語の難易度、頻出度または重要度などのレベル情
報に基づいて設定された訳振りレベルを指定可能な訳振
りレベル設定手段と、前記訳振りレベル設定手段で指定
された訳振りレベルと前記単語辞書における訳振りレベ
ルとの関係により訳語を付与するか否かを判定する翻訳
出力判定手段と、前記翻訳文生成手段までの翻訳処理を
行なった後に、前記翻訳出力判定手段で判定した訳語に
対して、入力文の各単語とその訳語との対応をとって該
訳語を該入力文とともに見易い形式に整えて出力する原
語訳語対応出力手段と、をさらに備え、前記翻訳出力判
定手段は、前記訳振りレベル設定手段で指定された訳振
りレベルと前記単語辞書における訳振りレベルとの関係
により訳語を付与するか否かを判定するとともに、前記
解析処理中に得られる文の構文構造、前記変換処理中に
得られる訳語の文字種構成、前記変換処理中に得られる
訳語が分野指定されたものであるか否か、および、同一
訳語の付与位置間の距離のうち、少なくともいずれかの
情報を言語情報とし、該言語情報に基づいて訳語を付与
するか否かを判定することを特徴とするものであり、そ
のことにより上記目的が達成される。SUMMARY OF THE INVENTION A translation machine translation apparatus according to the present invention translates grammatical information such as part of speech of a word and a translated word.
At the same time, the translation level is assigned to each translation.
And the input sentence is divided into word strings,
Grammar information such as the part of speech for each word in the word string
And get the translated words and solve the tense, person, and number of each word.
A dictionary lookup / morphological analysis means for analyzing, and the dictionary lookup / form
According to the grammatical information obtained by the morphological analysis means, the word sequence
Syntax to determine input sentence structure such as dependency relation between words
Analysis means, and the input sentence structure determined by the syntax analysis means
Into a sentence structure for the translated sentence,
Syntax conversion means for checking and selecting the translation for the word
And the order of the translated words obtained by the syntax conversion means is determined.
Translation generator that generates translations by adjusting
A translation machine translation device having a step and
Level information such as difficulty, frequency or importance of each word
Translation that can specify the translation level set based on the
Specified by the translation level setting means and the translation level setting means
Translation level and translation level in the word dictionary
Translation to determine whether to add a translation word based on the relationship with
Output determination means and translation processing up to the translation sentence generation means.
After performing, the translated word determined by the translation output determination means
On the other hand, each word of the input sentence is
A source that prepares the translated word together with the input sentence in an easy-to-view format and outputs it.
And a translation output unit.
The translation means specifies the translation specified by the translation level setting means.
Between the translation level and the translation level in the word dictionary
And whether or not to add a translation by
The syntax structure of the sentence obtained during the parsing process,
Character type configuration of the translated word obtained, obtained during the conversion process
Whether the translated word is specified in the field, and is the same
At least one of the distances between
Use information as linguistic information and assign translations based on the linguistic information
It is characterized in that it is determined whether or not to do so, whereby the above object is achieved.

【００２７】さらに、好ましくは、本発明の訳振り機械
翻訳装置は、同一訳語の付与位置間の距離は、直前に付
与した同一訳語間の距離であることを特徴とするもので
あり、そのことにより上記目的が達成される。 Further, preferably, the translation machine translation apparatus of the present invention is arranged such that the distance between the positions at which the same translation is provided is immediately preceding.
It is characterized by the distance between the same translated words given , thereby achieving the above object.

【００２８】また、本発明の訳振り機械翻訳装置は、単
語に対する品詞などの文法情報および訳語を格納すると
ともに、該訳語毎に訳振りレベルが付与された単語辞書
と、入力文を単語列に分割し、前記単語辞書から該単語
列の各単語に対する品詞などの文法情報および訳語を得
るとともに、該各単語の時制・人称・数を解析する辞書
引き・形態素解析手段と、前記辞書引き・形態素解析手
段で得られた文法情報に従って該単語列の単語間の係り
受け関係などの入力文章構造を決定する構文解析手段
と、前記構文解析手段で決定した入力文章構造を翻訳文
に対する文章構造に変換し、意味的な整合性を検査して
該単語に対する該訳語を選択する構文変換手段と、前記
構文変換手段で得られた訳語の並び順を決定し、活用変
化を調整して翻訳文を生成する翻訳文生成手段とを備え
た訳振り機械翻訳装置であって、利用者が、該各単語の
難易度、頻出度または重要度などのレベル情報に基づい
て設定された訳振りレベルを指定可能な訳振りレベル設
定手段と、前記訳振りレベル設定手段で指定された訳振
りレベルと前記単語辞書における訳振りレベルとの関係
により訳語を付与するか否かを判定する翻訳出力判定手
段と、前記翻訳文生成手段までの翻訳処理を行なった後
に、前記翻訳出力判定手段で判定した訳語に対して、入
力文の各単語とその訳語との対応をとって該訳語を該入
力文とともに見易い形式に整えて出力する原語訳語対応
出力手段と、をさらに備え、前記翻訳出力判定手段は、
前記訳振りレベル設定手段で指定された訳振りレベルと
前記単語辞書における訳振りレベルとの関係により訳語
を付与するか否かを判定するとともに、前記構文解析処
理中に得られる文の構文構造、前記変換処理中に得られ
る訳語の文字種構成、前記変換処理中に得られる訳語が
分野指定されたものであるか否か、および、同一訳語の
付与位置間の距離のうち、少なくともいずれかの情報を
言語情報とし、該言語情報に基づいて訳語を付与するか
否かを判定し、さらに前記構文変換処理中に得られる同
一訳語の付与回数が、文書の所定範囲内で所定基準を越
えた場合、それ以降は該同一訳語を付与しないことを特
徴とするものであり、そのことにより上記目的が達成さ
れる。Further, the translation machine translation apparatus of the present invention
When storing grammatical information such as part of speech for words and translations
Both are word dictionaries to which a translation level is assigned for each of the translated words
And the input sentence is divided into word strings, and the word
Get grammatical information such as part of speech and translations for each word in the column
And a dictionary that analyzes the tense, person, and number of each word
Lookup / morpheme analysis means and the dictionary lookup / morpheme analyzer
The relationship between the words in the word string according to the grammatical information obtained in the column
Parsing means to determine input sentence structure such as receiving relation
And the translated sentence structure determined by the parsing means.
Into a sentence structure for
Syntactic conversion means for selecting the translation for the word;
Determine the order of the translated words obtained by the syntax conversion means, and
Translation generation means for generating a translation by adjusting
A translation machine translation device, wherein a user
Based on level information such as difficulty, frequency or importance
Translation level setting that can specify the translation level set
Translation means specified by the translation level setting means.
Between the translation level and the translation level in the word dictionary
Translation output determination method that determines whether to add a translated word based on
After performing the translation processing up to the step and the translation sentence generation means
Next, the translated word determined by the translation output
Correspondence between each word in the literary sentence and its translation is entered into the translation.
Supports original language translations that are output in an easy-to-read format along with the power sentences
Output means, and the translation output determination means,
The translation level specified by the translation level setting means;
Translated words according to the translation level in the word dictionary
To determine whether or not to add
Sentence structure obtained during processing, obtained during the conversion process
The character type configuration of the translated word, the translated word obtained during the conversion process is
Whether or not the field has been designated, and
Information on at least one of the distances between
Whether to add a translation based on the linguistic information
Is determined, and the same is obtained during the syntax conversion process.
The number of times a translated word is assigned exceeds a predetermined standard within a predetermined range of the document.
If the name is changed, it is noted that the same translation will not be added after that.
The purpose is achieved.
It is .

【００２９】さらに、好ましくは、本発明の訳振り機械
翻訳装置は、前記単語辞書中に付与する訳振りレベル
は、通常の辞書に付与された重要度マークに関する第１
情報と、前記入力文の各単語を構成する単語文字数に関
する第２情報と、品詞の種類に関する第３情報と、基本
辞書、ユーザ辞書または専門辞書など辞書の種類に関す
る第４情報、のうち少なくともいずれかの情報から構成
され、前記第１情報に基づいて訳振りレベルを設定する
際は、複数の辞書のうち、幾つの辞書に重要度マークが
付与されているか、また該重要度マークがどの程度の重
要度マークかに基づいて設定し、前記第２情報に基づい
て訳振りレベルを設定する際は、該単語文字数が一定の
文字数を越えるか否か、または該各単語の接頭辞および
接尾辞などの該各単語の構成により該各単語の意味が推
測可能か否かに基づいて設定し、前記第３情報に基づい
て訳振りレベルを設定する際は、特定品詞に対してレベ
ルを低く設定し、固有名詞および助動詞は訳振り対象か
ら除き、前記第３情報に基づいて訳振りレベルを設定す
る際は、基本辞書に登録されている単語は単語レベルを
低くし、ユーザ辞書または専門辞書などに登録されてい
る単語は単語レベルを高くする、ことで前記単語辞書中
に付与する訳振りレベルを設定することを特徴とするも
のであり、そのことにより上記目的が達成される。[0029] In addition, preferably, translation pretend machine translation equipment of the present invention, the swing level translation to be applied to in the word dictionary
Is the first of the importance marks attached to the normal dictionary.
Information and the number of word characters constituting each word of the input sentence.
Second information, third information on the type of part of speech,
For dictionary types such as dictionaries, user dictionaries, or specialized dictionaries
Composed of at least one of the fourth information
Setting a translation level based on the first information
In some cases, out of multiple dictionaries, how many dictionaries have importance marks
Has been given, and how important the importance mark is
Set based on the necessity mark and based on the second information
When setting the translation level, the number of words
Whether it exceeds the number of characters, or the prefix of each word and
The meaning of each word is deduced from the configuration of each word such as a suffix.
Set based on whether it is measurable or not, and based on the third information
When setting the translation level, level
Low, and proper nouns and auxiliary verbs can be translated
And set the translation level based on the third information.
The word registered in the basic dictionary
Lower, and it is registered in the user dictionary or specialized dictionary.
Words in the word dictionary by raising the word level
It is characterized by setting the translation level to be given to
Therefore, the above object is achieved .

【００３０】[0030]

【００３１】[0031]

【００３２】[0032]

【００３３】[0033]

【００３４】[0034]

【００３５】[0035]

【作用】本発明においては、入力された文を単語の形態
素列に分割し、その単語毎に単語辞書をひくとともに、
その単語の時制・人称・数などを解析する。この形態素
列における単語間の係り受けなど入力文章構造を決定
し、これを翻訳文に対する文章構造に変換し、意味的な
整合性を検査して単語に対する訳語を選択する。さら
に、この訳語の並びを決定し、活用変化を調整して翻訳
文を生成し、この翻訳文と入力文を見易い形式に整え、
この後、入力文の各単語と訳語との対応をとって出力す
る。このように、形態素解析、構文解析、構文変換さら
に翻訳文生成の翻訳処理を行った後に、入力文の各単語
と訳語との対応をとって訳振りをしているので、より精
度の高い訳語を出力することが可能となり、利用者は入
力文を正しく理解することができて、外国語の入力文の
速読や粗読を効率よく容易に行うことが可能となる。According to the present invention, an input sentence is divided into morpheme strings of words, and a word dictionary is searched for each word.
Analyze the tense, person, and number of the word. The input sentence structure such as the dependency between words in the morpheme sequence is determined, converted into the sentence structure for the translated sentence, and checked for semantic consistency to select a translated word for the word. Further, the arrangement of the translated words is determined, the translation is generated by adjusting the utilization change, and the translated sentence and the input sentence are arranged in an easy-to-view format.
Thereafter, the correspondence between each word of the input sentence and the translated word is output. In this way, after performing the morphological analysis, syntactic analysis, syntactic conversion, and translation processing for generating the translated sentence, translation is performed by associating each word of the input sentence with the translated word, so that a more accurate translated word is obtained. Can be output, the user can correctly understand the input sentence, and can quickly and easily read the input sentence in a foreign language efficiently and easily.

【００３６】また、各単語の難易度、頻出度および重要
度などのレベル情報に基づいて設定された訳振りレベル
設定手段で設定された訳振りレベルと、各単語の難易
度、頻出度および重要度などのレベル情報に基づいて設
定された単語辞書における訳振りレベルとの関係により
訳語を付与するか否かが判定されるので、利用者の語学
レベルに応じて訳語の不要な単語には訳語が付与され
ず、利用者の望む最低限必要な単語に、より精度の高い
適切な訳語を付与することが可能となって、利用者は外
国語の入力文の速読や粗読を効率良く容易に行うことが
可能となる。The translation level set by the translation level setting means set based on level information such as the degree of difficulty, frequency and importance of each word, and the level of difficulty, frequency and importance of each word Whether or not to add a translation is determined based on the relationship with the translation level in the word dictionary set based on level information such as the degree. Is not added, and it is possible to add more accurate and appropriate translations to the minimum required words desired by the user, and the user can efficiently read or roughly read foreign language input sentences efficiently This can be easily performed.

【００３７】この場合、単語辞書中に付与する訳振りレ
ベルを、通常の辞書に付与された重要度マークのうち、
重要度マークがどの程度の重要度マークか、どの程度多
くの該辞書に付与されているかに関する第１情報と、入
力文の各単語を構成する単語文字数に関する第２情報
と、品詞の種類に関する第３情報と、基本辞書、ユーザ
辞書および専門辞書など単語の辞書の種類に関する第４
情報とのうち、少なくともいずれかの情報などに基づい
て設定するが、第１〜第４の情報のうち情報が多い程、
必要な単語に正確な訳振りをし、かつ不要な単語に訳振
りしないという、よりきめ細かい正確な訳振りがなされ
る。In this case, the translation level given to the word dictionary is changed from the importance mark given to the normal dictionary.
First information about the degree of importance of the importance mark, the number of words assigned to the dictionary, the second information about the number of words constituting each word of the input sentence, and the second information about the type of part of speech. 3 information and 4th type of word dictionary such as basic dictionary, user dictionary and specialized dictionary
The information is set based on at least one of the information and the like.
A more detailed and accurate translation is made, so that necessary words are correctly translated and not unnecessary words.

【００３８】この単語文字数に基づいて行う訳振りレベ
ルの設定において、単語文字数が一定の文字数を越える
かどうかに基づいて設定するだけではなく、各単語の接
頭辞および接尾辞などの各単語の構成により各単語の意
味が推測可能か否かに基づいても設定するようにすれ
ば、さらによりきめ細かいすっきりとした訳振りがなさ
れる。In setting the translation level based on the number of word characters, not only is the setting based on whether the number of word characters exceeds a certain number of characters, but also the configuration of each word such as the prefix and suffix of each word. If the setting is made based on whether or not the meaning of each word can be guessed, a more detailed and clear translation is made.

【００３９】また、品詞の種類に基づいて行う訳振りレ
ベルの設定において、特定品詞は訳振りレベルを低く設
定し、固有名詞および助動詞は訳振り対象から除くよう
にすれば、訳振り状態がすっきりして外国語の入力文の
速読や粗読をより効率よく容易に行うことが可能とな
る。In setting the translation level based on the type of part of speech, the translation level of a specific part of speech is set low, and proper nouns and auxiliary verbs are excluded from the translation target. As a result, it is possible to more efficiently and easily read a foreign language input sentence quickly or roughly.

【００４０】さらに、訳語出力判定処理は、構文解析処
理、構文変換処理および翻訳文生成処理のうち少なくと
もいずれかの処理中に得られる言語情報に基づいても訳
語を付与するか否かを判定することが可能であり、より
きめ細かい正確な訳振りがなされる。Further, in the translated word output determination processing, it is determined whether or not to provide a translated word based on linguistic information obtained during at least one of the syntax analysis processing, the syntax conversion processing, and the translated sentence generation processing. And more precise translations are possible.

【００４１】この言語情報とは、構文解析処理中に得ら
れる文の構文構造、構文変換処理中に得られる訳語の文
字種構成、構文変換処理中に得られる訳語が分野指定さ
れたものであるか否か、および、同一訳語の付与位置間
の距離のうち少なくともいずれかの情報などであるが、
これら情報が多い程、必要な単語に正確な訳振りをし、
かつ不要な単語に訳振りしないという、よりきめ細かい
正確な訳振りがなされる。The linguistic information is the syntax structure of the sentence obtained during the syntax analysis process, the character type configuration of the translated word obtained during the syntax conversion process, and whether the translated word obtained during the syntax conversion process is specified in the field. No, and at least one of the distances between the positions where the same translation is given, etc.
The more information you have, the more accurate the words you need to translate,
In addition, a more detailed and accurate translation that does not translate into unnecessary words is performed.

【００４２】また、直前に付与した同一訳語間の距離に
よって訳語を付与するか否かを判定したり、また、構文
変換処理中に得られる同一訳語の付与回数が、文書の所
定範囲内で所定基準を越えた場合、それ以降は同一訳語
を付与しないようにすれば、一度訳語が付与されて人の
記憶に留まっているものや視野の範囲内で容易に参照可
能な訳語については付与する必要がなく、訳振り状態が
すっきりして外国語の入力文の速読や粗読をより効率よ
く容易に行うことが可能となる。It is also possible to determine whether or not to add a translation based on the distance between the same translations added immediately before, and to determine whether the number of times of the same translation obtained during the syntax conversion processing is within a predetermined range of the document. If the standard is exceeded, it is necessary not to assign the same translation after that, and it is necessary to assign translations that have been assigned once and remain in human memory or translations that can be easily referenced within the field of view As a result, the translation state is clear, and it is possible to more efficiently and easily read a foreign language input sentence quickly and roughly.

【００４３】[0043]

【実施例】以下、本発明の実施例について説明する。Embodiments of the present invention will be described below.

【００４４】図１は本発明の一実施例における訳振り機
械翻訳装置の基本構成を示すブロック図である。FIG. 1 is a block diagram showing a basic configuration of a translation machine translation apparatus according to an embodiment of the present invention.

【００４５】図１において、文字列および記号を入力す
る入力手段１は翻訳ＣＰＵ２を介して出力手段３に接続
され、訳振りに必要な単語に対応した訳語を入力文とと
もに出力手段３に出力し、出力手段３でそれを表示した
り印刷したりして出力する。また、この翻訳ＣＰＵ２は
記憶手段４と接続されており、翻訳ＣＰＵ２による翻訳
処理に必要な辞書データおよび処理結果を記憶手段４に
記憶している。In FIG. 1, an input means 1 for inputting a character string and a symbol is connected to an output means 3 via a translation CPU 2 and outputs a translated word corresponding to a word necessary for translation to the output means 3 together with an input sentence. Then, it is displayed or printed by the output means 3 and output. The translation CPU 2 is connected to the storage unit 4, and stores dictionary data and processing results required for translation processing by the translation CPU 2 in the storage unit 4.

【００４６】この翻訳ＣＰＵ２の構成について説明する
と、上記入力手段１が接続される訳振りレベル設定手段
５はユーザが入力手段１を介して自らの語学レベルに応
じた訳振りレベルを指定する。また、入力手段１が接続
される分野指定手段６はユーザが入力手段１を介して翻
訳専門分野を指定する。これらの訳振りレベル設定手段
５および分野指定手段６が接続される辞書引き・形態素
解析手段７は、単語辞書を引き、入力された文章を各形
態素列（単語列）に分割し、各単語の品詞などの文法情
報およびその訳語を得るとともに、各単語の時制・人称
・数などを解析する。この辞書引き・形態素解析手段７
が接続される構文解析手段８は、この文法情報に従っ
て、入力文の各形態素列における単語間の係り受けなど
の文構造をいわゆる構造解析木によって決定する。この
構文解析手段８が接続される構文変換手段９は、入力文
に対する構文解折木の構造を、翻訳文に対する構文解析
木に変換し、さらに意味的な整合性を検査して単語に対
する訳語を選択する。この構文変換手段９が接続される
翻訳文生成手段１０は、構文変換手段９で得られた訳語
の並び順を決定し、活用変化を調整して翻訳文を生成す
る。この翻訳文生成手段１０が接続される原語訳語対応
記憶手段１１は、形態素解析と、構文解析と、構文変換
と、翻訳文生成の翻訳処理を行なった後に、入力文の各
単語と訳語との対応をとってその位置関係を記憶する。
また、翻訳文生成手段１０が接続される翻訳出力判定手
段１２は、訳振りレベル設定手段５で指定した訳振りレ
ベルと単語辞書における原語の訳振りレベルとの関係に
より訳語を付与するか否かを判定する。この翻訳出力判
定手段１２が接続される訳語出力位置回数記憶手段１３
は訳語の出力位置回数を記憶する。また、翻訳出力判定
手段１２が接続される出力形式整形手段（原語訳語対応
出力手段）１４は出力手段３に接続され、出力手段３に
原文と翻訳文を見易い形式に整えて出力する。本実施例
の場合、出力形式整形手段１４は原語訳語対応出力手段
としても機能し、翻訳文生成手段１０までの翻訳処理を
行なった後に、翻訳出力判定手段１２で判定した訳語に
対して、入力文の各単語とその訳語との対応をとって訳
語を入力文とともに出力する。The configuration of the translation CPU 2 will be described. The translation level setting unit 5 to which the input unit 1 is connected specifies the translation level according to the language level of the user via the input unit 1 by the user. Further, the field designating means 6 to which the input means 1 is connected allows the user to designate a translation specialty field via the input means 1. The dictionary lookup / morpheme analysis means 7 to which the translation level setting means 5 and the field designation means 6 are connected draws a word dictionary, divides an input sentence into morpheme strings (word strings), and Obtain grammatical information such as part of speech and its translations, and analyze the tense, personal name, number, etc. of each word. This dictionary lookup / morphological analysis means 7
Is connected to the grammatical information, and determines the sentence structure such as the dependency between words in each morpheme sequence of the input sentence using a so-called structure analysis tree. The syntax conversion means 9 to which the syntax analysis means 8 is connected converts the structure of the syntax analysis tree for the input sentence into a syntax analysis tree for the translated sentence, further checks the semantic consistency, and converts the translation for the word. select. The translated sentence generating means 10 to which the syntax converting means 9 is connected determines the arrangement order of the translated words obtained by the syntax converting means 9 and adjusts the utilization change to generate a translated sentence. The source language translation correspondence storage unit 11 to which the translation sentence generation unit 10 is connected performs morphological analysis, syntax analysis, syntax conversion, and translation processing for generating a translated sentence, and then performs translation processing for each word of the input sentence and the translated word. The correspondence is stored and the positional relationship is stored.
The translation output determining means 12 to which the translated sentence generating means 10 is connected determines whether or not to provide a translated word based on the relationship between the translation level specified by the translation level setting means 5 and the translation level of the original word in the word dictionary. Is determined. Translation word output position count storage means 13 to which translation output determination means 12 is connected
Stores the number of output positions of the translated word. Further, an output format shaping unit (corresponding to an original language translation word) to which the translation output determining unit 12 is connected.
The output unit 14 is connected to the output unit 3, and outputs the original sentence and the translated sentence to the output unit 3 in an easily recognizable format. In the case of the present embodiment, the output format shaping means 14 also functions as an original language translated word corresponding output means. After performing the translation processing up to the translation sentence generating means 10, the translation output determining means 12 The translation is output together with the input sentence by associating each word of the sentence with its translation.

【００４７】また、この記憶手段４の構成について説明
すると、記憶手段４は、単語に対する品詞などの文法情
報および訳語を格納するとともに、各単語の見出し毎に
レベル情報が付与されている単語辞書４ａと、利用者が
指定した訳振りレベルを記憶する訳振りレベルバッファ
４ｂと、原文の単語番号と原語の対応をとる原語バッフ
ァ４ｃと、翻訳文の単語番号と訳語の対応をとる訳語バ
ッファ４ｄと、入力文中の各原語に対応する訳語が、通
常の翻訳文の文頭から何語目であるかの情報を格納する
原語訳語対応情報バッファ４ｅと、出力形式整形手段１
４により整えられた原文と訳文の出力形式を格納する出
力形式バッファ４ｆとから構成されている。The structure of the storage means 4 will be described. The storage means 4 stores grammatical information such as the part of speech of a word and a translated word, and a word dictionary 4a in which level information is provided for each word heading. A translation level buffer 4b for storing the translation level designated by the user, an original language buffer 4c for associating the word number of the original sentence with the original language, and a translated word buffer 4d for associating the word number of the translated sentence with the translated word. An original language translation word correspondence information buffer 4e for storing information as to the number of the translation word corresponding to each source word in the input sentence from the beginning of the normal translation sentence;
4 and an output format buffer 4f for storing the output format of the original sentence and the translated sentence.

【００４８】次に、この訳振り機械翻訳装置の具体的ハ
ード構成を図２を用いて説明する。この訳振り機械翻訳
装置は、各部を制御するメインＣＰＵ１５と、記憶手段
４の構成要素としてのメインメモリ４Ａおよび辞書メモ
リ４ａと、出力手段３としてのＣＲＴ（陰極線管）また
はＬＣＤ（液晶ディスプレイ）からなる表示手段３Ａ
と、入力手段１としてのキーボード１Ａと、翻訳を行う
翻訳モジュール１６と、これら各部を接続するバスライ
ン１７とから構成されている。Next, a specific hardware configuration of the translation machine translation device will be described with reference to FIG. This translation machine translation apparatus includes a main CPU 15 for controlling each unit, a main memory 4A and a dictionary memory 4a as components of the storage unit 4, and a CRT (cathode ray tube) or LCD (liquid crystal display) as an output unit 3. Display means 3A
, A keyboard 1A as the input means 1, a translation module 16 for performing translation, and a bus line 17 for connecting these components.

【００４９】この翻訳モジュール１６は、図３に示すよ
うに、バスライン１７を介して入力されるソース言語
を、所定の翻訳プログラムに従って翻訳してターゲット
言語としてバスライン１７に出力する翻訳ＣＰＵ２と、
この翻訳ＣＰＵ２で実行される翻訳プログラムを格納す
る翻訳プログラムメモリ１６Ａと、入力されたソース言
語の原文を各単語毎に格納するバッファＡと、辞書メモ
リ４ａに格納されている辞書データを参照して得た各単
語の品詞、訳語などの情報を格納するバッファＢと、ソ
ース言語の構造解析木に関する情報を格納するバッファ
Ｃと、構造解析木から変換されたターゲット言語の構造
解析木を格納するバッファＤと、ターゲット言語の構造
解析木に適切な付属語、例えば日本語ならば助詞や助動
詞などを補充して、ターゲット言語の形として整えられ
た文章を格納するバッファＥとを具備している。As shown in FIG. 3, the translation module 16 translates a source language input via the bus line 17 according to a predetermined translation program and outputs the translated language to the bus line 17 as a target language.
A translation program memory 16A for storing a translation program executed by the translation CPU 2, a buffer A for storing the textual input source language for each word by referring to the dictionary data stored in the dictionary memory 4a A buffer B for storing information such as the part of speech and translation of each obtained word, a buffer C for storing information about the source language structure analysis tree, and a buffer for storing a target language structure analysis tree converted from the structure analysis tree D and a buffer E for storing a sentence prepared in the form of the target language by supplementing an appropriate auxiliary word to the structure analysis tree of the target language, for example, particles or auxiliary verbs in Japanese.

【００５０】ここで、機械翻訳について説明する。Here, machine translation will be described.

【００５１】図４に示すような、様々な解析レベルがあ
り、図４の左上においてソース言語がキーボード１Ａを
介して入力されると、レベルＬ１の辞書引き、レベルＬ
２の形態素解析、レベルＬ３の構文解析、・・・と、解
析が進められる。機械翻訳は、どのレベルの解析処理ま
で行うかによって、大きく次の２つに分けられる。その
１つは、レベルＬ６のソース言語およびターゲット言語
のどちらにも依存しない概念である中間言語まで解析
し、そこから、レベルＬ７の文脈生成、レベルＬ８の意
味生成、レベルＬ９の構文生成、レベルＬ１０の形態素
生成と生成を進めて、ターゲット言語を生成していくピ
ボット方式である。もう１つは、上述のレベルＬ２の形
態素解析、レベルＬ３の構文解析、レベルＬ４の意味解
析およびレベルＬ５の文脈解析のいずれかまで解析を行
ってソース言語の内部構造を得、次に、この得られたソ
ース言語の内部構造と同レベルのターゲット言語の内部
構造に変換した後、ターゲット言語を生成するトランス
ファー方式である。As shown in FIG. 4, there are various analysis levels. When the source language is input via the keyboard 1A at the upper left of FIG.
The morphological analysis of 2, the syntax analysis of level L3,... Machine translation is roughly divided into the following two types depending on the level of analysis processing to be performed. One of them analyzes the intermediate language, which is a concept independent of both the source language and the target language of level L6, and generates a context of level L7, a semantic generation of level L8, a syntactic generation of level L9, and a level L9. This is a pivot method in which morpheme generation and generation of L10 are performed to generate a target language. The other is to perform an analysis up to one of the morphological analysis at the level L2, the syntax analysis at the level L3, the semantic analysis at the level L4, and the context analysis at the level L5 to obtain the internal structure of the source language. This is a transfer method in which a target language is generated after conversion into an internal structure of a target language at the same level as the obtained internal structure of the source language.

【００５２】上記構成によって、以下、図１〜図９を用
いて本発明の翻訳動作を説明する。ここでは、下記の英
文を日本語に翻訳して、より精度の高い訳語を出力する
場合を例にとって説明する。Hereinafter, the translation operation of the present invention will be described with reference to FIGS. Here, an example will be described in which the following English sentence is translated into Japanese and a more accurate translated word is output.

【００５３】I will take her child to the zoo. まず、キーボード１Ａなどの入力手段１で読み込まれた
原文は、図５に示すようにバッファＡに格納される。こ
のバッファＡは辞書引きのために原文は１単語毎（英語
の場合は空白毎）に分離され、さらに、語尾処理などの
形態素処理のために各単語も１文字づつ分離して格納さ
れる。図５では、一行が１単語に相当している。翻訳プ
ログラムメモリ１６Ａに記憶された翻訳プログラムに基
づく翻訳ＣＰＵ２の制御にて、辞書引き・形態素解析手
段５における処理において、バッファＡに格納された原
文に従って辞書メモリ４ａの辞書を用いて各単語の訳語
や、その単語に対する品詞などの文法情報などが得ら
れ、バッファＢに格納される。例えば、その各情報の一
部である品詞情報は、図６に示すように格納される。こ
こで、“will”“her”“to”は多品詞語であるが、翻
訳プログラムのうちの構文解析手段８における処理にお
いて一意に決定される。このようにして、辞書引き・形
態素解析手段７による辞書引き、形態素解析において、
辞書メモリ４ａの辞書が引かれ、入力された文章が各形
態素列（単語列）に分割され、この各単語に対する品詞
などの文法情報および訳語が得られ、さらに時制・人称
・数などが解析される。First, the original text read by the input means 1 such as the keyboard 1A is stored in the buffer A as shown in FIG. In this buffer A, the original sentence is separated for each word (each space in English) for dictionary lookup, and each word is separated and stored one character at a time for morphological processing such as ending processing. In FIG. 5, one line corresponds to one word. Under the control of the translation CPU 2 based on the translation program stored in the translation program memory 16A, in the processing of the dictionary lookup / morphological analysis means 5, the translation of each word is performed using the dictionary of the dictionary memory 4a in accordance with the original sentence stored in the buffer A. And grammatical information such as the part of speech for the word is obtained and stored in the buffer B. For example, part-of-speech information, which is a part of each information, is stored as shown in FIG. Here, “will”, “her”, and “to” are many parts of speech, but are uniquely determined in the processing of the parsing unit 8 of the translation program. In this way, in dictionary lookup and morphological analysis by the dictionary lookup / morphological analysis means 7,
The dictionary in the dictionary memory 4a is consulted, the input sentence is divided into morpheme strings (word strings), grammatical information such as part of speech and translations for each word are obtained, and tense / person / number are analyzed. You.

【００５４】この構文解析手段８における解析処理にお
いては、辞書メモリ４ａに格納された辞書データと文法
規則に従って、単語列の各単語間の係り受け関係を示す
構造解析木が例えば図７に示すように決定され、バッフ
ァＣに格納される。In the analysis processing by the syntax analysis means 8, a structure analysis tree indicating a dependency relationship between each word in the word string according to the dictionary data stored in the dictionary memory 4a and the grammatical rules, for example, as shown in FIG. And stored in the buffer C.

【００５５】構造解析木の決定は、次のようにして決定
される。すなわち、辞書メモリ４ａの文法規則から文 → 主部、述部主部 → 名詞句述部 → 助動詞、動詞、名詞句、前置詞句名詞句 → 代名詞名詞句 → 名詞前置詞句→ 前置詞、限定詞、名詞の規則が得られる。この規則は、例えば、１つ目の規則
は“文とは、主部と述部からできている”ということを
表す。以下、この規則に従って構造解析木が決定され
る。このように、構文解析手段８による構文解析におい
ては、各形態素列の単語間の係り受けなどの、文章の構
造（構造解析木）が決定される。The structure analysis tree is determined as follows. In other words, from the grammar rules of the dictionary memory 4a, the sentence → main part, predicate main part → noun phrase predicate → auxiliary verb, verb, noun phrase, preposition phrase noun phrase → pronoun noun phrase → noun preposition phrase → preposition, determiner, noun Is obtained. This rule indicates, for example, that the first rule is “a sentence is made up of a main part and a predicate”. Hereinafter, a structural analysis tree is determined according to this rule. As described above, in the syntax analysis by the syntax analysis unit 8, the structure of the sentence (structure analysis tree) such as the dependency between the words in each morpheme string is determined.

【００５６】この翻訳プログラムのうち、構文変換手段
９における意味解析処理では、辞書メモリ４ａの木構造
変換規則を用いて、図７に示すように、入力された英文
に対する構文解析木の構造が決定された後、図８に示す
ように、日本文に対する構文解析木に変換される。この
ようにして得られた結果がバッファＤに格納される。こ
の構文解析木の変換の際、辞書メモリ４ａの単語辞書中
の意味情報を用いて、図７中の構文解析木の主要な語句
の間を意味的にチェックすることで、原語に対する適切
な訳語を選択する。In the translation program, in the semantic analysis processing in the syntax conversion means 9, the structure of the syntax analysis tree for the input English sentence is determined as shown in FIG. 7 using the tree structure conversion rules of the dictionary memory 4a. Then, as shown in FIG. 8, it is converted into a parse tree for Japanese sentences. The result obtained in this way is stored in the buffer D. At the time of the conversion of the parse tree, by using the semantic information in the word dictionary of the dictionary memory 4a, semantic checking is performed between the main words of the parse tree in FIG. Select

【００５７】本実施例の単語辞書では、下記の（表１）
に示すような意味情報が格納されている。In the word dictionary of this embodiment, the following (Table 1)
Is stored.

【００５８】[0058]

【表１】 [Table 1]

【００５９】この（表１）に示すように、“take”の品
詞は「動詞」であり、その訳語には「とる」「受けと
る」「捕らえる」など幾つかの意味がある。動詞に対す
る意味情報には訳語選択条件が格納されている。具体的
には、「動詞の目的語がそこに列挙した意味カテゴリの
名詞であれば、その訳語をとる」ことを表している。As shown in Table 1, the part of speech of "take" is a "verb", and its translation has several meanings such as "take", "receive", and "capture". Translation term selection conditions are stored in the semantic information for the verb. Specifically, it indicates that "if the object of the verb is a noun of the semantic category listed there, the translation is taken".

【００６０】また、（表１）に示すように、“child”
の品詞は「名詞」であり、名詞の意味情報には訳語に対
する意味カテゴリが格納されている。訳語「子供」の意
味カテゴリは「人」であることがわかる。As shown in (Table 1), "child"
Is a “noun”, and the meaning information of the noun stores a semantic category for the translated word. It can be seen that the meaning category of the translated word "child" is "person".

【００６１】本例文では、構文解析結果により、“tak
e”の目的語が“child”であることがわかる。“chil
d”の「子供」の訳語の意味カテゴリが「人」であるた
め、“take”の訳語選択条件に「人」が含まれる訳語を
探すと、「連れて行く」が見つかるので、この例文で
は、“take”と“child”の訳語は、それぞれ「連れて
行く」と「子供」であることがわかる。In this example sentence, “tak
It turns out that the object of "e" is "child".
Since the semantic category of the translation of "child" of "d" is "person", when looking for a translation that includes "person" in the translation selection condition of "take", "take along" is found. , "Take" and "child" are translated to "take" and "child", respectively.

【００６２】このように、構文変換手段９による意味解
析においては、複数の構文解析の結果から意味的に正し
いものとそうでないものが判別され、各単語に対する訳
語を選択する。As described above, in the semantic analysis by the syntactic conversion means 9, a semantically correct one and a non-semantically correct one are discriminated from a plurality of syntactic analysis results, and a translated word for each word is selected.

【００６３】この翻訳プログラムのうち、翻訳文生成手
段１０における文脈解析処理では、構文変換手段９で得
られた日本文字『私彼女子供動物園連れて行
く。』に適切な助詞『は』『の』『に』『を』や助動詞
がつけられて、図９に示すような日本語の形にして、バ
ッファＥに格納する。この日本文『私は、彼女の子供を
動物園に連れて行く。』は、翻訳モジュール１６から出
力され、メインメモリ４Ａに格納されると共に、出力手
段３としての表示装置３Ａに表示可能である。In the context analysis processing in the translated sentence generation means 10 of the translation program, the Japanese character “I, her, children, zoo, and the zoo” obtained by the syntax conversion means 9 are taken. The appropriate particle "wa", "no", "ni", "wo" or an auxiliary verb is added to "", and the result is stored in the buffer E in a Japanese form as shown in FIG. This Japanese sentence "I take her child to the zoo. Are output from the translation module 16 and stored in the main memory 4A, and can be displayed on the display device 3A as the output means 3.

【００６４】このように、翻訳文生成手段１０による文
脈解析においては、話題が理解され、省略や曖昧さが取
り去られる。As described above, in the context analysis by the translated sentence generation means 10, the topic is understood, and the omission and ambiguity are removed.

【００６５】このとき、翻訳文生成手段１０からの制御
で原語訳語対応記憶手段１１が起動され、上記入力文中
の各原語に対応する訳語が、通常の翻訳文の文頭から何
語目であるかが、訳語番号として原語訳語対応情報バッ
ファ４ｅに設定される。この後、出力形式整形手段１４
では、原語バッファ４ｃ、訳語バッファ４ｄおよび原語
訳語対応情報バッファ４ｅのデータを基に原語（単語）
と訳語の対応をとった文字列が出力形式バッファ４ｆに
格納され、これが出力手段３としての表示装置３Ａなど
に、必要な単語に訳語を付与した訳振り形式で出力され
る。At this time, the original language translation word correspondence storage means 11 is started under the control of the translation sentence generation means 10, and the number of the translation word corresponding to each of the original words in the input sentence from the beginning of the normal translation sentence is determined. Is set in the source language translation information buffer 4e as the translation word number. Thereafter, the output format shaping means 14
Then, based on the data in the original language buffer 4c, the translated word buffer 4d, and the original language translated word correspondence information buffer 4e, the original language (word)
Is stored in the output format buffer 4f, and is output to the display device 3A or the like as the output means 3 in a translation format in which the required words are provided with the translated words.

【００６６】次に、利用者の語学レベルに応じて訳語の
不要な単語には訳語を付与せず、利用者の望む最低限必
要な単語に訳語を付与する場合について、本実施例の動
作を図１のブロック図、（表２）〜（表４）を参照して
詳しく説明する。Next, the operation of the present embodiment will be described for the case where a translation is not given to a word that does not require a translation according to the language level of the user, and a translation is given to the minimum required word desired by the user. This will be described in detail with reference to the block diagrams of FIG. 1 (Tables 2 to 4).

【００６７】まず、以下の文が入力手段１に入力された
ものとする。First, it is assumed that the following sentence has been input to the input means 1.

【００６８】The earth contains a large number of m
etals which are useful to man. この入力文は、上述した本発明の訳振り機械翻訳の場合
と同様に、辞書引き・形態素解析、構文解析、構文変換
を経て、翻訳文生成処理が行われる。The earth contains a large number of m
This input sentence is subjected to a dictionary creation / morphological analysis, syntax analysis, and syntax conversion, and a translated sentence generation process is performed in the same manner as in the case of the translation machine translation of the present invention described above.

【００６９】このとき、翻訳文生成手段１０からの制御
で原語訳語対応記憶手段１１が起動され、上記入力文中
の各原語に対応する訳語が、通常の翻訳文の文頭から何
語目であるかが、訳語番号として原語訳語対応情報バッ
ファ４ｅに設定される。この原語訳語対応情報バッファ
４ｅの例を下記の（表２）に示す。ちなみに、それそれ
の番号に対応する訳語（訳語バッファ４ｄ）と原語（原
語バッファ４ｃ）の記憶内容を下記の（表３）および
（表４）にそれぞれ示している。At this time, the original language translation word correspondence storage means 11 is started under the control of the translation sentence generation means 10, and the number of the translation word corresponding to each of the original words in the input sentence from the beginning of the normal translation sentence is determined. Is set in the source language translation information buffer 4e as the translation word number. The following (Table 2) shows an example of the source language translation information buffer 4e. Incidentally, the stored contents of the translated word (translated word buffer 4d) and the original word (original word buffer 4c) corresponding to the respective numbers are shown in the following (Table 3) and (Table 4), respectively.

【００７０】[0070]

【表２】 [Table 2]

【００７１】[0071]

【表３】 [Table 3]

【００７２】[0072]

【表４】 [Table 4]

【００７３】この後、出力形式整形手段１４では、原語
バッファ４ｃ、訳語バッファ４ｄおよび原語訳語対応情
報バッファ４ｅのデータを基に原語（単語）と訳語の対
応をとった文字列が出力形式バッファ４ｆに格納され、
これが出力手段３としての表示装置３Ａなどにより、下
記に示すように出力形式整形結果として、訳語を入力文
と共に出力する。Thereafter, the output format shaping means 14 converts the character string corresponding to the original word (word) to the translated word on the basis of the data of the original word buffer 4c, the translated word buffer 4d and the translated word correspondence information buffer 4e. Stored in
The translated word is output together with the input sentence as an output format shaping result by the display device 3A or the like as the output means 3 as shown below.

【００７４】 The earth contains a large number of metals which are useful to man. その地球含む多数の有益であるこのようにして訳振りされる場合の訳語出力判定処理に
ついて、図１０のフローチャートに沿って、下記の（表
５）を参照して詳しく説明する。The translated word output judging process in the case where the translation is performed in this manner is described in accordance with the flowchart of FIG. (Table 5).

【００７５】単語辞書の辞書メモリ４ａは、難易度や頻
度や重要度などのレベル情報に基づき各見出しにレベル
付けがなされている。本実施例では「上級／中級／初級
／超初級」の４つのレベルを想定する。下記の（表５）
に上記入力文例における各単語の各見出し単位にレベル
付けされた基本辞書内容を模式的に示している。In the dictionary memory 4a of the word dictionary, each heading is assigned a level based on level information such as difficulty level, frequency level, and importance level. In this embodiment, four levels of “advanced / intermediate / beginner / extreme” are assumed. The following (Table 5)
2 schematically shows the basic dictionary contents leveled for each heading unit of each word in the input sentence example.

【００７６】[0076]

【表５】 [Table 5]

【００７７】この（表５）において、例えば、“are”
は基本的な単語なので「超初級」が付与されており、
「超初級」の単語は訳振り不要とする。利用者は、訳振
りレベル設定手段５を使って自らの学力レベルとしての
訳振りレベルを設定する。本実施例では、「上級／中級
／初級」の３つのレベルを想定する。指定されたレベル
情報は訳振りレベルバッファ４ｂに格納される。In this (Table 5), for example, “are”
Is a basic word, so "beginner" is given,
No translation is needed for the word "super beginner". The user uses the translation level setting means 5 to set the translation level as his / her own academic ability level. In this embodiment, three levels of “advanced / intermediate / beginner” are assumed. The designated level information is stored in the translation level buffer 4b.

【００７８】最初に、訳振りレベルが「中級」である場
合について説明する。先の入力文と同じ入力文が入力さ
れ、同様にして各原語に対する単語単位の訳語が得ら
れ、原語バッファ４ｃと、訳語バッファ４ｄと、原語訳
語対応情報バッファ４ｅとに格納された状態になってい
るものとする。First, the case where the translation level is “intermediate” will be described. The same input sentence as the previous input sentence is input, and a word-by-word translation for each source word is obtained in the same manner, and stored in the source word buffer 4c, the target word buffer 4d, and the source word translation word correspondence information buffer 4e. It is assumed that

【００７９】翻訳出力判定手段１２において、図１０に
示すように、まず、ステップｓ１で先頭単語“the”に
着目する。ステップｓ２で単語のレベル情報を取り出
す。各見出し単位にレベル付けされた基本辞書を模式的
に示した（表５）にあるように「超初級」であることが
わかる。As shown in FIG. 10, the translation output judging means 12 first focuses on the first word "the" in step s1. In step s2, word level information is extracted. As shown in Table 5 schematically showing the basic dictionaries leveled for each heading unit, it can be seen that the dictionary is "super elementary".

【００８０】今、訳振りレベルは「中級」でイディオム
を構成しないので、ステップｓ３およびステップｓ５で
はＮＯであり、ステップｓ９でＹＥＳとなる。ステップ
ｓ１０で単語“the”のレベル情報が「超初級」または
「初級」であるかどうかを判定する。単語“the”のレ
ベル情報は「超初級」であり、この「初級」以下の単語
である“the”の訳語は、ステップｓ１１で出力しな
い。さらに、ステップｓ１６に進んで、入力された文の
全ての単語の処理は終了したかどうかを判断する。全て
の単語の処理が終了していないならば、ステップｓ１７
で次の単語“earth”に着目する。ステップｓ２で単語
のレベル情報を取り出すと「初級」なので、先頭単語
“the”の場合と同様に、ステップｓ１１でやはり訳語
は出力しない。Since the translation level is "intermediate" and does not constitute an idiom, NO is obtained in steps s3 and s5, and YES is obtained in step s9. In step s10, it is determined whether the level information of the word "the" is "super elementary" or "elementary". The level information of the word “the” is “super elementary”, and the translated word of “the” which is a word below the “elementary” is not output in step s11. Further, the process proceeds to step s16, where it is determined whether or not the processing of all the words of the input sentence has been completed. If all words have not been processed, step s17
Attention is paid to the next word "earth". When the level information of the word is extracted in step s2, the word is "beginner". Therefore, similarly to the case of the head word "the", no translated word is output in step s11.

【００８１】同様にして、ステップｓ１７で次の単語
“contain”に着目し、ステップｓ２でレベル情報を取
り出すと「中級」であり、ステップｓ９を介してステッ
プｓ１０でこれは「中級」であると判断され、ステップ
ｓ１２で“contain”の訳語を出力する。Similarly, attention is paid to the next word “contain” in step s17, and when the level information is extracted in step s2, it is “intermediate”. In step s10 through step s9, this is “intermediate”. It is determined, and the translated word of "contain" is output in step s12.

【００８２】この実際の出力は、出力形式整形手段１４
によって、原文と訳語の出力の形式が整形されたものが
出力形式バッファ４ｆに格納され、これが出力手段３に
より表示や印字などの形式で出力される。この時点での
出力手段３の出力状態は以下のようになっている。This actual output is output to the output format shaping means 14.
The output format of the original and translated words is stored in the output format buffer 4f, and output by the output means 3 in a format such as display or printing. The output state of the output means 3 at this time is as follows.

【００８３】さらに、ステップｓ１７で次の単語“a”に着目し、ス
テップｓ２でレベル情報を取り出し、ステップｓ３で
“a”以降の句が、“a large number of”というイディ
オムかどうかを判定し、イディオムであれば、利用者が
指定した訳振りレベルやその単語のレベル情報に関係な
く訳語を出力する。この時点での出力手段３の出力状態
は以下のようになっている。[0083] Further, in step s17, attention is paid to the next word "a", level information is extracted in step s2, and in step s3, it is determined whether or not the phrase after "a" is an idiom of "a large number of". If there is, the translated word is output irrespective of the translation level specified by the user and the level information of the word. The output state of the output means 3 at this time is as follows.

【００８４】以下、同様に処理が進み、結局、中級レベルでは、次の
ような出力結果が得られる。[0084] Thereafter, the process proceeds in the same manner, and eventually, at the intermediate level, the following output result is obtained.

【００８５】 The earth contains a large number of metals which are useful to man. 含む多数の金属有益な次に、利用者が、訳振りレベルに「上級」を指定した場
合の処理について、上記した「中級」の場合との相違点
を説明する。[0086] Next, regarding the processing when the user designates the "advanced" translation level, the above-mentioned "intermediate" The differences from the above case will be described.

【００８６】単語“the”と“earth”は上記と同様の判
断で出力されないが、次の単語“contain”は「中級」
であるのに対し、利用者が指定した訳振りレベルは「上
級」なので、ステップＳ１４では『出力しない』と判断
される。同様に、レベル情報「中級」の単語“metal”
と“useful”が出力しないと判断され、結局、上級レベ
ルでは、“a large number of”のイディオムだけの訳
語が出力されて、出力手段３の出力状態は以下のように
なる。Although the words “the” and “earth” are not output in the same manner as described above, the next word “contain” is “intermediate”.
On the other hand, since the translation level specified by the user is "advanced", it is determined in step S14 that "do not output". Similarly, the word "metal" of the level information "intermediate"
And "useful" are not output, and eventually, at the advanced level, only the idiom of "a large number of" is output, and the output state of the output means 3 is as follows.

【００８７】 The earth contains a large number of metals which are useful to man. 多数のこのようにして、利用者が指定する３つの訳振りレベル
を設定した時の出力形式整形結果を下記に示す。The output format shaping results when three translation levels specified by the user are set as described above are shown below.

【００８８】（初級レベルの例） The earth contains a large number of metals which are useful to man. 地球含む多数の金属有益である人（中級レベルの例）Ｔｈｅｅａｒｔｈｃｏｎｔａｉｎｓａｌａｒｇｅｎｕｍｂｅｒ
ｏｆｍｅｔａｌｓｗｈｉｃｈａｒｅｕｓｅｆｕｌｔｏｍａｎ．含む多数の金属有益な（上級レベルの例）Ｔｈｅｅａｒｔｈｃｏｎｔａｉｎｓａｌａｒｇｅｎｕｍｂｅｒ
ｏｆｍｅｔａｌｓｗｈｉｃｈａｒｅｕｓｅｆｕｌｔｏｍａｎ．多数のここで、単語辞書に格納しておく各単語のレベル情報の
付与基準の各具体例１〜４について説明する。このレベ
ル情報の付与基準を用いて、訳語出力判定手段１２は訳
語を出力するかどうかの判定をする。(Early Level Example) The earth contains a large number of metals which are useful to man. The earth including many metals is useful (an intermediate level example) The earth contains a large number
of metals which are used to man. Including many metals Informative (advanced level example) The earth contains a large number
of metals which are used to man. A large number of specific examples 1 to 4 of the criterion for assigning the level information of each word stored in the word dictionary will now be described. Using the reference of the level information, the translated word output determining unit 12 determines whether or not to output the translated word.

【００８９】（具体例１）単語辞書中の辞書メモリ４ａ
に付与する各単語のレベル情報を、複数の英和辞書のう
ち、幾つの辞書に重要度マークが付与されているかなど
に基づいて設定する。即ち、高い重要度マークが付与さ
れている辞書が多い単語は、それだけ基本的な単語であ
り、多くの人が訳や意味を知っているであろうというこ
とで、低い単語レベルを付与する。(Specific Example 1) Dictionary Memory 4a in Word Dictionary
Is set based on how many of the English-Japanese dictionaries have the importance mark added thereto. In other words, a word with many dictionaries to which a high importance mark is given is a basic word, and a low word level is given because many people will know the translation and meaning.

【００９０】このように、通常の辞書に付与された重要
度マークのうち、この重要度マークがどの程度の重要度
マークか、また、どの程度多くの辞書に付与されている
かに関する情報に基づいて、単語辞書中の辞書メモリ４
ａに付与する訳振りのレベル情報を設定する。As described above, of the importance marks assigned to the normal dictionary, based on the information on how many importance marks are assigned to the importance marks and how many dictionaries are assigned to the dictionaries. , Dictionary memory 4 in word dictionary
The level information of the translation given to a is set.

【００９１】（具体例２）単語辞書中の辞書メモリ４ａ
に付与する各単語のレベル情報を、見出しの文字構成に
よって決める。例えば、見出しの文字構成が１１文字以
上から構成される単語は、それだけ理解や記憶が困難な
単語であろうということで、高い単語レベルを付与す
る。例えば下記に示す２単語は１１文字以上の文字構成
であり、これら２単語には、訳語を付与する。(Specific Example 2) Dictionary Memory 4a in Word Dictionary
The level information of each word to be assigned to is determined by the character configuration of the heading. For example, a word whose heading is composed of 11 or more characters is given a high word level because it is a word that is difficult to understand and memorize. For example, the following two words have a character configuration of 11 or more characters, and a translation is added to these two words.

【００９２】supernatural 超自然の supercalender 紙に光沢をつけるまた、１１文字以上の文字構成の単語であっても、語構
成的にみれば、接頭辞や接尾辞などが連接して長くなっ
た単語であって、かつそれら断片の意味か、または単語
全体としての意味が容易に推測できて構成できるものに
は単語レベルを低く設定する。例えば、上記した２単語
はいずれも”super”の接頭辞を持ち、それ以外の部分
は、”natural”と”calender”とも基本的な単語であ
るが、前者は、”natural”から意味が容易に推測する
ことができるのに対して、後者の”calender”は全く意
味が変わってしまう。したがって、前者の単語レベルは
低く設定して訳語を振らないようにし、また、後者の単
語レベルは高く設定して訳語を振るようにする。Supernatural Supercalender Supernatural paper is made glossy. Even if the word has a character composition of 11 characters or more, it is a word whose prefix or suffix is concatenated and lengthened in terms of word composition. In addition, the word level is set low for those which can easily be deduced from the meaning of the fragments or the meaning of the whole word. For example, both of the above two words have a prefix of “super”, and the other part is a basic word for both “natural” and “calender”. While the latter "calender" has a completely different meaning. Therefore, the former word level is set low so that no translation is given, and the latter word level is set high so that the translation is given.

【００９３】なお、イディオムについては、ほとんどの
イディオムが１１文字を越えてしまうので、この付与基
準を用いれば、イディオムの単語レベル（複合語レベ
ル）にはすべて高い単語レベルが設定されることにな
る。この場合には、例えば、容易なイディオムの単語レ
ベル（複合語レベル）には低い単語レベルを付与して設
定するようにしても良い。Note that most idioms have more than 11 characters, so if this criterion is used, the word level (complex word level) of the idiom will be set to a higher word level. . In this case, for example, the word level (compound word level) of the easy idiom may be set by giving a low word level.

【００９４】このように、入力文の各単語を構成する単
語文字数に関する情報に基づいて、単語辞書中の辞書メ
モリ４ａに付与する訳振りのレベル情報を設定する。こ
の単語文字数が一定の文字数を越えるかどうか、かつ各
単語の接頭辞および接尾辞などの各単語の構成により各
単語の意味が推測可能か否かに基づいてレベル情報を設
定する。As described above, the translation level information to be assigned to the dictionary memory 4a in the word dictionary is set based on the information on the number of words constituting each word of the input sentence. Level information is set based on whether or not the number of words exceeds a certain number of characters, and whether or not the meaning of each word can be inferred by the configuration of each word such as the prefix and suffix of each word.

【００９５】（具体例３）単語辞書中の辞書メモリ４ａ
に付与する各単語のレベル情報を、品詞の種類によって
決める。即ち、特定の品詞には一律に低い単語レベルを
付与する。例えば、助動詞はその数も少ないこともあっ
て、訳振り対象外の品詞とする。同様に、固有名詞はカ
タカナ訳語になることや、誤った訳語になることが多い
ため、訳語を付与しないようにする。(Specific Example 3) Dictionary Memory 4a in Word Dictionary
The level information of each word to be assigned to is determined according to the type of part of speech. That is, a specific word class is uniformly assigned a low word level. For example, the auxiliary verb is a part of speech that is not subject to translation because of its small number. Similarly, since proper nouns often become katakana translations or erroneous translations, no translation is given.

【００９６】このように、品詞の種類に関する情報に基
づいて、単語辞書中の辞書メモリ４ａに付与する訳振り
のレベル情報を設定する。つまり、特定品詞は訳振りレ
ベルを低く設定し、固有名詞および助動詞などは訳振り
対象から除く。As described above, the translation level information to be assigned to the dictionary memory 4a in the word dictionary is set based on the information on the type of part of speech. That is, the specific part of speech is set at a low translation level, and proper nouns and auxiliary verbs are excluded from the translation target.

【００９７】（具体例４）単語辞書中の辞書メモリ４ａ
に付与する各単語のレベル情報を、どの種類の辞書に登
録されていたものであるかによって決める。即ち、基本
辞書に登録されている単語は、基本的な単語であろうと
いうことで、単語レベルを低く設定する。一方、ユーザ
辞書および専門辞書などに登録されている単語は、単語
自体またはその訳語が特殊なものであろうということ
で、単語レベルを高く設定する。(Specific Example 4) Dictionary Memory 4a in Word Dictionary
The level information of each word to be assigned to is determined according to which type of dictionary has been registered. That is, the word registered in the basic dictionary is likely to be a basic word, and the word level is set low. On the other hand, a word registered in a user dictionary, a specialized dictionary, or the like is set to a high word level because the word itself or its translation may be special.

【００９８】このように、基本辞書、ユーザ辞書および
専門辞書など単語の辞書の種類に関する情報に基づい
て、単語辞書中の辞書メモリ４ａに付与する訳振りのレ
ベル情報を設定する。As described above, the translation level information to be assigned to the dictionary memory 4a in the word dictionary is set based on the information on the type of the word dictionary such as the basic dictionary, the user dictionary, and the specialized dictionary.

【００９９】次に、以上のような単語辞書中の辞書メモ
リ４ａに付与する予め決まったレベル情報ではなく、上
記した翻訳（訳振り）処理（構文解析処理、構文変換処
理および翻訳文生成処理）中に得られる言語情報に基づ
いて、訳語出力判定手段１２が、訳語を付与するか否か
を判定する付与基準の各具体例５〜７について説明す
る。Next, instead of the predetermined level information given to the dictionary memory 4a in the word dictionary as described above, the above-mentioned translation (translation) processing (syntax analysis processing, syntax conversion processing and translated sentence generation processing) Specific examples 5 to 7 of the assignment criteria for the translated word output determining means 12 to determine whether to assign a translated word based on the linguistic information obtained therein will be described.

【０１００】（具体例５）構文解析の結果として得られ
る原文の構文構造を利用することで、訳語出力判定手段
１２は訳語を付与するか否かを判定する出力判定を行
う。即ち、構文解析が終わると、図７に示すように、主
部に相当する名詞句、述部の主動詞などの文の骨格とな
る構造を得ることができる。この構文解析情報を利用す
ることで、たとえ単語レベルが低いものであっても、文
の中で重要な役割を果たす語句には訳語を付与する。特
に、初級レベルでは、主語や述語の役割を担うような単
語は、どのようなレベルの単語でも訳語を付与し、ま
た、逆に、上級レベルでは、修飾語（関係節中の単語な
どを含む）の単語には訳語を付与しない。(Specific Example 5) By using the syntax structure of the original sentence obtained as a result of the syntax analysis, the translated word output determining means 12 makes an output determination for determining whether or not to add a translated word. That is, after the syntactic analysis, as shown in FIG. 7, it is possible to obtain a structure serving as a skeleton of a sentence such as a noun phrase corresponding to the main part or a main verb of a predicate. By using this parsing information, even if the word level is low, a word that plays an important role in the sentence is given a translation. In particular, at the elementary level, words that play the role of subject or predicate are assigned a translation at any level, and conversely, at the advanced level, modifiers (including words in relative clauses, etc.) No translation is given to the word in parentheses).

【０１０１】下記に示す入力文例について具体的に説明
すると、「初級」レベルでは、単語レベルで判定するだ
けではなく、Ｓ（主語）、Ｖ（動詞）、Ｏ（目的語）な
どの文の構成要素にも訳語を付与している。また、単語
レベルだけで判定する「中級」レベルにおいては、文の
構成要素というだけでは訳語は付与されない。例えば、
文の構成要素（主語）である「友達」の訳語は、「初
級」レベルでは付与されていたが、「中級」レベルでは
付与されていない。さらに、逆に、「上級」レベルで
は、単語レベルだけで判定すると、付与されるはずの
「派生する」などの訳語も、修飾語句としての関係節の
中なので抑制されている。Explaining specifically the following input sentence example, at the "elementary" level, not only the judgment at the word level but also the structure of sentences such as S (subject), V (verb), O (object), etc. Translations are also given to the elements. At the “intermediate” level, which is determined only at the word level, no translated word is given only by the constituent elements of the sentence. For example,
The translation of “friend”, which is a component (subject) of the sentence, was given at the “beginner” level, but was not given at the “intermediate” level. Conversely, at the “advanced” level, if the determination is made only at the word level, translations such as “derived” that should be given are also suppressed because they are in the relative clauses as modifiers.

【０１０２】（初級レベル：単語レベルおよび文構成要素で判定する。） Her friends extolled Japan where technology are derived from the theory・友達絶賛した技術派生する理論（中級レベル：単語レベルで判定する。） Her friends extolled Japan where technology are derived from the theory・絶賛した技術派生する理論（上級レベル：関係節などの修飾語には訳語を付与しない。） Her friends extolled Japan where technology are derived from the theory・絶賛したこのように、翻訳出力判定手段１２は、単語レベルだけ
ではなく、構文解析処理中に得られる文の構文構造（文
構成要素や修飾語句など）に基づいて訳語を付与するか
否かを判定する。(Beginner level: Judgment at the word level and sentence components.) Her friends extolled Japan where technology is derived from the theory. Friends acclaimed technology Derived theory (Intermediate level: Judgment at the word level.) Her friends extolled Japan where technology is derived from the theory ・ acclaimed technology derived theory (advanced level: no translation is given to modifiers such as relative clauses) Her friends extolled Japan where technology is derived from the theory ・ acclaimed In addition, the translation output determination unit 12 determines whether or not to add a translation based on not only the word level but also the sentence syntax structure (sentence constituent element, modifier, etc.) obtained during the syntax analysis processing.

【０１０３】（具体例６）構文変換の際に行われる意味
処理を用いた訳語選択により最も適切な訳語が決まる。
この訳語の文字の種類を判断することで、出力判定を行
う。例えば、英単語の発音をそのままカタカナにしたよ
うな訳語は、英文解読にとって有用な情報ではないの
で、どのようなレベルであったとしても訳語を付与しな
い。(Specific Example 6) The most appropriate translated word is determined by selecting a translated word using a semantic process performed during syntax conversion.
The output is determined by determining the type of the translated word. For example, a translated word in which the pronunciation of an English word is converted into katakana as it is is not useful information for deciphering an English sentence, and therefore, no translated word is added at any level.

【０１０４】下記に示す入力文例について具体的に説明
する。An example of an input sentence shown below will be specifically described.

【０１０５】（カタカナ訳語抑制前） The text describes an mechanism and programmig model for new software. テキスト記述するメカニズムプログラミングモデルソフトウェア（カタカナ訳語抑制後） The text describes an mechanism and programmig model for new software. 記述する上記したカタカナ訳語抑制後の訳振り結果においては、
すっきりと見やすくなっていることがわかる。(Before Katakana translation is suppressed) The text describes an mechanism and programmig model for new software. The text describes an mechanism and programmig model for new software. In the translation results after Katakana translation suppression,
It turns out that it is easy to see clearly.

【０１０６】このように、翻訳出力判定手段１２は、単
語レベルだけではなく、構文変換処理中に得られる訳語
の文字種構成に基づいて訳語を付与するか否かを判定す
る。As described above, the translation output determining means 12 determines whether or not to add a translation based on the character type configuration of the translation obtained during the syntax conversion processing, not only at the word level.

【０１０７】（具体例７）同じ単語であっても、分野によっては訳語が不要になる
場合がある。実用化レベルの機械翻訳システムとして、
翻訳する文書の分野を入力手段１を介して分野指定手段
６で利用者に指定させる。これは、訳語選択を適切に行
うためであるが、この分野情報を翻訳出力判定手段１２
で行う訳語付与の判定にも利用する。即ち、翻訳出力判
定手段１２は、分野指定された訳語（特殊な訳語）が選
択された場合には、単語のレベル情報が低くても訳語を
付与するように判断する。(Specific Example 7) Even for the same word, a translated word may not be necessary depending on the field. As a practical machine translation system,
The user specifies the field of the document to be translated by the field specifying means 6 via the input means 1. This is for performing translation proper selection, translation output determining unit this field information 12
It is also used for the determination of translation word assignment performed in. That is, when a translated word (special translated word) specified in the field is selected, the translation output determining means 12 determines that the translated word is added even if the level information of the word is low.

【０１０８】例えば、下記の例文の１つ目の出力判断結
果では、”attack”は単語レベルが低いので、訳語が付
与されないが、２つ目の出力判断結果では、「医学」分
野が指定された場合に、選択される訳語「発病」が訳振
りされる。For example, in the first output judgment result of the following example sentence, no translation is given because “attack” has a low word level, but the “medical” field is specified in the second output judgment result. In this case, the selected word "illness" is translated.

【０１０９】 She had an attack of fever. She had an attack of fever. 発病このように、翻訳出力判定手段１２は、構文変換処理中
に得られる訳語が分野指定されたものであるか否かの言
語情報に基づいて訳語を付与するか否かを判定する。In this way, the translation output determining means 12 determines whether or not the translated word obtained during the syntax conversion processing is a field-specified word. It is determined whether or not to add a translation based on the information.

【０１１０】（具体例８）この具体例８では、翻訳出力判定手段１２が、翻訳出力
位置回数記憶手段１３からの情報に基づいて訳語を付与
するか否かを判定する付与基準について説明する。(Specific Example 8) In this specific example 8, a description will be given of an assignment criterion in which the translation output determination means 12 determines whether or not to add a translation word based on information from the translation output position count storage means 13.

【０１１１】訳語を付与する度に、その訳語とその表示
位置を訳語出力位置回数記憶手段１３にて保持してお
く。この訳語出力位置回数記憶手段１３からの情報によ
って、その訳語を前回に表示した位置からある一定以上
の距離が離れている場合にのみ訳語を付与できるように
する。Each time a translated word is added, the translated word and its display position are stored in the translated word output position count storage means 13. Based on the information from the translated word output position number storage unit 13, the translated word can be added only when the translated word is at least a certain distance from the previously displayed position.

【０１１２】また、ある訳語の付与回数が文書のある範
囲内である基準を越えれば、以降はその訳語は付与しな
いようにする。また、全単語（訳語）を対象に直前に付
与した位置からの距離で訳語を付与するか否かを決める
ことも可能である。If the number of times a given translation is given exceeds a certain standard within a certain range of the document, the translation is not added thereafter. It is also possible to determine whether or not to assign a translation to all words (translations) based on the distance from the position immediately before the translation.

【０１１３】下記の例文の１つ目の訳振り結果（本規則
追加前）では、critical temperature（臨界温度）、ca
rbon（炭素）、heat（加熱する）、temperature（温
度）などの複数回出現する語句の訳も毎回、訳振りされ
ているが、その例文の２つ目の訳振り結果（本規則追加
後）では、２度目以降に出現する訳語が抑制されている
場合を示しており、すっきり見やすくなっていることが
わかる。In the first translation result of the following example sentence (before this rule is added), the critical temperature (critical temperature), ca
Words that appear multiple times, such as rbon (carbon), heat (heat), and temperature (temperature), are also translated each time, but the second translation result of the example sentence (after adding this rule) The figure shows a case where the translated words appearing after the second time are suppressed, and it can be seen that the translation is easy to see clearly.

【０１１４】（本規則追加前） In the first place,steel which contains very little carbon will be まず第一に含む炭素 milder than steel. Secondly,we can heat the carbon above a certain 穏やかな第二に加熱する炭素 critical temperature. At this critical temperature,changes begin 臨界温度臨界温度 to take place in the molecular structure. In the process known as 分子の構造 annealing,we heat the carbon above the critical temperature. We heat it 焼なまし加熱する炭素臨界温度加熱する up beyond the critical temperature,and then quench it in other liquid. 臨界温度抑制する液体 The temperature drop fixes the change in the carbon which occurred 温度炭素発生する at the critical temperature. But a bar of this hardened carbon is more 臨界温度堅くなった炭素 liable to fracture than normal carbon. We therefore heat it again to a 傾向がある破砕正常な炭素加熱する temperature below the critical temperature. 温度臨界温度（本規則追加後） In the first place,steel which contains very little carbon will be まず第一に含む炭素 milder than steel. Secondly,we can heat the carbon above a certain 穏やかな第二に加熱する critical temperature. At this critical temperature,changes begin 臨界温度 to take place in the molecular structure. In the process known as 分子の構造 annealing,we heat the carbon above the critical temperature. We heat it 焼なまし up beyond the critical temperature,and then quench it in other liquid. 抑制する液体 The temperature drop fixes the change in the carbon which occurred 温度発生する at the critical temperature. But a bar of this hardened carbon is more 堅くなった liable to fracture than normal carbon. We therefore heat it again to a 傾向がある破砕正常な temperature below the critical temperature. ここで、上記具体例１〜８を考慮した場合の訳語出力判
定手段１２の動作を図１１のフローチャートを用いて説
明する。(Before this rule is added) In the first place, steel which contains very little carbon will be firstly contained carbon milder than steel. Secondly, we can heat the carbon above a certain mildly secondarily heated carbon critical temperature.At this critical temperature, changes begin critical temperature to take place in the molecular structure.In the process known as molecular structure annealing, we heat the carbon above the critical temperature.We heat it Critical temperature heating up beyond the critical temperature, and then quench it in other liquid.The temperature drop fixes the change in the carbon which occurred temperature carbon occurs at the critical temperature.But a bar of this hardened carbon is more critical temperature hardened carbon liable to fracture than normal carbon.We therefore heat it again to a prone to fracture normal carbon heating temperature bel In the first place, steel which contains very little carbon will be carbon in the first place, mildly than steel.Secondly, we can heat the carbon above a certain mild ow the critical temperature. Critical temperature.At this critical temperature, changes begin Critical temperature to take place in the molecular structure.In the process known as Molecular structure annealing, we heat the carbon above the critical temperature.We heat it beyond the critical temperature, and then quench it in other liquid.The temperature drop fixes the change in the carbon which occurred temperature occurs at the critical temperature.But a bar of this hardened carbon is more liable to fracture than normal carbon. We therefore heat it again to a tendency of crushing Normal temperature below the critical temperature. The operation of the power determining means 12 will be described with reference to the flowchart of FIG.

【０１１５】図１１に示すように、まず、ステップＳ１
で先頭単語に着目する。次に、ステップＳ２で着目した
単語のレベル情報、品詞および訳語を取り出す。さら
に、ステップＡ１で着目した単語の品詞が助動詞または
固有名詞かどうかを判定し、ＹＥＳの場合、即ち、着目
した単語の品詞が助動詞または固有名詞の場合、ステッ
プＡ２で訳語は出力しない。ステップＳ１６で全ての単
語が着目されたかどうかを判定し、全ての単語が着目さ
れていないならば、ステップＳ１７で次の単語に着目
し、ステップＳ２を介してステップＡ１に進む。As shown in FIG. 11, first, in step S1
Focus on the first word. Next, in step S2, the level information, the part of speech, and the translated word of the focused word are extracted. Further, it is determined whether the part of speech of the focused word is an auxiliary verb or a proper noun in step A1, and if YES, that is, if the focused part of speech of the word is an auxiliary verb or a proper noun, no translated word is output in step A2. In step S16, it is determined whether or not all the words have been noticed. If not all the words have been noticed, the next word is noticed in step S17, and the process proceeds to step A1 via step S2.

【０１１６】さらに、ステップＡ１で着目した単語の品
詞が助動詞または固有名詞ではない場合、ステップＡ３
で着目している単語がユーザ辞書か専門辞書または、そ
の訳語は指定分野の訳語かどうかを判定する。ＹＥＳの
場合、ステップＡ４でその訳語は出力する。さらに、ス
テップＳ１６で全ての単語が着目されたかどうかを判定
し、全ての単語が着目されていないならば、ステップＳ
１７で次の単語に着目し、ステップＳ２を介してステッ
プＡ１に進む。Further, if the part of speech of the word focused in step A1 is not an auxiliary verb or proper noun, step A3
Then, it is determined whether the word of interest is a user dictionary or a specialized dictionary, or whether the translated word is a translated word in a specified field. If YES, the translated word is output in step A4. Further, it is determined in step S16 whether or not all the words have been noticed.
At 17, the next word is noted, and the process proceeds to step A1 via step S2.

【０１１７】このステップＡ１およびステップＡ３でＮ
Ｏの場合、ステップＡ５に進み、ステップＡ５で直前の
訳語付与単語から十分に離れているかどうかを判定す
る。直前の同一訳語付与単語から十分に離れていないな
らば、ステップＡ６でその訳語は出力しない。さらに、
ステップＳ１６で全ての単語が着目されたかどうかを判
定し、全ての単語が着目されていないならば、ステップ
Ｓ１７で次の単語に着目し、ステップＳ２を介してステ
ップＡ１に進む。At step A1 and step A3, N
In the case of O, the process proceeds to step A5, and in step A5, it is determined whether or not it is sufficiently distant from the immediately preceding translated word. If it is not sufficiently distant from the immediately preceding word with the same translation, the translation is not output in step A6. further,
In step S16, it is determined whether or not all the words have been noticed. If not all the words have been noticed, the next word is noticed in step S17, and the process proceeds to step A1 via step S2.

【０１１８】さらに、このステップＡ１およびステップ
Ａ３でＮＯ、ステップＡ５でＹＥＳの場合、ステップＳ
５に進み、ステップＳ５で訳振りレベルは「初級」かど
うかを判定する。訳振りレベルが「初級」の場合、ステ
ップＳ６でその着目している単語は、レベル情報が「超
初級」で、かつ文の修飾部かどうかを判定する。その単
語のレベル情報が「超初級」で、かつその単語は文の修
飾部である場合、ステップＳ７でその訳語は出力しな
い。また、その単語のレベル情報が「超初級」で、かつ
その単語が文の修飾部ではない場合、ステップＳ８でそ
の訳語は出力する。さらに、ステップＳ１６で全ての単
語が着目されたかどうかを判定し、全ての単語が着目さ
れていないならば、ステップＳ１７で次の単語に着目
し、ステップＳ２を介してステップＡ１に進む。Further, if NO in steps A1 and A3 and YES in step A5, step S
Then, in step S5, it is determined whether the translation level is "elementary". If the translation level is "elementary", it is determined in step S6 whether the word of interest has the level information "super elementary" and is a modified part of the sentence. If the level information of the word is “super elementary” and the word is a modifier of the sentence, the translated word is not output in step S7. If the level information of the word is “beginner” and the word is not a modification part of the sentence, the translated word is output in step S8. Further, it is determined whether or not all the words have been focused on in step S16. If not all the words have been focused on, the next word is focused on in step S17, and the process proceeds to step A1 via step S2.

【０１１９】ステップＡ１から同様にしてステップＳ７
に進んだ場合、ステップＳ７で着目した単語の訳語が、
カタカナだけで構成されているかどうかを判定する。そ
の訳語がカタカナだけで構成されている場合、ステップ
Ａ８でその訳語は出力しないようにする。さらに、ステ
ップＳ１６で全ての単語が着目されたかどうかを判定
し、全ての単語が着目されていないならば、ステップＳ
１７で次の単語に着目し、ステップＳ２を介してステッ
プＡ１に進む。From step A1 to step S7
When the process proceeds to step S7, the translation of the word of interest in step S7 is
It is determined whether or not it is composed only of katakana. If the translated word is composed only of katakana, the translated word is not output in step A8. Further, it is determined in step S16 whether or not all the words have been noticed.
At 17, the next word is noted, and the process proceeds to step A1 via step S2.

【０１２０】さらに、ステップＡ１から同様にしてステ
ップＳ７に進み、ステップＳ７で訳語がカタカナではな
くＮＯでステップＳ９に進んだ場合、ステップＳ９で訳
振りレベルは「中級」かどうかを判定する。訳振りレベ
ルが「中級」の場合、ステップＳ１０でその着目してい
る単語のレベル情報は、レベル情報が「超初級」または
「初級」かどうかを判定する。その単語のレベル情報が
「超初級」または「初級」の場合、ステップＳ１１でそ
の訳語は出力しないようにする。また、その単語のレベ
ル情報が「超初級」または「初級」ではない場合、ステ
ップＳ１２でその訳語は出力する。さらに、ステップＳ
１６で全ての単語が着目されたかどうかを判定し、全て
の単語が着目されていないならば、ステップＳ１７で次
の単語に着目し、ステップＳ２を介してステップＡ１に
進む。Further, the process proceeds from Step A1 to Step S7 in the same manner. If the translated word is not Katakana but NO and the process proceeds to Step S9, it is determined in Step S9 whether the translation level is "intermediate". If the translation level is "intermediate", it is determined in step S10 whether the level information of the word of interest is "super elementary" or "elementary". If the level information of the word is "beginner" or "beginner", the translated word is not output in step S11. If the level information of the word is not "beginner" or "beginner", the translated word is output in step S12. Further, step S
At 16, it is determined whether or not all the words have been noticed. If not all the words have been noticed, the next word is noticed at step S 17, and the process proceeds to step A 1 via step S 2.

【０１２１】さらに、ステップＡ１から同様にしてステ
ップＳ９に進み、ステップＳ９で訳振りレベルが「中
級」ではなくＮＯでステップＳ１３に進んだ場合、ステ
ップＳ１３でその注目している単語は、そのレベル情報
が「上級」で、かつその単語が文のＳＶＯ部かどうかを
判定する。その注目している単語のレベル情報が「上
級」で、かつその単語が文のＳＶＯ部の場合、ステップ
Ｓ１５でその訳語は出力する。さらに、ステップＳ１６
で全ての単語が着目されたかどうかを判定し、全ての単
語が着目されていないならば、ステップＳ１７で次の単
語に着目し、ステップＳ２を介してステップＡ１に進
む。Further, the process proceeds from step A1 to step S9 in the same manner. If the translation level is not “intermediate” in step S9 and the process proceeds to step S13 with NO, the word of interest in step S13 is the level of the word. It is determined whether the information is “advanced” and the word is the SVO part of the sentence. If the level information of the focused word is “advanced” and the word is the SVO part of the sentence, the translated word is output in step S15. Further, step S16
It is determined whether or not all the words have been noticed. If not all the words have been noticed, the next word is noticed in step S17, and the process proceeds to step A1 via step S2.

【０１２２】さらに、ステップＡ１から同様にしてステ
ップＳ１３に進み、ステップＳ１３でその単語のレベル
情報が「上級」で、かつその単語が文のＳＶＯ部ではな
くＮＯの場合、ステップＳ１４で訳語は出力せず、ステ
ップＳ１６で全ての単語が着目されたかどうかを判定
し、全ての単語が着目されていれば、訳振り処理を終了
する。Further, similarly from step A1, the process proceeds to step S13. If the level information of the word is "advanced" in step S13 and the word is not the SVO part of the sentence but NO, the translated word is output in step S14. Instead, it is determined in step S16 whether all the words have been noticed. If all the words have been noticed, the translation process ends.

【０１２３】このように、訳語を付与するか否かの判断
がきめ細かくなされて、利用者のレベルに応じて訳語の
不要な単語には訳語がより正確に抑制されるので、入力
文をより効率良く速読または粗読することができる。As described above, the determination as to whether or not to add a translated word is made finely, and the translated word is more accurately suppressed for a word that does not require a translated word according to the level of the user. Can read fast or roughly.

【０１２４】以上のように、本発明においては、入力手
段１から入力された文を単語の形態素列に分割し、その
単語毎に単語辞書をひくとともに、その単語の時制・人
称・数などを解析する。この形態素列における単語間の
係り受けなど入力文章構造を構文解析手段８が決定し、
これを翻訳文に対する文章構造に構文変換手段９が変換
し、意味的な整合性を検査して単語に対する訳語を選択
する。さらに、翻訳文生成手段１０は、この訳語の並び
を決定し、活用変化を調整して翻訳文を生成する。この
ように、形態素解析、構文解析、構文変換さらに翻訳文
生成の翻訳処理を行った後に、出力形式整形手段１４は
入力文の各単語と訳語との対応をとって訳振りをする。
この訳振りは、訳振りレベル設定手段５で設定された訳
振りレベルと単語辞書における訳振りレベルとの関係に
より、訳語出力判定手段１２が訳語を付与するか否かを
判定する。これによって、外国語の入力文の速読や粗読
を効率よく容易に行うことができる。As described above, in the present invention, the sentence input from the input means 1 is divided into a morpheme sequence of words, and a word dictionary is used for each word, and the tense, personal name, number, etc. of the word are determined. To analyze. The syntax analysis unit 8 determines the input sentence structure such as the dependency between words in this morpheme sequence,
The syntax conversion means 9 converts this into a sentence structure for the translated sentence, checks semantic consistency, and selects a translated word for the word. Further, the translated sentence generating means 10 determines the arrangement of the translated words, adjusts the utilization change, and generates a translated sentence. As described above, after performing the morphological analysis, the syntax analysis, the syntax conversion, and the translation processing for generating the translated sentence, the output format shaping unit 14 performs translation by associating each word of the input sentence with the translated word.
In this translation, the translated word output determining unit 12 determines whether or not to provide a translated word, based on the relationship between the translated level set by the translated level setting unit 5 and the translated level in the word dictionary. This makes it possible to efficiently and easily read a foreign language input sentence quickly or roughly.

【０１２５】[0125]

【発明の効果】以上のように本発明によれば、形態素解
析、構文解析、構文変換さらに翻訳文生成の翻訳処理を
行った後に、入力文の各単語と訳語との対応をとって訳
振りをするため、より精度の高い訳語を出力することが
でき、利用者は入力文を正しく理解することができて、
外国語の入力文の速読や粗読を効率よく容易に行うこと
ができる。As described above, according to the present invention, after performing a morphological analysis, a syntax analysis, a syntax conversion, and a translation process for generating a translated sentence, each word of the input sentence is corresponded to the translated word and translated. , It is possible to output a translation with higher accuracy, and the user can correctly understand the input sentence,
Fast and rough reading of foreign language input sentences can be performed efficiently and easily.

【０１２６】また、単語の見出し毎にレベル情報が付与
されており、ユーザが訳振りレベルを指定し、その指定
された訳振りレベルと原語（単語）のレベル情報との関
係により訳語を付与するか否かを判定するため、利用者
のレベルに応じて訳語の不要な単語には訳語が付与され
ず、利用者は入力文を効率良く速読や粗読することがで
きる。[0126] Level information is provided for each word heading. The user designates a translation level, and a translation is given according to the relationship between the designated translation level and the level information of the original word (word). Since it is determined whether or not the word is unnecessary for the translation according to the level of the user, the user can efficiently read the input sentence quickly or roughly.

[Brief description of the drawings]

【図１】本発明の一実施例における訳振り機械翻訳装置
の基本構成を示すブロック図である。FIG. 1 is a block diagram showing a basic configuration of a translation machine translation apparatus according to an embodiment of the present invention.

【図２】図１の訳振り機械翻訳装置の具体的ハード構成
を示すブロック図である。FIG. 2 is a block diagram showing a specific hardware configuration of the translation machine translation device of FIG. 1;

【図３】図２の翻訳モジュール１６の詳細な構成を示す
ブロック図である。FIG. 3 is a block diagram showing a detailed configuration of a translation module 16 in FIG. 2;

【図４】機械翻訳の概念を模式的に示す図である。FIG. 4 is a diagram schematically illustrating the concept of machine translation.

【図５】図３のバッファＡの格納内容を模式的に示す図
である。FIG. 5 is a diagram schematically showing stored contents of a buffer A of FIG. 3;

【図６】図３のバッファＢの格納内容を模式的に示す図
である。FIG. 6 is a diagram schematically showing the contents stored in a buffer B of FIG. 3;

【図７】図３のバッファＣの格納内容を模式的に示す図
である。FIG. 7 is a diagram schematically showing stored contents of a buffer C of FIG. 3;

【図８】図３のバッファＤの格納内容を模式的に示す図
である。FIG. 8 is a diagram schematically showing the contents stored in a buffer D of FIG. 3;

【図９】図３のバッファＥの格納内容を模式的に示す図
である。FIG. 9 is a diagram schematically showing the contents stored in a buffer E of FIG. 3;

【図１０】図１の訳語出力判定手段１２の動作の一例を
示すフローチャート図である。FIG. 10 is a flowchart showing an example of the operation of the translated word output determining means 12 of FIG. 1;

【図１１】図１の訳語出力判定手段１２の動作の他の例
を示すフローチャート図である。FIG. 11 is a flowchart showing another example of the operation of the translated word output determination means 12 of FIG. 1;

【図１２】従来の翻訳支援装置の構成を示すブロック図
である。FIG. 12 is a block diagram showing a configuration of a conventional translation support device.

【図１３】従来の機械翻訳装置の構成を示すブロック図
である。FIG. 13 is a block diagram illustrating a configuration of a conventional machine translation device.

[Explanation of symbols]

１入力手段１Ａキーボード２翻訳ＣＰＵ３出力手段３Ａ表示手段（表示装置）４記憶手段４Ａメインメモリ４ａ辞書メモリ４ｂ訳振りレベルバッファ４ｃ原語バッファ４ｄ訳語バッファ４ｅ原語訳語対応情報バッファ４ｆ出力形式バッファ５訳振りレベル設定手段６分野指定手段７辞書引き・形態素解析手段８構文解析手段９構文変換手段１０翻訳文生成手段１１原語訳語対応記憶手段１２訳語出力判定手段（翻訳出力判定手段）１３訳語出力位置回数記憶手段１４出力形式整形手段１５メインＣＰＵ１６翻訳モジュール１６Ａ翻訳プログラムメモリＡ〜Ｅバッファ Reference Signs List 1 input means 1A keyboard 2 translation CPU 3 output means 3A display means (display device) 4 storage means 4A main memory 4a dictionary memory 4b translation level buffer 4c original word buffer 4d translation word buffer 4e original language translation word correspondence information buffer 4f output format buffer 5 translation Assignment level setting means 6 Field designation means 7 Dictionary lookup / morphological analysis means 8 Syntax analysis means 9 Syntax conversion means 10 Translation sentence generation means 11 Original language translation word correspondence storage means 12 Translation word output determination means (translation output determination means) 13 Number of translation word output positions Storage means 14 output format shaping means 15 main CPU 16 translation module 16A translation program memory AE buffer

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/27 - 17/28 ──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁷ , DB name) G06F 17/27-17/28

Claims

(57) [Claims]

(1) grammatical information such as a part of speech for a word;
The translations are stored, and a translation level is assigned to each translation.
A given word dictionary and an input sentence are divided into word strings.
When you get grammatical information such as part of speech for each word and translated words
Both are dictionary lookups for analyzing the tense, person, and number of each word.
The morphological analysis means and the grammatical information obtained by the dictionary lookup / morphological analysis means
The input sentence structure such as the dependency relationship between the words in the word string.
And parsing means for determining the granulation, versus an input sentence structure determined by said syntax analysis means translation
Into a sentence structure, and check for semantic consistency.
Syntactic conversion means for selecting the translation for the word; and determining the order of the translations obtained by the syntax conversion means.
A translation sentence generating means for adjusting the usage change to generate a translated sentence.
A translation machine translation device provided with a user, wherein the user can determine the degree of difficulty, frequency or importance of each word, etc.
Specify the translation level set based on the level information of
Possible translation level setting means, and a translation level designated by the translation level setting means.
Translated words according to the translation level in the word dictionary
After performing translation processing up to the translation output determining means and the translation sentence generating means for determining whether or not to add
The translation of the input sentence
Correspondence between each word and its translation is used to define the translation as the input sentence.
Output source for original language translations that output both in an easy-to-view format
And the translation output determination means, wherein the translation level setting means
Specified translation level and translation in the word dictionary
Determine whether to add a translation word based on the relationship with the level
Along with the syntax structure of the sentence obtained during the parsing process,
The character type configuration of the translated word obtained during the conversion process;
Whether the translation obtained during the process is specified in the field
Or the distance between the positions where the
At least one of the pieces of information is linguistic information, and based on the linguistic information,
Determining whether or not to add a translated word based on the
That translation pretend machine translation apparatus.

2. The distance between the positions where the same translation is given is
Characterized in that the distance between the same translation word imparted before
The translation machine translation device according to claim 1 .

3. grammatical information such as part of speech for a word and
The translations are stored, and a translation level is assigned to each translation.
A given word dictionary and an input sentence are divided into word strings.
When you get grammatical information such as part of speech for each word and translated words
Both are dictionary lookups for analyzing the tense, person, and number of each word.
The morphological analysis means and the grammatical information obtained by the dictionary lookup / morphological analysis means
The input sentence structure such as the dependency relationship between the words in the word string.
And parsing means for determining the granulation, versus an input sentence structure determined by said syntax analysis means translation
Into a sentence structure, and check for semantic consistency.
Syntactic conversion means for selecting the translation for the word; and determining the order of the translations obtained by the syntax conversion means.
A translation sentence generating means for adjusting the usage change to generate a translated sentence.
A translation machine translation device provided with a user, wherein the user can determine the degree of difficulty, frequency or importance of each word, etc.
Specify the translation level set based on the level information of
Possible translation level setting means, and a translation level designated by the translation level setting means.
Translated words according to the translation level in the word dictionary
After performing translation processing up to the translation output determining means and the translation sentence generating means for determining whether or not to add
The translation of the input sentence
Correspondence between each word and its translation is used to define the translation as the input sentence.
Output source for original language translations that output both in an easy-to-view format
And the translation output determination means, wherein the translation level setting means
Specified translation level and translation in the word dictionary
Determine whether to add a translation word based on the relationship with the level
Along with the syntax of the sentence obtained during the parsing process.
Structure, the character type configuration of the translated word obtained during the conversion process,
The translation obtained during the conversion process is specified in the field
Or not, and the distance between the positions where the same translation is given,
At least one of the information is linguistic information, and the linguistic information
To determine whether or not to add a translation based on the
The number of times the same translation is obtained during the syntax conversion
If the specified standard is exceeded within the specified range,
Pretend machine translation apparatus mean that the feature that it does not grant one translation.

4. A translation level assigned to the word dictionary.
Is the first information about the importance mark given to the normal dictionary.
And a second number relating to the number of word characters constituting each word of the input sentence.
Information, third information on the type of part of speech, and dictionary types such as a basic dictionary, user dictionary, or specialized dictionary
Fourth information, is composed of at least one of information of, when setting the translation swing level based on the first information regarding,
Among several dictionaries, several dictionaries are assigned importance marks.
And how important the importance mark is
Set based on whether the chromatography click, when setting the translation swing level based on the second information,
Whether the number of word characters exceeds a certain number of characters, or
The structure of each word, such as the prefix and suffix of each word
When setting the translation level based on whether or not the meaning of each word can be estimated, and setting the translation level based on the third information,
Set the level lower for specific parts of speech,
When the auxiliary verb is excluded from the translation target and the translation level is set based on the fourth information,
Words registered in the basic dictionary lower the word level,
Words registered in the user dictionary or specialized dictionary
By setting the word level higher, a translation level to be given in the word dictionary is set.
4. The translation machine translation device according to claim 1, wherein: