JP2016045751A

JP2016045751A - Machine translation device, machine translation method, machine translation program, and recording medium

Info

Publication number: JP2016045751A
Application number: JP2014170082A
Authority: JP
Inventors: 貢三浦; Mitsugi Miura
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2014-08-25
Filing date: 2014-08-25
Publication date: 2016-04-04
Anticipated expiration: 2034-08-25
Also published as: JP6459300B2

Abstract

PROBLEM TO BE SOLVED: To provide a machine translation device that selects a field dictionary used for translation of a translation target sentence with high speed and precision, a machine translation method, a machine translation program, and a recording medium.SOLUTION: A machine translation device includes: a field correspondence information storage unit for storing field correspondence information showing correspondence between a translation condition and a field; a dictionary storage unit for storing a plurality of field dictionaries preserving translation in accordance with the field; and a dictionary determination unit for determining the field dictionary used for translation of the translation target sentence from the plurality of field dictionaries based on the correspondence between the translation condition that coincides with a predetermined unit description in the translation target sentence and the field.SELECTED DRAWING: Figure 1

Description

本発明は、翻訳対象文章の分野を推定する機械翻訳装置、機械翻訳方法、機械翻訳プログラムおよび記録媒体に関する。 The present invention relates to a machine translation device, a machine translation method, a machine translation program, and a recording medium that estimate the field of a translation target sentence.

ある特定の自然言語で書かれた文章を他の自然言語へ翻訳する際、人手による翻訳ではなく、コンピュータを用いて機械的に翻訳を行う機械翻訳が広く使用されている。 When translating a sentence written in a specific natural language into another natural language, machine translation, in which a computer is used to translate mechanically, is widely used instead of manual translation.

この機械翻訳の翻訳精度を向上させる方法の一つに、分野辞書を使用する方法がある。分野辞書は、翻訳対象となる文章が特定の分野について記載されている場合に、訳語を制御するための辞書データを蓄積したものである。たとえば、英語の「court」を日本語へ翻訳する場合、「コート」や「裁判所」など複数の訳語が該当するが、翻訳対象文章の分野が未知の場合には、それらの訳語の中でどれが適切かを決定することができない。しかし、翻訳対象となる文章が「法律」に関するものならば「裁判所」、「スポーツ」に関するものならば「コート」と訳すのが順当である。このような、分野による訳語の制御を行うための辞書データを蓄積したものが、分野辞書である。分野辞書を使用することで、翻訳対象の文章の訳語を分野に応じて適切に選択することが可能になる。 One method for improving the translation accuracy of machine translation is to use a field dictionary. The field dictionary is a dictionary in which dictionary data for controlling a translation word is stored when a sentence to be translated is described in a specific field. For example, when translating English “court” into Japanese, multiple translations such as “Court” and “Court” are applicable. Can't determine if is appropriate. However, it is appropriate to translate “court” if the sentence to be translated is related to “law” and “coat” if it is related to “sports”. A field dictionary is a dictionary in which dictionary data for controlling translations according to fields is stored. By using the field dictionary, it becomes possible to appropriately select the translation of the sentence to be translated according to the field.

使用する分野辞書を選択する方法については、あらかじめ翻訳対象文章の分野がわかっている場合には、人手で分野を設定して使用する分野辞書を選択する方法が単純である。一方、翻訳対象文章の分野が事前にわかっていない場合や翻訳対象文章が多く存在して人手による分野設定に手間がかかる場合には、機械翻訳時に自動的に分野を推定することができれば便利である。しかし、誤って分野を推定して、本来の翻訳対象文章の分野以外の分野辞書を選択してしまうと、一斉に誤った訳語へ翻訳し、著しく翻訳精度が低下してしまう。 Regarding the method of selecting a field dictionary to be used, if the field of the sentence to be translated is known in advance, a method of manually setting the field and selecting the field dictionary to be used is simple. On the other hand, if the field of the text to be translated is not known in advance or if there are many text to be translated and it takes time to manually set the field, it would be convenient if the field could be estimated automatically during machine translation. is there. However, if the field is mistakenly estimated and a field dictionary other than the field of the original text to be translated is selected, translation into the wrong translated word is performed all at once, resulting in a significant decrease in translation accuracy.

このような問題を解決するための方法で代表的なものに、ベクトル演算を用いて翻訳対象文章の分野を推定する方法がある。たとえば特許文献１には、翻訳対象文章の単語ごとの出現頻度を一つの次元の要素とする多次元特徴ベクトルと各分野の代表特徴ベクトルとの近似度を計算する方法が記載されている。２つのベクトルの近似度の計算には、一般的に、コサイン類似度が使用されることが多い。コサイン類似度とは、２つのベクトルのなす角が小さいほど類似度が高いとする方法である。ベクトルxとベクトルyのなす角θの余弦は、式（１）のように、xとyの内積をxとyの長さ（ノルム）で割ることによって得られる。

As a typical method for solving such a problem, there is a method for estimating a field of a translation target sentence by using a vector operation. For example, Patent Document 1 describes a method for calculating the degree of approximation between a multidimensional feature vector having a frequency of appearance of each word of a translation target sentence as a one-dimensional element and a representative feature vector in each field. In general, cosine similarity is often used for calculating the degree of approximation of two vectors. The cosine similarity is a method in which the similarity is higher as the angle between two vectors is smaller. The cosine of the angle θ formed by the vector x and the vector y can be obtained by dividing the inner product of x and y by the length (norm) of x and y as shown in Equation (1).

また、特許文献２では、翻訳対象文章に含まれる単語の各分野辞書での出現頻度情報を用いている。 Moreover, in patent document 2, the appearance frequency information in each field | area dictionary of the word contained in a translation object sentence is used.

特開2007-080221号公報JP 2007-080221 A 特開2001-005812号公報JP 2001-005812 A

しかし、特許文献１の方法では、特徴ベクトルの要素となる単語数の次元をもつベクトルの近似度を各分野について計算するため、計算量が非常に多く、翻訳速度が遅い。たとえば、近似度としてコサイン類似度を使用する場合、特徴ベクトルの要素となる単語数と分野数を掛けた回数の乗算が必要となる。さらに分野の数が増加したり特徴ベクトルの要素となる単語数が増加したりすると、計算量が増加してしまう。また、翻訳対象文章の単語数が少ない場合は、推定の精度が低下しやすい。 However, in the method of Patent Document 1, since the degree of approximation of a vector having the dimension of the number of words that is an element of the feature vector is calculated for each field, the calculation amount is very large and the translation speed is slow. For example, when the cosine similarity is used as the degree of approximation, it is necessary to multiply the number of words that are the elements of the feature vector by the number of fields. Furthermore, if the number of fields increases or the number of words that are elements of feature vectors increases, the amount of calculation increases. In addition, when the number of words in the translation target sentence is small, the estimation accuracy tends to decrease.

特許文献２の方法では、分野辞書ごとに翻訳対象文章に含まれる単語の出現頻度を求めているため、異なる訳語を持つ単語が異なる分野辞書に多く含まれている場合に推定の精度が低下しやすい。また、翻訳対象文章の単語数が少ない場合にはさらに推定の精度が低下しやすい。 In the method of Patent Literature 2, since the appearance frequency of words included in the translation target sentence is obtained for each field dictionary, the accuracy of estimation decreases when many words having different translation words are included in different field dictionaries. Cheap. Further, when the number of words in the translation target sentence is small, the estimation accuracy is likely to be further reduced.

本発明の目的は、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行う機械翻訳装置、機械翻訳方法、機械翻訳プログラムおよび記録媒体を提供することにある。 An object of the present invention is to provide a machine translation apparatus, a machine translation method, a machine translation program, and a recording medium that perform selection of a field dictionary used for translation of a translation target sentence at high speed and high accuracy.

上述の問題を解決するために、本発明の機械翻訳装置は、翻訳条件と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶部と、前記分野に応じた訳語を保持する複数の分野辞書を記憶する辞書記憶部と、翻訳対象文章内の所定の単位記載と合致する前記翻訳条件と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定部とを備えることを特徴とする。 In order to solve the above-described problem, the machine translation apparatus of the present invention includes a field correspondence information storage unit that stores field correspondence information indicating a correspondence relationship between a translation condition and a field, and a plurality of words corresponding to the field. A dictionary storage unit for storing the field dictionary, and a translation of the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the field and the translation condition that matches a predetermined unit description in the sentence to be translated And a dictionary determining unit for determining the field dictionary to be used.

また、本発明の機械翻訳装置は、訳語と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶部と、前記分野に応じた前記訳語を保持する複数の分野辞書を記憶する辞書記憶部と、翻訳条件と前記翻訳条件に対応する前記訳語とを含む前記翻訳規則を記憶する翻訳規則記憶部と、翻訳対象文章内の所定の単位記載と合致する前記翻訳条件に対応する前記訳語と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定部とを備えることを特徴とする。 In addition, the machine translation device of the present invention includes a field correspondence information storage unit that stores field correspondence information indicating a correspondence relationship between a translated word and a field, and a dictionary that stores a plurality of field dictionaries that hold the translated words corresponding to the field. A translation rule storage unit for storing the translation rule including a storage unit, a translation condition and the translation corresponding to the translation condition, and the translation corresponding to the translation condition matching a predetermined unit description in the translation target sentence And a dictionary determination unit that determines the field dictionary to be used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the field and the field.

また、本発明の機械翻訳方法は、翻訳対象文章内の所定の単位記載と合致する翻訳条件と、前記翻訳条件と分野との対応関係を示す分野対応情報に基づいて、前記分野に応じた訳語を保持する複数の分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定することを特徴とする。 Further, the machine translation method of the present invention is based on translation conditions that match a predetermined unit description in the sentence to be translated, and field correspondence information that indicates a correspondence relationship between the translation conditions and the fields, and is translated according to the field. The field dictionary to be used for translation of the translation target sentence is determined from a plurality of field dictionaries holding

また、本発明の機械翻訳方法は、翻訳条件と前記翻訳条件に対応する訳語とを含む翻訳規則と、翻訳対象文章内の所定の単位記載と合致する前記翻訳条件に対応する前記訳語と分野との対応関係を示す分野対応情報とに基づいて、前記分野に応じた前記訳語を保持する複数の分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定することを特徴とする。 The machine translation method of the present invention includes a translation rule including a translation condition and a translation corresponding to the translation condition, and the translation and field corresponding to the translation condition matching a predetermined unit description in the translation target sentence. The field dictionary to be used for translation of the translation target sentence is determined from a plurality of field dictionaries holding the translated words corresponding to the field based on field correspondence information indicating the correspondence relationship between the field and the field.

また、本発明の機械翻訳プログラムは、コンピュータに、翻訳条件と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶機能と、前記分野に応じた訳語を保持する複数の分野辞書を記憶する辞書記憶機能と、翻訳対象文章内の所定の単位記載と合致する前記翻訳条件と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定機能とを実現させる。 In addition, the machine translation program of the present invention includes a field correspondence information storage function for storing field correspondence information indicating a correspondence relationship between translation conditions and fields, and a plurality of field dictionaries holding translated words corresponding to the fields. Based on the correspondence relationship between the dictionary storage function for storing and the translation condition that matches a predetermined unit description in the translation target sentence and the field, the translation target sentence is used from a plurality of the field dictionaries. A dictionary determination function for determining a field dictionary is realized.

また、本発明の機械翻訳プログラムは、コンピュータに、訳語と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶機能と、前記分野に応じた前記訳語を保持する複数の分野辞書を記憶する辞書記憶機能と、翻訳条件と前記翻訳条件に対応する前記訳語とを含む前記翻訳規則を記憶する翻訳規則記憶機能と、翻訳対象文章内の所定の単位記載と合致する前記翻訳条件に対応する前記訳語と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定機能とを実現させる。 In addition, the machine translation program of the present invention includes a field correspondence information storage function for storing field correspondence information indicating a correspondence relationship between a translated word and a field, and a plurality of field dictionaries for holding the translation words corresponding to the field. Corresponding to the translation rule storage function for storing the translation rule including the dictionary storage function to store, the translation rule including the translation condition and the translation corresponding to the translation condition, and the translation condition matching the predetermined unit description in the translation target sentence Based on the correspondence relationship between the translated word and the field, a dictionary determination function for determining the field dictionary to be used for translation of the translation target sentence from a plurality of field dictionaries is realized.

本発明の機械翻訳装置、機械翻訳方法、機械翻訳プログラムおよび記録媒体により、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 With the machine translation device, machine translation method, machine translation program, and recording medium of the present invention, it becomes possible to select a field dictionary used for translation of a translation target sentence at high speed and with high accuracy.

本発明の実施形態の機械翻訳装置の構成例を示す図である。It is a figure which shows the structural example of the machine translation apparatus of embodiment of this invention. 本発明の実施形態の動作例を示す図である。It is a figure which shows the operation example of embodiment of this invention. 本発明の実施形態の機械翻訳装置の他の構成例を示す図である。It is a figure which shows the other structural example of the machine translation apparatus of embodiment of this invention. 本発明の実施形態の他の動作例を示す図である。It is a figure which shows the other operation example of embodiment of this invention. 辞書記憶部のデータ例を示す図である。It is a figure which shows the example of data of a dictionary memory | storage part. 翻訳規則のデータ例を示す図である。It is a figure which shows the example of data of a translation rule. 分野対応情報のデータ例を示す図である。It is a figure which shows the example of data of field corresponding | compatible information. 本発明の第二の実施形態の機械翻訳装置の構成例を示す図である。It is a figure which shows the structural example of the machine translation apparatus of 2nd embodiment of this invention. 本発明の第二の実施形態の動作例を示す図である。It is a figure which shows the operation example of 2nd embodiment of this invention. 分野分類木の作成例を示す図である。It is a figure which shows the example of creation of a field classification tree. 本発明の第四の実施形態の分野対応情報のデータ例を示す図である。It is a figure which shows the example of data of the field corresponding | compatible information of 4th embodiment of this invention. 本発明の実施形態の機械翻訳装置の他の構成例を示す図である。It is a figure which shows the other structural example of the machine translation apparatus of embodiment of this invention. 本発明の実施形態の機械翻訳装置の他の構成例を示す図である。It is a figure which shows the other structural example of the machine translation apparatus of embodiment of this invention.

図１に本発明の実施形態の機械翻訳装置の構成例を示す。 FIG. 1 shows a configuration example of a machine translation apparatus according to an embodiment of the present invention.

機械翻訳装置１は、辞書記憶部１１、分野対応情報記憶部１２および辞書決定部１３から構成される。辞書記憶部１１は、複数の分野辞書を記憶した部分である。分野辞書は、翻訳対象となる文章が特定の分野について記載されている場合に、訳語を制御するための辞書データを蓄積したものである。分野対応情報記憶部１２は、翻訳規則の条件部（翻訳条件）と分野の対応関係を示す分野対応情報を記憶する部分である。辞書決定部１３は、翻訳対象文章２と分野対応情報記憶部１２の翻訳条件を比較し、翻訳対象文章２内で合致した翻訳条件と分野対応情報に基づいて、辞書記憶部１１の複数の分野辞書から、翻訳対象文章２の翻訳に使用する分野辞書を決定する部分である。 The machine translation device 1 includes a dictionary storage unit 11, a field correspondence information storage unit 12, and a dictionary determination unit 13. The dictionary storage unit 11 is a part that stores a plurality of field dictionaries. The field dictionary is a dictionary in which dictionary data for controlling a translation word is stored when a sentence to be translated is described in a specific field. The field correspondence information storage unit 12 is a part that stores field correspondence information indicating a correspondence relationship between a translation rule condition part (translation condition) and a field. The dictionary determination unit 13 compares the translation conditions in the translation target sentence 2 and the field correspondence information storage unit 12, and based on the matching translation conditions and field correspondence information in the translation target sentence 2, a plurality of fields in the dictionary storage unit 11. This is a part for determining a field dictionary to be used for translation of the translation target sentence 2 from the dictionary.

上記のように機械翻訳装置１を構成することによって、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 By configuring the machine translation apparatus 1 as described above, it becomes possible to select a field dictionary used for translation of a translation target sentence at high speed and with high accuracy.

次に、図１の構成における機械翻訳装置１の辞書決定部１３の動作の例を図２に示す。 Next, FIG. 2 shows an example of the operation of the dictionary determination unit 13 of the machine translation apparatus 1 in the configuration of FIG.

まず、翻訳対象文章２と分野対応情報記憶部１２に記憶されている翻訳条件を比較する（Ｓ１０１）。次に、翻訳対象文章２内で合致する翻訳条件と分野対応情報記憶部１２の分野対応情報に基づいて、翻訳対象文章２の翻訳に使用する分野辞書を決定する（Ｓ１０２）。 First, the translation conditions 2 and the translation conditions stored in the field correspondence information storage unit 12 are compared (S101). Next, a field dictionary to be used for translation of the translation target sentence 2 is determined based on the matching translation conditions in the translation target sentence 2 and the field correspondence information in the field correspondence information storage unit 12 (S102).

上記のように機械翻訳装置１を動作させることによって、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 By operating the machine translation apparatus 1 as described above, it is possible to select a field dictionary used for translation of a translation target sentence at high speed and with high accuracy.

また、図３に、本発明の実施形態の機械翻訳装置の他の構成例を示す。 FIG. 3 shows another configuration example of the machine translation apparatus according to the embodiment of the present invention.

図３では、図１に対して、翻訳規則記憶部１３を追加している。翻訳規則記憶部１３は、翻訳規則の条件（翻訳条件）とその翻訳条件に対応する訳語を含む翻訳規則を記憶する部分である。辞書記憶部１１は図１の説明と同様である。分野対応情報記憶部１２は、上述の翻訳条件と分野の対応関係ではなく、訳語と分野の対応関係を示す分野対応情報を記憶する部分である。辞書決定部１３は、翻訳対象文章２と翻訳規則記憶部１３の翻訳条件を比較し、翻訳対象文章２内で合致した翻訳条件に対応する訳語と分野対応情報に基づいて、辞書記憶部１１の分野辞書から、翻訳対象文章２の翻訳に使用する分野辞書を決定する部分である。 In FIG. 3, a translation rule storage unit 13 is added to FIG. The translation rule storage unit 13 is a part for storing a translation rule including a translation rule condition (translation condition) and a translation corresponding to the translation condition. The dictionary storage unit 11 is the same as described in FIG. The field correspondence information storage unit 12 is a part that stores field correspondence information indicating a correspondence relationship between a translation word and a field, instead of the above-described translation condition and field correspondence. The dictionary determination unit 13 compares the translation conditions of the translation target sentence 2 and the translation rule storage unit 13, and based on the translation word corresponding to the translation condition matched in the translation target sentence 2 and the field correspondence information, This is a part for determining a field dictionary to be used for translation of the translation target sentence 2 from the field dictionary.

次に、図３の構成における機械翻訳装置１の辞書決定部１３の動作の例を図４に示す。 Next, an example of the operation of the dictionary determination unit 13 of the machine translation apparatus 1 in the configuration of FIG. 3 is shown in FIG.

まず、翻訳対象文章２と翻訳規則記憶部１３に記憶されている翻訳条件を比較する（Ｓ２０１）。次に、翻訳対象文章２内で合致する翻訳条件に対応する訳語と分野対応情報記憶部１２の分野対応情報に基づいて、翻訳対象文章２の翻訳に使用する分野辞書を決定する（Ｓ２０２）。 First, the translation condition 2 and the translation conditions stored in the translation rule storage unit 13 are compared (S201). Next, a field dictionary to be used for translation of the translation target sentence 2 is determined based on the translation corresponding to the translation condition that matches in the translation target sentence 2 and the field correspondence information in the field correspondence information storage unit 12 (S202).

次に、本発明の第一の実施の形態について、具体例を用いて説明する。 Next, the first embodiment of the present invention will be described using a specific example.

まず、構成例は図１と同様である。 First, the configuration example is the same as in FIG.

辞書記憶部１１のデータ例を図５に示す。図５では、「法律」「テニス」「バスケット」の分野辞書それぞれに、２つ、２つ、３つの辞書データが保持されている。この分野辞書を用いると、たとえば、英語の「court」は、「法律」の分野辞書が選択された場合には「法廷」あるいは「裁判所」、「テニス」の分野辞書が選択された場合には「コート」と翻訳されることになる。 An example of data stored in the dictionary storage unit 11 is shown in FIG. In FIG. 5, two, two, and three dictionary data are held in each of the “law”, “tennis”, and “basket” field dictionaries. Using this field dictionary, for example, “court” in English means “court” or “court” or “tennis” field dictionary when “law” field dictionary is selected. "Coat" will be translated.

なお、分野辞書の保持形式については、図５では、分野ごとに辞書データ（原語訳語ペア）を保持しているが、辞書データ単位に分野情報が付与される形式の分野辞書を使用しても構わない。 As for the storage format of the field dictionary, in FIG. 5, the dictionary data (original word translation word pair) is stored for each field. However, a field dictionary in a format in which field information is given in units of dictionary data may be used. I do not care.

次に、分野対応情報記憶部１２の説明のために、まず、翻訳規則について説明する。翻訳規則のデータ例を図６に示す。翻訳規則は条件部（翻訳条件）とアクション部から構成され、翻訳対象文章に条件部に合致する部分があった場合に、アクション部の動作を行うことを意味している。図６の場合、２つの翻訳規則が保持されており、それぞれの条件部は「隣接（federal、court）」と「共起（cager、traveling）」である。本実施形態では、「隣接（federal、court）」は「federal」と「court」が翻訳対象文書内に隣接して存在している場合、「共起（cager、traveling）」は「cager」と「traveling」の両方が翻訳対象文書内に存在する場合を意味する。なお、「共起」の判断基準については種々の手法が考えられるため、その手法を適用することも可能である。たとえば、条件部で指定した単語の間の文字数や単語数が所定の数以内である場合、あるいは、指定した単語が翻訳対象文章内に同時にそれぞれ所定の数以上存在する場合を「共起」と判断することも可能である。図６の翻訳規則では、「隣接（federal、court）」を適用する場合、その部分の「federal」を「連邦」、「court」を「裁判所」と翻訳し、また、「共起（cager、traveling）」を適用する場合は、「traveling」を「トラベリング」と翻訳する。 Next, in order to explain the field correspondence information storage unit 12, first, translation rules will be described. FIG. 6 shows an example of translation rule data. The translation rule is composed of a condition part (translation condition) and an action part, and means that the action part operates when there is a part that matches the condition part in the sentence to be translated. In the case of FIG. 6, two translation rules are held, and each condition part is “adjacent (federal, court)” and “co-occurrence (cager, traveling)”. In this embodiment, “adjacent (federal, court)” is “caged” and “court” is “cager” when “federal” and “court” are adjacent in the translation target document. This means that both “traveling” are present in the translation target document. Since various methods can be considered for the determination criterion of “co-occurrence”, it is also possible to apply the method. For example, when the number of characters or the number of words between the words specified in the condition part is within a predetermined number, or when the specified words are simultaneously present in the translation target sentence at a predetermined number or more, “co-occurrence” It is also possible to judge. In the translation rule of FIG. 6, when “adjacent (federal, court)” is applied, “federal” in that part is translated as “federal”, “court” is translated as “court”, and “co-occurrence (cager, When traveling) is applied, “traveling” is translated as “traveling”.

分野対応情報記憶部１２の例を図７に示す。この例では、翻訳規則の条件部（翻訳条件）と分野を関連付けたデータを保持しており、「隣接（federal、court）」の翻訳条件には「法律」が、「共起（cager、traveling）」の翻訳条件には「バスケット」が関連付けられている。図７では、１つの翻訳条件に１つの分野を関連付けているが、１つの翻訳条件に対して複数の分野を関連付けることも可能である。 An example of the field correspondence information storage unit 12 is shown in FIG. In this example, data that associates the field with the condition part (translation condition) of the translation rule is stored, and the translation condition of “adjacent (federal, court)” is “law”, but “co-occurrence (cager, traveling) ) "Is associated with" basket ". In FIG. 7, one field is associated with one translation condition, but a plurality of fields may be associated with one translation condition.

次に、辞書決定部１３の動作について説明する。辞書決定部１３では、図２のとおり、まず、翻訳対象文章２と分野対応情報記憶部１２に記憶されている翻訳条件を比較する（Ｓ１０１）。次に、翻訳対象文章２内で合致する翻訳条件と分野対応情報記憶部１２の分野対応情報に基づいて、翻訳対象文章２の翻訳に使用する分野辞書を決定する（Ｓ１０２）。なお、図３のように、図１に対して翻訳規則記憶部１３を追加し、図２のＳ１０１において、翻訳対象文章２と翻訳規則記憶部１３に記憶されている翻訳条件を比較しても、同じ結果が得られる。 Next, the operation of the dictionary determination unit 13 will be described. As shown in FIG. 2, the dictionary determination unit 13 first compares the translation target sentence 2 with the translation conditions stored in the field correspondence information storage unit 12 (S101). Next, a field dictionary to be used for translation of the translation target sentence 2 is determined based on the matching translation conditions in the translation target sentence 2 and the field correspondence information in the field correspondence information storage unit 12 (S102). As shown in FIG. 3, the translation rule storage unit 13 is added to FIG. 1, and the translation conditions 2 and the translation conditions stored in the translation rule storage unit 13 are compared in S101 of FIG. The same result is obtained.

簡単のため、図７の分野対応情報のうち「隣接（federal、court）」の翻訳条件のみが翻訳対象文章２内で合致した場合を例に挙げる。このとき、図７の分野対応情報を参照すると、「隣接（federal、court）」の翻訳規則には「法律」分野が関連付けられている。そのため、翻訳対象文章の分野は「法律」であると推定することができる。このように、翻訳対象文章２内で合致した翻訳条件が関連付けられている分野を分野対応情報記憶部１２の分野対応情報で調べることにより、分野を推定することが可能になる。このようにして推定した分野の分野辞書を、翻訳対象文章２の翻訳に使用する分野辞書として決定する。 For the sake of simplicity, an example will be given in which only the translation condition of “adjacent (federal, court)” in the field correspondence information in FIG. At this time, referring to the field correspondence information in FIG. 7, the “law” field is associated with the translation rule of “adjacent (federal, court)”. Therefore, it can be estimated that the field of the translation target sentence is “law”. As described above, the field can be estimated by examining the field associated with the matching translation condition in the translation target sentence 2 with the field correspondence information in the field correspondence information storage unit 12. The field dictionary of the field estimated in this way is determined as the field dictionary used for translation of the translation target sentence 2.

複数の翻訳規則が合致したときの辞書決定方法としては、合致した翻訳条件に関連付けられている分野を分野対応情報記憶部１２で調べ、合致した翻訳条件の合致回数から各分野に対する適合度を計算し、適合度が大きい分野を推定分野とする方法が考えられる。適合度の計算には、合致した翻訳条件の合致回数に基づき、合致回数が大きくなると適合度も大きくなる関数であれば、どのような関数を使用しても良い。 As a method for determining a dictionary when a plurality of translation rules are matched, the field corresponding information storage unit 12 checks the field associated with the matched translation condition, and calculates the fitness for each field from the number of matching translation conditions matched. However, a method in which a field having a high fitness is used as an estimation field can be considered. Any function may be used for the calculation of the fitness as long as the fitness increases as the number of matches increases based on the number of matches of the matched translation conditions.

例として、各翻訳条件の合致回数に関する一次関数を適合度の計算に使用する場合について説明する。分野対応情報記憶部１２に登録されている翻訳条件の数をa、分野の数をbとする。そして、i番目の翻訳条件（1≦i≦a）とj番目の分野（1≦j≦b）の間の重み係数をm(i,j)（m(i,j)≧0）としてあらかじめ決めて、分野対応情報記憶部１２等に保持しておく。分野対応情報記憶部１２にて関連付けられていない翻訳条件と分野の間の重み係数は0とする。このとき、i番目の翻訳条件の合致回数をx_iとすると、適合度は、たとえば、式（２）のような各翻訳条件の合致回数に関する一次関数で計算することができる。

As an example, a case will be described in which a linear function related to the number of matches of each translation condition is used for calculation of the fitness. The number of translation conditions registered in the field correspondence information storage unit 12 is a, and the number of fields is b. The weight coefficient between the i-th translation condition (1 ≦ i ≦ a) and the j-th field (1 ≦ j ≦ b) is m (i, j) (m (i, j) ≧ 0) in advance. It is determined and stored in the field correspondence information storage unit 12 or the like. The weighting factor between the translation condition and the field that are not associated in the field correspondence information storage unit 12 is assumed to be zero. At this time, if the number of matches of the i-th translation condition is x _i , the fitness can be calculated by a linear function relating to the number of matches of each translation condition, for example, Equation (2).

この式（２）の関数を適合度の計算に使用する場合、乗算回数は、分野対応情報記憶部１２に登録されている翻訳条件の数と分野数との積（a×b回）となる。しかし、関連付けられていない翻訳条件と分野の間の重み係数は0のため、合致した翻訳条件に関連付けられている分野を分野対応情報記憶部１２で調べ、関連付けられていない分野についての乗算（重み係数が0の乗算）を行わないことができる。このようにすることで、乗算回数を減らすことができる。たとえば、図７の分野対応情報のように、各翻訳条件に対して関連付けられている分野が１つの場合、乗算回数は合致した翻訳条件の数（最大a回）となる。翻訳対象文章と分野辞書の特徴ベクトルのコサイン類似度を使用する方法では、特徴ベクトルの要素となる単語数（c）と分野数（b）を掛けた回数（c×b回）の乗算を行う。本実施形態の方法では、特徴ベクトルの要素となる単語数より分野対応情報の翻訳条件の数を小さくすることで、乗算回数を減らすことができる。また、分野対応情報の各翻訳条件に関連付けられている分野数が小さくなれば、さらに乗算回数を減らすことができる。 When the function of Equation (2) is used for calculating the fitness, the number of multiplications is the product of the number of translation conditions registered in the field correspondence information storage unit 12 and the number of fields (a × b times). . However, since the weighting coefficient between the unrelated translation condition and the field is 0, the field associated with the matched translation condition is checked in the field correspondence information storage unit 12, and multiplication (weight) for the unrelated field is performed. (Multiplication with a coefficient of 0) can not be performed. In this way, the number of multiplications can be reduced. For example, when there is one field associated with each translation condition as in the field correspondence information of FIG. 7, the number of multiplications is the number of matching translation conditions (a maximum). In the method using the cosine similarity of the feature vector of the translation target sentence and the field dictionary, the number of words (c) that are the elements of the feature vector is multiplied by the number of fields (b) (c × b times). . In the method according to the present embodiment, the number of multiplications can be reduced by making the number of field condition information translation conditions smaller than the number of words that are elements of feature vectors. In addition, if the number of fields associated with each translation condition of the field correspondence information is reduced, the number of multiplications can be further reduced.

さらに、式（２）において、各翻訳条件に関連付けられている分野の重み係数をすべて1とすることも可能である。このとき、適合度は、各分野に関連付けられた翻訳条件の合致回数の合計となる。このようにすると、分野ごとに、翻訳条件の合致回数を数えて合計するだけになり、乗算回数が０回になるため、特徴ベクトルのコサイン類似度を計算する方法よりも、大幅に計算量を減らすことができる。 Furthermore, in the formula (2), it is possible to set all the weighting factors in the field associated with each translation condition to 1. At this time, the fitness is the total number of matching of the translation conditions associated with each field. In this way, the number of matching of the translation conditions is counted and totaled for each field, and the number of multiplications is 0. Therefore, the calculation amount is significantly larger than the method of calculating the cosine similarity of the feature vector. Can be reduced.

たとえば、図７の分野対応情報の場合、翻訳条件「隣接（federal、court）」の「法律」分野、および、翻訳条件「共起（cager、traveling）」の「バスケット」分野の重み係数を1、それ以外の重み係数を0とする。そして、「隣接（federal、court）」の翻訳条件の合致回数が１０回、「共起（cager、traveling）」の翻訳条件の合致回数が３回であったとすると、「法律」分野の適合度は10、「バスケット」分野の適合度は3と計算される。その結果、適合度がより大きい「法律」を翻訳対象文章の分野として推定することになる。また、「隣接（federal、court）」の「法律」分野の重み係数を1、「共起（cager、traveling）」の「バスケット」分野の重み係数を5とすると、「法律」分野の適合度は10、「バスケット」分野の適合度は15となる。その結果、適合度がより大きい「法律」を翻訳対象文章の分野として推定することになる。 For example, in the case of the field correspondence information in FIG. 7, the weighting factors of the “law” field of the translation condition “federal, court” and the “basket” field of the translation condition “cager, traveling” are set to 1. The other weighting factors are set to 0. If the number of matches for the translation condition of “adjacent (federal, court)” is 10 and the number of matches for the translation condition of “cager, traveling” is 3, the conformity in the “law” field. Is calculated as 10, and the fitness of the “basket” field is calculated as 3. As a result, a “law” having a higher fitness is estimated as the field of the translation target text. Also, if the weight factor for the “law” field of “adjacent (court)” is 1, and the weight factor for the “basket” field of “cager, traveling” is 5, the fitness of the “law” field Is 10, and the fitness of the “basket” field is 15. As a result, a “law” having a higher fitness is estimated as the field of the translation target text.

なお、翻訳条件の合致回数は、各翻訳条件に合致すると判断した回数とする。本実施形態の場合、「隣接」を条件とする翻訳規則の場合の合致回数は、指定された単語が隣り合って存在した回数となる。「共起」を条件とする翻訳条件の場合の合致回数は、指定された単語の翻訳対象文書内での出現回数の合計となる。 The number of times that the translation condition is met is the number of times that it is determined that the translation condition is met. In the case of the present embodiment, the number of matches in the case of a translation rule with “adjacent” as a condition is the number of times that the specified word is present adjacent to each other. The number of matches in the case of the translation condition with “co-occurrence” as a condition is the total number of appearances of the specified word in the translation target document.

また、翻訳対象文章の推定分野を１つに絞るのではなく、複数の分野を同時に適用し、複数の分野辞書を同時に翻訳時に利用することも可能である。 Further, it is possible to apply a plurality of fields at the same time and use a plurality of field dictionaries at the same time for translation, instead of narrowing the estimated field of the translation target sentence to one.

以上で説明したように、本発明の第一の実施形態では、合致した翻訳条件に対応付けられた分野を分野対応情報で調べて数えるため、特徴ベクトルのコサイン類似度を計算する方法に比べて計算量を少なくでき、分野推定の時間を短縮することが可能になる。また、翻訳対象文章の単語数が少なくても条件が合致する翻訳条件があれば分野を推定することができる。そのため、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 As described above, in the first embodiment of the present invention, since the field associated with the matched translation condition is examined and counted by the field correspondence information, it is compared with the method of calculating the cosine similarity of the feature vector. The amount of calculation can be reduced, and the time for field estimation can be shortened. Moreover, even if the number of words in the translation target sentence is small, the field can be estimated if there is a translation condition that satisfies the condition. Therefore, it becomes possible to select the field dictionary used for translation of the translation target sentence at high speed and with high accuracy.

なお、決定された分野辞書の訳語と翻訳規則の訳語が異なる場合もあり得る。このような場合の競合関係の処理方法としては、合致回数が多い翻訳条件については翻訳規則の訳語を優先して翻訳する、あるいは、競合する規則を除いて再度分野推定を行う、などの方法が考えられる。しかし、競合解消は、本発明の中核技術ではなく、特に解消手段を確定させる必要はないため、その手法は確定させていない。 Note that there may be a case where the translation of the determined field dictionary is different from the translation of the translation rule. In such a case, as a method of processing the competitive relationship, there is a method of preferentially translating the translation rule translation word for a translation condition with a high number of matches, or performing field estimation again without the conflicting rule. Conceivable. However, conflict resolution is not the core technology of the present invention, and it is not necessary to determine the resolution means, and therefore the method is not finalized.

次に、本発明の第二の実施の形態について説明する。 Next, a second embodiment of the present invention will be described.

本発明の第二の実施形態の構成例を図８に示す。図１に対して、分野辞書管理部１５と翻訳規則記憶部１３を追加している。翻訳規則記憶部１３は、翻訳規則を記憶する部分である。分野辞書管理部１５は、翻訳規則記憶部１３の翻訳規則と辞書記憶部１１の分野辞書とから分野対応情報を生成し、分野対応情報記憶部１２に記憶させる部分である。 A configuration example of the second embodiment of the present invention is shown in FIG. A field dictionary management unit 15 and a translation rule storage unit 13 are added to FIG. The translation rule storage unit 13 is a part that stores translation rules. The field dictionary management unit 15 is a part that generates field correspondence information from the translation rules in the translation rule storage unit 13 and the field dictionary in the dictionary storage unit 11 and stores the field correspondence information in the field correspondence information storage unit 12.

次に、分野辞書管理部１５の動作の例について図９を用いて説明する。 Next, an example of the operation of the field dictionary management unit 15 will be described with reference to FIG.

まず、辞書記憶部１１の分野辞書から分野分類木を作成する（Ｓ３０１）。分野分類木は、辞書データ（原語訳語ペア）から分野を推定するための条件木である。分野分類木の作成例を図１０に示す。分野単位に保持された辞書データを、辞書データに対して分野を関連付ける形式に変換している。 First, a field classification tree is created from the field dictionary in the dictionary storage unit 11 (S301). The field classification tree is a condition tree for estimating a field from dictionary data (original word translation word pair). An example of creating a field classification tree is shown in FIG. The dictionary data held in the field unit is converted into a format in which the field is associated with the dictionary data.

次に、分野分類木から辞書データを一つずつ取り出し、辞書データが翻訳規則記憶部１４のアクション部にあるかどうかを確認する（Ｓ３０２〜３０３）。ある場合には、翻訳条件と分野を連結して、分野対応情報記憶部に登録する（Ｓ３０４〜Ｓ３０６）。これを未確認の翻訳規則がなくなるまで繰り返し（Ｓ３０７）、未確認の翻訳規則がなくなれば、新たな辞書データを一つ取り出す（Ｓ３０２）。そして、Ｓ３０２〜Ｓ３０７のステップを、未確認の辞書データがなくなるまで繰り返す（Ｓ３０８）。 Next, dictionary data is extracted one by one from the field classification tree, and it is confirmed whether the dictionary data is in the action part of the translation rule storage part 14 (S302 to 303). In some cases, the translation condition and the field are linked and registered in the field correspondence information storage unit (S304 to S306). This is repeated until there is no unconfirmed translation rule (S307). When there is no unconfirmed translation rule, one new dictionary data is extracted (S302). The steps of S302 to S307 are repeated until there is no unconfirmed dictionary data (S308).

図１０の分野分類木の場合、図６の翻訳規則記憶部１４の翻訳規則のアクション部に存在する辞書データが見つかるまで、分野分類木から辞書データを取り出していく。「court → 法廷、裁判所」が翻訳規則記憶部１４に存在するので、この翻訳規則「隣接（federal、court）」と分野「法律」を連結して分野対応情報記憶部１２に登録する。同様に、「traveling → トラベリング」が翻訳規則記憶部１４に存在するので、この翻訳規則「共起（cager、traveling）」と「バスケット」を連結して分野対応情報記憶部１２に登録する。このようにして、図７の分野対応情報を得ることができる。 In the case of the field classification tree of FIG. 10, the dictionary data is extracted from the field classification tree until dictionary data existing in the action part of the translation rule in the translation rule storage unit 14 of FIG. 6 is found. Since “court → courtroom, court” exists in the translation rule storage unit 14, the translation rule “adjacent (federal, court)” and the field “law” are connected and registered in the field correspondence information storage unit 12. Similarly, since “traveling → travelling” exists in the translation rule storage unit 14, the translation rules “cager, traveling” and “basket” are connected and registered in the field correspondence information storage unit 12. In this way, the field correspondence information of FIG. 7 can be obtained.

ここで、分野辞書管理部１５での分野対応情報記憶部１２の生成は、翻訳対象文書２が入力された時点で行っても良いが、翻訳対象文書２が入力される以前にあらかじめ生成してデータを保持しておくことで、分野推定にかかる時間を短くすることができる。 Here, the generation of the field correspondence information storage unit 12 in the field dictionary management unit 15 may be performed at the time when the translation target document 2 is input, but is generated in advance before the translation target document 2 is input. By holding the data, the time required for field estimation can be shortened.

以上で説明したように、分野辞書管理部１５では、翻訳規則記憶部１４の翻訳規則と辞書記憶部１１の分野辞書から、翻訳条件と分野を関連付けた分野対応情報を生成することができる。この分野辞書管理部１５で生成した分野対応情報を用いることで、第一の実施形態で説明した、辞書決定部１３での分野推定を行うことが可能になる。これにより、第一の実施形態と同様に、合致した翻訳条件に対応付けられた分野を分野対応情報で調べて数えるため、ベクトル演算を行う方法に比べて計算量を少なくでき、分野推定の時間を短縮することが可能になる。また、翻訳対象文章の単語数が少なくても合致する翻訳条件があれば分野を推定することができる。そのため、第二の実施形態においても、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 As described above, the field dictionary management unit 15 can generate field correspondence information in which a translation condition and a field are associated from the translation rule in the translation rule storage unit 14 and the field dictionary in the dictionary storage unit 11. By using the field correspondence information generated by the field dictionary management unit 15, it is possible to perform field estimation by the dictionary determination unit 13 described in the first embodiment. As a result, as in the first embodiment, the field associated with the matching translation condition is examined and counted in the field correspondence information, so that the amount of calculation can be reduced compared with the method of performing the vector operation, and the time for field estimation is reduced. Can be shortened. Moreover, even if the number of words in the translation target sentence is small, the field can be estimated if there is a matching translation condition. Therefore, also in the second embodiment, it becomes possible to select a field dictionary used for translation of a translation target sentence at high speed and with high accuracy.

次に、本発明の第三の実施形態について説明する。 Next, a third embodiment of the present invention will be described.

第一および第二の実施形態では、分野対応情報として、翻訳条件と分野を関連付けたデータ（たとえば図７）を使用したが、第三の実施形態では、辞書データ（原語訳語ペア）と分野を関連付けたデータを使用する。 In the first and second embodiments, the data (for example, FIG. 7) in which the translation condition is associated with the field is used as the field correspondence information. However, in the third embodiment, the dictionary data (original language word pair) and the field are Use the associated data.

第三の実施形態の分野対応情報は、第二の実施形態で説明した分野分類木の生成方法と同様に、辞書記憶部１１の分野辞書を辞書データに対して分野を関連付ける形式に変換することで生成することができる。たとえば、図１０のような分野分類木を分野対応情報として使用することができる。また、さらに、分野分類木の辞書データを、翻訳規則記憶部１４の翻訳規則のアクション部に存在するもののみに削減することも可能である。 The field correspondence information of the third embodiment is obtained by converting the field dictionary in the dictionary storage unit 11 into a format for associating the field with dictionary data in the same manner as the field classification tree generation method described in the second embodiment. Can be generated. For example, a field classification tree as shown in FIG. 10 can be used as field correspondence information. Furthermore, it is also possible to reduce the dictionary data of the field classification tree to only those existing in the action part of the translation rule in the translation rule storage unit 14.

次に、第三の実施形態における分野推定方法について説明する。第三の実施形態の分野推定方法では、第一および第二の実施形態で使用した翻訳条件の合致回数の代わりに、翻訳規則のアクション部で訳出する辞書データ（原語訳語ペア）の訳出回数を使用する。翻訳条件の合致回数の代わりに辞書データの訳出回数を使用する点以外は、第一および第二の実施形態と同様の方法で分野推定を行うことができる。辞書データの訳出回数は、ある辞書データに対応する翻訳条件の合致回数を合計することで得られる。仮に、ある辞書データに対応する翻訳条件が一つである場合は、辞書データの訳出回数と、その辞書データに対応する翻訳条件の合致回数は同じになる。 Next, the field estimation method in the third embodiment will be described. In the field estimation method of the third embodiment, the number of translations of the dictionary data (source language translation pair) translated in the action part of the translation rule is used instead of the number of matching of the translation conditions used in the first and second embodiments. use. Field estimation can be performed in the same manner as in the first and second embodiments, except that the number of dictionary data translations is used instead of the number of translation condition matches. The number of translations of dictionary data can be obtained by summing up the number of matches of translation conditions corresponding to certain dictionary data. If there is only one translation condition corresponding to a certain dictionary data, the number of translations of the dictionary data and the number of matches of the translation conditions corresponding to the dictionary data are the same.

式（２）で例に挙げた適合度についても、翻訳条件の代わりに辞書データを使用して計算することができる。つまり、分野対応情報記憶部１２に登録されている辞書データの数をa、分野の数をb、i番目の辞書データ（1≦i≦a）とj番目の分野（1≦j≦b）の間の重み係数をm(i,j)（m(i,j)≧0）、i番目の辞書データの訳出回数をx_iとして、適合度を計算することができる。 The degree of matching exemplified in Expression (2) can also be calculated using dictionary data instead of translation conditions. That is, the number of dictionary data registered in the field correspondence information storage unit 12 is a, the number of fields is b, the i-th dictionary data (1 ≦ i ≦ a) and the j-th field (1 ≦ j ≦ b). The degree of fitness can be calculated with m (i, j) (m (i, j) ≧ 0) as the weighting coefficient between and the number of translations of the i-th dictionary data as x _i .

たとえば、図６の翻訳規則のうち「隣接（federal、court）」の翻訳規則の条件部が翻訳対象文章２内で合致したとする。この翻訳規則のアクション部は「訳出（federal → 連邦）」「訳出（court → 裁判所）」となっている。このアクション部の辞書データを図１０の分野分類木で確認すると、「court → 裁判所」が「法律」と対応している。式（２）において、対応関係がある辞書データと分野の間の重み係数をたとえば1として適合度を計算すると、「法律」分野の適合度が1、それ以外の分野の適合度が0となる。その結果、「法律」分野を翻訳対象文章２の分野として推定することができる。 For example, it is assumed that the conditional part of the translation rule “adjacent (federal, court)” of the translation rules in FIG. The action part of this translation rule is “translation (federal → federal)” and “translation (court → court)”. When the dictionary data of this action part is confirmed by the field classification tree of FIG. 10, “court → court” corresponds to “law”. In Equation (2), when the fitness is calculated by setting the weighting coefficient between the dictionary data having a correspondence relationship and the field as 1, for example, the fitness of the “law” field is 1, and the fitness of the other fields is 0. . As a result, the “law” field can be estimated as the field of the translation target sentence 2.

以上で説明したように、第三の実施形態では、第一および第二の実施形態における翻訳条件の合致回数の代わりに辞書データの訳出回数を用いることで、第一および第二の実施形態と同様に、分野推定を行うことが可能になる。そのため、第三の実施形態においても、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 As described above, in the third embodiment, the number of translations of dictionary data is used in place of the number of translation condition matches in the first and second embodiments, so that the first and second embodiments Similarly, field estimation can be performed. Therefore, also in the third embodiment, it becomes possible to select a field dictionary used for translation of a translation target sentence at high speed and with high accuracy.

次に、本発明の第四の実施形態について説明する。 Next, a fourth embodiment of the present invention will be described.

第四の実施形態では、分野対応情報として、翻訳規則のアクション部で訳出した辞書データの訳出回数を入力すると各分野の適合度を出力するニューラルネットワークの発火条件データを使用する。 In the fourth embodiment, as the field correspondence information, the firing condition data of the neural network that outputs the fitness of each field when the number of times of translation of the dictionary data translated by the action part of the translation rule is input is used.

分野辞書管理部１５では、辞書記憶部１１の分野辞書から発火条件データの生成を行う。まず、第二の実施形態と同様に、分野辞書から分野分類木を作成する。そして、作成した分野分類木を使用して、辞書データ（原語訳語ペア）と分野とのつながりをニューロン結合としたニューラルネットワークを構成する。図１０の分野分類木から得られるニューラルネットワークの例を図１１に示す。辞書データの訳出回数を入力すると、決められた発火条件に従って発火が行われ、各分野の適合度が出力されることを示している。m(i,j)は、i番目の辞書データとj番目の分野との間の重み係数である。辞書データの訳出回数と重み係数を掛けたものの総和が、あらかじめ決められている閾値を超えたときに発火したとみなされる。発火すると、辞書データの訳出回数と重み係数を掛けたものの総和が、適合度として出力される。ここで、たとえば、発火条件の閾値を0とすると、適合度は、第三の実施形態の式（２）を計算していることになる。このようにして得られたニューラルネットワークの、辞書データの訳出回数と各分野の適応度の関係を示す発火条件データを分野対応情報記憶部１２として保持する。 The field dictionary management unit 15 generates firing condition data from the field dictionary in the dictionary storage unit 11. First, as in the second embodiment, a field classification tree is created from a field dictionary. Then, using the created field classification tree, a neural network is constructed in which the connection between the dictionary data (original word translation word pair) and the field is a neuron connection. An example of a neural network obtained from the field classification tree in FIG. 10 is shown in FIG. When the number of dictionary data translations is input, firing is performed according to the determined firing conditions, and the degree of conformity of each field is output. m (i, j) is a weighting coefficient between the i-th dictionary data and the j-th field. When the sum of the number of times of translation of dictionary data multiplied by the weighting coefficient exceeds a predetermined threshold, it is considered to have fired. When fired, the sum of the number of translations of the dictionary data multiplied by the weighting coefficient is output as the fitness. Here, for example, when the threshold value of the ignition condition is set to 0, the fitness is calculated by the expression (2) of the third embodiment. Firing condition data indicating the relationship between the number of translations of dictionary data and the fitness of each field of the neural network obtained in this way is held as the field correspondence information storage unit 12.

辞書決定部１３では、分野対応情報記憶部１２に記憶された発火条件データを使用して、分野推定を行う。発火条件データからニューラルネットワークを構成し、翻訳対象文章２に適用された辞書データの訳出回数をニューラルネットワークに入力することで各分野への適合度を計算し、翻訳対象文章２の分野を推定することができる。たとえば、図６の翻訳規則のうち「隣接（federal、court）」が適用されたとする。この翻訳規則のアクション部は「訳出（federal → 連邦）」「訳出（court → 裁判所）」となっている。このアクション部の辞書データを図１１のニューラルネットワークに入力すると、「court → 裁判所」が「法律」と関連しているため、「法律」が分野として推定されることになる。 The dictionary determination unit 13 performs field estimation using the firing condition data stored in the field correspondence information storage unit 12. A neural network is constructed from the firing condition data, the number of translations of the dictionary data applied to the translation target sentence 2 is input to the neural network, the degree of adaptation to each field is calculated, and the field of the translation target sentence 2 is estimated be able to. For example, it is assumed that “adjacent (federal, court)” is applied among the translation rules of FIG. The action part of this translation rule is “translation (federal → federal)” and “translation (court → court)”. When the dictionary data of this action part is input to the neural network of FIG. 11, “law” is estimated as a field because “court → court” is related to “law”.

以上で説明したように、第四の実施形態では、第三の実施形態の訳出回数に関する一次関数（式（２））と同様の適合度の計算を、ニューラルネットワークを使用して行うことができる。また、ここまでの説明では、辞書データの訳出回数から各分野の適合度を導く方法について記載したが、辞書データを翻訳条件に置き換えることで、第一および第二の実施形態と同様に、翻訳条件の合致回数から各分野の適合度を計算することも可能である。 As described above, in the fourth embodiment, the calculation of the fitness similar to the linear function (equation (2)) regarding the number of translations in the third embodiment can be performed using a neural network. . In the description so far, the method for deriving the fitness of each field from the number of times the dictionary data is translated has been described. However, by replacing the dictionary data with the translation conditions, the translation can be performed as in the first and second embodiments. It is also possible to calculate the fitness of each field from the number of times the condition is met.

以上で説明したように、第四の実施形態においても、第一、第二および第三の実施形態と同様に、翻訳条件の合致回数から分野を推定することができ、翻訳対象文章の翻訳に使用する分野辞書の選択を、高速かつ精度良く行うことが可能になる。 As described above, in the fourth embodiment, as in the first, second, and third embodiments, the field can be estimated from the number of matching translation conditions, and the translation target sentence can be translated. The field dictionary to be used can be selected at high speed and with high accuracy.

以上で翻訳対象文章の分野を推定して分野辞書を決定する機械翻訳装置について説明したが、図１２のように、翻訳部１６を追加して、決定された分野辞書に従って翻訳を行うことが可能である。また、さらに、図１３のように。通信部１７を追加して、翻訳対象文章２をネットワーク上で入力して、翻訳結果３を通信経由で返信するようにすることも可能である。さらに、機械翻訳装置の各部をネットワーク上に分散配置して使用できるようにすることも可能である。 Although the machine translation device that estimates the field of the text to be translated and determines the field dictionary has been described above, it is possible to add a translation unit 16 and perform translation according to the determined field dictionary as shown in FIG. It is. Furthermore, as shown in FIG. It is also possible to add the communication unit 17 and input the translation target sentence 2 on the network and return the translation result 3 via communication. Furthermore, each part of the machine translation apparatus can be used by being distributed on the network.

また、所定の単語の「隣接」及び「共起」を翻訳条件の例として説明したが、単語だけでなく、文章の「隣接」あるいは「共起」等を翻訳条件としてもよい。ここで、前述の単語や文章を単位記載とも呼ぶ。 Further, although “adjacent” and “co-occurrence” of a predetermined word have been described as examples of translation conditions, not only words but “adjacent” or “co-occurrence” of sentences may be used as translation conditions. Here, the aforementioned words and sentences are also referred to as unit descriptions.

また、本発明の機械翻訳装置、機械翻訳方法、機械翻訳プログラムおよび記録媒体は、自然言語の翻訳だけでなく、漢字仮名変換や文章への分野タグの自動付与などへ応用することも可能である。 The machine translation device, machine translation method, machine translation program, and recording medium of the present invention can be applied not only to natural language translation but also to kanji kana conversion and automatic application of field tags to sentences. .

なお、上述する各実施の形態は、本発明の好適な実施の形態であり、本発明の要旨を逸脱しない範囲内において種々変更実施が可能である。 Each of the above-described embodiments is a preferred embodiment of the present invention, and various modifications can be made without departing from the scope of the present invention.

上記の実施形態の一部又は全部は、以下の付記のようにも記載されうるが、以下には限られない。 A part or all of the above-described embodiment can be described as in the following supplementary notes, but is not limited thereto.

（付記１）
翻訳条件と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶部と、
前記分野に応じた訳語を保持する複数の分野辞書を記憶する辞書記憶部と、
翻訳対象文章内の所定の単位記載と合致する前記翻訳条件と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定部と
を備えることを特徴とする機械翻訳装置。 (Appendix 1)
A field correspondence information storage unit for storing field correspondence information indicating correspondence between translation conditions and fields;
A dictionary storage unit that stores a plurality of field dictionaries that hold translations according to the field;
Dictionary determination for determining the field dictionary to be used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the field and the translation condition that matches a predetermined unit description in the sentence to be translated And a machine translation device.

（付記２）
前記辞書決定部は、さらに、前記翻訳条件の前記翻訳対象文章内での合致回数と前記分野との前記対応関係に基づいて、前記分野ごとの適合度を計算し、前記適合度が大きい前記分野に対応する前記分野辞書を前記翻訳対象文章の翻訳に使用する前記分野辞書として決定する
ことを特徴とする付記１に記載の機械翻訳装置。 (Appendix 2)
The dictionary determination unit further calculates the degree of adaptation for each field based on the number of matches of the translation condition in the translation target sentence and the correspondence relationship with the field, and the field having a high degree of fitness The machine translation device according to appendix 1, wherein the field dictionary corresponding to is determined as the field dictionary used for translation of the translation target sentence.

（付記３）
前記機械翻訳装置は、さらに、
前記翻訳条件と前記翻訳条件に対応する前記訳語とを含む翻訳規則を記憶する翻訳規則記憶部と、
前記翻訳規則と前記分野辞書とから前記分野対応情報を生成する分野辞書管理部と
を備え、
前記辞書決定部は、さらに、前記分野対応情報を用いて、前記翻訳対象文章内の所定の前記単位記載と合致した前記翻訳条件に対応する前記分野を調べて前記適合度を計算し、
前記適合度は、前記分野に対応する前記翻訳条件の前記合致回数の増加に従って大きくなる
ことを特徴とする付記２に記載の機械翻訳装置。 (Appendix 3)
The machine translation device further includes:
A translation rule storage unit for storing a translation rule including the translation condition and the translation corresponding to the translation condition;
A field dictionary management unit that generates the field correspondence information from the translation rule and the field dictionary;
The dictionary determination unit further calculates, using the field correspondence information, the field corresponding to the translation condition that matches the predetermined unit description in the translation target sentence, and calculates the fitness.
The machine translation device according to appendix 2, wherein the degree of conformity increases as the number of matches of the translation condition corresponding to the field increases.

（付記４）
前記適合度は、さらに、前記翻訳条件の前記合致回数に関する一次関数を計算することによって得られる
ことを特徴とする付記３に記載の機械翻訳装置。 (Appendix 4)
The machine translation device according to appendix 3, wherein the fitness is further obtained by calculating a linear function related to the number of times the translation condition is matched.

（付記５）
前記適合度は、さらに、前記翻訳条件と前記分野との間の重み係数と、前記翻訳条件の前記合致回数とを乗算したものを、前記分野ごとに合計して計算され、
前記重み係数は、０以上の実数値であり、対応付けられていない前記翻訳条件と前記分野との間の重み係数は０、対応付けられている前記翻訳条件と前記分野との間の重み係数は０より大きい
ことを特徴とする付記４に記載の機械翻訳装置。 (Appendix 5)
The fitness is further calculated by multiplying a weighting factor between the translation condition and the field by the number of matches of the translation condition for each field,
The weighting factor is a real value greater than or equal to 0, the weighting factor between the unrelated translation condition and the field is 0, and the weighting factor between the associated translation condition and the field The machine translation apparatus according to appendix 4, wherein is greater than 0.

（付記６）
対応付けられている前記翻訳条件と前記分野との間の重み係数は１である
ことを特徴とする付記５に記載の機械翻訳装置。 (Appendix 6)
The machine translation device according to appendix 5, wherein a weighting factor between the translation condition and the field associated with each other is 1.

（付記７）
前記分野辞書管理部は、さらに、前記適合度の計算方法をニューラルネットワークの発火条件とした前記発火条件を前記分野対応情報とし、
前記辞書決定部は、さらに、前記翻訳条件の前記合致回数を前記発火条件のニューラルネットワークに入力して前記適合度を計算する
ことを特徴とする付記４から付記６に記載の機械翻訳装置。 (Appendix 7)
The field dictionary management unit further sets the firing condition, which is the neural network firing condition as the calculation method of the fitness, as the field correspondence information,
The machine translation device according to any one of appendix 4 to appendix 6, wherein the dictionary determination unit further calculates the fitness by inputting the number of matches of the translation condition into a neural network of the firing condition.

（付記８）
訳語と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶部と、
前記分野に応じた前記訳語を保持する複数の分野辞書を記憶する辞書記憶部と、
翻訳条件と前記翻訳条件に対応する前記訳語とを含む前記翻訳規則を記憶する翻訳規則記憶部と、
翻訳対象文章内の所定の単位記載と合致する前記翻訳条件に対応する前記訳語と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定部と
を備えることを特徴とする機械翻訳装置。 (Appendix 8)
A field correspondence information storage unit that stores field correspondence information indicating correspondence between translated words and fields;
A dictionary storage unit that stores a plurality of field dictionaries that hold the translated words according to the field;
A translation rule storage unit that stores the translation rule including a translation condition and the translation corresponding to the translation condition;
The field dictionary used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the translation word corresponding to the translation condition and the field corresponding to the translation unit description in the sentence to be translated A machine translation apparatus comprising: a dictionary determination unit that determines

（付記９）
前記辞書決定部は、さらに、前記翻訳条件が前記翻訳対象文章内で合致した前記翻訳規則で訳出する前記訳語の訳出回数と前記分野との前記対応関係に基づいて、前記分野ごとの適合度を計算し、前記適合度が大きい前記分野に対応する前記分野辞書を前記翻訳対象文章の翻訳に使用する前記分野辞書として決定する
ことを特徴とする付記８に記載の機械翻訳装置。 (Appendix 9)
The dictionary determination unit further determines the fitness for each field based on the correspondence relationship between the field and the number of translations of the translated words translated by the translation rule in which the translation condition is matched in the sentence to be translated. 9. The machine translation device according to appendix 8, wherein the field dictionary corresponding to the field having a high degree of fitness is calculated as the field dictionary to be used for translation of the translation target sentence.

（付記１０）
前記機械翻訳装置は、さらに、
前記分野辞書から前記分野対応情報を生成する分野辞書管理部
を備え、
前記辞書決定部は、さらに、前記分野対応情報を用いて、前記翻訳条件が前記翻訳対象文章内で合致した前記翻訳規則で訳出する前記訳語に対応する前記分野を調べて前記適合度を計算し、
前記適合度は、前記分野に対応する前記訳語の前記訳出回数の増加に従って大きくなる
ことを特徴とする付記９に記載の機械翻訳装置。 (Appendix 10)
The machine translation device further includes:
A field dictionary management unit for generating the field correspondence information from the field dictionary,
The dictionary determining unit further uses the field correspondence information to check the field corresponding to the translated word translated by the translation rule whose translation condition matches in the translation target sentence, and calculates the fitness. ,
The machine translation apparatus according to appendix 9, wherein the fitness level increases as the number of translations of the translation corresponding to the field increases.

（付記１１）
前記適合度は、さらに、前記訳語の前記訳出回数に関する一次関数を計算することによって得られる
ことを特徴とする付記１０に記載の機械翻訳装置。 (Appendix 11)
The machine translation apparatus according to appendix 10, wherein the fitness is further obtained by calculating a linear function related to the number of translations of the translated word.

（付記１２）
前記適合度は、さらに、前記訳語と前記分野との間の重み係数と、前記訳語の前記訳出回数とを乗算したものを、前記分野ごとに合計して計算され、
前記重み係数は、０以上の実数値であり、対応付けられていない前記訳語と前記分野との間の重み係数は０、対応付けられている前記訳語と前記分野との間の重み係数は０より大きい
ことを特徴とする付記１１に記載の機械翻訳装置。 (Appendix 12)
The fitness is further calculated by multiplying, for each field, a product of a weighting factor between the translated word and the field and the number of translations of the translated word.
The weighting factor is a real value greater than or equal to 0, the weighting factor between the uncorresponding translation and the field is 0, and the weighting factor between the associated translation and the field is 0. The machine translation device according to attachment 11, wherein the machine translation device is larger.

（付記１３）
対応付けられている前記訳語と前記分野との間の重み係数は１である
ことを特徴とする付記１２に記載の機械翻訳装置。 (Appendix 13)
The machine translation apparatus according to appendix 12, wherein the weighting factor between the translated word and the field associated with each other is 1.

（付記１４）
前記分野辞書管理部は、さらに、前記適合度の計算方法をニューラルネットワークの発火条件とした前記発火条件を前記分野対応情報とし、
前記辞書決定部は、さらに、前記訳語の前記訳出回数を前記発火条件のニューラルネットワークに入力して前記適合度を計算する
ことを特徴とする付記１１から付記１３に記載の機械翻訳装置。 (Appendix 14)
The field dictionary management unit further sets the firing condition, which is the neural network firing condition as the calculation method of the fitness, as the field correspondence information,
The machine translation device according to appendix 11 to appendix 13, wherein the dictionary determination unit further calculates the fitness by inputting the number of translations of the translation into the neural network of the firing condition.

（付記１５）
前記機械翻訳装置は、さらに、
前記辞書決定部で決定された前記翻訳対象文章の翻訳に使用する前記分野辞書を使用して翻訳を行う翻訳部
を備えることを特徴とする付記１から付記１４に記載の機械翻訳装置。 (Appendix 15)
The machine translation device further includes:
15. The machine translation device according to any one of supplementary notes 1 to 14, further comprising: a translation unit that performs translation using the field dictionary used for translation of the translation target sentence determined by the dictionary determination unit.

（付記１６）
前記機械翻訳装置は、さらに、
前記翻訳対象文章をネットワーク経由で受信し、翻訳結果を返信するデータ通信部
を備えることを特徴とする付記１５に記載の機械翻訳装置。 (Appendix 16)
The machine translation device further includes:
The machine translation apparatus according to claim 15, further comprising a data communication unit that receives the translation target sentence via a network and returns a translation result.

（付記１７）
翻訳対象文章内の所定の単位記載と合致する翻訳条件と、前記翻訳条件と分野との対応関係を示す分野対応情報に基づいて、前記分野に応じた訳語を保持する複数の分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する
ことを特徴とする機械翻訳方法。 (Appendix 17)
The translation from a plurality of field dictionaries holding translation words corresponding to the field based on the field condition information indicating a translation condition that matches a predetermined unit description in the sentence to be translated and the translation condition and the field A machine translation method comprising: determining the field dictionary to be used for translation of a target sentence.

（付記１８）
前記機械翻訳方法は、さらに、
前記翻訳条件の前記翻訳対象文章内での合致回数と前記分野との前記対応関係に基づいて、前記分野ごとの適合度を計算し、前記適合度が大きい前記分野に対応する前記分野辞書を前記翻訳対象文章の翻訳に使用する前記分野辞書として決定する
ことを特徴とする付記１７に記載の機械翻訳方法。 (Appendix 18)
The machine translation method further includes:
Based on the number of matches of the translation condition in the translation target sentence and the correspondence relationship with the field, the degree of matching for each field is calculated, and the field dictionary corresponding to the field with the high degree of matching is The machine translation method according to supplementary note 17, wherein the field dictionary is determined as the field dictionary used for translation of the translation target sentence.

（付記１９）
前記機械翻訳方法は、さらに、
前記翻訳条件と前記翻訳条件に対応する前記訳語とを含む翻訳規則と、前記分野辞書とから、前記翻訳条件と前記分野を対応付けた前記分野対応情報を生成し、前記分野対応情報を用いて、前記翻訳対象文章内の所定の前記単位記載と合致した前記翻訳条件に対応する前記分野を調べて前記適合度を計算し、前記適合度は、前記分野に対応する前記翻訳条件の前記合致回数の増加に従って大きくなる
ことを特徴とする付記１８に記載の機械翻訳方法。 (Appendix 19)
The machine translation method further includes:
From the translation rule including the translation condition and the translation corresponding to the translation condition, and the field dictionary, the field correspondence information in which the translation condition is associated with the field is generated, and the field correspondence information is used. , Examining the field corresponding to the translation condition that matches the predetermined unit description in the translation target sentence, and calculating the fitness, the fitness is the number of matches of the translation condition corresponding to the field The machine translation method according to appendix 18, wherein the machine translation method increases as the value increases.

（付記２０）
前記適合度は、さらに、前記翻訳条件の前記合致回数に関する一次関数を計算することによって得られる
ことを特徴とする付記１９に記載の機械翻訳方法。 (Appendix 20)
The machine translation method according to appendix 19, wherein the fitness is further obtained by calculating a linear function related to the number of matches of the translation condition.

（付記２１）
前記適合度は、さらに、前記翻訳条件と前記分野との間の重み係数と、前記翻訳条件の前記合致回数とを乗算したものを、前記分野ごとに合計して計算され、
前記重み係数は、０以上の実数値であり、対応付けられていない前記翻訳条件と前記分野との間の重み係数は０、対応付けられている前記翻訳条件と前記分野との間の重み係数は０より大きい
ことを特徴とする付記２０に記載の機械翻訳方法。 (Appendix 21)
The fitness is further calculated by multiplying a weighting factor between the translation condition and the field by the number of matches of the translation condition for each field,
The weighting factor is a real value greater than or equal to 0, the weighting factor between the unrelated translation condition and the field is 0, and the weighting factor between the associated translation condition and the field The machine translation method according to appendix 20, wherein is greater than zero.

（付記２２）
対応付けられている前記翻訳条件と前記分野との間の重み係数は１である
ことを特徴とする付記２１に記載の機械翻訳方法。 (Appendix 22)
The machine translation method according to appendix 21, wherein the weighting factor between the translation conditions associated with the field is 1.

（付記２３）
前記機械翻訳方法は、さらに、
前記適合度の計算方法をニューラルネットワークの発火条件とした前記発火条件を前記分野対応情報とし、前記翻訳条件の前記合致回数を前記発火条件のニューラルネットワークに入力して前記適合度を計算する
ことを特徴とする付記２０から付記２２に記載の機械翻訳方法。 (Appendix 23)
The machine translation method further includes:
Using the firing condition as the neural network firing condition as the neural network firing condition as the field correspondence information, and inputting the number of matches of the translation condition into the neural network of the firing condition to calculate the fitness. 23. The machine translation method according to supplementary note 20 to supplementary note 22, which is characterized.

（付記２４）
翻訳条件と前記翻訳条件に対応する訳語とを含む翻訳規則と、翻訳対象文章内の所定の単位記載と合致する前記翻訳条件に対応する前記訳語と分野との対応関係を示す分野対応情報とに基づいて、前記分野に応じた前記訳語を保持する複数の分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する
ことを特徴とする機械翻訳方法。 (Appendix 24)
A translation rule including a translation condition and a translation corresponding to the translation condition, and field correspondence information indicating a correspondence relationship between the translation corresponding to the translation condition and the field corresponding to the predetermined unit description in the translation target sentence Based on the plurality of field dictionaries holding the translated words corresponding to the field, the field dictionary to be used for translation of the translation target sentence is determined.

（付記２５）
前記機械翻訳方法は、さらに、
前記翻訳条件が前記翻訳対象文章内で合致した前記翻訳規則で訳出する前記訳語の訳出回数と前記分野との前記対応関係に基づいて、前記分野ごとの適合度を計算し、前記適合度が大きい前記分野に対応する前記分野辞書を前記翻訳対象文章の翻訳に使用する前記分野辞書として決定する
ことを特徴とする付記２４に記載の機械翻訳方法。 (Appendix 25)
The machine translation method further includes:
Based on the number of translations of the translated words translated by the translation rule that matches the translation condition in the translation target sentence and the correspondence relationship with the field, the degree of fitness for each field is calculated, and the degree of matching is large. 25. The machine translation method according to appendix 24, wherein the field dictionary corresponding to the field is determined as the field dictionary used for translating the sentence to be translated.

（付記２６）
前記機械翻訳方法は、さらに、
前記分野辞書から前記分野対応情報を生成し、前記分野対応情報を用いて、前記翻訳条件が前記翻訳対象文章内で合致した前記翻訳規則で訳出する前記訳語に対応する前記分野を調べて前記適合度を計算し、前記適合度は、前記分野に対応する前記訳語の前記訳出回数の増加に従って大きくなる
ことを特徴とする付記２５に記載の機械翻訳方法。 (Appendix 26)
The machine translation method further includes:
The field correspondence information is generated from the field dictionary, and the field correspondence information is used to check the field corresponding to the translated word translated by the translation rule whose translation condition matches in the translation target sentence. 26. The machine translation method according to claim 25, wherein a degree of conformity is calculated, and the degree of conformity increases as the number of translations of the translation corresponding to the field increases.

（付記２７）
前記適合度は、さらに、前記訳語の前記訳出回数に関する一次関数を計算することによって得られる
ことを特徴とする付記２６に記載の機械翻訳方法。 (Appendix 27)
The machine translation method according to appendix 26, wherein the goodness of fit is further obtained by calculating a linear function related to the number of translations of the translated word.

（付記２８）
前記適合度は、さらに、前記訳語と前記分野との間の重み係数と、前記訳語の前記訳出回数とを乗算したものを、前記分野ごとに合計して計算され、
前記重み係数は、０以上の実数値であり、対応付けられていない前記訳語と前記分野との間の重み係数は０、対応付けられている前記訳語と前記分野との間の重み係数は０より大きい
ことを特徴とする付記２７に記載の機械翻訳方法。 (Appendix 28)
The fitness is further calculated by multiplying, for each field, a product of a weighting factor between the translated word and the field and the number of translations of the translated word.
The weighting factor is a real value greater than or equal to 0, the weighting factor between the uncorresponding translation and the field is 0, and the weighting factor between the associated translation and the field is 0. 28. The machine translation method according to appendix 27, which is larger.

（付記２９）
対応付けられている前記訳語と前記分野との間の重み係数は１である
ことを特徴とする付記２８に記載の機械翻訳方法。 (Appendix 29)
29. The machine translation method according to appendix 28, wherein the weighting factor between the translated word and the field associated with each other is 1.

（付記３０）
前記機械翻訳方法は、さらに、
前記適合度の計算方法をニューラルネットワークの発火条件とした前記発火条件を前記分野対応情報として生成し、前記訳語の前記訳出回数を前記発火条件のニューラルネットワークに入力して前記適合度を計算する
ことを特徴とする付記２７から付記２９に記載の機械翻訳方法。 (Appendix 30)
The machine translation method further includes:
Generating the firing condition as the field correspondence information using the method for calculating the fitness as a firing condition of a neural network, and calculating the fitness by inputting the number of translations of the translation into the neural network of the firing condition. 30. The machine translation method according to appendix 27 to appendix 29, wherein:

（付記３１）
前記機械翻訳方法は、さらに、
決定された前記翻訳対象文章の翻訳に使用する前記分野辞書を使用して翻訳を行う
ことを特徴とする付記１７から付記３０に記載の機械翻訳方法。 (Appendix 31)
The machine translation method further includes:
The machine translation method according to appendix 17 to appendix 30, wherein translation is performed using the field dictionary used for translating the determined translation target sentence.

（付記３２）
前記機械翻訳方法は、さらに、
前記翻訳対象文章をネットワーク経由で受信し、翻訳結果を返信する
ことを特徴とする付記３１に記載の機械翻訳方法。 (Appendix 32)
The machine translation method further includes:
32. The machine translation method according to appendix 31, wherein the translation target sentence is received via a network and a translation result is returned.

（付記３３）
コンピュータに、
翻訳条件と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶機能と、
前記分野に応じた訳語を保持する複数の分野辞書を記憶する辞書記憶機能と、
翻訳対象文章内の所定の単位記載と合致する前記翻訳条件と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定機能と
を実現させるための機械翻訳プログラム。 (Appendix 33)
On the computer,
A field correspondence information storage function for storing field correspondence information indicating correspondence between translation conditions and fields;
A dictionary storage function for storing a plurality of field dictionaries that hold translations according to the field;
Dictionary determination for determining the field dictionary to be used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the field and the translation condition that matches a predetermined unit description in the sentence to be translated Machine translation program for realizing functions and.

（付記３４）
前記辞書決定機能は、さらに、前記翻訳条件の前記翻訳対象文章内での合致回数と前記分野との前記対応関係に基づいて、前記分野ごとの適合度を計算し、前記適合度が大きい前記分野に対応する前記分野辞書を前記翻訳対象文章の翻訳に使用する前記分野辞書として決定する
ことを特徴とする付記３３に記載の機械翻訳プログラム。 (Appendix 34)
The dictionary determination function further calculates the degree of adaptation for each field based on the number of matches of the translation condition in the sentence to be translated and the correspondence relationship with the field, and the field having a high degree of fitness 34. The machine translation program according to appendix 33, wherein the field dictionary corresponding to is determined as the field dictionary used for translation of the translation target sentence.

（付記３５）
前記機械翻訳プログラムは、コンピュータに、さらに、
前記翻訳条件と前記翻訳条件に対応する前記訳語とを含む翻訳規則を記憶する翻訳規則記憶機能と、
前記翻訳条件と前記分野辞書とから前記分野対応情報を生成する分野辞書管理機能と、
を実現させ、
前記辞書決定機能は、さらに、前記分野対応情報を用いて、前記翻訳対象文章内の所定の前記単位記載と合致した前記翻訳条件に対応する前記分野を調べて前記適合度を計算し、
前記適合度は、前記分野に対応する前記翻訳条件の前記合致回数の増加に従って大きくなる
ことを特徴とする付記３４に記載の機械翻訳プログラム。 (Appendix 35)
The machine translation program is further stored on a computer.
A translation rule storage function for storing a translation rule including the translation condition and the translation corresponding to the translation condition;
A field dictionary management function for generating the field correspondence information from the translation conditions and the field dictionary;
Realized,
The dictionary determining function further calculates the fitness by examining the field corresponding to the translation condition that matches the predetermined unit description in the translation target sentence using the field correspondence information,
The machine translation program according to appendix 34, wherein the fitness level increases as the number of matches of the translation conditions corresponding to the field increases.

（付記３６）
前記適合度は、さらに、前記翻訳条件の前記合致回数に関する一次関数を計算することによって得られる
ことを特徴とする付記３５に記載の機械翻訳プログラム。 (Appendix 36)
36. The machine translation program according to appendix 35, wherein the fitness is further obtained by calculating a linear function related to the number of matches of the translation condition.

（付記３７）
前記適合度は、さらに、前記翻訳条件と前記分野との間の重み係数と、前記翻訳条件の前記合致回数とを乗算したものを、前記分野ごとに合計して計算され、
前記重み係数は、０以上の実数値であり、対応付けられていない前記翻訳条件と前記分野との間の重み係数は０、対応付けられている前記翻訳条件と前記分野との間の重み係数は０より大きい
ことを特徴とする付記３６に記載の機械翻訳プログラム。 (Appendix 37)
The fitness is further calculated by multiplying a weighting factor between the translation condition and the field by the number of matches of the translation condition for each field,
The weighting factor is a real value greater than or equal to 0, the weighting factor between the unrelated translation condition and the field is 0, and the weighting factor between the associated translation condition and the field 37. The machine translation program according to appendix 36, wherein is greater than zero.

（付記３８）
対応付けられている前記翻訳条件と前記分野との間の重み係数は１である
ことを特徴とする付記３７に記載の機械翻訳プログラム。 (Appendix 38)
38. The machine translation program according to appendix 37, wherein the weighting factor between the associated translation condition and the field is 1.

（付記３９）
前記分野辞書管理機能は、さらに、前記適合度の計算方法をニューラルネットワークの発火条件とした前記発火条件を前記分野対応情報とし、
前記辞書決定機能は、さらに、前記翻訳条件の前記合致回数を前記発火条件のニューラルネットワークに入力して前記適合度を計算する
ことを特徴とする付記３６から付記３８に記載の機械翻訳プログラム。 (Appendix 39)
The field dictionary management function further sets the firing condition with the method for calculating the fitness as the firing condition of the neural network as the field correspondence information,
The machine translation program according to appendix 36 to appendix 38, wherein the dictionary determination function further inputs the number of matches of the translation condition into a neural network of the firing condition and calculates the fitness.

（付記４０）
コンピュータに、
訳語と分野との対応関係を示す分野対応情報を記憶する分野対応情報記憶機能と、
前記分野に応じた前記訳語を保持する複数の分野辞書を記憶する辞書記憶機能と、
翻訳条件と前記翻訳条件に対応する前記訳語とを含む前記翻訳規則を記憶する翻訳規則記憶機能と、
翻訳対象文章内の所定の単位記載と合致する前記翻訳条件に対応する前記訳語と前記分野との前記対応関係に基づいて、複数の前記分野辞書から前記翻訳対象文章の翻訳に使用する前記分野辞書を決定する辞書決定機能と
を実現させるための機械翻訳プログラム。 (Appendix 40)
On the computer,
A field correspondence information storage function for storing field correspondence information indicating the correspondence between translated words and fields;
A dictionary storage function for storing a plurality of field dictionaries that hold the translated words according to the field;
A translation rule storage function for storing the translation rule including a translation condition and the translation corresponding to the translation condition;
The field dictionary used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the translation word corresponding to the translation condition and the field corresponding to the translation unit description in the sentence to be translated A machine translation program for realizing a dictionary determination function for determining

（付記４１）
前記辞書決定機能は、さらに、前記翻訳条件が前記翻訳対象文章内で合致した前記翻訳規則で訳出する前記訳語の訳出回数と前記分野との前記対応関係に基づいて、前記分野ごとの適合度を計算し、前記適合度が大きい前記分野に対応する前記分野辞書を前記翻訳対象文章の翻訳に使用する前記分野辞書として決定する
ことを特徴とする付記４０に記載の機械翻訳プログラム。 (Appendix 41)
The dictionary determination function is further configured to determine a degree of fitness for each field based on the correspondence between the number of translations of the translated word translated by the translation rule with the translation condition matched in the translation target sentence and the field. 41. The machine translation program according to appendix 40, wherein the field dictionary corresponding to the field having a high degree of fitness is calculated and used as the field dictionary used for translation of the translation target sentence.

（付記４２）
前記機械翻訳プログラムは、コンピュータに、さらに、
前記分野辞書から前記分野対応情報を生成する分野辞書管理機能
を実現させ、
前記辞書決定機能は、さらに、前記分野対応情報を用いて、前記翻訳条件が前記翻訳対象文章内で合致した前記翻訳規則で訳出する前記訳語に対応する前記分野を調べて前記適合度を計算し、
前記適合度は、前記分野に対応する前記訳語の前記訳出回数の増加に従って大きくなる
ことを特徴とする付記４１に記載の機械翻訳プログラム。 (Appendix 42)
The machine translation program is further stored on a computer.
Realizing a field dictionary management function for generating the field correspondence information from the field dictionary,
The dictionary determination function further calculates, using the field correspondence information, the field corresponding to the translated word translated by the translation rule whose translation condition matches in the translation target sentence, and calculates the fitness. ,
The machine translation program according to appendix 41, wherein the fitness level increases as the number of translations of the translation corresponding to the field increases.

（付記４３）
前記適合度は、さらに、前記訳語の前記訳出回数に関する一次関数を計算することによって得られる
ことを特徴とする付記４２に記載の機械翻訳プログラム。 (Appendix 43)
The machine translation program according to appendix 42, wherein the fitness is further obtained by calculating a linear function related to the number of translations of the translated word.

（付記４４）
前記適合度は、さらに、前記訳語と前記分野との間の重み係数と、前記訳語の前記訳出回数とを乗算したものを、前記分野ごとに合計して計算され、
前記重み係数は、０以上の実数値であり、対応付けられていない前記訳語と前記分野との間の重み係数は０、対応付けられている前記訳語と前記分野との間の重み係数は０より大きい
ことを特徴とする付記４３に記載の機械翻訳装プログラム。 (Appendix 44)
The fitness is further calculated by multiplying, for each field, a product of a weighting factor between the translated word and the field and the number of translations of the translated word.
The weighting factor is a real value greater than or equal to 0, the weighting factor between the uncorresponding translation and the field is 0, and the weighting factor between the associated translation and the field is 0. 44. The machine translation program according to appendix 43, which is larger.

（付記４５）
対応付けられている前記訳語と前記分野との間の重み係数は１である
ことを特徴とする付記４４に記載の機械翻訳プログラム。 (Appendix 45)
45. The machine translation program according to appendix 44, wherein the weighting factor between the translated word and the field associated with each other is 1.

（付記４６）
前記分野辞書管理機能は、さらに、前記適合度の計算方法をニューラルネットワークの発火条件とした前記発火条件を前記分野対応情報として生成し、
前記辞書決定機能は、さらに、前記訳語の前記訳出回数を前記発火条件のニューラルネットワークに入力して前記適合度を計算する
ことを特徴とする付記４３から付記４５に記載の機械翻訳プログラム。 (Appendix 46)
The field dictionary management function further generates, as the field correspondence information, the firing condition in which the calculation method of the fitness is a firing condition of a neural network,
46. The machine translation program according to appendix 43 to appendix 45, wherein the dictionary determination function further calculates the fitness by inputting the number of translations of the translation into the neural network of the firing condition.

（付記４７）
前記機械翻訳プログラムは、コンピュータに、さらに、
前記辞書決定部で決定された前記翻訳対象文章の翻訳に使用する前記分野辞書を使用して翻訳を行う翻訳機能
を実現させるための付記３３から付記４６に記載の機械翻訳プログラム。 (Appendix 47)
The machine translation program is further stored on a computer.
47. The machine translation program according to appendix 33 to appendix 46 for realizing a translation function for performing translation using the field dictionary used for translation of the translation target sentence determined by the dictionary determination unit.

（付記４８）
前記機械翻訳プログラムは、コンピュータに、さらに、
前記翻訳対象文章をネットワーク経由で受信し、翻訳結果を返信するデータ通信部
を実現させるための付記４７に記載の機械翻訳プログラム。 (Appendix 48)
The machine translation program is further stored on a computer.
48. The machine translation program according to appendix 47 for realizing a data communication unit that receives the translation target sentence via a network and returns a translation result.

（付記４９）
付記３３から付記４８に記載の機械翻訳プログラムを記録した、コンピュータ読み取り可能な記録媒体。 (Appendix 49)
A computer-readable recording medium on which the machine translation program according to appendix 33 to appendix 48 is recorded.

１機械翻訳装置
１１辞書記憶部
１２分野対応情報記憶部
１３辞書決定部
１４翻訳規則記憶部
１５分野辞書管理部
１６翻訳部
１７通信部
２翻訳対象文章
３翻訳結果 DESCRIPTION OF SYMBOLS 1 Machine translation apparatus 11 Dictionary memory | storage part 12 Field correspondence information storage part 13 Dictionary determination part 14 Translation rule memory | storage part 15 Field dictionary management part 16 Translation part 17 Communication part 2 Translation object sentence 3 Translation result 3 Translation result

Claims

A field correspondence information storage unit for storing field correspondence information indicating correspondence between translation conditions and fields;
A dictionary storage unit that stores a plurality of field dictionaries that hold translations according to the field;
Dictionary determination for determining the field dictionary to be used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the field and the translation condition that matches a predetermined unit description in the sentence to be translated And a machine translation device.

The dictionary determination unit further calculates the degree of adaptation for each field based on the number of matches of the translation condition in the translation target sentence and the correspondence relationship with the field, and the field having a high degree of fitness The machine translation device according to claim 1, wherein the field dictionary corresponding to is determined as the field dictionary used for translation of the translation target sentence.

The machine translation device further includes:
A translation rule storage unit for storing a translation rule including the translation condition and the translation corresponding to the translation condition;
A field dictionary management unit that generates the field correspondence information from the translation rule and the field dictionary;
The dictionary determination unit further calculates, using the field correspondence information, the field corresponding to the translation condition that matches the predetermined unit description in the translation target sentence, and calculates the fitness.
The machine translation device according to claim 2, wherein the degree of conformity increases as the number of matches of the translation rule corresponding to the field increases.

The machine translation device according to claim 3, wherein the fitness is further obtained by calculating a linear function related to the number of matches of the translation condition.

A field correspondence information storage unit that stores field correspondence information indicating correspondence between translated words and fields;
A dictionary storage unit that stores a plurality of field dictionaries that hold the translated words according to the field;
A translation rule storage unit that stores the translation rule including a translation condition and the translation corresponding to the translation condition;
The field dictionary used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the translation word corresponding to the translation condition and the field corresponding to the translation unit description in the sentence to be translated A machine translation apparatus comprising: a dictionary determination unit that determines

The translation from a plurality of field dictionaries holding translation words corresponding to the field based on the field condition information indicating a translation condition that matches a predetermined unit description in the sentence to be translated and the translation condition and the field A machine translation method comprising: determining the field dictionary to be used for translation of a target sentence.

A translation rule including a translation condition and a translation corresponding to the translation condition, and field correspondence information indicating a correspondence relationship between the translation corresponding to the translation condition and the field corresponding to the predetermined unit description in the translation target sentence Based on the plurality of field dictionaries holding the translated words corresponding to the field, the field dictionary to be used for translation of the translation target sentence is determined.

On the computer,
A field correspondence information storage function for storing field correspondence information indicating correspondence between translation conditions and fields;
A dictionary storage function for storing a plurality of field dictionaries that hold translations according to the field;
Dictionary determination for determining the field dictionary to be used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the field and the translation condition that matches a predetermined unit description in the sentence to be translated Machine translation program for realizing functions and.

On the computer,
A field correspondence information storage function for storing field correspondence information indicating the correspondence between translated words and fields;
A dictionary storage function for storing a plurality of field dictionaries that hold the translated words according to the field;
A translation rule storage function for storing the translation rule including a translation condition and the translation corresponding to the translation condition;
The field dictionary used for translating the sentence to be translated from a plurality of the field dictionaries based on the correspondence relationship between the translation word corresponding to the translation condition and the field corresponding to the translation unit description in the sentence to be translated A machine translation program for realizing a dictionary determination function for determining

A computer-readable recording medium on which the machine translation program according to claim 8 or 9 is recorded.