JP6312243B2

JP6312243B2 - Translation apparatus, translation method, and translation program

Info

Publication number: JP6312243B2
Application number: JP2014068166A
Authority: JP
Inventors: 光昭小関
Original assignee: Zenrin Datacom Co Ltd
Current assignee: Zenrin Datacom Co Ltd
Priority date: 2014-03-28
Filing date: 2014-03-28
Publication date: 2018-04-18
Anticipated expiration: 2034-03-28
Also published as: JP2015191446A

Description

この発明は、例えば、地図上に記載される文字情報であるいわゆる地図注記などのように、大量に存在する名詞句の羅列データの翻訳を行うための装置、方法、プログラムに関する。 The present invention relates to an apparatus, a method, and a program for translating enumeration data of a large number of noun phrases such as so-called map notes that are character information described on a map.

従来から機械翻訳装置に関する種々の発明がなされている。例えば、後に記す特許文献１には、固有名詞を翻訳する場合であっても、固有名詞辞書を適切に整備しておくことにより正確に翻訳することができることが記載されている。また、同文献には、固有名詞辞書に登録された各固有名詞に対する捕捉語を記憶保持する捕捉語辞書をも用いることにより、固有名詞がどのようなものかについても翻訳結果に含めるようにすることが記載されている。これにより、翻訳結果をより理解し易いものとすることができるという点で効果がある。 Conventionally, various inventions relating to machine translation devices have been made. For example, Patent Document 1 described later describes that even when a proper noun is translated, it can be accurately translated by properly preparing a proper noun dictionary. In addition, the same document also includes a captured word dictionary that stores captured words for each proper noun registered in the proper noun dictionary, so that the proper result is included in the translation result. It is described. This is advantageous in that the translation result can be made easier to understand.

特開２００４−２２０４１６号公報JP 2004-220416 A

従来の機械翻訳装置の場合、翻訳のために種々の辞書が必要になっている。上述した特許文献１に記載の発明でも固有名詞などの翻訳辞書に加えて捕捉語辞書も必要なっている。翻訳のために辞書を検索する処理にはある程度時間がかかるために、大量の固有名詞などの名詞句の羅列の翻訳を行う場合には時間が掛かる。例えば、大量の名詞句の羅列の翻訳行う場合の例として、地図注記の翻訳を行う場合について考える。 In the case of a conventional machine translation device, various dictionaries are required for translation. The invention described in Patent Document 1 described above also requires a captured word dictionary in addition to a translation dictionary such as proper nouns. Since it takes a certain amount of time to search the dictionary for translation, it takes time to translate a list of noun phrases such as a large number of proper nouns. For example, consider the case of translating map notes as an example of translating a large number of noun phrases.

地図には、例えば、市街図、道路図、広域図、地方図、全国図といった種々のものがあり、さらに同じ種類の地図でも縮尺の異なる複数の地図が存在する。このように、種類や縮尺の異なる全ての地図を対象にした場合、地図注記は数千万件も存在する。そして、数字、アルファベット、アイコンのみにより構成される地図注記などの翻訳が必要ないものは除外し、重複する地図注記は１つに集約したとしても、翻訳対象となる地図注記は数百万件も存在する。 There are various maps such as city maps, road maps, regional maps, regional maps, and national maps, and there are a plurality of maps of different scales even with the same type of map. Thus, when all types of maps of different types and scales are targeted, there are tens of millions of map notes. And even if map annotations consisting only of numbers, alphabets, and icons are excluded, those that do not need to be translated are excluded, and even if multiple overlapping map notes are consolidated into one, millions of map notes are subject to translation. Exists.

この数百万件の地図注記を、従来の辞書を参照する方式で多言語に機械翻訳する場合には、数週間を要しているのが現状である。また、地図注記は、地図上の地物が無くなったり、新たにできたりした場合には、必ず変わるため、所定のタイミングで現地調査をして更新し、翻訳し直す必要がある。 It takes several weeks to machine-translate these millions of map notes into multiple languages using a conventional dictionary reference method. In addition, map notes always change when features on the map are lost or newly created, so it is necessary to conduct a field survey at a predetermined timing to update and re-translate.

このように、地図注記は、翻訳を繰り返す必要があるため、できるだけ高速に翻訳したいとする要求が従来からある。このことは地図注記だけでなく、住所リスト、電話帳リスト、顧客リストなどに記載された、例えば、地名、名称、名前、住所といった多数の名詞句の羅列を翻訳する場合にも同様に生じている課題である。 As described above, since map annotation needs to be repeatedly translated, there is a conventional request to translate as fast as possible. This also occurs when translating a large number of noun phrases such as place names, names, names, addresses, etc., not only in map notes but also in address lists, phone book lists, customer lists, etc. It is a problem.

以上のことに鑑み、この発明は、大量に存在する名詞句の羅列データを、従来の機械翻訳装置に比べて飛躍的に速く、且つ、正確に翻訳できるようにすることを目的とする。 In view of the above, an object of the present invention is to make it possible to translate a large amount of noun phrase enumeration data much faster and more accurately than conventional machine translation devices.

上記課題を解決するため、請求項１に記載の発明の翻訳装置は、
翻訳対象言語に応じて、検索する名詞句の種類の異なる複数の翻訳辞書の中から使用する辞書を選択して抽出し、その使用順序を定めて並べ替え、名詞句の読み仮名の長いもの順に辞書内容を並べた翻訳用辞書を作成する辞書作成手段と、
翻訳対象の１以上の名詞句と当該１以上の名詞句のそれぞれの読み仮名とを有する読み仮名付与済みデータを、他の読み仮名付与済みデータと区別可能な態様で１つの変数に格納する格納手段と、
前記辞書作成手段により作成された前記翻訳用辞書の辞書データを１件ずつ読み込み、前記変数に格納されている前記読み仮名付与済みデータの翻訳対象の名詞句を、前記翻訳用辞書の前記辞書データを用いて目的とする言語に全文置換する全文置換手段と
を備えることを特徴とする。 In order to solve the above problem, a translation device according to claim 1 is provided:
Select and extract the dictionary to be used from multiple translation dictionaries with different types of noun phrases to be searched according to the language to be translated. A dictionary creation means for creating a dictionary for translation of the dictionary contents;
Storage that stores reading-kana-added data having one or more noun phrases to be translated and the reading-kana of each of the one or more noun phrases in a variable that is distinguishable from other reading-kana-giving data. Means,
The dictionary data of the dictionary for translation created by the dictionary creating means is read one by one, and the noun phrase to be translated of the data with the kana added stored in the variable is converted to the dictionary data of the dictionary for translation. And a full-text replacement means for replacing the full-text into the target language.

この請求項１に記載の発明の翻訳装置によれば、辞書作成手段は、翻訳対象言語に応じて翻訳用辞書を作成する。格納手段は、読み仮名付与済みデータを、他の読み仮名付与済みデータと区別可能な態様で１つの変数に格納する。全文置換手段は、辞書作成手段により作成された前記翻訳用辞書の辞書データを１件ずつ読み込み、格納手段によって所定の変数に格納されている読み仮名付与済みデータの名詞句を、読み込んだ辞書の辞書データを用いて目的とする言語に全文置換する。 According to the translation device of the first aspect of the present invention, the dictionary creating means creates the translation dictionary according to the language to be translated. The storage means stores the reading-kana-added data in one variable in a manner distinguishable from other reading-kana-giving data. The full-text replacement means reads the dictionary data of the dictionary for translation created by the dictionary creation means one by one, and the noun phrase of the reading-kana-given data stored in a predetermined variable by the storage means Full-text replacement with the target language using dictionary data.

これにより、翻訳対象の読み仮名付与済みデータを所定の変数に格納して、格納された多数の読み仮名付与済みデータの名詞句を一括して処理対象とすることができる。そして、翻訳用辞書の辞書データを１件ずつ読み込んで、読み仮名が一致する辞書データを用いて、翻訳対象の読み仮名付与済みデータの名詞句を目的とする言語に置換するといった従来に無い手法で名詞句の翻訳ができる。この場合、読み仮名の１つ１つに基づいて翻訳辞書を参照する場合に比べて、大幅に翻訳に掛かる処理時間を短縮できる。 As a result, it is possible to store the translation-ready kana-given data in a predetermined variable and collectively process the stored many noun phrases of the read-kana-given data. Then, the dictionary data of the dictionary for translation is read one by one, and the noun phrase of the read-named kana-given data to be translated is replaced with the target language by using the dictionary data having the same reading-kana. Can translate noun phrases. In this case, the processing time required for translation can be greatly shortened as compared with the case where the translation dictionary is referred to based on each reading kana.

この発明によれば、大量に存在する名詞句の羅列データを、従来の機械翻訳装置に比べて飛躍的に速く、且つ、正確に翻訳できる。 According to the present invention, enumerated data of a large number of noun phrases can be translated significantly faster and more accurately than conventional machine translation devices.

実施の形態の翻訳装置の構成例と動作の概要を説明するためのブロック図である。It is a block diagram for demonstrating the example of a structure of the translation apparatus of embodiment, and the outline | summary of operation | movement. 翻訳対象データファイル１１１の格納データ（辞書データ）の例を説明するための図である。It is a figure for demonstrating the example of the storage data (dictionary data) of the translation object data file. 読み仮名付与済みデータファイル１１３の格納データの例を説明するための図である。It is a figure for demonstrating the example of the stored data of the reading pseudonymized data file 113. FIG. 使用辞書作成部１１４の処理について説明するためのブロック図である。It is a block diagram for demonstrating the process of the use dictionary creation part. 全文置換処理部１１６で行われる第１の処理について説明するための図である。It is a figure for demonstrating the 1st process performed in the full text replacement process part. 全文置換処理部１１６において実行される変数への読み仮名の格納処理プログラムの一例を示す図である。It is a figure which shows an example of the storage process program of the reading pseudonym to the variable performed in the full text replacement process part. 各読み仮名付与済みデータの読み仮名が、変数＄ｊｐに格納された状態の例を示す図である。It is a figure which shows the example of the state by which the reading pseudonym of each reading pseudonym assignment | providing data was stored in variable $ jp. 全文置換処理部１１６で行われる第２の処理について説明するための図である。It is a figure for demonstrating the 2nd process performed in the full text replacement process part. 全文置換処理部１１６において実行される変数＄ｊｐに格納された読み仮名を目的とする言語に全文置換するプログラムの一例を示す図である。It is a figure which shows an example of the program which substitutes the full text to the target language for the reading pseudonym stored in the variable $ jp performed in the full text replacement processing part. 翻訳用使用辞書ファイル１１５に用意される翻訳用使用辞書データの例を説明するための図である。It is a figure for demonstrating the example of the use dictionary data for translation prepared in the use dictionary file for translation. 図１０に示した辞書により、変数＄ｊｐの格納データを全文置換処理した場合の状態を示す図である。It is a figure which shows the state at the time of carrying out the full-text replacement process for the storage data of variable $ jp with the dictionary shown in FIG. 翻訳済みデータファイル１１７の格納データの例を説明するための図である。It is a figure for demonstrating the example of the stored data of the translated data file. 実施の形態の翻訳装置で行われる翻訳対象データに対して読み仮名を付与する処理を説明するためのフローチャートである。It is a flowchart for demonstrating the process which provides a reading pseudonym with respect to the translation object data performed with the translation apparatus of embodiment. 実施の形態の翻訳装置で行われる地図注記の翻訳処理を説明するためのフローチャートである。It is a flowchart for demonstrating the translation process of the map note performed with the translation apparatus of embodiment.

以下、図を参照しながら、この発明の装置、方法、プログラムの一実施の形態について説明する。この発明は、地図注記、電話帳リスト、顧客リストなどに記載された地名、名称、名前、住所といった、大量に存在する名詞句の羅列データについて翻訳を行う場合に適用可能なものである。「名詞句」との文言は、物体、物質、人物、場所など具体的な対象を示す語句を意味する。この明細書において、翻訳対象となる「名詞句の羅列」の具体例として、地図注記、電話帳リスト、顧客リストなどに記載された、例えば、地名、名称、名前、住所といったものを想定している。 Hereinafter, embodiments of the apparatus, method, and program of the present invention will be described with reference to the drawings. The present invention can be applied to translation of enumerated data of a large number of noun phrases such as place names, names, names, and addresses described in map notes, telephone directory lists, customer lists, and the like. The term “noun phrase” means a phrase indicating a specific object such as an object, a substance, a person, or a place. In this specification, as a specific example of “a list of noun phrases” to be translated, for example, a place name, a name, a name, an address described in a map note, a phone book list, a customer list, etc. Yes.

以下においては、地図注記の翻訳を行う場合を例にして具体的に説明する。地図注記は、上述もしたように大量に存在し、頻繁に更新されるため、その更新の都度、翻訳し直さなければならないものである。また、近年おいては、外国から日本を訪れる観光客やビジネスマンも増えてきており、最新の地図の地図注記を所定の外国語に翻訳して表した外国語対応の最新の地図を迅速に提供することが望まれている。しかも、外国語は、中国語（簡体）、中国語（繁体）、英語、韓国語などというように、複数の言語に対応することが求められる。 In the following, a specific description will be given by taking as an example the case of translating a map note. Map annotations exist in large quantities as described above, and are frequently updated. Therefore, map annotations must be re-translated each time they are updated. In recent years, the number of tourists and business people visiting Japan from abroad has increased, and the latest maps for foreign languages, which are translated from the map annotations of the latest maps into the specified foreign languages, can be quickly displayed. It is hoped to provide. Moreover, foreign languages are required to correspond to a plurality of languages such as Chinese (simplified), Chinese (traditional), English, and Korean.

このような背景の下、以下に説明する実施の形態の装置、方法、プログラムは、大量に存在する地図注記を、従来の機械翻訳装置に比べて、飛躍的に速く、しかも適切に、且つ、複数の言語に翻訳することを実現している。 Under such a background, the apparatus, method, and program according to the embodiments described below are capable of converting a large amount of map annotations significantly faster and more appropriately than a conventional machine translation apparatus, and It has been translated into multiple languages.

［ビッグデータ翻訳装置の構成例］
まず、この実施の形態のビッグデータ翻訳装置（以下、単に翻訳装置と記載する。）の構成例と動作の概要について説明する。図１は、この実施の形態の翻訳装置の構成例と動作の概要を説明するためのブロック図である。なお、図１において、翻訳対象データＦ１１１などの記載における文字「Ｆ」は、ファイル（file）の略称として用いている。 [Configuration example of big data translation device]
First, a configuration example and an outline of operation of a big data translation apparatus (hereinafter simply referred to as a translation apparatus) of this embodiment will be described. FIG. 1 is a block diagram for explaining a configuration example and an outline of operation of the translation apparatus according to this embodiment. In FIG. 1, the letter “F” in the description of the translation target data F111 or the like is used as an abbreviation for a file.

図１に示す翻訳装置において、制御部１０１は、当該翻訳装置の各部を制御する機能を実現し、記憶装置１０２は情報記憶保持機能を実現し、操作部１０３は、ユーザーインターフェース機能を実現する。 In the translation apparatus shown in FIG. 1, the control unit 101 realizes a function for controlling each unit of the translation apparatus, the storage device 102 realizes an information storage holding function, and the operation unit 103 realizes a user interface function.

読み仮名辞書群１０４は、固有名詞読み仮名辞書、一般名詞読み仮名辞書、接頭語読み仮名辞書、接尾語読み仮名辞書、カタカナ語読み仮名辞書などの複数の読み仮名辞書を備え、日本語の地図注記データに対する読み仮名を提供する。なお、図示していないが、接頭語読み仮名辞書には、固有名詞接頭語読み仮名辞書、一般名詞接頭語読み仮名辞書、カタカナ語接頭語読み仮名辞書がある。同様に、接尾語読み仮名辞書には、固有名詞接尾語読み仮名辞書、一般名詞接尾語読み仮名辞書、カタカナ語接尾語読み仮名辞書がある。 The reading kana dictionary group 104 includes a plurality of reading kana dictionaries such as a proper noun reading kana dictionary, a general noun reading kana dictionary, a prefix reading kana dictionary, a suffix reading kana dictionary, and a katakana reading kana dictionary, and a Japanese map. Provides reading kana for note data. Although not shown, the prefix reading kana dictionary includes a proper noun prefix reading kana dictionary, a general noun prefix reading kana dictionary, and a katakana prefix reading kana dictionary. Similarly, the suffix reading kana dictionary includes a proper noun suffix reading kana dictionary, a general noun suffix reading kana dictionary, and a katakana suffix reading kana dictionary.

また、固有名詞読み仮名辞書、一般名詞読み仮名辞書、カタカナ語読み仮名辞書のそれぞれは、部分一致により読み仮名の置換が可能なものである。また、各接頭語読み仮名辞書は、接頭語の読み仮名を提供するものであるので、前方一致により読み仮名の置換が可能なものである。また、各接尾語読み仮名辞書は、接尾語の読み仮名を提供するものであるので、後方一致により読み仮名の置換が可能なものである。 Each of the proper noun reading kana dictionary, the general noun reading kana dictionary, and the katakana reading kana dictionary can replace the reading kana by partial matching. Each prefix reading kana dictionary provides the reading kana of the prefix, so that the reading kana can be replaced by front matching. Each suffix reading kana dictionary provides a reading kana for the suffix, so that the reading kana can be replaced by backward matching.

言語別翻訳辞書群１０５は、この実施の形態では、日本語の地図注記データの読み仮名を、目的とする外国語に翻訳するための言語別の翻訳辞書を備える。この実施の形態では、中国語（簡体）、中国語（繁体）、英語、韓国語の４言語に対応するため、４言語の翻訳辞書を備える。更に、各言語の翻訳辞書は、大きく分けると、一般名詞に関する辞書やカタカナ語に関する辞書を備える。一般名詞に関する辞書は、一般名詞接尾語辞書、一般名詞接頭語辞書、一般名詞辞書からなる。カタカナ語に関する辞書は、カタカナ語固有名詞接尾語辞書、カタカナ語固有名詞接頭語辞書、カタカナ語固有一般名詞辞書、カタカナ語一般名詞接尾語辞書、カタカナ語一般名詞接頭語辞書、カタカナ語一般名詞辞書からなる。 In this embodiment, the language-specific translation dictionary group 105 includes language-specific translation dictionaries for translating reading kana of Japanese map note data into a target foreign language. In this embodiment, a four-language translation dictionary is provided to support four languages: Chinese (simplified), Chinese (traditional), English, and Korean. Furthermore, the translation dictionaries for each language are roughly divided into a dictionary for general nouns and a dictionary for katakana. The dictionary for general nouns consists of a general noun suffix dictionary, a general noun prefix dictionary, and a general noun dictionary. The dictionary for Katakana is Katakana proper noun suffix dictionary, Katakana proper noun prefix dictionary, Katakana proper noun dictionary, Katakana general noun suffix dictionary, Katakana general noun prefix dictionary, Katakana general noun dictionary Consists of.

また、一般名詞辞書、カタカナ語固有名詞辞書、カタカナ語一般名詞辞書のそれぞれは、部分一致により読み仮名などに対応する翻訳データ（翻訳語句）の置換が可能なものである。また、各接頭語辞書は、接頭語に対応する翻訳データを提供するものであるので、前方一致により接頭語の翻訳データの置換が可能なものである。また、各接尾語辞書は、接尾語の翻訳データを提供するものであるので、後方一致により接尾語の翻訳データの置換が可能なものである。 Each of the general noun dictionary, the katakana proper noun dictionary, and the katakana general noun dictionary can replace translation data (translation words) corresponding to reading kana and the like by partial matching. Each prefix dictionary provides translation data corresponding to the prefix, so that prefix translation data can be replaced by prefix matching. Also, each suffix dictionary provides suffix translation data, so that suffix translation data can be replaced by backward matching.

翻訳対象データファイル１１１は、翻訳対象となる多数の地図注記データを保持する。図２は、翻訳対象データファイル１１１の格納データの例を説明するための図である。図２に示すように、翻訳対象データファイル１１１には、「エリアコード、属性番号、地図注記データ」からなる翻訳対象データが多数保持されている。この翻訳対象データファイル１１１には、例えば、図示しない外部インターフェイスなどを通じて外部機器から提供された翻訳対象データなどが格納される。 The translation object data file 111 holds a large number of map note data to be translated. FIG. 2 is a diagram for explaining an example of data stored in the translation object data file 111. As shown in FIG. 2, the translation target data file 111 holds a large number of translation target data composed of “area code, attribute number, map annotation data”. The translation target data file 111 stores, for example, translation target data provided from an external device through an external interface (not shown).

読み仮名付与部１１２は、翻訳対象データファイル１１１に格納されている多数の地図注記データのそれぞれについて、これに含まれている漢字部分を、読み仮名辞書群１０４の読み仮名辞書を用いて読み仮名に変換する。読み仮名付与部１１２は、変換した読み仮名を含む、読み仮名付与済みデータを形成して、読み仮名付与済みデータファイル１１３に記録する。読み仮名付与済みデータファイル１１３は、ハードディスクなどの大容量記録媒体に作成され、読み仮名付与部１１２によって形成された読み仮名付与済みデータを蓄積する。 The reading kana giving unit 112 reads the kanji portion included in each of a large number of map annotation data stored in the translation object data file 111 by using the reading kana dictionary of the reading kana dictionary group 104. Convert to The reading kana giving unit 112 forms reading kana-given data including the converted reading kana and records it in the reading kana-given data file 113. The reading-kana-added data file 113 is created on a large-capacity recording medium such as a hard disk, and stores reading-kana-added data formed by the reading-kana adding unit 112.

図３は、読み仮名付与済みデータファイル１１３の格納データの例を説明するための図である。図３に示すように、読み仮名付与済みデータファイル１１３に格納された読み仮名付与済みデータは、「エリアコード、属性番号、地図注記データ、読み仮名」からなるものである。なお、読み仮名は、読み仮名を取得した辞書が異なれば、それらは分離して示されている。すなわち、図３において、「悪石島」は、「悪石」部分の読み仮名は、例えば固有名詞辞書から取得され、「島」部分の読み仮名は、一般名詞接尾語辞書から取得されたために、「あくせきじま」というように、「あくせき」と「じま」がそれぞれ異なる辞書から取得されたことが分かる態様で読み仮名付与済みデータに格納されている。 FIG. 3 is a diagram for explaining an example of data stored in the read-named data file 113. As shown in FIG. 3, the reading-kana-added data stored in the reading-kana-added data file 113 includes “area code, attribute number, map note data, and reading-kana”. In addition, if the dictionary which acquired the reading pseudonym differs, they will be shown separately. That is, in FIG. 3, “Akishijima” is obtained by reading the kana of the “Evilstone” part from the proper noun dictionary, for example, and the kana of the “island” part from the general noun suffix dictionary. Like “Akaseki Jima”, “Akaseki” and “Jima” are stored in the kana-given data in such a manner that it can be seen that they are obtained from different dictionaries.

この実施の形態において、使用辞書作成部１１４は、翻訳対象言語（何語に翻訳するのか）に応じて、翻訳に使用する翻訳用使用辞書を翻訳用使用辞書ファイル１１５に作成する。具体的に使用辞書作成部１１４は、翻訳対象言語に応じて言語別翻訳辞書群１０５の翻訳辞書の中から使用する辞書を選択して抽出する。そして、その使用順序を定めて抽出した辞書を並べ変えて結合し、更に各辞書の内容（辞書データ）を地図注記の読み仮名の長いもの順に並べた翻訳用使用辞書を翻訳用使用辞書ファイル１１５に作成する。 In this embodiment, the use dictionary creation unit 114 creates a translation use dictionary to be used for translation in the translation use dictionary file 115 according to the language to be translated (in which language it is translated). Specifically, the use dictionary creation unit 114 selects and extracts a dictionary to be used from the translation dictionaries in the language-specific translation dictionary group 105 according to the translation target language. Then, the extracted dictionaries are arranged and combined, and the translation use dictionary in which the contents of the dictionaries (dictionary data) are arranged in the order of the longest readings of the map notes is used. To create.

なお、各辞書データには、使用辞書作成部１１４により、その辞書データが属する辞書に応じて、部分一致、前方一致、後方一致のいずれにより検索語の置換が可能なものかを示す情報も付加される。例えば、一般名詞辞書、カタカナ語固有一般名詞辞書、カタカナ語一般名詞辞書の辞書データには部分一致により使用されるものである情報が付加される。また、一般名詞接頭語辞書、カタカナ語固有名詞接頭語辞書、カタカナ語一般名詞接頭語辞書には前方一致により使用されるものである情報が付加される。また、一般名詞接尾語辞書、カタカナ語固有名詞接尾語辞書、カタカナ語一般名詞接尾語辞書の辞書データには後方一致により使用されるものである情報が付加される。 In addition, information indicating whether the search word can be replaced by partial match, forward match, or backward match is added to each dictionary data according to the dictionary to which the dictionary data belongs by the use dictionary creation unit 114. Is done. For example, information used for partial matching is added to the dictionary data of the general noun dictionary, the Katakana proper noun dictionary, and the Katakana general noun dictionary. In addition, information that is used by forward matching is added to the general noun prefix dictionary, the katakana proper noun prefix dictionary, and the katakana general noun prefix dictionary. Further, information used for backward matching is added to the dictionary data of the general noun suffix dictionary, the katakana proper noun suffix dictionary, and the katakana general noun suffix dictionary.

全文置換処理部１１６は、大容量メモリ１１６Ｍを備え、大きく以下の２つの処理を行う。第１の処理は、読み仮名付与済みデータファイル１１３の読み仮名付与済みデータを読み込んで、読み仮名付与済みデータごとにセパレータを付しながら、全読み仮名付与済みデータを１つの変数に格納する処理を行う。第２の処理は、使用辞書作成部１１４が作成した翻訳用使用辞書ファイル１１５の辞書データを１件ずつ読み込み、上記の変数に格納した読み仮名付与済みデータの読み仮名を辞書データの翻訳データで置換する処理を行う。 The full-text replacement processing unit 116 includes a large-capacity memory 116M and performs the following two processes. The first process reads the reading-kana-given data in the reading-kana-given data file 113, and stores all the reading-kana-given data in one variable while adding a separator to each reading-kana-given data. I do. In the second process, the dictionary data in the translation use dictionary file 115 created by the use dictionary creation unit 114 is read one by one, and the reading pseudonym of the reading-kana-given data stored in the variables is converted into the translation data of the dictionary data. Perform the replacement process.

これにより、全文置換処理部１１６は、読み仮名付与済みデータの地図注記データに対する翻訳データを含む翻訳済みデータを形成し、これを翻訳済みデータファイル１１７に記録する。翻訳済みデータファイル１１７は、ハードディスクなどの大容量記録媒体に作成され、全文置換処理部１１６によって形成された翻訳済みデータを蓄積する。 As a result, the full-text replacement processing unit 116 forms translated data including translation data for the map annotation data of the reading-kana-added data, and records this in the translated data file 117. The translated data file 117 is created on a large-capacity recording medium such as a hard disk, and stores translated data formed by the full-text replacement processing unit 116.

このように、この実施の形態の翻訳装置は、全文置換処理部１１６の機能により、読み仮名付与済みデータファイル１１３の全読み仮名付与済みデータを所定の変数に格納することにより、全読み仮名付与済みデータをメモリ１１６Ｍに格納して、一括して処理の対象にする。そして、整えた翻訳用使用辞書ファイル１１５の辞書データを１件ずつ読み込んで、メモリ１１６Ｍ上の全ての読み仮名付与済みデータを対象にして、翻訳用使用辞書ファイル１１５の翻訳データを付与する処理を行う。換言すれば、メモリ１１６Ｍ上の全ての読み仮名付与済みデータの各読み仮名を、翻訳用使用辞書ファイル１１５の翻訳データにより全文置換するようにしている。これにより、数多く存在する地図注記の超高速な翻訳を可能にしている。 As described above, the translation apparatus according to this embodiment stores the full-reading kana-given data in the reading-kana-given data file 113 in a predetermined variable by the function of the full-text replacement processing unit 116, thereby giving the full-reading kana The stored data is stored in the memory 116M and is collectively processed. Then, processing is performed for reading the prepared dictionary data in the translation use dictionary file 115 one by one and assigning the translation data in the translation use dictionary file 115 to all the read-named data on the memory 116M. Do. In other words, all the reading pseudonyms of all the reading pseudonym-added data on the memory 116M are replaced by the full text with the translation data of the translation use dictionary file 115. This enables ultra-high-speed translation of many existing map notes.

以下、この実施の形態の翻訳装置の主要部である、使用辞書作成部１１４において行われる処理と、全文置換処理部１１６において行われる処理の詳細について説明する。 Hereinafter, the details of the processing performed in the use dictionary creation unit 114 and the processing performed in the full-text replacement processing unit 116, which are the main parts of the translation apparatus of this embodiment, will be described.

［使用辞書作成部１１４の処理］
図４は、使用辞書作成部１１４の処理について説明するためのブロック図である。上述もし、また、図４にも示すように、言語別翻訳辞書群１０５には、言語別に、例えば、一般名詞接尾語辞書、一般名詞接頭語辞書、一般名詞辞書、カタカナ語固有名詞接尾語辞書、カタカナ語固有名詞接頭語辞書、カタカナ語固有一般名詞辞書、カタカナ語一般名詞接尾語辞書、カタカナ語一般名詞接頭語辞書、カタカナ語一般名詞辞書が用意されている。 [Process of Use Dictionary Creation Unit 114]
FIG. 4 is a block diagram for explaining the processing of the use dictionary creation unit 114. As described above and as shown in FIG. 4, the language-specific translation dictionary group 105 includes, for example, a general noun suffix dictionary, a general noun prefix dictionary, a general noun dictionary, a katakana proper noun suffix dictionary for each language. , A Katakana proper noun prefix dictionary, a Katakana proper noun dictionary, a Katakana general noun suffix dictionary, a Katakana general noun prefix dictionary, and a Katakana general noun dictionary are prepared.

このため、使用辞書作成部１１４は、翻訳対象言語に応じて、言語別翻訳辞書群１０５の翻訳辞書の中から使用する辞書を選択して抽出する。そして、その使用順序を定めて辞書を並べ変えて結合し、更に各辞書の内容（辞書データ）を、地図注記の読み仮名の長いもの順に並べた翻訳用使用辞書を翻訳用使用辞書ファイル１１５に作成する。なお、地図注記の読み仮名の長いもの順に並べるのは、最長一致法により、読み仮名付与済みデータの名詞句を置換するためである。また、各辞書データには、上述もしたように、その辞書データが属する辞書に応じて、部分一致、前方一致、後方一致のいずれにより検索語の置換が可能なものかを示す情報も付加される。 Therefore, the use dictionary creation unit 114 selects and extracts a dictionary to be used from the translation dictionaries in the language-specific translation dictionary group 105 according to the translation target language. Then, the use order is determined and the dictionaries are rearranged and combined. Further, the translation use dictionary in which the contents of the dictionaries (dictionary data) are arranged in the order of the longest readings of the map notes are stored in the translation use dictionary file 115. create. The reason why the map annotations are arranged in the order of the longest reading kana is to replace the noun phrases in the data having been given the reading kana by the longest match method. In addition, as described above, information indicating whether a search word can be replaced by partial match, forward match, or backward match is added to each dictionary data according to the dictionary to which the dictionary data belongs. The

また、辞書の選択や並び順は、翻訳対象言語ごとの優先度や必要性に応じて行う。辞書の選択や並び順の一例について説明する。韓国語は表音文字であり、カタカナで表現された日本語は、その音（おん）に対応する韓国語（ハングル文字）に置き換えることができる。このため、翻訳対象言語が韓国語の場合、使用辞書作成部１１４は、カタカナ語の辞書は使用せず、韓国語への翻訳用の一般名詞接尾語辞書、一般名詞接頭語辞書、一般名詞辞書を使用辞書として選択する。 Further, the selection and arrangement order of dictionaries are performed according to the priority and necessity of each language to be translated. An example of dictionary selection and arrangement order will be described. Korean is a phonetic character, and Japanese expressed in katakana can be replaced with a Korean (Hangul character) corresponding to the sound. For this reason, when the language to be translated is Korean, the use dictionary creation unit 114 does not use a katakana dictionary, but a general noun suffix dictionary, a general noun prefix dictionary, a general noun dictionary for translation into Korean. Select as your dictionary.

そして、使用辞書作成部１１４は、選択した辞書の並び順として、例えば、接尾語、接頭語を翻訳した上で、残りの一般名詞を翻訳するために、（１）一般名詞接尾語辞書→（２）一般名詞接頭語辞書→（３）一般名詞辞書の順に並べて結合する。各辞書内は、上述もしたように、地図注記に対応する読み仮名の長いもの順に並べる。このようにして、地図注記を韓国語に翻訳するための翻訳用使用辞書を翻訳用使用辞書ファイル１１５に形成する。 Then, the use dictionary creation unit 114 translates the suffix and prefix as the selected dictionary order, for example, and translates the remaining general nouns by (1) General noun suffix dictionary → ( 2) General noun prefix dictionary → (3) General noun dictionary arranged in order. As described above, each dictionary is arranged in the order of long reading kana corresponding to the map notes. In this manner, a translation use dictionary for translating map notes into Korean is formed in the translation use dictionary file 115.

また、漢字で表された日本語の一般名詞は、中国語でも同じ漢字で表される場合も多い。このため、翻訳対象言語が中国語である場合には、上述した９つの全ての辞書を用いるが、カタカナ語の翻訳を優先させるため、カタカナ語に関する辞書を先に使用し、次に一般名詞に関する辞書を使用するように使用辞書を並べ変えて結合する。各辞書内は、上述もしたように、地図注記に対応する読み仮名の長いもの順に並べる。このようにして、地図注記を中国語に翻訳するための翻訳用使用辞書を翻訳用使用辞書ファイル１１５に形成する。 In addition, Japanese common nouns expressed in Chinese characters are often expressed in the same Chinese characters in Chinese. For this reason, when the language to be translated is Chinese, all the nine dictionaries described above are used. However, in order to prioritize the translation of Katakana, the dictionary related to Katakana is used first, and then the general noun is used. Rearrange the used dictionaries to use dictionaries and combine them. As described above, each dictionary is arranged in the order of long reading kana corresponding to the map notes. In this manner, a translation use dictionary for translating map notes into Chinese is formed in the translation use dictionary file 115.

また、翻訳対象言語が英語の場合には、日本語とは全く異なる言語であるため、上述した９つの全ての辞書を用い、その使用順も最初に一般名詞に関する辞書、次にカタカナ語に関する辞書というように並べて結合することができる。この場合においても、各辞書内は、上述もしたように、地図注記に対応する読み仮名の長いもの順に並べておく。このようにして、地図注記を英語に翻訳するための翻訳用使用辞書を翻訳用使用辞書ファイル１１５に形成する。 In addition, when the translation target language is English, it is a completely different language from Japanese, so all nine of the above-mentioned dictionaries are used, and the order of use is first a dictionary for general nouns and then a dictionary for katakana. Can be combined side by side. Even in this case, the respective dictionaries are arranged in the order of the long reading kana corresponding to the map notes as described above. In this manner, a translation use dictionary for translating map notes into English is formed in the translation use dictionary file 115.

なお、ここで説明した翻訳対象言語ごとの優先度や必要性に応じた辞書の選択や並び順は一例であり、この他にも種々の態様とすることができる。また、上述した９つの辞書以外にも使用する辞書が存在する場合には、その辞書についても考慮することは言うまでもない。また、翻訳対象言語ごとに特別に設けられた辞書が存在する場合もあり、その場合には、その翻訳対象言語が選択された場合には、当該辞書も使用される辞書として選択されることになる。 Note that the selection and arrangement order of dictionaries according to the priority and necessity for each translation target language described here are merely examples, and various other modes can be used. Needless to say, if there are dictionaries to be used other than the nine dictionaries described above, these dictionaries are also considered. In addition, there may be a dictionary specially provided for each translation target language. In that case, when the translation target language is selected, the dictionary is also selected as a dictionary to be used. Become.

［全文置換処理部１１６の処理］
［第１の処理（変数への格納処理）］
図５、図６、図７のそれぞれは、全文置換処理部１１６で行われる第１の処理である読み仮名付与済みデータファイル１１３の全ての読み仮名付与済みデータを１つの変数に格納する処理について説明するための図である。 [Process of Full Text Replacement Processing Unit 116]
[First processing (storage processing to variable)]
Each of FIGS. 5, 6, and 7 relates to a process of storing all reading-kana-given data in the reading-kana-given data file 113 as a first process performed by the full-text replacement processing unit 116 in one variable. It is a figure for demonstrating.

この内、図５は、全文置換処理部１１６で行われる当該第１の処理について説明するための図である。図５に示すように、全文置換処理部１１６で行われる第１の処理は、読み仮名付与済みデータファイル１１３の読み仮名付与済みデータを読み込んで、読み仮名付与済みデータごとにセパレータを付しながら、全読み仮名付与済みデータを１つの変数に格納する。これにより、読み仮名付与済みデータファイル１１３の全ての読み仮名付与済みデータが、全文置換処理部１１６のメモリ１１６Ｍ上に展開される。 Among these, FIG. 5 is a figure for demonstrating the said 1st process performed in the full text replacement process part 116. FIG. As shown in FIG. 5, the first process performed by the full-text replacement processing unit 116 reads the reading-kana-given data in the reading-kana-given data file 113 and attaches a separator to each reading-kana-given data. Then, the fully read pseudonymized data is stored in one variable. As a result, all of the reading-kana-added data in the reading-kana-added data file 113 is expanded on the memory 116M of the full-text replacement processing unit 116.

図６は、全文置換処理部１１６において実行される読み仮名付与済みデータの変数への格納プログラムの一例を示す図である。図６に示すプログラムにより、図３を用いて説明したように、「エリアコード、属性番号、地図注記、読み仮名」からなる読み仮名付与済みデータのそれぞれは、他の読み仮名付与済みデータと分離可能な態様で変数＄ｈｅａｄ［＄ｉ］に格納される。更に、読み仮名付与済みデータのそれぞれの読み仮名だけが変数＄ｊｐに格納される。これら変数に格納された読み仮名付与済みデータと、当該読み仮名付与済みデータの読み仮名とは、同じ順番で各変数に格納されるので、そのそれぞれの対応関係は維持される。 FIG. 6 is a diagram illustrating an example of a program for storing the reading-kana-given data that is executed in the full-text replacement processing unit 116. As described with reference to FIG. 3, the program shown in FIG. 6 separates each reading-kana-assigned data composed of “area code, attribute number, map note, and reading-kana” from other reading-kana-giving data. Stored in variable $ head [$ i] in a possible manner. Furthermore, only each reading pseudonym of the reading-kana-given data is stored in the variable $ jp. Since the reading-kana-added data stored in these variables and the reading-kana of the reading-kana-added data are stored in each variable in the same order, the corresponding relationship is maintained.

図７は、図３に示した読み仮名付与済みデータファイル１１３の各読み仮名付与済みデータの読み仮名が、変数＄ｊｐに格納された状態を示す図である。図６に示したプログラムを実行することにより、各読み仮名付与済みデータの読み仮名だけが、他の読み仮名と分離可能な態様で変数＄ｊｐに格納される。これにより、各読み仮名付与済みデータの各読み仮名がメモリ１１６Ｍ上に格納され、一括して処理の対象にできる。 FIG. 7 is a diagram showing a state where the reading pseudonym of each reading pseudonym-assigned data of the reading pseudonym assigned data file 113 shown in FIG. 3 is stored in the variable $ jp. By executing the program shown in FIG. 6, only the reading kana of each reading kana-given data is stored in the variable $ jp in a manner separable from other reading kana. As a result, each reading-kana of each reading-kana-given data is stored on the memory 116M and can be processed at once.

［第２の処理（全文置換処理）］
図８、図９、図１０、図１１のそれぞれは、全文置換処理部１１６で行われる第２の処理である翻訳用使用辞書ファイル１１５の辞書データを用いて、上述したように変数＄Ｊｐに格納された読み仮名を目的とする言語に置換する処理について説明するための図である。 [Second processing (full-text replacement processing)]
8, 9, 10, and 11, as described above, the variable $ Jp is set in the variable $ Jp using the dictionary data of the translation use dictionary file 115, which is the second process performed by the full-text replacement processing unit 116. It is a figure for demonstrating the process which replaces the stored reading pseudonym with the target language.

この内、図８は、全文置換処理部１１６で行われる当該第２の処理について説明するための図である。図８に示すように、全文置換処理部１１６で行われる第２の処理は、翻訳用使用辞書ファイル１１５の辞書データを１件ずつ読み込み、変数＄ｊｐに格納されている読み仮名付与済みデータの読み仮名を当該辞書データの翻訳データに置換する処理を行う。そして、変数＄ｈｅａｄ［＄ｉ］に格納された読み仮名付与済みデータと、これに対する変数＄ｊｐの置換された翻訳データとを対応付けて翻訳済みデータを形成し、これを翻訳済みデータファイル１１７に記録する。これにより、翻訳対象の地図注記を、目的とする翻訳対象言語に翻訳できる。 Among these, FIG. 8 is a figure for demonstrating the said 2nd process performed in the full-text replacement process part 116. FIG. As shown in FIG. 8, the second process performed by the full-text replacement processing unit 116 reads the dictionary data of the translation use dictionary file 115 one by one and stores the reading-kana-added data stored in the variable $ jp. Processing to replace the reading pseudonym with the translation data of the dictionary data is performed. Then, the kana-given data stored in the variable $ head [$ i] is associated with the translation data in which the variable $ jp is replaced with this to form translated data, which is converted into the translated data file 117. To record. Thereby, the map note to be translated can be translated into the target language to be translated.

図９は、全文置換処理部１１６において実行される変数＄Ｊｐに格納された読み仮名を目的とする言語に全文置換するプログラムの一例を示す図である。図９に示すプログラムは、後方一致により用いられる辞書の辞書データを用いては、後方一致により置換処理を行う。また、当該プログラムは、前方一致により用いられる辞書の辞書データを用いては、前方一致により置換処理を行う。同様に、当該プログラムは、部分一致により用いられる辞書の辞書データを用いては、部分一致により置換処理を行う。 FIG. 9 is a diagram showing an example of a program that replaces the full text with the target language of the reading pseudonym stored in the variable $ Jp, which is executed by the full text replacement processing unit 116. The program shown in FIG. 9 performs replacement processing by backward matching using dictionary data of a dictionary used by backward matching. In addition, the program uses the dictionary data of the dictionary used for forward matching to perform replacement processing by forward matching. Similarly, the program performs replacement processing by partial matching using dictionary data of a dictionary used by partial matching.

そして、図９に示すプログラムは、置換処理終了後においては、変数＄ｈｅａｄ［＄ｉ］に格納された読み仮名付与済みデータと、これに対する変数＄ｊｐの置換された翻訳データとを対応付けて翻訳済みデータを形成し、これを出力する。実際には、形成された翻訳済みデータは翻訳済みデータファイル１１７に記録される。このようにして、翻訳対象の地図注記が目的とする翻訳対象言語に翻訳される。 Then, after the replacement process is completed, the program shown in FIG. 9 associates the read-named data stored in the variable $ head [$ i] with the translation data in which the variable $ jp is replaced. Form translated data and output it. Actually, the formed translated data is recorded in the translated data file 117. In this way, the map note to be translated is translated into the target language to be translated.

ここで、図７に示したように、変数＄ｊｐに格納された翻訳対象の地図注記に対応する読み仮名を、英語に翻訳する場合を例にして、全文置換処理部１１６で行われる第２の処理である全文置換処理について具体的に説明する。 Here, as shown in FIG. 7, the second sentence performed by the full-text replacement processing unit 116, taking as an example the case where the reading kana corresponding to the map note to be translated stored in the variable $ jp is translated into English. The full-text replacement process, which is the above process, will be specifically described.

図１０は、変数＄ｊｐに格納された翻訳対象の地図注記に対応する読み仮名を、英語に翻訳する場合に、翻訳用使用辞書ファイル１１５に用意される翻訳用使用辞書データの例を説明するための図である。ここでは、説明を簡単にするため、図１０に示すように、辞書Ａ〜Ｅの５つの辞書が使用する辞書として選択されて抽出され、辞書Ａ〜Ｅの順番で並べられたものとする。そして、辞書Ａ〜Ｅのそれぞれには、辞書データが１つだけ存在しているものとする。 FIG. 10 illustrates an example of the translation use dictionary data prepared in the translation use dictionary file 115 when the kana corresponding to the map note to be translated stored in the variable $ jp is translated into English. FIG. Here, in order to simplify the description, it is assumed that five dictionaries A to E are selected and extracted as dictionaries to be used and arranged in the order of dictionaries A to E, as shown in FIG. It is assumed that only one dictionary data exists in each of the dictionaries A to E.

図１１は、図１０に示した辞書により、変数＄ｊｐの格納データを全文置換処理した場合の状態を示す図である。図１１（Ａ）は、図７に示した内容と同じであり、変数＄ｊｐに、３つの地図注記データの読み仮名「あくせきじま」、「ひずえじま」、「ヲウボウかはん」が格納されている場合を示している。 FIG. 11 is a diagram showing a state when the full text replacement processing is performed on the stored data of the variable $ jp using the dictionary shown in FIG. FIG. 11 (A) is the same as that shown in FIG. 7, and in the variable $ jp, three readings of map annotation data “Akase Jima”, “Hizue Jima”, “Owaku Kahan” "Is stored.

そして、全文置換処理部１１６は、この例の場合、まず、翻訳用使用辞書ファイル１１５から、辞書Ａの辞書データを読み込み、部分一致により、変数＄ｊｐの格納データの全文置換を行う。すなわち、全文置換処理部１１６は、辞書Ａの辞書データに応じて、「ヲウボウ」を「Ｗｏｂｏｕ」に置換せよという命令を出す。ここで、「ヲウボウ」は、名詞句の任意の位置にある「ヲウボウ」を置換の対象にすることを意味する。この場合、図１１（Ｂ）に示すように、３番目の名詞句の読み仮名の「ヲウボウ」が、辞書Ａの辞書データの翻訳データ「Ｗｏｂｏｕ」に置換される。 In this example, the full text replacement processing unit 116 first reads the dictionary data of the dictionary A from the translation use dictionary file 115, and performs full text replacement of the stored data of the variable $ jp by partial matching. That is, full-text replacement processing unit 116 issues an instruction to replace “Wobo” with “Wobou” according to the dictionary data of dictionary A. Here, “Wow” means that “Wow” in any position of the noun phrase is to be replaced. In this case, as shown in FIG. 11 (B), the reading name “Wobo” of the third noun phrase is replaced with the translation data “Wobo” of the dictionary data of the dictionary A.

なお、この例において、全文置換処理部１１６が出す命令において、部分一致、前方一致、後方一致の区別は、置換対象の読み仮名に付される「，（カンマ）」の位置により判別できる。すなわち、前方一致により置換を指示する場合には、「，ヲウボウ」となり、後方一致で置換を指示する場合には、「ヲウボウ，」となる。また、部分一致で置換を指示する場合には、「ヲウボウ」となり、「，（カンマ）」は付されないものとなる。 In this example, in the command issued by the full-text replacement processing unit 116, the distinction between partial match, forward match, and backward match can be determined by the position of “, (comma)” added to the reading target kana. That is, when the replacement is instructed by the front matching, “, BACK” is indicated, and when the replacement is instructed by the backward matching, the “OUT,” is indicated. Further, when the replacement is instructed by partial matching, “NO” is given and “, (comma)” is not added.

次に、全文置換処理部１１６は、翻訳用使用辞書ファイル１１５から、辞書Ｂの辞書データを読み込み、前方一致により、変数＄ｊｐの格納データの全文置換を行う。すなわち、全文置換処理部１１６は、辞書Ｂの辞書データに基づいて、「，あくせき」を「Ａｋｕｓｅｋｉ」に置換せよという命令を出す。ここで、「，あくせき」は、名詞句の前方にある「あくせき」を置換の対象にすることを意味する。この場合、図１１（Ｃ）に示すように、１番目の名詞句の読み仮名の前側（前方）に位置する「あくせき」が、辞書Ｂの辞書データの翻訳データ「Ａｋｕｓｅｋｉ」に置換される。 Next, the full-text replacement processing unit 116 reads the dictionary data of the dictionary B from the translation use dictionary file 115 and performs full-text replacement of the stored data of the variable $ jp by forward matching. That is, the full-text replacement processing unit 116 issues an instruction to replace “, Akuseki” with “Akuseki” based on the dictionary data of the dictionary B. Here, “, akuseki” means that “akuseki” in front of the noun phrase is to be replaced. In this case, as shown in FIG. 11C, “akuseki” located in front of (in front of) the reading of the first noun phrase is replaced with translation data “Akuseki” of the dictionary data of the dictionary B. .

次に、全文置換処理部１１６は、翻訳用使用辞書ファイル１１５から、辞書Ｃの辞書データを読み込み、後方一致により、変数＄ｊｐの格納データの全文置換を行う。すなわち、全文置換処理部１１６は、辞書Ｃの辞書データに基づいて、「かはん，」を「Ｒｉｖｅｒｓｉｄｅ」に置換せよという命令を出す。ここで、「かはん，」は、名詞句の後方にある「かはん」を置換の対象にすることを意味する。この場合、図１１（Ｄ）に示すように、３番目の名詞句の読み仮名の後側（後方）に位置する「かはん」が、辞書Ｃの辞書データの翻訳データ「ＲｅｖｅｒＳｉｄｅ」に置換される。 Next, the full text replacement processing unit 116 reads the dictionary data of the dictionary C from the translation use dictionary file 115 and performs full text replacement of the stored data of the variable $ jp by backward matching. That is, the full-text replacement processing unit 116 issues an instruction to replace “KAHAN,” with “River side” based on the dictionary data of the dictionary C. Here, “Kahan,” means that “Kahan” behind the noun phrase is to be replaced. In this case, as shown in FIG. 11 (D), “Kahan” located behind (behind) the reading noun of the third noun phrase becomes the translation data “Reverse Side” of the dictionary data of the dictionary C. Replaced.

次に、全文置換処理部１１６は、翻訳用使用辞書ファイル１１５から、辞書Ｄの辞書データを読み込み、前方一致により、変数＄ｊｐの格納データの全文置換を行う。すなわち、全文置換処理部１１６は、辞書Ｄの辞書データに基づいて、「，ひずえ」を「Ｈｉｚｕｅ」に置換せよという命令を出す。ここで、「，ひずえ」は、名詞句の前方にある「ひずえ」を置換の対象にすることを意味する。この場合、図１１（Ｅ）に示すように、２番目の名詞句の読み仮名の前側（前方）に位置する「ひずえ」が、辞書Ｄの辞書データの翻訳データ「Ｈｉｚｕｅ」に置換される。 Next, the full text replacement processing unit 116 reads the dictionary data of the dictionary D from the translation use dictionary file 115 and performs full text replacement of the stored data of the variable $ jp by forward matching. That is, the full-text replacement processing unit 116 issues an instruction to replace “, haze” with “Hizue” based on the dictionary data of the dictionary D. Here, “, hizue” means that “hizue” in front of the noun phrase is to be replaced. In this case, as shown in FIG. 11E, “hizue” located in front of (in front of) the reading of the second noun phrase is replaced with the translation data “Hizue” of the dictionary data of the dictionary D. .

最後に、全文置換処理部１１６は、翻訳用使用辞書ファイル１１５から、辞書Ｅの辞書データを読み込み、後方一致により、変数＄ｊｐの格納データの全文置換を行う。すなわち、全文置換処理部１１６は、辞書Ｅの辞書データに基づいて、「じま，」を「Ｉｓｌａｎｄ」に置換せよという命令を出す。ここで、「じま，」は、名詞句の後方にある「じま」を置換の対象にすることを意味する。この場合、図１１（Ｆ）に示すように、１番目と２番目の２つの名詞句の読み仮名の後側（後方）に位置する「じま」が、辞書Ｅの辞書データの翻訳データ「Ｉｓｌａｎｄ」に置換される。このようにして、翻訳用使用辞書ファイル１１５に作成された翻訳用使用辞書の辞書データにより、変数＄ｊｐ内の読み仮名を翻訳データに置き換える全文置換が行われる。 Finally, the full text replacement processing unit 116 reads the dictionary data of the dictionary E from the translation use dictionary file 115 and performs full text replacement of the stored data of the variable $ jp by backward matching. That is, the full-text replacement processing unit 116 issues an instruction to replace “Jima,” with “Island” based on the dictionary data of the dictionary E. Here, “Jima,” means that “Jima” behind the noun phrase is to be replaced. In this case, as shown in FIG. 11 (F), “Jima” located at the back side (back) of the first and second noun phrases is “translated data“ of the dictionary data of the dictionary E “ Island ”. In this way, full-text replacement is performed in which the reading pseudonym in the variable $ jp is replaced with the translation data by using the dictionary data of the translation usage dictionary created in the translation usage dictionary file 115.

そして、全文置換処理部１１６は、上述もしたように、変数＄ｈｅａｄ［＄ｉ］に格納された読み仮名付与済みデータと、これに対する変数＄ｊｐの置換された翻訳データとを対応付けて翻訳済みデータを形成し、これを翻訳済みデータファイル１１７に記録する。図１２は、翻訳済みデータファイル１１７の格納データの例を説明するための図である。図１２に示すように、翻訳済みデータファイル１１７の格納データである翻訳済みデータは、「エリアコード、属性番号、地図注記、読み仮名、翻訳データ」からなる。このようにして、翻訳対象データファイル１１１の各翻訳対象データに対して、対応する翻訳データを付加した翻訳済みデータが、翻訳済みデータファイル１１７に得られる。 Then, as described above, the full-text replacement processing unit 116 translates the kana-given data stored in the variable $ head [$ i] in association with the translation data in which the variable $ jp is replaced. Complete data is recorded and recorded in the translated data file 117. FIG. 12 is a diagram for explaining an example of data stored in the translated data file 117. As shown in FIG. 12, the translated data that is stored data of the translated data file 117 is composed of “area code, attribute number, map note, reading kana, translation data”. In this way, translated data obtained by adding corresponding translation data to each translation target data in the translation target data file 111 is obtained in the translated data file 117.

なお、図１１に示した例においては、名詞句を構成する語句が２つしかないために、部分一致の例と前方一致の例が同じになっている。しかし、本来の部分一致置換は、例えば「○○映画村」という翻訳対象データが存在し、「置換方式＝部分一致，翻訳対象＝映画，翻訳データ＝ｍｏｖｉｅ」という辞書データで翻訳することが指示されたとする。この場合には、「映画」を「ｍｏｖｉｅ」に置換せよという命令が出される。この命令により、「○○映画村」が、「○○ｍｏｖｉｅ村」というように、「映画」が「ｍｏｖｉｅ」に置き換えられるのが本来の部分一致による置換である。 In the example shown in FIG. 11, since there are only two words constituting the noun phrase, the partial match example and the forward match example are the same. However, the original partial match replacement has, for example, translation target data “XX movie village”, and it is instructed to translate with dictionary data “substitution method = partial match, translation target = movie, translation data = movie”. Suppose that In this case, an instruction to replace “movie” with “movie” is issued. By this command, “movie movie village” is replaced with “movie movie village”, and “movie” is replaced with “movie” in the original partial match replacement.

［実施の形態の翻訳装置での翻訳処理のまとめ］
次に、この実施の形態の翻訳装置で行われる地図注記の翻訳処理について、図１３、図１４のフローチャートを参照しながらまとめる。この実施の形態の翻訳装置で行われる処理は、大きく分けると翻訳対象データである地図注記データへの読み仮名の付与処理と、読み仮名が付与された地図注記データの翻訳処理とに分けることができる。読み仮名が付与された地図注記データは、地図注記データを種々の言語に翻訳する場合に共通して利用することができるため、地図注記データに変更が生じない限り、１回だけ行えばよい。これに対して、地図注記の翻訳処理は、翻訳対象言語がことなれば別途実行する必要のある処理である。 [Summary of Translation Processing in Translation Device of Embodiment]
Next, the map annotation translation processing performed by the translation apparatus of this embodiment will be summarized with reference to the flowcharts of FIGS. The processing performed in the translation apparatus of this embodiment can be broadly divided into reading kana assignment processing for map annotation data that is data to be translated and translation processing for map annotation data to which reading kana is given. it can. Since the map note data to which the reading pseudonym is assigned can be used in common when the map note data is translated into various languages, it may be performed only once unless the map note data is changed. On the other hand, the map annotation translation process is a process that needs to be executed separately if the language to be translated is different.

［読み仮名の付与処理］
図１３は、この実施の形態の翻訳装置で行われる翻訳対象データに対して読み仮名を付与する処理を説明するためのフローチャートである。この図１３のフローチャートに示す処理は、当該翻訳装置の制御部１０１において実行され、制御部１０１が読み仮名付与部１１２を制御することによって行われる。 [Yomikana assignment process]
FIG. 13 is a flowchart for explaining a process of assigning a reading pseudonym to translation target data performed by the translation apparatus of this embodiment. The processing shown in the flowchart of FIG. 13 is executed by the control unit 101 of the translation apparatus, and is performed by the control unit 101 controlling the reading pseudonym assignment unit 112.

当該翻訳装置の制御部１０１は、操作部１０３を通じて当該翻訳装置の使用者からの読み仮名の付与処理の実行指示を受け付けると、図１３に示す翻訳処理を実行する。この場合、制御部１０１は、読み仮名付与部１１２を制御し、翻訳対象データファイル１１１に蓄積されている各翻訳対象データの地図注記に対して、読み仮名辞書群１０４の各辞書を用いて、読み仮名を付与する処理を行う（ステップＳ１０１）。 When the control unit 101 of the translation apparatus receives an instruction to execute the reading pseudonym assignment process from the user of the translation apparatus through the operation unit 103, the control unit 101 executes the translation process illustrated in FIG. In this case, the control unit 101 controls the reading kana giving unit 112 to use each dictionary of the reading kana dictionary group 104 for the map annotation of each translation target data stored in the translation target data file 111. Processing for assigning a reading pseudonym is performed (step S101).

このステップＳ１０１の処理において読み仮名付与部１１２は、図２に示した翻訳対象データファイル１１１の各翻訳対象データに対して、その地図注記に対する読み仮名を付与して、図３に示した読み仮名付与済みデータを作成する。ここで作成された読み仮名付与済みデータは、読み仮名付与済みデータファイル１１３に蓄積される。 In the process of step S101, the reading pseudonym assigning unit 112 assigns a reading pseudonym for the map note to each translation target data in the translation target data file 111 shown in FIG. 2, and the reading pseudonym shown in FIG. Create granted data. The created reading-kana-added data is stored in the reading-kana-given data file 113.

このようにして、翻訳対象データファイル１１１の全ての翻訳対象データに対応する読み仮名付与済みデータが、読み仮名付与済みデータファイル１１３に格納されると、この図１３に示す処理は終了する。 In this way, when the reading pseudonymized data corresponding to all the translation target data in the translation target data file 111 is stored in the reading pseudonymized data file 113, the processing shown in FIG.

［地図注記の翻訳処理］
図１４は、この実施の形態の翻訳装置で行われる地図注記の翻訳処理を説明するためのフローチャートである。この図１４のフローチャートに示す処理は、当該翻訳装置の制御部１０１において実行され、制御部１０１が使用辞書作成部１１４と、全文置換処理部１１６とを制御することによって行われる。 [Translation processing of map notes]
FIG. 14 is a flowchart for explaining a map annotation translation process performed by the translation apparatus of this embodiment. The processing shown in the flowchart of FIG. 14 is executed by the control unit 101 of the translation apparatus, and is performed by the control unit 101 controlling the use dictionary creation unit 114 and the full-text replacement processing unit 116.

当該翻訳装置の制御部１０１は、操作部１０３を通じて当該翻訳装置の使用者からの翻訳処理の実行指示を受け付けると、図１３に示す翻訳処理を実行する。まず、制御部１０１は、操作部１０３を通じて使用者からの翻訳対象言語の選択入力を受け付ける（ステップＳ２０１）。 When receiving a translation processing execution instruction from the user of the translation device through the operation unit 103, the control unit 101 of the translation device executes the translation processing illustrated in FIG. First, the control unit 101 receives a selection input of a language to be translated from the user through the operation unit 103 (step S201).

次に、制御部１０１は、使用辞書作成部１１４を制御し、選択された翻訳対象言語に応じた翻訳用使用辞書を、翻訳用使用辞書ファイル１１５に作成する処理を行う（ステップＳ２０２）。このステップＳ２０２の処理は、上述もしたように、使用辞書作成部１１４が、翻訳対象言語に応じて、翻訳に使用する辞書を言語別翻訳辞書群１０５から選択して抽出する。そして、使用辞書作成部１１４が、抽出した辞書をその使用順序を定めて並べ替える共に、地図注記の読み仮名の長いもの順に辞書データを並べた翻訳用使用辞書を翻訳用使用辞書ファイル１１５に作成する。 Next, the control unit 101 controls the use dictionary creation unit 114 to perform a process of creating a translation use dictionary corresponding to the selected translation target language in the translation use dictionary file 115 (step S202). In the process of step S202, as described above, the use dictionary creation unit 114 selects and extracts a dictionary to be used for translation from the language-specific translation dictionary group 105 according to the translation target language. Then, the use dictionary creation unit 114 rearranges the extracted dictionaries in the order of use, and creates a use dictionary for translation in the use dictionary file 115 for translation in which dictionary data is arranged in the order of the longest readings of map notes. To do.

次に、制御部１０１は、全文置換処理部１１６を制御して、読み仮名付与済みデータファイル１１３に格納されている読み仮名付与済みデータを、所定の変数に格納する処理を行う（ステップＳ２０３）。このステップＳ２０３の処理は、図５〜図７を用いて説明したように、変数＄ｈｅａｄ［＄ｉ］に読み仮名付与済みデータを、変数＄ｊｐに読み仮名付与済みデータの読み仮名を格納する処理である。 Next, the control unit 101 controls the full-text replacement processing unit 116 to perform processing for storing the reading-kana-added data stored in the reading-kana-added data file 113 in a predetermined variable (step S203). . In the process of step S203, as described with reference to FIG. 5 to FIG. 7, the read-named data is stored in the variable $ head [$ i], and the read-named data of the read-named data is stored in the variable $ jp. It is processing.

この後、制御部１０１は、全文置換処理部１１６を制御して、翻訳用使用辞書ファイル１１５の辞書データを１件ずつ読み込み、変数＄ｊｐに格納した読み仮名を辞書データの翻訳データで全文置換する処理を行う（ステップＳ２０４）。このステップＳ２０４の処理は、図８〜図１１を用いて説明した処理であり、変数＄ｊｐに格納された地図注記の読み仮名は、翻訳用使用辞書ファイル１１５に格納されている各翻訳用使用辞書データの翻訳データで全文置換される。 Thereafter, the control unit 101 controls the full text replacement processing unit 116 to read the dictionary data in the translation use dictionary file 115 one by one, and replaces the reading pseudonym stored in the variable $ jp with the translation data of the dictionary data. The process which performs is performed (step S204). The processing in step S204 is the processing described with reference to FIGS. 8 to 11, and the reading kana of the map note stored in the variable $ jp is used for each translation stored in the translation usage dictionary file 115. Full text is replaced with translation data of dictionary data.

そして、制御部１０１は、更に全文置換処理部１１６を制御し、全文置換処理部１１６が、翻訳済みデータを作成し、これを翻訳済みデータファイル１１７に記録する処理を行う（ステップＳ２０５）。このステップＳ２０５において作成される翻訳済みデータは、変数＄ｈｅａｄ［＄ｉ］に格納された読み仮名付与済みデータと、これに対する変数＄ｊｐの置換された翻訳データとを対応付けて形成されるものである。このステップＳ２０５の処理の後、この図１４に示す翻訳処理は終了する。これにより、翻訳済みデータファイル１１７には、図１２に示した態様の翻訳済みデータが蓄積される。 The control unit 101 further controls the full-text replacement processing unit 116, and the full-text replacement processing unit 116 creates translated data and records it in the translated data file 117 (step S205). The translated data created in step S205 is formed by associating the reading-kana-added data stored in the variable $ head [$ i] with the translated data in which the variable $ jp is replaced with this. It is. After the processing in step S205, the translation processing shown in FIG. 14 ends. As a result, the translated data in the form shown in FIG. 12 is accumulated in the translated data file 117.

なお、上述もしたように、図１２に示した例の翻訳済みデータは、地図注記を英語に翻訳したものである。従って、図１４に示した翻訳処理において、ステップＳ２０１で目的とする言語を翻訳対象言語として選択することにより、地図注記を目的とする言語に翻訳した翻訳済みデータが作成できる。 As described above, the translated data in the example shown in FIG. 12 is a map note translated into English. Therefore, in the translation process shown in FIG. 14, by selecting the target language as the translation target language in step S201, translated data translated into the target language for the map annotation can be created.

［中国語への翻訳の場合の特例処理］
上述した実施の形態では、中国語（簡体）、中国語（繁体）、英語、韓国語の４言語に翻訳することを想定した。当該４言語のどの言語に翻訳する場合にも、この発明を適用することが可能である。しかし、中国語（簡体）、中国語（繁体）に翻訳する場合であって、日本語の地図注記の漢字部分については、これに付与された読み仮名を用いずに、単漢字変換により翻訳が可能である。 [Special handling for translation into Chinese]
In the embodiment described above, it is assumed that translation is performed into four languages of Chinese (simplified), Chinese (traditional), English, and Korean. The present invention can be applied to any of the four languages. However, when translating into Chinese (simplified) and Chinese (traditional), the kanji part of the map note in Japanese can be translated by single kanji conversion without using the reading kana given to it. Is possible.

すなわち、日本語の漢字は、中国語（簡体）、中国語（繁体）と共通に用いることができるものも多く、通常、日本語の漢字に対して、対応する中国語の簡体文字、中国語の繁体文字が決まる。このため、日本語の地図注記の漢字部分は、読み仮名を用いずに、単漢字変換により翻訳する。簡単には、日本語の漢字をそのまま用いるようにしてもよいし、日本語の漢字から中国語の簡体文字、中国語の繁体文字を特定して、これを翻訳結果として用いるようにしてもよい。なお、日本語のかたかなやひらがなについては、上述したこの発明によって翻訳できる。 That is, there are many Japanese kanji characters that can be used in common with Chinese (simplified) and Chinese (traditional). Usually, the corresponding Chinese simplified characters, Chinese Traditional characters are determined. For this reason, the kanji portion of the Japanese map note is translated by single kanji conversion without using the reading kana. For simplicity, Japanese Kanji characters may be used as they are, or Simplified Chinese characters and Traditional Chinese characters may be identified from Japanese Kanji characters and used as translation results. . Note that Japanese Katakana and Hiragana can be translated by the above-described invention.

［実施の形態の効果］
上述した実施の形態の翻訳装置によれば、大量に存在する地図注記を、従来の機械翻訳装置に比べて飛躍的に速く、且つ、正確に翻訳できる。 [Effect of the embodiment]
According to the translation apparatus of the embodiment described above, a large amount of map notes can be translated significantly faster and more accurately than conventional machine translation apparatuses.

また、地図注記について読み仮名を付与し、この付与した読み仮名を翻訳する構成としているので、実際の読み方に即した翻訳ができる。不自然な翻訳となることが無い。 Moreover, since the reading kana is given to the map note, and the given reading kana is translated, the translation according to the actual reading can be performed. There is no unnatural translation.

そして、数百万件の地図注記を翻訳する場合、従来は翻訳に数週間かかっていたが、これを数時間に短縮することができた。 And when translating millions of map notes, it used to take weeks to translate, but it could be reduced to hours.

［エリアコードと属性番号の利用］
なお、上述した実施の形態においては、図３を用いて説明したように、読み仮名付与済みデータファイル１１３の各格納データには、エリアコードと属性番号とが付されている。このため、翻訳用変数はエリア別、属性別で用意する。そして、翻訳対象データに読み仮名を付与した読み仮名付与済みデータを、各変数に振り分けて格納し、これを結合することにより、エリア別、属性別に、読み仮名付与済みデータを格納した１つの変数が形成できる。この場合、エリアコードと属性番号とが、各読み仮名付与済みデータのヘッダ情報となる。 [Use of area code and attribute number]
In the above-described embodiment, as described with reference to FIG. 3, each storage data of the reading pseudonymized data file 113 is assigned an area code and an attribute number. For this reason, translation variables are prepared for each area and attribute. Then, the reading-kana-assigned data with the reading-kana added to the data to be translated is distributed to each variable and stored, and by combining them, one variable storing the reading-kana-added data by area and attribute Can be formed. In this case, the area code and the attribute number serve as header information of each reading-kana given data.

また、上述した実施の形態では、言語別翻訳辞書群の各翻訳辞書には、エリアコードや属性番号が無い場合を例にして説明したが、これに限るものではない。実際には、辞書データもエリア別、属性別に編成される。すなわち、言語別翻訳辞書群の各翻訳辞書が有する辞書データのそれぞれにも、翻訳対象データの場合と同様にして、エリアコードと属性番号とが付与されている。同じ文字からなる地図注記でも、エリアが異なれば読み方が違う場合も有れば、同じ文字からなる地図注記でも、属性が異なれば読み方が違う場合もあるためである。前者の例としては、「新宿」との記載は、「しんじゅく」と読む地名もあれば、「あらじゅく」と読む地名もある。また、後者の例としては、「浅草」と記載されていても、地名では「あさくさ」と読ませ、寺院の名称としては「せんそう」と読ませるといった例がある。 In the above-described embodiment, the case where each translation dictionary in the language-specific translation dictionary group has no area code or attribute number has been described as an example. However, the present invention is not limited to this. In practice, dictionary data is also organized by area and attribute. That is, the area code and the attribute number are assigned to each dictionary data included in each translation dictionary in the language-specific translation dictionary group as in the case of the translation target data. This is because even a map note made up of the same characters may be read differently if the area is different, and a map note made up of the same characters may be read differently if the attributes are different. As an example of the former, the description of “Shinjuku” includes a place name that reads “Shinjuku” and a place name that reads “Arajuku”. As an example of the latter, there is an example in which “Asakusa” is read as the place name, and “Senso” is read as the temple name, even though “Asakusa” is described.

そこで、使用辞書作成部１１４は、翻訳用使用辞書を作成するに当り、各翻訳辞書の各辞書データが備えるエリアコードと属性番号をも含めて辞書データを構成する。そして、各辞書の辞書データは、エリアコード順であって、属性番号順で、且つ読み仮名の文字数の長いもの順に並べる。そして、エリアコードと、属性番号と、読み仮名とが一致した場合にのみ、読み仮名付与済みデータの読み仮名を、翻訳用使用辞書ファイル１１５の辞書データの翻訳データで置換する。このようにすることによって、エリアや属性が異なることによって読み仮名が違う同じ表記の地図注記に対しても、その適切な読み仮名に応じた適切な翻訳結果を確実に付与できる。 Therefore, the use dictionary creation unit 114 constructs dictionary data including an area code and an attribute number included in each dictionary data of each translation dictionary when creating the translation use dictionary. The dictionary data of each dictionary is arranged in order of area code, in order of attribute number, and in descending order of the number of characters in the reading kana. Then, only when the area code, the attribute number, and the reading kana match, the reading kana of the reading kana given data is replaced with the translation data of the dictionary data in the translation use dictionary file 115. By doing in this way, the appropriate translation result according to the appropriate reading pseudonym can be reliably provided also to the map notation of the same notation with different readings and kana depending on an area and an attribute.

また、エリアコードと、読み仮名とが一致した場合にのみ、読み仮名付与済みデータの読み仮名を、翻訳用使用辞書ファイル１１５の辞書データの翻訳データで置換することもできる。同様に、属性番号と、読み仮名とが一致した場合にのみ、読み仮名付与済みデータの読み仮名を、翻訳用使用辞書ファイル１１５の辞書データの翻訳データで置換することもできる。前者の場合には、例えば、翻訳対象が住所などのように、属性番号を有しないデータを翻訳する場合に適用できる。後者の場合には、単一のエリアの地図注記を翻訳する場合など、翻訳対象のデータが属するエリアが単一のエリアに限られている場合に適用できる。 In addition, only when the area code and the reading kana match, the reading kana of the data to which the reading kana has been added can be replaced with the translation data of the dictionary data in the translation use dictionary file 115. Similarly, only when the attribute number and the reading kana match, the reading kana of the reading-kana-given data can be replaced with the translation data of the dictionary data in the translation use dictionary file 115. In the former case, for example, the present invention can be applied when translating data that does not have an attribute number such as an address to be translated. The latter case can be applied when the area to which the data to be translated belongs is limited to a single area, such as when a map note of a single area is translated.

［変形例など］
なお、上述した実施の形態では、地図注記データを翻訳する場合を例にして説明したが、これに限るものではない。例えば、電話帳リスト、顧客リスト、住所リストといった多数の名前、住所などの名詞句の羅列が含まれるものの当該多数の名詞句の羅列を翻訳する場合に、この発明を適用できる。 [Modifications, etc.]
In the above-described embodiment, the case where the map annotation data is translated has been described as an example. However, the present invention is not limited to this. For example, the present invention can be applied to a case where a large number of noun phrases such as a phone book list, a customer list, and an address list are included, but a large number of noun phrases are translated.

また、上述した実施の形態では、地図注記データを翻訳する場合の翻訳対象言語には、例えば、中国語（簡体）、中国語（繁体）、英語、韓国語の４言語に対応可能なものとして説明したが、これに限るものではない。目的とする言語の翻訳辞書を用意するだけで、地図注記などの名詞句の羅列を種々の言語に翻訳できる。 In the above-described embodiment, the language to be translated when the map annotation data is translated is, for example, capable of supporting four languages: Chinese (simplified), Chinese (traditional), English, and Korean. Although explained, it is not limited to this. A list of noun phrases such as map notes can be translated into various languages simply by preparing a translation dictionary for the target language.

また、読み仮名が付与されて形成された読み仮名付与済みデータは、各言語に翻訳する場合に共通に利用できるので、図１３に示した翻訳対象データに対して読み仮名を付与する処理は、地図注記に変更が無い限り、１回だけ行えばよい。 Moreover, since the reading-kana-added data formed by adding the reading-kana can be used in common when translated into each language, the process of giving the reading-kana to the translation target data shown in FIG. As long as there are no changes to the map notes, this only needs to be done once.

また、上述した実施の形態では、使用辞書作成部１１４が、言語別翻訳辞書群１０５から翻訳対象言語に応じて、必要となる辞書を選択して抽出し、それらの並び順を整えた翻訳用使用辞書を翻訳用使用辞書ファイル１１５に作成するものとして説明した。しかし、これに限るものではない。必要となる辞書を、必要となる順番で、シーケンシャルに読み出すようにしてももちろんよい。この場合、使用辞書作成部１１４は、全文置換処理部１１６に対して、使用する辞書とその使用順を指示するものとなる。 In the above-described embodiment, the use dictionary creation unit 114 selects and extracts necessary dictionaries from the language-specific translation dictionary group 105 according to the language to be translated, and arranges the arrangement order thereof. The description has been made assuming that the use dictionary is created in the use dictionary file 115 for translation. However, it is not limited to this. Of course, the necessary dictionaries may be read sequentially in the required order. In this case, the use dictionary creation unit 114 instructs the full text replacement processing unit 116 to use the dictionary and the order of use.

また、図２などに示した地図注記データは、その一例として簡単なものを示したが、実際には、漢字、カタカナ、ひらがな、英数字記号が混在する場合がある。このため、漢字やカタカナの一般名詞については、上述した方式で翻訳を行う。また、漢字の固有名詞については、その読み仮名をローマ字変換する。また、カタカナを韓国語に翻訳する場合には、音（おん）に応じて韓国に置き換える。英数字記号は、何もせずにそのまま用いる。というように、ハイブリッドな翻訳が行われる。 In addition, the map annotation data shown in FIG. 2 and the like is shown as a simple example, but in reality, kanji, katakana, hiragana, and alphanumeric symbols may be mixed. For this reason, kanji and katakana common nouns are translated in the manner described above. In addition, for kanji proper nouns, their kana are converted to Roman characters. If katakana is translated into Korean, it is replaced with Korean according to the sound. Use alphanumeric characters as they are without doing anything. Thus, a hybrid translation is performed.

［その他］
上述した実施の形態の説明からも分かるように、請求項に記載した翻訳装置の翻訳辞書は、実施の形態の翻訳装置の言語別翻訳辞書群の各翻訳辞書に対応し、また、請求項に記載した翻訳装置の辞書作成手段は、実施の形態の翻訳装置の使用辞書作成部１１４に対応している。また、請求項に記載した翻訳装置の読み仮名辞書は、実施の形態の翻訳装置の読み仮名辞書群１０４の各読み仮名辞書に対応し、請求項に記載した翻訳装置の読み仮名付与手段は、実施の形態の翻訳装置の読み仮名付与部１１２に対応している。 [Others]
As can be seen from the description of the above-described embodiment, the translation dictionary of the translation device described in the claims corresponds to each translation dictionary of the language-specific translation dictionary group of the translation device of the embodiment. The dictionary creation means of the translation device described corresponds to the use dictionary creation unit 114 of the translation device of the embodiment. Moreover, the reading-kana dictionary of the translation apparatus described in the claim corresponds to each reading-kana dictionary of the reading-kana dictionary group 104 of the translation apparatus according to the embodiment. This corresponds to the reading pseudonym assigning unit 112 of the translation apparatus according to the embodiment.

また、請求項に記載した翻訳装置の格納手段、全文置換手段のそれぞれは、実施の形態の翻訳装置の全文置換処理部１１６に対応している。 Each of the storage unit and the full text replacement unit of the translation device described in the claims corresponds to the full text replacement processing unit 116 of the translation device of the embodiment.

また、図１４を用いて説明した翻訳処理は、この発明の翻訳方法の一実施の形態が適用されたものであり、図１４を用いて説明した翻訳処理を実行するプログラムは、この発明の翻訳プログラムの一実施の形態が適用されたものである。また、図１に示した読み仮名付与部１１２、使用辞書作成部１１４、全文置換処理部１１６の各機能は、制御部１０１で実行されるプログラムにより、制御部１０１の機能として実現することもできる。 The translation process described with reference to FIG. 14 is the one to which an embodiment of the translation method of the present invention is applied. The program for executing the translation process described with reference to FIG. An embodiment of the program is applied. 1 may be realized as a function of the control unit 101 by a program executed by the control unit 101. The reading-kana adding unit 112, the use dictionary creation unit 114, and the full-text replacement processing unit 116 illustrated in FIG. .

１０１…制御部、１０２…記憶装置、１０３…操作部、１０４…読み仮名辞書群、１０５…言語別翻訳辞書群、１１１…使用辞書作成部、１１２…翻訳用使用辞書ファイル、１１３…翻訳対象データファイル、１１４…読み仮名付与部、１１５…読み仮名付与済みデータファイル、１１６…全文置換処理部、１１６Ｍ…メモリ、１１７…翻訳済みデータファイル DESCRIPTION OF SYMBOLS 101 ... Control part, 102 ... Memory | storage device, 103 ... Operation part, 104 ... Reading kana dictionary group, 105 ... Translation dictionary group according to language, 111 ... Use dictionary creation part, 112 ... Use dictionary file for translation, 113 ... Translation object data File: 114 ... Yomikana adding unit, 115 ... Yomikana-added data file, 116 ... Full text replacement processing unit, 116M ... Memory, 117 ... Translated data file

Claims

Select and extract the dictionary to be used from multiple translation dictionaries with different types of noun phrases to be searched according to the language to be translated. A dictionary creation means for creating a dictionary for translation of the dictionary contents;
Storage that stores reading-kana-added data having one or more noun phrases to be translated and the reading-kana of each of the one or more noun phrases in a variable that is distinguishable from other reading-kana-giving data. Means,
The dictionary data of the dictionary for translation created by the dictionary creating means is read one by one, and the noun phrase to be translated of the data with the kana added stored in the variable is converted to the dictionary data of the dictionary for translation. A translation device comprising: full-text replacement means for replacing the full-text with a target language using

The translation device according to claim 1,
The translation apparatus, wherein the plurality of translation dictionaries include a plurality of dictionaries having different search word matching methods for front matching, backward matching, and partial matching.

A translation device according to claim 1 or 2, wherein
Attribute information indicating the attribute of the noun phrase included in the reading-kana-given data is added to the reading-kana-giving data stored in the variable,
The dictionary data of the translation dictionary is associated with attribute information indicating the attribute of the noun phrase included in the dictionary data,
The full-text replacing means converts the noun phrase of the reading-kana-given data stored in the variable into a target language using dictionary data of the translation dictionary having the same attribute information as the reading-kana-given data. A translation device characterized by replacing.

A translation device according to claim 1 or 2, wherein
Area information indicating a region to which a location relating to the noun phrase included in the reading pseudonymized data belongs is added to the reading pseudonymized data stored in the variable,
Area information indicating a region to which a place relating to the noun phrase included in the dictionary data belongs is added to the dictionary data of the translation dictionary,
The full-text replacement means converts the noun phrase of the reading-kana-given data stored in the variable into a target language using dictionary data of the translation dictionary having the same area information as the reading-kana-given data. A translation device characterized by replacing.

A translation device according to claim 1 or 2, wherein
The phonetic affixed data stored in the variable includes attribute information indicating attributes of the noun phrase included in the phonetic affixed data, and a place related to the noun phrase included in the phonetic affixed data. Area information indicating the region to which it belongs,
In the dictionary data of the translation dictionary, attribute information indicating the attribute of the noun phrase included in the dictionary data, and area information indicating a region to which a place related to the noun phrase included in the dictionary data belongs, are added.
The full-text replacement means uses the dictionary data of the translation dictionary having the same attribute information and area information as the reading-kana-given data for the noun phrase of the reading-kana-giving data stored in the variable. A translation device characterized in that it is replaced with a language.

The dictionary creation means selects and extracts dictionaries to be used from multiple translation dictionaries with different types of noun phrases to be searched according to the language to be translated, determines the order of use, rearranges them, and reads noun phrases A dictionary creation step for creating a dictionary for translation in which the dictionary contents are arranged in the order of long kana;
One variable in a manner in which the storage means can distinguish the reading-kana-added data having one or more noun phrases to be translated and the respective reading-kana of the one or more noun phrases from other reading-kana-giving data. Storing process to store in,
Full-text replacement means reads the dictionary data of the dictionary for translation created by the dictionary creation means one by one, and translates the noun phrase to be translated of the reading-kana given data stored in the variable And a full-text replacing step of replacing the full-text with a target language using the dictionary data of the dictionary.

The computer installed in the translation device
Select and extract the dictionary to be used from multiple translation dictionaries with different types of noun phrases to be searched according to the language to be translated. A dictionary creation means for creating a dictionary for translation of the dictionary contents;
Storage that stores reading-kana-added data having one or more noun phrases to be translated and the reading-kana of each of the one or more noun phrases in a variable that is distinguishable from other reading-kana-giving data. Means,
The dictionary data of the dictionary for translation created by the dictionary creating means is read one by one, and the noun phrase to be translated of the data with the kana added stored in the variable is converted to the dictionary data of the dictionary for translation. A translation program that functions as full-text replacement means for replacing the full-text with a target language using.