JP2003108553A

JP2003108553A - Machine translation device, machine translation method and machine translation program

Info

Publication number: JP2003108553A
Application number: JP2001296556A
Authority: JP
Inventors: Yumiko Yoshimura; 裕美子吉村
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2001-09-27
Filing date: 2001-09-27
Publication date: 2003-04-11
Anticipated expiration: 2021-09-27
Also published as: JP4031921B2

Abstract

PROBLEM TO BE SOLVED: To improve translation accuracy even in e-mail and a Web notice board few in information obtained from an individual document in translation using context information. SOLUTION: The application 10 outputs text data to a related document information acquiring part 6 via a control part 2. The related document information acquiring part 6 makes access to a document data storage part 9, acquires reference series document data related to the text data as reference document information, and outputs the data to a translation environment setting part 7. The translation environment setting part 7 stores information for facilitating a translation among the reference document information in a translation environment storage part 8 as translation environment information. A translation part 3 translates the text data by using knowledge of a dictionary part 4 and the translation environment information of the translation environment storage part 8. The reference document information includes information on a plurality of reference series document data related to the text data, and even when information required for the translation is few in the text data, the translation environment information for facilitating the translation can be obtained by using the reference document information.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、電子メール送受信
ツール、Ｗｅｂブラウザ等において第１言語と第２言語
との間の翻訳を行うものに好適な機械翻訳装置、機械翻
訳方法及び機械翻訳プログラムに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device, a machine translation method, and a machine translation program suitable for translating between a first language and a second language in an electronic mail transmitting / receiving tool, a Web browser, and the like. .

【０００２】[0002]

【従来の技術】近年、パソコンの普及率は高く、インタ
ーネット環境が整備されるに伴って、電子メールやＷｅ
ｂ（world wide web）サイトを利用した海外との情報の
やり取りが盛んになってきた。海外との情報の授受の際
には、言語の相違について考慮する必要がある。機械翻
訳ソフトの翻訳精度も向上してきており、電子メールソ
フトやＷｅｂブラウザソフトから機械翻訳ソフトを呼び
出して、翻訳処理を行うこともできるようになってい
る。また、Ｗｅｂ上で翻訳サービスを行うサイトも増加
している。2. Description of the Related Art In recent years, the popularity of personal computers has been high, and as the Internet environment has been improved, e-mail and We
The exchange of information with foreign countries using b (world wide web) sites has become popular. When exchanging information with foreign countries, it is necessary to consider language differences. The translation accuracy of machine translation software has been improved, and it is also possible to call machine translation software from e-mail software or Web browser software to perform translation processing. Also, the number of sites providing translation services on the Web is increasing.

【０００３】ところで、翻訳精度を向上させるために、
文脈情報を利用した翻訳方法が採用されることがある。
この文書翻訳技術においては、文書全体を前もって解析
して各種情報を抽出し、抽出した内容を翻訳に利用する
と共に、文書の翻訳を進めながら翻訳した文の情報を後
の文の翻訳に利用する手法が採用される。By the way, in order to improve the translation accuracy,
A translation method using context information may be adopted.
In this document translation technology, the entire document is analyzed in advance to extract various information, the extracted contents are used for translation, and the information of the translated sentence is used for translation of the later sentence while the document is being translated. The method is adopted.

【０００４】ところが、電子メールの場合には、通常、
１件の文章の長さが短く、個々の文も簡潔なスタイルの
ものが多いことから、メール全体から情報を得ようとし
ても適切に翻訳するための十分な情報を得ることは困難
である。However, in the case of electronic mail,
Since the length of one sentence is short and each sentence is often in a simple style, it is difficult to obtain sufficient information for proper translation even when trying to obtain information from the entire email.

【０００５】また、最近、インターネット上では情報交
換の場として、Ｗｅｂ上の掲示板システムが利用される
ことがある。この掲示板システムにおいて扱われる文書
サイズも比較的小さく、共通の話題に対する一連のやり
取りの中には、極めて短い記事も存在する。また、掲示
板の構成として、関連した話題に対するやり取りをまと
めて表示するスレッド表示でなく、個々の記事を時系列
的に列挙するタイプのもの（以下、順次表示という）も
ある。このような順次表示タイプの場合には、１ページ
内には複数の話題がランダムに夫々独立して記述され、
個々の関連を有する記事の翻訳に適した情報だけを抽出
することは極めて困難である。Further, recently, a bulletin board system on the Web is sometimes used as a place for exchanging information on the Internet. The document size handled by this bulletin board system is also comparatively small, and an extremely short article exists in a series of exchanges for a common topic. In addition, as a bulletin board configuration, there is also a type of bulletin board in which individual articles are listed in time series (hereinafter referred to as sequential display), instead of a threaded display that collectively displays interactions with related topics. In the case of such a sequential display type, a plurality of topics are randomly and independently described in one page,
It is extremely difficult to extract only the information that is suitable for translating articles having individual relationships.

【０００６】[0006]

【発明が解決しようとする課題】このように、従来、電
子メールやＷｅｂ掲示板等の電子情報の授受について
は、個々の文書内には翻訳処理に利用可能な情報が少な
いことが多く、文脈情報を利用した翻訳方法を適用して
も十分な翻訳精度を得ることができないという問題点が
あった。As described above, in the past, when sending and receiving electronic information such as electronic mails and Web bulletin boards, there is often little information available for translation processing in individual documents, and context information is available. There is a problem that sufficient translation accuracy cannot be obtained even if the translation method using is applied.

【０００７】本発明は、電子メールや電子掲示板等の電
子情報のやり取りの特徴を利用して、翻訳対象の文書以
外の関連を有する文書から翻訳処理に利用可能な情報を
抽出することにより、翻訳精度を向上させることができ
る機械翻訳装置、機械翻訳方法及び機械翻訳プログラム
を提供することを目的とする。The present invention utilizes the characteristics of electronic information exchange such as electronic mail and electronic bulletin boards to extract information that can be used for translation processing from a document having a relationship other than the document to be translated, thereby performing translation. An object is to provide a machine translation device, a machine translation method, and a machine translation program that can improve accuracy.

【０００８】[0008]

【課題を解決するための手段】本発明の請求項１に係る
機械翻訳装置は、複数の文書データが格納された文書デ
ータ保持手段にアクセスして、前記複数の文書データの
うち原文データと関連を有する参照系列の文書データを
参照文書情報として取得する関連文書情報取得手段と、
前記関連文書情報取得手段が取得した参照文書情報から
前記原文データの翻訳に利用可能な翻訳環境情報を取得
して翻訳環境保持手段に保持させる翻訳環境設定手段
と、第１言語と第２言語との翻訳に必要な知識情報を有
する辞書と前記翻訳環境保持手段に保持された翻訳環境
情報とを利用して、前記原文データの翻訳を行う翻訳手
段とを具備したものであり、本発明の請求項１３に係る
機械翻訳方法は、複数の文書データが格納された文書デ
ータ保持手段にアクセスして、前記複数の文書データの
うち原文データと関連を有する参照系列の文書データを
参照文書情報として取得する関連文書情報取得処理と、
前記関連文書情報取得処理において取得した参照文書情
報から前記原文データの翻訳に利用可能な翻訳環境情報
を取得して翻訳環境保持手段に保持させる翻訳環境設定
処理と、第１言語と第２言語との翻訳に必要な知識情報
を有する辞書と前記翻訳環境保持手段に保持された翻訳
環境情報とを利用して、前記原文データの翻訳を行う翻
訳処理とを具備したものであり、本発明の請求項１４に
係る機械翻訳プログラムは、コンピュータに、複数の文
書データが格納された文書データ保持手段にアクセスし
て、前記複数の文書データのうち原文データと関連を有
する参照系列の文書データを参照文書情報として取得す
る関連文書情報取得処理と、前記関連文書情報取得処理
において取得した参照文書情報から前記原文データの翻
訳に利用可能な翻訳環境情報を取得して翻訳環境保持手
段に保持させる翻訳環境設定処理と、第１言語と第２言
語との翻訳に必要な知識情報を有する辞書と前記翻訳環
境保持手段に保持された翻訳環境情報とを利用して、前
記原文データの翻訳を行う翻訳処理とを実行させるもの
である。A machine translation apparatus according to claim 1 of the present invention accesses a document data holding means in which a plurality of document data is stored, and relates to the original text data among the plurality of document data. Related document information acquisition means for acquiring document data of a reference sequence having as reference document information,
A translation environment setting unit that obtains translation environment information that can be used for translation of the original text data from the reference document information obtained by the related document information obtaining unit and stores the translation environment information in the translation environment holding unit, a first language and a second language. The present invention further comprises a translation means for translating the original text data by utilizing a dictionary having knowledge information necessary for translating and translation environment information held by the translation environment holding means. The machine translation method according to Item 13 accesses a document data holding unit in which a plurality of document data is stored, and acquires reference series of document data among the plurality of document data as reference document information that is related to the original text data. Related document information acquisition process,
A translation environment setting process for obtaining translation environment information that can be used for translation of the original text data from the reference document information obtained in the related document information obtaining process, and causing the translation environment holding unit to hold the translation environment information, a first language and a second language. The present invention comprises a translation process for translating the original text data by using a dictionary having knowledge information necessary for translating and translation environment information held by the translation environment holding means. The machine translation program according to Item 14 accesses a document data holding unit in which a plurality of pieces of document data are stored in a computer, and refers to the reference series of document data among the plurality of pieces of document data as a reference document. The related document information acquisition process acquired as information, and the translation available from the reference document information acquired in the related document information acquisition process for translation of the original text data. A translation environment setting process for acquiring environment information and retaining it in the translation environment retaining means, a dictionary having knowledge information necessary for translation between the first language and the second language, and translation environment information retained in the translation environment retaining means. Is used to execute a translation process for translating the original text data.

【０００９】本発明の請求項１において、文書データ保
持手段は、複数の文書データを格納する。関連文書情報
取得手段は、文書データ保持手段に格納されている複数
の文書データのうち原文データと関連を有する参照系列
の文書データを参照文書情報として取得する。これによ
り、参照文書情報は原文データに関連を有する参照系列
の複数の文書データを含んでおり、原文データに翻訳精
度を向上させるために必要な情報が少ない場合でも、参
照文書情報には翻訳精度を向上させるために必要な情報
が多く含まれる。翻訳環境設定手段は、取得された参照
文書情報から原文データの翻訳に利用可能な翻訳環境情
報を取得して翻訳環境保持手段に保持させる。辞書は第
１言語と第２言語との翻訳に必要な知識情報を有する。
翻訳手段は辞書と保持された翻訳環境情報とを利用し
て、原文データの翻訳を行う。In claim 1 of the present invention, the document data holding means stores a plurality of document data. The related document information acquisition means acquires, as reference document information, reference series of document data among the plurality of pieces of document data stored in the document data holding means, which is related to the original text data. As a result, the reference document information includes a plurality of document data of the reference series that are related to the original text data, and even if the original text data has a small amount of information necessary for improving the translation accuracy, the reference document information has the translation accuracy. It contains a lot of information needed to improve The translation environment setting means acquires the translation environment information that can be used to translate the original text data from the acquired reference document information, and causes the translation environment holding means to hold the translation environment information. The dictionary has knowledge information necessary for translating the first language and the second language.
The translation means translates the original text data using the dictionary and the held translation environment information.

【００１０】本発明の請求項１３において、文書データ
保持手段に格納されている複数の文書データのうち原文
データと関連を有する参照系列の文書データが参照文書
情報として取得される。取得された参照文書情報から原
文データの翻訳に利用可能な翻訳環境情報が取得され
て、翻訳環境保持手段に保持される。原文データの翻訳
処理時には、第１言語と第２言語との翻訳に必要な知識
情報を有する辞書だけでなく、翻訳環境保持手段に保持
された翻訳環境情報も利用され、これにより、翻訳精度
が向上する。In the thirteenth aspect of the present invention, the document data of the reference series which is related to the original text data among the plurality of document data stored in the document data holding means is acquired as the reference document information. Translation environment information that can be used to translate the original text data is acquired from the acquired reference document information, and is stored in the translation environment holding means. During the translation process of the original text data, not only the dictionary having the knowledge information necessary for the translation between the first language and the second language but also the translation environment information held in the translation environment holding means is used, whereby the translation accuracy is improved. improves.

【００１１】本発明の請求項１４において、コンピュー
タに、複数の文書データが格納された文書データ保持手
段にアクセスして、複数の文書データのうち原文データ
と関連を有する参照系列の文書データを参照文書情報と
して取得する関連文書情報取得処理と、関連文書情報取
得処理において取得した参照文書情報から原文データの
翻訳に利用可能な翻訳環境情報を取得して翻訳環境保持
手段に保持させる翻訳環境設定処理と、第１言語と第２
言語との翻訳に必要な知識情報を有する辞書と翻訳環境
保持手段に保持された翻訳環境情報とを利用して、原文
データの翻訳を行う翻訳処理とを実行させる。In the fourteenth aspect of the present invention, the computer is made to access the document data holding means in which a plurality of document data is stored, and the document data of the reference series having a relation with the original text data among the plurality of document data is referred to. Related document information acquisition processing acquired as document information and translation environment setting processing for acquiring translation environment information that can be used for translation of original text data from the reference document information acquired in the related document information acquisition processing and holding it in the translation environment holding means. And first language and second
Using a dictionary having knowledge information necessary for translation with a language and the translation environment information held by the translation environment holding means, a translation process for translating the original text data is executed.

【００１２】[0012]

【発明の実施の形態】以下、図面を参照して本発明の実
施の形態について詳細に説明する。図１は本発明の一実
施の形態に係る機械翻訳装置を示すブロック図である。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below in detail with reference to the drawings. FIG. 1 is a block diagram showing a machine translation device according to an embodiment of the present invention.

【００１３】本実施の形態は、複数の文書の中から、個
々の文書に含まれる情報量が少ない複数の文書につい
て、相互に関連を有する文書同士のチェーン（文書セッ
ト）を収集することで、チェーン内の各文書に含まれる
情報を利用して翻訳を行うことにより、翻訳精度を向上
させるようにしたものである。The present embodiment collects a chain (document set) of mutually related documents from a plurality of documents having a small amount of information contained in each document. By using the information contained in each document in the chain to perform translation, the translation accuracy is improved.

【００１４】本実施の形態においては、例えば、電子メ
ールを介した電子情報の授受、又はＷｅｂサイトを利用
した電子情報の授受に適用した例について説明する。In the present embodiment, for example, an example of application to the exchange of electronic information via electronic mail or the exchange of electronic information using a website will be described.

【００１５】図１において、機械翻訳装置は、装置全体
の制御を行う制御部２、外部との入力及び出力のインタ
ーフェースを行う入力部１及び出力部５、入力部１を介
して入手された入力文書を翻訳するための翻訳部３、翻
訳部３が翻訳時に用いる各種翻訳知識を収納した辞書部
４、関連文書情報取得部６、翻訳環境設定部７及び翻訳
環境記憶部８によって構成されている。In FIG. 1, the machine translation apparatus includes a control section 2 for controlling the entire apparatus, an input section 1 and an output section 5 for interfacing with the outside for input and output, and an input obtained via the input section 1. A translation unit 3 for translating a document, a dictionary unit 4 that stores various translation knowledge used by the translation unit 3 for translation, a related document information acquisition unit 6, a translation environment setting unit 7, and a translation environment storage unit 8. .

【００１６】機械翻訳装置は、各種のアプリケーション
１０によって起動される。図１の機械翻訳装置の各部の
構成はプログラムによっても実現可能である。この場合
には、機械翻訳装置は機械翻訳プログラムに相当し、各
種アプリケーションは、機械翻訳プログラムを起動する
アプリケーションプログラムに相当する。アプリケーシ
ョンプログラムは、機械翻訳プログラムを内部に組込ん
でいてもよい。例えば、アプリケーションプログラムと
しては、電子メール送受信プログラムや電子掲示板プロ
グラム等があり、更に、単体ソフトウェアにかぎらず、
Ｗｅｂブラウザ上で起動するプログラムでもよい。The machine translation device is activated by various applications 10. The configuration of each unit of the machine translation apparatus in FIG. 1 can be realized by a program. In this case, the machine translation device corresponds to a machine translation program, and the various applications correspond to application programs that activate the machine translation program. The application program may have a machine translation program incorporated therein. For example, the application program includes an electronic mail transmission / reception program, an electronic bulletin board program, etc., and is not limited to a stand-alone software,
It may be a program activated on a web browser.

【００１７】文書データ記憶部９は、複数の文書データ
を記憶する。例えば、文書データ記憶部９は、電子メー
ルソフトによって送受信された電子情報、Ｗｅｂアクセ
スによって送受信する電子情報等を記憶する。即ち、文
書データ記憶部９は、アプリケーション１０がアクセス
可能であればよく、例えば、インターネット上のハード
ディスク装置等によって構成されることもある。The document data storage unit 9 stores a plurality of document data. For example, the document data storage unit 9 stores electronic information transmitted and received by electronic mail software, electronic information transmitted and received by Web access, and the like. That is, the document data storage unit 9 has only to be accessible by the application 10, and may be configured by, for example, a hard disk device on the Internet.

【００１８】アプリケーション１０は、図示しない所定
の伝送路を介して送受信する文書データを文書データ記
憶部９に記憶させると共に、文書データ記憶部９に記憶
されている文書データを読み出して利用する。The application 10 causes the document data storage unit 9 to store the document data to be transmitted / received via a predetermined transmission path (not shown), and reads and uses the document data stored in the document data storage unit 9.

【００１９】なお、アプリケーション１０が電子メール
送受信プログラムである場合には、処理する文書データ
を格納する文書データ記憶部９内の場所及び構造等の情
報を、予め後述する関連文書情報取得部６に登録してお
く。また、アプリケーション１０がＷｅｂ上の掲示板シ
ステムを利用するソフトウェア等である場合には、基本
的にはそのページのソース情報から格納場所を取得する
ことができるので、予め登録の必要はない。これによ
り、アプリケーション１０は、文書データ記憶部９内の
各文書データに確実にアクセスすることができるように
なっている。When the application 10 is an electronic mail transmission / reception program, information such as the location and structure in the document data storage unit 9 for storing the document data to be processed is stored in advance in the related document information acquisition unit 6 which will be described later. Register Further, when the application 10 is software or the like that uses a bulletin board system on the Web, basically, the storage location can be acquired from the source information of the page, so there is no need to register in advance. As a result, the application 10 can reliably access each document data in the document data storage unit 9.

【００２０】本実施の形態においては、関連文書情報取
得部６は、後述するように、制御部２に制御されて、翻
訳対象から、各チェーンを構成する各文書を相互に関連
付けている情報（参照関係情報）を取得するようになっ
ている。更に、関連文書情報取得部６は、文書データ記
憶部９にアクセスして、参照関係情報に基づいてチェー
ンを収集し、チェーン内の各文書データを参照文書情報
として取得するようになっている。In the present embodiment, the related document information acquisition unit 6 is controlled by the control unit 2 as will be described later, and information (corresponding to each document constituting each chain from the translation target) is associated with each other. Reference relationship information) is acquired. Further, the related document information acquisition unit 6 accesses the document data storage unit 9, collects chains based on the reference relation information, and acquires each document data in the chain as reference document information.

【００２１】関連文書情報取得部６は取得した参照文書
情報を翻訳環境設定部７に供給する。翻訳環境設定部７
は、後述するように、参照文書情報から翻訳処理に有用
である情報（以下、翻訳環境情報という）を抽出し、抽
出した翻訳環境情報を翻訳環境記憶部８に出力する。翻
訳環境記憶部８は、翻訳環境設定部７が抽出取得した翻
訳環境情報を記憶する。The related document information acquisition unit 6 supplies the acquired reference document information to the translation environment setting unit 7. Translation environment setting section 7
As will be described later, extracts information useful for translation processing (hereinafter referred to as translation environment information) from the reference document information and outputs the extracted translation environment information to the translation environment storage unit 8. The translation environment storage unit 8 stores the translation environment information extracted and acquired by the translation environment setting unit 7.

【００２２】入力部１はアプリケーション１０から送ら
れた翻訳対象文書を制御部２に供給する。制御部２は、
翻訳対象文書が入力されると、関連文書情報取得部６に
対して参照文書情報取得処理を実行させる。The input unit 1 supplies the translation target document sent from the application 10 to the control unit 2. The control unit 2
When the translation target document is input, the related document information acquisition unit 6 is caused to execute the reference document information acquisition process.

【００２３】なお、アプリケーション１０から送られて
くる情報は、文字コードからなる文書そのものでもよい
し、文書名、格納場所情報だけでもよい。文書名及び格
納場所情報のみが入力された場合でも、制御部２は、関
連文書情報取得部６を介して文書データ記憶部９から文
字データからなる原文情報そのものを取得させることが
できる。また、アプリケーション１０から文字コードか
らなる文書そのものが送られてくる場合でも、文字コー
ドデータと共に元データの格納場所情報（又はファイル
情報）をも受理する構成にしてもよいし、文字データに
基づいて関連文書情報取得部６を介して元データ個所を
サーチさせる構成にしてもよい。The information sent from the application 10 may be the document itself consisting of character codes, or only the document name and storage location information. Even if only the document name and the storage location information are input, the control unit 2 can acquire the original text information itself composed of character data from the document data storage unit 9 via the related document information acquisition unit 6. In addition, even when the document itself including the character code is sent from the application 10, the storage location information (or file information) of the original data may be received together with the character code data, or based on the character data. The original data location may be searched through the related document information acquisition unit 6.

【００２４】関連文書情報取得部６は、参照文書情報取
得処理を実行する場合には、以前に取得した情報を再度
取得しないように、翻訳環境記憶部８内の情報を参照す
る。翻訳環境記憶部８は翻訳環境情報の記憶が終了する
と終了信号を制御部２に返す。When the reference document information acquisition processing is executed, the related document information acquisition unit 6 refers to the information in the translation environment storage unit 8 so as not to acquire the previously acquired information again. The translation environment storage unit 8 returns an end signal to the control unit 2 when the storage of the translation environment information is completed.

【００２５】制御部２は、翻訳環境設定処理（翻訳環境
情報の記憶）が終了したことを示す信号を受け取ると、
翻訳部３に翻訳対象文書を送り、翻訳処理を実行させ
る。翻訳部３は辞書部４に格納されている翻訳知識及び
翻訳環境情報を用いて、第１の言語によって記述されて
いる翻訳対象文書を第２言語の文章に変換して制御部２
に出力する。制御部２は、翻訳部３からの翻訳結果を出
力部５を介してアプリケーション１０に送出するように
なっている。When the control unit 2 receives a signal indicating that the translation environment setting process (storage of translation environment information) is completed,
The document to be translated is sent to the translation unit 3 to be translated. The translation unit 3 uses the translation knowledge and the translation environment information stored in the dictionary unit 4 to convert the translation target document described in the first language into a sentence in the second language to control the control unit 2
Output to. The control unit 2 is adapted to send the translation result from the translation unit 3 to the application 10 via the output unit 5.

【００２６】辞書部４は、翻訳部３の翻訳処理に必要な
翻訳知識を記憶している。例えば、翻訳部３が日英・英
日双方向の翻訳処理を行うものとすると、辞書部４は夫
々の翻訳方向毎に以下の構成を有しいてる。The dictionary unit 4 stores the translation knowledge necessary for the translation process of the translation unit 3. For example, assuming that the translation unit 3 performs bilingual translation processing between English and Japanese, the dictionary unit 4 has the following configuration for each translation direction.

【００２７】4-1．基本辞書部 (a) 語彙部第１言語の語彙の各々についての活用情報、意味情報、
訳語情報等。4-1. Basic dictionary part (a) Vocabulary part Utilization information, meaning information, for each vocabulary of the first language,
Translation information, etc.

【００２８】(b) 形態素解析規則第１言語の入力文を形態素解析するための知識。(B) Morphological analysis rule Knowledge for morphological analysis of input sentences in the first language.

【００２９】(c) 構文・意味解析規則第１言語の入力文を形態素解析した後、構文的・意味的
な解析を行うための知識。(C) Syntax / Semantic Analysis Rule Knowledge for performing syntactic / semantic analysis after morphological analysis of the input sentence of the first language.

【００３０】(d) 変換規則構文・意味解析された結果の第１言語の意味構造を第２
言語の意味構造へ変換するための知識。(D) The conversion rule syntax / semantic analysis result is the second semantic structure of the first language.
Knowledge to translate into the semantic structure of language.

【００３１】(e) 構文生成規則第２言語の意味構造から第２言語の単語列を生成するた
めの知識。(E) Syntax generation rule Knowledge for generating a word string of the second language from the semantic structure of the second language.

【００３２】(f) 形態素生成規則第２言語の語の活用を反映し、最終的な訳文を出力する
ための知識。(F) Morphological Generation Rule Knowledge for outputting the final translated sentence, which reflects the use of words in the second language.

【００３３】4-2. 専門用語辞書部複数の分野別辞書が用意されており、入力文書の内容に
より翻訳時に用いる辞書を選択することが出来る。語彙
部のみからなる。4-2. Technical term dictionary section A plurality of field-specific dictionaries are prepared, and the dictionary used for translation can be selected according to the contents of the input document. It consists only of vocabulary.

【００３４】4-3. ユーザ辞書部ユーザ定義用の辞書。翻訳する文書に合わせてユーザが
複数作成することができる。各々について(a)と(b)があ
る。4-3. User dictionary section A dictionary for user definition. The user can create a plurality according to the document to be translated. There are (a) and (b) for each.

【００３５】(a) 語彙部ユーザが新規登録した用語についての活用情報、意味情
報、訳語情報等。(A) Vocabulary section Utilization information, meaning information, translated word information, etc. regarding terms newly registered by the user.

【００３６】(b) 訳語学習部ある第１言語の語句に対してユーザが学習させた訳語の
情報を格納する。(B) Translated word learning unit Stores information on translated words learned by the user with respect to a certain phrase in the first language.

【００３７】なお、辞書部４としては、原文のパターン
と訳し方のパターンとを蓄積した翻訳パターン辞書を含
む構成にすることも可能である。The dictionary unit 4 may be configured to include a translation pattern dictionary in which original sentence patterns and translation patterns are accumulated.

【００３８】次に、このように構成された実施の形態の
作用について図２の説明図及び図３乃至図５のフローチ
ャートを参照して説明する。図２は翻訳環境記憶部８内
の情報の格納例を示している。Next, the operation of the embodiment thus constructed will be described with reference to the explanatory view of FIG. 2 and the flow charts of FIGS. FIG. 2 shows an example of storage of information in the translation environment storage unit 8.

【００３９】いま、アプリケーション１０が電子メール
送受信プログラムであるものとし、第１言語を使用する
１人以上のユーザと第２言語を使用する１人以上のユー
ザとが、所定の標題について複数回の電子情報のやり取
りを行っているものとする。また、これらの電子情報
（電子メールの送信データ及び受信データ）を記憶した
文書データ記憶部９に対して、アプリケーション１０
は、第１言語を使用するユーザ（以下、単にユーザとい
う）の操作によってアクセスを行うことができるものと
する。Now, assuming that the application 10 is an electronic mail transmission / reception program, one or more users who use the first language and one or more users who use the second language receive a plurality of times for a predetermined title. It is assumed that electronic information is exchanged. In addition, an application 10 is provided for the document data storage unit 9 that stores these electronic information (e-mail transmission data and reception data).
Can be accessed by an operation of a user who uses the first language (hereinafter, simply referred to as a user).

【００４０】アプリケーション１０は、ユーザによる第
１言語の原文データを翻訳対象文書（翻訳原文）として
入力部１から制御部２に供給する。例えば、アプリケー
ション１０が電子メール送受信プログラムである場合に
は、このプログラム内に翻訳処理の実行を呼び出すため
のプログラムを埋め込むことによって、ユーザーはプロ
グラム上の所定の操作によって、送信しようとする原文
データの翻訳処理を実行させることが可能である。The application 10 supplies the original text data of the first language by the user to the control section 2 from the input section 1 as a translation target document (translation original text). For example, when the application 10 is an e-mail transmission / reception program, by embedding a program for calling the execution of translation processing in this program, the user can perform a predetermined operation on the program to send the original text data to be transmitted. It is possible to execute a translation process.

【００４１】制御部３は、入力部１を介して原文データ
が入力されると、関連文書情報取得部６を起動する。図
３は制御部２にアプリケーション１０から原文データが
入力されてから、訳文を出力するまでの処理の流れを示
している。図３のステップＳ301 において、関連文書情
報取得部６は、原文データに付属されている情報を解析
して、参照関係情報を取得する。When the original text data is input via the input unit 1, the control unit 3 activates the related document information acquisition unit 6. FIG. 3 shows the flow of processing from the input of the original text data from the application 10 to the control unit 2 to the output of the translated text. In step S301 of FIG. 3, the related document information acquisition unit 6 analyzes the information attached to the original text data and acquires the reference relation information.

【００４２】いま、入力部１を介して入力された原文デ
ータは、翻訳対象文書自体と付属情報との組のデータに
よって構成されているものとする。付属情報は、メール
のヘッダ情報や、サブジェクト情報、送信者、あて先、
Ｃｃ先等の諸情報に相当する。この場合には、文書のチ
ェーンを収集するために必要な参照関係情報としては、
サブジェクト情報、送信者、あて先、Ｃｃ先及びメッセ
ージＩＤ等が考えられる。これらの情報は、一定の形式
で原文データ内に組込まれているので、例えば文字列の
サーチによって容易に取得することができる。Now, it is assumed that the original text data input via the input unit 1 is composed of data of a set of a document to be translated and auxiliary information. The attached information is the header information of the email, subject information, sender, destination,
Corresponds to various information such as Cc destination. In this case, the reference relationship information needed to collect the document chain is:
The subject information, sender, destination, Cc destination, message ID, etc. can be considered. Since these pieces of information are incorporated in the original text data in a fixed format, they can be easily obtained by searching a character string, for example.

【００４３】また、電子メールの返信時には、前回用い
た文を引用することが多く、また、引用文であることを
示すタグが本文中に付加されることも多い。従って、関
連文書情報取得部６は、引用文であることを示すタグ及
び引用文の内容が一致するものについて参照系列の文書
データであるものと判断することも可能である。Further, when replying an electronic mail, a sentence used last time is often quoted, and a tag indicating a quoted sentence is often added to the text. Therefore, the related document information acquisition unit 6 can also determine that the tag indicating the quote and the content of the quote are the same as the reference series document data.

【００４４】ステップＳ301 における解析及び情報の取
得が終了すると、次に、関連文書情報取得部６は、ステ
ップＳ302 において翻訳環境記憶部８を参照し、入力デ
ータ内の翻訳原文の翻訳に関して、すでに過去に参照文
書を参照して翻訳環境として登録されているか否かを調
べる。既に当該翻訳原文の識別情報がいずれかの区分の
MessageＩＤの中に含まれていれば、処理をステップＳ3
03 からステップＳ306に移行して翻訳処理を実行する。When the analysis and information acquisition in step S301 are completed, the related document information acquisition unit 6 next refers to the translation environment storage unit 8 in step S302, and the translation original text in the input data has already been translated. Check whether the document is registered as a translation environment by referring to the reference document. If the identification information of the original translation is already in
If it is included in the MessageID, the process is step S3.
The process moves from 03 to step S306 to execute the translation process.

【００４５】一方、当該翻訳原文の識別情報がいずれの
区分のMessageＩＤ中にも含まれていなければ、この翻
訳原文についての参照文書取得処理は行なわれていない
ので、処理をステップＳ304 に移行して、参照文書情報
の取得処理を行う。On the other hand, if the identification information of the translated original sentence is not included in any of the MessageIDs of any of the categories, the reference document acquisition process for this translated original sentence has not been performed. Therefore, the process proceeds to step S304. , Obtains reference document information.

【００４６】即ち、関連文書情報取得部６は、ステップ
Ｓ301 で取得した参照関係情報を元にして、入力された
翻訳原文の作成時に参照された文書、および、当該翻訳
原文が作成された後に当該翻訳原文を参照して作成され
た文書を文書データ記憶部９から取得する。この場合に
は、関連文書情報取得部６は、参照文書情報として取得
した文書が更に参照した文書があれば、文書チェーン
（参照系列）の出発点又は終点まで遡って文書を取得す
る。That is, the related document information acquisition unit 6 uses the reference relation information acquired in step S301 as a basis, and refers to the document referred to when the input translation source text is created and the translation source text after the source text is created. A document created by referring to the translated original sentence is acquired from the document data storage unit 9. In this case, if there is a document further referred to by the document acquired as the reference document information, the related document information acquisition unit 6 traces back to the starting point or the end point of the document chain (reference sequence) to acquire the document.

【００４７】なお、関連文書情報取得部６による参照文
書情報の取得の仕方としては以下の方法が考えられる。The following method can be considered as a method of acquiring the reference document information by the related document information acquisition unit 6.

【００４８】即ち、送信済みホルダー、受信フォルダ
ー、その他保存用ホルダー内を、同一サブジェクト、お
よび"Re:"、"RE:"等の付加されたサブジェクトの項目を
時系列的にサーチし、取得する方法がある。また、プロ
グラム内でメッセージＩＤが完備している場合には、こ
のメッセージＩＤをキーにして参照系列を取得すること
ができる。参照系列をたどり１つのメールを取得する毎
に、翻訳環境記憶部８のいずれかの区分にあるMessage
ＩＤと一致するか否かをチェックし、一致するものがあ
れば既にその先は過去に参照ずみであると判断すること
ができ、この時点で参照データの取得処理を終了する。
ただし、時系列的に新しい参照系列が見つかった場合に
は、検出した参照文書情報の追加のみは省略せずに行
う。That is, the same subject and the items of the subject to which "Re:", "RE:", etc. are added are searched and obtained in a time-series manner in the sent folder, the receiving folder, and other storage folders. There is a way. If the message ID is complete in the program, the reference sequence can be acquired using this message ID as a key. Every time one mail is obtained by tracing the reference sequence, the Message in any of the categories in the translation environment storage unit 8
It is checked whether or not it matches the ID, and if there is a match, it can be determined that the destination has already been referred to in the past, and the reference data acquisition processing ends at this point.
However, when a new reference sequence is found in time series, only the detected reference document information is added without omission.

【００４９】また、MessageＩＤとの一致が見られない
ものについても、以下のような場合には処理を終了する
ことができる。例えば、同一サブジェクトでも、送信
者、受信者、Cｃ先が明かに食い違いが見つかった場合
には、系列の区切りと見なして参照系列の検出処理を終
了する。また、同一プログラムで送られたメールである
のに、In-Reply-to情報が途絶えた場合にも、他の系列
の始発点と見なすことができるので、サーチを終了す
る。これ以外にも、種々な情報を利用してサーチ方法を
適宜工夫することが可能である。Further, even if the message ID does not match, the process can be terminated in the following cases. For example, even when the sender, the receiver, and the Cc destination are clearly discriminated from each other even in the same subject, the reference sequence detection processing is terminated by regarding it as a sequence delimiter. Further, even if the mail is sent by the same program, but the In-Reply-to information is lost, it can be considered as the starting point of another series, so the search ends. Other than this, it is possible to devise a search method as appropriate using various information.

【００５０】参照文書の取得が終ると、制御部２は、処
理をステップＳ305 に移行して、翻訳環境設定部７を起
動する。翻訳環境設定部７は、取得した参照文書情報の
データを用いて、翻訳原文に適合した翻訳環境を判定す
る。When the acquisition of the reference document is completed, the control unit 2 shifts the processing to step S305 and activates the translation environment setting unit 7. The translation environment setting unit 7 uses the data of the acquired reference document information to determine the translation environment suitable for the translation original sentence.

【００５１】図４は図３中のステップＳ305 の処理を具
体的に示している。まず、翻訳環境設定部７は、ステッ
プＳ401 において、参照文書情報全体の中の内容語を抽
出する。次に、翻訳環境設定部７は、ステップＳ402 に
おいて抽出した内容語を用いて、一連の文書の内容（分
野）を推定する処理を行う。この推定処理については、
参考文献「辞書管理方法および辞書管理システムおよび
翻訳方法および翻訳システム」に詳述されている方法を
適用することができる。なお、この文献に記載された推
定処理は、特定の分野毎に関連のある語のセットを定義
しておき、その定義された語と抽出された語との一致状
況によって、文書の分野を推定する方法を採用してい
る。FIG. 4 specifically shows the process of step S305 in FIG. First, the translation environment setting unit 7 extracts content words in the entire reference document information in step S401. Next, the translation environment setting unit 7 uses the content word extracted in step S402 to estimate the content (field) of the series of documents. For this estimation process,
The methods detailed in the reference “Dictionary Management Method and Dictionary Management System and Translation Method and Translation System” can be applied. Note that the estimation process described in this document defines a set of related words for each specific field, and estimates the field of the document based on the matching status between the defined word and the extracted word. Is adopted.

【００５２】分野の推定処理が終了すると、翻訳環境設
定部７は、ステップＳ403 に処理を移行して、翻訳時に
使用する専門用語辞書の選定を行う。この方法として
は、ステップＳ402 で推定された分野に応じて、どの専
門用語辞書を用いるとよいかの対応表を用意しておき、
それに従って、選択を行えば良い。下記表１はこの対応
表の例を示している。When the field estimation process is completed, the translation environment setting unit 7 shifts the process to step S403 to select a technical term dictionary to be used at the time of translation. As this method, a correspondence table indicating which technical term dictionary should be used according to the field estimated in step S402 is prepared,
The selection should be made accordingly. Table 1 below shows an example of this correspondence table.

【００５３】専門辞書の選定が終ると、翻訳環境設定部７は、次のス
テップＳ404 において、ステップＳ304 、Ｓ401 〜Ｓ40
3 で得た情報を翻訳環境記憶部８に反映させる処理を行
う。この場合には、翻訳環境記憶部８のいずれかの区分
が当該翻訳原文の参照系列と一致し、すでに記載がある
場合には、それに書き加えが行われ、新規の場合には新
区分が追加される。ステップＳ401 で抽出した内容語
は、英語の文書か日本語の文書かによって、夫々「内容
語Ｅ」、「内容語Ｊ」欄にリストアップされる。新規項
目の場合には、サブジェクト情報はSubjectに記載され
る。参照されたメッセージ識別情報は、MessageＩＤに
列挙される。推定された分野情報は「分野」欄に、選定
された専門用語辞書の種類名は「使用専門辞書」欄に書
きこまれる。[0053] When the selection of the specialized dictionary is completed, the translation environment setting unit 7 proceeds to the next step S404, where steps S304, S401 to S40 are performed.
Processing for reflecting the information obtained in 3 in the translation environment storage unit 8 is performed. In this case, if one of the categories in the translation environment storage unit 8 matches the reference sequence of the translation original sentence and is already described, it is added to it, and if it is new, a new category is added. To be done. The content words extracted in step S401 are listed in the "content word E" and "content word J" columns, respectively, depending on whether they are English documents or Japanese documents. In the case of a new item, subject information is described in Subject. The referenced message identification information is listed in MessageID. The estimated field information is written in the “field” column, and the type name of the selected technical term dictionary is written in the “used technical dictionary” column.

【００５４】図２はステップＳ404 による翻訳環境記憶
部８の更新処理によって記憶される翻訳環境情報を示し
ていてる。図２の「Subject:」、「Message ＩＤ：」、
「分野：」、「使用専門辞書：」、「内容語Ｅ：」及び
「内容語Ｊ：」は、夫々下記の内容を示している。FIG. 2 shows the translation environment information stored by the update processing of the translation environment storage unit 8 in step S404. In Figure 2, "Subject:", "Message ID:",
“Field:”, “specialized dictionary for use:”, “content word E:” and “content word J:” indicate the following contents, respectively.

【００５５】Subject:本実施の形態においては、サブジ
ェクトが共通するものを１つの文書セット（チェーン）
として認識する。ここではそのサブジェクト名を記憶す
る。Subject: In the present embodiment, the common subject is one document set (chain).
Recognize as. Here, the subject name is stored.

【００５６】実施の仕方によっては、サブジェクト名以
外から参照関係情報を取得することもあり得るので（ア
プリケーションが電子掲示板の場合等）、その場合に
は、複数のサブジェクト名を記憶するようにする。Depending on the method of implementation, the reference relationship information may be obtained from other than the subject name (when the application is an electronic bulletin board, etc.). In that case, a plurality of subject names are stored.

【００５７】Message ＩＤ：アプリケーションが管理す
る文書識別情報。実施の形態によっては番号であった
り、ファイル名であったりする。Message ID: Document identification information managed by the application. Depending on the embodiment, it may be a number or a file name.

【００５８】分野：前回の翻訳時に翻訳部３により翻訳
原文がいずれの分野の文書と判定されたか、あるいは、
翻訳環境設定部７により参照文書情報はいずれの分野の
文書と判定されたのかについての情報を記憶する。Field: in which field the translation original sentence was judged by the translation unit 3 at the time of the previous translation, or
The translation environment setting unit 7 stores information about which field the reference document information is determined to be a document.

【００５９】使用専門辞書：前回の翻訳時に、ユーザが
どの辞書の使用を指定したか、あるいは翻訳直前の翻訳
環境設記憶部８内の分野情報に基づいて、翻訳部３が自
動で選択した専門用語辞書は何であったか、あるいは翻
訳環境設定部７により参照文書情報に対して判定された
分野に対応する専門辞書の種類は何かについての情報を
記憶する。Specialized dictionary to be used: The special dictionary automatically selected by the translation unit 3 based on which dictionary the user specified to use at the time of the previous translation, or the field information in the translation environment storage unit 8 immediately before translation. Information about what the term dictionary was, or what kind of specialized dictionary corresponds to the field determined by the translation environment setting unit 7 for the reference document information is stored.

【００６０】内容語Ｅ：参照文書履歴中で、もともと英
語で書かれた文書に生起した内容語のリストを記憶す
る。図２の例では単純な列挙にとどめたが、頻度情報を
持たせて、後の翻訳部３での利用の制御に用いてもよ
い。Content word E: In the reference document history, a list of content words originally generated in the document written in English is stored. In the example of FIG. 2, only simple enumeration is performed, but frequency information may be added and used for controlling the use in the translation unit 3 later.

【００６１】内容語Ｊ：参照文書履歴中で、もともと日
本語で書かれた文書に生起した内容語のリストを記憶す
る。言語の違い以外は内容語Ｅと同じ。Content word J: A list of content words originally generated in a document written in Japanese is stored in the reference document history. Same as content word E except for language differences.

【００６２】このような翻訳環境記憶部８の更新が終了
すると、図３のステップＳ306 に処理を移行して、翻訳
部３が起動される。図５は図３のステップＳ306 の具体
的なフローを示し、翻訳原文中の１文に対する翻訳部３
の翻訳処理を示している。なお、入力された翻訳原文全
体の翻訳には、文書末まで図５のフローが繰り返され
る。When the updating of the translation environment storage unit 8 is completed, the processing shifts to step S306 in FIG. 3 and the translation unit 3 is activated. FIG. 5 shows a specific flow of step S306 of FIG.
Shows the translation process of. For the translation of the entire input translation source text, the flow of FIG. 5 is repeated until the end of the document.

【００６３】翻訳部３は、制御部２から翻訳原文が送ら
れると、先ず、ステップＳ501 において、当該翻訳原文
が新しいブロックの第１文か否かを判断する。アプリケ
ーション１０が電子メールの送受信プログラムである場
合には、１ブロックは１メッセージに対応する。従っ
て、この場合には、メッセージの先頭の文のみが「新し
いブロック」と判断される。ブロックの先頭の文であれ
ば、翻訳部３は、ステップＳ502 に移行して、翻訳環境
記憶部８中の該当する区分の翻訳環境情報を読み込む。
ブロックの先頭以外の文については、翻訳環境情報を新
たに読み込むことなく、前文で用いた環境と同一の翻訳
環境情報を用いる。こうして、ブロック毎に対応する翻
訳環境情報を用いるように制御される。When the translation source sentence is sent from the control unit 2, the translation unit 3 first determines in step S501 whether the translation source sentence is the first sentence of a new block. When the application 10 is an electronic mail transmission / reception program, one block corresponds to one message. Therefore, in this case, only the first sentence of the message is determined as the "new block". If it is the first sentence of the block, the translation unit 3 proceeds to step S502 to read the translation environment information of the corresponding section in the translation environment storage unit 8.
For the sentences other than the beginning of the block, the same translation environment information as the environment used in the previous sentence is used without newly reading the translation environment information. In this way, control is performed so that the translation environment information corresponding to each block is used.

【００６４】翻訳環境の準備が整うと、翻訳部３は、次
のステップＳ503 において、辞書部４の下記知識を用い
て、辞書引き及び形態素解析処理を行う。When the translation environment is ready, in the next step S503, the translation unit 3 uses the following knowledge of the dictionary unit 4 to perform dictionary lookup and morphological analysis processing.

【００６５】なお、4-2．専門用語辞書部の中で用いる辞書の種類
は、翻訳環境情報中に設定されている辞書とする。ユー
ザ辞書4-3．については、翻訳環境情報との関連付けを
行っていないので、ユーザが別途指定した辞書を用いる
ようにする。この場合でも、翻訳環境記憶部８が記憶す
る項目の１つとしてユーザ辞書の項目を設けることで、
翻訳精度を一層向上させることができる。[0065] In addition, 4-2. The type of dictionary used in the technical term dictionary section is the dictionary set in the translation environment information. User dictionary 4-3. With respect to the above, since it is not associated with the translation environment information, a dictionary specified separately by the user should be used. Even in this case, by providing an item of the user dictionary as one of the items stored in the translation environment storage unit 8,
The translation accuracy can be further improved.

【００６６】次に、翻訳部３は、ステップＳ504 におい
て、4-1．(c)構文・意味解析規則を用いて、入力原文の
構文・意味解析を行う。この段階で、入力言語の解析結
果の構造が構築されている。次に、翻訳部３は、ステッ
プＳ505 に処理を移行して、4-1．(a)語彙部、(d)変換
規則、4-2．語彙部、4-3．(a)語彙部、(b)訳語学習部の
知識を用いて、入力言語の構造を出力言語の構造に変換
する処理を行う。この段階で、出力言語の言語的な構造
と共に、入力単語に対する訳語の決定も行う。Next, in step S504, the translation unit 3 4-1. (c) The syntax and semantic analysis of the input source text is performed using the syntax and semantic analysis rules. At this stage, the structure of the analysis result of the input language is constructed. Next, the translation unit 3 shifts the processing to step S505, and 4-1. (a) Vocabulary part, (d) Conversion rule, 4-2. Vocabulary section, 4-3. Using the knowledge of (a) vocabulary part and (b) translated word learning part, processing for converting the structure of the input language into the structure of the output language is performed. At this stage, the translation of the input word is determined along with the linguistic structure of the output language.

【００６７】本実施の形態においては、訳語の決定に際
して、翻訳環境情報に設定されている分野名、専門用語
辞書、内容語Ｅ，Ｊの情報を下記の手法によって用い
る。In the present embodiment, when determining the translated word, the information of the field name, the technical term dictionary, and the content words E and J set in the translation environment information is used by the following method.

【００６８】* 専門用語辞書入力文中に、基本辞書にはなく、専門用語辞書にのみ登
録されている合成語等が現われていれば、ステップＳ50
3 で専門用語辞書の登録語の辞書引きが行われ、その訳
語がここで選択される。また、基本辞書にも専門用語辞
書にも登録されている語が入力文中にあった場合にも、
専門用語辞書に定義されている訳語が採用される。こう
して、当該専門分野に即した訳語で訳出されることにな
る。（ユーザ辞書についても同様） * 分野名基本辞書中の訳語に分野情報がついているものがある。
以下に「base」の例を示す。野球分野であれば、「ベー
ス」と訳し、軍事分野であれば「基地」と訳し、それ以
外であれば「基礎」と訳すことを示している。よって、
翻訳環境情報に設定された分野に合わせて、同じ見出し
語でも異なる訳語に訳し分けることができる。* If a compound word or the like registered in the technical term dictionary but not in the basic dictionary appears in the input text of the technical term dictionary, step S50.
In step 3, the registered words in the technical term dictionary are looked up, and their translated words are selected here. Also, when there is a word in the input sentence that is registered in both the basic dictionary and the technical term dictionary,
The translation terms defined in the technical term dictionary are adopted. In this way, the translation will be performed in a translation word that matches the specialized field. (The same applies to user dictionaries.) * Some translations in the field name basic dictionary have field information.
An example of "base" is shown below. In the field of baseball, it is translated as "base", in the field of military it is translated as "base", and in other cases it is translated as "basic". Therefore,
The same headword can be translated into different translations according to the field set in the translation environment information.

【００６９】（例） base: 基礎、ベース(分野=野球)、基地(分野=軍事) * 内容語Ｅ，Ｊいま、MessageＩＤ=000015の文書を翻訳しようとしてい
るとする。その文書の中に「The program featured wor
ks by Picasso.」という文があったとする。基本辞書部
の「work」の登録内容が以下の通りだとすると、通常は
第１定義である「仕事」が「work」の訳語として採用さ
れる。ところが、翻訳環境情報として内容語Ｊの中の語
を一覧すると、「作品」という語があり、これが「wor
k」の訳語定義群の中の一つと一致する。このような場
合に、一致した訳語を優先して用いる、という制御の仕
方をする。一方、MessageＩＤ=000010の文書を翻訳する
場合には、同じ「work」に対して、内容語Ｊにある「作
業」を優先する。こうすることで、翻訳を介したメール
やメッセージのやり取りを通して、一貫性のある翻訳が
保ちやすくなる。(Example) base: Basic, base (field = baseball), base (field = military) * Content words E and J Now, suppose that a document with MessageID = 000015 is being translated. In that document, "The program featured wor
Suppose there is a sentence "ks by Picasso." If the registered contents of “work” in the basic dictionary section are as follows, the first definition “work” is usually adopted as the translation of “work”. However, when listing the words in the content word J as translation environment information, there is the word "work", which is "wor
Matches one of the translation definition groups of "k". In such a case, the matched translated word is preferentially used and controlled. On the other hand, when translating a document with MessageID = 000010, “work” in the content word J is prioritized with respect to the same “work”. This will help maintain consistent translations through email and message exchanges through translations.

【００７０】（例） work: 仕事、作業、労働、作品こうして、出力言語の構造が構築されると、翻訳部３は
処理をステップＳ506に移行して、4-1．(e)構文生成規
則を用いて、２次元的な構造を１次元的な構造に変換す
る。最後に、翻訳部３は、ステップＳ507 において、4-
1．(f)形態素生成規則を用いて、個々の語の表層形態を
生成し、最終的な訳文を出力する。(Example) work: work, work, labor, work When the structure of the output language is constructed in this way, the translation unit 3 shifts the processing to step S506, and 4-1. (e) Convert a two-dimensional structure into a one-dimensional structure using the syntax generation rule. Finally, in step S507, the translation unit 3 4-
1. (f) The surface morphology of each word is generated using the morpheme generation rule, and the final translated sentence is output.

【００７１】翻訳部３から出力された訳文は、制御部２
に送られる。制御部２は図３のステップＳ307 におい
て、訳文を出力部５を介してアプリケーション１０に出
力する。こうして、アプリケーション１０である電子メ
ール送受信プログラム上で、翻訳原文に対する第２言語
の翻訳文が得られる。The translated text output from the translation unit 3 is the control unit 2
Sent to. The control unit 2 outputs the translated text to the application 10 via the output unit 5 in step S307 of FIG. In this way, the translated sentence in the second language with respect to the translated original sentence can be obtained on the electronic mail transmission / reception program which is the application 10.

【００７２】次に、アプリケーション１０がＷｅｂ上の
掲示板システムである場合について作用を説明する。な
お、説明を簡略化するために、アプリケーションが異な
ることに基づく相違点のみを説明する。なお、掲示板シ
ステムの電子情報（掲示板に表示される文書データ）を
記憶したインターネット上の文書データ記憶部９に対し
て、第１言語を使用するユーザが、アプリケーション１
０によってアクセスを行うものとする。Next, the operation will be described when the application 10 is a bulletin board system on the Web. To simplify the description, only the differences based on the different applications will be described. It should be noted that a user who uses the first language is assigned to the application 1 by using the document data storage unit 9 on the Internet that stores electronic information (document data displayed on the bulletin board) of the bulletin board system.
It is assumed that 0 is used for access.

【００７３】第１の相違点は、図３のステップＳ301 に
おける付属情報の解析処理であり、第２の相違点は、図
３のステップＳ304 における参照文書情報の取得処理で
あり、第３の相違点は、ステップＳ501 においてブロッ
クを考慮する点である。The first difference is the additional information analysis process in step S301 of FIG. 3, and the second difference is the reference document information acquisition process in step S304 of FIG. 3, which is the third difference. The point is that blocks are considered in step S501.

【００７４】先ず、第１の相違点について説明する。First, the first difference will be described.

【００７５】Ｗｅｂ上の掲示板では、翻訳対象及び文書
データ記憶部９に格納されている文書がｈｔｍｌソース
ファイルであることが考えられる。この場合、ソースフ
ァイル内に埋め込まれた原文データ以外の例えばタグ等
の各種構造情報が付属情報に相当する。On the bulletin board on the Web, it is conceivable that the translation object and the document stored in the document data storage unit 9 are html source files. In this case, various structural information such as tags other than the original text data embedded in the source file corresponds to the attached information.

【００７６】ところで、掲示板の構成はＷｅｂサイト毎
に異なる。例えば、１ページに投稿メッセージが時系列
的に列挙されている順次表示の掲示板で、返信関係のリ
ンクボタンが用意されていない形態（以下、「タイプ
Ａ」という）の掲示板がある。また、特定のメッセージ
がいずれのメッセージの返信であるかの情報をリンク付
きで表示する形態（以下、「タイプＢ」という）の掲示
板もある。更に、参照関係が枝分かれツリーや、インデ
ントを使って表されている形態（以下、「タイプＣ」と
いう）の掲示板もある。これらのタイプの識別は、ステ
ップＳ301 において、関連文書情報取得部６がページ内
のテーブルの使用法等を判別することによって予め行
う。By the way, the structure of the bulletin board differs for each Web site. For example, there is a bulletin board in which posting messages are listed in a time series on one page and a link button for reply is not prepared (hereinafter referred to as “type A”). There is also a bulletin board in a form (hereinafter, referred to as "type B") in which information indicating which message a particular message is a reply to is displayed with a link. Furthermore, there is a bulletin board in which the reference relationship is expressed using a branching tree or indentation (hereinafter referred to as "type C"). These types are identified in advance in step S301 by the related document information acquisition unit 6 determining how to use the table in the page.

【００７７】「タイプＡ」の掲示板においては、各メッ
セージは番号で管理されており、返信関係を示す情報と
して、サブジェクト名に参照したメッセージ番号が付記
される等の表示形態をとるのが普通である。そこで、こ
の場合には、関連文書情報取得部６は、ステップＳ301
において、各メッセージの参照メッセージ番号、サブジ
ェクト、投稿者等を参照関係情報として取得する。In the "type A" bulletin board, each message is managed by a number, and it is usual to take a display form such that a message number referred to a subject name is added as information indicating a reply relationship. is there. Therefore, in this case, the related document information acquisition unit 6 performs step S301.
In, the reference message number, subject, poster, etc. of each message are acquired as reference relationship information.

【００７８】「タイプＢ」の掲示板については、関連文
書情報取得部６は、ステップＳ301において、当該メッ
セージのサブジェクト、投稿者情報の他に、返信の元と
なったデータの所在、ファイル名等を取得する。掲示板
で通常用いられる以下に示す表示の仕方（パターンＡ
１，Ａ２）を記憶しておき、そのパターンを入力のｈｔ
ｍｌファイル内でサーチすることで、返信の元となった
データを検出することができる。なお、これらのパター
ンを、ユーザが容易に追加できるような構成にしておく
ことも可能である。For the "type B" bulletin board, the related document information acquisition unit 6 determines in step S301 the location of the data that is the source of the reply, the file name, etc. in addition to the subject of the message and the poster information. get. The following display method (Pattern A) that is usually used in bulletin boards
1, A2) is stored and the pattern is input
By performing a search in the ml file, the data that is the source of the reply can be detected. It should be noted that these patterns can be configured so that the user can easily add them.

【００７９】パターンＡ１：これは（$変数）さんの（$
変数_参照先）に対する返信ですパターンＡ２：このコメントは（$変数_参照先）へ返信
です ; ; 同様に、当該メッセージを参照して作成されたメッセー
ジの検出についても、以下のようなパターンＢ１を用意
しておき、リンク関係をたどりながらメッセージを取得
することができる。Pattern A1: This is ($ variable) 's ($
This is a reply to (variable_reference) Pattern A2: This comment is a reply to ($ variable_reference);; Similarly, the following pattern B1 is also used for detecting a message created by referring to the message. You can get messages while following the link relations.

【００８０】パターンＢ１:このメッセージに対する返
信を見る ; ; また、「タイプＣ」の掲示板については、ｈｔｍｌ形式
の構造を解析することで容易に参照関係情報の表示部分
の検出が可能である。下記表示例は、その表示の一例で
ある。テーブル形式を用いて表されることが多く、サブ
ジェクト部分は通常実メッセージデータへのリンクとな
っている。この中から、もっとも単純なレベルでは、サ
ブジェクト中の「Re」「RE」「re」等の使用法をキーに
参照・被参照関係にあるメッセージデータの所在、ファ
イル名等を参照関係情報として取得すればよい。更に、
テーブル内入れ子関係等の構造を解析することにより、
サブジェクト名が変わっていても参照・被参照関係を検
出することができる。Pattern B1: See reply to this message;; For the "type C" bulletin board, it is possible to easily detect the display portion of the reference relation information by analyzing the html format structure. The following display example is an example of the display. Often expressed using a table format, the subject part is usually a link to the actual message data. From these, at the simplest level, the usage of "Re", "RE", "re", etc. in the subject is used as a key, and the location of the message data in the referenced relationship and the file name etc. are acquired as reference relationship information. do it. Furthermore,
By analyzing the structure such as the nesting relation in the table,
Even if the subject name is changed, the reference / reference relationship can be detected.

【００８１】示例> No. 発言順 ; ; 5 5 ソフトウェアの購入方法について花田一郎 2001-06-10 6 6 ユーザ意見の収集について鈴木宏 2001-06-11 7 ・Re: ユーザ意見の収集について佐藤孝 2001-06-11 9 ・・Re: ユーザ意見の収集について山田智子 2001-06-13 7 8 警告表示井上悟 2001-06-12 ; ; これらのいずれのタイプについても、入力が原文データ
のアドレス、存在場所、ファイル名のみであるような場
合には、付属情報解析の前に、文書データ記憶部９にア
クセスして原文データを取得するところから処理を始め
る必要がある。Example> No. Speaking order ;; 5 5 About purchasing software Ichiro Hanada 2001-06-10 6 6 About collecting user opinions Hiroshi Suzuki 2001-06-11 7 ・ Re: About collecting user opinions Taka Sato 2001-06-11 9 ・・ Re: Collection of user opinions Tomoko Yamada 2001-06-13 7 8 Warning display Satoru Inoue 2001-06-12 ;; For both of these types, the input is the address of the text data, In the case where only the location and the file name are present, it is necessary to start the process from the place where the original data is acquired by accessing the document data storage unit 9 before analyzing the attached information.

【００８２】次に、第２の相違点、即ち、参照文書情報
の探し方について説明する。Next, the second difference, that is, how to search for reference document information will be described.

【００８３】タイプＡのＷｅｂ上の掲示板の場合には、
ページ内に参照系列に属する文書が存在する可能性があ
るので、最初に、ページ上の各メッセージ毎に、参照系
列を同ページ内で検索する。ページ内の検索が終了し、
その中に参照系列に属するメッセージ（文書）が存在し
ない場合には、次に前ページの参照を行う。なお、前ペ
ージのデータを検出するためには、ｈｔｍｌ内で「前ペ
ージ」、「次ページ」等の文字列がボタンやリンクにな
っている個所を検索することで、容易に前ページのデー
タの検出が可能である。前ページのデータを取得する
と、同様に参照系列のメッセージ番号（参照メッセージ
番号）と一致するメッセージを順次検索して、参照系列
の出発点までたどりながら、参照文書情報を系列的に取
得する。In the case of a type A Web bulletin board,
Since there is a possibility that a document belonging to the reference sequence exists in the page, first, the reference sequence is searched in the page for each message on the page. The search on the page is completed,
If there is no message (document) belonging to the reference sequence, the previous page is referred to. In addition, in order to detect the data of the previous page, you can easily find the data of the previous page by searching the place where the character string such as "previous page" or "next page" is a button or link in html. Can be detected. When the data on the previous page is acquired, similarly, the message that matches the message number (reference message number) of the reference sequence is sequentially searched, and the reference document information is acquired in sequence while tracing to the starting point of the reference sequence.

【００８４】タイプＢの掲示板の場合には、前記パター
ンのサーチにより見つかった個所の「$変数_参照先」に
リンクがはられているデータを取得し、同様に参照先が
無くなるまで、メッセージの参照系列を取得する。In the case of the type B bulletin board, the data linked to the "$ variable_reference" of the part found by the search of the pattern is acquired, and similarly, until the reference disappears, the message Get reference series.

【００８５】タイプＣの掲示板の場合には、参照関係の
表示部分の構造から参照・被参照のメッセージのセット
の情報が取得可能なので、取得された情報にあるメッセ
ージを順次取得していけばよい。In the case of the type C bulletin board, information on a set of referenced / referenced messages can be acquired from the structure of the display portion of the reference relationship, so the messages in the acquired information can be sequentially acquired. .

【００８６】また、翻訳環境記憶部８のいずれかの区分
にMessageＩＤが記載済みのメッセージが見つかった時
点で以降の参照データ取得処理を省略するのは、タイプ
Ａ，Ｂ，Ｃとも同様である。Similar to types A, B and C, the subsequent reference data acquisition processing is omitted when a message in which the MessageID has been described is found in any of the categories of the translation environment storage unit 8.

【００８７】次に、第３の相違点、即ち、入力される翻
訳訳文書が複数のブロックによって構成されている場合
について説明する。Next, the third difference, that is, the case where the input translated document is composed of a plurality of blocks will be described.

【００８８】本実施の形態においては、掲示板システム
では、通常テーブル形式を用いてひとまとまりに表され
るメッセージの集合をブロックと定義する。Ｗｅｂ上の
掲示板システムで扱う文書は、タイプＡの掲示板のよう
に、複数ブロックによって構成されていることがある。
この場合には、翻訳部３は、ブロックの先頭の文であれ
ば、ステップＳ501 から処理をステップＳ502 に移行
し、翻訳環境記憶部８中の該当する区分の翻訳環境情報
を読み込む。ブロック先頭以外では、翻訳環境情報を新
たに読み込むことなく、前文で用いた環境と同じ翻訳環
境情報を用いる。こうして、１つのページに複数のメッ
セージがある場合には、夫々の翻訳環境情報を用いるよ
うに制御することができる。In the present embodiment, the bulletin board system defines a block as a set of messages that are normally expressed in a group using a table format. A document handled by a bulletin board system on the Web may be composed of a plurality of blocks like a type A bulletin board.
In this case, the translation unit 3 moves the process from step S501 to step S502 if the sentence is the head sentence of the block, and reads the translation environment information of the corresponding section in the translation environment storage unit 8. Except for the beginning of the block, the same translation environment information as the environment used in the previous sentence is used without newly reading the translation environment information. In this way, when there are a plurality of messages on one page, it is possible to control to use each translation environment information.

【００８９】なお、タイプＢの掲示板の場合には、入力
文書全体を１ブロックとみなし、その第１文であるか否
かをステップＳ501 において判断すればよい。In the case of the type B bulletin board, the entire input document may be regarded as one block, and whether or not it is the first sentence may be determined in step S501.

【００９０】他の作用はアプリケーション１０がメール
送受信プログラムである場合と同様であり、メッセージ
のブロック毎に、取得した参照文書の情報を用いて翻訳
環境情報を設定し、翻訳環境情報に基づいて翻訳処理を
行って訳文書を出力する。Other operations are the same as when the application 10 is a mail transmission / reception program, and the translation environment information is set for each block of the message using the information of the acquired reference document, and the translation is performed based on the translation environment information. Process and output the translated document.

【００９１】このように、本実施の形態においては、翻
訳対象に応じた参照関係情報を用いて文書データ記憶部
から関連を有する参照文書情報を検出し、参照文書情報
に基づいて翻訳を補助する翻訳環境情報を設定し、設定
した翻訳環境情報を用いて翻訳処理を行っており、翻訳
精度を向上させることができる。これにより、比較的短
い文書の電子メールや電子掲示板上のメッセージの翻訳
に際しても、返信履歴等を利用して、履歴内の文書から
翻訳を補助する翻訳環境情報を取得して有効利用するこ
とができる。また、１ページに複数の話題が並行して掲
載される掲示板システムにおいても、話題毎に適した翻
訳環境情報を用いて高い翻訳精度の翻訳結果を得ること
ができる。As described above, in the present embodiment, the reference relationship information corresponding to the translation target is used to detect the related reference document information from the document data storage unit, and the translation is assisted based on the reference document information. The translation environment information is set, and the translation process is performed using the set translation environment information, so that the translation accuracy can be improved. As a result, even when translating an e-mail of a relatively short document or a message on an electronic bulletin board, it is possible to use the reply history and the like to obtain translation environment information for assisting translation from the document in the history and effectively use it. it can. Further, even in a bulletin board system in which a plurality of topics are posted in parallel on one page, it is possible to obtain a translation result with high translation accuracy by using the translation environment information suitable for each topic.

【００９２】図６は本発明の他の実施の形態において採
用される翻訳環境設定処理を示すフローチャートであ
る。図６において図４と同一の手順には同一符号を付し
て説明を省略する。FIG. 6 is a flow chart showing a translation environment setting process adopted in another embodiment of the present invention. 6, the same steps as those in FIG. 4 are designated by the same reference numerals, and the description thereof will be omitted.

【００９３】本実施の形態は翻訳環境設定部７の処理が
図１の実施の形態と異なるのみである。翻訳環境設定部
７は、図６のステップＳ412 において、対訳用語を作成
する。即ち、翻訳環境設定部７は、参照文書情報全体又
はその一部に対する文脈情報を用いた翻訳処理によっ
て、収集したチェーン内の単語の訳語として最適な訳語
を取得する。The present embodiment is different only in the processing of the translation environment setting unit 7 from the embodiment of FIG. The translation environment setting unit 7 creates bilingual terms in step S412 of FIG. That is, the translation environment setting unit 7 obtains an optimal translated word as a translated word of the collected words in the chain by the translation process using the context information for the entire reference document information or a part thereof.

【００９４】図７は図６のステップＳ404 において更新
される翻訳環境記憶部８の内容を示している。図７にお
いて図２と同一の記憶内容については説明を省略する。FIG. 7 shows the contents of the translation environment storage unit 8 updated in step S404 of FIG. In FIG. 7, description of the same storage contents as in FIG. 2 will be omitted.

【００９５】図７において、対訳用語はステップＳ412
において取得されたものである。第１言語のある語に対
して最適の第２言語の訳語を決める手段の典型には、4-
1(d)変換規則によるものがある。例えば、「plane」に
は大きくわけて「飛行機」と「平面」の異なる意味があ
る。4-1(d)変換規則の中には、どちらかの訳語が適切か
を決める知識として以下のような訳出決定のための規則
が多数蓄積されている。In FIG. 7, the parallel translation term is step S412.
It was acquired in. A typical means for determining the optimum second language translation for a certain word in the first language is 4-
1 (d) There are some conversion rules. For example, “plane” has two different meanings, “airplane” and “plane”. In the 4-1 (d) conversion rules, a large number of rules for determining translation as follows are accumulated as knowledge for deciding which translation is appropriate.

【００９６】plane(主語)+ bomb/crash/explode(主動
詞) → plane=飛行機（意味：bomb, crash, explodeの主語として用いられた
planeは「飛行機」と訳す）このような規則のいずれかが選択され訳語が決定される
ことで、「plane=飛行機」という対訳情報を作成するこ
とができる。Plane (subject) + bomb / crash / explode (main verb) → plane = airplane (Meaning: used as the subject of bomb, crash, explode
(Plane is translated as “airplane”) By selecting one of these rules and determining the translation, it is possible to create parallel translation information “plane = airplane”.

【００９７】図７の対訳用語は、翻訳原文に属するチェ
ーン内の文書の訳語として最適と判断されたものであ
る。従って、翻訳部３は、翻訳処理に際して、図７の対
訳用語を最優先して翻訳を行う。The bilingual terms in FIG. 7 are determined to be optimal as the translated words of the documents in the chain that belong to the original translation. Therefore, the translation unit 3 gives the highest priority to the translation term of FIG.

【００９８】他の構成及び作用は図１の実施の形態と同
様である。Other configurations and operations are similar to those of the embodiment shown in FIG.

【００９９】このように、本実施の形態においても図１
の実施の形態と同様の効果を得ることができる。As described above, FIG. 1 is also used in this embodiment.
It is possible to obtain the same effect as that of the above embodiment.

【０１００】なお、上記実施の形態において用いた情報
は一例にすぎず、請求の範囲を逸脱しない範囲であらゆ
る情報の利用が可能である。また、アプリケーションプ
ログラムが扱うデータ情報の詳細化に合わせて、履歴情
報を取得する手段にも多様な形態が可能である。また、
上記各実施の形態は、取得の細かい仕様について限定す
るものではない。また、アプリケーションプログラムに
ついては、上記実施の形態においては電子メール送受信
プログラム又は電子掲示板プログラムを想定したが、翻
訳を介した自然言語文データのやり取りを行うあらゆる
コミュニケーションプログラムに適用可能である。The information used in the above embodiments is merely an example, and any information can be used without departing from the scope of the claims. Further, in accordance with the detailing of the data information handled by the application program, various forms can be used for the means for acquiring the history information. Also,
The above embodiments do not limit the detailed specifications of acquisition. Further, although the application program is assumed to be the electronic mail transmission / reception program or the electronic bulletin board program in the above-mentioned embodiment, it can be applied to any communication program for exchanging natural language sentence data through translation.

【０１０１】即ち、本発明は翻訳を介したメッセージの
やり取りに適用することができ、この場合には、第１言
語と第２言語の語句の対を翻訳環境情報として保存する
ので、翻訳を介した結果まったく別の意味の内容の会話
のようになってしまうという問題を回避しやすくなる。That is, the present invention can be applied to the exchange of messages through translation. In this case, since the pair of words in the first language and the second language is stored as the translation environment information, the translation is not performed. As a result, it becomes easier to avoid the problem that the conversation becomes like a completely different meaning.

【０１０２】[0102]

【発明の効果】以上説明したように本発明によれば、電
子メールや電子掲示板等の電子情報のやり取りの特徴を
利用して、翻訳対象の文書以外の関連を有する文書から
翻訳処理に利用可能な情報を抽出することにより、翻訳
精度を向上させることができるという効果を有する。As described above, according to the present invention, it is possible to utilize the characteristic of electronic information exchange such as electronic mail and electronic bulletin board for the translation process from the related document other than the translation target document. By extracting such information, the translation accuracy can be improved.

[Brief description of drawings]

【図１】本発明の一実施の形態に係る機械翻訳装置を示
すブロック図。FIG. 1 is a block diagram showing a machine translation device according to an embodiment of the present invention.

【図２】翻訳環境記憶部８内の情報の格納例を示す説明
図。FIG. 2 is an explanatory diagram showing a storage example of information in a translation environment storage unit 8.

【図３】図１中の制御部２にアプリケーション１０から
原文データが入力されてから、訳文を出力するまでの処
理の流れを示すフローチャート。3 is a flowchart showing a flow of processing from input of original text data from an application 10 to output of a translated text in a control unit 2 in FIG.

【図４】図３中のステップＳ305 の翻訳環境設定処理を
具体的に示すフローチャート。4 is a flowchart specifically showing a translation environment setting process in step S305 in FIG.

【図５】図３のステップＳ306 の翻訳処理を具体的に示
すフローチャート。FIG. 5 is a flowchart specifically showing the translation process of step S306 in FIG.

【図６】本発明の他の実施の形態において採用される翻
訳環境設定処理を示すフローチャート。FIG. 6 is a flowchart showing a translation environment setting process adopted in another embodiment of the present invention.

【図７】図６のステップＳ404 において更新される翻訳
環境記憶部８の内容を示す説明図。7 is an explanatory diagram showing the contents of a translation environment storage unit 8 updated in step S404 of FIG.

[Explanation of symbols]

２…制御部、３…翻訳部、４…辞書部、６…関連文書情
報取得部、７…翻訳環境設定部、８…翻訳環境記憶部、
９…文書データ記憶部、１０…アプリケーション。2 ... control unit, 3 ... translation unit, 4 ... dictionary unit, 6 ... related document information acquisition unit, 7 ... translation environment setting unit, 8 ... translation environment storage unit,
9 ... Document data storage unit, 10 ... Application.

Claims

[Claims]

1. Related document information for accessing, as reference document information, document data holding means in which a plurality of document data is stored, and reference sequence document data having a relation with original text data among the plurality of document data is acquired. An acquisition unit, a translation environment setting unit that acquires translation environment information that can be used to translate the original text data from the reference document information acquired by the related document information acquisition unit, and stores the translation environment information in the translation environment holding unit; A translation means for translating the original text data using a dictionary having knowledge information necessary for translation into the second language and the translation environment information held in the translation environment holding means. Machine translation device.

2. The related document information acquisition means obtains reference relationship information that correlates document data of the reference series by analyzing the original text data, and uses the reference relationship information to obtain the reference document. The machine translation apparatus according to claim 1, wherein the machine translation apparatus acquires information.

3. The machine translation device according to claim 2, wherein the related document information acquisition means obtains the reference relation information by analyzing information attached to the original text data.

4. The reference relationship information is at least one of subject information, a sender, a destination, a Cc destination, and a message ID when the original text data is transmission / reception data of an electronic mail. The machine translation device according to claim 2.

5. When the original text data is data posted on a bulletin board on the web, the reference relationship information includes a reference message number of each message, a subject, a contributor, and the location of the data that is the source of the reply. The machine translation device according to claim 2, wherein the machine translation device is at least one of file names.

6. The machine translation apparatus according to claim 1, wherein the document data of the reference sequence is document data of a response regarding the same content.

7. The related document information acquisition unit refers to the translation environment information stored in the translation environment storage unit, and does not acquire the reference document information corresponding to the registered translation environment information. The machine translation device according to claim 1.

8. The machine translation apparatus according to claim 1, wherein the related document information acquisition unit acquires the document data of the reference series in the order of creation.

9. The machine translation device according to claim 1, wherein the related document information acquisition unit acquires the document data of the reference sequence in a reverse order of the creation time.

10. The dictionary includes a plurality of pieces of knowledge information necessary for translation, and the translation environment information includes information for selecting knowledge information to be used for translation from a plurality of pieces of knowledge information included in the dictionary. The machine translation device according to claim 1, wherein:

11. The translation environment holding means stores the phrase of the first language and the phrase of the second language included in the reference document information of the reference sequence as translation environment information, and the translation means The machine translation apparatus according to claim 1, wherein the translation environment information is used when determining a translation word.

12. The translation environment holding means includes the first language and the second language included in the reference document information of the reference sequence.
2. The machine translation device according to claim 1, wherein the translation means stores a translation term as a translation environment information, and the translation unit uses the translation term when determining a translation word.

13. Related document information for accessing, as reference document information, document data holding means in which a plurality of document data is stored, and reference sequence document data having a relation with original text data among the plurality of document data is acquired. An acquisition process, a translation environment setting process for acquiring translation environment information that can be used for translation of the original text data from the reference document information acquired in the related document information acquisition process, and causing the translation environment holding unit to hold the translation environment information; A translation process for translating the original text data by using a dictionary having knowledge information necessary for translation into a second language and the translation environment information held in the translation environment holding means. Machine translation method.

14. A computer accesses a document data holding means in which a plurality of document data is stored, and acquires, as reference document information, document data of a reference series having a relation with original text data among the plurality of document data. A related document information acquisition process, a translation environment setting process for acquiring translation environment information that can be used for translation of the original text data from the reference document information acquired in the related document information acquisition process, and holding the translation environment information in a translation environment holding unit; To execute a translation process for translating the original text data by using a dictionary having knowledge information necessary for translation into one language and a second language and the translation environment information held in the translation environment holding means. Machine translation program.