JPH11224251A

JPH11224251A - Method and device for machine translation

Info

Publication number: JPH11224251A
Application number: JP10038105A
Authority: JP
Inventors: Toshiyuki Sugio; 俊之杉尾
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1998-02-04
Filing date: 1998-02-04
Publication date: 1999-08-17

Abstract

PROBLEM TO BE SOLVED: To realize a machine translating method which meets both requirements of accurate translation and high speed translation. SOLUTION: A source language character string is inputted for a certain range or group according to a time series (step S1) and character strings in the certain range are collated against an example dictionary (step S2). The record of a source language pattern including a character string of the source language is retrieved through the above collation. Then one or more target language patterns of the retrieved record are combined to compose a translation sentence, which is outputted (step S4). If there is deficient information (step S3), the records of the source language pattern corresponding to the information and target language pattern are added as new records to the example dictionary (step S5). Then the updated example dictionary is used to repeat the processing after the collation operation.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、第１の言語を第２
の言語に翻訳する機械翻訳方法および装置に関し、特
に、同時通訳等、実時間での翻訳処理が要求される形態
での利用を想定した機械翻訳方法および装置に関する。[0001] The present invention relates to a method for converting a first language into a second language.
In particular, the present invention relates to a machine translation method and apparatus that is intended for use in a form that requires real-time translation processing such as simultaneous interpretation.

【０００２】[0002]

【従来の技術】近年の計算機ネットワークの普及によ
り、計算機上で扱うテキスト文書（以下、電子化文書と
呼ぶ）の流通が活発となり、個人で計算機を利用するユ
ーザにとっても、外国語で記述された電子化文書を海外
から受信したり、外国語の電子化文書を海外へ発信する
機会も増大している。この状況は、更に、文書処理装置
や携帯型情報処理装置に翻訳機能を組み込んで利用する
という要求を誘引しており、電子化文書に限らず、例え
ば、音声データの形態である文書（以下、音声文書と呼
ぶ）の翻訳、更には、電話に翻訳機能を付加した翻訳電
話への発展が期待されている。このような形態で利用さ
れる機械翻訳装置には、従来から機械翻訳装置に求めら
れてきた正確に翻訳するという要件（＝第１の要件）に
加えて、実時間で高速に翻訳するという第２の要件が重
要となる。2. Description of the Related Art With the spread of computer networks in recent years, the distribution of text documents (hereinafter referred to as "electronic documents") handled on computers has become active, and even users who use computers by themselves have been described in foreign languages. Opportunities for receiving digitized documents from overseas and transmitting digitized documents in foreign languages to foreign countries are also increasing. This situation further induces a demand for incorporating and using a translation function in a document processing device or a portable information processing device, and is not limited to a digitized document. For example, a document in the form of audio data (hereinafter, referred to as a document) It is expected that the translation of a telephone document (hereinafter referred to as a voice document) will be further developed. The machine translation device used in such a form has a requirement for accurate translation (= first requirement) conventionally required of a machine translation device and a requirement for high-speed translation in real time. The second requirement is important.

【０００３】従来の機械翻訳装置の翻訳過程を大まかに
分類すると、解析手段、変換手段、生成手段の３種によ
り構成されている。解析手段は、第１の言語（以下、原
言語と呼ぶ）で記述された入力文を解析して、原言語の
構造を反映した構文木を作成する。変換手段は、解析手
段が作成した目的言語の構文木を第２の言語（以下、目
的言語と呼ぶ）の構造を反映した構文木に変換する。生
成手段は、変換手段が作成した目的言語の構文木から目
的言語の翻訳文を生成する。The translation process of a conventional machine translation device can be roughly classified into three types: an analysis unit, a conversion unit, and a generation unit. The analysis unit analyzes an input sentence described in a first language (hereinafter, referred to as a source language) and creates a syntax tree reflecting the structure of the source language. The conversion means converts the syntax tree of the target language created by the analysis means into a syntax tree reflecting a structure of a second language (hereinafter, referred to as a target language). The generation unit generates a translation of the target language from the syntax tree of the target language created by the conversion unit.

【０００４】しかしながら、前記の方法または手段で
は、解析手段は、原言語の入力文を完全に入力し終わる
まで待機して原言語の構文木を作成し、変換手段は、そ
の前に動作する解析手段が入力文の完全な構文木を作成
するまで待機して原言語の構文木を目的言語の構文木に
変換し、更に、生成手段はその前に動作する変換手段が
目的言語の構文木を完全に作成するまで待機して目的言
語の構文木から目的言語の翻訳文を作成するといったよ
うに、各手段は、その前段階の手段の動作の完了を待た
ないと動作を開始することができない。However, in the above method or means, the analyzing means waits until the input sentence of the source language is completely input to create the syntax tree of the source language, and the converting means operates the analysis tree which operates before that. The source language parse tree is converted into the target language parse tree by waiting until the means generates a complete parse tree of the input sentence. Each unit cannot start its operation without waiting for the completion of the operation of the unit at the previous stage, such as creating a translation of the target language from the syntax tree of the target language after waiting until it is completely created .

【０００５】即ち、従来の機械翻訳装置によれば、順序
に依存する手段で構成されていたため、各手段は独立に
動作を行うことができず、よって、高速化の改良も各手
段の動作の範囲内でしか行うことができず、実時間で高
速に翻訳するという上述した第２の要件を満足する機械
翻訳装置を実現するには限界があった。That is, according to the conventional machine translation apparatus, each means cannot be operated independently because it is constituted by means depending on the order. Therefore, the improvement of the speed is not improved. There is a limit in realizing a machine translation apparatus that can perform translation only within the range and satisfies the above-described second requirement of performing high-speed translation in real time.

【０００６】このような課題を解決するため、例えば、
「文献：特開平８−１７１５６６号公報（機械翻訳装
置）」に開示されるような機械翻訳装置が提案されてい
る。この装置は、処理を中断する処理中断手段を有する
モジュールを複数備え、各モジュールの処理中断手段
は、当該モジュールの処理を中断できるか否かを所定の
規則に基づいて判定し、中断できると判定した場合、そ
の時点までに当該モジュールで得られている処理結果を
当該モジュールの処理結果を受け取る他のモジュールに
送り、中断できないと判定した場合、当該モジュールの
処理を続けるようにしている。また、この装置の各モジ
ュールは、他のモジュールに既に送った処理結果との整
合性が保たれるように、他のモジュールに送ろうとして
いる処理結果を調整する出力結果調整手段を備えてお
り、このような構成により、実時間で高速に翻訳すると
いう上記の第２の要件を満たそうとしている。In order to solve such a problem, for example,
There has been proposed a machine translation apparatus as disclosed in "Document: JP-A-8-171566 (machine translation apparatus)". This apparatus includes a plurality of modules having processing interruption means for interrupting the processing, and the processing interruption means of each module determines whether or not the processing of the module can be interrupted based on a predetermined rule, and determines that the processing can be interrupted. In this case, the processing result obtained by the module up to that point is sent to another module that receives the processing result of the module, and if it is determined that the processing cannot be interrupted, the processing of the module is continued. Also, each module of this apparatus is provided with an output result adjusting means for adjusting the processing result to be sent to another module so that the consistency with the processing result already sent to the other module is maintained. With such a configuration, it is intended to satisfy the above-described second requirement that translation is performed at high speed in real time.

【０００７】[0007]

【発明が解決しようとする課題】上記文献に示されてい
る機械翻訳装置では、それより以前の機械翻訳装置より
は概して高速である翻訳装置を実現することはできる。
しかしながら、正確に翻訳するという前記の第１の要件
も考慮した場合、処理中断のタイミングは、当該モジュ
ールに入力されるデータの意味的な妥当性に制約される
ので、結局、各モジュールの順序依存の制約は解消され
ないことになり、意味的に完璧な翻訳を行おうとする
と、翻訳の各過程を中断できなくなる。従って、この機
械翻訳装置では、それより以前の機械翻訳装置の処理時
間に中断の判断を行う時間が加わり、最悪の場合には、
それより以前の機械翻訳装置より処理速度は更に遅くな
ることが予想される。With the machine translation apparatus disclosed in the above document, it is possible to realize a translation apparatus that is generally faster than earlier machine translation apparatuses.
However, considering the first requirement of accurate translation, the timing of the processing interruption is restricted by the semantic validity of the data input to the module. Will not be resolved, and if you try to perform a semantically perfect translation, you will not be able to interrupt the translation process. Therefore, in this machine translation apparatus, a time for determining the interruption is added to the processing time of the earlier machine translation apparatus, and in the worst case,
It is expected that the processing speed will be even slower than earlier machine translation devices.

【０００８】例えば、日本語を英語に翻訳する場合、日
本語の述語は文末に位置し、また、その文の意図が否定
であるかどうかも文末の述語の活用形をみなければ判断
できないため、入力文の日本語の全てを解析しなけれ
ば、目的言語である英語の文を生成することができな
い。つまり、前記の文献に開示される機械翻訳装置をも
ってしても、翻訳過程の手順の順序依存性は解消されな
いので、正確に翻訳するという第１の要件を満たしなが
ら、実時間で高速に翻訳するという第２の要件も満たす
ことのできる機械翻訳装置は実現できなった。For example, when translating Japanese into English, the Japanese predicate is located at the end of the sentence, and it cannot be determined whether the intention of the sentence is negative without looking at the usage of the predicate at the end of the sentence. Unless all of the Japanese in the input sentence is analyzed, an English sentence, which is the target language, cannot be generated. In other words, even with the machine translation apparatus disclosed in the above-mentioned document, the order dependency of the translation process is not eliminated, so that the translation is performed in real time and at high speed while satisfying the first requirement of accurate translation. A machine translation device that can also satisfy the second requirement has not been realized.

【０００９】このような点から、正確に翻訳するという
第１の要件を満たすと同時に、実時間で高速に翻訳する
ことができるという第２の要件も満たすことのできる機
械翻訳方法および装置の実現が望まれていた。In view of the above, realization of a machine translation method and apparatus capable of satisfying the first requirement for accurate translation and also satisfying the second requirement for high-speed translation in real time. Was desired.

【００１０】[0010]

【課題を解決するための手段】本発明は、前述の課題を
解決するため次の構成を採用する。〈請求項１の構成〉入力文書から時系列に従って原言語
のデータを一定範囲ずつ入力するステップと、原言語パ
ターンと目的言語パターンの組を格納する用例辞書を用
いて、入力された一定範囲のデータの情報を含む原言語
パターンの組を検索するステップと、検索した組の目的
言語パターンを一つ以上組み合わせて翻訳文を合成し、
かつ、合成を行う際に不足する情報があった場合は、こ
の情報に対応する原言語パターンと目的言語パターンの
組を新たな組として用例辞書に追加するステップとを繰
り返して実行し、入力文書を翻訳することを特徴とする
機械翻訳方法である。The present invention employs the following structure to solve the above-mentioned problems. <Structure of Claim 1> A step of inputting source language data in a predetermined range from an input document according to a time series, and using an example dictionary for storing a set of a source language pattern and a target language pattern, Searching for a set of source language patterns including information of data; combining one or more target language patterns of the searched set to synthesize a translated sentence;
In addition, if there is insufficient information when performing the synthesis, the step of adding a set of the source language pattern and the target language pattern corresponding to this information to the example dictionary as a new set is repeatedly executed to execute the input document. Is a machine translation method characterized by translating.

【００１１】〈請求項１の説明〉請求項１の発明は、翻
訳文の作成時に不足部分があった場合は、この不足部分
を新たな用例として追加しながら翻訳処理を行い、不足
部分がないデータを翻訳文として出力するようにしたも
のである。これにより、翻訳処理における順序依存性を
解消し、正確に翻訳するという条件を満たしながら、実
時間で高速に翻訳することができるという効果が得られ
る。<Explanation of Claim 1> According to the invention of claim 1, if there is a missing part at the time of creating a translated sentence, the translation processing is performed while adding the missing part as a new example, and there is no missing part. The data is output as a translation. As a result, there is an effect that translation can be performed in real time and at high speed while eliminating the order dependency in the translation process and satisfying the condition of performing accurate translation.

【００１２】入力文書は、例えば電子化文書であるが、
音声媒体であってもよい。また、原言語および目的言語
とは例えば日本語と英語であるが、どのような言語であ
っても適用可能である。また、検索の結果、一定範囲の
データの情報を含む原言語パターンの組は複数存在して
いても構わない。更に、翻訳文を合成する場合、組み合
わせる目的言語パターンは一つあるいは複数いずれであ
ってもよい。The input document is, for example, an electronic document.
It may be an audio medium. The source language and the target language are, for example, Japanese and English, but any language is applicable. Also, as a result of the search, a plurality of sets of source language patterns including information of a certain range of data may exist. Furthermore, when synthesizing a translated sentence, one or more target language patterns may be combined.

【００１３】〈請求項２の構成〉入力文書から時系列に
従って原言語のデータを一定範囲ずつ入力するタイミン
グゲートと、原言語パターンと目的言語パターンの組か
らなるレコードを複数格納する用例辞書と、タイミング
ゲートから入力された原言語の一定範囲のデータと、用
例辞書の原言語パターンを照合し、一定範囲のデータに
一致する情報を含む原言語パターンのレコードを検索す
る照合手段と、照合手段が検索したレコードの目的言語
パターンを一つ以上組み合わせて翻訳文を合成し、か
つ、合成を行う際に不足する情報があった場合は、この
情報に対応する原言語パターンと目的言語パターンの組
を新たなレコードとして用例辞書に追加する予測変換手
段を備えたことを特徴とする機械翻訳装置である。<Structure of Claim 2> A timing gate for inputting source language data from the input document in a fixed range according to time series, an example dictionary for storing a plurality of records each comprising a set of source language pattern and target language pattern, Matching means for matching a certain range of source language data input from the timing gate with a source language pattern in the example dictionary and searching for a source language pattern record including information matching the certain range of data, If a translated sentence is synthesized by combining one or more target language patterns of the searched records, and there is insufficient information when performing the synthesis, the set of the source language pattern and target language pattern corresponding to this information is A machine translation device comprising a predictive conversion unit for adding a new record to an example dictionary as a new record.

【００１４】〈請求項２の説明〉請求項２の発明は、請
求項１の発明の機械翻訳方法を実現した機械翻訳装置の
発明である。タイミングゲートは、例えば、一定時間毎
に原言語の文字列を入力する。照合手段は、タイミング
ゲートに入力された一定範囲の文字列と用例辞書の原言
語パターンとを照合し、一定範囲の文字列を含む原言語
パターンのレコードを検索する。ここで、一定範囲の文
字列を含む原言語パターンとは、一定範囲の文字列と原
言語パターンが完全一致している場合と、原言語パター
ンの一部分に、一定範囲の文字列が一致している場合と
を含むものとする。<Explanation of Claim 2> The invention of claim 2 is an invention of a machine translation apparatus which realizes the machine translation method of the invention of claim 1. The timing gate inputs, for example, a character string in the source language at regular intervals. The collating means collates a certain range of the character string input to the timing gate with the source language pattern of the example dictionary, and searches for a record of the source language pattern including the certain range of the character string. Here, the source language pattern including a certain range of character strings is defined as a case where the certain range of the character string completely matches the source language pattern, and a case where the certain range of the character string matches a part of the source language pattern. Is included.

【００１５】予測変換手段は、照合手段で検索したレコ
ードの目的言語パターンを一つ用いて、または複数組み
合わせて翻訳文を合成する。ここで、翻訳文を合成する
場合に不足情報が存在するとは、照合手段で検索された
レコードで原言語パターンに一定範囲の文字列が一致し
ていない部分があり、その結果、目的言語パターンで特
定できない情報が存在する場合である。そこで、予測変
換手段は、このような情報を新たなレコードとして用例
辞書に追加する。The predictive conversion means synthesizes a translated sentence using one target language pattern of the record searched by the matching means or a combination of a plurality of target language patterns. Here, when there is missing information when synthesizing a translated sentence, it means that there is a portion where a certain range of character strings does not match the source language pattern in the record searched by the matching means, and as a result, the target language pattern This is when there is information that cannot be specified. Therefore, the predictive conversion unit adds such information as a new record to the example dictionary.

【００１６】照合手段は、それ以降の原言語文字列の照
合を、追加されたレコードを含めて行うので、新たに照
合する原言語文字列の中で、追加されたレコードの原言
語パターンに一致するものがあれば、予測変換手段は、
これを用いて翻訳文を合成し、出力する。The collation means performs collation of the subsequent source language character strings including the added record. Therefore, in the source language character strings to be newly collated, the collation means matches the source language pattern of the added record. If there is something to do, the predictive conversion means
This is used to synthesize and output a translated sentence.

【００１７】このように、翻訳文として不足情報がない
場合は、そのまま翻訳結果として出力しながら、不足情
報は新たな用例として追加し、それ以降の照合処理を続
行するので、実時間で高速に翻訳することができるとい
う条件を満たしながら、正確な翻訳文が得られるという
条件も満たすことができる。As described above, if there is no missing information as a translated sentence, the missing information is added as a new example while the translation result is output as it is, and subsequent matching processing is continued. While satisfying the condition that translation can be performed, the condition that an accurate translation can be obtained can be satisfied.

【００１８】また、請求項２の発明においても、入力文
書は、文字列以外に音声媒体等であってもよく、原言語
および目的言語もどのような言語であっても適用可能で
ある。Also in the second aspect of the invention, the input document may be a voice medium or the like other than the character string, and the source language and the target language may be any language.

【００１９】[0019]

【発明の実施の形態】以下、本発明の実施の形態を図面
を用いて詳細に説明する。《具体例》図１は本発明の機械翻訳方法の具体例を示す
フローチャートであるが、この説明に先立ち、本発明の
機械翻訳装置を説明する。Embodiments of the present invention will be described below in detail with reference to the drawings. << Specific Example >> FIG. 1 is a flowchart showing a specific example of the machine translation method of the present invention. Prior to this description, the machine translation apparatus of the present invention will be described.

【００２０】〈構成〉図２は、本発明の機械翻訳装置の
具体例の構成図である。図の装置は、タイミングゲート
２、入力バッファ３、照合手段４、予測変換手段５、用
例辞書６からなり、また、図中、１は原言語文字列から
なる入力文書、７は目的言語文字列からなる出力文書を
示している。<Structure> FIG. 2 is a diagram showing the structure of a specific example of the machine translation apparatus according to the present invention. The apparatus shown in the figure comprises a timing gate 2, an input buffer 3, a matching means 4, a predictive conversion means 5, and an example dictionary 6. In the figure, 1 is an input document consisting of a source language character string, and 7 is a target language character string. Is shown.

【００２１】タイミングゲート２は、入力文書１から原
言語のデータを時系列に従って入力するためのモジュー
ルである。入力バッファ３は、タイミングゲート２で入
力したデータのうち、未処理のデータを一時保管するた
めのバッファである。照合手段４は、入力データである
原言語文字列と用例辞書６中の用例レコードとを照合
し、原言語文字列に適合する用例を検索するモジュール
である。予測変換手段５は、照合手段４が検索した用例
を元に不足情報を予測しながら原言語文字列を目的言語
文字列に変換するモジュールである。用例辞書６は、照
合手段４と予測変換手段５の動作に利用される用例を登
録した辞書である。The timing gate 2 is a module for inputting source language data from the input document 1 in chronological order. The input buffer 3 is a buffer for temporarily storing unprocessed data among the data input by the timing gate 2. The matching unit 4 is a module that matches a source language character string as input data with an example record in the example dictionary 6 and searches for an example that matches the source language character string. The prediction conversion unit 5 is a module that converts a source language character string into a target language character string while predicting missing information based on the example searched by the matching unit 4. The example dictionary 6 is a dictionary in which examples used for operations of the matching unit 4 and the predictive conversion unit 5 are registered.

【００２２】図３は、用例辞書６の構成を示す説明図で
ある。ここで、図３の（ａ）は、用例辞書６の構成を表
し、（ｂ）は用例辞書６の実際の構成例を表している。FIG. 3 is an explanatory diagram showing the structure of the example dictionary 6. Here, FIG. 3A shows the configuration of the example dictionary 6, and FIG. 3B shows the actual configuration example of the example dictionary 6.

【００２３】図３（ａ）に示すように、用例辞書６には
「／」で区切られた二つのフィールド（１０１，１０
２）から構成される形式の複数のレコードが任意の数だ
け格納される。As shown in FIG. 3A, the example dictionary 6 has two fields (101, 10
An arbitrary number of records of the format composed of 2) are stored.

【００２４】用例辞書６の最初のフィールド１０１に
は、タイミングゲート２を経て入力される原言語文字列
との照合の対象となるパターン（以下、原言語パターン
と呼ぶ）が記述される。原言語パターンには、「＄」と
数字で表記される任意の単語への照合を示す記号（以
下、任意パターンと呼ぶ）と、「＝｛…｝」で表記され
る任意パターンの条件（以下、属性条件と呼ぶ）の記述
が許されている。The first field 101 of the example dictionary 6 describes a pattern to be collated with a source language character string input through the timing gate 2 (hereinafter referred to as a source language pattern). The source language pattern includes a symbol (hereinafter, referred to as an arbitrary pattern) indicating matching with an arbitrary word represented by a numeral “＄” and a condition of an arbitrary pattern represented by “= ｛. , Attribute conditions) are allowed.

【００２５】属性条件の記述は、図３（ｂ）の構成例に
おいては、便宜上「品詞：変化形」としているが、目的
に応じて、条件となる属性を列挙しても構わない。任意
パターンの数字部分は、この任意パターンの照合位置を
識別するために付与される。従って、原言語パターン中
に任意パターンを複数記述する場合には、任意パターン
の数字部分を区別して記述することになる。The description of the attribute condition is "part of speech: variant" for convenience in the configuration example of FIG. 3B, but the attribute serving as the condition may be listed according to the purpose. The numeral portion of the arbitrary pattern is given to identify the collation position of the arbitrary pattern. Therefore, when a plurality of arbitrary patterns are described in the source language pattern, the numerical portions of the arbitrary patterns are distinguished and described.

【００２６】例えば、任意の名詞句に照合する任意パタ
ーンが、原言語パターン中に１個存在する場合には、
「＄１＝｛名詞句｝」と記述する（図中、１０３に示
す）。また、動詞の未然形に照合する任意パターンは、
「＄１＝｛動詞：未然｝」として記述される（図中、１
０４に示す）。For example, when there is one arbitrary pattern in the source language pattern to be matched with an arbitrary noun phrase,
It is described as “{1 = {noun phrase}” (shown at 103 in the figure). Also, the arbitrary pattern that matches the verb's intact form is
It is described as "{1 = {verb: unpredictable}".
04).

【００２７】一方、用例辞書６のレコードの２番目のフ
ィールド１０２には、原言語パターンに適合した部分を
目的言語に変換するためのパターン（以下、目的言語パ
ターンと呼ぶ）が記述される。目的言語パターンの表記
は、原言語パターンと同様である。但し、変換時に、属
性条件を変更することが可能であり、例えば、原言語の
名詞である任意パターンを目的言語の名詞に変換した後
に、変換された目的言語の名詞を目的言語の動詞の完了
形に変換する場合には、任意パターンの数字部分を共通
にして、原言語パターンに「＄１＝｛名詞｝」を目的言
語パターンに「＄１＝｛動詞：完了｝」を記述する（図
中、１０５に示す）。On the other hand, in the second field 102 of the record of the example dictionary 6, a pattern (hereinafter, referred to as a target language pattern) for converting a portion conforming to the source language pattern into a target language is described. The notation of the target language pattern is the same as that of the source language pattern. However, it is possible to change the attribute condition at the time of conversion. For example, after converting an arbitrary pattern that is a noun in the source language into a noun in the target language, the noun in the converted target language is converted to a verb in the target language. In the case of conversion to the form, the number part of the arbitrary pattern is made common, and “＄ 1 = ｛noun｝” is described in the source language pattern and “＄ 1 = ｛verb: complete｝” is described in the target language pattern (see FIG. Medium, 105).

【００２８】〈動作〉次に、本発明の機械翻訳方法を図
１のフローチャートを用いて説明する。先ず、タイミン
グゲート２は、入力文書１から原言語文字列を時系列に
従って入力する（ステップＳ１）。そして、このデータ
は入力バッファ３に格納され、照合手段４は、この入力
バッファ３に格納されている原言語文字列を取り出し
て、用例辞書６の用例と照合し、この原言語文字列を含
む用例を検索する（ステップＳ２）。<Operation> Next, the machine translation method of the present invention will be described with reference to the flowchart of FIG. First, the timing gate 2 inputs a source language character string from the input document 1 in chronological order (step S1). Then, this data is stored in the input buffer 3, and the collating means 4 extracts the source language character string stored in the input buffer 3, compares it with the example in the example dictionary 6, and includes the original language character string. An example is searched (step S2).

【００２９】その後、予測変換手段５は、照合手段４が
検索した用例の目的言語パターンを一つ以上組み合わ
せ、翻訳文を合成して出力する（ステップＳ４）。ここ
で、予測変換手段５は、合成を行う際に不足情報がある
かを判定し（ステップＳ３）、不足情報がなかった場合
にステップＳ４に移行する。一方、ステップＳ３におい
て、不足情報があった場合は、この不足情報に対応する
原言語パターンの一部と目的言語パターンの一部とから
なる組を新たな用例として用例辞書６に追加し（ステッ
プＳ５）、ステップＳ２に戻る。Thereafter, the predictive conversion means 5 combines one or more target language patterns of the example searched by the collating means 4, synthesizes and outputs a translated sentence (step S4). Here, the predictive conversion means 5 determines whether or not there is missing information when performing synthesis (step S3). If there is no missing information, the process proceeds to step S4. On the other hand, if there is missing information in step S3, a set consisting of a part of the source language pattern and a part of the target language pattern corresponding to the missing information is added to the example dictionary 6 as a new example (step S3). S5), returning to step S2.

【００３０】そして、このような動作を繰り返すことに
より入力文書１の原言語文字列を目的言語の翻訳文とし
て出力する。By repeating such operations, the source language character string of the input document 1 is output as a translation of the target language.

【００３１】次に、上記のタイミングゲート２〜予測変
換手段５の動作を各モジュール毎に説明する。図４は、
タイミングゲート２の動作を示すフローチャートであ
る。先ず、タイミングゲート２は、予め設定されている
一定時間が経過するまで、入力文書１から原言語文字列
を逐次入力して入力バッファ３に格納する（ステップＳ
１０１、Ｓ１０２）。ステップＳ１０２で、一定時間が
経過したことが判明すれば、入力バッファ３に原言語文
字列が存在するかを判定し（ステップＳ１０３）、入力
バッファ３に原言語文字列がまだ格納されていない状態
であるならば、タイミングゲート２は入力文書１の到着
待ちの状態であるので、ステップＳ１０１に戻り、次の
一定時間が経過するまで入力動作を繰り返す。Next, the operation of the timing gate 2 to the predictive conversion means 5 will be described for each module. FIG.
5 is a flowchart showing the operation of the timing gate 2. First, the timing gate 2 sequentially inputs a source language character string from the input document 1 and stores it in the input buffer 3 until a predetermined time has elapsed (step S).
101, S102). If it is determined in step S102 that the predetermined time has elapsed, it is determined whether a source language character string exists in the input buffer 3 (step S103), and a state in which the source language character string has not been stored in the input buffer 3 yet. If so, the timing gate 2 is in a state of waiting for the input document 1 to arrive, so the process returns to step S101 and repeats the input operation until the next fixed time elapses.

【００３２】また、ステップＳ１０３において、入力バ
ッファ３に原言語文字列が存在すれば、入力バッファ３
に格納されている原言語文字列を照合手段４に転送する
（ステップＳ１０４）。In step S103, if the source language character string exists in the input buffer 3,
Is transferred to the matching means 4 (step S104).

【００３３】しかる後に、タイミングゲート２は、入力
バッファ３に格納されている原言語文字列を消去するこ
とにより入力バッファ３を初期化する（ステップＳ１０
５）。最後にタイミングゲート２は、入力文書１の全て
のデータの入力が完了したかどうかを検査し（ステップ
Ｓ１０６）、未入力のデータが存在するならば、ステッ
プＳ１０１に戻り、上述した処理を繰り返す。一方、ス
テップＳ１０６で、入力文書１の全てのデータの入力が
完了していた場合、タイミングゲート２は動作を終了す
る。Thereafter, the timing gate 2 initializes the input buffer 3 by deleting the source language character string stored in the input buffer 3 (step S10).
5). Finally, the timing gate 2 checks whether the input of all the data of the input document 1 has been completed (step S106), and if there is any uninput data, returns to step S101 and repeats the above-described processing. On the other hand, if it is determined in step S106 that all data of the input document 1 has been input, the timing gate 2 ends the operation.

【００３４】尚、タイミングゲート２は動作を終了した
場合であっても、本装置の入力となる新たな入力文書１
が到着すると再起動するような設定にしておけば、ステ
ップＳ１０１以降の処理を適宜繰り返すことができる。It should be noted that even if the operation of the timing gate 2 is completed, the new input document
If the setting is made such that the system is restarted when arrives, the processing from step S101 can be repeated as appropriate.

【００３５】次に、照合手段４の動作を説明する。図５
は、照合手段４の動作を示すフローチャートである。照
合手段４は、上述したステップＳ１０５に示すタイミン
グゲート２の動作に同期して、タイミングゲート２から
原言語文字列を受け取り、照合手段４に設けられている
検索対象である文字列を一時格納する記憶域（以下、検
索バッファと呼ぶ）に追加・格納する（ステップＳ２０
１）。Next, the operation of the matching means 4 will be described. FIG.
5 is a flowchart showing the operation of the matching means 4. The matching unit 4 receives the source language character string from the timing gate 2 in synchronization with the operation of the timing gate 2 shown in step S105 described above, and temporarily stores the character string to be searched provided in the matching unit 4. Add / store in a storage area (hereinafter referred to as a search buffer) (step S20)
1).

【００３６】次に、この時点で検索バッファに格納され
ている文字列と用例辞書６に格納されているレコード
（以下、用例レコードと呼ぶ）の原言語パターンを照合
する（ステップＳ２０２）。ここで、当該検索文字列が
適合する用例レコードが存在するかどうかを検査する
（ステップＳ２０３）。尚、ステップＳ２０３の適合す
るか否かの判断は、当該検索文字列が、用例レコードの
原言語パターンに部分適合する場合（＝検索文字列が原
言語パターンに含まれる場合）も真であるとする。Next, the character string stored in the search buffer at this time is compared with the source language pattern of a record (hereinafter, referred to as an example record) stored in the example dictionary 6 (step S202). Here, it is checked whether or not there is an example record that matches the search character string (step S203). It should be noted that the determination as to whether or not matching is made in step S203 is true when the search character string partially matches the source language pattern of the example record (= when the search character string is included in the source language pattern). I do.

【００３７】ステップＳ２０３において、適合する用例
レコードが存在しないと判明した場合には、ステップＳ
２０１に戻り、上述したステップＳ２０１〜Ｓ２０３の
処理を繰り返す。これにより、新たな原言語文字列がタ
イミングゲート２から入力され、検索バッファに格納さ
れる。以下、新たな検索バッファの文字列で照合動作が
繰り返される。If it is determined in step S203 that there is no matching example record, the process proceeds to step S203.
Returning to 201, the processing of steps S201 to S203 described above is repeated. As a result, a new source language character string is input from the timing gate 2 and stored in the search buffer. Thereafter, the collating operation is repeated with the character string in the new search buffer.

【００３８】また、ステップＳ２０３で、適合する用例
レコードが存在すると判明した場合、照合手段４は、当
該検索文字列と適合した用例レコード（複数存在しても
構わない）を予測変換手段５に転送する（ステップＳ２
０４）。If it is determined in step S203 that there is a matching example record, the matching unit 4 transfers the matching example matching record (a plurality of example records may exist) to the predictive conversion unit 5. (Step S2
04).

【００３９】最後に、照合手段４は、検索バッファに格
納されている文字列を消去することにより、検索バッフ
ァを初期化し（ステップＳ２０５）、動作を終了する。
尚、照合手段４は、動作を終了した場合であっても、タ
イミングゲート２から新たな原言語文字列が到着すると
再起動するような設定にしておけば、ステップＳ２０１
以降の動作を適宜繰り返すことができる。Finally, the collating means 4 initializes the search buffer by erasing the character string stored in the search buffer (step S205), and ends the operation.
Even if the collating unit 4 has completed the operation, if it is set to restart when a new source language character string arrives from the timing gate 2, step S 201.
Subsequent operations can be repeated as appropriate.

【００４０】次に、予測変換手段５の動作を説明する。
図６は、予測変換手段５の動作を示すフローチャートで
ある。予測変換手段５は、先ず、ステップＳ２０４に示
す照合手段４の動作に同期して、照合手段４から検索文
字列に適合した用例レコードを受け取る（ステップＳ３
０１）。ここで、受け取った一つ以上の用例レコードの
目的言語パターンに従って翻訳文を合成する（ステップ
Ｓ３０２）。その結果、情報が不足するために完全な翻
訳文が生成できない場合は（ステップＳ３０３）、この
不足情報部分を含む新たな用例レコードを作成して用例
辞書６に登録する（ステップＳ３０４）。Next, the operation of the predictive conversion means 5 will be described.
FIG. 6 is a flowchart showing the operation of the predictive conversion means 5. First, the predictive conversion unit 5 receives an example record matching the search character string from the matching unit 4 in synchronization with the operation of the matching unit 4 shown in step S204 (step S3).
01). Here, a translated sentence is synthesized according to the target language pattern of the received one or more example records (step S302). As a result, if a complete translation cannot be generated due to lack of information (step S303), a new example record including this missing information portion is created and registered in the example dictionary 6 (step S304).

【００４１】ステップＳ３０４の後、予測変換手段５
は、ステップＳ３０１に戻り、照合手段４から新たな検
索文字列と、検索文字列に適合した用例レコードを受け
取り、ステップＳ３０１以降の処理を繰り返す。After step S304, the prediction conversion means 5
Returns to step S301, receives a new search character string and an example record matching the search character string from the matching unit 4, and repeats the processing from step S301.

【００４２】また、ステップＳ３０３において、完全な
翻訳文が生成できた場合、予測変換手段５は、この翻訳
文を出力文書７として出力し（ステップＳ３０５）、動
作を終了する。尚、予測変換手段５は、動作を終了した
場合であっても、照合手段４から新たな検索文字列と、
この検索文字列に適合した用例レコードが到着すると再
起動するような設定にしておけば、ステップＳ３０１以
降の動作を適宜繰り返すことができる。If a complete translation is generated in step S303, the predictive conversion means 5 outputs this translation as an output document 7 (step S305), and ends the operation. Note that, even when the operation is completed, the predictive conversion unit 5 outputs a new search character string
If the setting is made to restart when an example record that matches the search character string arrives, the operation after step S301 can be repeated as appropriate.

【００４３】以上のような機械翻訳装置の動作は、機械
翻訳装置の役割を行うコンピュータのプログラムによる
制御で実現することができる。従って、そのプログラム
をフロッピディスクやＣＤ−ＲＯＭといった種々の記録
媒体に記録してから、一般のコンピュータにインストー
ルしたり、あるいはネットワークを介してダウンロード
するといった手段により、本発明の機械翻訳装置を実現
することができる。The operation of the machine translation apparatus as described above can be realized by control by a program of a computer serving as the machine translation apparatus. Therefore, the machine translation apparatus of the present invention is realized by means of recording the program on various recording media such as a floppy disk and a CD-ROM, and then installing the program on a general computer or downloading the program via a network. be able to.

【００４４】以下、入力文書１として具体的な例を用い
て、上記の動作を更に説明する。ここで、入力文書１と
して、（原文）「質の高い情報を発信できない企業は成長しな
いと言われている」なる日本語文書が入力され、この日本語文書を英語に翻
訳する場合を想定して説明する。尚、用例辞書６には、
図３（ｂ）に示すレコードが格納されているものとす
る。Hereinafter, the above operation will be further described using a specific example as the input document 1. Here, it is assumed that a Japanese document "(Original)" It is said that a company that cannot transmit high-quality information will not grow "is input and that this Japanese document is translated into English. Will be explained. The example dictionary 6 contains
It is assumed that the record shown in FIG. 3B is stored.

【００４５】先ず、タイミングゲート２が、一定時間
（例えば、数秒）の間、上記の原文の一部を入力バッフ
ァ３に入力する（ステップＳ１０１〜１０３）。その結
果、「質の高い情報を」が入力バッファ３に格納された
ものとする。次に、タイミングゲート２は、入力バッフ
ァ３に格納されている上記の原言語文字列「質の高い情
報を」を照合手段４に転送する（ステップＳ１０４）。First, the timing gate 2 inputs a part of the original sentence to the input buffer 3 for a fixed time (for example, several seconds) (steps S101 to S103). As a result, "high quality information" is assumed to be stored in the input buffer 3. Next, the timing gate 2 transfers the source language character string “high quality information” stored in the input buffer 3 to the matching unit 4 (step S104).

【００４６】この動作に同期して、照合手段４はタイミ
ングゲート２から原言語文字列「質の高い情報を」を受
け取り、検索バッファに追加・格納する（ステップＳ２
０１）。次に、照合手段４は、検索バッファに格納され
ている検索文字列「質の高い情報を」と、用例辞書６の
各用例レコードの原言語パターンを照合する（ステップ
Ｓ２０２）。In synchronization with this operation, the collating means 4 receives the source language character string "high quality information" from the timing gate 2 and adds / stores it in the search buffer (step S2).
01). Next, the matching unit 4 checks the search character string “high quality information” stored in the search buffer against the source language pattern of each example record in the example dictionary 6 (step S202).

【００４７】その結果、レコード１０３とレコード１０
６の原言語パターンが適合するので（ステップＳ２０
２）、この検索文字列「質の高い情報を」と、適合した
レコード１０３「／＄１＝｛名詞句｝を／＄１＝｛名詞
句｝／」と、レコード１０６「／質の高い情報／high-q
uality information／」を予測変換手段５に転送する
（ステップＳ２０４）。As a result, record 103 and record 10
6 match the source language pattern (step S20).
2), the search character string "high quality information", the matched record 103 "/ {1 = {noun phrase} / {1 = {noun phrase} /"", and the record 106" / high quality information / High-q
uality information / ”is transferred to the predictive conversion means 5 (step S204).

【００４８】この動作に同期して、予測変換手段５は、
上記の文字列と二つのレコードを受け取る（ステップＳ
３０１）。次に、予測変換手段５は、受け取った二つの
レコードの目的言語パターンに従って翻訳文を合成する
（ステップＳ３０２）。この場合は、レコード１０３の
目的言語パターン「＄１＝｛名詞句｝」にレコード１０
６の目的言語パターン「high-quality information」を
適用し、「high-quality information」なる翻訳文１が
合成される。この翻訳文１には不足情報は存在しないの
で（ステップＳ３０３）、そのまま出力文書７として出
力される（ステップＳ３０５）。In synchronization with this operation, the prediction conversion means 5
Receive the above character string and two records (step S
301). Next, the predictive conversion unit 5 synthesizes a translated sentence according to the target language patterns of the two received records (step S302). In this case, the target language pattern “{1 = {noun phrase}”
The target language pattern “high-quality information” of No. 6 is applied, and the translated sentence 1 “high-quality information” is synthesized. Since there is no missing information in the translated sentence 1 (step S303), it is output as it is as the output document 7 (step S305).

【００４９】以上、一連の動作の間に一定時間経過する
ので、タイミングゲート２は、上記の原文の一部を新た
に入力バッファ３に入力する（ステップＳ１０１〜Ｓ１
０３）。その結果、「発信できない」が入力バッファ３
に格納されたものとする。As described above, since a fixed time elapses during a series of operations, the timing gate 2 newly inputs a part of the original sentence to the input buffer 3 (steps S101 to S1).
03). As a result, "cannot send" is displayed in input buffer 3
Shall be stored in

【００５０】次に、タイミングゲート２は、入力バッフ
ァ３に格納されている上記の原言語文字列「発信できな
い」を照合手段４に転送する（ステップＳ１０４）。こ
の動作に同期して照合手段４はタイミングゲート２から
原言語文字列「発信できない」を受け取り、検索バッフ
ァに追加・格納する（ステップＳ２０１）。Next, the timing gate 2 transfers the source language character string "cannot be transmitted" stored in the input buffer 3 to the collating means 4 (step S104). In synchronization with this operation, the collation means 4 receives the source language character string "cannot be transmitted" from the timing gate 2 and adds / stores it in the search buffer (step S201).

【００５１】次に、照合手段４は、検索バッファに格納
されている検索文字列「発信できない」と、用例辞書６
の各用例レコードの原言語パターンを照合する（ステッ
プＳ２０２）。その結果、レコード１０５とレコード１
０７の原言語パターンが適合するので（ステップＳ２０
３）、この検索文字列「発信できない」と、適合したレ
コード１０５「／＄１＝｛名詞｝できない＄２＝｛名
詞｝は／have to be ＄１＝｛動詞：完了｝，unless ot
herwise ＄２＝｛名詞｝／」と、レコード１０７「／発
信／transmission／」を予測変換手段５に転送する（ス
テップＳ２０４）。Next, the collating means 4 determines that the search character string "cannot be transmitted" stored in the search buffer is "not sent".
The source language pattern of each example record is compared (step S202). As a result, record 105 and record 1
07 match the source language pattern (step S20).
3), this search character string "cannot be transmitted", and the matching record 105 "/ {1 = {noun} cannot} 2 = {noun} is / have to be {1 = {verb: complete}, unless ot"
Herwise {2 = {noun} / "and the record 107" / transmission / transmission / "are transferred to the predictive conversion means 5 (step S204).

【００５２】この動作に同期して予測変換手段５は、上
記の文字列と二つのレコードを受け取る（ステップＳ３
０１）。次に、予測変換手段５は、受け取った二つのレ
コードの目的言語パターンに従って翻訳文を合成する
（ステップＳ３０２）。この場合は、レコード１０５の
目的言語パターン「＄１＝｛名詞句｝」にレコード１０
７の目的言語パターン「transmission」を適用し、更
に、適用した＄１の部分「transmission」を動詞の完了
形「transmitted」に変換し、「have to be transmitte
d,unless otherwise ＄２＝｛名詞｝」なる翻訳文２が
合成される。この翻訳文には不足情報として＄２が存在
するので（ステップＳ３０３）、この不足情報部分を含
む「／＄２＝｛名詞｝は／＄２＝｛名詞｝／」が追加レ
コードとして用例辞書６に登録される（ステップＳ３０
４）。そして、翻訳文２から不足部分を除いた「have t
o be transmitted,unless otherwise」が翻訳文３とし
て保留される。In synchronization with this operation, the predictive conversion means 5 receives the character string and two records (step S3).
01). Next, the predictive conversion unit 5 synthesizes a translated sentence according to the target language patterns of the two received records (step S302). In this case, the target language pattern “{1 = {noun phrase}”
7 is applied to the target language pattern "transmission", and the applied "1" part "transmission" is converted to the verb completed form "transmitted", and "have to be transmitted"
d, unless otherwise {2 = {noun} ”is synthesized. Since there is $ 2 as missing information in this translated sentence (step S303), "/ {2 = {noun} is / {2 = {noun} /""including this missing information portion is used as an additional record as an example dictionary 6. (Step S30)
4). Then, “have t
o be transmitted, unless otherwise ”is retained as translation 3.

【００５３】以上、一連の動作の間に一定時間経過する
ので、タイミングゲート２は、上記の原文の一部を新た
に入力バッファ３に入力する（ステップＳ１０１〜Ｓ１
０３）。その結果、「企業は成長しない」が入力バッフ
ァ３に格納されたものとする。As described above, since a certain time elapses during a series of operations, the timing gate 2 newly inputs a part of the original sentence to the input buffer 3 (steps S101 to S1).
03). As a result, it is assumed that “the company does not grow” is stored in the input buffer 3.

【００５４】次に、タイミングゲート２は、入力バッフ
ァ３に格納されている上記の原言語文字列「企業は成長
しない」を照合手段４に転送する（ステップＳ１０
４）。この動作に同期して照合手段４はタイミングゲー
ト２から原言語文字列「企業は成長しない」を受け取
り、検索バッファに追加・格納する（ステップＳ２０
１）。Next, the timing gate 2 transfers the above-mentioned source language character string "the company does not grow" stored in the input buffer 3 to the collating means 4 (step S10).
4). In synchronization with this operation, the collation means 4 receives the source language character string "company does not grow" from the timing gate 2 and adds / stores it in the search buffer (step S20).
1).

【００５５】次に、照合手段４は、検索バッファに格納
されている検索文字列「企業は成長しない」と、用例辞
書６の各用例レコードの原言語パターンを照合する（ス
テップＳ２０２）。その結果、レコード１０４とレコー
ド１０８とレコード１０９、および、不足情報を含むレ
コードとして新たに登録された追加レコードの原言語パ
ターンが照合するので（ステップＳ２０３）、この検索
文字列「企業は成長しない」と、適合したレコード１０
４「／＄１＝｛動詞：未然｝ない／not be able to ＄
１＝｛動詞：原形｝／」と、レコード１０８「／企業／
company」と、レコード１０９「／成長する／grow up
／」と、追加レコード「／＄２＝｛名詞｝は／＄２＝
｛名詞｝」を予測変換手段５に転送する（ステップＳ２
０４）。Next, the collating means 4 collates the search character string "company does not grow" stored in the search buffer with the source language pattern of each example record in the example dictionary 6 (step S202). As a result, since the source language patterns of the records 104, 108, 109, and the additional record newly registered as the record including the lack information are collated (step S203), this search character string "the company does not grow". And the matched record 10
4 “/ ＄ 1 = ｛verb: not yet possible / not be able to ＄
1 = {verb: prototype} / ”and record 108“ / company /
company "and record 109" / grow up
/ ”And additional record“ / ＄ 2 = {noun} is / ＄ 2 =
Is transferred to the predictive conversion means 5 (step S2).
04).

【００５６】この動作に同期して予測変換手段５は、上
記の文字列と四つのレコードを受け取る（ステップＳ３
０１）。次に、予測変換手段５は、受け取った四つのレ
コードの目的言語パターンに従って翻訳文を合成する
（ステップＳ３０２）。この場合は、追加レコードの目
的言語パターン「＄２＝｛名詞｝」にレコード１０８の
目的言語パターン「company」を適用し、更に、レコー
ド１０４の目的言語パターン「not be able to ＄１＝
｛動詞：原形｝」にレコード１０９の目的言語パターン
「grow up」を適用し、「company not be able to grow
up」なる翻訳文４が生成される。更に、以前の処理で
保留されていた翻訳文３と合わせて、翻訳文５が生成さ
れ（ステップＳ３０２）、「have to be transmitted,u
nless otherwise company not be able to grow up」が
出力される（ステップＳ３０５）。In synchronization with this operation, the predictive conversion means 5 receives the character string and four records (step S3).
01). Next, the predictive conversion unit 5 synthesizes a translated sentence according to the target language patterns of the received four records (step S302). In this case, the target language pattern “company” of the record 108 is applied to the target language pattern “＄ 2 = {noun}” of the additional record, and the target language pattern “not beable to ＄ 1 =
Apply the target language pattern “grow up” of record 109 to {verb: original form} and “company not be able to grow”
The translated sentence 4 “up” is generated. Further, a translated sentence 5 is generated together with the translated sentence 3 held in the previous process (step S302), and "have to be transmitted, u" is generated.
"nless otherwise company not be able to grow up" is output (step S305).

【００５７】同様にして、残りの原言語文字列「と言わ
れている」も、レコード１１０により「,it is said」
なる翻訳文６に変換され、結果として、翻訳文１、翻訳
文５および翻訳文６が順次出力され、以下の訳文が得ら
れる。In the same manner, the remaining source language character string “said” is also recorded in the record 110 as “, it is said”.
The translated sentence 6 is converted into the translated sentence 6, and as a result, the translated sentence 1, the translated sentence 5, and the translated sentence 6 are sequentially output, and the following translated sentence is obtained.

【００５８】（訳文）「high-quality information hav
e to be transmitted,unless otherwise company not b
e able to grow up, it is said.」(Translation) "high-quality information hav
e to be transmitted, unless otherwise company not b
e able to grow up, it is said.

【００５９】〈効果〉以上説明したように、上記具体例
によれば、原言語から目的言語への照合で不足分を予測
しながら翻訳文を生成するようにしたので、各手段の入
力と出力とが制約を満たしていない場合に、その制約を
満たす要件を予測することができるので、翻訳過程の順
序依存性を解消し、実時間で高速かつ正確に翻訳するこ
とができる。<Effects> As described above, according to the above specific example, a translated sentence is generated while predicting a shortage by collation from the source language to the target language. In the case where と does not satisfy the constraint, it is possible to predict a requirement that satisfies the constraint, so that order dependency of the translation process can be resolved, and translation can be performed quickly and accurately in real time.

【００６０】これを上記の（原文）を例にして説明する
と、以下のようになる。例えば、従来の文献に示されて
いるような機械翻訳装置では、上記（原文）のような文
に対しては、文末の「言われている」までの一つの単位
の翻訳しなければ、正確に翻訳するという第１の要件を
満足することができない。ところが、このような処理を
行った場合は、高速に翻訳するという第２の要件が満足
できなくなってしまう。一方、高速に翻訳するために、
上記の（原文）を「発信できない」で一旦区切ると、
「質の高い情報を発信できない」と「企業は成長しない
と言われている」の２文を別々に翻訳してしまうことも
起こり得る。例えば、「high-quality information can
not be transmitted.」と「it is said that company c
annot growup.」の訳文を得ることになり、本具体例に
おける上記の（訳文）のような正確な翻訳文を得ること
ができない。This will be described below by taking the above (original text) as an example. For example, in a machine translation apparatus as shown in a conventional document, a sentence such as the above (original sentence) cannot be correctly translated unless one unit is translated up to the end of the sentence. Cannot be satisfied with the first requirement of translation into However, when such processing is performed, the second requirement of high-speed translation cannot be satisfied. On the other hand, for fast translation,
Once the above (original text) is separated by "cannot send",
It is possible that two sentences, "cannot transmit high-quality information" and "the company is said to not grow", may be translated separately. For example, "high-quality information can
not be transmitted. '' and `` it is said that company c
annot growup. ", and an accurate translation such as the above (translation) in this specific example cannot be obtained.

【００６１】《利用形態》以上の具体例では、日本語の
文書を英語の文書に高速に翻訳する機械翻訳装置として
説明したが、翻訳する言語は任意の言語であっても構わ
ない。また、入出力媒体は電子化文書であるとして説明
したが、本発明の機械翻訳方法および装置は、入出力媒
体として時間制約が強い音声媒体にも適用可能である。
例えば、このような装置としては音声翻訳装置や翻訳電
話等に適用することができる。<Usage Form> In the above specific example, the machine translation apparatus for translating a Japanese document into an English document at high speed has been described. However, the language to be translated may be any language. Although the input / output medium has been described as an electronic document, the machine translation method and apparatus of the present invention can be applied to an audio medium having a strong time constraint as the input / output medium.
For example, such a device can be applied to a voice translation device, a translation telephone, and the like.

[Brief description of the drawings]

【図１】本発明の機械翻訳方法の具体例を示すフローチ
ャートである。FIG. 1 is a flowchart showing a specific example of the machine translation method of the present invention.

【図２】本発明の機械翻訳装置の具体例を示す構成図で
ある。FIG. 2 is a configuration diagram showing a specific example of the machine translation device of the present invention.

【図３】本発明の機械翻訳装置における用例辞書の構成
説明図である。FIG. 3 is an explanatory diagram of a configuration of an example dictionary in the machine translation device of the present invention.

【図４】本発明の機械翻訳装置におけるタイミングゲー
トの動作を示すフローチャートである。FIG. 4 is a flowchart showing the operation of a timing gate in the machine translation device of the present invention.

【図５】本発明の機械翻訳装置における照合手段の動作
を示すフローチャートである。FIG. 5 is a flowchart showing the operation of the matching means in the machine translation device of the present invention.

【図６】本発明の機械翻訳装置における予測変換手段の
動作を示すフローチャートである。FIG. 6 is a flowchart showing the operation of the predictive conversion means in the machine translation device of the present invention.

[Explanation of symbols]

１入力文書２タイミングゲート３入力バッファ４照合手段５予測変換手段６用例辞書７出力文書 DESCRIPTION OF SYMBOLS 1 Input document 2 Timing gate 3 Input buffer 4 Matching means 5 Predictive conversion means 6 Example dictionary 7 Output document

Claims

[Claims]

1. A step of inputting source language data in a predetermined range from an input document according to time series, and using an example dictionary for storing a set of a source language pattern and a target language pattern, Searching for a set of source language patterns including the information of the target language pattern, combining one or more target language patterns of the searched set to synthesize a translated sentence, and when there is insufficient information when performing the synthesis. Translating the input document by repeatedly executing a step of adding a set of a source language pattern and a target language pattern corresponding to the information to the example dictionary as a new set.

2. A timing gate for inputting source language data in a fixed range from an input document according to a time series, an example dictionary for storing a plurality of records each consisting of a source language pattern and a target language pattern, and an input from the timing gate. Data of a certain range of the source language is compared with the source language pattern of the example dictionary,
A matching unit for searching for a record of a source language pattern including information matching the data in the certain range; and combining one or more target language patterns of the record searched by the matching unit to synthesize a translated sentence, and If there is insufficient information at the time of performing, the predictive conversion means for adding a set of a source language pattern and a target language pattern corresponding to this information to the example dictionary as a new record. Translator.