JPH06295309A

JPH06295309A - Machine translation system, language analyzing device and digital copying device

Info

Publication number: JPH06295309A
Application number: JP5105137A
Authority: JP
Inventors: Masumi Narita; 真澄成田; Masayuki Kameda; 雅之亀田; Takashi Katooka; 隆加登岡; Yoshihisa Oguro; 慶久大黒; Ayako Oono; 亜矢子大野
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1993-04-06
Filing date: 1993-04-06
Publication date: 1994-10-21

Abstract

PURPOSE:To enable a user to easily grasp the structure of an original sentence and to quickly understand the sentence. CONSTITUTION:An input part 1 inputs a natural language sentence and a an original sentence inputted from the input part 1 so as to be translated is stored in an original sentence storing part 2. A translation dictionary part 6 stores knowledge information to be used for translation processing. A translation part 5 executes the translation processing of the inputted original sentence by using the contents of the dictionary part 6. A translated sentence as a result of the translation processing of the translation part 5 is stored in a translated sentence storing part 3. A display part 7 displays the original sentence and the translated sentence. An edition control part 4 executes the edition processing the original sentence or the translated sentence displayed on the display part 7. The translation part 5 recognizes the style of the sentence pattern of the original sentence and outputs the recognized sentence pattern information.

Description

Detailed Description of the Invention

【０００１】[0001]

【技術分野】本発明は、機械翻訳装置及び言語解析装置
並びにディジタル複写装置に関し、より詳細には、ユー
ザが原文の構造を容易に把握することができ、文理解を
迅速に行なえるようにした機械翻訳装置及び言語解析装
置並びにディジタル複写装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a machine translation device, a language analysis device, and a digital copying device. More specifically, the user can easily understand the structure of an original sentence and can quickly understand the sentence. The present invention relates to a machine translation device, a language analysis device, and a digital copying device.

【０００２】[0002]

【従来技術】機械翻訳装置は、第一言語文（原文）を辞
書を用いて形態素に分割して品詞等の情報を得る形態素
解析部と、該形態素解析部によって分割された単語列の
構文を辞書および文法規則を用いて解析する構文解析部
と、該構文解析部で得られた構文の構造を第二言語にお
ける構文構造に変換する変換部と、該変換部によって得
られた第二言語の構文構造に従って翻訳文を生成する訳
文生成部から構成されている。さらに、前記構文解析部
では第一言語文中の主動詞の辞書情報を参照して第一言
語文の文型候補を複数抽出して解析を行ない、その中か
ら最適な文型を選択するという手法を採用していること
が多い。しかし、このようにして選択された文型情報は
翻訳の内部処理においてのみ使用されるだけで、第一言
語文（原文）や第二言語文（訳文）に該文型情報を表示
してユーザの文理解を助けるといった試みはなされてい
ない。但し、現在市販されている翻訳ソフトの中には、
主語に当たる部分と述語に当たる部分のみをマーキング
するものがあるが、これ以外の情報は表示されないので
入力文の文型情報を完全に提供するものではない。2. Description of the Related Art A machine translation device divides a first language sentence (original sentence) into morphemes using a dictionary to obtain information such as a part of speech and a morpheme analysis unit, and a syntax of a word string divided by the morpheme analysis unit. A syntactic analysis unit that analyzes using a dictionary and a grammar rule, a conversion unit that converts the syntactic structure obtained by the syntactic analysis unit into a syntactic structure in the second language, and a second language obtained by the conversion unit. It is composed of a translation generation unit that generates a translation according to the syntactic structure. Further, the syntax analysis unit employs a method of referring to the dictionary information of the main verb in the first language sentence, extracting and analyzing a plurality of sentence pattern candidates of the first language sentence, and selecting an optimal sentence pattern from them. I often do However, the sentence pattern information selected in this way is only used in the internal processing of translation, and the sentence pattern information is displayed in the first language sentence (original sentence) or the second language sentence (translated sentence) to display the sentence of the user. No attempt has been made to help understanding. However, some of the translation software currently on the market
There are some marking only the part corresponding to the subject and the part corresponding to the predicate, but since no other information is displayed, the sentence pattern information of the input sentence is not completely provided.

【０００３】また、入力文の文型認識に関する従来技術
を記載した公知文献としては、例えば、特開平１−１８
９７６７号公報、あるいは特開平３−２２２０６６号公
報がある。特開平１−１８９７６７号公報に提案されて
いる「文並列認識方式」は、候補の素性とスロット条件
との比較回数を減らし、効率的に文型を認識できるよう
にするために、文型を構成する文法格を用言との位置関
係や表層文法格により整理した有限個のポジションにお
けるスロット条件として表現し、候補に対し一つのポジ
ションのみを複数の文型に渡って並列にチェックするも
のである。すなわち、構文解析部において複数の文型候
補の中から最適な文型を効率的に決定する方法に関する
もので、選択された文型に関する情報を入力文や訳文に
表示するというものではない。Further, as a publicly known document describing a conventional technique relating to sentence pattern recognition of an input sentence, there is, for example, Japanese Patent Laid-Open No. 1-18.
There are 9767 and Japanese Patent Laid-Open No. 3-220206. The "sentence parallel recognition method" proposed in Japanese Unexamined Patent Publication No. 1-189767 configures a sentence pattern in order to reduce the number of comparisons between candidate features and slot conditions and to efficiently recognize the sentence pattern. The grammatical case is expressed as a slot condition in a finite number of positions arranged by the positional relationship with the grammar and the surface grammatical case, and only one position is checked in parallel for a candidate over multiple sentence patterns. That is, the present invention relates to a method for efficiently determining an optimum sentence pattern from a plurality of sentence pattern candidates in the syntactic analysis unit, and does not display information on the selected sentence pattern in an input sentence or a translated sentence.

【０００４】また、特開平３−２２２０６６号公報に提
案されている「機械翻訳装置」は、入力文の文型を指定
することによって、正しい翻訳結果を迅速に得るように
するために、予め入力文中に単語の文型を指定する記号
を挿入して、翻訳実行時に該記号を検出して、その指定
された文型に基づいて構文解析を行なうものである。す
なわち、翻訳処理に先だち、入力文に文型情報を指定し
てやることにより、正しい翻訳結果を迅速に得ることを
目的とするものであり、やはり文型情報の表示に関わる
ものではない。また、言語処理部を設けたディジタル複
写装置に関しては、例えば特開平４−７７９６５号公報
において、認識した文字から単語を抽出し、抽出された
単語ごとにその訳語を検索し、得られた訳語情報を付加
して出力する複写装置が提案されているが、入力画像を
文単位に認識して解析や翻訳を行なった結果や、文の文
型情報までも出力するものは提案されていない。Further, the "machine translation device" proposed in Japanese Patent Laid-Open No. 3-222066 pre-registers an input sentence in advance in order to promptly obtain a correct translation result by designating the sentence pattern of the input sentence. A symbol for designating the sentence pattern of a word is inserted in, the symbol is detected at the time of executing translation, and the syntactic analysis is performed based on the designated sentence pattern. That is, the purpose is to promptly obtain a correct translation result by designating the sentence pattern information in the input sentence prior to the translation process, and is not related to the display of the sentence pattern information. Regarding a digital copying apparatus provided with a language processing unit, for example, in Japanese Unexamined Patent Publication No. 4-77965, words are extracted from the recognized characters, the translated words are searched for each extracted word, and the obtained translated word information is obtained. Although a copying apparatus that adds and outputs a sentence has been proposed, there is no proposal that outputs a result obtained by recognizing an input image on a sentence-by-sentence basis and performing analysis and translation, and sentence pattern information of a sentence.

【０００５】[0005]

【目的】本発明は、上述のごとき実情に鑑みてなされた
もので、ユーザが原文の構造を容易に把握することがで
き、文理解が迅速に行なえるようにした機械翻訳装置及
び言語解析装置並びにディジタル複写装置を提供するこ
とを目的としてなされたものである。[Object] The present invention has been made in view of the above-mentioned circumstances, and a machine translation device and a language analysis device in which a user can easily grasp the structure of an original sentence and quickly understand the sentence. The present invention is also intended to provide a digital copying machine.

【０００６】[0006]

【構成】本発明は、上記目的を達成するために、（１）
自然言語文を入力するための入力部と、該入力部から入
力された翻訳対象となる原文を記憶する原文記憶部と、
翻訳処理に使用する知識情報を収容した翻訳辞書部と、
該翻訳辞書部の内容を用いて前記入力された原文の翻訳
処理を実行する翻訳部と、該翻訳部による翻訳処理の結
果得られた訳文を記憶する訳文記憶部と、前記原文およ
び訳文を表示する表示部と、前記各構成部を制御すると
ともに、前記表示部で表示された原文または訳文の編集
処理を行なう編集制御部とから成り、前記翻訳部は原文
の文型を認定し、認定した文型情報を出力すること、更
には、（２）前記翻訳部は、認定した文型情報を原文側
か訳文側のどちらかに、あるいは両方に出力すること、
更には、（３）前記翻訳部は、認定した文型情報を文型
構成要素別に構成要素としてのまとまりがわかるように
出力すること、更には、（４）前記（３）において前記
認定した文型の文型構成要素別に原文および訳文を分割
して出力すること、或いは、（５）自然言語文を入力す
るための入力部と、該入力部から入力された文を解析す
るのに必要な知識情報を収容した解析辞書部と、該解析
辞書部の内容を用いて前記入力された自然言語文の解析
処理を実行する解析部と、該解析部による解析処理の結
果を表示する表示部と、前記各構成部を制御するととも
に、前記表示部で表示された内容の編集処理を行なう編
集制御部とから成り、前記解析部は入力文の文型を認定
し、認定した文型情報を出力すること、或いは、（６）
原稿画像の画像情報を読み取る画像読取部と、該画像読
取部で読み取った情報を記憶する読取情報記憶部と、該
読取情報記憶部に格納された画像情報を出力する出力部
を有するディジタル複写装置において、格納された画像
情報から文字を認識する文字認識部と、認識された文字
情報を請求項１記載の機械翻訳装置あるいは請求項５記
載の言語解析装置によって処理する言語処理部と、前記
各構成部を制御する制御部とを有し、前記出力部は言語
処理部の結果を付加して出力することを特徴としたもの
である。すなわち、翻訳対象となる原文の文型を認定
し、認定した文型情報を出力するようにしたことを特徴
とするものである。以下、本発明の実施例に基づいて説
明する。In order to achieve the above object, the present invention provides (1)
An input unit for inputting a natural language sentence, an original sentence storage unit for storing an original sentence to be translated input from the input unit,
A translation dictionary unit containing knowledge information used for translation processing,
A translation unit that executes a translation process of the input original sentence using the contents of the translation dictionary unit, a translation sentence storage unit that stores a translated sentence obtained as a result of the translation process by the translation unit, and the original sentence and the translated sentence are displayed. And a control unit for controlling each of the constituent units and for editing the original sentence or the translated sentence displayed on the display unit, the translation unit certifying the sentence pattern of the original sentence, and the certified sentence pattern. Outputting information, and further, (2) the translation unit outputs the recognized sentence pattern information to either the original sentence side or the translated sentence side, or both.
Further, (3) the translating unit outputs the recognized sentence pattern information so that the unity of each sentence pattern constituent element can be understood, and (4) the sentence pattern of the recognized sentence pattern in (3) above. Outputting the original sentence and the translated sentence by dividing them by component, or (5) accommodating an input unit for inputting a natural language sentence and knowledge information necessary for analyzing the sentence input from the input unit The analysis dictionary unit, an analysis unit that executes an analysis process of the input natural language sentence by using the contents of the analysis dictionary unit, a display unit that displays a result of the analysis process by the analysis unit, and each of the configurations. And an editing control section for controlling the section and performing editing processing of the content displayed on the display section, wherein the analyzing section certifies the sentence pattern of the input sentence and outputs the certified sentence pattern information, or ( 6)
Digital copying apparatus having an image reading unit for reading image information of a document image, a read information storage unit for storing information read by the image reading unit, and an output unit for outputting the image information stored in the read information storage unit A character recognition unit that recognizes a character from stored image information; a language processing unit that processes the recognized character information by the machine translation device according to claim 1 or the language analysis device according to claim 5; It has a control part which controls a constituent part, and the above-mentioned output part adds the result of a language processing part, and outputs it. That is, the feature is that the sentence pattern of the original sentence to be translated is certified and the certified sentence pattern information is output. Hereinafter, description will be given based on examples of the present invention.

【０００７】図１は、本発明による機械翻訳装置の一実
施例を説明するための構成図で、図中、１は入力部、２
は原文記憶部、３は訳文記憶部、４は編集制御部、５は
翻訳部、６は翻訳辞書部、６ａは文法規則、６ｂは単語
辞書、７は表示部である。入力部１は自然言語文を入力
するためのもので、該入力部１により入力された翻訳対
象とする原文を原文記憶部２に記憶する。翻訳辞書部６
には翻訳処理に使用する知識情報を収容してある。該翻
訳辞書部６の内容を用いて前記入力された原文の翻訳処
理を翻訳部５で実行する。該翻訳部５で翻訳処理された
結果の訳文は訳文記憶部３に記憶される。表示部７は前
記原文及び訳文が表示される。前記各々の構成部を制御
するとともに、前記表示部で表示された原文または訳文
の編集処理は編集制御部４で行なわれる。前記翻訳部５
は、原文の文型を認定し、認定した文型情報を出力す
る。FIG. 1 is a block diagram for explaining one embodiment of a machine translation apparatus according to the present invention, in which 1 is an input unit and 2 is
Is an original sentence storage unit, 3 is a translated sentence storage unit, 4 is an edit control unit, 5 is a translation unit, 6 is a translation dictionary unit, 6a is a grammatical rule, 6b is a word dictionary, and 7 is a display unit. The input unit 1 is for inputting a natural language sentence, and stores an original sentence to be translated input by the input unit 1 in the original sentence storage unit 2. Translation dictionary section 6
Contains the knowledge information used for the translation process. The translation unit 5 uses the contents of the translation dictionary unit 6 to translate the input original sentence. The translated text resulting from the translation processing by the translation unit 5 is stored in the translated text storage unit 3. The display unit 7 displays the original sentence and the translated sentence. The edit control unit 4 controls the respective constituent units and edits the original sentence or the translated sentence displayed on the display unit. The translation unit 5
Certifies the sentence pattern of the original sentence and outputs the certified sentence pattern information.

【０００８】図２は、図１における翻訳部動作処理を説
明するためのフローチャートである。以下、各ステップ
に従って順に説明する。まず、原文が入力されると（St
ep１１）、翻訳部では原文の形態素解析を行なう（Step
１２）。形態素解析では、単語辞書６ｂを用いて原文か
ら形態素を抽出し、抽出された形態素がどのように結合
して語を形成しているかを解析する。この解析結果は、
次の構文解析処理（Step１３）に渡される。構文解析で
は、前記形態素解析によって得られた品詞情報などを基
に、文法規則６ａ中の構文解析規則を用いて木構造を生
成する。このとき、入力文中の用言をキーとして、該用
言の取り得るすべての文型候補の中から最適な文型を優
先解とする木構造が生成される。用言が取り得る文型情
報は、図３に示すように単語辞書６ｂに記述されてい
る。構文解析処理によって得られた入力文の文型情報
は、図４に示されるように、入力文番号とともにバッフ
ァＡに格納される（Step１４）。続いて、構文解析結果
は変換・生成処理（Step１５）に渡される。変換・生成
処理では、文法規則６ａ中の変換・生成規則を用いて訳
文を生成し、その結果とバッファＡに格納されている文
型情報を出力する（Step１６）。FIG. 2 is a flow chart for explaining the translation unit operation processing in FIG. Hereinafter, each step will be described in order. First, when the original text is entered (St
ep11), the translation unit performs morphological analysis of the original sentence (Step
12). In the morpheme analysis, morphemes are extracted from the original sentence using the word dictionary 6b, and how the extracted morphemes are combined to form a word is analyzed. The result of this analysis is
It is passed to the next syntax analysis process (Step 13). In the syntactic analysis, the tree structure is generated using the syntactic analysis rule in the grammatical rule 6a based on the part-of-speech information obtained by the morphological analysis. At this time, using the adjective in the input sentence as a key, a tree structure in which an optimal sentence pattern is set as a priority solution from all possible sentence pattern candidates of the adjective is generated. Sentence pattern information that can be taken by the idiom is described in the word dictionary 6b as shown in FIG. The sentence pattern information of the input sentence obtained by the syntactic analysis process is stored in the buffer A together with the input sentence number as shown in FIG. 4 (Step 14). Then, the syntax analysis result is passed to the conversion / generation process (Step 15). In the conversion / generation processing, a translated sentence is generated using the conversion / generation rules in the grammar rule 6a, and the result and the sentence pattern information stored in the buffer A are output (Step 16).

【０００９】今、”It helps you decide whether to i
nstall your system as a stand-alone, server, or da
taless workstation．”という英文を日本語に翻訳する
場合を例にとり、本発明の実施例における翻訳処理につ
いて説明する。この入力文は、 Step１２および Step１
３の処理を経て、図５に示されるような木構造が生成さ
れる。この場合、入力文中に含まれる用言の中で最上位
にある”help”に注目し、図３に示す”help”の辞書情
報を基にして最適な文型が選択される。ここで、本発明
の実施例では、以下の表１に示す文型番号が用言の辞書
情報として記述されているものとする。Now, "It helps you decide whether to i
nstall your system as a stand-alone, server, or da
taless workstation. The translation process in the embodiment of the present invention will be described by taking an example of translating an English sentence "" into Japanese. The input sentence is Step 12 and Step 1.
A tree structure as shown in FIG. 5 is generated through the process of 3. In this case, paying attention to "help" at the highest level among the phrases included in the input sentence, the optimum sentence pattern is selected based on the dictionary information of "help" shown in FIG. Here, in the embodiment of the present invention, it is assumed that the sentence pattern numbers shown in Table 1 below are described as dictionary information of the vocabulary.

【００１０】[0010]

【表１】 [Table 1]

【００１１】上記表１の文型ＳＰ１は、主語（Ｓ）と述
語（Ｖ）からなる文型を示し、文型ＳＰ２は、主語
（Ｓ）と述語（Ｖ）と補語（Ｃ）からなる文型を示す。
文型ＳＰ３は、主語（Ｓ）と述語（Ｖ）と目的語（Ｏ）
からなる文型を示し、文型ＳＰ４は主語（Ｓ）と述語
（Ｖ）と第１目的語（Ｏ）と第２目的語（Ｏ）からなる
文型を示す。文型ＳＰ５は、主語（Ｓ）と述語（Ｖ）と
目的語（Ｏ）と補語（Ｃ）からなる文型を示す。図３か
らわかるように、動詞”help”が取り得る文型として
は、ＳＰ１（第１文型）、ＳＰ３（第３文型）、ＳＰ５
（第５文型）等がある。これらの複数の文型候補の中か
ら最適な文型を選択する方法としては、辞書に記述する
文型候補に重みづけを行なった上でスコアリングする方
法や、候補をひとつずつチェックし、当該の文型を構成
する要素が出現しない等により解析失敗が検出された段
階でバックトラックして別の候補をチェックするという
方法をとったり、前述した特開平１−１８９７６７号公
報に記載されているように複数の文型候補を並列にチェ
ックして選択するなど種々の方法が可能である。The sentence pattern SP1 in Table 1 above shows a sentence pattern consisting of the subject (S) and the predicate (V), and the sentence pattern SP2 shows a sentence pattern consisting of the subject (S), the predicate (V) and the complement (C).
Sentence pattern SP3 has subject (S), predicate (V) and object (O)
The sentence pattern SP4 represents a sentence pattern composed of the subject (S), the predicate (V), the first object (O), and the second object (O). The sentence pattern SP5 indicates a sentence pattern including a subject (S), a predicate (V), an object (O), and a complement (C). As can be seen from FIG. 3, the sentence patterns that the verb "help" can take are SP1 (first sentence pattern), SP3 (third sentence pattern), and SP5.
(5th sentence pattern) and so on. As a method of selecting an optimal sentence pattern from these plural sentence pattern candidates, the sentence pattern candidates described in the dictionary are weighted and then scored, or the candidates are checked one by one and the relevant sentence pattern is selected. A method of backtracking to check another candidate at the stage when an analysis failure is detected due to no appearance of constituent elements, or a plurality of sentence patterns as described in JP-A-1-189767 mentioned above. Various methods are possible, such as checking and selecting candidates in parallel.

【００１２】ここでは、いずれかの方法により、図５に
示されるような解析結果が得られたとする。この場合、
入力文の文型はＳＰ５（第５文型）と決定され、この文
型情報はバッファＡに格納される。このあと、変換・生
成処理（Step１５）が実行され、その結果である訳文と
バッファＡに格納されている文型情報が、図６に示され
るように画面に出力される（Step１６）。図６において
は、入力文の文型情報が、文型構成要素を示す記号列と
して原文の末尾に括弧つきで表示されている。この文型
情報の表示は、図６のように原文側だけにとどまらず、
訳文側にも行なえるように設定することが可能である。Here, it is assumed that the analysis result as shown in FIG. 5 is obtained by either method. in this case,
The sentence pattern of the input sentence is determined to be SP5 (fifth sentence pattern), and this sentence pattern information is stored in the buffer A. After that, the conversion / generation process (Step 15) is executed, and the resulting translated sentence and the sentence pattern information stored in the buffer A are output to the screen as shown in FIG. 6 (Step 16). In FIG. 6, the sentence pattern information of the input sentence is displayed in parentheses at the end of the original sentence as a symbol string indicating a sentence pattern constituent element. The display of this sentence pattern information is not limited to the original sentence side as shown in FIG.
It can be set so that it can be performed on the translation side as well.

【００１３】次に、認定した文型情報を文型構成要素別
に構成要素としてのまとまりがわかるように出力する場
合の実施例を示す。前記入力文”It helps you decide
whether to install your system as a standalone, se
rver, or dataless workstation．”を例にとり説明す
る。図７は、翻訳部の他の動作処理を説明するためのフ
ローチャートである。以下、各ステップに従って順に説
明する。入力文の形態素解析処理（Step２２）および構
文解析処理（Step２３）に関しては、図２で説明したの
と同様に実行され、その結果、図５に示す木構造が生成
される。このとき、抽出された文型情報は、図８に示す
ように文型構成要素と個々の構成要素の開始位置情報と
してバッファＢに格納される（Step２４）。このあと、
変換・生成処理（Step２５）が実行され、訳文とバッフ
ァＢに格納されている情報が図９に示されるように画面
に出力される。図９においては、文型構成要素Ｓ，Ｖ，
Ｏ，Ｃは、各々の構成要素を形成している語群の開始文
字の直下に表示されているが、各構成要素を形成してい
る語群の開始文字と最終文字の位置情報を記憶しておい
て中央値を計算することによって語群の中央に表示させ
ることも可能である。また、図１０に示すように、いず
れの場合にも、各構成要素としてのまとまりが視覚的に
容易に認識できるように各構成要素単位に下線を施して
表示させることも可能である。Next, an embodiment will be described in which the certified sentence pattern information is output for each sentence pattern component so that the group as a component can be understood. The input sentence "It helps you decide
whether to install your system as a standalone, se
rver, or dataless workstation. 7 is an example. FIG. 7 is a flowchart for explaining another operation process of the translation unit. Hereinafter, description will be made in order according to each step. Morphological analysis process (Step 22) of input sentence and syntactic analysis process ( Step 23) is executed in the same manner as described in Fig. 2, and as a result, the tree structure shown in Fig. 5 is generated.At this time, the extracted sentence pattern information has sentence pattern components as shown in Fig. 8. And the start position information of each component are stored in the buffer B (Step 24).
The conversion / generation process (Step 25) is executed, and the translated text and the information stored in the buffer B are output to the screen as shown in FIG. In FIG. 9, sentence pattern components S, V,
O and C are displayed immediately below the start character of the word group forming each constituent element, but store the position information of the start character and the last character of the word group forming each constituent element. It is also possible to display in the center of the word group by calculating the median value in advance. Further, as shown in FIG. 10, in any case, it is possible to display each component unit by underlining it so that a group as each component can be easily visually recognized.

【００１４】次に、認定した文型の文型構成要素別に原
文および訳文を分割して出力する場合の実施例を示す。
前記入力文”It helps you decide whether to install
your system as a standalone,server,or dataless wo
rkstation．”を例にとり、説明する。図１１は、翻訳
部の更に他の動作処理を説明するためのフローチャート
である。以下、各ステップに従って順に説明する。入力
文の形態素解析処理（Step３２）および構文解析処理
（Step３３）に関しては図２で説明したのと同様に実行
され、その結果、図５に示す木構造が生成される。この
とき、抽出された文型情報は、図１２に示すように文型
構成要素と当該の構成要素を形成している単語列がバッ
ファＣに格納される（Step３４）。このあと、変換・生
成処理（Step３５）が実行され、バッファＣに格納され
た情報が、図１３に示すように、その訳出結果とともに
対応表示の形で画面に出力される。Next, an embodiment in which an original sentence and a translated sentence are divided and output according to the sentence pattern constituents of the recognized sentence pattern will be described.
The input sentence "It helps you decide whether to install
your system as a standalone, server, or dataless wo
rkstation. 11 is a flow chart for explaining still another operation process of the translation unit. Hereinafter, description will be made in order according to each step. Morphological analysis process of input sentence (Step 32) and syntax analysis The processing (Step 33) is executed in the same manner as described with reference to Fig. 2, and as a result, the tree structure shown in Fig. 5 is generated.At this time, the extracted sentence pattern information has a sentence pattern structure as shown in Fig. 12. The element and the word string forming the corresponding constituent element are stored in the buffer C (Step 34), after which the conversion / generation processing (Step 35) is executed, and the information stored in the buffer C is shown in FIG. As shown, the translation result is output to the screen in the form of a correspondence display.

【００１５】図１４は、本発明による言語解析装置の一
実施例を説明するための構成図で、図中、１１は入力
部、１２は編集制御部、１３は解析部、１４は解析辞書
部、１４ａは文法規則、１４ｂは単語辞書、１５は表示
部である。入力部１１は自然言語文を入力するためのも
ので、該入力部１１から入力された文を解析するのに必
要な知識情報を解析辞書部１４に収容する。該解析辞書
部１４の内容を用いて、前記入力された自然言語文の解
析処理を解析部１３で実行する。該解析部１３で解析処
理された結果は表示部１５で表示される。編集制御部１
２は、前記各構成部を制御するとともに、前記表示部１
５で表示された内容の編集処理を行う。前記解析部１３
は、入力文の文型を認定し、認定した文型情報を出力す
る。また、解析部１３は図２に示すフローチャートのSt
ep１４までの処理を実行する。従って図１５に示すよう
に、入力文に対する文型情報のみが画面に表示される。FIG. 14 is a block diagram for explaining an embodiment of the language analysis apparatus according to the present invention. In the figure, 11 is an input unit, 12 is an edit control unit, 13 is an analysis unit, and 14 is an analysis dictionary unit. , 14a is a grammar rule, 14b is a word dictionary, and 15 is a display unit. The input unit 11 is for inputting a natural language sentence, and the knowledge information necessary for analyzing the sentence input from the input unit 11 is stored in the analysis dictionary unit 14. Using the contents of the analysis dictionary unit 14, the analysis unit 13 executes the analysis process of the input natural language sentence. The result of the analysis processing by the analysis unit 13 is displayed on the display unit 15. Editing control unit 1
2 controls each of the above-mentioned components, and at the same time, the display unit 1
The content displayed in 5 is edited. The analysis unit 13
Validates the sentence pattern of the input sentence and outputs the certified sentence pattern information. In addition, the analysis unit 13 uses the St
The processing up to ep14 is executed. Therefore, as shown in FIG. 15, only the sentence pattern information for the input sentence is displayed on the screen.

【００１６】図１６は、本発明によるディジタル複写装
置の一実施例を説明するための構成図で、図中、２１は
画像読取部、２２は読取情報記憶部、２３は文字認識
部、２４は言語処理部、２５は制御部、２６は出力部で
ある。画像読取部２１は、原稿画像の画像情報を読み取
るためのもので、該画像読取部２１で読み取った情報を
読取情報記憶部２２に記憶する。出力部２６は、前記読
取情報記憶部２２に格納された画像情報を出力する。格
納された画像情報から文字を文字認識部２３で認識す
る。言語処理部２４は、該文字認識部２３で認識された
文字情報を、図１に示した機械翻訳装置（請求項１）あ
るいは図１４に示した言語解析装置（請求項５）によっ
て処理する。２４の言語処理部として、請求項１に記載
した機械翻訳装置あるいは請求項５に記載した言語解析
装置のいずれを使用することも可能である。ディジタル
複写装置にこれらの言語処理部を付加したことにより、
入力原稿に文型情報が付加されて出力されるようにな
る。前記各構成部は制御部２５により制御する。前記出
力部２６は言語処理部２４の結果を付加して出力する。FIG. 16 is a block diagram for explaining one embodiment of the digital copying apparatus according to the present invention. In the figure, 21 is an image reading unit, 22 is a read information storage unit, 23 is a character recognition unit, and 24 is a character recognition unit. A language processing unit, 25 is a control unit, and 26 is an output unit. The image reading unit 21 is for reading the image information of the document image, and stores the information read by the image reading unit 21 in the read information storage unit 22. The output unit 26 outputs the image information stored in the read information storage unit 22. The character recognition unit 23 recognizes a character from the stored image information. The language processing unit 24 processes the character information recognized by the character recognition unit 23 by the machine translation device shown in FIG. 1 (claim 1) or the language analysis device shown in FIG. 14 (claim 5). As the language processing unit 24, either the machine translation device described in claim 1 or the language analysis device described in claim 5 can be used. By adding these language processing units to the digital copying machine,
Sentence pattern information is added to the input document and output. The respective components are controlled by the controller 25. The output unit 26 adds and outputs the result of the language processing unit 24.

【００１７】[0017]

【効果】以上の説明から明らかなように、本発明による
と、以下のような効果がある。（１）入力文の文型を認定してその文型情報が画面に表
示されたり、あるいは用紙に印字されるので、原文の構
造把握が容易になり、それだけ文理解が促進されるとい
う効果が得られる。（２）文型構成要素をそのまとまりごとに表示すること
も可能としたので、文型を構成する要素とそれに対応す
る語群が容易に把握でき、文理解を大幅に促進するとい
う効果が得られる。（３）文型構成要素単位に原文を分割し、訳文もこれに
対応させる形で出力できるようにしたので、構成要素単
位ごとに原文の意味を咀嚼できるという効果も得られ
る。（４）請求項６に記載のディジタル複写装置によると、
入力原稿に入力文の文型情報が翻訳結果とともに付加さ
れて出力されるので、文字入力のためにキーボードを操
作する必要がなく、翻訳作業や言語解析作業が普通に複
写機を使うのと同様に、翻訳や解析の対象となる原稿を
イメージスキャナにセットしてコピーボタンを押すとい
う極めて簡単な作業になる。As is apparent from the above description, the present invention has the following effects. (1) Since the sentence pattern of the input sentence is recognized and the sentence pattern information is displayed on the screen or printed on the paper, the structure of the original sentence can be easily grasped, and the sentence understanding can be promoted. . (2) Since it is also possible to display sentence pattern constituents for each group, it is possible to easily grasp the elements that form a sentence pattern and the corresponding word groups, and to obtain an effect of greatly promoting sentence comprehension. (3) Since the original sentence is divided into sentence pattern component units and the translated sentence can be output in a form corresponding to this, the effect of being able to masticate the meaning of the original sentence for each component unit is also obtained. (4) According to the digital copying apparatus of claim 6,
Sentence pattern information of the input sentence is added to the input manuscript together with the translation result and output, so there is no need to operate the keyboard to input characters, and translation and language analysis work is the same as using a copier normally. , It is an extremely simple work of setting the original document to be translated or analyzed in the image scanner and pressing the copy button.

[Brief description of drawings]

【図１】本発明による機械翻訳装置の一実施例を説明
するための構成図である。FIG. 1 is a configuration diagram for explaining an embodiment of a machine translation device according to the present invention.

【図２】図１における翻訳部の動作処理を説明するた
めのフローチャートである。FIG. 2 is a flowchart for explaining an operation process of a translation unit in FIG.

【図３】図１における単語辞書の記述内容を示す図で
ある。FIG. 3 is a diagram showing description contents of a word dictionary in FIG.

【図４】本発明における入力文の文型情報を示す図で
ある。FIG. 4 is a diagram showing sentence pattern information of an input sentence according to the present invention.

【図５】本発明における入力文の木構造を示す図であ
る。FIG. 5 is a diagram showing a tree structure of an input sentence according to the present invention.

【図６】本発明における原文と訳文との出力画面を示
す図である。FIG. 6 is a diagram showing an output screen of an original sentence and a translated sentence according to the present invention.

【図７】図１における翻訳部の他の動作処理を説明す
るためのフローチャートである。FIG. 7 is a flowchart for explaining another operation process of the translation unit in FIG.

【図８】本発明における抽出された文型情報を示す図
である。FIG. 8 is a diagram showing extracted sentence pattern information according to the present invention.

【図９】本発明における原文と訳文の他の出力画面を
示す図である。FIG. 9 is a diagram showing another output screen for an original sentence and a translated sentence according to the present invention.

【図１０】本発明における原文と訳文の更に他の出力
画面を示す図である。FIG. 10 is a diagram showing still another output screen of the original sentence and the translated sentence in the present invention.

【図１１】図１における翻訳部の更に他の動作処理を
説明するためのフローチャートである。FIG. 11 is a flowchart for explaining still another operation process of the translation unit in FIG.

【図１２】本発明における抽出された文型情報を示す
図である。FIG. 12 is a diagram showing extracted sentence pattern information according to the present invention.

【図１３】本発明における原文と訳文の更に他の出力
画面を示す図である。FIG. 13 is a diagram showing still another output screen of the original sentence and the translated sentence in the present invention.

【図１４】本発明による言語解析装置の一実施例を説
明するための構成図である。FIG. 14 is a configuration diagram for explaining an embodiment of a language analysis device according to the present invention.

【図１５】本発明による言語解析装置における入力文
に対する文型情報のみを表示する出力画面を示す図であ
る。FIG. 15 is a diagram showing an output screen for displaying only sentence pattern information for an input sentence in the language analysis device according to the present invention.

【図１６】本発明によるディジタル複写装置の一実施
例を説明するための構成図である。FIG. 16 is a configuration diagram for explaining an embodiment of a digital copying apparatus according to the present invention.

[Explanation of symbols]

１…入力部、２…原文記憶部、３…訳文記憶部、４…編
集制御部、５…翻訳部、６…翻訳辞書部、６ａ…文法規
則、６ｂ…単語辞書、７…表示部。DESCRIPTION OF SYMBOLS 1 ... Input part, 2 ... Original text storage part, 3 ... Translation storage part, 4 ... Editing control part, 5 ... Translation part, 6 ... Translation dictionary part, 6a ... Grammar rule, 6b ... Word dictionary, 7 ... Display part.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁵ 識別記号庁内整理番号ＦＩ技術表示箇所Ｇ０６Ｋ 9/00 Ｚ 8623−5Ｌ (72)発明者大黒慶久東京都大田区中馬込１丁目３番６号株式会社リコー内 (72)発明者大野亜矢子東京都大田区中馬込１丁目３番６号株式会社リコー内─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁵ Identification code Internal reference number FI Technical display location G06K 9/00 Z 8623-5L (72) Inventor Yoshihisa Oguro 1-3-3 Nakamagome, Ota-ku, Tokyo No. 6 In Ricoh Co., Ltd. (72) Inventor Ayako Ono 1-3-3 Nakamagome, Ota-ku, Tokyo In Ricoh Co., Ltd.

Claims

[Claims]

1. An input unit for inputting a natural language sentence,
An original sentence storage unit that stores an original sentence to be translated that is input from the input unit, a translation dictionary unit that stores knowledge information used for translation processing, and a content of the translation dictionary unit that is used to store the input original sentence. A translation unit that executes a translation process, a translation storage unit that stores a translated sentence obtained as a result of the translation process by the translation unit, a display unit that displays the original sentence and the translated sentence, and a control unit that controls each of the constituent units. It consists of an edit control unit that edits the original sentence or translated sentence displayed on the display unit,
The machine translation device, wherein the translation unit certifies the sentence pattern of the original sentence and outputs the certified sentence pattern information.

2. The machine translation device according to claim 1, wherein the translation unit outputs the recognized sentence pattern information to either the original sentence side or the translated sentence side, or both.

3. The machine translation device according to claim 1, wherein the translation unit outputs the recognized sentence pattern information for each sentence pattern component so that a group of the components can be understood.

4. The machine translation device according to claim 3, wherein the original sentence and the translated sentence are divided and output for each sentence pattern component of the recognized sentence pattern.

5. An input unit for inputting a natural language sentence,
An analysis dictionary unit containing knowledge information required to analyze a sentence input from the input unit; and an analysis unit that executes analysis processing of the input natural language sentence using the contents of the analysis dictionary unit. A display unit that displays a result of analysis processing by the analysis unit, and an edit control unit that controls each of the components and edits the content displayed on the display unit. A language analysis device characterized by recognizing the sentence pattern of a sentence and outputting the certified sentence pattern information.

6. An image reading unit that reads image information of a document image, a read information storage unit that stores information read by the image reading unit, and an output unit that outputs the image information stored in the read information storage unit. In a digital copying apparatus having
A character recognition unit that recognizes characters from stored image information,
A language processing unit for processing the recognized character information by the machine translation device according to claim 1 or the language analysis device according to claim 5, and a control unit for controlling each of the components, and the output unit includes a language. A digital copying apparatus characterized by adding and outputting the result of a processing unit.