JPS62267871A

JPS62267871A - Japanese word producing device

Info

Publication number: JPS62267871A
Application number: JP61109631A
Authority: JP
Inventors: Toshihiko Yokogawa; 横川　壽彦; Yasuyuki Numata; 泰之沼田
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1986-05-15
Filing date: 1986-05-15
Publication date: 1987-11-20
Anticipated expiration: 2012-07-02
Also published as: JP2626722B2

Abstract

PURPOSE:To show the noun phrase without losing the naturalness of the Japanese language by discriminating the character types of the translated words of a noun obtained from a dictionary means and deciding the connecting description between nouns in response to said character type in case a foreign language sentence includes a noun phrase containing plural nouns. CONSTITUTION:When a foreign language sentence includes a noun phrase containing plural nouns, the character types of the equivalents of the noun obtained from a dictionary means are discriminated. Based on this discriminated character type, the connecting description between nouns is decided for production of a Japanese sentence. For instance, a translated sentence producing part 26 decides whether each noun forming a noun phrase is equal to a single Chinese word, 2 Chinese words, a KATAKANA (square form of Japanese syllabary) character, a Japanese word or the description of a direct foreign language like English, etc. Then the part 26 produces the connecting description between nouns in terms of a surface layer in accordance with the descriving way of the noun, i.e., the character type of a word.

Description

【発明の詳細な説明】技術分野本発明は［１本語生成装置、とくに［１動翻訳装置に有
用な１１本１塾生成装置に関する。DETAILED DESCRIPTION OF THE INVENTION Technical Field The present invention relates to an 11-word, 1-juku generation device useful for a 1-language translation device, particularly a 1-language translation device.

従」く十表林ｒたとえば英１：ｈなどの外国１；ｈの文からそれに対応
するＩＩ木詰の文を作成する場合、人力された英文の形
ｉＥ１素を解析し、その構文を解析し、その文構造を変
換し、そののち日本語の訳文を生成する。For example, when creating a corresponding II Kizume sentence from a foreign sentence such as English 1: h, the human-generated form iE1 element of the English sentence is analyzed, and its syntax is analyzed. Then, the sentence structure is converted, and a Japanese translation is then generated.

岡知のように日本語文では一般に、複数個の名詞からな
る名詞句は、隣接する２つの名詞の間に助詞「の」を挿
入したり、中点「・」を挿入したり、何も挿入せず複数
の名詞の合成語としたり、はたまた英語などの外国語の
単語をそのまま表記する場合はスペースをはさんで列記
したりするなど、様々な形で表記される。In Japanese sentences, as in Okachi, noun phrases consisting of multiple nouns are generally created by inserting the particle ``no'' between two adjacent nouns, inserting a midpoint ``・'', or inserting nothing at all. It is written in various ways, such as as a compound word of multiple nouns, or when words in a foreign language such as English are written as they are, they are listed with a space between them.

従来方式による英日自動翻訳装置では、このような複数
個の名詞からなる名詞句が対応する日本語に翻訳される
と、単に、複数個の名詞の羅列か、あるいは各名詞間に
すべて助詞の「の」が挿入されるなど１日本語として不
自然な形で表記されることが多かった。With conventional English-Japanese automatic translation devices, when such a noun phrase consisting of multiple nouns is translated into the corresponding Japanese, it is simply a list of multiple nouns or a particle between each noun. It was often written in an unnatural way as a single Japanese word, such as by inserting ``no''.

しかし現実の日本語文はかなり複雑である。たとえば英
語の名詞句″ａ　ｗｏｍａｎ　ｄｉｒｅｃｔｏｒ”の場
合。However, real Japanese sentences are quite complex. For example, in the case of the English noun phrase "a woman director."

説明を簡単にするために、“ｗｏｍａｎ“に対応する日
本語として一般に「女」と「女性」があり、”ｄｉｒｅ
ｃｔｏｒ”に対応する日本語として一般に「ディレクタ
ー」と「編集者」があるとする。この場合−股に、’ａ
　ｗｏｍａｎ　ｄｉｒｅｃｔｏｒ’の訳語として「女性
ディレクター」　「女性編集者」　「女のディレクター
」　「女の編集者」などがあり、それぞれ文中における
用法ないしはニュアンスが微妙に異なることがある。し
たがって従来方式のように、−律に「ｎに、複数個の名
詞を羅列したり、あるいは各名詞間にすべて助詞の「の
」を挿入しただけでは、１１本語として不自然なことが
多い。To simplify the explanation, the Japanese words for "woman" are generally "Onna" and "Josei," and "dire" is the Japanese equivalent of "woman."
Assume that the Japanese words corresponding to "director" generally include "director" and "editor." In this case - in the crotch, 'a
Translations of the word ``woman director'' include ``female director,'' ``female editor,'' ``female director,'' and ``female editor,'' each of which may have slightly different usages or nuances in a sentence. Therefore, as in the conventional method, if you list multiple nouns in 'n' in the -rison, or just insert the particle 'no' between each noun, it will often be unnatural as a 11-hongo. .

目的末完ＩＩはこのような従来技術の欠点を解消し。the purpose Suekan II eliminates these drawbacks of the conventional technology.

名詞句の表記に関してより自然な日本語が生成される１
１木、１ｈ生成装置を提供することを目的とする。More natural Japanese is generated regarding the notation of noun phrases1
The purpose is to provide a 1-tree, 1-h generation device.

構　　成本発明は上記の目的を達成させるため、外国語の単語に
ついてそれに対応する日本語の訳語に関する辞どデータ
を格納した辞書手段と、形通素および構文が解析された
外国語の文について辞書手段を索引して対応する日本語
の文を生成する日本語生成手段とを有し、日本語生成手
段は、外国語の文に複数個の名詞を含む名詞句があると
、辞書手段より得られた名詞の訳語の字種を判別し、そ
の字種に応じて名詞相互間の連結表記を決定して日本語
の文を生成する日本語生成装置を特徴としたものである
。以下、本発明の実施例に基づいて具体的に説明する。Structure: In order to achieve the above object, the present invention includes a dictionary means storing dictionary data regarding Japanese translations corresponding to words in a foreign language, and a dictionary storing dictionary data regarding Japanese translations corresponding to words in a foreign language, and a dictionary regarding sentences in a foreign language whose morphology and syntax have been analyzed. and a Japanese language generating means that indexes the means and generates a corresponding Japanese sentence, and the Japanese language generating means is configured to generate a Japanese sentence that is better than the dictionary means when there is a noun phrase containing multiple nouns in the foreign language sentence. The present invention is characterized by a Japanese language generation device that determines the character type of the translated word of the noun that is entered, determines the connection notation between the nouns according to the character type, and generates a Japanese sentence. Hereinafter, the present invention will be specifically explained based on examples.

第１図を参照すると、本発明による自動翻訳装置の訳文
生成方式を英日自動翻訳装置に適用した実施例の全体構
成が示されている。なお本発明は、英語を［１本語に翻
訳する英日自動翻訳装置のみならず、他の外国語を日本
語に翻訳する自動翻訳装置にも効果的に適用されること
は、言うまでもない。Referring to FIG. 1, there is shown the overall configuration of an embodiment in which the translation generation method of the automatic translation device according to the present invention is applied to an English-Japanese automatic translation device. It goes without saying that the present invention is effectively applied not only to an English-Japanese automatic translation device that translates English into one language, but also to an automatic translation device that translates other foreign languages into Japanese.

同実施例は入力部１０を有し、日本語に翻訳すべき英文
テキスト１２がこれにより入力される。人力部ｌＯはた
とえば、英数字キーなどの文字キーや機濠キーなどを有
するキーボード、紙に記録された英文テキストを読み取
る光学的文字縁Ｊ１１１　”３７）（ＯＣＲ）　、およ
び（または）磁気ディスクなどの記憶媒体に記録された
英文テキストを読み込むファイル記憶装置などを含んで
よい。The embodiment has an input section 10, through which an English text 12 to be translated into Japanese is input. For example, the human power department 10 may include a keyboard having character keys such as alphanumeric keys and mechanical keys, an optical character frame J111 "37) (OCR) for reading English text recorded on paper, and/or a magnetic disk. It may include a file storage device for reading English text recorded on a storage medium.

人力ｆｉｌＯにより人力された英文テキストは、１部１
編集部１４に読み込まれ、翻訳の前処理が行なわれる。The English text manually created by filO is 1 part 1.
It is read into the editing section 14 and preprocessed for translation.

ここでは、主として文の認定と未知語の処理を行なう、
これは形ｙ５素解析の−・部として機廃する。Here, we mainly perform sentence recognition and processing of unknown words.
This will be abandoned as the - part of the form y5 elementary analysis.

市編集された英文データは、ｉｉｆ編集で得られた情報
とともに形１ム素解析１−ｉ１１１Ｂに転送される。形
通素解析部１６では、単語辞書１８を索引して文に分；
１し、英文の形通素を解析し、未知語の処理、固有名詞
、時の表現、数の表現などの各種のまとめあげを行ない
、付加疑問、同格の認定などの文全体の処理を行なう。The edited English data is transferred to the form 1 elemental analysis 1-i111B together with the information obtained by the IIF editing. The morphological analysis unit 16 indexes the word dictionary 18 and divides it into sentences;
1. Analyze the morphological elements of English sentences, process unknown words, summarize various types of expressions such as proper nouns, time expressions, and number expressions, and process entire sentences such as additional questions and appositive recognition.

その形ｊ８素解析ルールは解析ルールファイル３Ｂに格
納されている。The form j8 elementary analysis rule is stored in the analysis rule file 3B.

こうして形通素解析された英文データは、形通素解析で
得られた辞書情報とともに構文解析Ｉ部２０に転送され
る。構文解析１部２０は、本実施例ではｃｆｇ文法（ｃ
ｏｎｔｅｘｔ−ｆｒｅｅ　ｇｒａｍｍａｒ：文脈口由文
法）ルールを英文データに適用して文についてｂｏｔｔ
ｏｌｌ−ｕｐ、　ｒｉｇｈｔ−ｔｏ−ｌｅｆｔに表層構
造の解析を行ない、すべての構文的可能性を見つけ出す
機能部である。The English sentence data subjected to the morphological analysis in this manner is transferred to the syntactic analysis section I 20 together with the dictionary information obtained by the morphological analysis. In this embodiment, the syntactic analysis part 1 20 uses cfg grammar (c
Apply the ontext-free grammar (context-free grammar) rules to English data to analyze sentences.
This is a functional unit that analyzes the surface structure in an all-up, right-to-left manner and finds all syntactic possibilities.

構文解析１部２０で構文解析（バーズ）された英文デー
タは、その解析情報とともに構文解析１１部２２に送ら
れる。ここでは、構文解析工による表層的なバーズ結果
から、構造記述を適用して解を選択する。これによって
英語文の確からしい解析木を作成し、その構造を作る。The English data parsed (barsed) by the parser 1 section 20 is sent to the parser 11 section 22 together with the analysis information. Here, a structural description is applied to select a solution from the surface bar results obtained by a parser. This creates a reliable parse tree for the English sentence and creates its structure.

これらの構文解析ルールはやはり、解析ルールファイル
３Ｂに格納されている。These parsing rules are also stored in the parsing rule file 3B.

構文解析された英文データは、解析木のデータとして構
造変換部２４に転送される。構造変換部２４では、英語
文の中間的構造である構文木から対応する日本語文の構
文木を作成し、日本語文を訳出しやすい日本語基底構造
に変換する。The parsed English data is transferred to the structure conversion unit 24 as parse tree data. The structure conversion unit 24 creates a syntax tree for a corresponding Japanese sentence from a syntax tree that is an intermediate structure of an English sentence, and converts the Japanese sentence into a basic Japanese structure that is easy to translate.

こうして構造変換された日本語の構文木を示す構文木デ
ータは訳文生成部２６に送出され、後者にて訳文の生成
が行なわれる。これは、日本語の構文木の木構造から１
１本１１１〜の文を生成する機能である。まず、１，１
す１１「１を１１本１：ｂのそれに一致させ、るため、
ｊセ１序の入換えを行なって木構造を変更する構文生成
を行ない１次に形通素生成を行なって構文木においてｔ
ｏｐ−ｄｏｗｎ、　ｌｅｆｔ−ｔｏ−ｒｉｇｈｔに訳文
を生成する。The syntax tree data representing the Japanese syntax tree whose structure has been converted in this way is sent to the translation generation unit 26, and the latter generates a translation. This is 1 from the tree structure of the Japanese syntax tree.
This is a function that generates one sentence from 111 onwards. First, 1,1
11 "In order to match 1 with that of 11 1:b,
j ce 1 Perform syntax generation to change the tree structure by permuting the ordinal order, and 1st form generation is performed to generate t in the syntax tree.
Translations are generated op-down and left-to-right.

１訳文生成された日本語文データ、すなわち訳文データ
は、後編集部３０に送られる。後編集部３０では、翻訳
処理に利用した情報を使用し、辞書１Ｂを索引して訳文
データを修正し、より自然な日本語文を完成する。この
日本語文データは出力部３２に転送され、翻訳された「
ｊ氷詰文３４として出力部３２かう出力される。出力部
３２は、たとえばプリンタ、ディスプレイ、および（ま
たは）磁気ディスクなどのファイル記憶装置を含む。The Japanese sentence data for which one translated sentence has been generated, that is, the translated sentence data, is sent to the post-editing section 30. The post-editing unit 30 uses the information used in the translation process to index the dictionary 1B and correct the translated data to complete a more natural Japanese sentence. This Japanese sentence data is transferred to the output unit 32 and translated "
This is outputted to the output section 32 as a frozen sentence 34. Output unit 32 includes, for example, a printer, a display, and/or a file storage device such as a magnetic disk.

これらの一連の翻訳処理の流れは、本装置全体の制御を
統括する制御部３８によって制御される。The flow of a series of these translation processes is controlled by a control unit 38 that controls the entire apparatus.

ｃｎ語辞；！ｊ１８には、本実施例では英語および日本
語の単語についての辞−；データが格納され、解析ルー
ルファイル３６には、形通素解析および４＋４文解析の
ルールデータが格納されている。cn word dictionary;! j18 stores lexical data for English and Japanese words in this embodiment, and the analysis rule file 36 stores rule data for morphological analysis and 4+4 sentence analysis.

制御部３８には、操作表示部４０が接続されている。操
作表示部３８は、操作者から本装置に様々な指示をケえ
る、たとえば翻訳指示キー、カーソルキーなどの操作キ
ーや、入力英語文テキスト、翻訳結果の日本語文、辞書
情報などの中間データ、操作者に対する様々な指示など
を可視表示するティスプレィやインジケータを有する。An operation display section 40 is connected to the control section 38 . The operation display section 38 allows the operator to give various instructions to the device, such as operation keys such as translation instruction keys and cursor keys, input English text, translated Japanese text, intermediate data such as dictionary information, etc. It has a display and indicators that visually display various instructions to the operator.

なお、それらの操作表示機能の多くは、入力部１０にキ
ーボードを備えている場合はそのキーボードに、また出
力部３２にディスプレイを備えている場合はそのディス
プレイに含まれるように構成してよい。Note that many of these operation display functions may be configured to be included in the keyboard if the input unit 10 is equipped with a keyboard, or in the display if the output unit 32 is equipped with a display.

ところで構文解析工部２０では、形通素解析された英文
データについて、ファイル記憶装置３６に格納されてい
るｃｆｇ文法ルールを英文にｒ　ｉｇｈｔ−ｔｏ−１ｅ
ｆｔに適用してその文について可能性のあるすべての構
文解を導出する。この解は一般に構造木の形で理解され
る。これは、１つの文ごとにそれに含まれる１１語また
は句が修飽関係および格関係などの従属ないしは共起関
係によって相１７二に関連づけられ、たとえば親、ｒ−
１孫といった相ｌｌ：の従属関係を示すものである。各
単語または句は、構造木の笛点すなわちノードの位置を
占める。By the way, the syntactic analysis unit 20 converts the cfg grammar rules stored in the file storage device 36 into English text for the English data that has been subjected to the morphological analysis.
ft to derive all possible syntactic solutions for that sentence. This solution is generally understood in the form of a structural tree. This means that the 11 words or phrases contained in each sentence are related to each other through subordination or co-occurrence relationships such as saturation relationships and case relationships, such as parent, r-
This indicates the dependent relationship of phase II: such as one grandchild. Each word or phrase occupies the position of a whistle point or node in the structure tree.

こうして構文−１−のまとまりと従属関係を規定するデ
ータが得られると、これは構文解析ＩＩ部２４に送られ
る。このデータは、１ｔｉ述した構文木の形で容易に理
解される。これは、さらに構造変換部２４で１１本、：
／１文の構造に変換され、訳文生成部２６では、それに
含まれる各ノートごとに訳文を生成してゆく、構造木に
おけるノードの処理は、ｔｏｐ−ｄｏｗｎ、　ｌｅｆｔ
−ｔｏ−ｒｉｇｈｔ　テ行なう。When the data defining the set of syntax-1- and the dependency relationship is obtained in this way, it is sent to the syntax analysis section II 24. This data is easily understood in the form of the syntax tree described above. This is further converted into 11 lines by the structure conversion unit 24:
The processing of the nodes in the structure tree is as follows:
-to-right Do it.

本実施例における訳文生成部２６では、［１本語文を生
成する過程で、複数個の名１；−を含む名詞句について
次のような処理を行なう。すなわち、名詞句を形成する
各基１ｉ１４について、それが−字の快語であるか、二
字以りの１菓１語であるか、カタカナ語であるか、和語
であるか、英語などの外国語そのままの表記であるかな
ど、名５：・４の表記の仕方に応じて、すなわち単語の
字種に応じて表層的に名詞相互間の連結表記を生成して
いる。In the translation generation unit 26 of this embodiment, in the process of generating one main language sentence, the following process is performed on a noun phrase containing a plurality of nouns 1;-. In other words, for each group 1i14 that forms a noun phrase, whether it is a pleasant word with a - character, a word with two or more characters, Katakana, Japanese, English, etc. Depending on how the name 5:.4 is written, such as whether it is written as it is in the foreign language, or not, a connection notation between nouns is generated on the surface depending on the character type of the word.

本実施例では、たとえば解析ルールファイル３Ｂの一部
に連結表記テーブルが用意され、このような名詞句を生
成する場合、この連結表記テーブルを参照し、これに応
じた連結表記を選択するように構成されている。連結表
記テーブルの例を第２図に示す。In this embodiment, for example, a concatenation notation table is prepared as part of the analysis rule file 3B, and when such a noun phrase is to be generated, this concatenation notation table is referred to and a concatenation notation corresponding to the noun phrase is selected. It is configured. An example of a concatenation notation table is shown in FIG.

このテーブルでは、名詞句を形成する各名詞のうち隣接
する２つの名詞について、両者が一字の漢語（漢語ｌ）
であるか、工学以上の漢語（漢語２）であるか、カタカ
ナ語であるか、和語であるか、英語などの外国語そのま
まの表記であるかなどの組合せに応じて、ｒｌＪないし
「４」のコードが割り当てられる。この連結状態を示す
コードは、「１」で助詞「の」の挿入を規定し、「２」
で中点「・」の挿入を規定し、「３」で１字分の一スペ
ースの挿入を規定し、「４」で助詞も中点もスペースも
挿入しないことを規定している。なお、この連結表記テ
ーブルで英語とあるが、一般にこれは、英＋ｉ５にかぎ
らず外国語の文字をそのまま１１木、：ｈ文中に使用し
て表記する１１語の場合を包含している。In this table, for each noun that forms a noun phrase, two adjacent nouns are both Chinese words (Kanji l) that have one character.
RlJ or ” code will be assigned. The code indicating this connected state is "1" which specifies the insertion of the particle "no", and "2" which specifies the insertion of the particle "no".
``3'' specifies the insertion of a midpoint ``・'', ``3'' specifies the insertion of one space for one character, and ``4'' specifies that no particle, midpoint, or space be inserted. Note that although English is used in this concatenated notation table, this generally includes not only English+i5 but also 11 words in which foreign language characters are used as they are in the 11-tree, :h sentence.

本実施例における訳文生成部２６は、生成した日本語文
をシフトＪＩＳコードで表わしている。そこで、１１本
語中語の字種をこの文字コードにより判定し、その判定
結果に従って連結表記テーブルを参照して連結コードを
得る。その処理フローを第３図に−示す。The translated sentence generation unit 26 in this embodiment expresses the generated Japanese sentence in shift JIS code. Therefore, the character types of the 11 Japanese and Chinese characters are determined using this character code, and a concatenated code is obtained by referring to the concatenated notation table according to the determination result. The processing flow is shown in FIG.

この例では一１構文木の形に構文解析された英文におけ
る１つの名詞句を形成する連続した名詞Ｘ１（ｉ＝１．
２．、、、ｎ）について訳出を行なうものである。In this example, consecutive nouns X1 (i=1...
2. , , n).

先頭の英語名詞から順に処理を行なうが（５ｏ）、まず
先頭の英語名詞とそれに続く英語名詞をそれぞれレジス
タＮ’ＡとＮＯに格納する　（５１，５１）、そこでレ
ジスタＮＡおよびＮＢの英単語について辞書１８を索引
しく５３．５５）　、その日本語訳語をそれぞれレジス
タＪＡおよびＪＢに格納する（５４．５８）。Processing is performed in order from the first English noun (5o). First, the first English noun and the following English nouns are stored in registers N'A and NO (51, 51), and then the English words in registers NA and NB are stored. The dictionary 18 is indexed (53.55) and its Japanese translations are stored in registers JA and JB, respectively (54.58).

そこで訳文生成部２Ｂは、レジスタＪＡおよびＪＢの内
容に１いてＦＵＮＩ（ＪＡ、ＪＲ）なる処理を行なう、
この処理ＦＵＮＩＡ、ＪＢ）は、レジスタＪＡとＪＢの
内容の表記の関係から一義的に決まる連結表記を発生す
る処理である。たとえば、本実施例では、レジスタＪ’
Ａと」Ｂに格納されている単語の先頭文字の文字コード
によってその単語の字種を判定し、これに基づいて第２
図に示す連結表記テーブルを参照して対応する連結コー
ドを発生する。たとえばシフトＪＩＳコードでは、カタ
カナ文字は０！８３４０から０１８３９８に定められて
いる。なお１判定した字種が漢語であるときは、さらに
そのｆｆ′語全体の字数も判断し、１字の漢語であるか
、２字以上のｂ　６６であるかを決定する。その処理結
果、すなわち連結コードはレジスタＮＯに格納される（
５７）。Therefore, the translation generation unit 2B performs a process called FUNI (JA, JR) based on the contents of registers JA and JB.
This process (FUNIA, JB) is a process that generates a concatenated notation that is uniquely determined from the notation relationship of the contents of registers JA and JB. For example, in this embodiment, register J'
The character type of the word is determined based on the character code of the first character of the word stored in "A" and "B," and the second character type is determined based on this.
A corresponding concatenation code is generated by referring to the concatenation notation table shown in the figure. For example, in the Shift JIS code, katakana characters are defined as 0!8340 to 018398. Note that when the character type determined as 1 is Chinese, the number of characters in the entire ff' word is also determined, and it is determined whether it is Chinese with one character or b66 with two or more characters. The processing result, that is, the concatenated code, is stored in register NO (
57).

そこで、レジスタＪＡに格納されている日本語単語にレ
ジスタＮＯの内容に応じて、助詞「の」の付加、中点「
・」の付加、１字分のスペースの付加、何も付加しない
などの処理ＦＵＮ２（ＪＡ、Ｎｏ）を行ない、その結果
をレジスタＹＡＫＵにセットする。レジスタＹＡＫＵは
、こうして生成された日本語訳が順次格納される記憶手
段である。Therefore, depending on the contents of register NO, the particle "no" is added to the Japanese word stored in register JA, and the midpoint "no" is added to the Japanese word stored in register JA.
Processing FUN2 (JA, No) such as adding ", adding a space for one character, adding nothing, etc. is performed, and the result is set in register YAKU. Register YAKU is a storage means in which the Japanese translations generated in this way are sequentially stored.

このような・連の処理を１つの英語名１書４句に含まれ
る仝名、ｉ！Ｎ１−Ｎｎについて先頭から順に実行する
　（５９，１３０）、これによって、ｒａｔなる名Ｊ４
の羅列でなく、助、７．４　ｒの」や中点「・」、スペ
ースなどが適νＪに挿入された自然な１１本ＬＩＴ’Ｓ
名詞句が生成される。こうして生成された訳文は、後編
集部３ｏで後処理が行なわれ、操作表示部４ｏに？’＋
７視表示されるとともに、出力部３２にて日本文３４と
してたとえば印字出力される。This kind of process of ``ren'' is performed by the nickname, i!, included in one English name, one book, and four phrases. N1-Nn are executed in order from the beginning (59, 130), thereby the name rat J4
It is not a list of ``, ``7.4 r'', a middle point ``・'', a space, etc. are inserted into the appropriate νJ, making it a natural 11-line LIT'S.
A noun phrase is generated. The translation generated in this way is post-processed in the post-editing section 3o, and displayed on the operation display section 4o. '＋
It is displayed visually, and is also printed out as Japanese text 34 by the output unit 32, for example.

ところで、上述の処理ＦＵＮＩ（ＪＡ、ＪＢ）では日本
語名詞の表層構造すなわち字種を判定するのに文字コー
ドを利用するように構成されていた。しかし、このよう
にしないで、またはこれに加えて、辞、１：１８に連結
表記に関する表記情報として字種のデータを領ｉえ、英
単語により辞書１８を索引して日本語訳語を得るときに
この表記情報も入手し、これにノ、（づいて前述の助詞
「の」の付加、中点「φ」の付加、１字分のスペースの
付加、何も付加しないなどの処理を行なうように構成し
てもよい。その場合、辞、Ｉ；１８は、第４図に示すよ
うに各エントリのレコードに訳１：５の表記情報が登録
されている。そこで、このような辞、Ｈｓを使用する場
合は、ステ７プ５７における処理ＦＩＩＮＩ（ＪＡ、Ｊ
Ｂ）では、レジスタＪＡおよびＪＢに格納されている辞
書エントリから表記情報を取り出し、これに従って第２
図の連結表記テーブルを索引して連結コードを得る処理
を行なう。By the way, the above-mentioned processing FUNI (JA, JB) is configured to use character codes to determine the surface structure of Japanese nouns, that is, the character type. However, instead of doing this, or in addition to this, when receiving character type data as orthographic information regarding connected notation in ji, 1:18, and indexing the dictionary 18 using English words to obtain the Japanese translation. Next, we obtain this notation information and perform processing such as adding the particle ``no'' (as described above), adding the midpoint ``φ'', adding a space for one character, and not adding anything. In that case, for the word I;18, notation information of translation 1:5 is registered in the record of each entry as shown in Figure 4.Therefore, such a word, Hs When using the process FIINI (JA, J
In B), the notation information is extracted from the dictionary entries stored in registers JA and JB, and the second
A process is performed to obtain a concatenation code by indexing the concatenation notation table in the figure.

このような辞、！；　ｉ　ａの表記情報による連結表記
の処理は、文字コードによる処理に比べて単語の意味ま
で考慮したきめ細かい処理を行なうことができる。すべ
ての名詞について文字コードから字種を判定して連結テ
ーブルを索引するような方式をとらず、表記情報を全部
辞書情報から得るように構成してもよい。また文字コー
ドから′ｒ１定する方式の補助的手法として辞書情報か
ら得る方式を併用してもよく、その場合は文字コードか
ら判定する方式の効率の良さと、これでは対処しきれな
い例外的ケースを辞書情報により適切に処理できるきめ
細かさの両方の利点が得られる。Words like this! ; i The process of concatenated notation using the notation information of a can perform more detailed processing that takes into account the meaning of the word, compared to the process using character codes. Instead of determining the character types of all nouns from character codes and indexing the concatenation table, it is also possible to obtain all the notation information from dictionary information. In addition, a method obtained from dictionary information may be used as an auxiliary method to the method of determining 'r1 from the character code. This provides the benefits of both fine-grained processing, which can be processed more appropriately using dictionary information.

効　　１東未発１９１によれば、１１本１；Δの生成において、名
詞句を形成する名１；Ｉ４の字種に応じて、各名、；１
１間の連結に助、ｉ、４　ｒの」を付加するか否かなど
の処理が行なわれる。この方式によれば、比較内筒「ｎ
な処理で１１木語の［１然さを失うことなく名詞句を表
記することができる。したがって名詞句の表記に関して
より自然な１１本語が生成される。Effect 1According to 191 Azuma, 191, in the production of 11 books 1;Δ, each name forms a noun phrase, 1; depending on the character type of I4, each name;
Processing such as whether or not to add "suke, i, 4 r" to the connection between 1 and 1 is performed. According to this method, the comparison inner cylinder "n
With this process, it is possible to write noun phrases in 11 wooden languages without losing their originality. Therefore, 11 more natural words are generated with respect to the notation of the noun phrase.

[Brief explanation of drawings]

第１図は、本発明によるし１本語生成装置を英■［Ｉリ
ノ翻訳装置に適用した実施例の全体構成を示すａ脂ブロ
ック図、第２図は、第１図に示す実施例における解析ルールファ
イルに格納されている名詞の連結表記テーブルの例を示
す説明図、第３図は、同実施例の１１木１１．〜生成処理における
名詞句の処理の例を示すフロー図、第４図は、同実施例における単語辞書に格納されている
英語の単語についての辞１与情報の例を示す説明図であ
る。主要部分の符号の説明１０、、、入力部１６１．形ｙム素解析部１８、、、弔１悟辞書２０．２２．　、構文解析部２４、、、構造変換部２６、、、訳文生成部３２、、、出力部３６、、、解析ルール３８、、、制御部４０１．操作表示部FIG. 1 is a block diagram showing the overall configuration of an embodiment in which the Japanese language generation device according to the present invention is applied to an English language translation device. FIG. 3 is an explanatory diagram showing an example of a noun concatenation notation table stored in the analysis rule file. - Flowchart showing an example of noun phrase processing in the generation process. FIG. 4 is an explanatory diagram showing an example of idiom 1 given information about English words stored in the word dictionary in the same embodiment. Explanation of symbols of main parts 10, Input section 161. Shape y elemental analysis unit 18,,, Condolences 1 Satoru Dictionary 20.22. , Syntax analysis unit 24, , Structure conversion unit 26, , Translation generation unit 32, , Output unit 36, , Analysis rule 38, , Control unit 401. Operation display section

Claims

[Claims] 1. Dictionary means storing dictionary data regarding Japanese translations corresponding to words in a foreign language, and indexing the dictionary means for foreign language sentences whose shapes and syntax have been analyzed. a Japanese language generating means for generating a corresponding Japanese sentence based on the foreign language sentence; 1. A Japanese language generation device that generates a Japanese sentence by determining a character type of a translated word of the noun, and determining a connecting notation between the nouns according to the character type. 2. In the device according to claim 1, the translated word is represented by a character code, and the Japanese language generation means determines the character type of the noun based on the character code of the first character of the noun. Characteristic Japanese generation device. 3. The Japanese language generation device according to claim 2, wherein the Japanese language generation means further determines the character type of the noun based on the number of characters forming the noun. 4. In the device according to claim 1, the dictionary means stores dictionary data regarding character types of Japanese translations for nouns, and the Japanese language generation means stores dictionary data obtained from the dictionary means. A Japanese language generation device characterized by determining the character type of a noun from data.