JPS6398774A

JPS6398774A - Language analyzing device

Info

Publication number: JPS6398774A
Application number: JP61243197A
Authority: JP
Inventors: Toshihiko Yokogawa; 横川　壽彦
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1986-10-15
Filing date: 1986-10-15
Publication date: 1988-04-30
Anticipated expiration: 2011-03-04
Also published as: JPH0821033B2

Abstract

PURPOSE:To analyze a proper noun into plural types of meaning in accordance with the context by using a property information adding means to give the property information to the character strings which are not recorded into a dictionary means and the character strings whose property information are not recorded into the dictionary means among those input character strings. CONSTITUTION:An input processing part 14 reads out the stored input sentences and outputs them to a unit segmenting part 16. A dictionary retrieving part 22 retrieves a reference dictionary 20 for the input sentences sent from the part 16 and divided into dictionary consulting units. A character string if serves as a proper noun is outputted to a proper noun processing part 24; while the character strings which are not equal to proper nouns are outputted to a processing part 36 and stored in a dictionary information storing table 36a of the part 36. A default property information adding part 34 is read out of the table 36a and sent to the part 22 through the part 16. The property information is added to the proper nouns having no property information according to the retrieving result of the dictionary 20 obtained by the part 22. These proper nouns are stored in the table 36a. A syntax structure analyzing part 38 analyzes the syntax structure of an input sentence that is read out of the table 36a and undergone the morpheme analysis.

Description

【発明の詳細な説明】技術分野本発明は言語解析装置、特に自動翻訳装置に有用な言語
解析装置に関する。TECHNICAL FIELD The present invention relates to a language analysis device, particularly to a language analysis device useful for automatic translation devices.

従来技術例えば英語等の外国語の文からそれに対応する日本語の
文を作成する場合、人力された英文の形７Ｕ素を解析し
、その構文を解析し、その文構造を変換し、その後に日
本語の訳文を生成する。Prior Art For example, when creating a corresponding Japanese sentence from a sentence in a foreign language such as English, the human-generated form 7U elements of the English sentence are analyzed, the syntax is analyzed, the sentence structure is converted, and then the sentence structure is converted. Generate a Japanese translation.

すなわち、辞書を検索することによって入力文を構成す
る各申請等の形ｊＥｉ素を解析し、これらの形１６素に
ついての品詞等の情報を得る。その後。That is, by searching a dictionary, the form jEi elements of each application forming the input sentence are analyzed, and information such as parts of speech for these 16 forms is obtained. after that.

得られた品詞等の情報に基づいて、各々の単語またはこ
れらの単語により形成されるブロック相互の修飾関係を
文法ルールにより解析することにより、入力文の構造の
解析、すなわち構文解析を行う、さらに、解析された構
文に基づいて入力文の文構造を日本語の構文配列の順序
に変換し、変換された配列に従って日本語の形態素を生
成し、日本語文を生成する。Based on the obtained information such as parts of speech, the structure of the input sentence is analyzed by analyzing the modifying relationships between each word or the blocks formed by these words using grammatical rules. , converts the sentence structure of the input sentence into the order of the Japanese syntactic arrangement based on the parsed syntax, generates Japanese morphemes according to the converted arrangement, and generates a Japanese sentence.

このような訳文生成の際に行われる形態素解析において
は、スペース、コンマ、コロン等のデリミツタにより入
力文を辞書引き単位に区分し、この辞書引き単位につい
て辞書を検索することにより、単語等の形態素の品詞そ
の他の情報を得ている。その場合に、例えば固有名詞は
、文脈により２以上の異なった意味に用いられるものが
ある。In the morphological analysis performed when generating a translated sentence, the input sentence is divided into dictionary lookup units using delimiters such as spaces, commas, and colons, and the dictionary is searched for these dictionary lookup units to identify morphemes such as words. The part of speech and other information are obtained. In this case, for example, some proper nouns can have two or more different meanings depending on the context.

例えばｒ（ｌｓａｋａ　Ｃ１ｔｙＪは、ｒＯｓａｋａ　
Ｃ１ｔｙ　ｄｅｃｉｄｅｄ・・・」のように団体を表す
意味で使用される場合と、「・・・ｉｎ　０ｓａｋａ　
Ｃ１ｔｙ　Ｊのように場所を表す意味で使用される場合
とがある。従来、固有名詞は１個の意味のみを辞書に記
憶していたから、このような使い分けには対応できず、
解析精度が低かった。For example, r(lsaka C1tyJ is rOsaka
There are cases where it is used to represent a group, such as ``C1ty decided...'', and cases where it is used to indicate a group, such as ``...in 0saka...''.
There are cases where it is used to indicate a place, such as C1ty J. Traditionally, only one meaning of a proper noun was stored in a dictionary, so it was not possible to handle this type of usage.
Analysis accuracy was low.

目的本発明はこのような従来技術の欠点を解消し、固有名詞
を文脈に応じて複数の意味に解析を行うことのできる言
語解析装置を提供することを１」的とする。OBJECTS OF THE INVENTION It is an object of the present invention to provide a language analysis device capable of solving the drawbacks of the prior art and analyzing a proper noun into multiple meanings depending on the context.

構　　成本発明は上記の目的を達成させるため、所定の言語の文
字列を人力する入力手段と、入力手段から入力された文
字列の検索に用いられる辞書手段と、入力された文字列
について辞書手段を検索する検索手段と、検索手段によ
る検索の結果、入力された文字列のうち辞書手段に記録
されていない文字列および辞書手段に性質情報が記録さ
れていない文字列について、性質情報を封手する性質情
報封手手段とを有し、性質情報付４手段は、性質情報の
ない文字列に複数の性質情報を付惧することを特徴とし
たものである。以下、本発明の一実施例に基づいて具体
的に説明する。Structure In order to achieve the above object, the present invention includes an input means for manually inputting a character string in a predetermined language, a dictionary means used for searching the character string input from the input means, and a dictionary means for searching the input character string. As a result of the search by the search means, character strings that are not recorded in the dictionary means and character strings for which character information is not recorded in the dictionary means are sealed with property information. The fourth property information attaching means is characterized in that a plurality of property information is attached to a character string without property information. Hereinafter, a detailed explanation will be given based on one embodiment of the present invention.

第１図には、本発明による言語解析装ごを英日自！ｌ１
１１訳装置に適用した一実施例が示されている。なお、
本発明は、英語の入力文の形態素解析以外にも用いるこ
とができ、英語を日本語に翻訳する英日翻訳装置のみな
らず、ある１つの言語を他の言語に翻訳する自動翻訳装
置にも効果的に適用されることは、言うまでもない。Figure 1 shows the language analysis equipment according to the present invention. l1
An example of application to a No. 11 translation device is shown. In addition,
The present invention can be used for purposes other than morphological analysis of English input sentences, and can be used not only for English-Japanese translation devices that translate English into Japanese, but also for automatic translation devices that translate one language into another language. Needless to say, it can be applied effectively.

本実施例は入力処理部１４を有し、入力処理部１４には
入力装置１２からデータが入力される。入力表２ｔ１２
は例えば、英数字キー等の文字キーや機能キー等を有す
るキーボード、紙に記録された英字テキストを読み取る
光学的文字読み取り装置、磁気ディスクの読み取り装置
等を含む。This embodiment has an input processing section 14, into which data is input from the input device 12. Input table 2t12
includes, for example, a keyboard having character keys such as alphanumeric keys, function keys, etc., an optical character reading device for reading alphabetic text recorded on paper, a magnetic disk reading device, and the like.

入力処理部１４は入力文字列バッファ１４ａを有し、入
力表！！１２から入力された英語の入力文を入力文字列
バッファ１４ａに記憶する。入力処理部１４は入力文字
列バッファ１４ａに記憶された入力文を読み出して単位
切出し部１Ｂに出力する。単位切出し部１６は、デリミ
ツタテーブル１８の検索によって、入力処理部１４から
送られた入力文の辞書引き単位切出しを行う機能部であ
る。デリミツタテーブル１８にはスペース、コンマ等の
デリミツタが格納されている。The input processing unit 14 has an input character string buffer 14a, and has an input table! ! The English input sentence input from 12 is stored in the input character string buffer 14a. The input processing section 14 reads the input sentence stored in the input character string buffer 14a and outputs it to the unit extraction section 1B. The unit extraction unit 16 is a functional unit that performs dictionary lookup unit extraction of the input sentence sent from the input processing unit 14 by searching the delimiter table 18. The delimiter table 18 stores delimiters such as spaces and commas.

単位切出し部ＩＢはデリミツタテーブル１８からデリミ
ツタを読み出し、入力処理部１４から送られる入力文を
デリミツタの存在する部分で区切ることにより、参照辞
書２０を検索する場合の単位となる文字列に分割する０
分割された文字列は辞書検索部２２に入力される。The unit extraction unit IB reads delimiters from the delimiter table 18, and divides the input sentence sent from the input processing unit 14 into character strings that serve as units when searching the reference dictionary 20 by dividing the input sentence at the portion where the delimiter exists. 0
The divided character strings are input to the dictionary search section 22.

辞書検索部２２は、単位切出し部１６から送られる辞書
引き単位に分割された入力文について参照辞書２０を検
索する。参照辞書２０には、例えば第２図に示すように
、構文の文字列のエントリ、その品詞、性質情報等が記
録されている。参照辞書２０には同図に示す固有名詞の
他、その他の品詞、例えば動詞、形容詞等の文字列も記
録されている。なお、同図において品詞としての固有名
詞という記録は、後述する登録固有名詞処理を行うもの
を意味し、通常の文法上の固有名詞を表すものではない
、また、性質情報とは、その固有名詞が表すものを示し
、１つとは限らない、これは後述するように、固有名詞
が使用される場合に応じて複数の性質を表すからである
。The dictionary search section 22 searches the reference dictionary 20 for the input sentence divided into dictionary lookup units sent from the unit extraction section 16 . In the reference dictionary 20, for example, as shown in FIG. 2, entries of character strings of syntax, their parts of speech, property information, etc. are recorded. In addition to the proper nouns shown in the figure, the reference dictionary 20 also records character strings of other parts of speech, such as verbs and adjectives. In addition, in the figure, the record of a proper noun as a part of speech refers to the registered proper noun processing described later, and does not represent a normal grammatical proper noun. indicates what it represents, and is not limited to one. This is because, as will be described later, a proper noun represents a plurality of properties depending on the case in which it is used.

辞書検索部２２は辞書引き単位に分割された文字列につ
いて参照辞書２Ｇを検索し、文字列が固有名詞である場
合には固有名詞処理部２４に出力し、後述する固有名詞
の処理を行う、また、固有名詞でない場合には処理部３
６に出力し、処理部３６の辞書情報保存テーブル３８ａ
に保存する。The dictionary search unit 22 searches the reference dictionary 2G for the character string divided into dictionary lookup units, and if the character string is a proper noun, outputs it to the proper noun processing unit 24 to perform proper noun processing to be described later. In addition, if it is not a proper noun, the processing unit 3
6 and the dictionary information storage table 38a of the processing unit 36.
Save to.

固有名詞処理部２４は、直前文末処理部２Ｂ、直ｊｉｉ
ｊ固有名詞処理部２８、固有名詞自身処理部３０、直前
固有名詞および固有名詞処理部３２、デフォルト性質情
報付与部３４からなる。The proper noun processing unit 24 includes the immediately preceding sentence-final processing unit 2B,
j It consists of a proper noun processing section 28, a proper noun itself processing section 30, a previous proper noun and proper noun processing section 32, and a default property information adding section 34.

直前文末処理部２８は、辞書検索部２２により検索され
人力された文字列の直前の文字列が文末であるか否かを
判断し、直前の文字列が文末である場合には処理すべき
文字列の先頭の大文字を小文字に変換して辞書検索部２
２に送り、辞書検索部２２に再度参照辞書２０の検索を
行わせる。この再度の検索によっても検索されなかった
文字列は未登録の固有名詞と判断され、処理部３Ｂに送
られ、辞書情報保存テーブル３６ａに保存される。また
直前の文字列が文末でない場合に、後述するように性質
情報未知の固有名詞として処理部３Ｂに送り、辞書情報
保存テーブル３８ａに登録する。The immediately preceding sentence end processing unit 28 determines whether the character string immediately preceding the character string searched and manually entered by the dictionary search unit 22 is the end of the sentence, and if the immediately preceding character string is the end of the sentence, the character string to be processed is processed. Dictionary search unit 2 converts the first uppercase letter of the column to lowercase letter
2, and causes the dictionary search unit 22 to search the reference dictionary 20 again. Character strings that are not found in this second search are determined to be unregistered proper nouns, and are sent to the processing unit 3B and stored in the dictionary information storage table 36a. If the immediately preceding character string is not at the end of a sentence, it is sent to the processing unit 3B as a proper noun with unknown property information and registered in the dictionary information storage table 38a, as will be described later.

直前固有名詞処理部２８は、直前文末処理部２Ｂから送
られた直前の文字列の性質情報を解析し、固有名詞自身
処理部３０に出力する。固有名詞自身処理部３０は、解
析すべき固有名詞の性質情報を調べ、後述するように、
固有名詞と直前の固有名詞のいずれかに性質情報が未登
録の場合に、他方の登録された性質情報により固有名詞
と直前の固有名詞をまとめて解析し、処理ｇ２Ｉ３Ｂの
辞書情報保存テーブル３８ａに保存する。The immediately preceding proper noun processing section 28 analyzes the property information of the immediately preceding character string sent from the immediately preceding sentence end processing section 2B, and outputs it to the proper noun itself processing section 30. The proper noun itself processing unit 30 examines the property information of the proper noun to be analyzed, and as described later,
If property information is not registered for either the proper noun or the immediately preceding proper noun, the proper noun and the immediately preceding proper noun are analyzed together using the registered property information of the other, and stored in the dictionary information storage table 38a of processing g2I3B. save.

直前固有名詞および固有名詞処理８ＩＩ３２は、固有名
詞自身処理部３０から送られた解析すべき固有名調およ
び直前の固有名詞の性質情報の共通部分を調べ、共通部
分によりこれらの固有名詞を解析し、処理部３６に出力
し、処理部３Ｂの辞−１情報保存テーブル３８ａに保存
する。The immediately preceding proper noun and proper noun processing 8II32 checks the common part of the property information of the proper name tone to be analyzed and the immediately preceding proper noun sent from the proper noun itself processing unit 30, and analyzes these proper nouns based on the common part. , is output to the processing unit 36 and stored in the dictionary-1 information storage table 38a of the processing unit 3B.

デフォルト性質情報付与部３４は辞書情報保存テーブル
３８ａから読み出され、単位切出し部１６を通して辞書
検索部２２に送られ、辞書検索部２２により参照辞書２
０を検索した結果、性質情報のない固有名詞について、
性質情報を付与する。固有名詞は使用される状況に応じ
て各種の性質を有することが有効であるため、付与され
る性質情報は必要と思われるものをすべて付与する０例
えば、「人、場所、団体、その他」が付与される。デフ
ォルト性質情報付与部３４は、固有名詞に性質情報を付
与した後、これらのデータを処理部３６に送り、処理部
３６の辞書情報保存テーブル３８ａに保存する。The default property information adding section 34 is read out from the dictionary information storage table 38a, sent to the dictionary search section 22 through the unit extraction section 16, and is sent to the dictionary search section 22 by the dictionary search section 22.
As a result of searching for 0, for proper nouns without property information,
Add property information. It is effective for proper nouns to have various properties depending on the situation in which they are used, so all necessary property information should be added. For example, "person, place, organization, etc." Granted. After adding property information to the proper noun, the default property information adding unit 34 sends this data to the processing unit 36 and stores it in the dictionary information storage table 38a of the processing unit 36.

処理部３Ｂは、辞書情報保存テーブル３６ａを有し、直
前固有名詞および固有名詞処理部３２、デフォルト性質
情報付与部３４または辞書検索部２２から送られたデー
タを辞書情報保存テーブル３８ａに保存した後、保存さ
れたデータを読み出して構文解析部３８に出力する。構
文解析部３８は辞書情報保存テーブル３８ａから読み出
された形態素解析の行われた入力文の構文解析を行う。The processing unit 3B has a dictionary information storage table 36a, and after storing data sent from the immediately preceding proper noun and proper noun processing unit 32, the default property information adding unit 34, or the dictionary search unit 22 in the dictionary information storage table 38a. , reads the saved data and outputs it to the syntax analysis unit 38. The syntactic analysis unit 38 performs syntactic analysis of the morphologically analyzed input sentence read from the dictionary information storage table 38a.

第３図に示すフローチャートにより、本装置の動作を説
明する。The operation of this device will be explained with reference to the flowchart shown in FIG.

まず、入力装置１２から英語の入力文を入力処理部１４
に読み込む（１００）。入力処理部１４に読み込まれた
入力文は入力文字列バッファ１４ａに格納される。入力
文字列バッファ１４ａに記憶された入力文は単位切り出
し部１６に読み出される。First, an English input sentence is input from the input device 12 to the input processing unit 14.
(100). The input sentence read into the input processing unit 14 is stored in the input character string buffer 14a. The input sentence stored in the input character string buffer 14a is read out by the unit extraction section 16.

単位切り出し部１６では、入力文が入力されるとプリミ
ツトテーブル１８からデリミツタを読み出し、辞書引き
単位の切り出しが行われる（１０２）。When the input sentence is input, the unit extraction section 16 reads delimiters from the primitive table 18 and performs extraction in dictionary lookup units (102).

すなわち、入力された入力文を構成する文字列は、スペ
ース、コロン等のデリミツタのある部分で区切ることに
より、参照辞書２０を検索する場合のｔｔｉ位である検
索キー文字列に、文字列の先頭から順に分割される。分
割された辞書引き単位、すなわち検索キー文字列が終り
か否かを判断しく１０４）　、検索キー文字列がまだあ
る場合（終りでない場合）には検索キー文字列を辞書検
索部２２に送る。That is, by separating the character strings constituting the input sentence with delimiters such as spaces and colons, the beginning of the character string can be added to the search key string, which is the tti position when searching the reference dictionary 20. It is divided in order from It is determined whether or not the divided dictionary lookup unit, that is, the search key string, is the end (104). If there is still a search key string (not the end), the search key string is sent to the dictionary search unit 22.

辞書検索部２２に検索キー文字列が送られると、辞書検
索部２２はこの検索キー文字列について参照辞書２０を
検索する（１０Ｂ）　、第２図に示すような参照辞書２
０のエントリに検索キー文字夕鴫があるか否かを判断し
く１０８）　、　エントリがある場合には参照辞書２０
に記憶されている品詞情報を読み出し、検案キー文字列
が固有名詞か否かを判断する（１１０）　。When the search key string is sent to the dictionary search unit 22, the dictionary search unit 22 searches the reference dictionary 20 for this search key string (10B).The reference dictionary 2 as shown in FIG.
108) to determine whether the entry 0 has the search key character Yushu (108), and if there is an entry, refer to the reference dictionary 20.
The part-of-speech information stored in is read out, and it is determined whether the test key character string is a proper noun (110).

検索キー文字列が固有名詞でない場合には、辞書検索部
２２は参照辞書２０から読み出したデータを処理部３Ｂ
に送り、辞書情報保存テーブル３８ａに記録する（１１
２）　、辞書情報保存テーブル３８ａにデータが保イＦ
されると、処理部３６から単位すＪり出し部１６にデー
タが保存された旨の入力および直前に保存された検案キ
ー文字列のデータが入力される。If the search key character string is not a proper noun, the dictionary search unit 22 uses the data read from the reference dictionary 20 in the processing unit 3B.
and records it in the dictionary information storage table 38a (11
2) The data is saved in the dictionary information storage table 38a.
Then, the processing section 36 inputs an input to the effect that the data has been saved and the data of the examination key character string that was saved immediately before.

これによりステップ１０２に戻り、再び単位切り出しｆ
ｌｌｌＢにおいて辞書引き単位の切り出しを行う。As a result, the process returns to step 102 and the unit cutout f
In lllB, dictionary lookup units are extracted.

ステップ１１０において検索キー文字列が固有名詞の場
合には、辞書検索部２２は参照辞書２０から読み出した
その固有名詞（以下単に固有名詞という）のデータを、
処理部３８の辞書情報保存テーブル３８ａから単位切り
出し部１６を通して辞書検索部２２に入力されている直
前の検索キー文字列のデータとともに、固有名詞処理部
２４に送り、固有名詞処理部２４において辞書登録固有
名詞の処理が行われる（＋１４）。If the search key character string is a proper noun in step 110, the dictionary search unit 22 reads the data of the proper noun (hereinafter simply referred to as proper noun) from the reference dictionary 20,
It is sent from the dictionary information storage table 38a of the processing unit 38 to the proper noun processing unit 24 along with the data of the previous search key character string inputted to the dictionary search unit 22 through the unit extraction unit 16, and is registered in the dictionary in the proper noun processing unit 24. Proper noun processing is performed (+14).

ここで辞書登録固有名詞の処理を第４図に示すフローチ
ャートにより説ＩＪＪする。Here, the processing of proper nouns registered in the dictionary will be explained using the flowchart shown in FIG.

辞書検索部２２から固有名詞処理部２４に送られたデー
タは、直前文末処理部２６を通して直前固有名詞処理部
２８に入力される。辞ど登録固有名詞の処理においては
、直前文末処理部２６は機能しない。The data sent from the dictionary search section 22 to the proper noun processing section 24 is input to the immediately preceding proper noun processing section 28 through the immediately preceding sentence end processing section 26. In processing registered proper nouns, the immediately preceding sentence end processing unit 26 does not function.

直前固有名詞処理部２８において、固有名詞の直前にあ
る検索キー文字列が、参照辞書２０に未登録の固有名詞
、すなわち後述する辞書未登録固有名詞処理を行われた
ものであるか台かを判断しく２００）　、未登録の固有
名詞である場合には、固有名詞および直前の未登録の固
有名詞の全体を固有名詞の性質情報をもつ固有名詞と判
断しく２０２）、これらのデータを処理部３６に送り、
辞書情報保存テーブル３６ａに記録する（２１８）。The immediately preceding proper noun processing unit 28 determines whether the search key character string immediately preceding the proper noun is a proper noun that has not been registered in the reference dictionary 20, that is, has undergone dictionary-unregistered proper noun processing to be described later. If the proper noun is an unregistered proper noun, the proper noun and the entire previous unregistered proper noun are judged to be proper nouns that have proper noun property information202), and these data are processed by the processing unit. Send to 36,
It is recorded in the dictionary information storage table 36a (218).

直前固有名詞処理部２８において、ステップ２００で、
固有名詞の直前にある検索キー文字列が未登録の固有名
詞でないと判断された場合には、固有名詞の直前にある
検索キー文字列が参照辞書２０に登録された固有名詞か
否かを判断する（２０４）。固有名詞の直前にある検索
キー文字列が登録された固有名詞である場合には、この
直前の固有名詞の性質情報が未知（Ｕｎｋｎｏｗｎ）　
、すなわち参照辞書２０に登録されていないものである
か否かを判断する（２０Ｂ）　。In the immediately preceding proper noun processing unit 28, in step 200,
If it is determined that the search key string immediately before the proper noun is not an unregistered proper noun, it is determined whether the search key string immediately before the proper noun is a proper noun registered in the reference dictionary 20. (204). If the search key string immediately before a proper noun is a registered proper noun, the property information of the proper noun immediately before it is unknown.
That is, it is determined whether or not it is not registered in the reference dictionary 20 (20B).

直前の固有名詞の性質情報が未知の場合には、ステップ
２０２に進み、固有名詞および直前の固有名詞の全体を
固有名詞の性質情報をもつ固有名詞と判断しく２０２）
　、直前固有名詞処理部２８はこれらのデータを処理部
３Ｂに送る。処理部３６に送られたデータは、辞書情報
保存テーブル３ｅａに記録される（２１８）　。If the property information of the immediately preceding proper noun is unknown, the process proceeds to step 202, and the proper noun and the entire immediately preceding proper noun are determined to be proper nouns having proper noun property information (202).
, the immediately preceding proper noun processing unit 28 sends these data to the processing unit 3B. The data sent to the processing unit 36 is recorded in the dictionary information storage table 3ea (218).

直前固有名詞処理部２８においてステップ２０Ｂで、直
前の固有名詞の性質情報が未知（Ｕｎｋｎｏｗｎ）でな
い、すなわち参照辞書２０に登録されていると判断され
た場合には、これらのデータは直前固有名詞処理部２８
から固有名詞自身処理部３０に送られる。固有名詞自身
処理Ｆ′Ａ３０において、固有名詞の性質情報が未知（
Ｕｎｋｎｏｗｎ）であるか否かを判断する（２０８）　
、固有名詞の性質情報が未知である場合には、固有名詞
自身処理部３０は固有名詞および直前の固有名詞の全体
を直前の固有名詞の性質情報をもつ固有名詞と判断しく
２１０）　、これらのデータを処理部３６に送る。処理
部３６に送られたデータは、辞、■情報保存テーブル３
８ａに記録される（２１８）　。If the immediately preceding proper noun processing unit 28 determines in step 20B that the property information of the immediately preceding proper noun is not unknown, that is, it is registered in the reference dictionary 20, these data are subjected to the immediately preceding proper noun processing. Part 28
The proper noun itself is sent to the proper noun processing unit 30. In proper noun self processing F'A30, property information of the proper noun is unknown (
(208)
, when the property information of a proper noun is unknown, the proper noun itself processing unit 30 determines that the proper noun and the immediately preceding proper noun are all proper nouns that have the property information of the immediately preceding proper noun (210). The data is sent to the processing section 36. The data sent to the processing unit 36 is stored in the information storage table 3.
8a (218).

固有名詞自身処理部３０において、固有名詞自身の性質
情報が未知でない、すなわち参照辞書２０に登録されて
いると判断された場合には、固有名詞［１身処理ｆ１！
３０はこれらのデータを直前固有名詞および固有名詞処
理部３２に送る。直前固有名詞および固有名詞処理部３
２において、固有名詞と直前の固有名詞との間に性質情
報の共通性があるか否かを判断しく２１２）　、共通性
がある場合には、固有名詞と直前の固有名詞の全体を、
共通部分の性質情報をもつ固有名詞と判断しく２１４）
　、これらのデータを処理部３Ｂに送る。処理部３６に
送られたデータは、辞書情報保存テーブルＨａに記録さ
れる（２１８）　。When the proper noun itself processing unit 30 determines that the property information of the proper noun itself is not unknown, that is, it is registered in the reference dictionary 20, the proper noun [1 body processing f1!
30 sends these data to the immediately preceding proper noun and proper noun processing section 32 . Immediately preceding proper noun and proper noun processing unit 3
In step 2, it is determined whether or not there is a commonality of property information between the proper noun and the immediately preceding proper noun (212), and if there is a commonality, the entire proper noun and the immediately preceding proper noun are
214)
, these data are sent to the processing section 3B. The data sent to the processing unit 36 is recorded in the dictionary information storage table Ha (218).

固有名詞と直前の固有名詞との間に性質情報の共通性が
ない場合には、固有名詞を直１ｉ１の固有名詞とは別に
、参照辞書２０から検索された性質情報をもった固有名
詞と判断しく２１Ｂ）　、そのデータを処理部３６に送
る。処理部３６に送られたデータは、辞書情報保存テー
ブル３８ａに記録される（２１８．ｌ。If there is no common property information between the proper noun and the immediately preceding proper noun, the proper noun is determined to be a proper noun that has property information retrieved from the reference dictionary 20, in addition to the proper noun directly 1i1. 21B) and sends the data to the processing section 36. The data sent to the processing unit 36 is recorded in the dictionary information storage table 38a (218.l).

第３図に戻って、ステップ１０日において参照辞書２０
のエントリに検索キー文字列がない場合には、検索キー
文字列の１文字目が大文字か否かを判断しく１１６）　
、大文字でない場合には辞書検索部２２はこの検索キー
文字列を未登録語と判断して処理部３８に送り、辞書情
報保存テーブル３８ａに記録する（１１８）　。Returning to FIG. 3, in step 10, the reference dictionary 20
If there is no search key string in the entry, determine whether the first character of the search key string is an uppercase letter (116).
, if it is not a capital letter, the dictionary search unit 22 determines that this search key character string is an unregistered word, sends it to the processing unit 38, and records it in the dictionary information storage table 38a (118).

１文字目が大文字の場合には、辞書検索部２２からこの
検索キー文字列のデータが直前の検索キー文字列のデー
タとともに、固有名詞処理部２４に送られ、辞書未登録
固有名詞の処理が行われる（１２０）　。If the first character is a capital letter, the data of this search key string is sent from the dictionary search section 22 to the proper noun processing section 24 along with the data of the previous search key string, and proper nouns not registered in the dictionary are processed. It is done (120).

第５図により辞書未登録固有名詞の処理について説明す
る。The processing of proper nouns not registered in the dictionary will be explained with reference to FIG.

検索キー文字列のデータは直前の検索キー文字列のデー
タとともに、直前文末処理部２６に送られ、直前文末処
理部２６において直前の検索キー文字列の末尾が文末候
補か否かを判断する（３００）　。The data of the search key string is sent to the immediately preceding sentence end processing section 26 along with the data of the immediately preceding search key string, and the immediately preceding sentence end processing section 26 determines whether the end of the immediately preceding search key string is an end of sentence candidate ( 300).

この文末候補か否かの判断は、直前の検索キー文字夕１
の末尾が屯独のピリオド（、）等の文末候補であるか否
かを判断することにより行う。To determine whether this is a candidate for the end of a sentence, use the previous search key character
This is done by determining whether the end of is a candidate for the end of a sentence, such as a period (,) for tundoku.

直前の検索キー文字列の末Ｊｌｌ＝が文末候補である場
合には、直前文末処理部２Ｂから直前固有名、＋４処理
部２８にデータが送られ、直前固有名詞処理部２８は直
前の検索キー文字列を文末とｒ４断しく３０２）、検索
キー文字列の１文字］１の大文字を小文字に直して辞書
検索部２２に送る。If the last word Jll= of the immediately preceding search key character string is a candidate for the last sentence, the data is sent from the immediately preceding sentence end processing section 2B to the immediately preceding proper name, +4 processing section 28, and the immediately preceding proper noun processing section 28 uses the immediately preceding search key Convert the character string to the end of the sentence and r4 (302), convert the uppercase letters of the search key character string [1] to lowercase letters, and send it to the dictionary search unit 22.

辞書検索部２２は小文字に直された検索キー文字列につ
いて参照辞書２０を検索しく３０４）　、参照辞書２０
にエントリがあるか否かを判断する（３０Ｂ）。エント
リがある場合には、辞書検索部２２は参照辞書２０から
検索したデータを処理部３６に送り、辞書情報保存テー
ブル３Ｅｔａに記録する（３０８）　、エントリがない
場合には辞書検索部２２は、この検索キー文字列の１文
字目を大文字に戻して未登録の固有名詞として処理部３
６に送り、辞書情報保存テーブル３８ａに記録する（３
１０）　。The dictionary search unit 22 searches the reference dictionary 20 for the search key character string converted to lowercase letters (304), the reference dictionary 20
It is determined whether there is an entry in (30B). If there is an entry, the dictionary search unit 22 sends the data searched from the reference dictionary 20 to the processing unit 36 and records it in the dictionary information storage table 3Eta (308); if there is no entry, the dictionary search unit 22 The processing unit 3 returns the first character of this search key string to uppercase and treats it as an unregistered proper noun.
6 and recorded in the dictionary information storage table 38a (3
10).

ステップ３００において、直前文末処理部２６が直前の
検索キー文字列の末尾を文末候補でないと判断した場合
には、直前文末処理部２６から直前固有名詞処理部２８
にそのデータが送られ、直前固有名詞処理部２８は直前
の検索キー文字列を文末でないと判断する（３１２）　
、直前固有名詞処理部２８から固有名詞自身処理部３０
にデータが送られ、固有名詞自身処理部３０は検索キー
文字列を性質情報が未知の固有名詞と判断する（３１４
）　。In step 300, if the immediately preceding sentence end processing unit 26 determines that the end of the immediately preceding search key character string is not a sentence ending candidate, the immediately preceding sentence ending processing unit 26 sends the immediately preceding proper noun processing unit 28
The data is sent to , and the immediately preceding proper noun processing unit 28 determines that the immediately preceding search key character string is not at the end of the sentence (312).
, from the immediately preceding proper noun processing section 28 to the proper noun itself processing section 30
The data is sent to , and the proper noun processing unit 30 determines that the search key character string is a proper noun whose property information is unknown (314
).

固有名詞自身処理部３０はデータを直前固有名詞処理部
２８に戻し、直前固有名詞処理部２８において辞書登録
固有名詞の処理が行われる（３１Ｂ）　、この辞書登録
固有名詞の処理は第４図に示すものと同じである。The proper noun itself processing section 30 returns the data to the immediately preceding proper noun processing section 28, where the proper noun registered in the dictionary is processed (31B). This processing of the proper noun registered in the dictionary is shown in FIG. Same as shown.

第３図に戻って、ステップ１０４において切り出された
辞書引き単位が終りの場合には、辞書検索部２２はデフ
ォルト性質情報材！ｊ部３４にその旨の信号を送り、デ
フォルト性質情報材ｑ一部３４は処理部３６の辞書情報
保存テーブル３６ａに記録された情報を読み出し、固有
名詞へのデフォルト性質情報の付与を行う（１２２）　
。Returning to FIG. 3, if the dictionary lookup unit cut out in step 104 is the end, the dictionary search unit 22 uses the default property information material! A signal to that effect is sent to the j part 34, and the default property information material q part 34 reads out the information recorded in the dictionary information storage table 36a of the processing part 36, and adds default property information to the proper noun (122 )
.

第６図により固有名詞へのデフォルト性質情報の付与の
動作を説明する。The operation of adding default property information to a proper noun will be explained with reference to FIG.

デフォルト性質情報付４部３４において、まず辞書情報
保存テーブル３６ａから読み出されたデータの先頭にポ
インタをセットする（４００）　、すなわち、各検索キ
ー文字列に分割され、それぞれ参照辞書２０の検索によ
り情報が付与された入力文の先頭の検索キー文字列にポ
インタをセットする。ポインタの指示する検索キー文字
列が固有名詞か否かを判断しく４０２）　、固有名詞の
場合にはその固有名詞の性質情報が未知（Ｕｎｋｎｏｗ
ｎ）か否かを判断する（４０４）　、固有名詞でない場
合にはステップ４０８に進み、ポインタを次の検索キー
文字列にすすめる。In the default property information appended section 34, a pointer is first set at the beginning of the data read from the dictionary information storage table 36a (400). Set the pointer to the search key string at the beginning of the input sentence to which information has been added. It is determined whether the search key character string indicated by the pointer is a proper noun (402), and if it is a proper noun, the property information of the proper noun is unknown (402).
n) is determined (404), and if it is not a proper noun, the process proceeds to step 408, and the pointer is advanced to the next search key character string.

ステップ４０４においてその固有名詞の性質情報が未知
である場合には、デフォルト性質情報の付与を行う（４
０Ｂ）　、デフォルト性質情報の付かは、第７図の下部
に示すように、性質情報が未知の固有名詞にデフォルト
の性質情報を付与する。例えば同図数丁部に示すように
、性質情報が未知の固有名詞ｒ　Ｊｏｈｎｓｏｎ　Ｊに
、すべての性質情報、１人、場所、団体、その他」を付
与する。性質情報が未知の固有名詞に対して、このよう
にすべての性質情報を付４ｊ−することにより、後の構
文解析において、その固有名詞を複数の性質のいずれに
も解析できる余地を残しておくことができる。If the property information of the proper noun is unknown in step 404, default property information is added (4
0B) When attaching default property information, default property information is added to a proper noun whose property information is unknown, as shown in the lower part of FIG. For example, as shown in the several pages of the same figure, all property information such as "one person, place, group, etc." is added to the proper noun r Johnson J, whose property information is unknown. By attaching all property information to a proper noun whose property information is unknown in this way, we leave room for the proper noun to be analyzed into any of multiple properties in later parsing. be able to.

ステップ４０４において固有名詞の性質情報が未知でな
い場合には、ステップ４０Ｂに進み、ポインタを次の検
索キー文字列にすすめる。If the property information of the proper noun is not unknown in step 404, the process proceeds to step 40B, where the pointer is advanced to the next search key character string.

ポインタの示す検索キー文字列が終りか否かを判断しく
４０８）　、終りでない場合にはステップ４０２に戻り
、次の検索キー文字列について固有名詞か台かを’Ｉ′
ｌ断する。ポインタの示す検索キー文字列が終りの場合
にはデフォルト情報の付かが終γする。It is determined whether the search key string indicated by the pointer is the end (408), and if it is not the end, the process returns to step 402 and the next search key string is determined as 'I' whether it is a proper noun or a platform.
I cut it off. If the search key character string indicated by the pointer is at the end, the addition of default information ends γ.

固有名詞へのデフォルト性質情報の付与が終ｒした後、
データは辞書情報保存テーブル３６ａに記録される。そ
の後、処理ａｆ！３Ｂから辞書情報保存テーブル３８ａ
に記録されたデータを構文解析部３８に出力しく１２４
）　、本実施例の形態素解析が終Ｙする。After the default property information has been assigned to the proper noun,
The data is recorded in the dictionary information storage table 36a. After that, process af! 3B to dictionary information storage table 38a
124 to output the data recorded in the syntax analysis unit 38.
), the morphological analysis of this example ends.

以上説ＩＪ１シた本装置の動作を入力文の例をあげて説
ＩＪＩする。The operation of this device explained above will be explained using an example of an input sentence.

第７図に示すような入力文ｒ　Ｉｎ　Ｔｏｋｙｏ　Ｓｔ
ａｔｉｏｎＭｒ、　Ｗａｉｔｅｒ　ｍｅｔ　Ｊｏｈｎｇ
ｏｎＪが入力された場合について説明する。まず、この
入力文を人力処理部１４に読み込む人力処理（１００）
を行う０次に辞、町ｉ引き単位の！／Ｊり出しく１０２
）を行い、スペースにより」；記の入力文を各単語に分
割する。まずｒＩｎＪについて参照辞書２０の検索を行
う（１０Ｂ）　、　　ｒ　Ｉｎ」は、参照辞書２０にエ
ントリがあり（１０８）　、固有名詞ではないので（１
１０）　、　参照辞書２０から検索されたデータを辞書
情報保存テーブル３８ａに記録する（１１２）　。Input sentence r In Tokyo St as shown in Figure 7
Mr. ation, Waiter met John
A case where onJ is input will be explained. First, human processing (100) reads this input sentence into the human processing unit 14.
The next step is to do the following, and the town i pull unit! /J Exit 102
) and divides the input sentence ";" into each word by spaces. First, the reference dictionary 20 is searched for rInJ (10B), since there is an entry for "rIn" in the reference dictionary 20 (108) and it is not a proper noun (10B).
10) Record the data retrieved from the reference dictionary 20 in the dictionary information storage table 38a (112).

次にｒ　Ｔｏｋｙｏ　Ｊについて参照辞書２０の検索を
行う（１０８）　、ｒ　Ｔｏｋｙｏ　Ｊは、参照辞書２
０にエントリがなく　（１０Ｂ）　、　１文字目が大文
字なので（１１Ｂ）　。Next, the reference dictionary 20 is searched for r Tokyo J (108), r Tokyo J is searched in the reference dictionary 2
There is no entry in 0 (10B) and the first character is a capital letter (11B).

辞書未登録固有名詞の処理を行う（１２０）　、第５図
に進み、直前はｒ’ＩｎＪであり文末候補がないから−
（３００）、ｒＩｎＪを文末でないと認定しく３１２）
、ｒ　Ｔｏｋｙｏ　４を性質情報未知（Ｕｎｋｎｏｗｎ
）の固有名詞としく３１４）　、辞書登録固有名詞の処
理を行う（３１Ｂ）　。Processing proper nouns not registered in the dictionary (120) Proceeds to Figure 5, since the immediately preceding one is r'InJ and there is no candidate for the end of the sentence.
(300), rInJ should be recognized as not the end of the sentence312)
, r Tokyo 4 with unknown property information (Unknown
) as a proper noun (314), and processes the proper noun registered in the dictionary (31B).

７ＪＳ４図に進み、直前のｒ　ＩｎＪが未登録の固有名
詞ではな（（２００）　、登録固有名詞でもないから（
２０４）　、単独で自分自身の性質情報をもった固有名
詞、すなわち性質情報未知（ｌＪｎｋｎｏｗｎ）の固有
名−一とじとして記録する（２１６）。Proceed to Figure 7JS4, and the immediately preceding r InJ is not an unregistered proper noun ((200), nor is it a registered proper noun (
204), a proper name having its own property information alone, that is, a proper name with unknown property information (lJnknown) is recorded as a single unit (216).

７ｔＳ３図に戻り、次にｒｓｔａｔｉｏｎ　Ｊについて
参照辞書２０の検索を行う（１０Ｂ）　、ｒｓｔａｔｉ
ｏｎ　Ｊは参照辞書２０にエントリがあり（１０８）　
、固有名詞であるから（１１０）　、辞書登録固有名詞
の処理を行う（１１４）　、　Ｅｒ５４図に進み、直前
のｒ　Ｔｏｋ７ａ　Ｊが未登録の固有名詞であるから（
２００）、ｒ　Ｔｏｋｙ。Returning to the 7tS3 diagram, next the reference dictionary 20 is searched for rstation J (10B), rstati
on J has an entry in the reference dictionary 20 (108)
, Because it is a proper noun (110), Process the proper noun registered in the dictionary (114) Proceed to the Er54 diagram, and because the immediately preceding r Tok7a J is an unregistered proper noun (
200), r Tokyo.

５ｔａｔｉｏｎ　Ｊ全体をｒＳｔａｔｉｏｎ　Ｊの性質
情報「場所、団体」を有する固有名詞として記録する（
２０２）。Record the entire 5tation J as a proper noun with the property information "location, organization" of rStation J (
202).

次にｒＭｒ、Ｊについて第３図の参照辞書２０の検索を
行う（１０Ｂ）　、　　ｒＭｒ、　Ｊは、参照辞書２０
にエントリがあり、固有名詞であるから（’　Ｉ　Ｏ）
　、　辞４Ｈ登録固有名詞の処理を行う（ｔｉａ）　、
第４図に進み、直前のｒｓｔａｔｉｏｎ　Ｊは未登録の
固有名詞ではなく（２００）　、登録固有名詞であり（
２０４）　、性質情報「場所・団体」が未知ではない（
２０す、　　ｒＭｒ、　Ｊは、性質情報が「人」であり
、未知ではないから（２０８）　、直前のｒｓｔａｔｉ
ｏｎ　ＪとｒＭｒ、　Ｊとで性質情報の共通部分がある
か否かを′１′１断する（２１２）　。Next, the reference dictionary 20 in FIG. 3 is searched for rMr, J (10B), rMr, J is the reference dictionary 20.
Since there is an entry in and it is a proper noun (' I O)
, Processing 4H registered proper nouns (tia) ,
Proceeding to Figure 4, the immediately preceding rstation J is not an unregistered proper noun (200), but a registered proper noun (
204), the property information “location/group” is not unknown (
20S, rMr, J has the property information “person” and is not unknown (208), so the immediately preceding rstati
It is determined whether or not there is a common portion of property information between on J, rMr, and J (212).

ｒ　５ｔａｔｉｏｎ　Ｊは「場所、団体」、ｒＭｒ、Ｊ
は「人」でこれらには共通部分がないので、ｒＭｒ、　
Ｊは単独で、性質情報「人」の固有名詞として登録する
（２１Ｂ）　。r 5tation J is "place, organization", r Mr, J
is a "person" and there is no common part between them, so rMr,
J is registered alone as a proper noun of the property information "person" (21B).

次に再びｆＩＳ３図に戻り、ｒＷａｌｔｅｒＪについて
参照辞書２０を検索する（１０Ｂ）　、　　ｒＷａｌｔ
ｅｒＪは参照辞書２０にエントリがあり（ｉｏｓ）　、
固有名詞であるから（１１０）　、辞書登録固有名詞の
処理を行う（１１４）　、第４図に進み、直曲ノｒＭｒ
、　」は未登録の固有名詞ではなく　（２００）　、登
録固有名詞であり（２０４）　、性質情報が「人」で未
知ではなく（２０Ｂ）、ｒ　ＷａｌｔｅＢの性質情報「
人、場所、団体」も未知ではないので（２０８）　、性
質情報の共通部分を′Ｉ断する（２１２）　、性質情報
の共通部分子犬」があるので、「にｒ、　Ｗａｌｔｅｒ
Ｊをまとめて性質情報「人」をもつ固有名詞として記録
する（２１４）　。Next, return to the fIS3 diagram again and search the reference dictionary 20 for rWalterJ (10B), rWalt
erJ has an entry in the reference dictionary 20 (ios),
Since it is a proper noun (110), the proper noun registered in the dictionary is processed (114). Proceed to FIG.
, " is not an unregistered proper noun (200), but a registered proper noun (204), and the property information is "person" and is not unknown (20B), and the property information of rWalteB is "
Since "persons, places, and groups" are not unknown (208), there is a "common part of property information" (212), so "Ni r, Walter
J is collectively recorded as a proper noun with the property information "person" (214).

次に「■ｅｔ　Ｊについて参照辞書２０を検索し、エン
トリがあり（１０８）　、固有名詞ではないから（１１
０）　、参照辞書２０から検索されたデータを辞書情報
保存テーブル３８ａに記録する（１１２）　。Next, I searched the reference dictionary 20 for "■et J, and there was an entry (108), and it was not a proper noun (11).
0), the data retrieved from the reference dictionary 20 is recorded in the dictionary information storage table 38a (112).

さらに、ｒＪｏｈｎｓｏｎ　Ｊについて参照辞書２０を
検索する（１０Ｂ）　、　　ｒ　Ｊｏｈｎｓｏｎ　Ｊは
エントリがなく（１０８）　、　　１文字目は大文字で
あるから（ＩＩＥＩ）　、辞書未登録固有名詞の処理を
行う（１２０）　、第５図に進み、直前のｒＢｅｔＪは
文末候補がないから（３００）、ｒｍｅｔＪを文末でな
いと認定しく３１２）、ｒ　Ｊｏｈｎｓｏｎ　Ｊを性質
情報未知（Ｕｎｋｎｏｗｎ）の固有名詞とみなしく３１
４）　、辞書登録固有名詞の処理を行う（３１Ｅｉ）。Furthermore, the reference dictionary 20 is searched for rJohnson J (10B), and since there is no entry for rJohnson J (108), and the first character is a capital letter (IIEI), proper nouns not registered in the dictionary are processed (120). , Proceeding to Figure 5, since there is no candidate for the end of the sentence for the immediately preceding rBetJ (300), rmetJ is recognized as not the end of the sentence (312), and r Johnson J is recognized as a proper noun with unknown property information (31).
4) Processes proper nouns registered in the dictionary (31Ei).

第４図に進み、直前のｒＢｅｔＪは未登録の固有名詞で
はなく　（２００）　、登録固有名詞でもないから（２
０４）、ｒ　ＪａｈｎＳｏｎ　」は弔独で性質情報未知
（Ｕｎｋｎｏｗｎ）の固有名詞として記録される。Proceeding to Figure 4, the previous rBetJ is not an unregistered proper noun (200), nor is it a registered proper noun (200).
04), r JahnSon'' is recorded as a proper noun with unknown property information.

以上の処理の後、第６図の固有名詞へのデフォルト性質
情報の付与が行われる。After the above processing, default property information is added to the proper noun shown in FIG.

ポインタを先頭の辞書引き中位であるｒＩｎＪにセット
する（４００）　、固有名詞ではないから（４０２）　
、ポインタを進め（４０８）　、　　ｒ　Ｔｏｋｙｏ　
Ｊにセットする。　　ｒＴｏｋｙｏ　Ｊは固有名詞であ
り（４０２）　、性質情報は前記の登録固有名ｊ４の処
理においてｒＴａｋｙｏ　５ｔａｔｉｏｎ　Ｊ全体を場
所、団体としており、未知（Ｕｎｋｎｏｗｎ）　テはな
いから（４０４）　、ポインタを進め（４０８）、ｒ　
５ｔａｔｉｏｎ　Ｊにセットする。Set the pointer to rInJ, which is the middle of the dictionary lookup at the beginning (400), because it is not a proper noun (402)
, advance the pointer (408) , r Tokyo
Set to J. rTokyo J is a proper noun (402), and the nature information indicates that the entire rTokyo 5tation J is a place and group in the process of the registered proper name j4, and there is no unknown (404), so move the pointer forward ( 408), r
5tation J.

ｒｓｔａｔｉｏｎ　Ｊ　　ｒＭｒ、　Ｊは同様に固有名
詞であり（４０２）　、性質情報が未知ではないから（
４０４）　、ポインタを進め（４０Ｂ）、［膳ｅｔ　Ｊ
は固有名詞ではないから（４０２）　、ポインタを進め
る（４０８）。rstation J rMr, J is also a proper noun (402), and the property information is not unknown (
404), advance the pointer (40B), [Zen et J
Since is not a proper noun (402), the pointer is advanced (408).

ｒ　Ｊｏｂｎｓｏｎ　Ｊは固有名詞であり（４０２）　
、性質情報が未知であるから（４０４）　、デフォルト
性質情報の付与を行い（４０Ｂ）　、第７図に示すよう
に。r Jobnson J is a proper noun (402)
Since the property information is unknown (404), default property information is added (40B), as shown in FIG.

ｒ　Ｊｏｈｎｓｏｎ　Ｊに性質情報［人、場所、団体、
その他］を付享する。r Johnson J. Characteristic information [person, place, organization,
Others].

以丑のように本実施例によれば、英語の入力文を検索キ
ー文字列に分割して、まず参照辞書２０により検索し、
参照辞書２０に固有名詞としてエントリがあった場合に
登録固有名詞の処理を行う、登録固有名詞の処理は、直
前の検索キー文字夕１を考慮し、直前の検索キー文字列
が固有名詞である場合には直前の検索キー文字列および
対象の固有名詞の性質情報を調べ、いずれかの性質情報
がない場合には他方の性質情報を付与し、いずれも性質
情報がある場合には共通部分をこれらの固有名詞の性質
情報としている。したがって、性質情報のない固有名調
に性質情報を適切に打手するとともに、付かされている
性質情報をより適切な性質情報にしぼることができる。As described above, according to this embodiment, an English input sentence is divided into search key character strings, which are first searched using the reference dictionary 20,
Registered proper noun processing is performed when there is an entry as a proper noun in the reference dictionary 20.The registered proper noun processing takes into consideration the immediately preceding search key character string 1 and determines that the immediately preceding search key string is a proper noun. In this case, check the property information of the previous search key string and target proper noun, and if there is no property information for either, add the property information of the other, and if both have property information, remove the common part. This is used as property information of these proper nouns. Therefore, it is possible to appropriately add property information to a proper name tone without property information, and to narrow down the added property information to more appropriate property information.

これにより後の構文解析において、解析を有効に行うこ
とができ、適切な翻訳を行うことができる。This allows for effective parsing and appropriate translation in subsequent syntactic analysis.

また、参照辞書２０に未登録の文字列について、１文字
目が大文字でかつ直面の文字列が文末と判断された場合
には、大文字を小文字に変換して再度参照辞書２０を検
索しているから、文頭の文字列についても参照辞書２０
による検索を行うことができる。さらに、文頭でない箇
所に大文字で始まる文字列が現れた場合には固有名詞と
判断し、この固有名詞の性質情報はその前後に性質情報
の登録された固有名詞がある場合にはこれによって付与
される。したがって、参照２ｉ１！占２０に未登録の固
有名詞をある程度解析することができる。Further, regarding a character string that is not registered in the reference dictionary 20, if the first character is a capital letter and the facing character string is determined to be the end of a sentence, the uppercase character is converted to a lowercase character and the reference dictionary 20 is searched again. Also refer to dictionary 20 for character strings at the beginning of sentences.
You can search by. Furthermore, if a character string that starts with a capital letter appears in a place that is not the beginning of a sentence, it is determined to be a proper noun, and the property information of this proper noun will be added if there is a proper noun with property information registered before or after it. Ru. Therefore, reference 2i1! It is possible to analyze proper nouns that are not registered in Uruguay 20 to some extent.

さらに、性質情報の付ｊされていない固有名詞には必要
な性質情報をすべて付与し、その語の処理においてこれ
らの性質情報のうち不要のものを除去するようにしてい
るから、あらかじめ性質情報が知られていない固有名詞
または未登録の固有名詞についても解析を行うことがで
きる。Furthermore, all necessary property information is attached to proper nouns that do not have property information attached, and unnecessary pieces of property information are removed during processing of the word, so that property information is not included in advance. It is also possible to analyze unknown or unregistered proper nouns.

また、特定の固有名詞については性質情報を複数個付与
し、前後の固有名詞の性質情報により適切な性質情報を
選択するようにしているから、複数種類の性質情報を有
する固有名詞を解析する場合にその前後関係に応じて適
切な性質情報を選択することができ、入力文の解析を有
効に行うことができる。In addition, multiple pieces of property information are assigned to a specific proper noun, and the appropriate property information is selected based on the property information of the preceding and following proper nouns, so when analyzing a proper noun that has multiple types of property information. Appropriate property information can be selected according to the context, and input sentences can be analyzed effectively.

効　　果本発明によれば、固有名詞の性質情報を辞書に複数保存
するとともに、辞書に性質情報が未登録の固有名詞には
複数の性質情報を付与しているから、固有名詞が使用さ
れる状況に応じて適切な性質情報を手え、入力文の解析
を有効に行うことができる。Effects According to the present invention, multiple pieces of property information of proper nouns are stored in the dictionary, and multiple pieces of property information are assigned to proper nouns for which property information is not registered in the dictionary, so that proper nouns can be used. It is possible to obtain appropriate property information depending on the situation and effectively analyze input sentences.

[Brief explanation of the drawing]

第１図は本発明による言語解析装置の一実施例を示すブ
ロック図、第２図は第１図の参照辞書に記憶されるデータの一例を
示す図、第３図は第１図の装置の全体の動作を示すフローチャー
ト、第４図は第３図に示す動作のうち、辞書登録固有名詞の
処理を示すフローチャート、第５図は第３図に示す動作のうち、辞書未登録固有名詞
の処理を示すフローチャート、第６図は第３図に示す動
作のうち、デフォルト性質情報の封手処理を示すフロー
チャート。第７図は第１図の装置において入力文が処理され、辞書
情報保存テーブルに格納されるデータが変化する例を示
す図である。一要部／）の符号の説す１１４、、、入力処理部１Ｂ、、、単位切出し部１８、、、デリミツタテーブル２０、、、参照辞書２２、、、辞書検索部２４、、、固有名詞処理部２Ｂ、、、直前文末処理部２８、、、直前固有名詞処理部３０、、、固有名詞自身処理部３２、、、直前固有名詞および固有名詞処理部３４、、
、デフォルト性質情報付与部３１３、、、処理部FIG. 1 is a block diagram showing an embodiment of the language analysis device according to the present invention, FIG. 2 is a diagram showing an example of data stored in the reference dictionary of FIG. 1, and FIG. A flowchart showing the overall operation; Figure 4 is a flowchart showing the processing of proper nouns registered in the dictionary among the operations shown in Figure 3; Figure 5 is a flowchart showing the processing of proper nouns not registered in the dictionary among the operations shown in Figure 3. FIG. 6 is a flowchart showing sealing processing of default property information among the operations shown in FIG. 3. FIG. 7 is a diagram showing an example in which an input sentence is processed in the apparatus shown in FIG. 1 and data stored in a dictionary information storage table changes. 1 14, ... Input processing section 1B, ... Unit extraction section 18, ... Delimiter table 20, ... Reference dictionary 22, ... Dictionary search section 24, ... Noun processing unit 2B, Immediately preceding sentence-end processing unit 28, Immediately preceding proper noun processing unit 30, Proper noun itself processing unit 32, Immediately preceding proper noun and proper noun processing unit 34,...
, default property information adding unit 313, , processing unit

Claims

[Claims] 1. An input means for inputting a character string in a predetermined language; a dictionary means used for searching the character string input from the input means; and a dictionary means for searching the input character string. a search means for searching, and as a result of the search by the search means, among the input character strings, character strings that are not recorded in the dictionary means and character strings for which property information is not recorded in the dictionary means;
A language analysis device comprising: property information adding means for adding property information, wherein the property information adding means adds a plurality of property information to the character string without property information. 2. The device according to claim 1, further comprising: unit cutting means for dividing the input character string into units for searching the dictionary means; A language analysis device comprising: analysis means for analyzing the character string together with the immediately preceding character string after the character string is searched by the search means. 3. In the apparatus according to claim 2, when either the character string or the immediately preceding character string has property information and the other character does not have property information, the analysis means determines whether the property information is absent. A language analysis device characterized in that a character string is given property information of another character string. 4. In the device according to claim 2, when the character string and the immediately preceding character string both have property information, the analyzing means may extract common property information from the character string and the immediately preceding character string. A language analysis device characterized in that the property information of 5. The language analysis device according to any one of claims 2 to 4, wherein the character string analyzed by the analysis means and the immediately preceding character string are proper nouns. . 6. In the device according to claim 2, when the character string starts with an uppercase letter and the immediately preceding character string is at the end of a sentence, the analyzing means converts the first uppercase letter of the character string to a lowercase letter. After that, the language analysis device is characterized in that the search means searches the dictionary means, and if the result of the search is that the character string is not registered in the dictionary means, the character string is analyzed as an unregistered proper noun.