JP3358100B2

JP3358100B2 - Japanese question message analysis method and device

Info

Publication number: JP3358100B2
Application number: JP09104195A
Authority: JP
Inventors: 林　　良彦
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc
Current assignee: Nippon Telegraph and Telephone Corp; NTT Inc
Priority date: 1995-04-17
Filing date: 1995-04-17
Publication date: 2002-12-16
Anticipated expiration: 2017-12-16
Also published as: JPH08287092A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、日本語質問メッセージ
解析方法及び装置に係り、特に、情報検索システムにお
いて用いられるテキスト形式で入力された日本語メッセ
ージを解析し、当該メッセージにおける情報要求を効率
的に構造化する日本語メッセージ構造化装置における日
本語質問メッセージ解析方法及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for analyzing a Japanese question message, and more particularly, to analyzing a Japanese message input in a text format used in an information retrieval system, and efficiently reducing an information request in the message. The present invention relates to a method and an apparatus for analyzing a Japanese question message in a Japanese message structuring apparatus that structurally structures.

【０００２】[0002]

【従来の技術】情報検索システムにおいて、利用者が希
望する情報要求をどのような形式で表現するかが大きな
ポイントとなる。一般には、利用者が複数のキーワード
を自ら選択して、選択されたキーワードを論理的に結合
して情報要求を表現する形式がある。例えば、氏名（店
名）が不明である場合に、電話番号を検索する場合に、
探索する電話番号の職業（または職種）、住所を『職業
ＡＮＤ住所』の形態で、“飲食業ＡＮＤ港区”とい
うキーワードを入力する。この場合かなりの数の検索対
象データが出現したために絞り込む場合には、“寿司屋
ＡＮＤ港区麻布”というキーワードを入力する。さらに
検索データを絞り込む場合には、“寿司屋ＡＮＤ港区麻
布３丁目”というキーワードを入力する。この例は、キ
ーワードを論理式（ＡＮＤ）で結合した場合を示した
が、反対に、論理式（ＯＲ）を用いてキーワードを変更
し、検索条件を緩和してデータベースを検索する方法等
がある。2. Description of the Related Art In an information retrieval system, a major point is how to express an information request desired by a user. Generally, there is a format in which a user selects a plurality of keywords by himself and logically combines the selected keywords to express an information request. For example, if you do not know your name (store name) and want to search for a phone number,
The occupation (or occupation type) and the address of the telephone number to be searched are entered in the form of "occupation AND address" and a keyword of "restaurant business AND Minato-ku" is input. In this case, a keyword "sushi restaurant AND Minato-ku Azabu" is input when narrowing down because a considerable number of search target data has appeared. In order to further narrow down the search data, the keyword "sushi restaurant AND 3 Azabu, Minato-ku" is input. This example shows a case where keywords are combined by a logical expression (AND). Conversely, there is a method of changing a keyword using a logical expression (OR) and relaxing a search condition to search a database. .

【０００３】他には、検索対象分野のトピックに基づい
てブラウジングを繰り返すことにより必要な情報に到達
する方法や、情報要求を疑似自然言語文で表現する方法
等がある。また、利用者が記述する質問メッセージを言
語解析し、情報要求を抽出し、構造化する方法もある。[0003] Other methods include a method of arriving at necessary information by repeating browsing based on a topic in a search target field, and a method of expressing an information request by a pseudo natural language sentence. There is also a method in which a question message described by a user is subjected to language analysis, and information requests are extracted and structured.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上記従
来の選択されたキーワードを論理的に結合する方法は、
利用者の多様化かつ複雑な情報要求をキーワードの論理
結合形式で表現することは困難である。However, the above-mentioned conventional method of logically combining selected keywords is as follows.
It is difficult to express diversified and complicated information requirements of users in the form of logical combination of keywords.

【０００５】また、対象分野のトピックに基づいたブラ
ウジングを繰り返す方法や疑似自然言語文で表現する方
法は、いずれも利用者が多様化した場合、かつ複雑な要
求には対応できないという問題がある。また、利用者が
記述する質問メッセージを言語解析して、情報要求を導
出する方法は、質問メッセージには通常、情報要求を表
現する文以外の文も多く含まれるため、それらの文のう
ち、情報要求を表現するためには、どの文を重点的に解
析すべきであるかを規定する方略を明らかにするのは困
難であるため、質問メッセージ全体に渡って一様な言語
解析処理を行う必要がある。[0005] In addition, the method of repeating browsing based on the topic of the target field and the method of expressing by a pseudo-natural language sentence have a problem that when users are diversified and complicated requests cannot be coped with. In addition, a method of linguistically analyzing a question message described by a user to derive an information request is based on the fact that a question message usually includes many sentences other than a sentence expressing an information request. In order to express the information request, it is difficult to clarify the strategy that defines which sentence should be focused on. Therefore, perform a uniform linguistic analysis process over the entire question message. There is a need.

【０００６】本発明は、上記の点に鑑みなされたもの
で、利用者が情報検索システムを利用するにあたって、
キーワードの論理結合形式によって情報要求を表現しな
ければならない困難さに対して、利用者が自然な質問メ
ッセージによって情報検索システムに対して情報要求を
与えることが可能な日本語質問メッセージ解析方法及び
装置を提供することを目的とする。[0006] The present invention has been made in view of the above points, and when a user uses an information retrieval system,
Japanese Query Message Analysis Method and Apparatus that Allows Users to Give Information Requests to Information Retrieval Systems Using Natural Question Messages Against Difficulty of Expressing Information Requests by Logical Combination of Keywords The purpose is to provide.

【０００７】[0007]

【課題を解決するための手段】図１は、本発明の原理を
説明するための図である。本発明は、テキスト形式で入
力された日本語質問メッセージを解析する日本語質問メ
ッセージ解析方法であって、日本語質問メッセージが入
力され（ステップ１）、入力された処理対象の日本語質
問メッセージについて、特定の表現文字列に注目して、
情報要求の核となる部分を述べている情報要求文を抽出
し（ステップ２）、抽出された情報要求文に含まれる参
照表現を情報要求文の特性に基づいて解消し（ステップ
３）、抽出された複数の情報要求文が同一の対象物に対
する情報要求を述べている場合に、該複数の情報要求文
の構造を統合して、日本語質問メッセージにおける主た
る情報要求を構造化し（ステップ４）、構造化された情
報要求文を出力する（ステップ５）。FIG. 1 is a diagram for explaining the principle of the present invention. The present invention relates to a method for analyzing a Japanese question message which analyzes a Japanese question message input in a text format, wherein a Japanese question message is input (step 1) and the input processing target Japanese question message is analyzed. , Paying attention to a specific expression string,
An information request sentence describing the core part of the information request is extracted (step 2), and the reference expression included in the extracted information request sentence is resolved based on the characteristics of the information request sentence (step 3). If the plurality of information request statements described above state information requests for the same object, the structures of the plurality of information request statements are integrated to structure the main information request in the Japanese question message (step 4). Then, a structured information request statement is output (step 5).

【０００８】また、本発明は、情報要求文を抽出する際
に（ステップ２）、日本語質問メッセージを構成する日
本語文が果たす機能的役割である情報要求やユーザの目
的であるゴール言明の文タイプと、該情報要求及び該ゴ
ール言明を判定するための表現文字列パターンの組合せ
が予め格納されている文タイプ判定辞書を参照し、日本
語質問メッセージが指定された文タイプに対応するかを
文字列の照合により判定し、日本語質問メッセージから
情報要求の機能を有する文を抽出する。Further, according to the present invention, when extracting the information request sentence (step 2), the information request which is a functional role played by the Japanese sentence constituting the Japanese question message and the sentence of the goal statement which is the purpose of the user are provided. Referring to a sentence type determination dictionary in which a combination of the type and the information request and the expression character string pattern for determining the goal statement is stored in advance, it is determined whether the Japanese question message corresponds to the specified sentence type. Judgment is made by comparing character strings, and a sentence having an information request function is extracted from the Japanese question message.

【０００９】また、本発明は、情報要求文が抽出される
と、抽出された情報要求文に対して形態素・構文解析を
行い、日本語質問メッセージについて、固有な意味構造
パターンをエントリ毎に予め格納した意味構造パターン
辞書を参照して、エントリと形態素・構文解析された情
報要求文の解析結果とを照合し、情報要求文の意味構造
を求める。Further, according to the present invention, when an information request sentence is extracted, a morpheme / syntax analysis is performed on the extracted information request sentence, and a unique semantic structure pattern is previously specified for each entry in the Japanese question message. By referring to the stored semantic structure pattern dictionary, the entry is compared with the analysis result of the information request sentence that has been subjected to morpheme / syntax analysis, and the semantic structure of the information request sentence is obtained.

【００１０】また、本発明は、参照表現を解消する際に
（ステップ３）、日本語質問メッセージにおける情報要
求文に含まれ得る参照表現のパターンと、参照表現の参
照先を探索する探索方向と、探索対象となる日本語文の
文タイプ、参照表現の言語表現単位を予め格納した参照
表現解消制御テーブルを参照して、探索対象となる日本
語文の文タイプが指定された文タイプに対応するかを判
定し、対応する場合に、参照表現の解消処理を制御す
る。Further, according to the present invention, when the reference expression is eliminated (step 3), the pattern of the reference expression that can be included in the information request sentence in the Japanese question message, the search direction for searching for the reference destination of the reference expression, and Referencing the sentence type of the Japanese sentence to be searched and the linguistic expression unit of the reference expression in advance by referring to the reference expression resolution control table, and checking whether the sentence type of the searched Japanese sentence corresponds to the specified sentence type Is determined, and if it corresponds, the reference expression resolution process is controlled.

【００１１】図２は、本発明の原理構成図である。本発
明は、テキスト形式で入力された日本語質問メッセージ
を解析する日本語質問メッセージ解析装置であって、日
本語質問メッセージを入力する入力手段１と、入力され
た処理対象の日本語質問メッセージについて、特定の表
現文字列に注目して、情報要求の核となる部分を述べて
いる情報要求文を抽出する情報要求文抽出手段４と、情
報要求文抽出手段４により抽出された情報要求文に参照
表現が含まれる場合に、参照表現を情報要求文の特性に
基づいて解消する参照表現解消手段１０と、抽出された
複数の情報要求文が同一の対象物に対する情報要求を述
べている場合に、複数の情報要求文の構造を統合して、
日本語質問メッセージにおける主たる情報要求を構造化
する情報要求構造化手段１１と、情報要求構造化手段１
１により統合された情報要求文を出力する出力手段１２
とを有する。FIG. 2 is a diagram showing the principle of the present invention. The present invention relates to a Japanese-language question message analyzer for analyzing a Japanese-language question message input in a text format, comprising: an input unit 1 for inputting a Japanese-language question message; An information request sentence extracting unit 4 that extracts an information request sentence that describes a core part of an information request by focusing on a specific expression character string, and an information request sentence extracted by the information request sentence extracting unit 4. When a reference expression is included, a reference expression canceling unit 10 that resolves the reference expression based on the characteristics of the information request sentence, and a case where the plurality of extracted information request sentences state an information request for the same object. , Unifying the structure of multiple information request statements,
Information request structuring means 11 for structuring a main information request in a Japanese question message, and information request structuring means 1
Output means 12 for outputting the information request sentence integrated by 1
And

【００１２】また、上記の情報要求文抽出手段４は、日
本語質問メッセージを構成する日本語文が果たす機能的
役割である情報要求やユーザの目的であるゴール言明の
文タイプと、該情報要求及び該ゴール言明を判定するた
めの表現文字列パターンの組合せが予め格納されている
文タイプ判定辞書２と、文タイプ判定辞書２を参照し
て、日本語質問メッセージが指定された文タイプに対応
するかを文字列の照合により判定する文タイプ判定手段
３と、文タイプ判定手段３を起動させ、日本語質問メッ
セージから情報要求の機能を有する文を抽出する文タイ
プ判定起動手段４１とを含む。Further, the information request sentence extracting means 4 includes a function for requesting information, which is a functional role played by a Japanese sentence constituting a Japanese question message, and a sentence type for a goal statement, which is a purpose of a user. The sentence type determination dictionary 2 in which combinations of expression character string patterns for determining the goal statement are stored in advance, and the Japanese question message corresponds to the specified sentence type with reference to the sentence type determination dictionary 2. A sentence type determination unit 3 that determines whether a sentence has a character string, and a sentence type determination activation unit 41 that activates the sentence type determination unit 3 and extracts a sentence having an information request function from a Japanese question message.

【００１３】また、本発明の日本語質問メッセージ解析
装置は、情報要求文抽出手段４により抽出された情報要
求文に対して形態素・構文解析を行う日本語基本解析手
段５と、日本語質問メッセージについて、固有な意味構
造パターンをエントリ毎に予め格納した意味構造パター
ン辞書６と、意味構造パターン辞書６を参照して、エン
トリと日本語基本解析手段５により解析された情報要求
文の解析結果とを照合して、情報要求文の意味構造を求
める意味構造解析手段７を更に有する。The Japanese-language question message analyzing apparatus according to the present invention comprises: a basic Japanese-language analysis unit 5 for performing morpheme and syntax analysis on the information request sentence extracted by the information request sentence extraction unit 4; , A semantic structure pattern dictionary 6 in which a unique semantic structure pattern is stored in advance for each entry, and an analysis result of the entry and the information request sentence analyzed by the Japanese basic analysis means 5 with reference to the semantic structure pattern dictionary 6. And a semantic structure analyzing means 7 for determining the semantic structure of the information request sentence by comparing

【００１４】また、上記の参照表現解消手段１０は、日
本語質問メッセージにおける情報要求文に含まれ得る参
照表現のパターンと、参照表現の参照先を探索する探索
方向と、探索対象となる日本語文の文タイプ、該参照表
現の言語表現単位を予め格納した参照表現解消制御テー
ブル８と、参照表現解消制御テーブル８と、文タイプ判
定手段３を用いて、参照表現の解消処理を制御する参照
表現解消制御手段９とを含む。[0014] The above-mentioned reference expression eliminating means 10 includes a reference expression pattern that can be included in an information request sentence in a Japanese question message, a search direction for searching for a reference destination of the reference expression, and a Japanese sentence to be searched. Using the reference expression elimination control table 8, the reference expression elimination control table 8, and the sentence type determination unit 3 in which the language type of the reference expression and the linguistic expression unit of the reference expression are stored in advance. And a cancellation control means 9.

【００１５】[0015]

【作用】本発明は、入力された日本語質問メッセージか
ら特定の表現文字列に基づいて情報要求の核となる部分
を述べている文を抽出する。このとき、抽出された文が
所定の文タイプに合致していない場合には、情報要求文
とはしないように制御することにより、情報要求とは直
接関係がない文を抽出することがないため、後段の処理
において、無駄な処理が回避される。According to the present invention, a sentence describing a core part of an information request is extracted from an input Japanese question message based on a specific expression character string. At this time, if the extracted sentence does not match the predetermined sentence type, by controlling not to be an information request sentence, a sentence that is not directly related to the information request is not extracted. In the subsequent processing, useless processing is avoided.

【００１６】また、本発明は、情報要求文としての特性
を用いて解析及び参照情報の解消を行う。これにより、
抽出された情報要求文中に主体となるものを指し示すた
めの参照要件が含まれている場合には、参照表現の参照
先を求め、参照して、同定し、参照表現の解消を行う。In the present invention, analysis and cancellation of reference information are performed using characteristics as information request sentences. This allows
If the extracted information request sentence includes a reference requirement for indicating the main subject, a reference destination of the reference expression is obtained, referenced, identified, and the reference expression is resolved.

【００１７】また、本発明は、複数の情報要求の対象物
の同一性を判定し、同一性が確認できる場合には、複数
の情報要求文の意味構造を統合して単一の意味構造とし
て、処理対象の日本語メッセージにおける情報要求を構
造化して、出力することが可能である。Further, according to the present invention, the identity of a plurality of information request objects is determined, and when the identity can be confirmed, the semantic structures of a plurality of information request sentences are integrated into a single semantic structure. It is possible to structure and output an information request in a Japanese message to be processed.

【００１８】これにより、情報検索システムの利用者
は、データベースを検索する際に、複雑かつ多様な情報
要求の指示を自然な日本語質問メッセージを入力するこ
とにより行うことができる。Thus, the user of the information retrieval system can perform complicated and diverse information request instructions by searching the database by inputting a natural Japanese question message.

【００１９】[0019]

【実施例】以下、図面と共に本発明の一実施例を説明す
る。図３は、本発明の一実施例の日本語質問メッセージ
解析装置の構成を示す。同図に示す日本語質問メッセー
ジ解析装置は、入力部１、文タイプ判定辞書２、文タイ
プ判定部３、情報要求文抽出部４、日本語基本解析部
５、意味構造パターン辞書６、意味構造解析部７、参照
表現解消制御テーブル８、参照表現解消制御部９、参照
表現解消部１０、情報要求構造化部１１、出力部１２及
び処理テーブル１３より構成される。An embodiment of the present invention will be described below with reference to the drawings. FIG. 3 shows the configuration of the Japanese-language question message analysis device according to one embodiment of the present invention. The Japanese question message analyzer shown in FIG. 1 includes an input unit 1, a sentence type determination dictionary 2, a sentence type determination unit 3, an information request sentence extraction unit 4, a Japanese basic analysis unit 5, a semantic structure pattern dictionary 6, and a semantic structure. It comprises an analysis unit 7, a reference expression elimination control table 8, a reference expression elimination control unit 9, a reference expression elimination unit 10, an information request structuring unit 11, an output unit 12, and a processing table 13.

【００２０】入力部１は、テキスト形式の日本語質問メ
ッセージが入力されると、当該メッセージを文毎に分
け、図６に後述する処理テーブル１３に格納しておく。
なお、入力部１は、特定のコンピュータシステムのファ
イルシステムやターミナルに接続されてもよいし、近年
飛躍的な発展・普及をとげているコンピュータネットワ
ークへ接続されてもよい。When a text-based Japanese question message is input, the input unit 1 divides the message into sentences and stores the message in a processing table 13 described later with reference to FIG.
Note that the input unit 1 may be connected to a file system or a terminal of a specific computer system, or may be connected to a computer network that has been rapidly developing and spreading in recent years.

【００２１】文タイプ判定辞書２は、予め文タイプ（情
報要求タイプ・ゴール言明タイプ）とそれを判定するた
めの表現文字列パターンの組合せが格納され、文タイプ
判定部３により参照される辞書である。文タイプ判定部
３は、文タイプ判定辞書２を参照して、処理対象の入力
された日本語質問メッセージより抽出された文が指定さ
れた文タイプに対応するかを文字列の照合により判定す
る。The sentence type determination dictionary 2 stores a combination of a sentence type (information request type / goal assertion type) and an expression character string pattern for determining the sentence type in advance. is there. The sentence type determination unit 3 refers to the sentence type determination dictionary 2 to determine whether a sentence extracted from the input Japanese question message to be processed corresponds to the specified sentence type by comparing character strings. .

【００２２】情報要求文抽出部４は、処理テーブル１３
内の日本語質問メッセージを読出し、文タイプ判定部３
を起動して日本語質問メッセージから情報要求の機能を
有する文を抽出する。詳しくは、情報要求文抽出部４
は、抽出すべき文タイプを“情報要求”と指定して文タ
イプ判定部３を起動し、入力された日本語質問メッセー
ジ中に含まれる情報要求を直接記述した文を抽出する。
このとき、文タイプ判定部３は、文タイプ判定辞書２を
参照して、エントリと、処理対象である日本語文との文
字列照合を行う。The information request sentence extracting unit 4 includes a processing table 13
Reads the Japanese question message in the sentence type determination unit 3
To extract a sentence having an information request function from the Japanese question message. For details, the information request sentence extracting unit 4
Specifies the sentence type to be extracted as "information request", starts the sentence type determination unit 3, and extracts a sentence directly describing the information request included in the input Japanese question message.
At this time, the sentence type determination unit 3 refers to the sentence type determination dictionary 2 and performs character string collation between the entry and the Japanese sentence to be processed.

【００２３】日本語基本解析部５は、日本語文に対し
て、形態素・構文解析を行う。解析方法は、既存のシス
テムを用いるものとし、本発明ではその詳細を規定しな
い。意味構造パターン辞書６は、予め、日本語質問メッ
セージにおいて固有な意味構造パターンが格納されてお
り、意味構造解析部７により参照される。The Japanese basic analysis unit 5 performs a morpheme / syntax analysis on a Japanese sentence. The analysis method uses an existing system, and the details are not specified in the present invention. The semantic structure pattern dictionary 6 stores a semantic structure pattern unique to the Japanese question message in advance, and is referred to by the semantic structure analysis unit 7.

【００２４】意味構造解析部７は、意味構造パターン辞
書６の内容と、エントリと情報要求文抽出部４により抽
出された情報要求文に対する日本語基本解析部５の解析
結果（形態素、構文解析結果）とを照合して、情報要求
文の意味構造を求める。参照表現解消制御テーブル８
は、日本語質問メッセージにおける情報要求文に含まれ
得る参照表現と意味構造における意味属性をキーとし
て、予め参照表現の参照先を探索する探索方向、探索対
象となる日本語文の文タイプ、参照先の表現の言語表現
単位が格納されたテーブルであり、参照表現解消制御部
９により参照される。The semantic structure analysis unit 7 analyzes the contents of the semantic structure pattern dictionary 6 and the results of analysis (morpheme, syntax analysis result) of the entry and the information request sentence extracted by the information request sentence extraction unit 4. ) To determine the semantic structure of the information request sentence. Reference expression resolution control table 8
The search direction to search the reference destination of the reference expression in advance, the sentence type of the Japanese sentence to be searched, the reference destination, using the reference expression that can be included in the information request sentence in the Japanese question message and the semantic attribute in the semantic structure as keys Is a table in which the linguistic expression unit of the expression is stored, and is referred to by the reference expression resolution control unit 9.

【００２５】参照表現解消制御部９は、参照表現解消制
御テーブル８と文タイプ判定部３を用いて、参照表現の
解消処理を制御する。詳しくは、意味構造解析部７より
入力された情報要求文の意味構造に基づいて、参照表現
解消制御テーブル８から得た探索方向に関する情報に基
づいて参照先を含む可能性がある候補文を選択する。こ
のとき、参照表現解消制御部９は、参照表現解消制御テ
ーブル８から得た文タイプの情報を用いて、文タイプ判
定部３を起動して、候補文の適格性を判定する。The reference expression elimination control unit 9 controls the reference expression elimination process using the reference expression elimination control table 8 and the sentence type determination unit 3. More specifically, based on the semantic structure of the information request sentence input from the semantic structure analysis unit 7, a candidate sentence that may include a reference destination is selected based on the information on the search direction obtained from the reference expression resolution control table 8. I do. At this time, the reference expression elimination control unit 9 activates the sentence type determination unit 3 using the sentence type information obtained from the reference expression elimination control table 8, and determines the eligibility of the candidate sentence.

【００２６】参照表現解消部１０は、参照表現の参照先
を求める。詳しくは、参照表現解消制御部９により候補
文が得られれば、実際に参照表現の参照先を求める参照
表現解消部１０を起動して、入力された情報要求文に含
まれる必要な参照表現の解消を行う。ここで、参照表現
の解消とは、例えば、情報要求の対象物が「そのために
必要なケーブル」という記述により、指し示される場合
「その」が何を指しているかを判定することができなけ
れば、「ケーブル」の特性を規定することができないた
め、参照表現「その」について探索して、指し示されて
いる内容を取得し、当該参照表現を解消するものであ
る。The reference expression elimination unit 10 obtains a reference destination of the reference expression. More specifically, when the candidate expression is obtained by the reference expression elimination control unit 9, the reference expression elimination unit 10 that actually obtains the reference destination of the reference expression is activated, and the necessary reference expression included in the input information request sentence is started. Perform the cancellation. Here, the resolution of the reference expression means, for example, if the object of the information request is indicated by the description of "the cable necessary for that", if it is not possible to determine what "the" indicates , "Cable" cannot be defined, so that the reference expression "that" is searched, the pointed content is acquired, and the reference expression is eliminated.

【００２７】情報要求構造化部１１は、入力された日本
語メッセージから複数の情報要求文が抽出された場合、
情報要求の対象物の同一性を判定することにより、処理
対象の日本語メッセージにおける情報要求を構造化す
る。詳しくは、上記の各処理により、情報要求の対象物
の同一性を判定し、同一性が確認できる場合には、複数
の情報要求文の意味構造を統合して、単一の意味構造と
して処理対象の日本語メッセージにおける情報要求を構
造化する。When a plurality of information request sentences are extracted from the input Japanese message, the information request structuring unit 11
By determining the identity of the object of the information request, the information request in the Japanese message to be processed is structured. In detail, by the above processes, the identity of the object of the information request is determined, and if the identity can be confirmed, the semantic structures of the plurality of information request sentences are integrated and processed as a single semantic structure. Structure the information request in the target Japanese message.

【００２８】出力部１２は、対象の日本語質問メッセー
ジから抽出し、構造化した情報要求を後段に接続される
装置へ出力する。上記の日本語質問メッセージ解析装置
を実現するハードウェアとしては、例えば、ＣＰＵとメ
モリを含む装置であればよく、コンピュータシステム等
の装置によって実現することが可能である。The output unit 12 extracts a target Japanese question message and outputs a structured information request to a device connected at a later stage. The hardware for realizing the Japanese-language question message analysis device may be any device including a CPU and a memory, for example, and can be realized by a device such as a computer system.

【００２９】図４は、本発明の一実施例の日本語質問メ
ッセージ解析装置の動作の概要を示すフローチャートで
ある。ステップ１０１）入力部１により利用者が情報要求を
テキスト形式で記載した図５に示すような日本語質問メ
ッセージが入力されると、図６に示すように、当該メッ
セージは処理テーブル１３に転送される。FIG. 4 is a flowchart showing the outline of the operation of the Japanese-language question message analyzing apparatus according to one embodiment of the present invention. Step 101) When the user inputs a Japanese question message in which the information request is described in text format by the input unit 1 as shown in FIG. 5, the message is transferred to the processing table 13 as shown in FIG. You.

【００３０】ステップ１０２）情報要求抽出部４は、
処理テーブル１３から取得した日本語質問メッセージに
ついて、抽出すべき文タイプを“情報要求”として文タ
イプ判定部３を起動する。ステップ１０３）文タイプ判定部３は、文タイプとそ
れを判定するための表現文字列パターンの組合せを格納
した文タイプ判定辞書２のエントリと、処理対象である
日本語文との文字照合を行い、日本語質問メッセージ中
に含まれる情報要求を直接記述した文（情報要求文）を
抽出し、情報要求文抽出部４に転送する。例えば、図５
の例では、第３文の『そのために必要な接続用のケーブ
ルを探していますが、』及び第４文の『どこのお店に売
っているかご存じの方はいらっしゃいませんか。』が該
当する。Step 102) The information request extraction unit 4
For the Japanese question message acquired from the processing table 13, the sentence type to be extracted is set to "information request" and the sentence type determination unit 3 is started. Step 103) The sentence type determination unit 3 performs character matching between an entry in the sentence type determination dictionary 2 storing a combination of the sentence type and an expression character string pattern for determining the sentence type, and a Japanese sentence to be processed. A sentence (information request sentence) directly describing the information request included in the Japanese question message is extracted and transferred to the information request sentence extracting unit 4. For example, FIG.
In the example above, the third sentence "I'm looking for the necessary connection cable," and the fourth sentence "Do you know where you are selling? ] Is applicable.

【００３１】ステップ１０４）情報要求文抽出部４
は、情報要求文を文タイプ判定部３より取得して、日本
語基本解析部５に転送する。なお、情報要求文抽出部４
の動作は図７を用いて詳述する。ステップ１０５）日本語基本解析部５は、情報要求文
抽出部４より転送された情報要求文の形態素・構文解析
を行い、その解析結果を意味構造解析部７に転送する。Step 104) Information request sentence extraction unit 4
Acquires the information request sentence from the sentence type determination unit 3 and transfers it to the Japanese basic analysis unit 5. The information request sentence extracting unit 4
Will be described in detail with reference to FIG. Step 105) The Japanese basic analysis unit 5 analyzes the morpheme / syntax of the information request sentence transferred from the information request sentence extraction unit 4, and transfers the analysis result to the semantic structure analysis unit 7.

【００３２】ステップ１０６）意味構造解析部７は、
日本語質問メッセージにおいて固有な意味構造パターン
を格納した意味構造パターン辞書６におけるエントリと
日本語基本解析部５の解析結果とを照合することにより
情報要求文の意味構造を求める。求められた意味構造
は、参照表現解消制御部９に転送される。Step 106) The semantic structure analysis unit 7
The semantic structure of the information request sentence is obtained by comparing the entry in the semantic structure pattern dictionary 6 storing the unique semantic structure pattern in the Japanese question message with the analysis result of the Japanese basic analysis unit 5. The obtained meaning structure is transferred to the reference expression elimination control unit 9.

【００３３】ステップ１０７）参照表現解消制御部９
は、日本語質問メッセージにおける情報要求文に含まれ
得る参照表現のパターンと、参照表現の参照先を探索す
る探索方向と探索対象となる日本語文の文タイプ、参照
表現の言語表現単位が格納されている参照表現解消制御
テーブル８を、入力された情報要求文の意味構造に基づ
いて検索し、参照先を含む可能性のある候補文を選択す
る。Step 107) Reference expression cancellation control section 9
Stores the reference expression pattern that can be included in the information request sentence in the Japanese question message, the search direction for searching for the reference destination of the reference expression, the sentence type of the Japanese sentence to be searched, and the linguistic expression unit of the reference expression. The reference expression elimination control table 8 is searched based on the semantic structure of the input information request sentence, and a candidate sentence that may include a reference destination is selected.

【００３４】ステップ１０８）参照表現解消制御部９
は、文タイプ判定部３を起動して、選択した候補文の適
格性を判定する。ステップ１０９）参照表現解消制御部９により候補文
が得られると、実際に参照表現の参照先を求める参照表
現解消部１０を起動して、入力された情報要求文に含ま
れる必要な参照表現の解消を行う。Step 108) Reference expression cancellation control section 9
Activates the sentence type determination unit 3 to determine the eligibility of the selected candidate sentence. Step 109) When the candidate expression is obtained by the reference expression elimination control unit 9, the reference expression elimination unit 10 for actually obtaining the reference destination of the reference expression is activated, and the necessary reference expression included in the input information request sentence is started. Perform the cancellation.

【００３５】ステップ１１０）情報要求構造化部１１
は、上記の処理によって入力された日本語メッセージか
ら複数の情報要求文が抽出された場合には、情報要求の
対象文の同一性を判定する。ステップ１１１）情報要求構造化部１１は、同一性が
確認できる場合は、複数の情報要求部の意味構造を統合
して単一の意味構造として処理対象の日本語メッセージ
における情報要求を構造化する。Step 110) Information request structuring unit 11
If a plurality of information request sentences are extracted from the Japanese message input by the above process, the same is determined for the target sentence of the information request. Step 111) If the identity can be confirmed, the information request structuring unit 11 integrates the semantic structures of the plurality of information request units and structures the information request in the Japanese message to be processed as a single semantic structure. .

【００３６】ステップ１１２）出力部１２は、上記の
手順により、対象の日本語質問メッセージから抽出し、
構造化した情報要求を後段に接続される装置に出力す
る。次に、図５に示す日本語質問メッセージを例として
日本語質問メッセージ解析装置を利用した日本語質問メ
ッセージ解析システムを詳細に説明する。Step 112) The output unit 12 extracts from the target Japanese question message by the above procedure,
The structured information request is output to a device connected at a later stage. Next, a Japanese question message analysis system using a Japanese question message analysis device will be described in detail using the Japanese question message shown in FIG. 5 as an example.

【００３７】i) 入力処理まず、図５に示す日本語質問メッセージが入力部１に入
力される。入力部１は、入力された日本語質問メッセー
ジを文の集合として図６に示すようにテーブル構造に分
解して格納する。図６に示す処理テーブル１３のエント
リは、５つのフィールドからなり、第１のフィールドに
は、文番号、第２のフィールドには、文の文字列、第３
のフィールドには文タイプ、第４のフィールドには、文
サブタイプ、第５のフィールドには、文の構造に関する
情報を保持するメモリ領域へのポインタが格納される。
この処理テーブル１３における各エントリの第３から第
５のフィールドは、初期値として情報が存在しないこと
を示す記号である“ＮＩＬ”が格納される。入力部１
は、図６に示される処理テーブル１３の内容を情報要求
文抽出部４へ転送する。I) Input Processing First, a Japanese question message shown in FIG. The input unit 1 decomposes and stores the input Japanese question message as a set of sentences into a table structure as shown in FIG. The entry of the processing table 13 shown in FIG. 6 includes five fields. The first field has a sentence number, the second field has a sentence character string, and the third field.
The fourth field stores a sentence type, the fourth field stores a sentence subtype, and the fifth field stores a pointer to a memory area that holds information on the structure of a sentence.
In the third to fifth fields of each entry in the processing table 13, “NIL”, which is a symbol indicating that no information exists, is stored as an initial value. Input unit 1
Transfers the contents of the processing table 13 shown in FIG.

【００３８】ii) 情報要求文抽出処理次に、情報要求文抽出部４の動作論理の概要を説明す
る。図７は、本発明の一実施例の情報要求文抽出部の詳
細な動作を示すフローチャートである。ステップ２０１）まず、初期値として、入力されるメ
ッセージの文のカウントｉを１とし、入力メッセージの
文の総数をＮとする。Ii) Information Request Sentence Extraction Processing Next, the outline of the operation logic of the information request sentence extraction unit 4 will be described. FIG. 7 is a flowchart showing a detailed operation of the information request sentence extracting unit according to one embodiment of the present invention. Step 201) First, as an initial value, the count i of the sentence of the input message is set to 1, and the total number of the sentences of the input message is set to N.

【００３９】ステップ２０２）入力部１より、図６に
示す形態の処理テーブル１３に日本語質問メッセージが
入力される。ステップ２０３）情報要求文抽出部４は、抽出すべき
文タイプを“情報要求”と指定して文タイプ判定部３を
起動する。Step 202) From the input unit 1, a Japanese question message is input to the processing table 13 in the form shown in FIG. Step 203) The information request sentence extracting unit 4 activates the sentence type determining unit 3 by designating the sentence type to be extracted as "information request".

【００４０】ステップ２０４）情報要求文抽出部４
は、入力された日本語質問メッセージ中に含まれる情報
要求を直接記述した文タイプを取得できた場合には、ス
テップ２０５に移行し、取得できない場合には、ステッ
プ２０６に移行する。ステップ２０５）情報要求の文タイプを取得できた場
合には、抽出された文タイプ（即ち、情報要求）と副作
用として得られる文サブタイプを該当する処理テーブル
１３のエントリに記入する。ここで、文サブタイプと
は、文タイプ「情報要求」に対して取得される当該文の
要旨を示し、図６の第３文『そのために必要な接続用の
ケーブルを探していますが、』の文サブタイプは“物品
の入手”となる。情報要求は処理テーブル１３上（図
６）のエントリの第３フィールドに、文サブタイプは第
４フィールドにそれぞれ記入する。Step 204) Information request sentence extraction unit 4
Moves to step 205 if the sentence type directly describing the information request included in the input Japanese question message can be obtained, and moves to step 206 otherwise. Step 205) If the sentence type of the information request can be obtained, the extracted sentence type (that is, the information request) and the sentence subtype obtained as a side effect are entered in the corresponding entry of the processing table 13. Here, the sentence subtype indicates the gist of the sentence acquired for the sentence type "information request", and the third sentence in FIG. 6 "I am looking for a necessary connection cable," Is sentence subtype "get goods". The information request is entered in the third field of the entry on the processing table 13 (FIG. 6), and the sentence subtype is entered in the fourth field.

【００４１】ステップ２０６）メッセージのカウント
ｉをインクリメントする。ステップ２０７）メッセージのカウントｉがＮより大
きい場合には処理を終了し、まだ、処理すべきメッセー
ジがある場合には、ステップ２０２に移行する。Step 206) The count i of the message is incremented. Step 207) If the message count i is larger than N, the process is terminated. If there is a message to be processed, the process proceeds to Step 202.

【００４２】次に、文タイプ判定辞書２について説明す
る。図８は、本発明の一実施例の文タイプ判定辞書の例
を示す。同図に示すように、文タイプ判定辞書２は、同
図（Ａ）に示すように表現文字列テーブルと、同図
（Ｂ）に示すように文タイプ判定テーブルの２つの部分
から構成される。Next, the sentence type determination dictionary 2 will be described. FIG. 8 shows an example of a sentence type determination dictionary according to an embodiment of the present invention. As shown in the figure, the sentence type determination dictionary 2 is composed of two parts: an expression character string table as shown in the figure (A), and a sentence type determination table as shown in the figure (B). .

【００４３】同図（Ａ）の表現文字列テーブルは、各エ
ントリが３つのフィールドを有するテーブル構造であ
り、第１のフィールドにはエントリＩＤ、第２のフィー
ルドには当該表現文字列のもつ機能を表現する機能Ｉ
Ｄ、第３のフィールドには表現文字列が格納される。The expression character string table of FIG. 9A has a table structure in which each entry has three fields. The first field has an entry ID, and the second field has the function of the expression character string. Function I that expresses
D, an expression character string is stored in the third field.

【００４４】同図（Ｂ）に示す文タイプ判定テーブル
は、３つのフィールドを持つテーブル構造であり、第１
のフィールドには文タイプ、第２のフィールドには文サ
ブタイプが格納され、第３のフィールドには各々が文タ
イプ、文サブタイプを判定するための表現文字列パター
ンの組合せを表現する機能ＩＤリストと呼ぶ構造の先頭
要素へのポインタのリストが格納される。The sentence type determination table shown in FIG. 4B has a table structure having three fields.
Field stores a sentence type, a second field stores a sentence subtype, and a third field stores a function type that represents a combination of a sentence type and a character string pattern for determining the sentence subtype. A list of pointers to the first element of a structure called a list is stored.

【００４５】第１のフィールドの文タイプには同図に示
すように「情報要求」と「ゴール言明」等が設定されて
いる。「ゴール言明」とは、ユーザ自身の目標（ゴー
ル）を表現するものである。第３フィールドの機能ＩＤ
リストにおける各セル要素は、２つのフィールドを持っ
ている。機能ＩＤリストのフィールド内の第１のフィー
ルドには、機能ＩＤ、第２のフィールドには後続し得る
セルへのポインタのリストが格納される。In the sentence type of the first field, "information request" and "goal statement" are set as shown in FIG. The “goal statement” expresses a user's own goal (goal). Function ID of the third field
Each cell element in the list has two fields. The first field in the function ID list field stores a function ID, and the second field stores a list of pointers to cells that can follow.

【００４６】文タイプ判定部３は、図８（Ｂ）に示す文
タイプ判定辞書２の文タイプ判定テーブルのエントリ
と、処理対象である日本語文との文字列照合を行うこと
により、処理対象文が指定された文タイプを持つかを判
定する。以下に文タイプ判定部３の動作論理の概要を説
明する。The sentence type determination unit 3 compares the entry of the sentence type determination table of the sentence type determination dictionary 2 shown in FIG. 8B with a Japanese sentence to be processed, thereby obtaining a sentence to be processed. Determines whether has the specified statement type. The outline of the operation logic of the sentence type determination unit 3 will be described below.

【００４７】図９は、本発明の一実施例の文タイプ判定
部の動作の概要を示すフローチャートである。同図に示
す処理は４つのブロック（ステップ３００、４００、５
００、６００）に分けられる。それらの詳細は、図１
０、図１１、図１２、図１３に詳細に説明する。以下に
示す各フローチャートにおいて、以下のような記号及び
関数を示す。・ｉ，ｊ，ｋ，Ｌ，Ｍ，Ｎ … 整数・Ｒ１，Ｒ２，Ｒ３，Ｒ４ … 集合・｛｝… 空集合・｛Ａ，Ｂ，Ｃ｝ … Ａ，Ｂ，Ｃの３つの要素を持つ
集合・＋ … 集合に対する和集合を作る演算・Ｓ：対象日本語文の文字列・Ｌ（ｉ）：エントリＩＤ＝ｉである表現文字列リスト
のエントリ・Ｌ（ｉ）．fid, Ｌ（ｉ）．string：同エントリにお
ける機能ＩＤ，表現文字列・Ｔ（ｉ）：文タイプ判定テーブルにおけるｉ番目のエ
ントリ・Ｔ（ｉ）．type：該エントリにおける文タイプ・Ｔ（ｉ）．pointer(k)：該エントリにおける機能ＩＤ
リストにおけるｋ番目のポインタ・search(X,Y) ：文字列Ｙが文字列Ｘの部分文字列であ
る場合、文字列Ｘの文字列Ｙにおける先頭位置を返す関
数。部分文字列でない場合は０を返す。・comb(i,X）：集合Ｘからｉ子の要素を選択することに
より生成可能な集合を要素とする集合を生成する関数。
Ｘの要素数をＮとすれば、comb（Ｎ，Ｘ）＝｛Ｘ｝であ
る。・check-overlap （Ｘ）：Ｘは集合Ｒ２の要素であり、
｛…，｛pos-i, length-i, fid-i｝, …｝という構造を
持つ。check-overlap(Ｘ）は、Ｘの要素を全て調べ、部
分文字列の先頭位置（pos)、長さ（len ）の情報から、
Ｘに格納されている各部分文字列において、オーバーラ
ップが生じるかをチェックし、yes/noを返却する関数で
ある。・check-list(pointer, Ｘ）：pointer は、文タイプ判
定テーブルのあるエントリにおける機能ＩＤリストに格
納されたポインタである。Ｘは、集合Ｒ３の要素であ
り、｛…，｛pos-i, length-i,fid-i …｝という構造を
持つ。check-list(pointer, Ｘ）は、Ｘの各要素におけ
る機能ＩＤ（fid-i)とpointer からポイントされる機能
ＩＤリストにおける各要素セルに格納された機能ＩＤが
全て一致するかをチェックし、yes/noを返却する関数で
ある。・compute-length（Ｘ）：Ｘは集合Ｒ４の要素であり、
｛｛…，｛pos, length-i,fid-i ｝, …｝, ｉ，ｋ｝と
いう構造を持つ。compute-length（Ｘ）は、Ｘの第１要
素である集合｛…，｛pos, length-i,fid-i ｝, …｝の
各要素を走査し、文字列長（len-i ）の合計を求める。・compute-items （Ｘ）：Ｘは集合Ｒ４の要素であり、
｛｛…，｛pos, length-i,fid-i ｝, …｝, ｉ，ｋ｝と
いう構造を持つ。compute-items （Ｘ）は、Ｘの第１要
素である集合｛…，｛pos, length-i,fid-i ｝, …｝の
要素数を求める。FIG. 9 is a flowchart showing the outline of the operation of the sentence type determination unit according to one embodiment of the present invention. The processing shown in FIG. 4 includes four blocks (steps 300, 400, 5).
00, 600). See Figure 1 for details.
0, FIG. 11, FIG. 12, and FIG. In the following flowcharts, the following symbols and functions are shown.・ I, j, k, L, M, N ... integer ・ R1, R2, R3, R4 ... set ・｛｝ ... empty set ・ {A, B, C} ... has A, B, C Set · + ··· Operation to create a union to the set · S: Character string of target Japanese sentence · L (i): Entry of expression string list with entry ID = i · L (i). fid, L (i). string: Function ID and expression character string in the entry ・ T (i): i-th entry in the sentence type determination table ・ T (i). type: sentence type in the entry • T (i). pointer (k): function ID in the entry
K-th pointer in the list search (X, Y): a function that returns the starting position of the character string X in the character string Y when the character string Y is a partial character string of the character string X. Returns 0 if not a substring. Comb (i, X): a function for generating a set having a set that can be generated by selecting an element of an i child from the set X as an element.
If the number of elements of X is N, comb (N, X) = {X}. Check-overlap (X): X is an element of the set R2,
It has the structure ｛…, {pos-i, length-i, fid-i},…｝. check-overlap (X) checks all the elements of X, and from the information on the head position (pos) and length (len) of the substring,
This function checks whether an overlap occurs in each partial character string stored in X and returns yes / no. Check-list (pointer, X): pointer is a pointer stored in the function ID list of a certain entry in the sentence type determination table. X is an element of the set R3, and has a structure of {..., {pos-i, length-i, fid-i ...}}. check-list (pointer, X) checks whether the function ID (fid-i) in each element of X and the function ID stored in each element cell in the function ID list pointed to by the pointer all match, This function returns yes / no. • compute-length (X): X is an element of the set R4,
{..., {pos, length-i, fid-i}, ...}, i, k}. compute-length (X) scans each element of the set {..., {pos, length-i, fid-i}, ...} which is the first element of X, and sums the character string length (len-i). Ask for. • compute-items (X): X is an element of the set R4,
{..., {pos, length-i, fid-i}, ...}, i, k}. compute-items (X) calculates the number of elements of a set {..., {pos, length-i, fid-i},.

【００４８】ステップ３００）まず、文タイプ判定部
３は、図８（Ａ）の表現文字列テーブルを参照して、対
象日本語文の部分文字列となっている表現文字列を求め
る。処理の詳細を図１０に示す。ステップ４００）ステップ３００により抽出された表
現文字列の対象文における可能な組合せパターンを求め
る。処理の詳細を図１１に示す。Step 300) First, the sentence type determination unit 3 refers to the expression character string table of FIG. 8A to obtain an expression character string that is a partial character string of the target Japanese sentence. FIG. 10 shows the details of the processing. Step 400) A possible combination pattern in the target sentence of the expression character string extracted in step 300 is obtained. FIG. 11 shows the details of the processing.

【００４９】ステップ５００）組合せパターンと文タ
イプ判定テーブル（図８（Ｂ））の照合を行い、文タイ
プを判定する。処理の詳細を図１２に示す。ステップ６００）文タイプ判定テーブルにおける複数
のエントリとの照合が成功する場合、ひとつのエントリ
を選択する。処理の詳細を図１３に示す。Step 500) The combination pattern and the sentence type determination table (FIG. 8B) are collated to determine the sentence type. FIG. 12 shows the details of the processing. Step 600) If matching with a plurality of entries in the sentence type determination table is successful, one entry is selected. FIG. 13 shows details of the processing.

【００５０】図１０は、本発明の一実施例の文タイプ判
定部の第１の動作を説明するフローチャートである。同
図は、上記のステップ３００の詳細な動作を示す。第１
の動作は、対象日本語文の部分文字列となっている表現
文字列を求める。ステップ３０１）ｉを１とし、Ｎに表現文字列リスト
のエントリ数を設定し、Ｒ１は空集合｛｝とする。FIG. 10 is a flow chart for explaining the first operation of the sentence type judging section according to one embodiment of the present invention. FIG. 10 shows the detailed operation of the above-described step 300. First
Operation finds an expression character string that is a partial character string of the target Japanese sentence. Step 301) i is set to 1, N is set to the number of entries in the expression character string list, and R1 is an empty set ｛｝.

【００５１】ステップ３０２）部分文字列の先頭位置
（ｐｏｓ）がpos ＝serach( Ｓ，Ｌ（ｉ）．string）≠
０であれば、ステップ３０３に移行し、pos ＝serach(
Ｓ，Ｌ（ｉ）．string）＝０であればステップ３０４に
移行する。ステップ３０３）Ｒ１＝Ｒ１＋｛pos, length （Ｌ
(i).string），Ｌ（ｉ）．fid ｝; とする。Step 302) The start position (pos) of the partial character string is pos = search (S, L (i) .string)}
If 0, the process proceeds to step 303, where pos = serach (
S, L (i). If (string) = 0, the process proceeds to step 304. Step 303) R1 = R1 + ｛pos, length (L
(i) .string), L (i). fid｝;

【００５２】ステップ３０４）ｉをインクリメントす
る（ｉ＝ｉ＋１）。ステップ３０５）ｉ＞Ｎであれば、ステップ３０６に
移行し、ｉ≦Ｎであればステップ３０２に移行する。ステップ３０６）Ｒが空集合（｛｝）であれば、対象
日本語文を指定された文タイプを持たないとして処理を
終了する。Ｒが空集合でなければステップ３０２に移行
する。Step 304) i is incremented (i = i + 1). Step 305) If i> N, proceed to step 306; if i ≦ N, proceed to step 302. Step 306) If R is an empty set (｛｝), the process ends because the target Japanese sentence does not have the specified sentence type. If R is not an empty set, the process proceeds to step 302.

【００５３】図１１は、本発明の一実施例の文タイプ判
定部の第２の動作を説明するためのフローチャートであ
る。第２の動作は、抽出された表現文字列の対象文にお
ける可能な組合せパターンを求めるものである。ステップ４０１）Ｎ＝Ｒ１の要素数とし、Ｒ２＝comb
（１，Ｒ１）＋…＋comb（Ｎ−１，Ｒ１）＋Ｒ１とす
る。FIG. 11 is a flow chart for explaining the second operation of the sentence type judging section according to one embodiment of the present invention. The second operation is to determine a possible combination pattern of the extracted expression string in the target sentence. Step 401) N = the number of elements of R1, and R2 = comb
(1, R1) +... + Comb (N-1, R1) + R1.

【００５４】ステップ４０２）ｉ−１とし、Ｍ＝Ｒ２
の要素数とし、Ｒ３＝｛｝とする。ステップ４０３）Ｒ２（ｉ）に格納されている各部分
文字列においてオーバーラップが生じるかをチェック
し、生じない場合（ｎｏ）には、Ｒ３＝Ｒ３＋Ｒ２
（ｉ）とする。Step 402) Set i-1 and M = R2
And R3 = ｛｝. Step 403) It is checked whether or not overlap occurs in each of the partial character strings stored in R2 (i). If no overlap occurs (no), R3 = R3 + R2
(I).

【００５５】ステップ４０４）ｉをインクリメントす
る（ｉ＝ｉ＋１）。ステップ４０５）ｉ＞Ｌであればステップ４０６に移
行し、ｉ≦Ｌであればステップ４０２に移行する。ステップ４０６）Ｒ３＝｛｝であれば、対象日本語文
は指定された文タイプを持たないとして処理を終了す
る。Ｒ３が空集合でなければステップ４０３に移行す
る。Step 404) i is incremented (i = i + 1). Step 405) If i> L, proceed to step 406; if i ≦ L, proceed to step 402. Step 406) If R3 = ｛｝, the target Japanese sentence does not have the specified sentence type, and the process ends. If R3 is not an empty set, the process proceeds to step 403.

【００５６】図１２は、本発明の一実施例の文タイプ判
定部の第３の動作を説明するためのフローチャートであ
る。第３の動作は、組合せパターンと文タイプ判定テー
ブルの照合を行い、文タイプを判断するものである。ステップ５０１）ｉ＝１とし、Ｌに文タイプ判定テー
ブルのエントリ数を設定し、Ｒ４を空集合（｛｝）とす
る。FIG. 12 is a flow chart for explaining the third operation of the sentence type judging section according to one embodiment of the present invention. The third operation is to compare the combination pattern with the sentence type determination table to determine the sentence type. Step 501) Set i = 1, set the number of entries in the sentence type determination table to L, and set R4 to an empty set (｛｝).

【００５７】ステップ５０２）エントリにおける文タ
イプＴ（ｉ）.type が指定された文タイプと一致しなけ
れば、処理を終了する。ステップ５０３）ｊ＝１とし、Ｍに文タイプ判定テー
ブル（図８（Ｂ））のｉ番目のエントリＴ（ｉ）の機能
ＩＤリストのエントリ数を設定する。Step 502) If the sentence type T (i) .type in the entry does not match the specified sentence type, the process ends. Step 503) Set j = 1, and set M to the number of entries in the function ID list of the i-th entry T (i) in the sentence type determination table (FIG. 8B).

【００５８】ステップ５０４）ｋ＝１とし、ＮにＲ３
の要素数を設定する。ステップ５０５）文タイプ判定テーブルのあるエント
リにおける機能ＩＤリストにおける機能ＩＤリストに格
納されたポインタcheck-list（Ｔ（ｉ）．pointer(j),
Ｒ３（ｋ））において、要素セルに格納された機能ＩＤ
が全て一致する場合には、Ｒ４＝Ｒ４＋｛Ｒ３（ｋ），
ｉ，ｊ｝とする。Step 504) Set k = 1 and set N to R3
Set the number of elements of Step 505) The pointer check-list (T (i) .pointer (j), stored in the function ID list of the function ID list in a certain entry of the statement type determination table.
R3 (k)), the function ID stored in the element cell
Are all the same, R4 = R4 + ｛R3 (k),
i, j}.

【００５９】ステップ５０６）ｋをインクリメントす
る（ｋ＝ｋ＋１）。ステップ５０７）ｋ＞Ｎであれば、ステップ５０８に
移行し、ｋ≦Ｎであればステップ５０５に移行する。ステップ５０８）ｊをインクリメントする（ｊ＝ｊ＋
１）。Step 506) Increment k (k = k + 1). Step 507) If k> N, proceed to step 508; if k ≦ N, proceed to step 505. Step 508) Increment j (j = j +
1).

【００６０】ステップ５０９）ｊ＞Ｍであれば、ステ
ップ５１０に移行し、ｊ≦Ｍであればステップ５０５に
移行する。ステップ５１０）ｉをインクリメントする（ｉ＝ｉ＋
１）。ステップ５１１）ｉ＞Ｌであればステップ５１２に移
行し、ｉ≦Ｌであれば、ステップ５０５に移行する。Step 509) If j> M, proceed to step 510; if j ≦ M, proceed to step 505. Step 510) Increment i (i = i +
1). Step 511) If i> L, proceed to step 512; if i ≦ L, proceed to step 505.

【００６１】ステップ５１２）Ｒ４が空集合（｛｝）
であれば、対象日本語文は指定された文タイプを持たな
いとして処理を終了する。ステップ５１３）Ｒ４の要素数が１なら、当該要素の
文タイプと文サブタイプの組を返却して処理を終了す
る。Ｒ４の要素数が１より大きい場合には、ステップ５
０４に移行する。Step 512) R4 is an empty set (｛｝)
If so, the processing is terminated assuming that the target Japanese sentence does not have the specified sentence type. Step 513) If the number of elements of R4 is 1, a set of the sentence type and the sentence subtype of the element is returned, and the process ends. If the number of elements of R4 is greater than 1, step 5
Move to 04.

【００６２】図１３は、本発明の一実施例の文タイプ判
定部の第４の動作を説明するためのフローチャートであ
る。第４の動作は、文タイプ判定テーブルにおける複数
のエントリとの照合が成功する場合、一つのエントリを
選択する。ステップ６０１） select＝１とし、length＝compute-
length（Ｒ４（１））とし、items ＝compute-items
（Ｒ４（１））とする。FIG. 13 is a flow chart for explaining the fourth operation of the sentence type judgment unit according to one embodiment of the present invention. The fourth operation is to select one entry when matching with a plurality of entries in the sentence type determination table is successful. Step 601) Select = 1, length = compute-
length = R4 (1), items = compute-items
(R4 (1)).

【００６３】ステップ６０２）ｉ＝２とし、ＮにＲ４
のエントリ数を設定する。ステップ６０３） length＜compute-items （Ｒ４
（ｉ））であれば、ステップ６０４に移行し、length≧
compute-items （Ｒ４（ｉ））であれば、ステップ６０
６に移行する。Step 602) When i = 2, N is R4
Set the number of entries. Step 603) length <compute-items (R4
If (i)), the process proceeds to step 604, where length ≧
If it is compute-items (R4 (i)), step 60
Move to 6.

【００６４】ステップ６０４） items ＞compute-item
s （Ｒ４（ｉ））であれば、ステップ６０５に移行し、
items ≦compute-items （Ｒ４（ｉ））であれば、ステ
ップ６０６に移行する。ステップ６０５） select＝ｉ、length＝compute-leng
th（Ｒ４（ｉ））、items ＝compute-items （Ｒ４
（ｉ））とする。ステップ６０６）ｉをインリメントする（ｉ＝ｉ＋
１）。Step 604) items> compute-item
If s (R4 (i)), the process proceeds to step 605,
If items ≦ compute-items (R4 (i)), the process proceeds to step 606. Step 605) select = i, length = compute-leng
th (R4 (i)), items = compute-items (R4
(I)). Step 606) Increment i (i = i +
1).

【００６５】ステップ６０７）ｉ＞Ｎであれば、ステ
ップ６０８に移行し、ｉ≦Ｎであれば、ステップ６０３
に移行する。ステップ６０８）Ｎの当該要素の文タイプと文サブタ
イプの組を返却して処理を終了する。Step 607) If i> N, proceed to step 608. If i ≦ N, step 603
Move to Step 608) Return the set of the sentence type and the sentence subtype of the element of N and end the process.

【００６６】図５に示す日本語質問メッセージに対する
文タイプ判定部３の動作の概要を上記の図９〜１３のフ
ローチャートに基づいて説明する。（１）まず、第１文『初めて質問します。』は、ステ
ップ３００により図８（Ｂ）の表現文字列テーブルに格
納されている文字列のいずれとも照合しないことが分か
る。従って、情報要求文とは判断されない。The outline of the operation of the sentence type determination unit 3 for the Japanese question message shown in FIG. 5 will be described with reference to the flowcharts of FIGS. (1) First sentence, "I will ask you a question for the first time. ] Does not match any of the character strings stored in the expression character string table of FIG. Therefore, it is not determined as an information request statement.

【００６７】（２）次に、第２文『１７インチモニタ
ーをＶＡ０６６に接続したいと考えています。』は、ス
テップ３００により図８（Ｂ）の表現文字列テーブルに
格納されているエントリＩＤ＝１３の表現のみと照合一
致する。このエントリ機能ＩＤは８００であり、ステッ
プ５００により、これは、“情報要求”の文タイプを規
定する表現文字列パターンの組合せとは判断されない。
従って、情報要求文とは判定されない。(2) Next, the second sentence “I want to connect a 17-inch monitor to VA066”. ] Matches and matches only the expression of entry ID = 13 stored in the expression character string table of FIG. The entry function ID is 800, and according to step 500, this is not determined to be a combination of expression character string patterns that define the sentence type of "information request".
Therefore, it is not determined as an information request statement.

【００６８】（３）次に、第３文『そのために必要な
接続用ケーブルを探していますが、』は、ステップ３０
０により図８（Ｂ）の表現文字列テーブルに格納されて
いるエントリＩＤ＝３の表現文字列のみと照合する。こ
のエントリは機能ＩＤが１００であり、ステップ５００
により、これは、“情報要求”の文タイプを規定する表
現文字列パターンの組合せと判定される。同時に、文サ
ブタイプが“対象物品提示”であることが副作用として
得られる。(3) Next, the third sentence “I am looking for a connection cable necessary for that,
With 0, it is checked against only the expression character string of entry ID = 3 stored in the expression character string table of FIG. This entry has a function ID of 100 and a step 500
Thus, this is determined to be a combination of expression character string patterns that define the sentence type of “information request”. At the same time, the fact that the sentence subtype is "target article presentation" is obtained as a side effect.

【００６９】（４）第４文『どのお店に売っているか
ご存じの方はいらっしゃいませんか。』は、ステップ３
００により図８（Ｂ）の表現文字列テーブルに格納され
ている３つのエントリ、即ち、エントリＩＤ＝５，エン
トリＩＤ＝７，エントリＩＤ＝１１の表現文字列と照合
し、これらの機能ＩＤは、それぞれ、２００、３００、
４００である。ステップ３００の結果である集合Ｒ１
は、Ｒ１＝｛｛１，１１，２００｝｝，｛｛１２，７，３０
０｝｝，｛｛１９，１０，４００｝｝となる。また、ステップ４００の処理結果である集合Ｒ
３は、Ｒ３＝｛｛｛１，１１，２００｝｝，｛｛１２，７，３
００｝｝，｛｛１９，１０，４００｝｝，｛｛１，１
１，２００｝，｛１２，７，３００｝｝，｛｛１，１
１，２００｝，｛１９，１０，４００｝｝，｛｛１２，
７，３００｝，｛１９，１０，４００｝｝，｛｛１，１
１，２００｝，｛１２，７，３００｝，｛１９，１０，
４００｝｝｝となる。さらに、このＲ３に対してステップ５００によ
る文タイプ判定テーブルとの照合を行うと、その結果と
して、集合Ｒ４が、Ｒ４＝｛｛｛｛１２，７，３００｝，｛１９，１０，４
００｝｝，１，２｝，｛｛｛１，１１，２００｝，｛１
２，７，３００｝｝，｛１９，１０，４００｝｝，１，
１｝｝と得られる。これは、“情報要求”の文タイプを与える
部分文字列の組合せが２通りあることを示す。よって処
理は、ステップ６００に移行する。ステップ６００で
は、照合した表現文字列によってカバーされる文字列の
長さがカウントされ比較される。ここでは、Ｒ４の第２
要素で与えられる組合せがカバーする文字列長（＝２
８）の方が第１要素で与えられる組合せがカバーする文
字列長（＝１７）より長いので、第２要素で与えられる
組合せが選択される。よって、文タイプ＝“情報要求”
が確認され、さらに、文サブタイプ＝“物品の入手”が
副作用として得られる。(4) Fourth sentence "Do you know which store you are selling? Is Step 3
00, the three entries stored in the expression string table of FIG. 8B are compared with the expression strings of entry ID = 5, entry ID = 7, and entry ID = 11. , Respectively, 200, 300,
400. Set R1 resulting from step 300
R1 = {1,11,200}, {12,7,30
0}, {19, 10, 400}. Also, the set R which is the processing result of step 400
3, R3 = {1, 11, 200}, {12, 7, 3}
00, 19, 10, 400, 1, 1
1,200｝, {12,7,300}, ｛｛1,1
1,200｝, ｛19,10,400 ｛｛, ｛｛12,
7,300, {19,10,400}, {1,1
1,200｝, {12,7,300}, {19,10,
400｝｝｝. Further, when this R3 is compared with the sentence type determination table in step 500, as a result, the set R4 becomes R4 = {12, 7, 300}, {19, 10, 4}
00, 1, 2, {1, 11, 200}, {1
2,7,300}, {19,10,400}, 1,
1｝｝ is obtained. This indicates that there are two combinations of partial character strings giving the sentence type of "information request". Therefore, the process proceeds to step 600. In step 600, the length of the character string covered by the matched expression string is counted and compared. Here, the second of R4
Character string length covered by the combination given by the element (= 2
8) is longer than the character string length (= 17) covered by the combination provided by the first element, so the combination provided by the second element is selected. Therefore, sentence type = "information request"
Is confirmed, and the sentence subtype = “obtain article” is obtained as a side effect.

【００７０】（５）最後に、第５文『よろしくお願い
致します。』は、図８（Ｂ）の表現文字列リストに格納
されている文字列のいずれとも照合しないため、第１文
の場合と同様に（ステップ３００）情報要求文とは判定
されない。以上の処理により、第３文『そのために必要
な接続用ケーブルを探していますが、』に対して、文タ
イプ（“情報要求”）及び文サブタイプ（“物品の入
手”）が求められ、該当する処理テーブル１３のエント
リの第３、第４フィールドに格納される。この状況を図
１４に示す。(5) Finally, the fifth sentence, "Thank you. Does not match any of the character strings stored in the expression character string list in FIG. 8B, and therefore is not determined to be an information request sentence (step 300) as in the case of the first sentence. As a result of the above processing, the sentence type (“information request”) and the sentence subtype (“acquisition of goods”) are requested for the third sentence “I am looking for the necessary connection cable.” The information is stored in the third and fourth fields of the corresponding entry of the processing table 13. This situation is shown in FIG.

【００７１】文タイプ判定部３によって、文タイプが
“情報要求”であると判定された日本語質問メッセージ
の文は、入力された日本語文に対して形態素・構文解析
を行う日本語基本解析部５に転送される。 iii) 日本語基本解析処理図５に示す日本語質問メッセージにおける第３文に対す
る日本語基本解析の結果の概要を図１５に示す。同図に
示すように、基本解析の結果は、文節を節点、文節間の
係り受け関係を枝、係り受け関係の属性の枝へのラベ
ル、最終文節を根とする構文木構造として表現され、処
理テーブル１３の第５フィールドへの根節点のアドレス
（根節点へのポインタ）が格納される。ここで、各節点
には、各文節に対する形態素解析の結果が保持されるも
のとする。なお、形態素解析や構文解析の処理は通常の
日本語処理システムに具備されているものであるので、
その実現方法については本発明では限定するものではな
い。The sentence of the Japanese question message whose sentence type is determined to be “information request” by the sentence type determining unit 3 is a Japanese basic analyzing unit that performs morphological and syntactic analysis on the input Japanese sentence. 5 is transferred. iii) Basic Japanese Analysis Process FIG. 15 shows an outline of the result of the basic Japanese analysis of the third sentence in the Japanese question message shown in FIG. As shown in the figure, the result of the basic analysis is expressed as a syntax tree structure in which a clause is a node, a dependency relation between the clauses is a branch, a label to a branch of an attribute of the dependency relation, and a final clause is a root, The address of the root node (pointer to the root node) to the fifth field of the processing table 13 is stored. Here, it is assumed that each node holds the result of the morphological analysis for each phrase. Note that the processing of morphological analysis and syntax analysis is provided in a normal Japanese processing system.
The present invention is not limited by the present invention.

【００７２】文タイプ判定部３によって、文タイプが
“情報要求”であると判定され、日本語基本解析部５に
よる解析処理が行われた日本語質問メッセージの文は、
意味構造解析部７へ転送される。意味構造解析部７は、
日本語質問メッセージにおいて、固有な意味構造パター
ンを格納した、予め用意された意味構造パターン辞書６
におけるエントリと日本語基本解析部５の解析結果とを
照合することにより、情報要求文の意味構造を求める。The sentence type determination unit 3 determines that the sentence type is “information request”, and the sentence of the Japanese question message analyzed by the Japanese basic analysis unit 5 is:
It is transferred to the semantic structure analysis unit 7. The semantic structure analysis unit 7
A prepared semantic structure pattern dictionary 6 that stores unique semantic structure patterns in Japanese question messages.
Is compared with the analysis result of the Japanese basic analysis unit 5 to determine the semantic structure of the information request sentence.

【００７３】図１６は、本発明の一実施例の意味構造パ
ターン辞書６のエントリ例を示す。矢印の左側は、統合
すべき構文木構造であり、矢印の右側は照合が成功した
場合に変換すべき構文木構造を示している。同図に示さ
れるように、辞書のエントリは、部分的な構文木構造と
して表現されている。また、同図において、“＄ＨＥＡ
Ｄ”、“＄Ｘ”等はマッチする部分木構造を保持するた
めの変数であり、｛｝内に列挙される“ＴＹＰＥ＝＝
“情報要求”等の表現は、各節点自体への制約を示す。
例えば、当該表現は、該当する文タイプが“情報要求”
でなければならないという制約を表現している。同様
に、“ＳｕｂＴｙｐｅ＝＝“対象物品の提示”は、文サ
ブタイプに関する制約を表現している。FIG. 16 shows an example of entries in the semantic structure pattern dictionary 6 according to one embodiment of the present invention. The left side of the arrow indicates a syntax tree structure to be integrated, and the right side of the arrow indicates a syntax tree structure to be converted when the collation is successful. As shown in the figure, the dictionary entry is expressed as a partial syntax tree structure. In the same figure, "@HEA
"D", "$ X", etc. are variables for holding the matching subtree structure, and "TYPE ==
Expressions such as “information request” indicate restrictions on each node itself.
For example, the expression has a corresponding sentence type of “information request”.
Expresses the constraint that it must be Similarly, “SubType ==“ presentation of target article ”expresses a constraint on a sentence subtype.

【００７４】意味構造解析部７は、入力された構文木構
造のなるべく多くの部分が意味構造パターン辞書６にエ
ントリされている部分構文木構造で覆われるように照合
処理を行い、照合が成功した場合、構文木構造と属性と
属性値のペアの集合により、表現される指定された意味
構造へと変換する。図１５に示された日本語基本解析部
５の処理結果は、意味構造解析部７により図１６
（Ａ），（Ｂ）にエントリ例を示す意味構造パターン辞
書６のエントリとの照合処理によって、図１７に示すよ
うな意味構造へと変換される。図１７において、“：ｎ
ｅｅｄｅｄ−ｆｏｒ”や“：ｉｔｅｍ−ｆｏｒ”等の意
味属性名を表す標識は、意味構造パターン辞書６におけ
る対応するエントリにより与えられる。The semantic structure analysis unit 7 performs the matching process so that as much of the input syntax tree structure as possible is covered by the partial syntax tree structure entered in the semantic structure pattern dictionary 6, and the matching is successful. In this case, the data is converted into a designated semantic structure expressed by a set of a syntax tree structure and pairs of attributes and attribute values. The processing result of the Japanese basic analysis unit 5 shown in FIG.
17A and 17B, the entry is converted into a semantic structure as shown in FIG. In FIG. 17, ": n
Indicators indicating semantic attribute names such as "eded-for" and ": item-for" are given by corresponding entries in the semantic structure pattern dictionary 6.

【００７５】意味構造解析部７の処理結果である意味構
造は、その記憶領域へのポインタが処理テーブル１３の
第５フィールドへ格納されることにより、保持される。
この結果は、さらに、参照表現解消制御テーブル８と文
タイプ判定部３を用いて参照表現の解消処理を制御する
参照表現解消制御部９へ転送される。The semantic structure which is the result of the processing by the semantic structure analyzing unit 7 is held by storing a pointer to the storage area in the fifth field of the processing table 13.
This result is further transferred to the reference expression elimination control unit 9 that controls the reference expression elimination process using the reference expression elimination control table 8 and the sentence type determination unit 3.

【００７６】iv) 参照表現解消処理参照表現解消制御部９は、入力された情報要求文の意味
構造に基づいて、上記の参照表現解消制御テーブル８か
ら得た探索方向に関する情報に基づいて参照先を含む可
能性のある候補文を選択する。この際、上記の参照表現
解消制御テーブル８から得た文タイプの情報を用いて文
タイプ判定部３を起動して、候補文の適格性を判定す
る。Iv) Reference expression elimination processing The reference expression elimination control unit 9 determines the reference destination based on the search direction information obtained from the reference expression elimination control table 8 based on the semantic structure of the input information request sentence. Is selected. At this time, the sentence type determining unit 3 is activated using the sentence type information obtained from the reference expression elimination control table 8 to determine the eligibility of the candidate sentence.

【００７７】図１８は、本発明の一実施例の参照表現解
消制御部の動作を示すフローチャートである。ステップ７０１）参照表現解消制御部９は、処理対象
の情報９応急文の意味構造を走査し、属性値として表さ
れる言語表現解析部分に含まれる参照表現を抽出する。
例えば、「その」、「あの」等の文脈指示御等による参
照表現を抽出する。FIG. 18 is a flowchart showing the operation of the reference expression cancellation control unit according to one embodiment of the present invention. Step 701) The reference expression elimination control unit 9 scans the semantic structure of the information 9 to be processed and extracts the reference expression included in the linguistic expression analysis part represented as the attribute value.
For example, a reference expression based on a context instruction such as “that” or “that” is extracted.

【００７８】ステップ７０２）初期設定として、カウ
ントｉ＝１とし、Ｎを抽出した参照表現の数、Ｌを処理
対象文のメッセージ先頭からの位置とする。ステップ７０３）次に、これらの参照表現と意味構造
における意味属性キーとして、参照表現の参照先を探索
する探索方向、探索対象となる日本語文の文タイプ、参
照先の用言の言語表現単位を格納した、予め用意された
参照表現解消テーブル８を検索し、これらの制御情報を
取得する。Step 702) Initially, count i = 1, N is the number of reference expressions extracted, and L is the position of the statement to be processed from the beginning of the message. Step 703) Next, as the semantic attribute key in the reference expression and the semantic structure, the search direction for searching the reference destination of the reference expression, the sentence type of the Japanese sentence to be searched, and the linguistic expression unit of the reference target idiom. The stored reference expression elimination table 8 is retrieved and the control information is obtained.

【００７９】ステップ７０４）以下、ステップ７０３
の結果に基づいて処理対象の参照表現を含む文に近い文
から順に指定された探索方向に沿って一文ずつ参照先が
求められるまで以下の処理を行う。まず、探索方向Ｄが
同一文である場合に、ステップ７０５に移行し、異なる
場合には、ステップ７０８に移行する。Step 704) Hereinafter, step 703
Based on the result of the above, the following processing is performed in order from the sentence closest to the sentence including the reference expression to be processed along the designated search direction one by one until the reference destination is obtained. First, if the search direction D is the same sentence, the process proceeds to step 705; otherwise, the process proceeds to step 708.

【００８０】ステップ７０５）参照表現解消制御部９
は、文タイプ判定部３を起動して、探索方向Ｄが同一文
である場合には、参照表現解消部１０を起動して、実際
に参照表現の参照先を求める。ステップ７０６）処理が成功した場合にはステップ７
０７に移行し、失敗の場合には、ステップ７１９に移行
する。Step 705) Reference expression cancellation control section 9
Starts the sentence type determination unit 3 and, when the search direction D is the same sentence, starts the reference expression resolution unit 10 and actually obtains the reference destination of the reference expression. Step 706) If the processing is successful, go to Step 7
07, and if unsuccessful, the flow proceeds to step 719.

【００８１】ステップ７０７）意味構造における処理
対象の参照表現の部分を参照先に対する意味構造で置換
し、ステップ７１９に移行する。ステップ７０８）カウントｊに１を設定する。ステップ７０９）参照表現解消制御部９は、探索方向
Ｄが前方である場合には、ステップ７１０に移行し、そ
れ以外である場合には、ステップ７１１に移行する。Step 707) Replace the part of the reference expression to be processed in the semantic structure with the semantic structure for the reference destination, and proceed to step 719. Step 708) The count j is set to 1. Step 709) If the search direction D is forward, the reference expression elimination control unit 9 proceeds to Step 710; otherwise, proceeds to Step 711.

【００８２】ステップ７１０）第（Ｌ−ｊ）文を候補
文とし、ステップ７１２に移行する。ステップ７１１）参照表現解消制御部９は、探索方向
Ｄが“前方”以外の場合には、第（Ｌ＋ｊ）文を候補文
とする。Step 710) The (Lj) -th sentence is set as a candidate sentence, and the flow advances to step 712. Step 711) When the search direction D is other than “forward”, the reference expression elimination control unit 9 sets the (L + j) th sentence as a candidate sentence.

【００８３】ステップ７１２）文タイプ判定部３を起
動する。ステップ７１３）文タイプ判定部３は、候補の文が指
定された文タイプを有するかを判定し、有している場合
には、ステップ７１４に移行し、有していない場合に
は、ステップ７１７に移行する。Step 712) The sentence type judgment section 3 is started. Step 713) The sentence type determination unit 3 determines whether the candidate sentence has the specified sentence type. If so, the process proceeds to step 714; otherwise, the process proceeds to step 717. Move to

【００８４】ステップ７１４）上記の判定が合格であ
れば、参照表現解消制御部９は、参照表現解消部１０を
起動して、実際に参照表現の参照先を求める。なお、こ
の処理自体は、表現表記の同一性や、シソーラスを用い
た意味的な同一性のチェック等の既存の技術で実現され
るものとし、その詳細は、本発明の要旨には直接関連が
ないので、説明は省略する。Step 714) If the above judgment is successful, the reference expression elimination control unit 9 activates the reference expression elimination unit 10 and actually obtains the reference destination of the reference expression. It is assumed that this processing itself is realized by an existing technology such as checking the identity of the expression notation and the semantic identity using a thesaurus, and details thereof are directly related to the gist of the present invention. Since there is no description, the description is omitted.

【００８５】ステップ７１５）参照表現の参照先が得
られた場合には、ステップ７１６に移行し、取得できな
った場合には、ステップ７１７に移行する。ステップ７１６）参照表現の部分を参照先に対する意
味構造で置換し、ステップ７２０に移行する。Step 715) If the reference destination of the reference expression has been obtained, the flow shifts to step 716; otherwise, the flow shifts to step 717. Step 716) Replace the part of the reference expression with the semantic structure for the reference destination, and proceed to Step 720.

【００８６】ステップ７１７）カウントｊをインリメ
ントする（ｊ＝ｊ＋１）。ステップ７１８）探索方向Ｄが“前方”であるとき
に、ｊ＜１または、探索方向Ｄが“後方”時にｊ＜質問
メッセージにおける文数であれば、ステップ７１９に移
行し、それ以外であれば、ステップ７０８に移行する。Step 717) The count j is incremented (j = j + 1). Step 718) When j <1 when the search direction D is “forward” or j <the number of sentences in the question message when the search direction D is “backward”, the process proceeds to step 719, otherwise. , To step 708.

【００８７】ステップ７１９）カウントｉをインクリ
メントする（ｉ＝ｉ＋１）。ステップ７２０）カウントｉが抽出した参照表現の数
Ｎより大きい場合には、処理を終了し、参照表現の数に
満たない場合には、ステップ７０３に移行する。Step 719) The count i is incremented (i = i + 1). Step 720) If the count i is larger than the extracted number N of reference expressions, the process ends. If the count i is less than the number of reference expressions, the process proceeds to Step 703.

【００８８】なお、上記のフローチャートの動作におい
て、参照表現解消にあたっては、参照先を含む候補文の
基本解析が一般には必要である。本発明では、参照表現
解消部１０は、必要に応じて日本語基本解析部を起動す
るものと仮定する。以下では、図１８、図１９の動作論
理による図１７の例に対する参照表現解消制御部９の動
作を説明する。In the operation of the above-described flowchart, a basic analysis of a candidate sentence including a reference destination is generally required to resolve a reference expression. In the present invention, it is assumed that the reference expression elimination unit 10 activates the Japanese basic analysis unit as needed. Hereinafter, the operation of the reference expression cancellation control unit 9 with respect to the example of FIG. 17 based on the operation logic of FIGS. 18 and 19 will be described.

【００８９】ここで、図１７の例を説明するに足りる参
照表現解消制御テーブル８の構成例を説明する。図２０
は、本発明の一実施例の参照表現解消制御テーブルの例
を示す。当該参照表現解消制御テーブル８は、各エント
リが５つのフィールドを持つテーブル構造である。第１
フィールドには、参照表現、第２フィールドには、対応
する意味属性、第３フィールドには、候補文の探索方
向、第４フィールドには、候補文の文タイプ、第５フィ
ールドには、参照先表現の言語表現単位が格納される。
まず、上記のステップ７０１により、意味属性 “:needed-for ” に対する属性値である〈その〉の部分が唯一の参照表現として抽出される。こ
の判定は、“その”に対する形態素解析の結果から、品
詞が指示詞と同定されることにより行われる。Here, an example of the configuration of the reference expression elimination control table 8 sufficient to explain the example of FIG. 17 will be described. FIG.
9 shows an example of a reference expression elimination control table according to an embodiment of the present invention. The reference expression elimination control table 8 has a table structure in which each entry has five fields. First
The field is a reference expression, the second field is the corresponding semantic attribute, the third field is the search direction of the candidate sentence, the fourth field is the sentence type of the candidate sentence, and the fifth field is the reference destination. The linguistic expression unit of the expression is stored.
First, in step 701, the <value> portion, which is the attribute value for the semantic attribute ": needed-for", is extracted as a unique reference expression. This determination is made by identifying the part of speech as a demonstrative from the result of the morphological analysis for "that".

【００９０】次に、ステップ７０３により参照表現解消
制御テーブル８を検索し、探索方向Ｄ＝“前方”、文タ
イプＴ＝“ゴール言明”、ゴール言明Ｓ＝“ゴール言明
文の命題部分”が得られる。次にステップ７０９により
第２（＝３−１）文が候補文となり、当該文が“ゴール
の言明”の文タイプを持つかが、文タイプ判定部３によ
り判定される。ここでは、文タイプ判定部３の動作の説
明は省略するが、図８に示す文タイプ判定辞書２を用い
て、図９〜図１３に示す動作論理を実施すると、文タイ
プは“ゴール言明”と判定される。よって、処理はステ
ップ７０５に移行し、参照先は、メッセージの第２文 “１７インチモニタをＶＡ０６６に接続したいと考えて
います。”であり、ゴール言明文の命題部分として、〈１７インチモニターをＶＡ０６６に接続する〉と解消
される。以上により、本例に対する参照表現解消制御部
９は、処理を終了する。以上により、図５の日本語質問
メッセージにおける第３文の意味構造は、図２１（Ａ）
に示すようになり、当該構造の格納領域の先頭アドレス
が処理テーブル１３の第５フィールドに格納される。Next, the reference expression elimination control table 8 is searched in step 703 to obtain a search direction D = “forward”, a sentence type T = “goal statement”, and a goal statement S = “proposition part of goal statement”. Can be Next, in step 709, the second (= 3-1) sentence becomes a candidate sentence, and the sentence type determining unit 3 determines whether the sentence has a sentence type of "statement of goal". Here, the description of the operation of the sentence type determination unit 3 is omitted, but when the operation logic shown in FIGS. 9 to 13 is performed using the sentence type determination dictionary 2 shown in FIG. 8, the sentence type becomes “goal statement”. Is determined. Accordingly, the process proceeds to step 705, and the reference destination is the second sentence of the message, "I want to connect a 17-inch monitor to VA066." Connect to VA066>. As described above, the reference expression elimination control unit 9 for the present example ends the processing. As described above, the semantic structure of the third sentence in the Japanese question message in FIG. 5 is as shown in FIG.
The head address of the storage area of the structure is stored in the fifth field of the processing table 13.

【００９１】また、図５の日本語質問メッセージにおけ
る第４文では、「どこのお店に売っている」かをユーザ
が尋ねているところの対象文「接続用のケーブル」が省
略されている。このような言語現象は、ゼロ代名詞と呼
ばれ、日本語の談話構造において頻繁に表れる一種の参
照表現である。この参照先の解消も、参照表現制御部９
により行われる。ゼロ代名詞に対する同定処理も既存の
技術で実現されることを前提とするため、ここでは、ゼ
ロ代名詞の参照先解消に関する参照表現解消部１０の動
作の詳細について規定しないが、結果としては、前文に
おける〈ケーブル〉が参照先として、同定されることを
想定する。第４文に対するここまでの処理による意味構
造は図２１（Ｂ）に示すようになる。In the fourth sentence of the Japanese question message shown in FIG. 5, the target sentence "connection cable" where the user inquires "where the store is sold" is omitted. . Such linguistic phenomena are called zero pronouns, and are a kind of reference expression frequently appearing in Japanese discourse structure. The resolution of the reference destination is also performed by the reference expression control unit 9.
It is performed by Since it is assumed that the identification process for the zero pronoun is also realized by the existing technology, the details of the operation of the reference expression resolving unit 10 regarding the resolution of the reference destination of the zero pronoun are not specified here. It is assumed that <cable> is identified as a reference. The semantic structure of the fourth sentence by the processing so far is as shown in FIG.

【００９２】以上の処理によって入力された日本語質問
メッセージから唯一の情報要求文が抽出され、その意味
構造が得られた場合、それを当該メッセージにおける情
報要求として、出力部１２から出力する。 v) 情報要求統合処理一方、複数の情報要求文が抽出され、それぞれの意味構
造が得られた場合、情報要求構造化部１１は、それぞれ
の情報要求文における情報要求対象物の同一性を判定
し、同一性が確認できる場合は、複数の情報要求文の意
味構造を統合して、単一の意味構造として処理対象の日
本語メッセージにおける情報要求を構造化する。この統
合の処理は、自然言語処理や人口知能の研究開発分野で
広く知られている素性構成のユニフィケーション処理に
より行う。以上の処理手順によって、対象の日本語質問
メッセージから抽出され、構造化された情報要求は、出
力部１２を介して後段に接続される装置へと出力され
る。When only one information request sentence is extracted from the input Japanese question message by the above processing and its semantic structure is obtained, it is output from the output unit 12 as an information request in the message. v) Information request integration processing On the other hand, when a plurality of information request sentences are extracted and their semantic structures are obtained, the information request structuring unit 11 determines the identity of the information request object in each information request sentence. However, if the identity can be confirmed, the information request in the Japanese message to be processed is structured as a single semantic structure by integrating the semantic structures of a plurality of information request sentences. This integration processing is performed by natural language processing and unification processing of feature configurations widely known in the field of research and development of artificial intelligence. According to the above processing procedure, the information request extracted and structured from the target Japanese question message is output via the output unit 12 to a device connected to the subsequent stage.

【００９３】情報要求構造化部１１の動作論理の概要を
図２２に示す。図５に示す日本語質問メッセージから図
２１（Ａ），（Ｂ）に示す２つの情報要求文の意味構造
が得られている状況における情報要求構造化部１１の動
作を説明する。ステップ８０１）情報要求構造化部１１は、意味属性
“:head ”に対する属性値ある項の同一性をチェックす
る。同一性が成立しなければステップ８０２に移行し、
成立する場合には、ステップ８０３に移行する。FIG. 22 shows an outline of the operation logic of the information request structuring unit 11. The operation of the information request structuring unit 11 in a situation where the meaning structures of the two information request sentences shown in FIGS. 21A and 21B are obtained from the Japanese question message shown in FIG. 5 will be described. Step 801) The information request structuring unit 11 checks the identity of an item having an attribute value for the semantic attribute “: head”. If the identity is not established, proceed to step 802,
If the condition holds, the process proceeds to step 803.

【００９４】ステップ８０２）同一性が成立しない場
合には、統合不可として処理を終了する。ステップ８０３）各意味構造における意味属性文サブ
タイプの属性値を比較し、各意味構造における意味属性
文サブタイプの属性値を最も詳細な情報を持つ属性値に
置換する。Step 802) If the identity is not established, it is determined that the integration is impossible and the process is terminated. Step 803) Compare the attribute values of the semantic attribute sentence subtype in each semantic structure, and replace the attribute value of the semantic attribute sentence subtype in each semantic structure with the attribute value having the most detailed information.

【００９５】ステップ８０４）各情報要求文に対する
意味構造間でのユニフィケーション処理を行う（同一の
意味属性を持つ場合は、それぞれの属性値の同一性をチ
ェックする。これらについての全ての同一性が確認でき
れば、各意味構造の和を取る）。Step 804) Perform unification processing between the semantic structures for each information request statement (if they have the same semantic attribute, check the identity of each attribute value. All the identity of these is confirmed. If possible, take the sum of each semantic structure).

【００９６】ステップ８０５）ユニフィケーションが
成功すれば、ステップ８０６に移行し、失敗すれば、ス
テップ８０７に移行する。ステップ８０６）ユニフィケーションが成功した場合
には、その結果を情報要求構造とし、処理を終了する。Step 805) If the unification succeeds, the process proceeds to step 806. If the unification fails, the process proceeds to step 807. Step 806) If the unification succeeds, the result is used as an information request structure, and the process ends.

【００９７】ステップ８０７）ユニフィケーションが
失敗した場合には、各意味構造をそれぞれ独立した情報
要求構造とし、処理を終了する。次に上記の図２２のフ
ローチャートに従って、詳細に説明する。まず、ステッ
プ８０１によって、標識“:head ”を持つ部分構造の同
一性をチェックする。図１６、図１７の入力例において
は、双方とも、標識“:head ”を持つものは、名詞「ケ
ーブル」で一致するため、これは、成功する。このよう
な同一の単語ではなくとも、「フロッピー」と「ＦＤ」
のように同じ意味であるが異表記の単語が用いられる場
合は「二次記憶」と「ハードディスク」のように文脈状
況においては同一の単語とみなせる場合がある。本発明
においては、このような意味の同一性のチェックは、既
存の表記ゆらぎの吸収技術やシソーラスを用いた意味チ
ェック等により達成されるものと仮定し、その詳細につ
いては規定しない。Step 807) If the unification fails, each semantic structure is set to an independent information request structure, and the processing is terminated. Next, a detailed description will be given according to the flowchart of FIG. First, in step 801, the identity of the partial structure having the indicator “: head” is checked. In the input examples of FIG. 16 and FIG. 17, both of those having the sign “: head” match with the noun “cable”, so that this is successful. Even if these words are not the same, "floppy" and "FD"
When words with the same meaning but different notations are used, such as "secondary storage" and "hard disk", they may be regarded as the same word in context context. In the present invention, it is assumed that such a check of semantic identity is achieved by an existing notation fluctuation absorption technique or a semantic check using a thesaurus, and details thereof are not specified.

【００９８】次に、ステップ８０３において、統合され
て得られる意味構造の文サブタイプ属性に与える属性値
を決定する。この属性値は、統合対象となっている各意
味構造における“文サブタイプ”属性の中で最も詳細な
情報を表現しているものとする。この属性は、図２１
（Ａ）では、“ＳＥＥＫ−ＯＢＪＥＣＴ”であり、図２
１（Ｂ）では、“ＨＯＷ−ＴＯ−ＧＥＴ”である。よっ
て、後者がより詳細な情報を表現するものとして選択さ
れる。実際の処理では、図２１（Ａ）の構造の“文サブ
タイプ”属性の属性値が“ＳＥＥＫ−ＯＢＪＥＣＴ”か
ら“ＨＯＷ−ＴＯ−ＧＥＴ”へ書換えられる。Next, in step 803, an attribute value to be given to the sentence subtype attribute of the semantic structure obtained by integration is determined. This attribute value is assumed to represent the most detailed information in the “sentence subtype” attribute in each semantic structure to be integrated. This attribute is shown in FIG.
FIG. 2A shows “SEEK-OBJECT” and FIG.
In 1 (B), it is “HOW-TO-GET”. Therefore, the latter is selected as expressing more detailed information. In the actual processing, the attribute value of the “sentence subtype” attribute in the structure of FIG. 21A is rewritten from “SEEK-OBJECT” to “HOW-TO-GET”.

【００９９】次に処理の制御は、ステップ８０４に移行
する。ステップ８０４では、人口知能や自然言語処理の
分野では広く知られている素性構造のユニフィケーショ
ンと呼ばれる構造統合の処理が行われる。この処理は、
対象の各構造に含まれる属性と属性値のペアの整合性を
チェックし、各構造の情報を包合する最小の情報を持つ
統合された構造を求める処理である。説明中の例に対し
ては、図２３に示す構造が最終的な情報要求構造として
生成され、出力部１２へ転送される。この例では、図２
１（Ｂ）に示す構造に“：ｈｅａｄ”標識以外の情報が
存在しなかったため、図２１（Ａ）の構造における“文
サブタイプ”属性のみが書換えられた構造がそのまま最
終的な統合された情報要求構造となる。この結果は、出
力部１２を介して後段に接続される装置へと転送され
る。Next, the control of the process proceeds to step 804. In step 804, a structure integration process called unification of a feature structure, which is widely known in the field of artificial intelligence and natural language processing, is performed. This process
This is a process of checking the consistency of an attribute-attribute value pair included in each target structure, and finding an integrated structure having the minimum information that encompasses the information of each structure. For the example being described, the structure shown in FIG. 23 is generated as the final information request structure and transferred to the output unit 12. In this example, FIG.
Since there is no information other than the “: head” indicator in the structure shown in FIG. 1 (B), the structure in which only the “sentence subtype” attribute in the structure of FIG. 21 (A) is rewritten is finally integrated as it is. It becomes an information request structure. This result is transferred to a device connected to the subsequent stage via the output unit 12.

【０１００】このような例に対して、統合される構造に
おいて、統合元の２つの構造双方における情報の和が取
られる場合の例を図２４（Ａ）に示し、同じ標識を持つ
部分構造が同一性を持たないために、統合ができない場
合の例を図２４（Ｂ）に示す。この例では、双方の構造
が“needed-for”の標識を持つが、それらの項が違う事
態を記述しているため、両者の構造を統合することはで
きない。このような場合、２つの情報要求構造が出力部
１２を介して出力される。In contrast to such an example, FIG. 24A shows an example in which, in a structure to be integrated, a sum of information in both structures of an integration source is obtained, and a partial structure having the same marker is obtained. FIG. 24B shows an example in which integration is not possible due to lack of identity. In this example, both structures have a "needed-for" sign, but they cannot be combined because their terms describe different situations. In such a case, two information request structures are output via the output unit 12.

【０１０１】なお、本発明の日本語質問メッセージ解析
装置を以下のように適用することも可能である。例え
ば、コンピュータネットワークにおける電子掲示板に投
稿される質問メッセージを本発明の日本語質問メッセー
ジ解析装置を用いて一定のタイミングで解析し、それら
における情報要求を抽出することによって、抽出された
情報要求に適合する情報をコンピュータネットワーク上
に蓄積された質問・回答メッセージのデータベースから
検索することが可能となる。The Japanese-language question message analyzer of the present invention can be applied as follows. For example, a question message posted on an electronic bulletin board in a computer network is analyzed at a certain timing using the Japanese question message analyzer of the present invention, and information requests in those are extracted, thereby conforming to the extracted information request. It is possible to search for information to be performed from a database of question / answer messages stored on a computer network.

【０１０２】さらに、本発明の日本語質問メッセージ解
析装置を電話によるサービス提供に適用する場合には、
入力部１に入力されるデータは、テキストデータである
ので、音声入力された音声データを音声認識処理により
認識された結果をテキストデータに変換する機能を入力
部１の前段に設けることで音声入力による情報検索も可
能となる。このとき、センタ側のオペレータが検索結果
を表示手段上で視認して利用者に電話音声で返答するこ
とも可能である。また、出力部１２から出力される情報
要求によりデータベースを検索した結果を、音声に変換
して利用者に返すことも可能である。Further, when the Japanese question message analysis device of the present invention is applied to service provision by telephone,
Since the data input to the input unit 1 is text data, by providing a function of converting the result of voice input voice data recognized by voice recognition processing into text data at a stage prior to the input unit 1, voice input can be performed. Can also retrieve information. At this time, the operator on the center side can visually recognize the search result on the display unit and reply to the user by telephone voice. Further, it is also possible to convert the result of searching the database in response to the information request output from the output unit 12 into voice and return it to the user.

【０１０３】なお、本発明は、上記の実施例に限定され
ることなく、特許請求の範囲内で種々変更・応用が可能
である。The present invention is not limited to the above embodiments, but can be variously modified and applied within the scope of the claims.

【０１０４】[0104]

【発明の効果】上述のように、本発明によれば、情報検
索システムに対する複雑かつ多様な情報要求の指示を記
述した自然な日本語メッセージに記述される情報要求を
構造化することができる。そのため、本発明による日本
語メッセージ構造化機能は、情報検索システムに組み込
まれることにより、従来のキーワードの論理結合形式で
情報要求を表現するものより柔軟な情報検索システムを
実現するためのコンポーネントとなり得る。As described above, according to the present invention, it is possible to structure an information request described in a natural Japanese message in which instructions of complicated and various information requests to the information retrieval system are described. Therefore, the Japanese message structuring function according to the present invention can be a component for realizing an information search system that is more flexible than the conventional one that expresses an information request in a logical combination format of keywords by being incorporated into the information search system. .

【０１０５】また、本発明による日本語質問メッセージ
構造化装置においては、形態素解析や構文解析といった
文法規則を必要とする基本的な言語解析処理や、参照表
現の参照先を同定する参照表現解消処理が、処理対象の
メッセージ全体から質問メッセージの特定に基づいて選
択される必要な部分のみに適用されるため、（１）処理が高速である；（２）メッセージ全体を処理対象とする通常のメッセー
ジ解析手法に比べ高い解析精度が達成されるという利点
がある。また、質問メッセージの特性に基づいて解析さ
れる意味構造に基づいて参照表現解消処理を制御するた
め、参照表現解消についても高い精度を見込むことがで
きる。In the apparatus for structuring a Japanese question message according to the present invention, a basic language analysis process requiring grammatical rules such as morphological analysis and syntax analysis, and a reference expression resolution process for identifying a reference destination of a reference expression Is applied only to the necessary part selected based on the identification of the question message from the whole message to be processed, and (1) the processing is fast; (2) a normal message to process the whole message There is an advantage that higher analysis accuracy is achieved as compared with the analysis method. Further, since the reference expression elimination process is controlled based on the semantic structure analyzed based on the characteristics of the question message, it is possible to expect high accuracy in reference expression elimination.

【０１０６】また、本発明を種々のメッセージ処理シス
テム等に多様に適用することが可能である。Further, the present invention can be variously applied to various message processing systems and the like.

[Brief description of the drawings]

【図１】本発明の原理を説明するための図である。FIG. 1 is a diagram for explaining the principle of the present invention.

【図２】本発明の原理構成図である。FIG. 2 is a principle configuration diagram of the present invention.

【図３】本発明の一実施例の日本語質問メッセージ解析
装置の構成図である。FIG. 3 is a configuration diagram of a Japanese question message analysis device according to an embodiment of the present invention.

【図４】本発明の一実施例の日本語質問メッセージ解析
装置の動作の概要を示すフローチャートである。FIG. 4 is a flowchart showing an outline of the operation of the Japanese-language question message analysis device according to one embodiment of the present invention.

【図５】本発明の一実施例の入力される日本語質問メッ
セージの例を示す図である。FIG. 5 is a diagram illustrating an example of a Japanese question message input according to an embodiment of the present invention;

【図６】本発明の一実施例の図５のメッセージを分解し
た例を示す図である。FIG. 6 is a diagram showing an example of disassembling the message of FIG. 5 according to one embodiment of the present invention.

【図７】本発明の一実施例の情報要求文抽出部の詳細な
動作を示すフローチャートである。FIG. 7 is a flowchart showing a detailed operation of an information request sentence extracting unit according to one embodiment of the present invention.

【図８】本発明の一実施例の文タイプ判定辞書の例を示
す図である。FIG. 8 is a diagram illustrating an example of a sentence type determination dictionary according to an embodiment of the present invention.

【図９】本発明の一実施例の文タイプ判定部の動作の概
要を示すフローチャートである。FIG. 9 is a flowchart illustrating an outline of an operation of a sentence type determination unit according to an embodiment of the present invention.

【図１０】本発明の一実施例の文タイプ判定部の第１の
動作を説明するためのフローチャートである。FIG. 10 is a flowchart illustrating a first operation of a sentence type determination unit according to one embodiment of the present invention.

【図１１】本発明の一実施例の文タイプ判定部の第２の
動作を説明するためのフローチャートである。FIG. 11 is a flowchart illustrating a second operation of the sentence type determination unit according to one embodiment of the present invention.

【図１２】本発明の一実施例の文タイプ判定部の第３の
動作を説明するためのフローチャートである。FIG. 12 is a flowchart illustrating a third operation of the sentence type determination unit according to one embodiment of the present invention.

【図１３】本発明の一実施例の文タイプ判定部の第４の
動作を説明するためのフローチャートである。FIG. 13 is a flowchart illustrating a fourth operation of the sentence type determination unit according to one embodiment of the present invention.

【図１４】本発明の一実施例の図５のメッセージ判定後
の処理テーブルの例を示す図である。FIG. 14 is a diagram illustrating an example of a processing table after the message determination in FIG. 5 according to an embodiment of the present invention;

【図１５】本発明の一実施例の日本語質問メッセージに
おける日本語基本解析の結果の概要を示す図である。FIG. 15 is a diagram illustrating an outline of a result of a basic Japanese analysis of a Japanese question message according to an embodiment of the present invention.

【図１６】本発明の一実施例の意味構造パターン辞書の
エントリの例を示す図である。FIG. 16 is a diagram illustrating an example of an entry of a semantic structure pattern dictionary according to an embodiment of the present invention.

【図１７】本発明の一実施例の意味構造への変換の例を
示す図である。FIG. 17 is a diagram illustrating an example of conversion to a semantic structure according to an embodiment of the present invention.

【図１８】本発明の一実施例の参照表現解消制御部の動
作を示すフローチャートである。FIG. 18 is a flowchart illustrating an operation of a reference expression cancellation control unit according to an embodiment of the present invention.

【図１９】本発明の一実施例の参照表現解消制御部の動
作を示すフローチャートである。FIG. 19 is a flowchart illustrating an operation of a reference expression cancellation control unit according to an embodiment of the present invention.

【図２０】本発明の一実施例の参照表現解消制御テーブ
ルを示す図である。FIG. 20 is a diagram illustrating a reference expression elimination control table according to an embodiment of the present invention.

【図２１】本発明の一実施例の意味構造を示す図であ
る。FIG. 21 is a diagram showing a semantic structure of an embodiment of the present invention.

【図２２】本発明の一実施例の情報要求構造化部の動作
を示すフローチャートである。FIG. 22 is a flowchart showing the operation of the information request structuring unit according to one embodiment of the present invention.

【図２３】本発明の一実施例の統合処理の例を示す図で
ある。FIG. 23 is a diagram illustrating an example of an integration process according to an embodiment of the present invention.

【図２４】本発明の一実施例の統合できる例／統合不可
の例を示す図である。FIG. 24 is a diagram showing an example where integration is possible / an example where integration is not possible according to an embodiment of the present invention.

[Explanation of symbols]

１入力部，入力手段２文タイプ判定辞書３文タイプ判定部，文タイプ判定手段４情報要求文抽出部，情報要求文抽出手段５日本語基本解析部，日本語基本解析手段６意味構造パターン辞書７意味構造解析部，意味構造解析手段８参照表現解消制御テーブル９参照表現解消制御部，参照表現解消制御手段１０参照表現解消部，参照表現解消手段１１情報要求構造化部，情報要求構造化手段１２出力部，出力手段１３処理テーブル４１文タイプ起動手段 DESCRIPTION OF SYMBOLS 1 Input part, input means 2 Sentence type judgment dictionary 3 Sentence type judgment part, sentence type judgment means 4 Information request sentence extraction part, Information request sentence extraction means 5 Japanese basic analysis part, Japanese basic analysis means 6 Semantic structure pattern dictionary 7 semantic structure analysis unit, semantic structure analysis unit 8 reference expression elimination control table 9 reference expression elimination control unit, reference expression elimination control unit 10 reference expression elimination unit, reference expression elimination unit 11 information request structuring unit, information request structuring unit 12 output unit and output means 13 processing table 41 statement type activation means

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開昭63−106042（ＪＰ，Ａ) 特開平２−297157（ＪＰ，Ａ) 特開平５−233730（ＪＰ，Ａ) 太田竜男，質問応答システムの構築, ＦＵＪＩＴＳＵ，日本，富士通株式会社，1994年９月10日，第45巻第５号, 第436頁乃至第442頁 (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/30 ＪＩＣＳＴファイル（ＪＯＩＳ)────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-63-106042 (JP, A) JP-A-2-297157 (JP, A) JP-A-5-233730 (JP, A) Tatsuo Ota, Question and answer System construction, FUJITSU, Japan, Fujitsu Limited, September 10, 1994, Vol. 45, No. 5, 436 to 442 (58) Fields investigated (Int. Cl. ⁷ , DB name) G06F 17/30 JICST file (JOIS)

Claims

(57) [Claims]

1. A method of analyzing a Japanese question message input in a text format, the method comprising: analyzing a Japanese question message input in a text format; Paying attention to a specific expression character string, an information request sentence that describes a core part of the information request is extracted. When the extracted information request sentence includes a reference expression, the information request sentence The reference expression is resolved based on the characteristics of the information request statement, and when the plurality of extracted information request statements state an information request for the same object, the structures of the plurality of information request statements are integrated. A method of analyzing a Japanese question message, comprising structuring a main information request in the Japanese question message and outputting a structured information request sentence.

2. When extracting the information request sentence, an information request which is a functional role played by a Japanese sentence constituting the Japanese question message, a sentence type of a goal statement which is a user's purpose, and the information Referencing a sentence type determination dictionary in which a combination of a request and an expression string pattern for determining the goal statement is stored in advance, and collating the character string to determine whether the Japanese question message corresponds to a specified sentence type. The method according to claim 1, wherein a sentence having an information request function is extracted from the Japanese question message.

3. When the information request sentence is extracted, morpheme / syntax analysis is performed on the extracted information request sentence, and a unique semantic structure pattern is previously stored for each entry of the Japanese question message. 2. The Japanese question message analysis according to claim 1, wherein the entry is compared with an analysis result of the information request sentence that has been subjected to morphological / syntactic analysis with reference to the semantic structure pattern dictionary. Method.

4. When resolving the reference expression, a pattern of a reference expression that can be included in the information request sentence in the Japanese question message, a search direction for searching for a reference destination of the reference expression, and a search target. The reference sentence type of the Japanese sentence and the linguistic expression unit of the reference expression are referenced in advance to determine whether or not the sentence type of the Japanese sentence to be searched corresponds to the specified sentence type with reference to the reference expression resolution control table. 2. The Japanese question message analysis method according to claim 1, wherein, in a corresponding case, the reference expression resolution processing is controlled.

5. A Japanese-language question message analyzing apparatus for analyzing a Japanese-language question message input in a text format, comprising: input means for inputting the Japanese-language question message; An information request sentence extracting unit that extracts an information request sentence that describes a core part of an information request by focusing on a specific expression character string of the message; and the information request sentence extracted by the information request sentence extraction unit. When a sentence includes a reference expression, a reference expression resolving means for resolving the reference expression on the basis of the characteristics of the information request sentence, and a plurality of extracted information request sentences describing an information request for the same object. Information request structuring means for unifying the structure of the plurality of information request sentences to structure a main information request in the Japanese question message; Japanese inquiry message analyzing apparatus characterized by an output means for outputting the information request statements that have been integrated by Zoka means.

6. The information request sentence extracting means includes: an information request which is a functional role played by a Japanese sentence constituting the Japanese question message and a sentence type of a goal statement which is a user's purpose; A sentence type determination dictionary in which a combination of expression character string patterns for determining a goal statement is stored in advance, and referring to the sentence type determination dictionary, whether the Japanese question message corresponds to a specified sentence type And a sentence type determination activating unit that activates the sentence type determination unit and extracts only an information request sentence having an information request function from the Japanese question message. The Japanese question message analyzer according to claim 5.

7. A Japanese basic analysis means for performing morpheme / syntax analysis on the information request sentence extracted by the information request sentence extraction means, and a unique semantic structure pattern for each of the Japanese question messages for each entry. And referring to the semantic structure pattern dictionary stored in advance, and comparing the entry with the analysis result of the information request sentence analyzed by the Japanese basic analysis means, 6. The Japanese question message analysis device according to claim 5, further comprising a semantic structure analyzing means for obtaining the semantic structure of the Japanese question message.

8. The reference expression resolving means includes: a reference expression pattern that can be included in the information request sentence in the Japanese question message; a search direction for searching for a reference destination of the reference expression; Using the reference expression elimination control table pre-stored the sentence type of the word sentence and the linguistic expression unit of the reference expression, the reference expression elimination control table, and the sentence type determination means, identify the characteristics of the object of the information request. 5. The Japanese-language question message analysis device according to claim 4, further comprising: reference-expression-resolving control means for controlling a reference-expression resolving process.