JPH05342258A

JPH05342258A - Natural language processing system

Info

Publication number: JPH05342258A
Application number: JP4150389A
Authority: JP
Inventors: Hisao Mase; 久雄間瀬; Hiroyuki Kinukawa; 博之絹川; Hiroshi Tsuji; 洋辻; Satoru Wakayama; 哲和歌山
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1992-06-10
Filing date: 1992-06-10
Publication date: 1993-12-24

Abstract

PURPOSE:To provide a natural language processing system which can prevent the execution of unnecessary operation caused by responsing an erroneous meaning analytic result and can prevent an input which is unfair for the system. CONSTITUTION:The system selects/extracts corresponding message information from a table storing message information which should be displayed for a user based on word data of a part specified its failure by meaning analysis and information expressing the meaning analytic part at the time of natural language analysis, prepares a message by burying word data into message information and alarms it. Thus, the user can confirm the meaning analytic result before the execution of the operation so that the execution of the unnecessary operation can be prevented. Besides, even when the operation corresponding to the meaning analytic result is not allowed by the system, the user can know it before the operation so that the input which is unfair for the system can be prevented.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、入力される自然語の示
す意味に対応して動作する自然語処理システムに関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a natural language processing system that operates in accordance with the meaning of an input natural language.

【０００２】[0002]

【従来の技術】入力される自然語の示す意味に対応して
動作する自然語処理システムでは、自然語の意味解析の
内容，結果に関する情報を利用者に報知することによっ
て、利用者の作業効率を向上させることができる。2. Description of the Related Art In a natural language processing system that operates in accordance with the meaning of an input natural language, the efficiency of the user's work is improved by notifying the user of the content and result of the semantic analysis of the natural language. Can be improved.

【０００３】これに関連する技術としては、例えば、特
開昭６３−３１７８７９号公報に記載されているよう
に、第１言語から第２言語へ文章を翻訳した結果を表示
する際に、係り受け解析結果を括弧等を用いて明示する
ことによって、係り受け解析結果を利用者に把握させる
技術がある。As a technique related to this, for example, as described in Japanese Patent Laid-Open No. 63-317879, when displaying the result of translating a sentence from a first language to a second language, a dependency is displayed. There is a technique that allows the user to understand the dependency analysis result by clearly indicating the analysis result using parentheses or the like.

【０００４】また、特開昭６３−１０３３７９号公報に
記載されているように、第１言語を第２言語に変換する
際に、翻訳結果だけでなく、入力した第１言語あるいは
翻訳結果である第２言語の出力に対応させて、第１言語
および第２言語に関する情報を付加情報として出力する
技術がある。Also, as described in Japanese Patent Laid-Open No. 63-103379, when converting the first language into the second language, not only the translation result but also the input first language or translation result. There is a technique of outputting information about the first language and the second language as additional information corresponding to the output of the second language.

【０００５】さらに、特開平２−１２０９７６号公報に
記載されるように、自然語入力にたいして、入力された
自然語の意味を確認するために確認文を生成し、表示す
る技術がある。Further, as described in Japanese Patent Application Laid-Open No. 2-120976, there is a technique for generating and displaying a confirmation sentence in order to confirm the meaning of the input natural language with respect to the input of the natural language.

【０００６】[0006]

【発明が解決しようとする課題】上記従来技術では、シ
ステムの動作結果を表示する時に係り受け解析結果等の
付加情報を併せて出力するだけなので、入力された自然
語の意味解析結果が誤っている場合でも（不要な）動作
（翻訳）を実行してしまい、その結果、利用者の作業効
率を低下させるという問題がある。In the above-mentioned prior art, since the additional information such as the dependency analysis result is also output together when the operation result of the system is displayed, the input semantic analysis result of the natural language is erroneous. Even if they do, (unnecessary) actions (translations) are executed, and as a result, there is a problem that the work efficiency of the user is reduced.

【０００７】また、入力された自然語の意味解析結果に
対応した動作がシステムで許容されない場合、その旨を
利用者に報知する手段を保持していないため、システム
にとっての不正入力を未然に防止できないという問題が
ある。Further, when the system does not allow the operation corresponding to the result of the semantic analysis of the input natural language, the system does not have a means for notifying the user of the result, so that the system is prevented from illegal input. There is a problem that you can not.

【０００８】さらに、特開平２−１２０９７６号公報に
記載される技術においては、確認文の生成のベースとな
る意味解析は正しい結果を得る、または強引に解析結果
を得るという前提に立っており、結果として意味解析が
十分に行われなかった場合でも十分でないことが意味解
析処理において意識されていない。これにより、確認文
を表示され、確認を求められたユーザは入力自然語文の
更新に当たってどのように修正すれば良いかが分からな
いという状態が発生する。Further, in the technique disclosed in Japanese Patent Laid-Open No. 2-120976, it is premised that the semantic analysis, which is the basis of the confirmation sentence generation, obtains a correct result or forcibly obtains the analysis result. As a result, even if the semantic analysis is not performed sufficiently, it is not conscious of the insufficientness in the semantic analysis processing. As a result, the confirmation sentence is displayed, and the user who is requested to confirm does not know how to correct the input natural language sentence when updating it.

【０００９】本発明の目的は、誤った意味解析結果に応
じた不要な動作の実行を防ぐことのできる自然語処理シ
ステムを提供することにある。An object of the present invention is to provide a natural language processing system capable of preventing the execution of an unnecessary operation according to the result of incorrect semantic analysis.

【００１０】本発明の他の目的は、システムにとって不
正な入力を防止することのできる自然語処理システムを
提供することにある。Another object of the present invention is to provide a natural language processing system capable of preventing illegal input for the system.

【００１１】[0011]

【課題を解決するための手段】本発明によれば、入力し
た自然語の意味解析で解析不良と特定される部分に関し
て、特定された部分の自然語を含むメッセージを出力す
る出力手段を設け、その出力メッセージに対する応答入
力に対応して動作する自然語処理システムが提供され
る。According to the present invention, there is provided output means for outputting a message including a natural language of a specified part regarding a part specified as an analysis failure by the semantic analysis of the input natural language. A natural language processing system that operates in response to a response input to the output message is provided.

【００１２】また、本発明によれば、入力した自然語の
解析結果に対応した動作がシステムで許容されないこと
を検出する手段と、その検出結果に応答して、入力した
自然語の示す意味に対応する動作が許容されないことを
報知する手段とを備えた自然語処理システムが提供され
る。Further, according to the present invention, means for detecting that the system does not allow the operation corresponding to the analysis result of the input natural language, and the meaning indicated by the input natural language in response to the detection result. A natural language processing system including means for notifying that the corresponding action is not permitted.

【００１３】さらに、本発明によれば、自然語の意味解
析が失敗する可能性のある解析箇所を認定する手段と、
その解析箇所に関しての意味解析失敗の原因，その原因
に対してシステムがどのような解析をしたかという解析
方法，意味解析の失敗を防ぐために利用者がすべき対処
方法に関する情報を出力する手段とを備えた自然語処理
システムが提供される。Further, according to the present invention, means for recognizing an analysis point where the semantic analysis of natural language may fail,
A means for outputting information about the cause of the failure of the semantic analysis regarding the analysis location, the analysis method of what kind of analysis the system performed for the cause, and the coping method that the user should take to prevent the failure of the semantic analysis. A natural language processing system is provided.

【００１４】[0014]

【作用】入力された自然語の意味解析で特定される部分
の自然語を含むメッセージとしては、例えば次の内容の
ものが考えられる。The message including the natural language of the part specified by the semantic analysis of the input natural language may have the following contents, for example.

【００１５】（１）自然語の意味解析が失敗したことを
表したもの（２）意味解析のどの箇所で失敗したかを表したもの（３）何が原因で意味解析が失敗したか（解析失敗原
因）を表したもの（４）意味解析の失敗原因に対してシステムがどのよう
な解析処理をしたか（解析方法）を表したもの（５）意味解析の失敗を防ぐために利用者がすべき処置
内容（対処方法）を表したもの（６）意味解析は成功したが、その意味解析結果に対応
する動作がシステムで許容できないことを表したもの（７）意味解析結果に対応する動作がシステムで許容で
きない場合に、利用者がすべき処置を表したもの入力した自然語の意味解析結果に対応する動作を実行す
る前に、上述の（１）〜（７）等のメッセージを出力す
ることにより、利用者は意味解析の不良を確認でき、ま
た、必要に応じて意味解析結果を修正できるので、誤っ
た意味解析結果に応じた不要な動作の実行を防ぐことが
でき、利用者の作業効率が向上する。また、意味解析結
果に対応する動作がシステムで許容されない場合でも、
出力メッセージを参照することにより、その旨を動作前
に知ることができる。また、それに対応する処置に関す
るメッセージを得ることができるので、システムにとっ
ての不正入力を防止することができる。(1) A failure of the semantic analysis of natural language (2) A failure of the semantic analysis (3) What caused the failure of the semantic analysis (analysis (4) Representation of failure cause (4) Representation of what kind of analysis processing (analysis method) was performed by the system for the cause of failure of semantic analysis (5) User's action to prevent failure of semantic analysis (6) Semantic analysis was successful, but the system indicated that the action corresponding to the semantic analysis result was not acceptable (7) The action corresponding to the semantic analysis result If the system does not allow it, it indicates the action to be taken by the user. Before executing the operation corresponding to the result of the semantic analysis of the input natural language, output the messages such as (1) to (7) above. By doing so, the user Since the analysis failure can be confirmed and the semantic analysis result can be corrected if necessary, it is possible to prevent the unnecessary operation from being performed according to the incorrect semantic analysis result, and improve the work efficiency of the user. In addition, even if the system does not allow the operation corresponding to the semantic analysis result,
By referring to the output message, it is possible to know that before the operation. In addition, since a message regarding the corresponding treatment can be obtained, it is possible to prevent illegal input to the system.

【００１６】[0016]

【実施例】以下、本発明の実施例を詳細に説明する。EXAMPLES Examples of the present invention will be described in detail below.

【００１７】本発明は、日本語に限らず英語、ドイツ語
等の自然語を用いたシステムに適用可能であるが、ここ
では日本語によるデータベース検索システム（以下、単
にシステムと呼ぶ。）を例として説明する。本実施例に
おいて、利用者から計算機に入力されるのは、主として
データベースの検索を指示する日本語文であり、一般に
（例外もある。）検索対象と、その検索対象に対する検
索条件とを含んだ日本語文である。The present invention is applicable not only to Japanese but also to systems using natural languages such as English and German. Here, a database search system in Japanese (hereinafter simply referred to as a system) is used as an example. As described below. In the present embodiment, the user inputs to the computer mainly Japanese sentences instructing the search of the database, and generally (including exceptions) a search target and a search condition for the search target in Japan. It is a word sentence.

【００１８】図１は、本実施例の処理概要を示したもの
である。FIG. 1 shows an outline of processing of this embodiment.

【００１９】先ず、利用者に日本語を入力させる
（１）。本実施例では、キーボードから日本語を入力す
る。日本語入力１においては、結果的に日本語文字列１
ａをシステムに入力できれば良く、したがって、日本語
の入力装置としては、キーボードの他に、手書き文字入
力装置、音声入力装置などでも容易に代替可能である。First, the user is prompted to input Japanese (1). In this embodiment, Japanese is input from the keyboard. Japanese input 1 results in Japanese character string 1
It suffices that a can be input to the system, and therefore, as the Japanese input device, a handwritten character input device, a voice input device, etc. can be easily substituted in addition to the keyboard.

【００２０】次に、日本語解析処理部２において、単語
辞書１２と、対象とするデータベース１６の構成情報を
定義した操作指示対象定義情報格納テーブル１３とを参
照して、入力された日本語文字列１ａを解析する。即
ち、日本語文字列１ａを構成する語（単語、熟語、文節
など）を識別し、日本語文字列１ａを構成する各語の意
味（品詞）を認定し、文節の修飾関係を認定し、日本語
文字列１ａの解析結果を表す中間表現２ａを生成する。Next, the Japanese analysis processing unit 2 refers to the word dictionary 12 and the operation instruction target definition information storage table 13 defining the configuration information of the target database 16 to input the input Japanese character. Analyze column 1a. That is, the words (words, phrases, phrases, etc.) that make up the Japanese character string 1a are identified, the meaning (part of speech) of each word that makes up the Japanese character string 1a is identified, and the qualifying relations of the phrases are identified. An intermediate expression 2a representing the analysis result of the Japanese character string 1a is generated.

【００２１】本実施例では、入力された自然語の意味解
析で特定される部分の自然語を含むメッセージとして、
次の３種類を考える。In this embodiment, as a message containing a natural language of a part specified by the semantic analysis of the input natural language,
Consider the following three types.

【００２２】（１）意味解析が失敗した部分について、
その原因が何であるかという「解析失敗原因」を表した
もの。(1) Regarding the part where the semantic analysis fails,
What expresses the "cause of analysis failure", which is the cause.

【００２３】（２）意味解析の失敗原因（上記（１））
に対してシステムがどのような解析をしたかという「解
析方法（解析方針）」を表したもの。(2) Cause of failure in semantic analysis ((1) above)
"Analysis method (analysis policy)" that indicates what kind of analysis the system performed.

【００２４】（３）意味解析の失敗を防ぐために利用者
がどのような処置をすればよいのかという「対処方法」
を表したもの。(3) "Handling method" of what the user should do to prevent the failure of the semantic analysis
Which represents.

【００２５】上記（１）の「意味解析が失敗した部分」
に関して、本実施例では、解析が失敗する可能性のある
解析箇所として、次の２箇所を考える。"Part where semantic analysis has failed" in (1) above
Regarding this, in the present embodiment, the following two points are considered as the analysis points where the analysis may fail.

【００２６】（ａ）入力された自然語を構成する単語に
多義が存在し、前後の単語の意味からはその意味を一義
に決定不能な場合。(A) In the case where there are ambiguous meanings in the words constituting the input natural language and the meanings of the preceding and following words cannot be uniquely determined.

【００２７】（ｂ）入力された自然語の中に、単語辞書
１２に定義されていない未知語が存在する場合。(B) A case where an unknown word which is not defined in the word dictionary 12 exists in the input natural language.

【００２８】一方、本実施例では、入力された自然語の
意味解析結果に対応する動作がシステムで許容できない
場合の例として、（ｃ）検索対象を検出できないために、このままでは、
解析結果を検索コマンドに変換できない場合を考える。On the other hand, in the present embodiment, as an example of the case where the system does not allow the operation corresponding to the input semantic analysis result of the natural language, (c) the search target cannot be detected.
Consider the case where the analysis result cannot be converted into a search command.

【００２９】本実施例では、解析が失敗する可能性のあ
る解析箇所の認定，あるいは意味解析結果に対応する動
作がシステムで許容されないことの認定を、解析処理プ
ログラム内に手続き処理的に定義することで実現する。
即ち、処理プログラムにおいて、ある特定のステップに
至ったならば、その解析が失敗している可能性がある，
あるいは解析結果がシステムで許容されないとシステム
は認定する。そして、そのステップにおいて、語データ
と、その解析箇所に対応して定義したフラグ（以下、
「解析コード（１４０１）」と呼ぶ。）とを解析状況デ
ータへ格納する処理を実行する。この方法は、解析箇所
の認定がプログラムとして実現されているので、保守が
困難な反面、処理速度の高速化が期待できる。In the present embodiment, the qualification of the analysis location where the analysis may possibly fail, or the certification that the system does not permit the operation corresponding to the result of the semantic analysis, is defined in the analysis processing program in a procedural manner. It will be realized.
That is, if a certain step is reached in the processing program, the analysis may have failed.
Alternatively, the system certifies that the analysis result is not acceptable by the system. Then, in that step, the word data and the flag defined corresponding to the analysis location (hereinafter,
It is called “analysis code (1401)”. ) And are stored in the analysis status data. In this method, since the certification of the analysis point is realized as a program, it is difficult to maintain, but it can be expected to increase the processing speed.

【００３０】解析が失敗する可能性のある解析箇所を認
定する他の方法としては、処理プログラムとは独立して
解析ルールを宣言的に定義し、実際の解析において、そ
れらの解析ルールを適用し、ある解析ルールの適用が成
功した場合、あるいは、どの解析ルールの適用も成功し
なかった場合に、その解析が失敗している可能性がある
と認定する方法がある。この方法は、処理速度は若干遅
くなるものの、解析ルールの追加、削除、修正といった
データ保守作業が容易である。As another method of recognizing the analysis points where the analysis may fail, the analysis rules are declaratively defined independently of the processing program, and those analysis rules are applied in the actual analysis. There is a method of recognizing that the analysis may have failed if the application of a certain analysis rule is successful or if the application of any analysis rule is not successful. Although this method slightly slows down the processing speed, it is easy to perform data maintenance work such as addition, deletion, and correction of analysis rules.

【００３１】さらに他の方法としては、解析に失敗する
ための条件をテーブル等に予め宣言的に記述しておき、
解析において、それらの条件を一つ一つ照合させ、条件
が満たされている場合に、その条件に相当する解析が失
敗している可能性があると認定する方法がある。この方
法もデータの保守作業は容易である。As another method, conditions for failing the analysis are declaratively described in a table in advance,
In the analysis, there is a method in which those conditions are collated one by one, and when the conditions are satisfied, it is determined that the analysis corresponding to the conditions may have failed. This method also facilitates data maintenance work.

【００３２】次に、解析結果表示部３において、中間表
現２ａに基づいて、表示装置に解析結果を表示する。本
実施例では、利用者に解析結果を理解しやすいものにす
るために、中間表現２ａを表形式に展開した「中間表現
展開テーブル」を表示する。もちろん、利用者への解析
結果の表示方法は、この方法の他にも、木構造形式によ
る表示方法、ネットワーク形式による表示方法、述語論
理形式を用いた表示方法、あるいは自然語形式による表
示方法でも容易に代替可能である。自然語形式による解
析結果の生成や表示については、特開平２−１２０９７
６号公報に記載されているようにすることが一方法であ
る。Next, the analysis result display unit 3 displays the analysis result on the display device based on the intermediate representation 2a. In the present embodiment, in order to make it easier for the user to understand the analysis result, an "intermediate expression expansion table" in which the intermediate expression 2a is expanded in a table format is displayed. Of course, other than this method, the display method of the analysis result to the user may be a display method in a tree structure format, a display method in a network format, a display method in a predicate logic format, or a display method in a natural language format. It can be easily replaced. Regarding the generation and display of the analysis result in the natural language format, JP-A-2-12097.
One method is to make it as described in Japanese Patent Publication No.

【００３３】利用者は、表示された解析結果を参照し、
自分の意図した通りにシステムが解析したかを確認す
る。解析結果が利用者の意図するものでない場合、即
ち、自然語の解析に失敗した場合、利用者は、解析が失
敗する可能性のある解析箇所における「解析失敗原
因」，解析箇所における「解析方法」，ならびに解析失
敗への「対処方法」（以下、まとめて「自然語解析状
況」と呼ぶ。）を必要に応じて参照することができる
（４）。もちろん、解析結果が利用者の意図するもので
あった場合でもこれらの自然語解析状況を参照しても構
わない。The user refers to the displayed analysis result,
Make sure the system parsed as you intended. If the analysis result is not intended by the user, that is, if the analysis of the natural language fails, the user may select "Cause of analysis failure" at the analysis point where analysis may fail and "Analysis method" at the analysis point. , And “handling method” for the analysis failure (collectively referred to as “natural language analysis status”) can be referred to as necessary (4). Of course, even if the analysis result is intended by the user, these natural language analysis situations may be referred to.

【００３４】解析が失敗する可能性のある各解析箇所に
対応させて、その解析失敗原因，その解析箇所における
解析方法（解析方針），解析失敗への対処方法に関する
出力データを予めメッセージデータテーブル１５に格納
しておく。出力メッセージは、後述するように、解析状
況データ１４に記述された、解析が失敗した解析箇所を
表す解析コード１４０１に対応するメッセージデータテ
ーブル１５内のメッセージ情報を抽出し、メッセージ情
報の可変部分にデータを代入することによって生成す
る。The message data table 15 is provided in advance with output data relating to the cause of the analysis failure, the analysis method (analysis policy) at the analysis location, and the coping method for the analysis failure in association with each analysis location where the analysis may fail. Stored in. As will be described later, the output message extracts the message information in the message data table 15 corresponding to the analysis code 1401 described in the analysis status data 14 and representing the analysis location where the analysis has failed, and extracts it in the variable portion of the message information. It is generated by substituting data.

【００３５】ここで、「解析失敗原因」とは、利用者の
意図と異なる解析をする直接の要素と考えられるもので
ある。本実施例では、（１）入力文を構成する単語に多
義が存在する、（２）入力文の中に、辞書に未登録の未
知語が出現する、ということに相当する。Here, the "cause of analysis failure" is considered to be a direct element that performs an analysis different from the user's intention. In the present embodiment, (1) the words that make up the input sentence have polysemy, and (2) the unknown words that have not been registered in the dictionary appear in the input sentence.

【００３６】また、「解析方法（解析方針）」とは、シ
ステムが実際にどのような解析をするのかという具体的
な手法を表す。本実施例では、（１）単語辞書に先に記
述されている方の意味として解釈する、（２）辞書に未
登録の未知語は、無視して解析する、などといった解析
方法に相当する。"Analysis method (analysis policy)" represents a concrete method of what kind of analysis the system actually performs. This embodiment corresponds to an analysis method such as (1) interpreting as the meaning previously described in the word dictionary, and (2) ignoring and analyzing an unknown word not registered in the dictionary.

【００３７】さらに、「対処方法」とは、仮にその解析
箇所が原因で解析が失敗したと仮定した場合に、次回以
降の解析で、同じ解析失敗をさせないために利用者がシ
ステムに対して行うべき具体的な処置ガイダンスであ
る。本実施例では、（１）単語辞書の意味の記述順序を
変更するか、単語の意味を一義に登録するか、デー
タベースのカラム名に相当する単語とカラム値に相当す
る単語を対にして記述する、（２）未知語を辞書に
登録する、などといった処置に相当する。Further, the "handling method" means that, if it is assumed that the analysis fails due to the analysis location, the user performs the same analysis in the next and subsequent analyzes to prevent the same analysis from failing. It should be concrete treatment guidance. In the present embodiment, (1) the description order of the meanings in the word dictionary is changed, the meanings of the words are uniquely registered, or the word corresponding to the column name of the database and the word corresponding to the column value are described as a pair. Yes, and (2) registering an unknown word in the dictionary.

【００３８】利用者が自然語解析状況の表示を指示した
場合は、さらに、その中で、解析失敗原因に関する情報
を表示するか（５）、解析失敗箇所における解析方法に
関する情報を表示するか（６）、解析失敗への対処方法
に関する情報を表示するか（７）を選択する。ステップ
５、６、７のいずれかにおいて、自然語解析状況を表示
することを選択した場合は、解析状況メッセージ生成・
表示処理部８において、解析が失敗する可能性のある解
析箇所における解析で使用した語データとその解析箇所
に対応する解析コード１４０１を格納した解析状況デー
タ１４と、表示すべき自然語メッセージ文字列情報を格
納したメッセージデータテーブル１５を参照して、自然
語解析状況を表す自然語文字列を生成し、利用者の表示
要求内容に応じて表示する。即ち、解析状況データ１４
に記述された解析コード１４０１に対応するメッセージ
データテーブル１５内のメッセージ情報を抽出し、メッ
セージ情報の可変部分にデータを代入することによって
生成する。本実施例では、利用者への表示は表示装置の
画面表示としているが、他に音声出力などによっても容
易に代替可能である。When the user gives an instruction to display the natural language analysis status, whether to display the information about the cause of the analysis failure (5) or the information about the analysis method at the analysis failure portion in that ((5) 6) Select whether to display information on how to deal with the analysis failure (7). If you have selected to display the natural language analysis status in any of steps 5, 6 and 7, generate an analysis status message.
In the display processing unit 8, the word data used in the analysis at the analysis location where the analysis may possibly fail and the analysis status data 14 storing the analysis code 1401 corresponding to the analysis location, and the natural language message character string to be displayed. With reference to the message data table 15 storing the information, a natural language character string representing the natural language analysis status is generated and displayed according to the display request content of the user. That is, the analysis status data 14
It is generated by extracting the message information in the message data table 15 corresponding to the analysis code 1401 described in 1. and substituting the data into the variable part of the message information. In the present embodiment, the display to the user is the screen display of the display device, but it can be easily replaced by voice output or the like.

【００３９】本実施例では、自然語解析状況に関する自
然語メッセージデータはメッセージデータテーブル１５
に予め定義する。従って、解析状況データ１４に格納し
た語データと、メッセージデータテーブル１５に定義さ
れたメッセージ情報に基づいて、自然語解析状況に関す
る自然語メッセージを生成・表示することが可能であ
る。もちろん、後述するように、解析が失敗する可能性
のある解析箇所としては、これらの他にも、文節の係り
受け、検索条件の係り受けなどがあり、これらについて
も、解析で使用したデータを格納し、メッセージデータ
テーブル１５に定義したメッセージ情報に基づいて自然
語解析状況に関する情報を表示することが可能である。In this embodiment, the natural language message data relating to the natural language analysis situation is stored in the message data table 15.
To be defined in advance. Therefore, it is possible to generate and display the natural language message regarding the natural language analysis status based on the word data stored in the analysis status data 14 and the message information defined in the message data table 15. Of course, as will be described later, there are bunsetsu dependency, search condition dependency, etc. in addition to these, as analysis points where analysis may fail. It is possible to store and display the information about the natural language analysis status based on the message information defined in the message data table 15.

【００４０】次に、利用者は表示された解析結果および
自然語メッセージ情報を参照して、解析結果を修正する
か否かを選択する（９）。ここでいう「修正」には、未
知語を単語辞書１２に新たに登録したり、単語辞書１２
に登録されている単語に関する記述内容を更新したりす
る処理をも含むものとする。本実施例では、解析結果そ
のものを修正する場合、解析結果修正処理部１０におい
て、解析結果として表示された中間表現展開テーブルを
そのまま修正エディタとして用いる。即ち、利用者は修
正したい中間表現展開テーブルの要素を指定し、更新、
削除、追加などにより対応するデータに修正する。これ
は表示されるデータと内部に格納されているデータとを
対応付けることにより、容易に実現できる。解析結果の
修正技術は周知であり、本発明の本質ではないので、こ
れ以上詳述する必要はないであろう。もちろん、修正の
方法としては、この他に、木構造形式やネットワーク形
式や述語論理形式を表示してそれを修正させる方法や、
入力日本語文１ａを表示してそれを修正させる方法や、
解析結果を日本語文に変換した文を表示してそれを修正
させる方法などでも容易に代替可能である。Next, the user refers to the displayed analysis result and natural language message information and selects whether or not to correct the analysis result (9). In the "correction" here, an unknown word is newly registered in the word dictionary 12, or the word dictionary 12
It also includes a process of updating the description content regarding the word registered in. In the present embodiment, when the analysis result itself is modified, the analysis result modification processing unit 10 uses the intermediate expression expansion table displayed as the analysis result as it is as a modification editor. That is, the user specifies, updates,
Modify the corresponding data by deleting or adding. This can be easily realized by associating the displayed data with the data stored inside. The technique of modifying the analysis result is well known and is not the essence of the present invention, and thus it may not be necessary to further describe it in detail. Of course, as a modification method, in addition to this, a method of displaying a tree structure form, a network form, a predicate logical form and correcting it,
How to display the input Japanese sentence 1a and correct it,
It is also possible to easily substitute a method of displaying a sentence obtained by converting the analysis result into a Japanese sentence and correcting the sentence.

【００４１】次に、コマンド生成・実行処理部１１にお
いて、中間表現２ａを操作指示命令言語に変換し、デー
タベース１６に対して変換した操作指示命令を実行し、
実行結果を表示する。Next, in the command generation / execution processing unit 11, the intermediate representation 2a is converted into an operation instruction command language, and the converted operation instruction command is executed to the database 16,
Display the execution result.

【００４２】図１から明らかなように、日本語解析処理
部２、解析結果表示処理部３、解析状況メッセージ生成
・表示処理部８、解析結果修正処理部１０、コマンド生
成・実行処理部１１は処理を示し、単語辞書１２、操作
指示対象定義情報格納テーブル１３、解析状況データ１
４、メッセージデータテーブル１５、データベース１６
はファイル（テーブルとも呼ぶ）を示している。As is apparent from FIG. 1, the Japanese analysis processing unit 2, the analysis result display processing unit 3, the analysis status message generation / display processing unit 8, the analysis result correction processing unit 10, and the command generation / execution processing unit 11 are The processing is shown, and the word dictionary 12, the operation instruction target definition information storage table 13, the analysis status data 1
4, message data table 15, database 16
Indicates a file (also called a table).

【００４３】このように、本実施例によれば、各機能は
ブロック化（モジュール化）されており、プログラム
（ソフトウエア）論理により実現する場合のモジュラリ
ティの向上、ハードウエア論理により実現する場合の各
機能毎の回路モジュラリティの向上やＬＳＩ化が可能と
なり、システムの設計、保守などが容易になる。As described above, according to this embodiment, each function is divided into blocks (modularization), and the modularity is improved when it is realized by program (software) logic, and when it is realized by hardware logic. It is possible to improve the circuit modularity of each function and to make it into an LSI, which facilitates system design and maintenance.

【００４４】本実施例で使用する各種ファイル（テーブ
ル）の構成およびその要素について説明する。The structure and elements of various files (tables) used in this embodiment will be described.

【００４５】まず、単語辞書１２の構成について説明す
る。単語辞書１２は、図１２に示すように、「語幹（１
２０１）」、「正規語（１２０２）」、「項目属性（１
２０３）」、および「意味コード（１２０４）」から構
成される。First, the structure of the word dictionary 12 will be described. As shown in FIG. 12, the word dictionary 12 includes “word stem (1
201) ”,“ regular word (1202) ”,“ item attribute (1
203) ”and“ semantic code (1204) ”.

【００４６】「語幹（１２０１）」とは、それ自身で何
らかの概念を表す語である。The "word stem (1201)" is a word that expresses some concept by itself.

【００４７】「正規語（１２０２）」とは、データベー
ス１６のカラム名あるいはカラム値と１対１に対応させ
た自然語文字列を表す。正規語１２０２を定義しておく
ことにより、データベース１６において同じ位置のカラ
ム名あるいはカラム値に対応する異なった言語表現（例
えば、「販売価格」を表すカラム名に対して「値段」、
「価格」といった言語表現）についても、それらの言語
表現を正規語に置き換えて解析することにより、言語表
現とデータベース１６の構成要素とを正確に対応付ける
ことができる。The "regular word (1202)" represents a natural language character string that has a one-to-one correspondence with a column name or column value in the database 16. By defining the regular word 1202, different language expressions corresponding to column names or column values at the same position in the database 16 (for example, “price” for a column name representing “sale price”,
With respect to (language expressions such as “price”), the language expressions can be accurately associated with the constituent elements of the database 16 by replacing the language expressions with regular words and analyzing the expressions.

【００４８】「項目属性（１２０３）」とは、検索命令
の対象世界における性格を分類したものであり、データ
ベース１６のカラム名に相当する「属性項目名」、デー
タベース１６のカラム値に相当する「属性項目デー
タ」、属性項目名と属性項目データとの論理指示関係を
表す「論理指示語」、および論理積または論理和を表す
「論理演算語」などがある。また、日本語解析２におい
て、単語辞書１２に未登録の単語を検出したときには、
その単語の項目属性を「未知語」とする。The "item attribute (1203)" is a classification of the character in the target world of the search command, and is an "attribute item name" corresponding to the column name of the database 16 and a "column value" of the database 16. There are “attribute item data”, “logical instruction word” indicating a logical instruction relationship between the attribute item name and attribute item data, and “logical operation word” indicating a logical product or a logical sum. Further, in the Japanese analysis 2, when an unregistered word is detected in the word dictionary 12,
The item attribute of the word is “unknown word”.

【００４９】「意味コード（１２０４）」とは、検索命
令の対象世界におけるその単語の位置付けを表したもの
である。データベース１６を構成するカラムと意味コー
ド１２０４とは１対１に対応する。データベース１６を
構成する各カラムの位置と意味コード１２０４との対応
関係は、次に述べる操作指示対象定義情報格納テーブル
１３で定義する。The "semantic code (1204)" represents the position of the word in the target world of the search command. The columns constituting the database 16 and the meaning codes 1204 have a one-to-one correspondence. The correspondence between the position of each column forming the database 16 and the meaning code 1204 is defined in the operation instruction target definition information storage table 13 described below.

【００５０】次に、図１３に示す操作指示対象定義情報
格納テーブル１３の構成について説明する。操作指示対
象定義情報格納テーブル１３は、データベース１６の構
成情報を定義したテーブルであり、「属性項目（１３０
１）」、「意味コード（１３０２）」、「所属テーブル
（１３０３）」、および「デフォルト対象指定項目（１
３０４）」から構成される。Next, the structure of the operation instruction target definition information storage table 13 shown in FIG. 13 will be described. The operation instruction target definition information storage table 13 is a table in which the configuration information of the database 16 is defined.
1) ”,“ semantic code (1302) ”,“ affiliation table (1303) ”, and“ default target designation item (1
304) ”.

【００５１】「属性項目（１３０１）」とは、データベ
ース１６を構成するカラム名に相当する正規語文字列を
定義した項目である。The "attribute item (1301)" is an item defining a regular word character string corresponding to a column name forming the database 16.

【００５２】「意味コード（１３０２）」は単語辞書１
２の「意味コード（１２０４）」と同義である。ここで
は、データベース１６の各カラムに１対１に対応する意
味コード１３０２が定義される。ここで注目すべきこと
は、意味コード１３０２の最初の１文字目のアルファベ
ットは、所属するテーブルを表しており、意味コード３
０２の残りの数字列は所属するテーブルにおけるカラム
位置を表しているということである。このように、意味
コード１３０２は、データベース１６における位置情報
を表しており、後述する多義解消処理２０２では、この
位置情報に基づいて単語の意味を一義に決定する。な
お、これらの内容は単語辞書１２の「意味コード（１２
０４）」にも共通する。"Semantic code (1302)" is the word dictionary 1
This is synonymous with “Semantic code (1204)” of 2. Here, a meaning code 1302 corresponding to each column of the database 16 is defined. It should be noted here that the first alphabet of the meaning code 1302 represents the table to which it belongs, and the meaning code 3
It means that the remaining number string of 02 represents the column position in the table to which it belongs. In this way, the meaning code 1302 represents the position information in the database 16, and in the ambiguous resolution processing 202 described later, the meaning of the word is uniquely determined based on this position information. In addition, these contents are “semantic code (12
04) ”.

【００５３】「所属テーブル（１３０３）」には、デー
タベース１６の各カラムの所属するテーブルの名称を表
す文字列が定義されている。A character string representing the name of the table to which each column of the database 16 belongs is defined in the "belonging table (1303)".

【００５４】「デフォルト対象指定項目（１３０４）」
とは、データベース１６の各テーブルにおいてその中心
をなすカラムを定義するフラグであり、各カラムは、そ
れぞれ整数「１」および「０」のどちらかの値を持つ。
入力自然語文１ａの解析において、検索対象を認定でき
なかった場合、デフォルト対象指定項目１３０４が
「１」であるカラムを検索対象とみなすことにより、検
索対象の省略された日本語文をも受理することが可能で
ある。"Default target designation item (1304)"
Is a flag that defines the central column in each table of the database 16, and each column has an integer value of "1" or "0".
In the analysis of the input natural language sentence 1a, when the search target cannot be recognized, the column in which the default target designation item 1304 is “1” is regarded as the search target, and the omitted Japanese sentence of the search target is also accepted. Is possible.

【００５５】次に、解析状況データ１４の構成について
説明する。解析状況データ１４は、解析が失敗する可能
性のある解析箇所に関する語データと、その解析箇所に
対応する解析コード１４０１とを格納するテーブルであ
る。本実施例では、前述の通り、一義に決定できない多
義語の存在、未知語の存在、および検索対象の不在の３
種類に関する自然語解析状況を利用者に表示する。しか
し、自然語解析状況を利用者に報知する際に必要となる
データは解析箇所の解析内容によって異なる。本実施例
においては、解析が失敗する可能性のある解析箇所それ
ぞれについて必要となるデータを格納するテーブルを定
義する。即ち、図３３、図３４、図３５に示すように、
解析状況データ１４を構成するテーブルを各解析箇所に
対して定義する。これらのテーブルの要素についての詳
細な説明は後述する。Next, the structure of the analysis status data 14 will be described. The analysis status data 14 is a table that stores word data related to an analysis point where analysis may fail and an analysis code 1401 corresponding to the analysis point. In the present embodiment, as described above, there are three meanings: existence of polysemous words that cannot be unambiguously determined, existence of unknown words, and absence of search target.
Display the natural language analysis status regarding types to the user. However, the data required for notifying the user of the natural language analysis status differs depending on the analysis content of the analysis location. In the present embodiment, a table that stores data required for each analysis location where analysis may fail is defined. That is, as shown in FIGS. 33, 34, and 35,
A table constituting the analysis status data 14 is defined for each analysis location. A detailed description of the elements of these tables is given below.

【００５６】次に、図３２に示すメッセージデータテー
ブル１５の構成について説明する。メッセージデータテ
ーブル１５は、解析が失敗する可能性のある解析箇所に
おける、自然語解析状況に関する自然語メッセージ情報
を格納したテーブルである。メッセージデータテーブル
１５は、「解析コード（１４０１）」、「メッセージモ
ード（１５０１）」、「メッセージテンプレート（１５
０２）」から構成される。本実施例では、図３２に示す
メッセージデータテーブル１５を用いて利用者に表示す
る自然語メッセージ情報を生成する。Next, the structure of the message data table 15 shown in FIG. 32 will be described. The message data table 15 is a table that stores natural language message information relating to the natural language analysis situation at an analysis location where analysis may fail. The message data table 15 includes “analysis code (1401)”, “message mode (1501)”, and “message template (15)”.
02) ”. In this embodiment, the natural language message information to be displayed to the user is generated using the message data table 15 shown in FIG.

【００５７】メッセージデータテーブル１５において、
「解析コード（１４０１）」とは、解析失敗箇所を識別
するためのコードである。「メッセージモード（１５０
１）」とは、そのメッセージの内容が、解析失敗原因、
解析方法、対処方法の３種類のどれに相当するものかを
識別するためのコードである。「メッセージテンプレー
ト（１５０２）」とは、利用者に自然語解析状況として
表示する自然語メッセージ情報の鋳型であり、解析内容
に依存しない固定文字列部分と、解析内容によって変化
する可変部分（図３２で、記号”＠”、”＄”、”％”
でそれぞれ挟まれた部分）で構成される。メッセージを
生成するときには、この鋳型の可変部分に、解析状況デ
ータ１４に格納されたデータを埋め込むことにより自然
語メッセージ情報を生成する。なお、図３２において、
可変部分に対応する文字列が自然語で定義されている
が、これは本実施例の説明を判り易くするためであり、
実際には、プログラムの実行において都合の良い文字列
（プログラムにおける変数名など）で記述し、代入すべ
きデータの認定を容易に行う。In the message data table 15,
The "analysis code (1401)" is a code for identifying the analysis failure location. "Message mode (150
1) ”means that the content of the message is the cause of the analysis failure,
It is a code for identifying which of the three methods, the analysis method and the coping method, corresponds. The "message template (1502)" is a template of natural language message information displayed to the user as the natural language analysis status, and includes a fixed character string portion that does not depend on the analysis content and a variable portion that changes depending on the analysis content (FIG. 32). And the symbols "@", "$", "%"
Part sandwiched between each). When a message is generated, the natural language message information is generated by embedding the data stored in the analysis situation data 14 in the variable part of this template. In addition, in FIG.
The character string corresponding to the variable part is defined in natural language, but this is to make the description of this embodiment easier to understand.
In actuality, a character string (such as a variable name in the program) that is convenient for executing the program is described, and the data to be substituted is easily identified.

【００５８】図２以下の処理の説明においては、必要に
応じて検索指示を表す具体的な日本語文を用いる。本実
施例で用いる具体的日本語文は図１４、図１５、図１６
に示す３文であり、図１４の日本語文は、多義語、即
ち、意味コード１２０４を複数持つ単語「日立」を含む
日本語文であり、図１５の日本語文は、未知語「高給取
りの」が存在する日本語文であり、図１６の日本語文
は、検索対象が明示されていない日本語文である。In the description of the processing in FIG. 2 and subsequent figures, specific Japanese sentences representing search instructions will be used as necessary. Specific Japanese sentences used in this embodiment are shown in FIGS. 14, 15 and 16.
14 is a Japanese sentence including a polysemous word, that is, a word “Hitachi” having a plurality of meaning codes 1204, and the Japanese sentence in FIG. 15 is an unknown word “Highly paid”. Is a Japanese sentence that exists, and the Japanese sentence in FIG. 16 is a Japanese sentence in which the search target is not specified.

【００５９】図２は、本実施例のハードウエア構成を示
すものであり、３０は入出力装置、４０はプロセッサ、
５０は記憶装置である。記憶装置５０は、以下の記憶部
からなる。５１は日本語解析処理部格納エリア、５２は
解析結果表示処理部格納エリア、５３は解析状況メッセ
ージ生成・表示処理部格納エリア、５４は解析結果修正
処理部格納エリア、５５はコマンド生成・実行処理部格
納エリア、５６は単語辞書格納エリア、５７は操作指示
対象定義情報格納テーブル格納エリア、５８は解析状況
データ格納エリア、５９はメッセージデータ格納エリ
ア、６０はデータベース格納エリア、６１は各処理実行
用のメモリであるワーキングエリアを表している。FIG. 2 shows the hardware configuration of this embodiment, where 30 is an input / output device, 40 is a processor,
Reference numeral 50 is a storage device. The storage device 50 includes the following storage units. Reference numeral 51 is a Japanese analysis processing unit storage area, 52 is an analysis result display processing unit storage area, 53 is an analysis status message generation / display processing unit storage area, 54 is an analysis result correction processing unit storage area, and 55 is command generation / execution processing. A copy storage area, 56 is a word dictionary storage area, 57 is an operation instruction target definition information storage table storage area, 58 is an analysis status data storage area, 59 is a message data storage area, 60 is a database storage area, and 61 is for each processing execution. Represents a working area, which is the memory of.

【００６０】記憶装置５０は、各種プログラムおよび単
語辞書１２等のデータを格納する。プロセッサ４０は、
記憶装置５０に格納されているプログラムを実行し、必
要に応じて入出力装置３０を通じてデータを入出力しな
がら処理する。The storage device 50 stores various programs and data such as the word dictionary 12. The processor 40 is
The program stored in the storage device 50 is executed, and data is processed through the input / output device 30 as necessary.

【００６１】図３は、日本語解析処理部２の概要ＰＡＤ
図（Problem Analysis Diagram）であり、入力日本語文
１ａを解析して中間表現２ａを生成するまでの処理を示
す。それらは、用語認定処理２０１、多義解消処理２０
２、修飾表現認定処理２０３、および、中間表現生成処
理２０４の各処理である。FIG. 3 is a schematic PAD of the Japanese analysis processing unit 2.
It is a figure (Problem Analysis Diagram), and shows the process of analyzing the input Japanese sentence 1a and generating the intermediate expression 2a. They are term recognition processing 201 and ambiguous resolution processing 20.
2, modified expression recognition processing 203, and intermediate expression generation processing 204.

【００６２】図４は、日本語解析処理部２における用語
認定処理２０１のＰＡＤ図である。入力装置から入力さ
れた文節分かち書き日本語文字列１ａから、単語辞書１
２を参照して、図１７等に示すような「文節テーブル
（ワーキングエリア５１に格納される）」に格納するま
での処理を示したものである。文節テーブルは、入力日
本語文を構成する単語の文法情報や、それらの単語とデ
ータベースの要素との対応情報を記述したテーブルであ
り、これらの情報は単語辞書１２から取得する。以下、
この処理をＰＡＤ図に従って説明する。FIG. 4 is a PAD diagram of the term recognition process 201 in the Japanese analysis processing unit 2. The word dictionary 1 from the Japanese character string 1a written in the phrase segment input from the input device
FIG. 2 shows the processing up to storing in the “bunsetsu table (stored in the working area 51)” as shown in FIG. The phrase table is a table in which grammatical information of words forming an input Japanese sentence and correspondence information between those words and elements of the database are described, and these pieces of information are acquired from the word dictionary 12. Less than,
This process will be described with reference to the PAD diagram.

【００６３】先ず、文節分かち書き入力日本語文１ａを
格納する文字列変数ＩＮＰを初期化する（ステップ２０
１０１）。変数ＩＮＰは、単に文字列変数を表すもので
あり、特定の意味を持つものではない。次に、文節分か
ち書き入力日本語文１ａを変数ＩＮＰに代入する（ステ
ップ２０１０２）。First, the character string variable INP for storing the phrase segmentation input Japanese sentence 1a is initialized (step 20).
101). The variable INP merely represents a character string variable and does not have any particular meaning. Next, the phrase segmentation input Japanese sentence 1a is substituted for the variable INP (step 20102).

【００６４】次に、変数ＩＮＰに文節が含まれている
間、各文節ごとに以下の処理を繰り返す。（ステップ２
０１０３）。単語辞書１２を参照し、当該文節中に単語
辞書１２の語幹１２０１が含まれるか否かを判別し（ス
テップ２０１０４）、含まれている場合は、当該文節中
の語幹１２０１に相当する文字列の部分を文節テーブル
の「語幹（１２０１）」として登録し、残りの文字列の
部分を文節テーブルの「付属語（１２０５）」として登
録する（ステップ２０１０５）。Next, while the variable INP contains a clause, the following processing is repeated for each clause. (Step 2
0103). By referring to the word dictionary 12, it is determined whether or not the stem 1201 of the word dictionary 12 is included in the phrase (step 20104). If it is included, the character string corresponding to the stem 1201 of the phrase is extracted. The part is registered as the "word stem (1201)" of the phrase table, and the remaining character string part is registered as the "adjunct word (1205)" of the phrase table (step 20105).

【００６５】次に、単語辞書１２を参照し、文節テーブ
ルに当該語幹１２０１の「正規語（１２０２）」、「項
目属性（１２０３）」、および「意味コード（１２０
４）」をそれぞれ登録する（ステップ２０１０６）。Next, referring to the word dictionary 12, "regular word (1202)", "item attribute (1203)", and "semantic code (120) of the stem 1201 in the phrase table.
4) ”are registered (step 20106).

【００６６】ステップ２０１０４で語幹１２０１が含ま
れていない場合は、当該文節中に数字列が存在するかを
判別し（ステップ２０１０７）、存在する場合は、文節
テーブルに数字列として登録し（ステップ２０１０
８）、残りの文字列を付属語１２０５として文節テーブ
ルに登録する（ステップ２０１０９）。ステップ２０１
０７で存在しない場合は、当該文節は「未知語」である
とみなし、文節テーブルの「項目属性（１２０３）」と
して「未知語」を格納する。前述したように、本実施例
では、未知語の存在によって解析が失敗する可能性があ
ると認定しているので、解析状況データ１４に解析コー
ド１４０１として整数「１」を格納し、また、対応する
文節番号１４０２と、未知語文字列１４０３を格納する
（ステップ２０１１０）。本実施例では解析コード
「１」は、用語認定処理２０１において未知語が存在し
たということを表している。If the word stem 1201 is not included in step 20104, it is determined whether or not a numeric string exists in the phrase (step 20107). If it exists, it is registered as a numeric string in the phrase table (step 2010).
8) Then, the remaining character string is registered in the clause table as an adjunct word 1205 (step 20109). Step 201
If it does not exist in 07, the phrase is regarded as an “unknown word”, and “unknown word” is stored as the “item attribute (1203)” of the phrase table. As described above, in the present embodiment, since it is determined that the analysis may fail due to the existence of the unknown word, the integer “1” is stored as the analysis code 1401 in the analysis situation data 14, and The phrase number 1402 to be executed and the unknown word character string 1403 are stored (step 20110). In this embodiment, the analysis code “1” indicates that an unknown word was present in the term recognizing process 201.

【００６７】次に、ステップ２０１０４の後、変数ＩＮ
Ｐより当該文節を削除する（ステップ２０１１１）。Next, after step 20104, the variable IN
The phrase is deleted from P (step 20111).

【００６８】用語認定処理２０１により、図１２に示す
単語辞書１２を参照して、図１４、図１５、図１６に示
す文節分かち書き日本語文を認定し、図１７、図１８、
図１９に示す文節テーブルにそれぞれ格納する。ここ
で、図１５の日本語文において、文節「高給取りの」は
単語辞書１２の語幹１２０１に含まれておらず、かつ、
数字列でもないので、未知語として認定する。一方、図
３３に示す解析状況データ１４内の、未知語に関する単
語データを格納する未知語データ格納テーブルに、解析
コード１４０１として整数「１」を、対応文節番号１４
０２として文節「高給取りの」の文節番号「２」を、未
知語文字列１４０３として文字列「高給取りの」を格納
する。The term recognizing process 201 authenticates the phrase segmentation Japanese sentences shown in FIGS. 14, 15 and 16 by referring to the word dictionary 12 shown in FIG. 12, and FIGS.
It stores in each clause table shown in FIG. Here, in the Japanese sentence of FIG. 15, the phrase “high paid” is not included in the stem 1201 of the word dictionary 12, and
Since it is not a numerical string, it is recognized as an unknown word. On the other hand, in the unknown word data storage table that stores word data relating to unknown words in the analysis situation data 14 shown in FIG. 33, the integer “1” is set as the analysis code 1401 and the corresponding clause number 14
The phrase number “2” of the phrase “high paid” is stored as 02, and the character string “high paid” is stored as the unknown word character string 1403.

【００６９】このように、本実施例では、未知語が存在
するか否かを、各単語が単語辞書１２に登録されている
か否かを判定することによって認定する。As described above, in this embodiment, it is determined whether or not an unknown word is present by determining whether or not each word is registered in the word dictionary 12.

【００７０】図５は、日本語解析処理部２における多義
解消処理２０２のＰＡＤ図である。用語認定処理２０１
で認定された語幹１２０１のうち、複数の意味を持つ単
語の意味を一義に決定する処理を示したものである。こ
こでいう「意味」とは、日本語１ａを構成する単語と、
データベース１６を構成する要素の対応関係を表す。以
下、この処理をＰＡＤ図に従って説明する。FIG. 5 is a PAD diagram of the ambiguous resolution processing 202 in the Japanese analysis processing unit 2. Term recognition process 201
This shows a process of uniquely determining the meaning of a word having a plurality of meanings in the stem 1201 recognized in (1). "Meaning" here means the words that make up Japanese 1a,
The correspondence relationship between the elements forming the database 16 is shown. Hereinafter, this processing will be described with reference to the PAD diagram.

【００７１】文節テーブルにデータが存在する間、以下
の処理を行う（ステップ２０２０１）。当該文節の語幹
１２０１が多義であるか、即ち、当該語幹の意味コード
１２０４が一義に決定されていないかを判別する（ステ
ップ２０２０２）。多義である場合は、当該文節が付属
語「が」または論理演算関係を表す付属語を伴うかどう
かを判別する（ステップ２０２０３）。ここで、「論理
演算語」とは、前述したように、論理積あるいは論理和
を表す自然語（「または」、「かつ」、「〜と」等）で
ある。While data is present in the clause table, the following processing is performed (step 20201). It is determined whether the word stem 1201 of the phrase has a polysemous meaning, that is, whether the meaning code 1204 of the word stem is not uniquely determined (step 20202). In the case of polysemy, it is determined whether or not the clause is accompanied by an adjunct "ga" or an adjunct representing a logical operation relation (step 20203). Here, as described above, the “logical operation word” is a natural word (“or”, “katsu”, “to”, etc.) representing a logical product or a logical sum.

【００７２】ステップ２０２０３で付属語「が」あるい
は論理演算関係を表す付属語を伴う場合、当該文節の意
味コード１２０４と共通の意味コード１２０４を直後の
文節が持つか否かを判別し（ステップ２０２０４）、共
通の意味コード１２０４を持つ場合は、一義に決定され
ていない文節について、当該共通の意味コード１２０４
を両文節の意味コード１２０４とする（ステップ２０２
０５）。なお、共通の意味コード１２０４が複数組存在
する場合には、すべての組を両文節の意味コード１２０
４とする。つまり、この場合、この時点では意味コード
１２０４は一義に決定されないことになる。ステップ２
０２０５の後、いったんステップ２０２０１のループの
外へ抜け、再びステップ２０２０１から処理を行う。こ
れは、ある語幹１２０１の意味コード１２０４が一義に
決定したことによって、それまで一義に決定できなかっ
た語幹１２０１の意味コード１２０４を一義に決定でき
る可能性があるからであり、この処理手順により正確に
意味コード１２０４の多義を解消することができる。When an adjunct word "ga" or an adjunct word representing a logical operation relation is accompanied in step 20203, it is determined whether or not the immediately following clause has a meaning code 1204 common to the meaning code 1204 of the clause (step 20204). ), If there is a common meaning code 1204, the common meaning code 1204 for the clauses that are not uniquely determined.
As the semantic code 1204 of both clauses (step 202)
05). If there are a plurality of sets of common meaning codes 1204, all sets are set to the meaning code 120 of both clauses.
Set to 4. That is, in this case, the meaning code 1204 is not uniquely determined at this point. Step two
After 0205, the process once exits the loop of step 20201, and the process is repeated from step 20201. This is because the meaning code 1204 of a certain stem 1201 is uniquely determined, so that the meaning code 1204 of the stem 1201 that could not be uniquely determined up to that point may be uniquely determined. In addition, the ambiguous meaning code 1204 can be resolved.

【００７３】ステップ２０２０３で、付属語「が」ある
いは論理演算関係を表す付属語を伴わない場合は、当該
文節が連体修飾文節であるか否かを判別し（ステップ２
０２０６）、連体修飾文節である場合は、当該文節意味
コード１２０４と同一テーブル内の意味コード１２０４
を直後の文節が持つかどうかを判別し（ステップ２０２
０７）、持つ場合は、一義に決定されていない文節につ
いて、当該同一テーブル内の意味コード１２０４を両文
節の意味コード１２０４とする（ステップ２０２０
８）。なお、同一テーブル内の意味コード１２０４の組
が複数組存在する場合には、すべての組を両文節の意味
コード１２０４とする。ステップ２０２０８の後、ステ
ップ２０２０５の後の処理と同様の理由により、再びス
テップ２０２０１に戻って処理を始めから行う。In step 20203, if the adjunct word "ga" or the adjunct word representing the logical operation relation is not involved, it is judged whether or not the clause is a adnominal modifier clause (step 2).
0206), the meaning code 1204 in the same table as the clause meaning code 1204 is
It is determined whether or not the clause immediately after has (step 202
07), in the case of having a clause that is not uniquely determined, the meaning code 1204 in the same table is set as the meaning code 1204 of both clauses (step 2020).
8). When there are a plurality of sets of meaning code 1204 in the same table, all the sets are set as the meaning code 1204 of both clauses. For the same reason as the process after step 20205 after step 20208, the process returns to step 20201 and the process is performed again from the beginning.

【００７４】ステップ２０２０６において、連体修飾文
節でない場合は、当該文節の直後の文節の語幹１２０１
が論理演算語あるいは連体修飾の論理指示語であるか否
かを判別する（ステップ２０２０９）。ここで、「論理
指示語」とは、前述したように、属性項目名と属性項目
データとの間の論理指示関係を表す自然語（「以上」、
「以下」、「以外」等）である。ステップ２０２０９で
応の場合は、当該文節の意味コード１２０４と共通の意
味コード１２０４を論理演算語あるいは論理指示語の直
後の文節が持つかどうかを判別し（ステップ２０２１
０）、持つ場合は、意味コード１２０４が一義に決定さ
れていない文節について共通の意味コード１２０４を両
文節の意味コード１２０４とする（ステップ２０２０
５）。なお、共通の意味コード１２０４が複数組存在す
る場合には、すべての組を両文節の意味コード１２０４
とする。その後、再びステップ２０２０１に戻って処理
を始めから行う。ステップ２０２１０で持たない場合
は、当該文節の意味コード１２０４と同一テーブル内の
意味コード１２０４を論理指示語の直後の文節が持つか
否かを判別し（ステップ２０２１１）、持つ場合は、一
義に決定されていない文節について、同一テーブル内の
意味コード１２０４を両文節の意味コード１２０４とす
る（ステップ２０２０８）。なお、同一テーブル内の意
味コード１２０４の組が複数組存在する場合には、すべ
ての組を両文節の意味コード１２０４とする。その後、
再びステップ２０２０１に戻って処理を始めから行う。In step 20206, if it is not the adnominal modifier phrase, the stem 1201 of the phrase immediately after the phrase.
Is a logical operation word or a logical instruction word for adnominal modification (step 20209). Here, as described above, the “logical instruction word” is a natural language (“or more”, which is a logical word) indicating a logical instruction relationship between the attribute item name and the attribute item data.
"Below", "other than", etc.). If YES in step 20209, it is determined whether or not the clause immediately after the logical operation word or the logical directive has the meaning code 1204 common to the meaning code 1204 of the clause (step 2021).
0), if there is, the common meaning code 1204 for the clauses whose meaning code 1204 is not uniquely determined is the meaning code 1204 of both clauses (step 2020).
5). If there are a plurality of sets of common meaning codes 1204, all sets are set to the meaning code 1204 of both clauses.
And After that, the process returns to step 20201 again to perform the process from the beginning. If not in step 20210, it is determined whether or not the clause immediately after the logical directive has the meaning code 1204 in the same table as the meaning code 1204 of the clause (step 20211). For the clauses that have not been set, the meaning code 1204 in the same table is set as the meaning code 1204 of both clauses (step 20208). When there are a plurality of sets of meaning code 1204 in the same table, all the sets are set as the meaning code 1204 of both clauses. afterwards,
The process returns to step 20201 and the process is performed from the beginning.

【００７５】ステップ２０２０１のループで行う処理
は、以上の説明から判るように、隣合う文節の接続関係
に規定された制約の強い多義解消処理である。ここまで
の処理で一義に決定できない語幹１２０１の意味コード
１２０４は、次のステップ２０２１２のループ処理にお
いて、やや制約を弱めた多義解消処理によって一義に決
定する。As can be seen from the above description, the processing executed in the loop of step 20201 is the ambiguous disambiguation processing with a strong restriction defined in the connection relation between adjacent clauses. The meaning code 1204 of the stem 1201 that cannot be unambiguously determined by the processing so far is unambiguously determined by the ambiguous resolution processing in which the constraint is slightly weakened in the loop processing of the next step 20212.

【００７６】文節テーブルにデータが存在する間、以下
の処理を行う（ステップ２０２１２）。当該文節の語幹
１２０１が複数の意味コード１２０４を持つか否かを判
別し（ステップ２０２１３）、持つ場合は、当該文節の
意味コード１２０４と同一テーブル内の意味コード１２
０４を隣接する文節が持つか否かを判別し（ステップ２
０２１４）、持つ場合は、一義に決定されていない文節
について同一テーブル内の意味コード１２０４を両文節
の意味コード１２０４とする（ステップ２０２０８）。
なお、同一テーブル内の意味コード１２０４の組が複数
組存在する場合には、すべての組を両文節の意味コード
１２０４とする。その後、再びステップ２０２０１に戻
って処理を始めから行う。While data is present in the clause table, the following processing is performed (step 20212). It is determined whether or not the stem 1201 of the phrase has a plurality of meaning codes 1204 (step 20213). If there is, the meaning code 1204 of the phrase and the meaning code 12 in the same table are determined.
It is determined whether or not the adjacent clause has 04 (step 2
0214), if it has, the meaning code 1204 in the same table for the clauses that are not uniquely determined is set as the meaning code 1204 of both clauses (step 20208).
When there are a plurality of sets of meaning code 1204 in the same table, all the sets are set as the meaning code 1204 of both clauses. After that, the process returns to step 20201 again to perform the process from the beginning.

【００７７】ここまでの処理で一義に決定できない単語
については、その意味コード１２０４を単語辞書１２に
おける記述順序に基づいて決定する。本実施例で使用す
る単語辞書１２に記述される各語幹１２０１の意味コー
ド１２０４は、記述順序が意味の優先順序と一致させて
ある。意味コード１２０４の優先順序は、その意味の使
用頻度等を基にして辞書作成者が予め決定しておく。も
ちろん、この優先順序を利用者が変更することは容易に
実現可能である。For words that cannot be uniquely determined by the processing up to this point, the meaning code 1204 is determined based on the description order in the word dictionary 12. In the meaning code 1204 of each stem 1201 described in the word dictionary 12 used in this embodiment, the description order matches the priority order of meaning. The priority order of the meaning code 1204 is determined in advance by the dictionary creator based on the usage frequency of the meaning. Of course, it is easy for the user to change this priority order.

【００７８】文節テーブルにデータが存在する間、以下
の処理を行う（ステップ２０２１５）。当該文節の語幹
１２０１が意味コード１２０４を複数持つか否かを判別
し（ステップ２０２１６）、持つ場合は、当該文節の語
幹１２０１が現在持つ意味コード１２０４のうち、単語
辞書１２に、より早く記述されている意味コード１２０
４を当該文節の意味コード１２０４として決定する（ス
テップ２０２１７）。そして、ステップ２０２１７で意
味コード１２０４を半ば強制的に決定することによって
解析が失敗する可能性があるので、図３４に示す解析状
況データ１４内の、多義語に関する単語データを格納す
る多義語データ格納テーブルに、解析コード１４０１と
して整数「２」を、対応文節番号１４０２として当該文
節番号を、決定意味コード１４０５としてステップ２０
２１７で決定した意味コード１２０４を、他候補意味コ
ード１４０６としてステップ２０２１７で棄却された意
味コード１２０４を、それぞれ格納する。本実施例で
は、解析コード「２」は、前後の語幹１２０１の意味か
ら当該語幹１２０１の意味コード１２０４を一義に決定
できず、その結果として意味コード１２０４を半ば強制
的に決定したことを表す。While data is present in the clause table, the following processing is performed (step 20215). It is determined whether or not the stem 1201 of the phrase has a plurality of meaning codes 1204 (step 20216). If the stem 1201 of the phrase has the meaning code 1204 currently held by the stem 1201 of the phrase, the word is described earlier in the word dictionary 12. Meaning code 120
4 is determined as the meaning code 1204 of the clause (step 20217). Then, since the analysis may fail by forcibly determining the meaning code 1204 in step 20217, the polysemous word data storage for storing the word data relating to the polysemous words in the analysis situation data 14 shown in FIG. In the table, the integer “2” is set as the analysis code 1401, the corresponding clause number is set as the corresponding clause number 1402, and the decision meaning code 1405 is set in step 20.
The meaning code 1204 determined in 217 is stored as the other candidate meaning code 1406, and the meaning code 1204 rejected in step 20217 is stored. In the present embodiment, the analysis code “2” indicates that the meaning code 1204 of the stem 1201 cannot be uniquely determined from the meanings of the preceding and subsequent stems 1201, and as a result, the meaning code 1204 is forcibly determined halfway.

【００７９】多義解消処理２０２により、図１４、図１
５、図１６の各入力日本語文１ａに含まれる、複数の意
味コード１２０４を持つ単語についてその意味コード１
２０４を一義に決定する。図１４の入力日本語文１ａに
ついての用語認定処理２０１の結果である図１７の文節
テーブルに対して多義解消処理２０２を行うことによ
り、図２０に示す、各語幹１２０１の意味コード１２０
４が一義に決定された文節テーブルを生成する。14 and 1 by the ambiguous resolution processing 202.
5, the meaning code 1 of a word having a plurality of meaning codes 1204 included in each input Japanese sentence 1a in FIG.
204 is uniquely determined. The meaning code 120 of each stem 1201 shown in FIG. 20 is obtained by performing the ambiguity elimination process 202 on the phrase table of FIG. 17, which is the result of the term recognition process 201 for the input Japanese sentence 1a of FIG.
4 generates a clause table that is uniquely determined.

【００８０】図１７の文節テーブルにおいて、語幹「日
立」は４種類の意味コードＰ５、Ｃ１、Ｐ４、Ｃ２を持
つ。文節「日立の」は直後の語幹「人」に連体修飾し、
かつ、「人」の持つ意味コード１２０４と同一テーブル
内の意味コード１２０４を語幹「日立」が持つので、ス
テップ２０２０８により、語幹「人」の持つ意味コード
Ｐ１と同一テーブルに属する意味コードＰ５、Ｐ４を語
幹「日立」の意味コード１２０４として残し、残りの二
つの意味コードＣ１、Ｃ２を棄却する。その後、ステッ
プ２０２１７により、語幹「日立」の意味コードＰ５、
Ｐ４のうち、単語辞書１２（即ち、図１７の文節テーブ
ル）に、より早く記述されている意味コードＰ５を語幹
「日立」の意味コード１２０４として決定する。さら
に、ステップ２０２１８により、図３４に示す解析状況
データ１４の多義語データ格納テーブルに、解析コード
１４０１として整数「２」を、対応文節番号１４０２と
して整数「１」を、多義語文字列１４０４として文字列
「日立」を、決定意味コード１４０５として意味コード
文字列Ｐ５を、他候補意味コード１４０６としてステッ
プ２０２１７で棄却した意味コードＰ４を格納する。In the clause table of FIG. 17, the word stem "Hitachi" has four kinds of meaning codes P5, C1, P4 and C2. The phrase "Hitachi's" is adjunctly modified to the immediately following stem "person",
Moreover, since the stem "Hitachi" has the meaning code 1204 in the same table as the meaning code 1204 of "person", in step 20208, the meaning code P5 and P4 belonging to the same table as the meaning code P1 of the stem "person" Is left as the meaning code 1204 of the stem "Hitachi", and the remaining two meaning codes C1 and C2 are rejected. Then, in step 20217, the meaning code P5 of the stem "Hitachi",
Of P4, the meaning code P5 described earlier in the word dictionary 12 (that is, the clause table of FIG. 17) is determined as the meaning code 1204 of the stem “Hitachi”. Further, at step 20218, the integer “2” is set as the analysis code 1401, the integer “1” is set as the corresponding clause number 1402, and the polysemous word character string 1404 is set as the character in the polysemous word data storage table of the analysis situation data 14 shown in FIG. The string “Hitachi” is stored as the determined meaning code 1405, the meaning code character string P5, and the other candidate meaning code 1406 is stored as the meaning code P4 rejected in step 20217.

【００８１】図１５の入力日本語文１ａを構成する語幹
１２０１の中には、図１８の文節テーブルに示すように
複数の意味コード１２０４を持つものが存在しないの
で、図２１の多義解消処理２０２の直後の文節テーブル
に示すように、多義解消処理２０２の前後で文節テーブ
ルは変わらない。Since there is no word stem 1201 forming the input Japanese sentence 1a shown in FIG. 15 having a plurality of meaning codes 1204 as shown in the clause table of FIG. 18, the ambiguous resolution processing 202 of FIG. As shown in the clause table immediately after, the clause table does not change before and after the ambiguous resolution processing 202.

【００８２】図１６の入力日本語文１ａの用語認定処理
２０１の結果である図１９の文節テーブルにおいて、語
幹「川崎」は２種類の意味コードＰ４、Ｃ２を持つ。こ
の場合、ステップ２０２１４の条件を満たすので、ステ
ップ２０２０８により、語幹「住む」の意味コードＰ４
と同一テーブル内の意味コードＰ４を語幹「川崎」の意
味コード１２０４として決定し、残りの意味コードＣ２
を棄却する。図２２に、図１９の文節テーブルに対する
多義解消処理２０２の結果を表す文節テーブルを示す。In the phrase table of FIG. 19 which is the result of the term recognition processing 201 of the input Japanese sentence 1a of FIG. 16, the stem "Kawasaki" has two kinds of meaning codes P4 and C2. In this case, since the condition of step 20214 is satisfied, the meaning code P4 of the word "live" is determined in step 20208.
The meaning code P4 in the same table as is determined as the meaning code 1204 of the stem "Kawasaki", and the remaining meaning code C2.
Reject. FIG. 22 shows a clause table representing the result of the ambiguous resolution processing 202 for the clause table of FIG.

【００８３】このように、本実施例では、多義を解消す
るための条件をプログラム中に手続き的に記述（図５に
おける、ステップ２０２０３、２０２０４、２０２１
４、２０２０６、２０２０７、２０２０９、２０２１
０、２０２１１）して、それらを順番に適用することに
よって多義を解消する。これらの多義解消処理は、上述
の条件部分および、図５のステップ２０２０５、２０２
０８のような帰結部分（アクション部分）を処理プログ
ラムと独立させて記述することでも実現可能である。As described above, in this embodiment, the condition for eliminating the ambiguous condition is procedurally described in the program (steps 20203, 20204, 2021 in FIG. 5).
4, 20206, 20207, 20209, 2021
0, 20211) and apply them in order to resolve the ambiguity. These ambiguous resolution processing is performed by the above-mentioned condition part and steps 20205 and 202 of FIG.
It can also be realized by describing a consequent part (action part) such as 08 separately from the processing program.

【００８４】図６は、日本語解析処理部２における修飾
表現認定処理２０３のＰＡＤ図である。文節テーブル情
報を用いて入力日本語文１ａを構成する文節の修飾関係
を認定するまでの処理を示したものであり、以下、この
処理をＰＡＤ図に従って説明する。FIG. 6 is a PAD diagram of the qualified expression recognition process 203 in the Japanese analysis processing unit 2. It shows the processing until the modification relation of the clauses forming the input Japanese sentence 1a is recognized using the clause table information, and this processing will be described below with reference to the PAD diagram.

【００８５】文節テーブルに未知語でないデータが存在
する間、以下の処理を行う（ステップ２０３０１）。当
該文節が論理指示語文節であるか否かを判別し（ステッ
プ２０３０２）、論理指示語文節である場合は、その直
前の文節が属性項目データであるか否かを判別し（ステ
ップ２０３０３）、属性項目データである場合は、当該
論理指示語に対応する論理指示子を属性項目データの直
前に連結し、当該２文節をまとめて「Ｄ項」として決定
する（ステップ２０３０４）。Ｄ項の「Ｄ」は、「Ｄａ
ｔａ」の頭文字を表している。また、ここで、「論理指
示子」は、属性項目と属性項目データとの間の論理指示
関係を表す記号（＜、＞、＝＝、！＝など）であり、論
理指示語と１対１に対応する。本実施例では、論理指示
語の意味コード１２０４として、論理指示子を表す記号
を定義し、単語辞書１２に格納する。これにより、論理
指示語と論理指示子との対応関係は、語幹１２０１とそ
の意味コード１２０４を参照することにより認定でき
る。論理指示語と論理指示子との対応関係は、単語辞書
１２とは別のテーブルで定義しても良い。While the phrase table contains data that is not an unknown word, the following processing is performed (step 20301). It is determined whether or not the phrase is a logical directive word phrase (step 20302). If it is a logical directive word phrase, it is determined whether or not the preceding phrase is attribute item data (step 20303). In the case of attribute item data, the logical indicator corresponding to the logical instruction word is connected immediately before the attribute item data, and the two clauses are collectively determined as "D term" (step 20304). “D” in the D term is “Da
It represents the initials of "ta". Further, here, the “logical designator” is a symbol (<,>, ==,! =, Etc.) representing a logical designating relationship between the attribute item and the attribute item data, and is one-to-one with the logical designating word. Corresponding to. In this embodiment, a symbol representing a logical indicator is defined as the meaning code 1204 of the logical indicator and stored in the word dictionary 12. Accordingly, the correspondence between the logical designator and the logical designator can be identified by referring to the stem 1201 and its meaning code 1204. The correspondence between the logical indicator and the logical indicator may be defined in a table different from the word dictionary 12.

【００８６】ステップ２０３０２の後、当該文節が論理
積または論理和を表す語幹１２０１であるか否かを判別
し（ステップ２０３０５）、応である場合は、当該語幹
１２０１の前後の語幹１２０１の項目属性１２０３が同
一であるか否かを判別し（ステップ２０３０６）、同一
である場合は、論理積または論理和を表す語幹１２０１
の前後の文節の項目属性１２０３がともに属性項目名で
あるか否かを判別（ステップ２０３０７）し、属性項目
名である場合は、論理積または論理和を表す語幹１２０
１と前後の文節をまとめて「Ａ項」として決定し（ステ
ップ２０３０８）、ともに属性項目名でない場合は、
「Ｄ項」として決定する（ステップ２０３０９）。Ａ項
の「Ａ」は、Ａｔｔｒｉｂｕｔｅの頭文字を表してい
る。After step 20302, it is judged whether or not the relevant clause is a stem 1201 representing a logical product or a logical sum (step 20305). If yes, the item attributes of the stem 1201 before and after the relevant stem 1201. It is determined whether or not 1203 are the same (step 20306). If they are the same, a stem 1201 representing a logical product or a logical sum.
It is determined whether or not the item attributes 1203 of the clauses before and after are both attribute item names (step 20307). If the item attributes 1203 are attribute item names, the word stem 120 representing a logical product or a logical sum.
1 and the preceding and following clauses are collectively determined as “A term” (step 20308), and when both are not attribute item names,
It is determined as "D term" (step 20309). The "A" in the section A represents the acronym for Attribute.

【００８７】ステップ２０３０１の後、文節テーブルに
未知語でないデータが存在する間、以下の処理を行う
（ステップ２０３１０）。当該文節がＡ項またはＤ項と
して決定していないかどうかを判別し（ステップ２０３
１１）、決定していない場合は、当該文節が属性項目名
であるか否かを判別し（ステップ２０３１２）、属性項
目名である場合は、当該文節を「Ａ項」として決定し
（ステップ２０３１３）、属性項目名でない場合は、当
該文節を「Ｄ項」として決定する（ステップ２０３１
４）。After step 20301, the following processing is carried out while data that is not an unknown word exists in the clause table (step 20310). It is determined whether or not the clause is determined as the A term or the D term (step 203).
11) If not determined, it is determined whether or not the phrase is an attribute item name (step 20312). If the phrase is an attribute item name, the phrase is determined as "A term" (step 20313). ), If it is not an attribute item name, the clause is determined as “D term” (step 2031).
4).

【００８８】図７は、日本語解析処理部２における中間
表現生成処理２０４のＰＡＤ図である。中間表現２ａを
生成するまでの処理を示したものであり、以下、この処
理ＰＡＤ図に従って説明する。FIG. 7 is a PAD diagram of the intermediate expression generation processing 204 in the Japanese analysis processing unit 2. The processing up to the generation of the intermediate representation 2a is shown, and will be described below with reference to this processing PAD diagram.

【００８９】修飾表現認定処理２０３で決定された全て
のＡ項およびＤ項に対して以下の処理を行う（ステップ
２０４０１）。隣り合うＡ項とＤ項で同一意味コード１
２０４をもつものを検索条件の最小単位として決定する
（ステップ２０４０２）。The following processing is performed for all the A terms and D terms determined in the qualified expression recognition processing 203 (step 20401). Same meaning code 1 in adjacent A and D terms
Those having 204 are determined as the minimum unit of the search condition (step 20402).

【００９０】ステップ２０４０１の後、最小単位として
未決定のＤ項が存在するかを判別し（ステップ２０４０
３）、存在する場合は、操作指示対象定義情報格納テー
ブル１３を参照し、当該Ｄ項の持つ意味コード１２０４
に対応する項目属性文字列（Ａ項）を補い、検索条件の
最小単位として決定する（ステップ２０４０４）。ステ
ップ２０４０３の後、最小単位として未決定のＡ項が存
在するかを判別し（ステップ２０４０５）、存在する場
合は、当該Ａ項を検索対象として格納する（ステップ２
０４０６）。存在しない場合は、まず、操作指示対象定
義情報格納テーブル１３を参照して、文末に最も近いＡ
項の属するテーブルにおける「デフォルト対象指定項目
（１３０４）」である属性項目名を検索対象として認定
する（ステップ２０４０７）。文末に最も近いＡ項の属
するテーブルのデフォルト対象指定項目１３０４を検索
対象とするのは、次の理由による。データベース検索を
指示する日本語文１ａにおいては、検索対象が文末にく
る確率が最も高い。また、検索対象を認定できないとい
うことは、検索対象が省略されているものとみなす。さ
らに、利用者が検索対象としたい項目は、文末のＡ項と
意味的にそれほどかけ離れたものではない。よって、検
索対象は文末のＡ項が属するテーブル内に存在すると推
定できる。さらに、そのテーブルにおいて中心をなす項
目以外の項目名を検索対象とする場合には、その項目名
に相当する単語を省略をしないのが普通である。省略す
ると、意味の通じない日本語文となることが多いからで
ある。以上のことから、検索対象が認定できなかった場
合に、文末に最も近いＡ項が属するテーブルの中心とな
る項目（デフォルト対象指定項目１３０４）を検索対象
として認定する。After step 20401, it is determined whether or not there is an undetermined D term as the minimum unit (step 2040).
3) If it exists, refer to the operation instruction target definition information storage table 13 and refer to the meaning code 1204 of the D term.
The item attribute character string (A term) corresponding to is supplemented and determined as the minimum unit of the search condition (step 20404). After step 20403, it is determined whether or not there is an undecided A term as the minimum unit (step 20405), and if there is, the A term is stored as a search target (step 2).
0406). If it does not exist, first, by referring to the operation instruction target definition information storage table 13, the A closest to the end of the sentence is displayed.
The attribute item name that is the "default target designation item (1304)" in the table to which the item belongs is recognized as the search target (step 20407). The default target designation item 1304 of the table to which the term A closest to the end of the sentence belongs is set as the search target for the following reason. In the Japanese sentence 1a instructing the database search, the search target is most likely to come to the end of the sentence. If the search target cannot be certified, it is considered that the search target is omitted. Further, the item that the user wants to search is not significantly different from item A at the end of the sentence. Therefore, it can be estimated that the search target exists in the table to which the A term at the end of the sentence belongs. Furthermore, when an item name other than the central item in the table is to be searched, it is usual not to omit the word corresponding to the item name. This is because, if omitted, it will often be a Japanese sentence that does not make sense. From the above, when the search target cannot be certified, the item (default target designation item 1304) that is the center of the table to which the A term closest to the end of the sentence belongs is certified as the search target.

【００９１】さて、ステップ２０４０７において検索対
象をシステムによって補ったが、この処理によって、解
析が失敗する可能性があるので、図３５に示す解析状況
データの検索対象データ格納テーブルに、解析コード１
４０１として、整数「３」を、対応文節番号１４０２と
して最後の文節の番号に１を加えた番号（即ち、日本語
文の文末に位置することを意味する）を、文末Ａ項文字
列１４０７として、文末Ａ項文字列を、挿入検索対象１
４０８として、文末Ａ項の属するテーブルのデフォルト
対象指定項目１３０４を格納する。本実施例において
は、解析コード「３」は、入力日本語文１ａから検索対
象を認定できなかったことを表す。Now, in step 20407, the search target is supplemented by the system. However, since the analysis may fail due to this processing, the analysis code 1 is stored in the search target data storage table of the analysis situation data shown in FIG.
As an 401, an integer "3", and a number obtained by adding 1 to the last bunsetsu number as the corresponding bunsetsu number 1402 (that means that the sentence is located at the end of the sentence of the Japanese sentence) is set as the end-of-sentence A term character string 1407 Insert sentence A term character string, insert search target 1
As 408, the default target designation item 1304 of the table to which the last sentence A belongs is stored. In the present embodiment, the analysis code “3” indicates that the search target cannot be recognized from the input Japanese sentence 1a.

【００９２】ステップ２０４０５の後、全ての検索条件
の最小単位に対して以下の処理を行う（ステップ２０４
０９）。Ｄ項の先頭に論理指示子が付接しているかを判
別し（ステップ２０４１０）、付接している場合は、Ａ
項の直後にＤ項を連結させ、検索条件中の「条件」とし
て決定する（ステップ２０４１１）。付接していない場
合は、Ａ項の直後に論理指示子”＝＝”を連結し、その
直後にＤ項を連結し、検索条件中の「条件」とする（ス
テップ２０４１２）。After step 20405, the following processing is performed on the minimum unit of all search conditions (step 204).
09). It is determined whether or not the logical indicator is attached to the head of the item D (step 20410), and if it is attached, A
Immediately after the term, the D term is concatenated and determined as the “condition” in the search condition (step 20411). If not attached, the logical indicator “==” is connected immediately after the A term, and the D term is connected immediately after the A term to be the “condition” in the search condition (step 20412).

【００９３】ステップ２０４０９の後、全ての「条件」
に対して以下の処理を行う（ステップ２０４１３）。
「条件」と「条件」との間に論理和を表す語幹が存在す
るかどうかを判別し（ステップ２０４１４）、存在する
場合は、「条件」と「条件」との間に”ｏｒ”を挿入し
（ステップ２０４１５）、存在しない場合は、”ａｎ
ｄ”を挿入する（ステップ２０４１６）。After step 20409, all "conditions"
Then, the following processing is performed (step 20413).
It is determined whether or not there is a stem representing a logical sum between the “condition” and the “condition” (step 20414), and if there is, a “or” is inserted between the “condition” and the “condition”. (Step 20415), if not present, "an
d "is inserted (step 20416).

【００９４】ステップ２０４１３の後、全ての「条件」
に対して以下の処理を行う（ステップ２０４１７）。検
索条件中に”ｏｒ”が存在するかを判別し（ステップ２
０４１８）、存在する場合は、”ｏｒ”の前後の「条
件」のＡ項が同一意味コード１２０４を持つか否かを判
別し（ステップ２０４１９）、持つ場合は、まず、”ｏ
ｒ”の直前の「条件」の前に”｛”を挿入し、”ｏｒ”
の直後の「条件」の直後に”｝”を挿入する（ステップ
２０４２０）。次に、”ｏｒ”の前後の条件の論理指示
子が同一で”＝＝”以外であるか否かを判別し（ステッ
プ２０４２１）、以外である場合は、まず、”ｏｒ”の
前後の「条件」の論理指示子を｛｝の直前に連結させ、
「条件」の論理指示子を”＝＝”と置き換える（ステッ
プ２０４２２）。次に、ステップにおける論理指示子が
否定”＾＝”であるか否かを判別し（ステップ２０４２
３）、否定である場合は、”ｏｒ”を”ａｎｄ”に変更
する（ステップ２０４２４）。After step 20413, all "conditions"
Then, the following processing is performed (step 20417). It is determined whether "or" exists in the search condition (step 2
0418), if it exists, it is determined whether or not the A terms of the “condition” before and after “or” have the same meaning code 1204 (step 20419).
Insert "{" before "condition" immediately before "r", and then "or"
"}" Is inserted immediately after "condition" immediately after (step 20420). Next, it is determined whether or not the logical indicators of the conditions before and after “or” are the same and other than “==” (step 20421). Connect the logical indicator of "condition" just before {},
The logical indicator of "condition" is replaced with "==" (step 20422). Next, it is judged whether or not the logical indicator in the step is negative "^ =" (step 2042).
3) If not, change "or" to "and" (step 20424).

【００９５】修飾表現認定処理２０３および中間表現生
成処理２０４により、意味コード１２０４がすべて一義
に決定した図２０の文節テーブルから図２３の中間表現
２ａを生成する。ここで、語幹「日立」に対応する属性
項目名が入力日本語文１ａの中に存在しないので、操作
指示対象定義情報格納テーブル１３を参照して、「日
立」の意味コードＰ５に対応する属性項目名「勤務先」
を補い、あわせて検索条件２００１の最小単位として認
定する。また、語幹「人」、「年収」に対応する属性項
目データが入力日本語文１ａの中に存在しないので、こ
れらを検索対象２００２として認定する。The modified expression recognition processing 203 and the intermediate expression generation processing 204 generate the intermediate expression 2a of FIG. 23 from the clause table of FIG. 20 in which all the meaning codes 1204 are uniquely determined. Here, since the attribute item name corresponding to the stem "Hitachi" does not exist in the input Japanese sentence 1a, the attribute item corresponding to the meaning code P5 of "Hitachi" is referred to by referring to the operation instruction target definition information storage table 13. First name "workplace"
And is also certified as the minimum unit of the search condition 2001. Further, since the attribute item data corresponding to the stems “person” and “annual income” do not exist in the input Japanese sentence 1a, these are certified as the search target 2002.

【００９６】同様に、図２１の文節テーブルから図２４
の中間表現２ａを生成する。語幹「５００万」について
は、上述の「日立」と同様に、属性項目名「年収」を補
い、検索条件２００１の最小単位として認定する。ま
た、語幹「高給取りの」は、未知語であるので、解析に
おいては無視し、何も解析をしない。Similarly, from the clause table in FIG.
To generate the intermediate representation 2a. The word stem “5 million” is supplemented with the attribute item name “annual income” as in the case of “Hitachi” described above, and is certified as the minimum unit of the search condition 2001. Further, the word stem “high paid” is an unknown word, so it is ignored in the analysis, and nothing is analyzed.

【００９７】同様に、図２２の文節テーブルから図２５
の中間表現２ａを生成する。まず、語幹「川崎」と「住
む（正規語は「住所」）」、語幹「趣味」と「ゴルフ」
をそれぞれまとめて検索条件２００１の最小単位として
それぞれ認定する。また、語幹「男性」に対応する属性
項目名「性別」を補い、検索条件２００１の最小単位と
して認定する。また、この日本語文１ａには最小単位と
して未決定のＡ項が存在しない、即ち、検索対象２００
２を認定できないので、ステップ２０４０７により、図
１２に示す操作指示対象定義情報格納テーブル１３を参
照し、文末Ａ項である語幹「趣味」の属する「個人テー
ブル」のデフォルト対象指定項目「氏名」を検索対象２
００２として認定する。そして、ステップ２０４０８に
より、図３５に示す解析状況データ１４の検索対象デー
タ格納テーブルに、解析コード１４０１として整数
「３」を、対応文節番号１４０２として整数「６」を、
文末Ａ項文字列１４０７として文末Ａ項文字列「趣味」
を、挿入検索対象１４０８としてデフォルト対象指定項
目「人名」をそれぞれ格納する。Similarly, from the clause table in FIG.
To generate the intermediate representation 2a. First, the stems "Kawasaki" and "live" (the regular word is "address"), the stems "hobby" and "golf"
Are collectively recognized as the minimum unit of the search condition 2001. In addition, the attribute item name “sex” corresponding to the word stem “male” is supplemented, and the minimum unit of the search condition 2001 is recognized. In addition, this Japanese sentence 1a has no undetermined A term as the minimum unit, that is, the search target 200
Since 2 cannot be certified, the operation instruction target definition information storage table 13 shown in FIG. 12 is referenced in step 20407, and the default target designation item “name” of the “personal table” to which the stem “hobby” to which the word end A belongs belongs. Search target 2
Certified as 002. Then, in step 20408, the integer “3” is set as the analysis code 1401 and the integer “6” is set as the corresponding clause number 1402 in the search target data storage table of the analysis situation data 14 shown in FIG.
End-of-sentence A character string "407" as end-of-sentence A character string 1407
The default target designation item “person name” is stored as the insertion search target 1408.

【００９８】このように、本実施例では、自然語１ａを
解析して中間表現２ａを生成する際に、検索対象２００
２が認定できないということをＤ項と対をなさないＡ項
が存在するか否かを判定することによって認定する。こ
のような条件をルールとしてプログラムと独立のテーブ
ル等に予め記述しておき、解析においてそれらを一つ一
つ照合することによって同様の処理を実現してもよい。As described above, in this embodiment, when the natural language 1a is analyzed and the intermediate expression 2a is generated, the search target 200 is searched.
The fact that 2 cannot be recognized is confirmed by determining whether or not there is an A term that does not pair with the D term. Such conditions may be described in advance in a table or the like independent of the program as rules, and the same processing may be realized by collating them one by one in the analysis.

【００９９】なお、ここでは、検索対象が存在しない場
合、自動的にデフォルト対象指定項目を検索対象として
格納するが、後述するように、検索対象が存在しない場
合、検索対象がないから検索できないことを表すメッセ
ージを生成、表示し、利用者に修正エディタ等を用いて
検索対象を定義させるという方法もある。また、この場
合、利用者の入力した検索対象と入力日本語文を対応さ
せて記憶しておき、次回以降で、意味的に近い日本語文
が入力された場合に、記憶しておいた情報とマッチング
をとることによって、検索対象を自動的に認定すること
もできる。Here, if there is no search target, the default target designation item is automatically stored as the search target. However, as will be described later, if the search target does not exist, it cannot be searched because there is no search target. There is also a method of generating and displaying a message indicating "," and letting the user define a search target using a correction editor or the like. In this case, the search target entered by the user and the input Japanese sentence are stored in association with each other, and when a semantically similar Japanese sentence is input, the stored information is matched with the stored information. By taking, it is possible to automatically qualify the search target.

【０１００】図８は、解析結果表示処理部３の概要ＰＡ
Ｄ図であり、以下の処理からなる。まず、中間表現２ａ
を表形式に展開した「中間表現展開テーブル」を生成す
る中間表現展開テーブル生成処理３０１、次に、中間表
現展開テーブルを出力手段により表示する中間表現展開
テーブル表示処理からなる。FIG. 8 shows an outline PA of the analysis result display processing section 3.
It is a D figure and consists of the following processes. First, the intermediate representation 2a
Is formed into a table format to generate an "intermediate expression expansion table" 301, and then an intermediate expression expansion table display processing for displaying the intermediate expression expansion table by the output means.

【０１０１】図９は、中間表現展開テーブル生成処理の
ＰＡＤ図であり、中間表現２ａを表形式に整形した中間
表現展開テーブルを生成するまでの処理であり、以下、
この処理の説明をＰＡＤ図に従って説明する。FIG. 9 is a PAD diagram of the intermediate expression expansion table generation processing, which is the processing until the intermediate expression expansion table in which the intermediate expression 2a is shaped into a table format is generated.
This processing will be described with reference to the PAD diagram.

【０１０２】中間表現２ａにおける検索条件２００１の
条件に対して以下の処理を行う（ステップ３０１０
１）。まず、中間表現展開テーブルを構成する「属性項
目名（２００３）」として、Ａ項を表す文節の正規語文
字列１２０２を登録する（ステップ３０１０２）。次
に、中間表現展開テーブルを構成する「論理指示子（２
００４）」として、Ａ項に対応する論理指示子を登録す
る（ステップ３０１０３）。その次に、中間表現展開テ
ーブルを構成する「属性項目データ（２００５）」とし
て、Ａ項に対応するＤ項を表す文節の正規語文字列１２
０２を登録する（ステップ３０１０４）。The following processing is performed on the condition of the search condition 2001 in the intermediate representation 2a (step 3010).
1). First, the regular word character string 1202 of the clause representing the A term is registered as the "attribute item name (2003)" that constitutes the intermediate expression development table (step 30102). Next, the "logical designator (2
004) ”, the logical indicator corresponding to the item A is registered (step 30103). Then, as the "attribute item data (2005)" that constitutes the intermediate expression expansion table, the regular word character string 12 of the clause representing the D term corresponding to the A term.
02 is registered (step 30104).

【０１０３】次に、当該条件が｛｝内に存在し、かつ、
｛｝内の最後の条件であるか否かを判別し（ステップ３
０１０５）、最後の条件である場合は、中間表現展開テ
ーブルの「条件修飾の範囲（２００７）」として、整数
「０」を登録する（ステップ３０１０６）。最後の条件
でない場合は、整数「１」を登録する（ステップ３０１
０７）。Next, the condition exists in {}, and
It is determined whether or not it is the last condition in {} (step 3
0105), if it is the last condition, the integer “0” is registered as the “condition modification range (2007)” of the intermediate expression expansion table (step 30106). If it is not the last condition, the integer "1" is registered (step 301).
07).

【０１０４】ステップ３０１０５の後、当該条件の直後
に論理演算子が存在するか否かを判別し（ステップ３０
１０８）、存在する場合は、当該論理演算子が「ａｎ
ｄ」であるか否かを判別し（ステップ３０１０９）、応
の場合は、中間表現テーブルの「論理演算子（２００
６）」として文字列「＆」を登録する。否の場合は、文
字列「！」を登録する。ステップ３０１０８において存
在しない場合は、条件の直後が「｛」であるか否かを判
別し（ステップ３０１１２）、否の場合は、中間表現展
開テーブルの「論理演算子（２００６）」として文字列
「Ｅ」を登録する（ステップ３０１１３）。After step 30105, it is determined whether or not a logical operator exists immediately after the condition (step 30
108), if present, the logical operator is "an
It is determined whether or not it is "d" (step 30109), and if yes, the "logical operator (200
6) ”is registered as the character string“ & ”. If not, the character string “!” Is registered. If it does not exist in step 30108, it is judged whether or not the condition immediately after is "{" (step 30112). If not, the character string "as the" logical operator (2006) "in the intermediate representation expansion table is determined. "E" is registered (step 30113).

【０１０５】中間表現展開テーブル生成処理３０１によ
り、図２３の中間表現２ａから図２６に示す中間表現展
開テーブルを生成する。同様に、図２４、図２５の中間
表現２ａから図２７、図２８に示す中間表現展開テーブ
ルを生成する。By the intermediate expression development table generation processing 301, the intermediate expression development table shown in FIG. 26 is generated from the intermediate expressions 2a in FIG. Similarly, the intermediate expression development tables shown in FIGS. 27 and 28 are generated from the intermediate expressions 2a shown in FIGS.

【０１０６】図１０は中間表現展開テーブル表示処理３
０２のＰＡＤ図であり、中間表現展開テーブルに基づい
て利用者に日本語解析結果を表示するまでの処理であ
り、以下、この処理の説明をＰＡＤ図に従って行う。FIG. 10 shows the intermediate representation expansion table display processing 3
It is a PAD diagram of No. 02, and is a process until a Japanese analysis result is displayed to a user based on an intermediate expression expansion table. Hereinafter, this process will be described according to the PAD diagram.

【０１０７】まず、検索条件２００１の条件に対して
（ステップ３０２０１）、当該条件に相当する中間表現
展開テーブルのデータを出力手段より表示する（ステッ
プ３０２０２）。ステップ３０２０１の後、検索対象２
００２を表すＡ項を出力手段より出力する（ステップ３
０２０３）。First, with respect to the condition of the search condition 2001 (step 30201), the data of the intermediate expression expansion table corresponding to the condition is displayed by the output means (step 30202). Search target 2 after step 30201
The A term representing 002 is output from the output means (step 3
0203).

【０１０８】中間表現展開テーブル表示処理３０２によ
り、図２６、図２７、および図２８に示す中間表現展開
テーブルに基づいて、それぞれ図２９、図３０、図３１
に示すように解析結果を利用者へ表示する。By the intermediate expression development table display processing 302, based on the intermediate expression development tables shown in FIGS. 26, 27, and 28, FIGS. 29, 30, and 31, respectively.
The analysis result is displayed to the user as shown in.

【０１０９】図１１は、解析状況メッセージ生成・表示
処理８のＰＡＤ図であり、利用者からの表示指定に応じ
て自然語メッセージ情報を生成し、出力手段より利用者
に表示するまでの処理であり、以下、この処理の説明を
ＰＡＤ図に従って行う。FIG. 11 is a PAD diagram of the analysis status message generation / display process 8, which is a process in which natural language message information is generated according to the display designation from the user and is displayed to the user by the output means. Yes, the process will be described below with reference to the PAD diagram.

【０１１０】まず、解析失敗の可能性のある解析箇所に
おける解析失敗原因を表示するのか、解析箇所における
解析方法を表示するのか、解析失敗への対処方法を表示
するのかという「メッセージモード（１５０１）」を表
す変数ＭＭを初期化する（ステップ８０１）。ＭＭは、
単に変数を表す文字列であり、特別な意味を持つもので
はない。次に、利用者からの表示指示内容を判別し（ス
テップ８０２）、解析失敗原因の表示指示である場合
は、メッセージモード１５０１を表す変数ＭＭに整数
「１」を代入する（ステップ８０３）。解析方法の表示
指示である場合は、整数「２」を代入する。対処方法の
表示指示である場合は、整数「３」を代入する（ステッ
プ８０４）。First, the "message mode (1501)" is displayed indicating whether the cause of analysis failure is displayed at the analysis point where there is a possibility of analysis failure, whether the analysis method at the analysis point is displayed, or how to deal with the analysis failure. A variable MM representing "" is initialized (step 801). MM is
It is just a string that represents a variable and has no special meaning. Next, the content of the display instruction from the user is discriminated (step 802), and if it is the display instruction of the cause of the analysis failure, the integer "1" is substituted for the variable MM representing the message mode 1501 (step 803). If it is a display instruction of the analysis method, the integer “2” is substituted. If it is the display instruction of the coping method, the integer "3" is substituted (step 804).

【０１１１】ステップ８０２の後、解析状況データ１４
内の単語データを格納するテーブルそれぞれについて、
以下の処理を行う（ステップ８０６）。テーブルそれぞ
れについてデータが格納されている間、以下の処理を行
う（ステップ８０７）。After step 802, the analysis status data 14
For each table that stores the word data in
The following processing is performed (step 806). While data is stored for each table, the following processing is performed (step 807).

【０１１２】まず、メッセージデータ１５を参照して、
メッセージモード１５０１を表す変数ＭＭおよび当該テ
ーブルに格納されている解析コード１４０１に対応する
メッセージテンプレート１５０２を抽出する（ステップ
８０８）。First, referring to the message data 15,
The variable MM representing the message mode 1501 and the message template 1502 corresponding to the analysis code 1401 stored in the table are extracted (step 808).

【０１１３】次に、抽出したメッセージテンプレート１
５０２の中に可変部分を表す記号「＠」、「＄」、ある
いは「％」が存在する間、以下の処理を行う（ステップ
８０９）。Next, the extracted message template 1
While the symbol "@", "$", or "%" representing the variable portion is present in 502, the following processing is performed (step 809).

【０１１４】メッセージテンプレート１５０２の中に記
号「＠」で囲まれた部分が存在し、かつ、その部分内に
記号「＠」、「＄」、「％」が存在しない間（ステップ
８１０）、解析状況データ１４を参照して、記号「＠」
で囲まれた部分の表す項目に対応するデータに置き換え
る（ステップ８１１）。While the message template 1502 has a portion surrounded by the symbol "@" and the symbols "@", "$" and "%" are not present in the portion (step 810), the analysis is performed. Referring to the situation data 14, the symbol "@"
It is replaced with the data corresponding to the item represented by the part surrounded by (step 811).

【０１１５】メッセージテンプレート１５０２の中に記
号「＄」で囲まれた部分が存在し、かつ、その部分内に
記号「＠」、「＄」、「％」が存在しない間（ステップ
８１２）、操作指示対象定義情報格納テーブル１３を参
照して、記号「＠」で囲まれた部分の表す意味コード１
２０４に対応する属性項目名１３０１に置き換える（ス
テップ８１３）。While the message template 1502 has a portion surrounded by the symbol "$" and the symbols "@", "$", and "%" do not exist within the portion (step 812), the operation is performed. By referring to the instruction target definition information storage table 13, the meaning code 1 represented by the part enclosed by the symbol "@"
The attribute item name 1301 corresponding to 204 is replaced (step 813).

【０１１６】メッセージテンプレート１５０２の中に記
号「％」で囲まれた部分が存在し、かつ、その部分内に
記号「＠」、「＄」、「％」が存在しない間（ステップ
８１４）、操作指示対象定義情報格納テーブル１３を参
照し、「＠」で囲まれた部分の表す意味コード１２０４
に対応する所属テーブル名１３０３に置き換える（ステ
ップ８１５）。以上の説明で判るように、記号「＠」、
「＄」、「％」の違いは、置き換える文字列データを抽
出するために参照するテーブルの違いおよび抽出するデ
ータそのものの違いによる。While the message template 1502 has a portion surrounded by the symbol "%" and the symbols "@", "$" and "%" do not exist within the portion (step 814), the operation is performed. By referring to the instruction target definition information storage table 13, the meaning code 1204 represented by the portion surrounded by "@"
Is replaced with the belonging table name 1303 corresponding to (step 815). As can be seen from the above explanation, the symbol "@",
The difference between “$” and “%” is due to the difference in the table referred to for extracting the character string data to be replaced and the difference in the extracted data itself.

【０１１７】ここで、図１４に示す入力日本語文１ａの
解析結果として、図２９に示す表形式の中間表現展開テ
ーブルを利用者に表示した際に、それが利用者の意図と
異なる場合、あるいは、単なる利用者の興味本位であっ
ても良いが、いずれにせよ、利用者から解析状況の表示
指示を受けた場合を考える。Here, as a result of analysis of the input Japanese sentence 1a shown in FIG. 14, when the intermediate expression expansion table in the table format shown in FIG. 29 is displayed to the user, if it is different from the user's intention, or Although it may be just the interest of the user, in any case, consider a case where the user receives an instruction to display the analysis status.

【０１１８】利用者から指示された表示内容が「解析失
敗原因」である場合は、ステップ８０３において、メッ
セージモード変数ＭＭを「１」とし、ステップ８０７に
おいて、解析状況データ１４内のすべてのテーブルを参
照し、データが存在するテーブルが存在するか否かを調
べる。図１４の入力日本語１ａの場合は、図３４に示す
ように多義語データ格納テーブルにデータが格納されて
いることが判る。次に、図３２に示すメッセージデータ
１５を参照し、メッセージモード１５０１がメッセージ
モード変数ＭＭの値「１」であり、かつ、解析コード１
４０１が多義語データ格納テーブルの解析コード「２」
であるフィールドのメッセージテンプレート”「＠多義
語文字列＠」は、意味の記述順位に基づいて、「％＠決
定意味コード＠％」の「＄＠決定意味コード＠＄」と解
釈しました。”を抽出する。When the display content instructed by the user is "cause of analysis failure", the message mode variable MM is set to "1" in step 803, and all tables in the analysis status data 14 are set in step 807. Reference is made to check whether or not there is a table in which data exists. In the case of the input Japanese 1a shown in FIG. 14, it is understood that the data is stored in the polysemous word data storage table as shown in FIG. Next, referring to the message data 15 shown in FIG. 32, the message mode 1501 is the value “1” of the message mode variable MM, and the analysis code 1
401 is the analysis code “2” of the polysemous word data storage table
The message template ““ @ polymorphic character string @ ”of the field that is“, ”was interpreted as“ $ @ decision meaning code @ $ ”of“% @ decision meaning code @% ”based on the description order of meaning. "Is extracted.

【０１１９】次に、抽出した前述のテンプレート１５０
２において、記号「＠」で囲まれ、それらに囲まれた内
部に記号「＠」、「＄」、あるいは「％」が存在しない
部分が存在（３ヵ所）するので、ステップ８１１によ
り、解析状況データ１４の多義語データ格納テーブルを
参照して、「＠」で囲まれた、「多義語文字列」、「決
定意味コード」に対応するデータ「日立」、「Ｐ５」を
それぞれ抽出し、「＠多義語文字列＠」、「＠決定意味
コード＠」の部分とそれぞれ置き換える。この時点でテ
ンプレート１５０２は、”「日立」は、意味の記述順位
に基づいて、「％Ｐ５％」の「＄Ｐ５＄」と解釈しまし
た。”となる。同様に、記号「＄」、「％」が存在する
ので、それぞれステップ８１３、ステップ８１５によ
り、図１３に示す操作指示対象定義情報格納テーブル１
３を参照し、Ｐ５に対応する属性項目名文字列および所
属テーブル文字列を抽出し、「＄Ｐ５＄」、「％Ｐ５
％」の部分とそれぞれ置き換える。その結果、テンプレ
ート１５０２は、”「日立」は、意味の優先順位によっ
て、「個人テーブル」の「勤務先」と解釈しました。”
となり、完全な文を生成できたので、図３６に示すよう
にこれを利用者に表示する。Next, the above-mentioned extracted template 150
In FIG. 2, there are portions (3 places) surrounded by the symbol “@”, and the symbol “@”, “$”, or “%” does not exist inside them. By referring to the polysemous word data storage table of the data 14, the data "Hitachi" and "P5", which are surrounded by "@" and correspond to the "polysemous word character string" and the "determination meaning code", are extracted, respectively. Replace with the parts of "@ polymorphic character string @" and "@ decision meaning code @". At this point, the template 1502 has interpreted ““ Hitachi ”as“ $ P5 $ ”of“% P5% ”based on the order of description of meaning. Similarly, since the symbols “$” and “%” are present, the operation instruction target definition information storage table 1 shown in FIG. 13 is obtained in step 813 and step 815, respectively.
3, the attribute item name character string and the belonging table character string corresponding to P5 are extracted, and "$ P5 $" and "% P5" are extracted.
Replace with "%". As a result, the template 1502 interprets ““ Hitachi ”as“ workplace ”in the“ personal table ”according to the priority of meaning. ”
Since the complete sentence has been generated, it is displayed to the user as shown in FIG.

【０１２０】利用者からの表示指定が「解析方法」であ
る場合も同様にして処理を行うことにより、まず、メッ
セージテンプレート１５０２である、”多義語の意味
は、・・・意味コードを優先します。”を得る。このテ
ンプレート１５０２には記号「＠」、「＄」、あるいは
「％」が存在しないので、そのままメッセージとして、
図３７に示すようにこれを利用者に表示する。Even when the display designation from the user is the "analysis method", the same process is performed, and the message template 1502, "The meaning of polysemous words ... Get. Since the template "1502" does not have the symbols "@", "$", or "%", the message as it is,
This is displayed to the user as shown in FIG.

【０１２１】利用者からの表示指定が「対処方法」であ
る場合も同様にして処理を行うことにより、図３８に示
すように利用者に表示する。Even when the display designation from the user is the "handling method", similar processing is performed to display it to the user as shown in FIG.

【０１２２】図１５、図１６の入力日本語文１ａについ
ても同様にして、利用者からの表示指定に応じたメッセ
ージを生成し、表示する。図１５の入力日本語１ａは、
未知語「高給取りの」が存在する。この日本語文１ａの
自然語解析状況について、解析状況データ１４、メッセ
ージデータ１５を参照してメッセージを生成し、利用者
からの表示指示に応じて、図３９、図４０、図４１に示
すように自然語解析状況を利用者に表示する。図１６の
入力日本語１ａは、検索対象が認定できないが、この日
本語文１ａの解析状況について、利用者からの表示指定
が「解析失敗原因」、「解析方法」、「対処方法」であ
る場合に、解析状況データ１４、メッセージデータ１５
を参照してそれぞれメッセージを生成し、それぞれ図４
２、図４３、図４４に示すように利用者に表示する。Similarly for the input Japanese sentence 1a shown in FIGS. 15 and 16, a message corresponding to the display designation from the user is generated and displayed. The input Japanese 1a in FIG. 15 is
There is an unknown word, "high paid". Regarding the natural language analysis situation of the Japanese sentence 1a, a message is generated by referring to the analysis situation data 14 and the message data 15, and according to the display instruction from the user, as shown in FIGS. 39, 40 and 41. Display the natural language analysis status to the user. When the input Japanese 1a shown in FIG. 16 cannot be identified as a search target, the display designation from the user regarding the analysis status of this Japanese sentence 1a is “analysis failure cause”, “analysis method”, and “handling method” In addition, analysis status data 14 and message data 15
Message is generated by referring to FIG.
2, displayed to the user as shown in FIGS. 43 and 44.

【０１２３】図１４に示す入力日本語文「日立の人の年
収は」という文について、利用者が「勤務先が日立製作
所の人の年収は」という意味で入力した場合、図２９に
示すような解析結果を表示された場合に、正しい解析結
果であるとみなすであろう。しかし、「住所が日立市の
人の年収は」という意味で入力した場合は、自分の意図
と異なる解析結果であると認識するであろう。そこで、
解析失敗原因を表示する指示を与えることにより、図３
６に示す情報を得る。即ち、「日立」が多義語であるこ
と、「日立」の意味として「勤務先」の意味が「住所」
の意味に優先していることを把握し、多義語の誤解釈が
原因で解析に失敗したことを認識する。When the user inputs the Japanese sentence "Hitachi's person's annual income is" as shown in FIG. 14 in the sense of "the employee's annual income is that of Hitachi," as shown in FIG. If the analysis result is displayed, it will be regarded as the correct analysis result. However, if you enter "meaning that the address is Hitachi's annual income", you will recognize that the analysis result is different from your intention. Therefore,
By giving an instruction to display the cause of analysis failure,
The information shown in 6 is obtained. That is, "Hitachi" is a polysemous word, and "Hitachi" means "work" as "address".
Recognize that the analysis fails due to misinterpretation of polysemous words.

【０１２４】また、多義語について、どのようにして意
味を決定しているのかを知りたいと利用者が思った場合
には、解析方法を表示する指示を与えることにより、図
３７に示す情報を得る。即ち、「日立」の意味は隣接す
る単語「人」の意味から決定すること、「人」の意味か
ら決定できない場合は、辞書の記述順序により意味を決
定していることを把握する。When the user wants to know how to determine the meaning of a polysemous word, the information shown in FIG. 37 is displayed by giving an instruction to display the analysis method. obtain. That is, it is understood that the meaning of "Hitachi" is determined from the meaning of the adjacent word "person", and if the meaning of "person" cannot be determined, the meaning is determined according to the description order of the dictionary.

【０１２５】さらに、利用者が「日立」を「日立市」の
意味として扱いたいのだがどうすれば良いのかと疑問に
思った場合に、対処方法を表示する指示を与えることに
より、図３８に示す情報を獲得することができる。利用
者は状況に応じてこれらの対処方法の中から選択して処
理操作を行うことにより、次回以降は、より正確にシス
テムに解析をさせることができるようになる。Further, when the user wants to treat “Hitachi” as the meaning of “Hitachi city” and wonders what should be done, by giving an instruction to display a coping method, the information shown in FIG. Can be earned. The user selects one of these coping methods according to the situation and performs a processing operation, so that the system can be made to analyze the system more accurately from the next time onward.

【０１２６】図１５、図１６に示す日本語文についても
図１４の場合と同様のことが言える。The same can be said for the Japanese sentences shown in FIGS. 15 and 16 as in the case of FIG.

【０１２７】ここで、誤解してはならないのは、本実施
例においては、システムは解析において、利用者の意図
を理解している訳ではないので、システムの表示する解
析結果が利用者の意図に合ったものであるか否かは判定
できない。よって、システムが利用者に表示する解析失
敗原因は、あくまで候補であり、よって、冗長な情報が
含まれてしまうことがある。しかしながら、解析に失敗
する箇所をある程度絞ることは可能であり、ある程度の
割合の日本語文については、利用者の意図に外れる原因
となる解析箇所に関する情報を利用者に報知できる。It should be understood that, in this embodiment, the system does not understand the user's intention in the analysis, so the analysis result displayed by the system is the user's intention. It is not possible to determine whether or not it matches Therefore, the cause of the analysis failure displayed by the system to the user is merely a candidate, and thus redundant information may be included. However, it is possible to narrow down the part where the analysis fails to some extent, and it is possible to notify the user of information about the analyzed part that causes the user's intention to be deviated for a certain percentage of Japanese sentences.

【０１２８】以上のように、本実施例によれば、主とし
てデータベース１６の検索を指示する入力日本語文１ａ
を解析し、解析結果を表示した際に、解析結果が利用者
の意図にそぐわない場合に、利用者は、システムによる
日本語解析２において、利用者の意図から外れる可能性
のある解析箇所における、解析方法および解析失敗への
対処方法に関する情報を取得できるので、利用者は、ど
この解析箇所でどのような解析をしたために結果的に自
分の意図にそぐわない解析をしてしまったのかを把握で
き、その結果として、解析結果の修正を効率良く行うこ
とができる。また、次回以降、利用者は前回の解析で習
得した自然語解析情報を活かして自分の意図通りに正確
にシステムが解析できる自然語を入力することができ
る。また、メッセージデータ１５は、プログラムとは独
立であるので、必要に応じて、また、利用者の習熟度に
応じてメッセージを容易に変更できる。As described above, according to this embodiment, the input Japanese sentence 1a mainly instructing the search of the database 16 is input.
When the analysis result is displayed and the analysis result does not meet the user's intention, in the Japanese parsing 2 by the system, the user is Since information about the analysis method and how to deal with the analysis failure can be obtained, the user can understand what kind of analysis was performed at what analysis location, and as a result, the analysis did not match the user's intention. As a result, the analysis result can be corrected efficiently. From the next time onward, the user can input the natural language that the system can analyze exactly as intended by utilizing the natural language analysis information acquired in the previous analysis. Further, since the message data 15 is independent of the program, it is possible to easily change the message as needed and according to the proficiency level of the user.

【０１２９】以下、上記実施例の拡充、変形について述
べる。The expansion and modification of the above embodiment will be described below.

【０１３０】（１）解析失敗原因、解析失敗箇所におけ
る解析方法、解析失敗への対処方法のうちのどれか一種
類が表示されており、かつ、その表示内容が複数種類の
メッセージからなる場合に、複数のメッセージのどれか
一つを利用者が指定し、さらに、現在表示されている表
示内容以外の内容のメッセージを表示指示した場合に、
利用者が指定したメッセージに対応する解析失敗箇所に
関するメッセージ情報のみを利用者に報知する。(1) When any one of the cause of the analysis failure, the analysis method at the analysis failure location, and the method of coping with the analysis failure is displayed, and the display content includes a plurality of types of messages. , If the user specifies one of multiple messages and further instructs to display a message other than the currently displayed display content,
The user is notified of only the message information related to the analysis failure location corresponding to the message specified by the user.

【０１３１】図４５に示す日本語文１ａが入力されたと
する。この入力日本語文１ａには、多義語「日立」が存
在し、かつ、検索対象２００２が省略されている。前述
の日本語解析処理２および解析結果表示処理３によっ
て、図４６に示す解析結果を表示する。ここで、利用者
が解析失敗原因を表示指示した場合、解析状況メッセー
ジ生成・表示処理８によって、図４７に示すメッセージ
を表示する。ここで、図４７に表示された２種類のメッ
セージのうち、１番目のメッセージについて、その解析
方法が知りたい場合に、まず、１番目のメッセージを選
択し、次に、解析方法の表示を指示することによって、
図４８に示すように、解析失敗原因メッセージの１番目
の内容に対応する解析方法メッセージのみを利用者に表
示する。これは、利用者の選択したメッセージに対応す
る解析コード１４０１および解析状況データ１４内の単
語データを参照することで容易に実現可能である。これ
によって、利用者にとって必要な情報のみを表示するこ
とができ、利用者が該当する情報をさがす手間が省け
る。It is assumed that the Japanese sentence 1a shown in FIG. 45 is input. In this input Japanese sentence 1a, the polysemous word “Hitachi” exists, and the search target 2002 is omitted. The analysis result shown in FIG. 46 is displayed by the Japanese analysis process 2 and the analysis result display process 3 described above. Here, when the user gives an instruction to display the cause of the analysis failure, the analysis status message generation / display process 8 displays the message shown in FIG. Here, of the two types of messages displayed in FIG. 47, if you want to know the analysis method for the first message, first select the first message, and then instruct to display the analysis method. By,
As shown in FIG. 48, only the analysis method message corresponding to the first content of the analysis failure cause message is displayed to the user. This can be easily realized by referring to the analysis code 1401 corresponding to the message selected by the user and the word data in the analysis situation data 14. As a result, only the information necessary for the user can be displayed, and the user can save the trouble of finding the relevant information.

【０１３２】あるいは、解析失敗原因、解析方法、対処
方法を同時に表示し、ある一つの表示内容について、あ
るメッセージを選択すると、それに対応する他の表示内
容のメッセージが他と異なる態様で表示されるという方
法でも同様の効果を得ることができる。Alternatively, when the cause of analysis failure, the analysis method, and the coping method are displayed at the same time, and when a certain message is selected for one display content, the message of the other display content corresponding to that is displayed in a different mode from the other. The same effect can be obtained by this method.

【０１３３】（２）表示した解析結果に対して、利用者
が自分の意図と異なる部分を指定し、次に、解析状況メ
ッセージを表示指示すると、利用者の指定した部分に関
する解析状況メッセージのみを利用者に表示する。ま
た、解析結果を表示する際には、利用者に表示可能な解
析状況メッセージに対応する解析結果構成要素を他と異
なる態様で表示する。また、解析結果の代わりに入力日
本語文そのものを表示して同様のことを行っても良い。(2) In the displayed analysis result, when the user specifies a part different from his intention and then instructs to display an analysis status message, only the analysis status message concerning the part specified by the user is displayed. Display to the user. Further, when displaying the analysis result, the analysis result component corresponding to the analysis status message that can be displayed to the user is displayed in a manner different from the others. Also, instead of the analysis result, the input Japanese sentence itself may be displayed and the same operation may be performed.

【０１３４】表示した解析結果において、解析失敗箇所
に相当する部分は解析結果を構成する各要素の文節番号
と、解析状況データに格納される「対応文節番号（１４
０２）」とのマッチングによって認定できる。ただし、
本実施例で、未知語については、解析結果にその情報が
反映されないので、解析状況データ１４において未知語
文節の「対応文節番号（１４０２）」に対応する解析結
果構成要素の番号として、日本語文において当該未知語
の前後に位置する文節の文節番号を採用する。図４５に
示す日本語文１ａについて、図４９に示すように解析結
果を表示する。図４５の日本語文１ａは、多義語「日
立」が存在し、かつ、検索対象が省略されている。図５
０に示す解析状況データ１４内の多義語データ格納テー
ブルにおいて、「日立」の「対応文節番号」が「１」で
あるので、図４９に示すように、入力日本語文において
１番目の文節である「日立の」に対応する解析結果構成
要素である、「日立製作所」を他と異なる態様で表示す
る。図５１に示す解析状況データ１４内の検索対象デー
タ格納テーブルにおいて、文末に追加された検索対象の
対応文節番号とが「３」であるので、図４９に示すよう
に、入力日本語文において３番目の文節（解析途中で挿
入された文節）である「人名」に対応する解析結果構成
要素である、「人名」を他と異なる態様で表示する。In the displayed analysis result, the portion corresponding to the analysis failure portion is the clause number of each element constituting the analysis result and the "corresponding clause number (14
02) ”. However,
In this embodiment, the unknown word is not reflected in the analysis result, so that the Japanese sentence is used as the number of the analysis result component corresponding to the “corresponding phrase number (1402)” of the unknown word phrase in the analysis situation data 14. In, the phrase numbers of the phrases located before and after the unknown word are adopted. For the Japanese sentence 1a shown in FIG. 45, the analysis result is displayed as shown in FIG. In the Japanese sentence 1a of FIG. 45, the polysemous word “Hitachi” exists and the search target is omitted. Figure 5
In the polysemous word data storage table in the analysis situation data 14 shown in 0, since the "corresponding phrase number" of "Hitachi" is "1", it is the first phrase in the input Japanese sentence, as shown in FIG. "Hitachi," which is an analysis result component corresponding to "Hitachi's", is displayed in a manner different from the others. In the search target data storage table in the analysis situation data 14 shown in FIG. 51, the corresponding phrase number of the search target added at the end of the sentence is “3”, so as shown in FIG. "Person name" which is an analysis result constituent element corresponding to "Person name" which is the phrase (phrase inserted during analysis) is displayed in a manner different from the others.

【０１３５】図４９を表示することによって、利用者
は、「日立製作所」、「人名」の部分に何らかの解析失
敗を起こしている可能性があるということを把握でき
る。そして、その箇所が自分の意図にそぐわない解析箇
所と一致する場合は、その箇所を選択することによっ
て、その箇所に関する解析状況メッセージを参照するこ
とができる。これによって、利用者の参照したい解析結
果要素に関する解析状況メッセージのみを表示すること
ができ、利用者の処理効率を向上させることができる。By displaying FIG. 49, the user can understand that there is a possibility that some sort of analysis failure has occurred in the "Hitachi Ltd." and "person name" parts. Then, if the location matches an analysis location that does not match the user's intention, by selecting the location, the analysis status message regarding the location can be referenced. As a result, only the analysis status message relating to the analysis result element that the user wants to refer to can be displayed, and the processing efficiency of the user can be improved.

【０１３６】（３）意味解析結果がシステムに許容され
ない場合に、メッセージを出力するが、そのメッセージ
に対応して施された処置に対応する動作と、入力した自
然語を記憶し、新たに入力した自然語の意味が以前に記
憶した自然語の意味と等価である場合に、以前に記憶し
た自然語に対応して記憶した動作を実行する。(3) When the result of the semantic analysis is not acceptable to the system, a message is output, but the action corresponding to the action performed corresponding to the message and the input natural language are stored and newly input. If the meaning of the stored natural language is equivalent to the previously stored meaning of the natural language, the stored operation is executed corresponding to the previously stored natural language.

【０１３７】例えば、図１６の日本語文を入力とした場
合に、検索対象が存在しないので、このままでは、シス
テムで許容されない。そこで、図５２に示すメッセージ
を生成、表示し、システムで許容されない旨を報知す
る。利用者は、修正エディタを用いて、検索対象を定義
する。その際に、定義された検索対象と、図１６の自然
語を対応させて図５３のようなテーブルに格納する。For example, when the Japanese sentence of FIG. 16 is input, there is no search target, so the system is not allowed as it is. Therefore, the message shown in FIG. 52 is generated and displayed to inform that the system does not allow it. The user uses the modification editor to define the search target. At that time, the defined search target and the natural language shown in FIG. 16 are stored in the table as shown in FIG.

【０１３８】次に、新たな入力として、図１６が再び入
力されたときには、検索対象が存在しないことが判った
（ステップ２０４０７）後で、図５３のテーブルを参照
して入力された自然語とテーブルに格納された自然語と
のマッチングを取っていき、マッチしたならば、その自
然語に対応している検索対象「氏名」を新たな入力にお
ける検索対象として補う。Next, when FIG. 16 is input again as a new input, it is found that the search target does not exist (step 20407), and then the natural language input by referring to the table of FIG. The matching with the natural language stored in the table is performed, and if there is a match, the search target “name” corresponding to the natural language is supplemented as the search target in the new input.

【０１３９】ここでは、自然語と自然語とのマッチング
を用いたが、テーブルに格納する際に、自然語の解析結
果を自然ごとともに格納しておけば、さらに柔軟なマッ
チングが実現できる。例えば、図１６の日本語文の解析
結果を図５４に示すように格納する。次に新たな入力と
して、図５５に示す自然語が入力されたとする。図５５
の文も検索対象が存在しないため、このままでは、シス
テムで許容できない。そこで、ステップ２０４０７の
後、図５４のテーブルを参照する。図１６の日本語で参
照しているデータベーステーブルと、図５５の日本語で
参照しているデータベーステーブルが同一である（個人
テーブル）ことから、省略されている検索対象は、以前
に利用者が定義したもの（図５４の「氏名」）と同一で
あると推定することができ、検索対象「氏名」を新たな
入力における検索対象として補う。Here, the matching between the natural language and the natural language is used, but when storing the analysis result of the natural language together with the natural language when storing in the table, more flexible matching can be realized. For example, the analysis result of the Japanese sentence in FIG. 16 is stored as shown in FIG. Next, it is assumed that the natural language shown in FIG. 55 is input as a new input. FIG. 55
Since there is no search target for the sentence of, the system cannot accept it as it is. Therefore, after step 20407, the table of FIG. 54 is referred to. Since the database table referred to in Japanese in FIG. 16 is the same as the database table referred to in Japanese in FIG. 55 (personal table), the omitted search target is the one previously searched by the user. It can be estimated that it is the same as the defined one (“name” in FIG. 54), and the search target “name” is supplemented as the search target in a new input.

【０１４０】このように、利用者が一度処置した内容を
記憶し、次回以降で利用することによって、利用者の処
置を反映した解析ができ、システムの使い勝手が向上す
る。As described above, by storing the contents of the treatment once performed by the user and using it after the next time, the analysis reflecting the treatment of the user can be performed and the usability of the system is improved.

【０１４１】（４）上記実施例では主として情報検索シ
ステムに適用した場合について述べたが、これに限ら
ず、他の自然語解析システムや自然語インタフェースに
も適用することができる。(4) In the above embodiment, the case where the invention is mainly applied to the information retrieval system has been described, but the invention is not limited to this and can be applied to other natural language analysis systems and natural language interfaces.

【０１４２】[0142]

【発明の効果】本発明では、入力された自然語の意味解
析結果で特定される部分の自然語を含むメッセージを出
力することにより、利用者は、意味解析結果に対応する
動作を実行する前に、意味解析結果の確かさを確認する
ことができ、誤った意味解析結果に応じた不要な動作の
実行を防ぐことができるので、利用者の作業効率が向上
する。According to the present invention, by outputting a message containing a natural language of a part specified by the input semantic analysis result of the natural language, the user can execute the operation corresponding to the semantic analysis result. In addition, since the accuracy of the semantic analysis result can be confirmed and the unnecessary operation according to the incorrect semantic analysis result can be prevented, the work efficiency of the user is improved.

【０１４３】また、本発明では、意味解析結果に対応す
る動作がシステムで許容されない場合には、その旨を動
作前に知ることができ、また、それに対応する処置に関
するメッセージを得ることができるので、システムにと
っての不正入力を防止することができる。Further, according to the present invention, when the system does not allow the action corresponding to the result of the semantic analysis, it is possible to know that fact before the action, and the message regarding the corresponding action can be obtained. It is possible to prevent illegal input for the system.

【０１４４】さらに、本発明では、自然語の意味解析が
失敗する可能性のある解析箇所を認定し、利用者の要求
に応じて、その解析箇所における解析失敗原因、その解
析箇所における解析方法（解析方針）、その解析失敗へ
の対処方法に関する情報を報知できるので、利用者は、
解析結果と自分の意図とが食い違っている場合に、何故
システムがそのような解析をするのかを容易に把握でき
る。また、これらの情報を用いることによって、利用者
は解析結果の修正を効率良く行うことができ、さらに、
次回以降、システムが正確に解析できる日本語文を作成
することができる。Further, according to the present invention, the analysis location where the semantic analysis of the natural language may fail is identified, and the cause of the analysis failure at the analysis location and the analysis method at the analysis location (according to the user's request) (Analysis policy) and information on how to deal with the analysis failure can be notified, so the user can
When the analysis result and the user's intention are inconsistent, it is easy to understand why the system performs such analysis. Also, by using these information, the user can efficiently correct the analysis result, and
From the next time onward, you can create Japanese sentences that the system can analyze accurately.

[Brief description of drawings]

【図１】実施例の処理概要を表す図である。FIG. 1 is a diagram illustrating a process outline of an embodiment.

【図２】実施例のハードウエア構成を示すブロック図で
ある。FIG. 2 is a block diagram showing a hardware configuration of an embodiment.

【図３】日本語解析処理の概要ＰＡＤ図である。FIG. 3 is a schematic PAD diagram of Japanese analysis processing.

【図４】用語認定処理のＰＡＤ図である。FIG. 4 is a PAD diagram of term recognition processing.

【図５】多義解消処理のＰＡＤ図である。FIG. 5 is a PAD diagram of ambiguous resolution processing.

【図６】修飾表現認定処理のＰＡＤ図である。FIG. 6 is a PAD diagram of qualified expression recognition processing.

【図７】中間表現生成処理のＰＡＤ図である。FIG. 7 is a PAD diagram of an intermediate expression generation process.

【図８】解析結果表示処理のＰＡＤ図である。FIG. 8 is a PAD diagram of analysis result display processing.

【図９】中間表現展開テーブル生成処理のＰＡＤ図であ
る。FIG. 9 is a PAD diagram of intermediate expression development table generation processing.

【図１０】中間表現展開テーブル表示処理のＰＡＤ図で
ある。FIG. 10 is a PAD diagram of an intermediate expression development table display process.

【図１１】解析状況メッセージ生成・表示処理のＰＡＤ
図である。FIG. 11: PAD of analysis status message generation / display processing
It is a figure.

【図１２】単語辞書の一例の構成図である。FIG. 12 is a configuration diagram of an example of a word dictionary.

【図１３】操作指示対象定義情報格納テーブルの一例の
構成図である。FIG. 13 is a configuration diagram of an example of an operation instruction target definition information storage table.

【図１４】情報検索のための操作指示を表した入力日本
語文の一例である。FIG. 14 is an example of an input Japanese sentence representing an operation instruction for information retrieval.

【図１５】情報検索のための操作指示を表した入力日本
語文の他の一例である。FIG. 15 is another example of an input Japanese sentence representing an operation instruction for information retrieval.

【図１６】情報検索のための操作指示を表した入力日本
語文のさらに他の一例である。FIG. 16 is still another example of an input Japanese sentence representing an operation instruction for information retrieval.

【図１７】入力自然語文の認定結果を表した文節テーブ
ルの一例の構成図である。FIG. 17 is a block diagram of an example of a clause table showing the recognition result of an input natural language sentence.

【図１８】入力自然語文の認定結果を表した文節テーブ
ルの他の一例の構成図である。FIG. 18 is a configuration diagram of another example of a clause table showing the recognition result of an input natural language sentence.

【図１９】入力自然語文の認定結果を表した文節テーブ
ルのさらに他の一例の構成図である。FIG. 19 is a configuration diagram of still another example of the clause table showing the recognition result of the input natural language sentence.

【図２０】多義解消処理後の文節テーブルの一例の構成
図である。FIG. 20 is a configuration diagram of an example of a clause table after the ambiguous resolution processing.

【図２１】多義解消処理後の文節テーブルの他の一例の
構成図である。FIG. 21 is a configuration diagram of another example of the clause table after the ambiguous resolution processing.

【図２２】多義解消処理後の文節テーブルのさらに他の
一例の構成図である。[Fig. 22] Fig. 22 is a configuration diagram of still another example of the clause table after the ambiguous resolution processing.

【図２３】中間表現の一例である。FIG. 23 is an example of an intermediate representation.

【図２４】中間表現の他の一例である。FIG. 24 is another example of the intermediate representation.

【図２５】中間表現のさらに他の一例である。FIG. 25 is still another example of the intermediate representation.

【図２６】中間表現展開テーブルの一例の構成図であ
る。FIG. 26 is a configuration diagram of an example of an intermediate expression development table.

【図２７】中間表現展開テーブルの他の一例の構成図で
ある。FIG. 27 is a configuration diagram of another example of the intermediate expression development table.

【図２８】中間表現展開テーブルのさらに他の一例の構
成図である。FIG. 28 is a configuration diagram of still another example of the intermediate expression development table.

【図２９】利用者への解析結果表示の一例である。FIG. 29 is an example of an analysis result display to the user.

【図３０】利用者への解析結果表示の他の一例である。FIG. 30 is another example of displaying the analysis result to the user.

【図３１】利用者への解析結果表示のさらに他の一例で
ある。FIG. 31 is still another example of displaying the analysis result to the user.

【図３２】メッセージデータテーブルの一例の構成図で
ある。FIG. 32 is a configuration diagram of an example of a message data table.

【図３３】解析状況データを構成する未知語データ格納
テーブルの一例の構成図である。FIG. 33 is a configuration diagram of an example of an unknown word data storage table that constitutes analysis situation data.

【図３４】解析状況データを構成する多義語データ格納
テーブルの一例の構成図である。FIG. 34 is a configuration diagram of an example of a polysemous word data storage table that constitutes analysis situation data.

【図３５】解析状況データを構成する検索対象データ格
納テーブルの一例の構成図である。FIG. 35 is a configuration diagram of an example of a search target data storage table that constitutes analysis status data.

【図３６】入力日本語文１０１の解析失敗原因を表すメ
ッセージ表示例である。FIG. 36 is an example of a message display showing the cause of analysis failure of the input Japanese sentence 101.

【図３７】入力日本語文１０１の解析失敗箇所における
解析方法を表すメッセージ表示例である。FIG. 37 is a message display example showing an analysis method at an analysis failure portion of the input Japanese sentence 101.

【図３８】入力日本語文１０１の解析失敗箇所に対する
対処方法を表すメッセージ表示例である。FIG. 38 is a message display example showing a coping method for an analysis failure part of the input Japanese sentence 101.

【図３９】入力日本語文１０１の解析失敗原因を表すメ
ッセージ表示例である。FIG. 39 is an example of a message display showing the cause of analysis failure of the input Japanese sentence 101.

【図４０】入力日本語文１０１の解析失敗箇所における
解析方法を表すメッセージ表示例である。FIG. 40 is a message display example showing an analysis method at an analysis failure portion of the input Japanese sentence 101.

【図４１】入力日本語文１０１の解析失敗箇所に対する
対処方法を表すメッセージ表示例である。FIG. 41 is a message display example showing a method of coping with a failed analysis portion of the input Japanese sentence 101.

【図４２】入力日本語文１０１の解析失敗原因を表すメ
ッセージ表示例である。FIG. 42 is a message display example showing the cause of failure in analysis of the input Japanese sentence 101.

【図４３】入力日本語文１０１の解析失敗箇所における
解析方法を表すメッセージ表示例である。FIG. 43 is a message display example showing an analysis method at an analysis failure portion of the input Japanese sentence 101.

【図４４】入力日本語文１０１の解析失敗箇所に対する
対処方法を表すメッセージ表示例である。FIG. 44 is a message display example showing a coping method for an analysis failure portion of the input Japanese sentence 101.

【図４５】情報検索のための操作指示を表した入力日本
語文のさらに他の例である。FIG. 45 is still another example of an input Japanese sentence representing an operation instruction for information retrieval.

【図４６】利用者への解析結果表示の他の例である。FIG. 46 is another example of displaying the analysis result to the user.

【図４７】入力日本語文１０４の解析失敗原因を表すメ
ッセージ表示例である。FIG. 47 is a message display example showing the cause of failure in analysis of the input Japanese sentence 104.

【図４８】入力日本語文１０４の解析失敗箇所における
解析方法を表すメッセージ表示例である。FIG. 48 is a message display example showing an analysis method at an analysis failure portion of the input Japanese sentence 104.

【図４９】利用者への解析結果表示の他の例である。FIG. 49 is another example of displaying the analysis result to the user.

【図５０】解析状況データを構成する多義語データ格納
テーブルのさらに他の例の構成図である。[Fig. 50] Fig. 50 is a configuration diagram of still another example of the polysemous word data storage table forming the analysis situation data.

【図５１】解析状況データを構成する検索対象データ格
納テーブルのさらに他の例の構成図である。[Fig. 51] Fig. 51 is a configuration diagram of still another example of the search target data storage table forming the analysis situation data.

【図５２】メッセージデータテーブルの他の例の構成図
である。FIG. 52 is a configuration diagram of another example of the message data table.

【図５３】日本語と検索対象とを対応させて格納するテ
ーブルの一例の構成図である。[Fig. 53] Fig. 53 is a configuration diagram of an example of a table that stores Japanese and search targets in association with each other.

【図５４】日本語の解析結果と検索対象とを対応させて
格納するテーブルの一例の構成図である。[Fig. 54] Fig. 54 is a configuration diagram of an example of a table that stores Japanese analysis results and search targets in association with each other.

【図５５】情報検索のための操作指示を表した入力日本
語文のさらに他の一例である。[Fig. 55] Fig. 55 is still another example of the input Japanese sentence representing an operation instruction for information retrieval.

[Explanation of symbols]

２：日本語解析処理部、３：解析結果表示処理部、８：
解析状況メッセージ生成・表示処理部、１０：解析結果
修正処理部、１１：コマンド生成・実行処理部2: Japanese analysis processing unit, 3: Analysis result display processing unit, 8:
Analysis status message generation / display processing unit, 10: analysis result correction processing unit, 11: command generation / execution processing unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者和歌山哲神奈川県横浜市戸塚区戸塚町5030番地株式会社日立製作所ソフトウェア開発本部内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Satoshi Wakayama 5030 Totsuka-cho, Totsuka-ku, Yokohama-shi, Kanagawa Prefecture Hitachi Ltd. Software Development Division

Claims

[Claims]

1. In a natural language processing system that operates in accordance with the meaning of an input natural language, regarding the part identified as poorly analyzed by the semantic analysis of the input natural language, the nature of the identified part A natural language processing system, comprising: output means for outputting a message including a word, and operating in response to a response input to the output message.

2. The natural language processing system according to claim 1, wherein said output means outputs said message in response to an output request from an input device.

3. The natural language processing system according to claim 1, wherein the output means outputs the input natural language and the message in association with each other.

4. The natural language processing system according to claim 3, wherein the output means outputs the natural language of the specified part included in each of the input natural language and the message in a mode different from the others. A natural language processing system characterized by:

5. The natural language processing system according to claim 1, wherein the specified portion is an unknown word not registered in the dictionary of the natural language processing system.

6. The natural language processing system according to claim 5, further comprising means for registering the unknown word in the dictionary in response to a request from an input device.

7. A natural language processing system that operates in accordance with the meaning of an input natural language, and detects that the operation corresponding to the analysis result of the input natural language is not allowed in the natural language processing system. A natural language processing system, comprising: means and, in response to the detection result, means for notifying that an operation corresponding to the meaning of the input natural language is not permitted.

8. The natural language processing system according to claim 7, wherein the informing means includes means for outputting information on a treatment corresponding to the detection result.

9. The natural language processing system according to claim 7, wherein the operation of the natural language processing system corresponding to the treatment performed in response to the notification and the input natural language are stored in association with each other. And a newly input natural language meaning is equivalent to the stored natural language meaning, the natural language processing system executes the stored operation corresponding to the stored natural language. .

10. A means for inputting a natural language consisting of at least one word representing an operation instruction, a means for analyzing the natural language, a means for converting the natural language analysis result into an operation instruction command language, and the operation. In a natural language processing system including a means for executing an instruction command and an output means, a means for recognizing an analysis location where the natural language analysis may fail, an analysis failure cause at the analysis location, and the analysis location A natural language processing system, comprising: an output method for notifying information about at least one of the analysis method and the method for coping with the failure in analysis.