JPH0322140A

JPH0322140A - Knowledge refinement device

Info

Publication number: JPH0322140A
Application number: JP1157154A
Authority: JP
Inventors: Masumi Nomura; 真澄野村
Original assignee: Mitsubishi Heavy Industries Ltd
Current assignee: Mitsubishi Heavy Industries Ltd
Priority date: 1989-06-20
Filing date: 1989-06-20
Publication date: 1991-01-30

Abstract

PURPOSE:To reduce time required for refining knowledge by analyzing and understanding knowledge data represented by natural language by using a dictionary, retrieving a knowledge base based on this result, and executing the confirmation of the rightness of the knowledge base and the supplement of insufficient knowledge. CONSTITUTION:A word, an idiom or a part of the word (for instance, the root of a word), etc., in an object area are registered in the dictionary 4 together with the kind of the word and relation to other words, and further, restriction to the order of the kind of the word is prescribed flexibly, and the analysis and the apprehension of the data represented by the natural language are executed by using these, and the knowledge base 8 is refined based on this result. Accordingly, the meaning of the data represented by the natural language and stored in the knowledge base 8 is understood. Thus, possibility to store the data wrong in viewpoint of its contents in the knowledge base 8 is reduced, and the necessity to confirm the rightness of the knowledge base 8 decreases, and the time required for refining the knowledge base 8 can be reduced.

Description

【発明の詳細な説明】［産業上の利用分野］本発明は自然言語表現記述のデータを格納した知識ベー
ス内の知識データの誤記述の訂正、不足知識データの追
加に適用される知識洗練装置に関する。[Detailed Description of the Invention] [Field of Industrial Application] The present invention is a knowledge refinement device that is applied to correct misdescriptions of knowledge data and add missing knowledge data in a knowledge base that stores data of natural language expression descriptions. Regarding.

［従来の技術］従来、この種の知識データの誤りの発見、不足した知識
データの充足は、すべてナレッジエンジニアなる人間の
手に委ねられていた。[Prior Art] Conventionally, the discovery of errors in this type of knowledge data and the filling of missing knowledge data have all been left to the hands of knowledge engineers.

［発明が解決しようとする課題］エキスパートより引き出した知識をナレッジエンジニア
が理解し、構築した知識ベースに関して、内容的に誤っ
たデータを知識ベースに登録する可能性がある。このた
め知識ベースの正当性を直接的に、あるいは推論結果を
見て、確認する必要があり、知識ベースの洗練に多大の
時間を要していた。[Problem to be solved by the invention] When a knowledge engineer understands the knowledge extracted from an expert and builds a knowledge base, there is a possibility that data with incorrect contents may be registered in the knowledge base. Therefore, it is necessary to confirm the validity of the knowledge base directly or by looking at the inference results, and it takes a lot of time to refine the knowledge base.

本発明は上記のような点に鑑みなされたもので、自然言
語表現記述のデータを格納した知識ベースにおいて、誤
登録の可能性を減らし、知識ベースの正当性の確認を低
減して、知識ベースの洗練に要する時間を削減可能とす
る知識洗練装置を提供することを目的とする。The present invention has been made in view of the above points, and it reduces the possibility of erroneous registration in a knowledge base storing data of natural language expression descriptions, reduces the need to confirm the validity of the knowledge base, and improves the knowledge base. An object of the present invention is to provide a knowledge refinement device that can reduce the time required for refinement.

［課題を解決するための手段］すなわち、本発明に係る知識洗練装置は、対象領域にお
ける単語、熟語、あるいは単語の一部（例えば語幹）等
を、単語の種類、他の単語等との関係とともに辞書に登
録し、さらに単語の種類の順序に関する制約を柔軟に規
定し、これらを用いて自然言語表現のデータの解析、理
解を行ない、この結果を踏まえて知識ベースを洗練する
ものである。[Means for Solving the Problems] In other words, the knowledge refinement device according to the present invention analyzes words, idioms, or part of words (for example, word stems) in a target domain based on the type of word, the relationship with other words, etc. Furthermore, constraints on the order of word types can be flexibly defined, data on natural language expressions can be analyzed and understood using these constraints, and the knowledge base can be refined based on the results.

［作用］上記の構成によれば、知識ベースに格納されている自然
言語表現データの意味理解が行なわれる。これにより内
容的に誤っ゛たデータを知識ベースに登録する可能性が
減少し、知識ベースの正当性の確認の必要性が低減し、
知識ベースの洗練に要する時間を削減できる。[Operation] According to the above configuration, the meaning of the natural language expression data stored in the knowledge base is understood. This reduces the possibility of registering data with incorrect content in the knowledge base, reduces the need to confirm the validity of the knowledge base, and
Reduces the time required to refine the knowledge base.

［実施例］以下、図面を参照して本発明の一実施例を説明する。[Example] Hereinafter, one embodiment of the present invention will be described with reference to the drawings.

第１図に本発明の実施例を示す。知識洗練装置は、単語
抽出装置１、構文解析装置２、知識データ生成装置３、
第１のメモリ４、第２のメモリ５、第３のメモリ６、ワ
ーキングメモリ７、知識ベース８、入力装置９、及び出
力装置１０から構成されている。FIG. 1 shows an embodiment of the present invention. The knowledge refinement device includes a word extraction device 1, a syntax analysis device 2, a knowledge data generation device 3,
It is composed of a first memory 4, a second memory 5, a third memory 6, a working memory 7, a knowledge base 8, an input device 9, and an output device 10.

第２、第３、第４図には、一実施例としての第１のメモ
リ４、第２のメモリ５、第３のメモリ６の構成図を示す
。また、第５、第６図には、一実施例としての、ワーキ
ングメモリ７、知識ベース８の構戊図を示す。FIGS. 2, 3, and 4 show configuration diagrams of a first memory 4, a second memory 5, and a third memory 6 as one embodiment. Further, FIGS. 5 and 6 show configuration diagrams of the working memory 7 and the knowledge base 8 as one embodiment.

第１のメモリ４に格納されている辞書には、単語データ
４０を第２図に示した形式で記述されている。以下に具
体的な記述内容を説明する。In the dictionary stored in the first memory 4, word data 40 is described in the format shown in FIG. The specific contents of the description will be explained below.

単語名部４１は、単語を記述する。同義語名部４２は、
当該単語の同義語を記述する。反意語名部４３は、当該
単語の反意語を記述する。上位単語名部４４は、当該単
語の意味するものが物理的な物であればこれを物理的に
包含する物理的な物に対応する単語を、当該単語の意味
するものが抽象的概念であれば、これを意味的に包含す
る抽象的概念に対応する単語を記述する。下位単語名部
４５は、当該単語の意味するものが物理的な物であれば
これが物理的に包含する物理的な物に対応する単語を、
当該単語の意味するものが抽象的概念であれば、これが
意味的に包含する抽象的概念に対応する単語を、その単
語区分名とともに記述する。ルール名部４６は、当該単
語が出現したときに評価されるルール名を記述する。単
語区分名部４７は、当該単語の単語区分名を記述する。The word name section 41 describes a word. The synonym name section 42 is
Write a synonym for the word. The antonym name section 43 describes the antonym of the word. The higher-level word name section 44 includes a word corresponding to a physical object that physically includes the word if the word means a physical object, or a word corresponding to a physical object that physically includes the word, even if the word means an abstract concept. For example, write a word that corresponds to an abstract concept that semantically includes this concept. If the meaning of the word is a physical thing, the lower word name section 45 contains the word corresponding to the physical thing that this word physically includes.
If the meaning of the word is an abstract concept, write the word corresponding to the abstract concept that this word semantically includes, along with its word category name. The rule name section 46 describes the rule name to be evaluated when the word appears. The word category name section 47 describes the word category name of the word.

第２のメモリ５に格納されているテンブレートノード５
０は、第３図に示した形式で記述されている。以下に具
体的な記述内容を説明する。Template node 5 stored in second memory 5
0 is written in the format shown in FIG. The specific contents of the description will be explained below.

ノード名部５１は、当該テンブレートノードのノード名
を記述する。通常、これには単語区分名を用いる。上流
ノード名部５２は、当該テンブレートノードのノード名
に対応する単語区分名を属性として有する単語より前に
出現する単語の単語区分名に対応するテンブレートノー
ドのノード名を記述する。下流ノード名部５３は、当該
テンブレートノードのノード名に対応する単語区分名を
属性として有する単語より後に出現する単語の単語区分
名に対応するテンブレートノードのノード名を記述する
。The node name section 51 describes the node name of the template node. Usually, word category names are used for this. The upstream node name section 52 describes the node name of the template node corresponding to the word category name of the word that appears before the word that has the word category name corresponding to the node name of the template node as an attribute. The downstream node name section 53 describes the node name of the template node corresponding to the word category name of the word that appears after the word that has the word category name corresponding to the node name of the template node as an attribute.

第３のメモリ６に格納されているルールは、第４図に示
したルールセットと称する４つのグループに分割して格
納されている。以下、各ルールセットに格納されている
ルールの一例を示す。The rules stored in the third memory 6 are divided into four groups called rule sets shown in FIG. An example of the rules stored in each rule set is shown below.

第１のルールセット６１ Φ　　ノレ　ー　ノレ　Ａ接続詞的な単語を有する文は、その接続詞的な単語の前
後でテンプレートノードにより制約される語順規定を犯
すが、これを黙認する。First Rule Set 61 Φ Nore Nore A Sentences with conjunction-like words violate the word order regulations constrained by template nodes before and after the conjunction-like word, but this is tacitly accepted.

●ルールＢ数量に関する単語は、文の最初か、あるいは主語より後
で副詞、動詞、形容動詞より前に位置せねばならない。●Rule B: Quantity words must appear at the beginning of a sentence, or after the subject and before adverbs, verbs, and adjectives.

第２のルールセット６２ ●ルールＣ “及び”や”または”等の接続詞的に使用する単語は、
その前後の単語区分名を参考にして、自然言語表現デー
タから省略されていると見做される単語を補完する。Second rule set 62 ●Rule C Words used as conjunctions such as “and”, “or”, etc.
Words that are considered to be omitted from the natural language expression data are complemented by referring to the word category names before and after the words.

・ルールＤ本来２つの単語が接続詞的な単語により結合され、さら
にそれを簡略化したと見做せる単語（例えば１出入口”
は“出口及び入口”と見做してよい）は、その元の簡略
化していない表現に戻す。・Rule D: Two words are originally connected by a conjunction, and a word that can be seen as a simplification (for example, 1 entrance/exit)
may be considered "exits and inlets") are returned to their original unsimplified representations.

●ルールＥ “単語１＋否定語”は単語１の反意語に置換する。●Rule E “Word 1 + negative word” is replaced with an antonym of word 1.

第３のルールセット６３ ●ルールＦ単語区分名１の単語は、テンプレートノード名１に位置
する単語として単語１あるいは単語２を持たねばならな
い。Third rule set 63 ●Rule F The word of word category name 1 must have word 1 or word 2 as the word located in template node name 1.

争ノレーノレＧ単語区分名１の単語は、テンプレートノード名１に位置
する単語として単語区分名２の単語を持ってはならない
。Conflict Nore Nore G A word with word category name 1 must not have a word with word category name 2 as a word located in template node name 1.

第４のルールセット６４ ◆ルールＨ単語１（例えば“あるいは”）′を含むデータは、その
前後に位置し、かつ語順テンプレートを満たす最も長い
単語列データを交換しても、自然言語上の意味は何等変
わらない。Fourth rule set 64 ◆Rule H Data containing word 1 (for example, "or")' has no meaning in natural language even if the longest word string data that is located before and after it and satisfies the word order template is exchanged. is no different.

●ルール！数量に関する単語は、文の最初にあっても、あるいは主
語より後で副詞、動詞、形容動詞より前にあっても、自
然言語上の意味は何等変わらない。●Rules! Quantitative words have the same meaning in natural language whether they appear at the beginning of a sentence, or after the subject but before adverbs, verbs, or adjectives.

次に、本発明の実施例の処理手順を、一具体例としての
ＰＷＲプラントで用いられる用語を用いて、プロダクシ
ョンルールに適用したケースを解説する。Next, a case will be explained in which the processing procedure of the embodiment of the present invention is applied to a production rule using terminology used in a PWR plant as a specific example.

既に、知識ベース８に自然言語表現の条件部と帰結部か
らなるプロダクションルール「ルール１」及び「ルール
２」が格納されているとする（第７図参照）。このとき
、本発明の実施例は以下の手順で処理がなされる。It is assumed that the knowledge base 8 has already stored production rules "Rule 1" and "Rule 2" consisting of a condition part and a consequence part of natural language expression (see FIG. 7). At this time, in the embodiment of the present invention, processing is performed in the following steps.

知識洗練装置を起動すると、（１）　　知識データ生成装置３は、知識ベース８に格
納されている知識データ８０のうち、最初の知識データ
８０の条件部８２に格納されている自然言語表現のデー
タ「加圧器圧力及び温度の急上昇」を取り出し、ワーキ
ングメモリ７の自然言語データ部７１に格納する（第８
図参照）。When the knowledge refinement device is activated, (1) the knowledge data generation device 3 generates natural language expression data stored in the condition section 82 of the first knowledge data 80 among the knowledge data 80 stored in the knowledge base 8; The “rapid increase in pressurizer pressure and temperature” is extracted and stored in the natural language data section 71 of the working memory 7 (No. 8
(see figure).

（２）　　単語抽出装置１は、第１のメモリ４（第９図
参照）に格納されている単語名部４１より単語を順次取
り出し、これがワーキングメモリ７の自然言語データ部
７１に格納されているデータ「加圧器圧力及び温度の急
上昇」内に含まれるかを評価する。含まれていれば、こ
の単語をワーキングメモリ７のキーワード部７２の最初
に格納する。(2) The word extracting device 1 sequentially extracts words from the word name section 41 stored in the first memory 4 (see FIG. 9), and these words are stored in the natural language data section 71 of the working memory 7. Evaluate whether it is included in the data "rapid increase in pressurizer pressure and temperature". If it is included, this word is stored at the beginning of the keyword section 72 of the working memory 7.

含まれていなければ、当該単語データ４０の同義語名部
４２に格納されている同義語を抽出し、これがワーキン
グメモリ７の自然言語データ部７１に格納されているデ
ータ内に含まれるかを評価する。この同義語が含まれて
いれば、当該単語（この同義語が同義語名部４２に登録
してあった単語データ４０の単語名部４１に格納されて
いる単語）をワーキングメモリ７のキーワード部７２の
最初に格納する。この場合、単語「急激」の同義語「急
」が含まれているので、ワーキングメモリは第１０図の
ようになる。If not included, extract the synonym stored in the synonym name field 42 of the word data 40 and evaluate whether it is included in the data stored in the natural language data field 71 of the working memory 7. do. If this synonym is included, the word (the word stored in the word name field 41 of the word data 40 in which this synonym was registered in the synonym name field 42) is stored in the keyword field of the working memory 7. Stored at the beginning of 72. In this case, since the word "sudden" includes the synonym "sudden", the working memory becomes as shown in FIG. 10.

（３）　　単語抽出装置１は、（２）項の処理により単
語を１つも抽出できなければ、（６）に飛ぶ。１つでも
抽出できれば（４）の処理を行なう。この場合は、（４
》の処理を行なうことになる。(3) If the word extraction device 1 is unable to extract any word through the process in (2), it jumps to (6). If even one can be extracted, process (4) is performed. In this case, (4
] will be processed.

（４）構文解析装置２は、まず、ワーキングメモリ７の
自然言語データ部７１に格納されているデータ「加圧器
圧力及び温度の急上昇」において、（２）項の処理で抽
出した単語「温度」、「及び」、「上昇」、「急激」、
「圧力」、「加圧器」の出現する順序に対応して、ワー
キングメモリ７のキーワード部７２に格納されている単
語の順序を変更する。（第１１図参照）。次に、ワーキ
ングメモリ７のキーワード部７２に格納されている単語
「加圧器」、「圧力」、「及び」、「温度」、「急激」
、「上昇」に対し、順次、第１のメモリ４に格納されて
いる対応する単語データ４ｏの有しているルール名部４
６を調べ、第３のメモリ６の第２のルールセット６２（
第１２図参照）に格納されているルールのルール名が記
述されていれば、当該ルールを抽出、発火させる。この
場合、単語データ「及び」の有している。ルール「ルー
ルＣ」が発火し、第１３図のように単語が補完される。(4) The syntax analysis device 2 first extracts the word "temperature" in the data "pressurizer pressure and sudden rise in temperature" stored in the natural language data section 71 of the working memory 7 in the process of section (2). , “and”, “rise”, “suddenly”,
The order of words stored in the keyword section 72 of the working memory 7 is changed in accordance with the order in which "pressure" and "pressurizer" appear. (See Figure 11). Next, the words “pressurizer”, “pressure”, “and”, “temperature”, and “sudden” stored in the keyword section 72 of the working memory 7 are
, for "rise", the rule name part 4 of the corresponding word data 4o stored in the first memory 4 is sequentially changed.
6 and the second ruleset 62 (
If the rule name of a rule stored in (see FIG. 12) is written, the rule is extracted and fired. In this case, the word data "and" is included. The rule "Rule C" is fired, and the word is completed as shown in FIG.

さらに、ワーキングメモリ７のワーキング部７２に格納
されている単語に対し、順次、第１のメモリ４に格納さ
れている対応する単語データ４０の有している単語区分
名部４７を調べて、この順序が、第２のメモリ５に格納
されているテンプレートノード５ｏに記述されているリ
ンク情報により実現可能かを評価する（第１４図参照）
。Furthermore, for the words stored in the working section 72 of the working memory 7, the word category name section 47 of the corresponding word data 40 stored in the first memory 4 is checked, and Evaluate whether the order can be realized by the link information described in the template node 5o stored in the second memory 5 (see FIG. 14).
.

この評価の際、第３のメモリ６の第１のルールセット６
１に格納されているルールの適用される単語があれば当
該ルールを考慮する。実現可能であれば、構文解析装置
２は、解析結果を出カ装置１０より出力し、（５）の処
理を行なう。実現可能でなければ、（６）へ飛ぶ。この
場合、単語「及び」に対してルール「ルールＡ」が考慮
され、実現可能と判定される。During this evaluation, the first rule set 6 of the third memory 6
If there is a word to which a rule stored in 1 is applied, that rule is taken into consideration. If it is possible, the syntax analysis device 2 outputs the analysis result from the output device 10 and performs the process (5). If it is not feasible, jump to (6). In this case, the rule "Rule A" is considered for the word "and" and determined to be feasible.

《５》構文解析装置２は、ワーキングメモリ７のキーワ
ード部７２に格納されている単語に対し、第３のメモリ
６の第３のルールセット６３に格納されているルールの
適用される単語があれば当該ルールを評価する。ルール
を満足しない単語が１つもなければ（８）へ飛ぶ。ルー
ルを満足しない単語が１つでも存在すれば、（６）の処
理を行なう。この場合、「ルールＦ」と「ルールＧ」の
具体例としてのルール“単語区分名「計測量」の単語は
テンプレートノード名「動詞」に所属する単語として「
変動」、「上昇」、「低下」、「正常」等の単語を持た
ねばならない。“と”単語区分名「計測量」の単語はテ
ンプレートノード名ｒ動詞ＪＩ：所属する単語として「
開」、「閉」等の単語を持ってはならない。゜が評価さ
れるが、ともに満たされているので、（８）へ飛ぶこと
になる。<<5>> The syntactic analysis device 2 determines whether there is a word to which the rule stored in the third rule set 63 of the third memory 6 is applied to the word stored in the keyword section 72 of the working memory 7. If so, evaluate the rule. If there is no word that does not satisfy the rule, jump to (8). If there is even one word that does not satisfy the rules, the process (6) is performed. In this case, as a specific example of "Rule F" and "Rule G", the word with the word category name "Measurement" is treated as a word belonging to the template node name "Verb".
It must have words such as "fluctuation", "increase", "decrease", "normal", etc. The word “and” word category name “Measurement amount” is the template node name r verb JI: The word to which it belongs is “
Must not have words such as "open" or "closed." ° is evaluated, but since both are satisfied, we will jump to (8).

（６）出力装置１ｏは、当該知識データの条件部に格納
されている自然言語表現のデータの語順・用いられてい
る単語が正しくない旨を出カし、データを入力装置９よ
り入カ仕直すようメッセージを出力する。(6) The output device 1o outputs that the word order and words used in the natural language expression data stored in the condition part of the knowledge data are incorrect, and inputs the data from the input device 9. Outputs a message to correct it.

（７）入力装置９よりデータが入力されたら、知識デー
タ生成装置３は、これを知識ベースの当該知識データ８
ｏの条件部８２、およびワーキングメモリ７の自然言語
データ部７１に格納し、さらにワーキングメモリ７のキ
ーワード部７２をクリアした後、（２）へ飛ぶ。(7) When data is input from the input device 9, the knowledge data generation device 3 converts the data into the corresponding knowledge data 8 of the knowledge base.
After storing in the condition section 82 of o and the natural language data section 71 of the working memory 7, and clearing the keyword section 72 of the working memory 7, the process jumps to (2).

（８）　　知識データ生成装置３は、ワーキングメモリ
７のキーワード部７２に格納されている単語列データを
、あるいはワーキングメモリ７のキーワード部７２に格
納されている単語列データ内の一部の単語を第３のメモ
リ６の第３のルールセット６３に格納されているルール
を考慮して当該単語データ４０の有している上位単語名
部４４に格納されている単語、あるいは下位単語名部４
５に格納されている単語に置換した単語列データを、帰
結部８３にこの順に含んでいる自然言語表現記述の知識
データ８ｏを、第３のメモリ６の第４のルールセット６
４に格納されているルールを考慮して知識ベース８より
すべて検索する。(8) The knowledge data generation device 3 generates the word string data stored in the keyword section 72 of the working memory 7 or some words in the word string data stored in the keyword section 72 of the working memory 7. Considering the rules stored in the third rule set 63 of the third memory 6, the word stored in the upper word name section 44 of the word data 40 or the lower word name section 4
The knowledge data 8o of the natural language expression description, which contains the word string data replaced with the words stored in 5 in this order in the consequent section 83, is stored in the fourth rule set 6 of the third memory 6.
All knowledge bases 8 are searched taking into account the rules stored in 4.

このような自然言語表現記述の知識データが存在すれば
、ワーキングメモリ７の自然言語データ部７１に格納さ
れているデータを帰結部に、検索された知識データ８ｏ
の帰結部８３に格納されているデータを条件部に持つ知
識データを新たに生成し、これを出力装置１０より出力
するとともに、知識ベース８の最後に格納し、ワーキン
グメモリ７をクリアする。この場合、単語「上昇」のそ
の上位単語「変動」への変換が「ルールＦ」、「ルール
Ｇ」の評価の結果、認められ、さらに単語「及び」に対
しルール「ルールＨ」を考慮した結果、「加圧器温度及
び圧力の急激な変動」が検索され、知識ベース８は第１
５図のように更新される。If knowledge data of such a natural language expression description exists, the searched knowledge data 8o is created using the data stored in the natural language data section 71 of the working memory 7 as a consequent section.
New knowledge data having the data stored in the consequent section 83 in the condition section is generated, and this is outputted from the output device 10, stored at the end of the knowledge base 8, and the working memory 7 is cleared. In this case, the conversion of the word "rise" to its higher-ranking word "fluctuation" was recognized as a result of the evaluation of "Rule F" and "Rule G", and furthermore, the rule "Rule H" was considered for the word "and". As a result, "rapid fluctuations in pressurizer temperature and pressure" was searched, and knowledge base 8
It is updated as shown in Figure 5.

（９）　　各装置は知識ベース８に格納されている知識
データのうち、第２番目以降の知識データ８０に関して
（１）〜（８）の処理を行なう。すべての知識データ８
０に関して上記処理が終了すれば、知識洗練装置は停止
する。(9) Each device performs the processes (1) to (8) regarding the second and subsequent knowledge data 80 among the knowledge data stored in the knowledge base 8. All knowledge data 8
When the above processing is completed for 0, the knowledge refinement device stops.

［発明の効果］以上のように本発明によれば、対象領域における単語、
熟語、あるいは単語の一部（例えば語幹）等を、単語の
種類、他の単語等との関係とともに辞書に登録し、さら
に単語の種類の順序に関する制約を柔軟に規定し、これ
らを用いて入力した自然言語表現の知識データの解析、
理解を行ない、この結果を踏まえて知識ベースを検索し
、知識データの正当性の確認、不足知識の追加を行なう
ことにより、内容的に誤ったデータを知識べ−スに登録
する可能性が減少し、知識ベースの正当性の確認の必要
性が低減し、知識ベースの洗練に要する時間を削減でき
るものである。[Effects of the Invention] As described above, according to the present invention, words in the target area,
Idioms or parts of words (e.g. stems) can be registered in a dictionary along with word types and relationships with other words, etc., and constraints regarding the order of word types can be flexibly defined and input using these. analysis of natural language expression knowledge data,
By understanding the information, searching the knowledge base based on the results, confirming the validity of the knowledge data, and adding missing knowledge, the possibility of registering incorrect data in the knowledge base is reduced. This reduces the need to confirm the validity of the knowledge base and reduces the time required to refine the knowledge base.

[Brief explanation of the drawing]

図面は本発明の一実施例を説明するための図であり、第
１図は知識洗練装置の構成を示すブロック図、第２図は
第１のメモリの構或を示す図、第３図は第２のメモリの
構成を示す図、第４図は第３のメモリの構成を示す図、
第５図はワーキングメモリの構成を示す図、第６図は知
識ベースの構成を示す図、第７図乃至第１５図はそれぞ
れ同実施例の処理手順を説明するための図である。１・・・単語抽出装置、２・・・構文解析装置、３・・
・知識データ生成装置、４・・・第１のメモリ、５・・
・第２のメモリ、６・・・第３のメモリ、７・・・ワー
キングメモリ、８・・・知識ベース、９・・・入力装置
、１０・・・出力装置、４０・・・単語データ、４５・
・・下位単語名部、５０・・・テンプレートノード、８
０・・・知識デー夕。The drawings are diagrams for explaining one embodiment of the present invention; FIG. 1 is a block diagram showing the configuration of the knowledge refinement device, FIG. 2 is a diagram showing the configuration of the first memory, and FIG. 3 is a block diagram showing the configuration of the knowledge refinement device. A diagram showing the configuration of the second memory, FIG. 4 is a diagram showing the configuration of the third memory,
FIG. 5 is a diagram showing the structure of the working memory, FIG. 6 is a diagram showing the structure of the knowledge base, and FIGS. 7 to 15 are diagrams for explaining the processing procedure of the same embodiment. 1...Word extraction device, 2...Syntax analysis device, 3...
・Knowledge data generation device, 4...first memory, 5...
- Second memory, 6... Third memory, 7... Working memory, 8... Knowledge base, 9... Input device, 10... Output device, 40... Word data, 45・
...Lower word name part, 50...Template node, 8
0...Knowledge day evening.

Claims

[Claims]

In a device having a memory for storing data described in natural language expression, a first memory for storing a dictionary in which important words in a target area are registered, and a template node for flexibly specifying the order of words. a second memory for storing words, rules for allowing exceptional situations such as word order regulations, rules for completing, replacing, and changing the order of words, and rules for restricting word substitutions. a third memory for storing rules or rules for flexibly searching already existing knowledge data;
A knowledge base for storing knowledge data that describes information in the target domain using natural language expressions, and words that are extracted from the data that are described using natural language expressions.
a word extraction device that replaces words with synonyms as necessary;
Check the order of words extracted by this word extraction device.
A syntactic analysis device that corrects and replaces words with the same meaning and completes omitted words, and collates the output of this syntactic analysis device with knowledge data of natural language expressions stored in the knowledge base, It is characterized by comprising a knowledge data generation device for generating missing knowledge data, an input device for inputting from the outside using natural language, etc., and an output device for displaying the generated knowledge data. A knowledge refinement device.