JPH11203325A

JPH11203325A - Database generating method, record medium in which its program is stored and record medium recording group of data generated by the generating method

Info

Publication number: JPH11203325A
Application number: JP10018323A
Authority: JP
Inventors: Tadamitsu Ryu; 忠光龍
Original assignee: Individual
Current assignee: Individual
Priority date: 1998-01-16
Filing date: 1998-01-16
Publication date: 1999-07-30

Abstract

PROBLEM TO BE SOLVED: To make contents of database to be stored sufficient and to obtain accurately labelled data by labelling the data inputted later based on data which are labelled and already registered and flexibly labelling the data further as learning the data on an interactive basis. SOLUTION: When inputted information from an information input means 1 is analog, it is converted into digital data in an inputted information processing means 2. When the inputted information is the one which is treated by a decomposition element decomposing means 3, the information is transmitted as it is to the decomposition element decomposing means 3. The inputted information is decomposed into the decomposition element formed by assembling it into a minimum unit to be decomposed by referring to the database 8 by the decomposition element decomposing means 3. And an inquiry whether the decomposition element obtained by the decomposition element decomposing means 3 is proper or not is made by a decomposition element interactive means 4 and when it is proper, it is defined as the decomposition element as it is. In addition, an inquiry whether a label obtained by a labelling means 5 is proper or not is made by a labelling interactive means 6 and when it is proper, it is defined as the label as it is.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は情報を入力し前記情報に
ラベリングを行いこれをデータベースにデータとして保
存するデータベース作成方法、そのプログラムを格納し
た記憶媒体及びその作成方法で作成したデータ群を記録
した記録媒体に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for creating a database for inputting information, labeling the information, and storing the information as data in a database, a storage medium storing the program, and a data group created by the creating method. Related to a recorded medium.

【０００２】[0002]

【従来の技術】従来、コンピュータに入力された情報の
ラベリングは、入力情報をそのまま最小単位である分解
素に分解し、その分解素にラベル等を付与してデータを
データベースに保持するようにしている。2. Description of the Related Art Conventionally, labeling of information input to a computer is performed by decomposing input information as it is into a minimum unit, a decomposed element, attaching a label or the like to the decomposed element, and storing data in a database. I have.

【０００３】[0003]

【発明が解決しようとする課題】従来のラベリングの方
法は分解素に対して固定的にラベリングするだけであり
ラベリングされたデータは固定的なものであった。その
ため従来の保存データはラベリングされたデータをもと
にさらに、別のデータに対してラベリングしたり、或い
は、後のデータを元に既に登録されている保存データを
書き替えて内容を充実させることなど入力情報に柔軟に
対応できるようにはなっていなかった。In the conventional labeling method, only the labeling is fixedly performed on the decomposed element, and the labeled data is fixed. For this reason, existing stored data should be labeled with different data based on the labeled data, or the registered data should be rewritten based on the later data to enhance the content. It was not possible to respond flexibly to input information.

【０００４】本発明はラベリングされて既に登録された
データを元に、後から入力されたデータに対してラベリ
ングしたり、さらにデータに応じてもとのデータを書き
替えることにより、対話形式で学習しながら柔軟にラベ
リングすることによりデータを有機的に互いに関連付け
可能な形式でデータベースを拡張できるデータベースの
作成方法、データベース作成プログラムを格納した記録
媒体及び前記ステップにより作成されたたことを特徴と
するデータ群を記録した記録媒体を提供することを目的
とするものであるAccording to the present invention, learning is performed in an interactive manner by labeling data inputted later and rewriting original data in accordance with the data based on the data which has been labeled and registered. Method for creating a database capable of expanding a database in a format in which data can be organically associated with each other by flexibly labeling data, a recording medium storing a database creation program, and data created by the above steps It is intended to provide a recording medium on which a group is recorded

【０００５】[0005]

【課題を解決するための手段】本発明は情報を入力し前
記情報をデータベースにデータとして保存するデータベ
ース作成方法、情報を入力し前記情報をデータベースに
データとして保存するデータベースを作成するプログラ
ムを保持する記録媒体及び情報を入力し前記情報に処理
を行い加工されるデータ群を記録した記録媒体においてＡ：情報入力手段Ｂ：入力情報処理手段Ｃ：情報処理手段により得られたデータを分解素に分解
する分解素分解手段Ｄ：前記分解素のラベリング処理をするラベリング手段Ｅ：ラベリング対話処理をするラベリング対話手段Ｆ：ラベリングされたデータを情報保持手段に登録する
登録手段の各ステップから構成されるAccording to the present invention, there is provided a database creation method for inputting information and storing the information as data in a database, and holding a program for inputting information and creating a database for storing the information as data in the database. In a recording medium and a recording medium in which information is input, data is processed, and a data group to be processed is recorded. A: information input means B: input information processing means C: data obtained by the information processing means is decomposed into decomposed elements D: labeling means for performing labeling processing of the decomposition element E: labeling dialogue means for performing labeling dialogue processing F: registration means for registering the labeled data in the information holding means

【０００６】その情報入力手段は情報の形態がデジタル
かアナログであるかを判別する判別手段を必要に応じて
具備し、前記情報がアナログである場合は次のステップ
の入力情報処理手段においてデジタルに変換するればよ
い。The information input means includes, if necessary, a discriminating means for discriminating whether the form of the information is digital or analog. Just convert it.

【０００７】前記情報入力手段は必要に応じて情報の種
類（メディア種）を判別する判別手段を具備しており、
その情報の種類は文字、音声、音、画像（静止画、動
画）、ベクトル及び事例から選ばれた少なくとも1つの
情報であり、それぞれの情報の種類に対応した入力デバ
イスを指定すればよい。[0007] The information input means includes a discriminating means for discriminating the type of information (media type) as necessary.
The type of the information is at least one information selected from characters, voices, sounds, images (still images and moving images), vectors and cases, and an input device corresponding to each type of information may be specified.

【０００８】情報処理手段は分解素に分解する分解素分
解手段で処理可能なデータになっていない情報は処理可
能なデータに変換し分解素分解手段に送り既に処理可能
なデータであるものはそのままのデータを分解素分解手
段に送ればよい。[0008] The information processing means converts the information that is not data that can be processed by the decomposer into data that can be processed by the decomposer and sends it to the decomposer and leaves the data that is already processable as it is. May be sent to the decomposer.

【０００９】分解素分解手段は分解可能な最小単位或い
は前記最小単位の集合で符号付与が可能な単位に分解す
る。そしてこの分解素分解手段は前記データベースに登
録済みの分解素を参照する参照手段をを有しその登録済
みの分解素参照して分解すればよい。The decomposing element decomposing means decomposes into a minimum unit which can be decomposed or a unit which can be assigned a sign by a set of the minimum units. The decomposing element decomposing means has reference means for referring to the decomposing elements registered in the database, and the decomposing element may be decomposed with reference to the registered decomposing elements.

【００１０】この分解素分解手段は必要に応じてユーザ
ーとの分解素対話手段を設けておく。そしてこの分解素
対話手段は得られた分解素に対してＡ：前記分解素を該フォーマットを比較し適切に分解さ
れているかいないかを判定し、適切に分解されていれば
そのまま分解素とし適切でなければを出力手段において
その旨をユーザーに示すＢ：ユーザーに前記分解素の正否を問うＣ：正しければそのまま分解素とするＤ：適切な分解素でない場合にはそれを出力手段に出力
しユーザーに正しい分解素の入力を促し入力されたモノ
を正しい分解素とするの各ステップからなる構成すればよい。This decomposition element decomposition means is provided with decomposition element dialogue means with the user as necessary. Then, the decomposing element dialogue means compares the obtained decomposing element with the format of the obtained decomposing element to determine whether or not the decomposing element has been properly decomposed. If not, indicate to the user in the output means to that effect. B: Ask the user whether the decomposition factor is correct. C: If it is correct, use the decomposition factor as it is. D: If it is not an appropriate decomposition factor, output it to the output unit. It may be configured to include the steps of prompting the user to input a correct decomposition element and making the input object a correct decomposition element.

【００１１】ラベリング手段は得られた前記分解素に対
して既に登録済みのデータベースを参照してラベルを求
めラベルを付与すればよい。付与すべきラベルが見つか
らない時にはその旨を出力手段に出力しユーザーに正し
い分解素の入力を促し入力されたモノを正しい新しいラ
ベルそしてラベリング対話手段は得られた前記分解素に
対して付与されたラベルに対してＡ：ラベルが適切にラベリングされているかの判別手段
を有し適切にラベリングされていなければ出力手段にお
いてその旨を出力するＢ：ユーザーに前記ラベリングの正否を問うＣ：正しければそのままラベルとするＤ：適切なラベリングでない場合にはその旨を出力手段
に出力しユーザーに正しいラベルの入力を促し入力され
たモノを正しい新しいラベリングとするＥ：参照データベースのラベルと前記ユーザーにより入
力されたラベルが異なる場合、新たに入力されたラベル
をＸとし、既に登録済みのラベルをＹとすると、両者に
共通のラベルにＸを追加した構成のデータを出力手段に
表示し、正否をユーザーに問い、正しければさらにラベ
ルＸを追加されたデータをデータベースの既にに登録済
みのデータに追加登録し、ラベルＹを追加したデータが
正しくなければデータベースの既にに登録済みの部分を
そのまま保持する以上の各ステップからなるラベリング対話手段を設けて
おく。The labeling means may obtain a label for the obtained decomposed element by referring to an already registered database and add the label. When the label to be given is not found, the fact is output to the output means, and the user is prompted to input a correct decomposing element, and the inputted thing is given a new new label and the labeling dialogue means is applied to the obtained decomposing element. For the label: A: A means for determining whether the label is appropriately labeled is provided. If the label is not properly labeled, the output means outputs the fact. B: Ask the user whether the labeling is correct. Label: D: If the labeling is not appropriate, the fact is output to the output means to prompt the user to input the correct label, and the input thing is set to the correct new labeling. E: The label of the reference database and the input by the user. If the label is different, the newly entered label is set to X, and the label already registered Is set to Y, data having a configuration in which X is added to a label common to both is displayed on the output means, and the user is asked for correctness. If correct, the data to which the label X has been added is already registered in the database If the data to which the label Y is added is not correct, the part already registered in the database is kept as it is. A labeling dialogue means comprising the above steps is provided.

【００１２】このラベリング手段は前記分解素に対して
に登録済みのデータベースを参照し適切なラベルがない
時はユーザーにその旨を出力手段に出力しユーザーにラ
ベルの入力を促し入力された結果をラベルとすればよ
い。This labeling means refers to a database registered for the decomposed element, and if there is no appropriate label, outputs the fact to the output means to the user, prompts the user to input a label, and checks the input result. It can be a label.

【００１３】入力情報が音声あるいはテキストで構成さ
れる言語である場合最小分解素は文字とすればよい。そ
して前記入力情報は文書であるからラベルは助詞，助動
詞，接続詞，副詞，句読点等の品詞およびスペース等の
記号を含む言い回し記号である属性とする。複数の最小
分解素（文字）により構成される構文を１つの構文分解
素とし，自身の構文分解素に含まれる属性の種類と他の
構文分解素に含まれる属性の種類を比較し，自身の属性
の種類を全て持つ他の構文分解素を求め，他の構文分解
素にあって自身にない属性を出力手段に表示し，ユーザ
ーにその属性の付与の正否を訊ねる。言い回し記号に，
誰が，いつ，どこで，なぜ，どのようにして等の疑問詞
を含み，入力データの分解素にそのいずれかが不足して
いる場合には，ユーザーに不足している疑問詞を出力手
段に表示し，その属性の入力を促せばよい。When the input information is a language composed of speech or text, the minimum decomposed element may be a character. Since the input information is a document, the label is an attribute that is a phrasing symbol including particles such as particles, auxiliary verbs, conjunctions, adverbs, parts of speech such as punctuation marks, and symbols such as spaces. A syntax composed of a plurality of minimum decomposers (characters) is regarded as one parser, and the types of attributes included in its own parser and those included in other parser are compared. It finds another parser that has all the attribute types, displays the attributes that are not in itself in the other parser, and displays them on the output means, and asks the user whether the attribute is correct. In the wording symbol,
If the question contains words such as who, when, where, why, and how, and if any of the decomposed elements of the input data is missing, the user will display the missing words on the output means. Then, input of the attribute may be prompted.

【００１４】また類似語辞書を備え、分解素のデータが
不完全な場合に、ユーザーの正否の判定結果に従って、
分解素の不完全データを正しいデータにするようにすれ
ばよい。In addition, a similar word dictionary is provided, and when the data of the decomposed element is incomplete, according to the judgment result of the user's correctness,
What is necessary is just to make incomplete data of a decomposition element into correct data.

【００１５】対話手段においては分解素、或いはラベリ
ングに必要なデータを編集してユーザーに理解可能な形
式にして出力手段に表示する編集部と、前記出力手段に
示されたデータを基にユーザーが選択した或いは入力し
た結果を正式な分解素或いはラベルとするとすればよ
い。[0015] In the interactive means, an editing unit for editing the disassembly or data necessary for labeling into a format understandable to the user and displaying on the output means, The selected or input result may be used as a formal decomposition factor or label.

【００１６】データベースにデータを保持するデータ保
持手段はオブジェクトデータベース手段であって，前記
データはオブシェクトデータでり、オブジェクト同士に
複数の共通のラベルがある場合に共通部分Ｘのみ取り出
して新しいオブジェクトを生成してＩＤを付与し，共通
部分Ｘを取り出したオブジェクトに対してはその部分を
該ＩＤにより表せばよい。The data holding means for holding data in the database is an object database means. The data is object data. When there are a plurality of common labels between objects, only the common portion X is taken out to generate a new object. Then, an ID may be assigned to the object from which the common part X is extracted, and that part may be represented by the ID.

【００１７】データベースへの登録手段はデータの分野
別に分類し分野別に登録すし、必要に応じて前記データ
ベースの登録済みのデータ処理手段を設けておく。そし
てこのデータの処理手段はデータベースを参照してデー
タのラベル及びラベリング処理で参照された登録済みの
データの内容の変更をユーザーとの対話処理により行い
ラベリングの過程でデータの分野を求め、判定された分
野のデータを参照し、前記データに対してラベリングを
行い同時に登録済みのデータの内容を更新する。The means for registering in the database is classified according to the field of data and registered for each field. If necessary, data processing means registered in the database is provided. Then, the data processing means refers to the database, changes the contents of the registered data referred to in the labeling and labeling processing of the data by interactive processing with the user, obtains the data field in the labeling process, and determines the data field. Referring to the data in the field, the data is labeled and the contents of the registered data are updated at the same time.

【００１８】利用者自身の知識データを保持する自己の
データベースを最初に参照し，自己のデータベースによ
りラベリングできない場合には，他人のデータベースを
参照すればよい。他人のデータベースを参照してもラベ
リングできない場合にはラベリングを可能にする知識を
有する有知識者のリストを求め，表示する。First, the user's own database holding the knowledge data of the user is referred to. If labeling cannot be performed by the user's own database, the database of another user may be referred to. If labeling cannot be performed by referring to another person's database, a list of intellectuals having knowledge that enables labeling is obtained and displayed.

【００１９】取り扱い情報が言語である場合情報処理手
段は入力された言葉タイプを認識し，入力された言葉タ
イプを異なる言葉タイプに変換可能であるようにする。
そして自己の入力情報のパターンと他人のデータベース
のパターンの相違を識別し，自己の入力情報のパターン
を他人のデータベースのパターンに読替えて，他人のデ
ータベースを検索する。When the handling information is a language, the information processing means recognizes the input word type, and converts the input word type into a different word type.
Then, the difference between the pattern of the input information of the user and the pattern of the database of the other person is identified, the pattern of the input information of the user is replaced with the pattern of the database of the other person, and the database of the other person is searched.

【００２０】データを分野別に登録する場合入力情報
が，どの分野の情報と類似しているかを判断するため，
それぞれの分野との類似度を計算し，一番類似度の近い
分野を使用し，類似度があるしきい値に達しない場合
は，直前に使用されたディレクトリ名もしくはファイル
名により分野を判定するようにすればよい。When registering data for each field, in order to determine which field the input information is similar to,
Calculate the similarity with each field, use the field with the closest similarity, and if the similarity does not reach a certain threshold, determine the field based on the directory name or file name used immediately before What should I do?

【００２１】本発明の基本的な流れの概略をもう1度、
図を用いて説明する。その流れを図１に示す。まず情報
入力手段１において情報を入力するが、入力情報がアナ
ログである場合、２の入力情報処理手段において、デジ
タルデータに変換する。入力情報が３の分解素分解手段
で取り扱羽事が可能な情報である場合にはそのまま分解
素分解手段にデータである情報を送る。The outline of the basic flow of the present invention will be described once again.
This will be described with reference to the drawings. The flow is shown in FIG. First, information is input by the information input means 1. If the input information is analog, the input information processing means 2 converts the information into digital data. If the input information is information that can be handled by the decomposing element decomposing means 3, the information as data is sent to the decomposing element decomposing means as it is.

【００２２】入力情報の種類（メディア種）ががわから
ない場合があるため、１の情報入力手段には言語、音
声、音、文字、画像（静止画、動画）ベクトル情報など
のがあるためこの入力情報の種類（メディア種）を判
別する判別手段をもうけておけばよい。Since the type of input information (media type) may not be known, one information input means includes language, voice, sound, character, image (still image, moving image) vector information, and the like. What is necessary is just to provide a determination means for determining the type of information (media type).

【００２３】３の分解素分解手段においては、データベ
ース８を参照して分解可能な最小単位或いはこの最小単
位の分解素を集めて再構築した集合でラベル付与が可能
な単位にまとめた分解素に分解する。そして必要に応じ
て４の分解素対話手段で分解素分解手段で得られた分解
素が適切か、或いは分解不可能であった場合にはユーザ
ーに適切かどうか問い、適切であった場合にはそのまま
分解素とし、不適切あるいは分解素が不足の場合には入
力を促し、ユーザーに入力してもらいその結果を正しい
分解素とする。In the decomposing means of (3), a decomposable minimum unit which can be decomposed with reference to the database 8 or a decomposed element which is collected into a decomposable unit by collecting and reconstructing the decomposed elements of this minimum unit is obtained. Decompose. If necessary, the user asks the user whether the decomposed element obtained by the decomposed element decomposer is appropriate or if it cannot be decomposed by the decomposed element decomposer, and if appropriate, If it is unsuitable or the number of decomposed elements is insufficient, the user is prompted to input the result, and the result is used as the correct decomposed element.

【００２４】５のラベリング手段においては、データベ
ース８を参照して得られた分解素に対するラベルを付与
する。そして６のラベリング対話手段においてはラベリ
ング手段によって得られたラベルが適切か、或いはラベ
ル付与が不可能であった場合にはユーザーに適切かどう
か問い、適切であった場合にはそのままラベルとし、不
適切あるいはラベルが不足の場合には入力を促し、ユー
ザーに入力してもらいその結果を正しいラベルとする。
そして７の登録手段において上記で選られた結果を８の
データベースに登録する。In the labeling means 5, a label is given to the decomposed element obtained by referring to the database 8. Then, in the labeling dialogue means 6, it is asked whether the label obtained by the labeling means is appropriate, or if labeling is not possible, asks the user whether the labeling is appropriate. If the label is appropriate or the label is insufficient, it prompts the user to input it and asks the user to input it. The result is the correct label.
Then, the result selected as described above is registered in the database 8 by the registering means 7.

【００２５】[0025]

【実施例１】以下実際の例に基にして更に説明してい
く。図２において，１０はデータ処理手段であって，入
力データに対して意味付けをするものである。１１はデ
ータ入力手段であって，文章等のテキストデータ，画
像，音声等のマルチメディアデータを入力するものであ
る。１２は入力データ保持部であって，入力された知識
データを一時保持するものである。[Embodiment 1] A further description will be given below based on an actual example. In FIG. 2, reference numeral 10 denotes data processing means for giving meaning to input data. Numeral 11 denotes data input means for inputting text data such as text, multimedia data such as images and sounds. Reference numeral 12 denotes an input data holding unit for temporarily holding input knowledge data.

【００２６】データ処理手段１０において、１３は知識
データ作成部であって，入力データを分解し，知識デー
タ保持部２０を参照して意味付けをするものである。
１４は分解素分解部であって，入力データを意味付けの
最小単位の分解素に分解するものである。１５は格認識
部であって，入力データの分解素にその意味を表す属性
を付与するものである（属性は，主格，目的格等の属性
名あるいは人の名前であれば，名前であること等の一般
名称により属性名，もしくはその具体的内容を表す属性
値である）。In the data processing means 10, reference numeral 13 denotes a knowledge data creation unit which decomposes input data and makes a meaning by referring to the knowledge data holding unit 20.
Reference numeral 14 denotes a decomposer for decomposing input data into decomposers of the smallest unit of meaning. Numeral 15 denotes a case recognition unit for assigning an attribute representing the meaning to the decomposed element of the input data. Is an attribute name by a general name such as, or an attribute value representing the specific content thereof).

【００２７】１６は知識データ参照部であって，知識デ
ータ保持部に保持されている知識データを参照するもの
である。１７は登録部であって，属性により意味付けさ
れたデータ保持手段に登録するものである。２０はデー
タ保持手段（以後，知識データ保持部と称する）であっ
て，知識データ（格認識データ，辞書部のデータ）を保
持するものである。Reference numeral 16 denotes a knowledge data reference unit for referring to the knowledge data held in the knowledge data holding unit. Reference numeral 17 denotes a registration unit for registering the data in the data holding means designated by the attribute. Reference numeral 20 denotes a data holding unit (hereinafter, referred to as a knowledge data holding unit), which holds knowledge data (case recognition data and data of a dictionary unit).

【００２８】２１は格認識データ保持部であって，意味
付けした入力データを保持するものである。例えば，
「私は学校に行く。」という入力データの場合，分解素
は「私」，「は」，「学校」，「に」，「行
く」，「。」である。そして，「私は学校に行く。」と
いう文章に対してＩＤが付与され，さらに「私」に対し
て主格，「は」の格を表す属性名と属性値として「主格」，「学校」に対して
「に」の表す属性名と属性値として目的格，「行く」に
対して行動格，「。」の格として句読点を表す属性名と
属性値を保持するものてある。Reference numeral 21 denotes a case recognition data holding unit for holding meaningful input data. For example,
In the case of the input data "I go to school.", The decomposition factors are "I", "Ha", "School", "Nii", "Go", "." An ID is assigned to the sentence "I go to school." In addition, "I" goes to Nominative, and "No" to "Nominative" and "School" as attribute names and attribute values representing the case of "ha". On the other hand, an attribute name and an attribute value representing "Ni" are held as an object case, an action case is displayed for "Go", and an attribute name and an attribute value representing punctuation are stored as a case of ".".

【００２９】３２は辞書部であって，属性名と属性値を
保持するものであり，入力データに設定された属性名
（主格，目的格，名前，年令等）に対応する属性値（主
格を表す分解素の「は」，目的格を表す分解素の
「を」，名前を表す分解素の「ＸＸＸ」，年令を表す分
解素の「４０歳」等）をもとに学習しながら，その内容
が充実されるものである。４１は出力手段であって，画
像を表示するディスプレイもしく音声を出力するスピー
カ，音声増幅器等の音声出力装置である。図１の構成の
動作を説明する。Reference numeral 32 denotes a dictionary unit for holding attribute names and attribute values, and attribute values (nominative, objective, name, age, etc.) corresponding to the attribute names set in the input data (nominative nominative). , A decomposer representing the object case, and a decomposer representing the name, "XXX", and a decomposer representing age, such as "40 years old." , The contents are enriched. Reference numeral 41 denotes an output means, which is a display for displaying an image or a sound output device such as a speaker or a sound amplifier for outputting sound. The operation of the configuration of FIG. 1 will be described.

【００３０】データ入力手段１１に文章のテキストデー
タ，マルチメディアデータ等のデータが入力される。入
力データは入力データ保持部１２に一時保持される。知
識データ作成部１３において分解素分解部１４は入力デ
ータを分解素に分解する。例えば，「私は学校に行
く。」（構文Ａとする）という入力データの場合，分解
素は「私」，「は」，「学校」，「に」，「行
く」，「。」である。入力データの分解素への分解は，
例えば，分解素分解部が分解したデータを表示部に表
示，オペレータか確認する。そして，不適切な分解はオ
ペレータが表示画面に上で訂正する等でオペレータと表
示部とが対話しながら行うものである。あるいは，音声
出力装置ち音声でオペレータに分解素の正否を訊ね，正
しい分解データの入力を促す。Data such as text data of a sentence and multimedia data are input to the data input means 11. The input data is temporarily stored in the input data storage unit 12. In the knowledge data creating unit 13, a decomposition element decomposition unit 14 decomposes input data into decomposition elements. For example, in the case of the input data “I go to school.” (Assumed to be syntax A), the decomposition factors are “I”, “H”, “School”, “N”, “Go”, “.”. . Decomposition of input data into decomposers is
For example, the data decomposed by the decomposer is displayed on the display unit, and the operator confirms the data. Inappropriate disassembly is performed while the operator and the display unit interact with each other, for example, by making corrections on the display screen by the operator. Alternatively, a voice output device asks the operator by voice whether the decomposition element is correct, and prompts the operator to input correct decomposition data.

【００３１】分解素に分解されたデータは出力手段４１
に表示される。辞書部３２になにも登録されていない状
態においては，分解素に分解された「私」，「は」，
「学校」，「に」，「行く」，「。」が表示されるの
で，オペレータは，「私」に対して主格，「は」の格を
表す属性値として「主格」，「学校」に対して目的格，
「に」に対してその格を表す属性値として目的格，「行
く」に対して行動格，「。」の格として句読点を表す属
性名と属性値を設定する。そして，登録部１７は属性名
「主格」に対して属性値「は」，属性名「目的格」に対
して属性値「を」辞書３１に登録する。また，「私」に
対しても，ＩＤを付して登録し，その属性値として主格
を登録する。また，「学校」に対してもそのＩＤを付与
し，その属性値として「目的格」を辞書部３２に登録す
る。そして，「私は学校に行く。」という入力データの
全体にＩＤを付与し，分解素毎に，その属性名と属性値
を付与し，格認識データ保持部２１に保存する。The data decomposed into decomposed elements is output by an output unit 41.
Will be displayed. When nothing is registered in the dictionary unit 32, “I”, “ha”,
Since “school”, “ni”, “go”, and “.” Are displayed, the operator assigns “Nominative” and “School” to “I” as attribute values indicating the nominative case and “ha” case. On the other hand,
For "Ni", an objective case is set as an attribute value representing the case, "Go" is an action case, and an attribute name and an attribute value representing punctuation are set as a case of ".". Then, the registration unit 17 registers the attribute value “ha” for the attribute name “Nominative” and the attribute value “O” for the attribute name “Objective” in the dictionary 31. Also, for "I", an ID is added and registered, and the nominative is registered as the attribute value. The ID is also assigned to “school”, and “object” is registered in the dictionary unit 32 as the attribute value. Then, an ID is assigned to the whole input data “I am going to school.”, An attribute name and an attribute value are assigned to each decomposition element, and stored in the case recognition data holding unit 21.

【００３２】さらに，辞書部３２に属性名「主格」に対
して属性値「は」，属性名「目的格」に対して属性値
「に」が登録されている状態において，「私は学校に行
く。」がデータ入力されると，分解素分解部１４はその
入力データを分解素に分解し，知識データ参照部１６は
知識データ保持部２０を検索して，「私」に対して主
格，「学校」に対して目的格，「行く」に対して行動格
を求め，格認識部１５はそのようにして求めた属性名
（格認識データ）をそれぞれの分解素に付与することに
より，入力データに対して自動的に意味付けをする。そ
して，意味付けされた知識データは知識データ保持部２
０に保存される。Further, in the state where the attribute value “ha” is registered for the attribute name “Nominative” and the attribute value “Ni” is registered for the attribute name “Objective Case” in the dictionary unit 32, Is input, the decomposing element decomposing unit 14 decomposes the input data into decomposing elements, and the knowledge data referencing unit 16 searches the knowledge data holding unit 20 to determine that nominative The goal case for "school" and the behavioral case for "go" are obtained, and the case recognition unit 15 assigns the attribute names (case recognition data) obtained in this way to each of the decomposed elements, thereby inputting. Automatically give meaning to data. The meaning-added knowledge data is stored in the knowledge data holding unit 2.
0 is stored.

【００３３】さらに，このような状態において，「山田
君は行く。」（構文Ｂとする）という文章が入力される
と，構文Ｂは分解素に分解され，自動的に格認識されて
「山田君」主格，「行く」行動格が出力手段４１に表示
される。そして，すでに登録されている構文Ａと比較し
て，目的格がないので，格認識部１５は，目的格の不足
を表す質問「どこに？」を出力手段に表示する，あるい
は質問を音声テーブルとして音声出力装置に出力するこ
とにより入力データの不足を指摘するとともに，不足す
るデータの入力を促す。Further, in this state, when a sentence "Yamada-kun is going." The “kun” nominative case and the “going” behavioral case are displayed on the output means 41. Then, as compared with the already registered syntax A, there is no purpose case, so the case recognition unit 15 displays a question "Where?" By outputting to an audio output device, it points out the lack of input data and prompts the input of the missing data.

【００３４】そして，オペレータが「学校に」をキーボ
ード入力もしくはマイクを利用して音声入力すると，
「山田君は学校に行く。」という文章が分解素に分解さ
れて確認識されて（属性名を付与されて）知識データ保
持部２０に保存される。あるいは，知識データ保持部
２０に「徒歩で」，「自転車で」等の手段を表す手段
格，「８時半」等の時刻を表す時刻格が設定されている
と，上記の構文Ｂに対して，さらに，手段格の不足を表
す「なにで？」，時間格の不足を表す「何時？」等を出
力手段に表示してオペレータにデータ入力を促す。そし
て，「自転車で」，「８時半に」等が入力されると，
「山田君は８時半に学校に自転車で行く。」を表す格認
識された知識データが知識データ保持部２０に保持され
る。この時の格認識データの構成は，「山田君」主格，
「８時半」時間格，「学校」目的格，「自転車」手段格
で登録される。このように構文Ｂに基づいて内容を充実
して作成された文章はＩＤを付与されて登録される。Then, when the operator inputs "to school" by voice input using a keyboard or a microphone,
The sentence “Yamada goes to school.” Is disassembled into decomposed elements, confirmed and identified (with an attribute name given), and stored in the knowledge data holding unit 20. Alternatively, when the knowledge data holding unit 20 is set with a measure indicating a means such as “on foot” or “on a bicycle” and a time scale indicating time such as “8:30”, the syntax B Further, "what?" Indicating shortage of measure, "what time?" Indicating shortage of time, etc. are displayed on the output means to prompt the operator to input data. Then, if "by bicycle" or "at 8:30" is entered,
The knowledge data with case recognition indicating that "Yamada will go to school at 8:30 by bicycle" is held in the knowledge data holding unit 20. The structure of the case recognition data at this time is "Yamada-kun"
Registered as "8:30" time, "school" purpose, "bicycle" means. In this way, a sentence created with a rich content based on the syntax B is assigned an ID and registered.

【００３５】また，本発明は，事例データの属性名とし
て「名前」，「年令」等を設定できるので，「山田君と
いう名称のデータ（もしくはＩＤ）」に対して，事例デ
ータの属性名として「名前」，属性値として「山田」，
属性名「年令」に対して属性値「４０歳」等を入力し，
辞書部３１に登録する。本発明は，さらに，主格として
使用される「の」，目的格として使用される「を」等の
格が出現すると，属性名「主格」に「の」，属性名「目
的格」に「を」が追加される。また，「の」，「を」に
対してもそれぞれ属性名を付与し，ＩＤを付けて登録す
る。このように登録された「の」，「を」は後に新たな
用法が出現した時にその用法が追加され，データの内容
が充実される。Further, according to the present invention, "name", "age", etc. can be set as attribute names of case data, so that "data (or ID) named Yamada-kun" As “name”, attribute value as “Yamada”,
Enter attribute value “40” for attribute name “age”
Register in the dictionary unit 31. Further, when a case such as "no" used as the nominative case and "wo" used as the nominative case appears, the attribute name "nominative" is used for the attribute name "nominative" and the " Is added. Also, attribute names are given to “no” and “wo”, respectively, and registered with IDs. The registered "no" and "wo" are added later when a new usage appears, and the content of the data is enriched.

【００３６】本発明において分解素に属性名を設定する
方法は，例えば，「は」に対して主格であるかどうか決
定する場合，「は」が使用される属性名の統計をとって
おき，頻度の高いものを選択して設定する。その他，確
率を求め確率の高いものを選択する等その決定方法は様
々な方法で可能である。このように，本発明は，入力
データに対してすでに意味付けされて保存されている辞
書，文章等の知識データを参照し，意味付けするもので
ある。また，すでに作成されている辞書の内容，文章等
の知識データも，後から入力された知識データをもと
に，その内容を書き替え，情報変化に応じてたえずその
内容を有意義に変更するものである。In the present invention, a method of setting an attribute name in a decomposed element is, for example, when determining whether or not “ha” is a nominative case, statistics of the attribute name in which “ha” is used are taken, and the frequency is calculated. Select the higher one and set it. In addition, various methods can be used for determining the probability, such as obtaining a probability and selecting a high probability. As described above, the present invention refers to the meaning of the input data by referring to the knowledge data such as dictionaries and sentences which are already stored with the meaning. Also, knowledge data such as dictionary contents and sentences that have already been created are rewritten based on knowledge data entered later, and the contents are constantly changed in a meaningful manner in response to information changes. It is.

【００３７】[0037]

【実施例２】図３は本発明のシステム構成の実施例を示
す。図３において、１１はデータ入力手段である。４
１は出力手段であって，ディスプレイ，プリンタ，音声
出力装置，ビテオ装置等である。５１１はコンピュータ
である。５２は入出力インタフェースであって，データ
入力手段１１，出力手段，外部記憶装置７１とのインタ
フェースである。Embodiment 2 FIG. 3 shows an embodiment of the system configuration of the present invention. In FIG. 3, reference numeral 11 denotes data input means. 4
Reference numeral 1 denotes an output unit such as a display, a printer, an audio output device, a video device, and the like. 511 is a computer. Reference numeral 52 denotes an input / output interface, which is an interface with the data input means 11, the output means, and the external storage device 71.

【００３８】５３はデータ処理部であって，入力データ
を処理するものである（図１のデータ処理手段１０およ
びデータ保持手段２０を含む）。６１はＣＰＵである。
６２はＲＡＭである。６３はＲＯＭである。６４は内部
磁気ディスクである。（以上，データ処理部５３）７１は外部記憶装置である。データ入力手段１１におい
て，８２はキーボード，マウスである。８３は音声入力
装置である。A data processing unit 53 processes input data (including the data processing means 10 and the data holding means 20 shown in FIG. 1). 61 is a CPU.
62 is a RAM. 63 is a ROM. 64 is an internal magnetic disk. (The above is the data processing unit 53) 71 is an external storage device. In the data input means 11, reference numeral 82 denotes a keyboard and a mouse. 83 is a voice input device.

【００３９】８４は図形データ入力装置であって，イメ
ージリーダ等の図形データを入力するものである。８５
は電子カメラである。８６はＣＤ−ＲＯＭドライブであ
る。８７はビデオカメラ等のビデオ入力装置である。１
１０は記憶媒体であって，磁気ディスクもしくはＣＤＲ
ＯＭ等である。A graphic data input device 84 inputs graphic data from an image reader or the like. 85
Is an electronic camera. 86 is a CD-ROM drive. Reference numeral 87 denotes a video input device such as a video camera. 1
10 is a storage medium, a magnetic disk or CDR
OM and the like.

【００４０】２１１は記憶媒体１１０に記録されている
プログラムを内部磁気ディスク６４５３にインストール
するためのプログラムである。１２１は知識データ作成
プログラムであって，知識データ作成部のプログラムで
ある。１２２は分解素分解プログラムであって，分解素
分解部のプログラムである。１２３は格認識プログラム
であって，格認識部のプログラムである。１２４は知識
データ参照プログラムであって，知識データ参照部のプ
ログラムである。Reference numeral 211 denotes a program for installing the program recorded on the storage medium 110 on the internal magnetic disk 6453. Reference numeral 121 denotes a knowledge data creation program, which is a program of a knowledge data creation unit. A decomposition element decomposition program 122 is a program for a decomposition element decomposition unit. Reference numeral 123 denotes a case recognition program, which is a program of a case recognition unit. Reference numeral 124 denotes a knowledge data reference program, which is a program of a knowledge data reference unit.

【００４２】１３１は編集プログラムであって，編集部
のプログラムである。１３２は登録プログラムであっ
て，登録部のプログラムである。なお，図４は本発明の
装置構成の実施例を示すものであって，例えば，データ
入力部の全てを必要とするものではない。Reference numeral 131 denotes an editing program, which is a program of an editing unit. 132 is a registration program, which is a program of the registration unit. FIG. 4 shows an embodiment of the apparatus configuration of the present invention, and does not require, for example, all of the data input units.

【００４３】図４において、１１はデータ入力手段であ
る。１２は入力データ保持部であって，入力データを一
時保持するものである（図２の内部磁気ディスク６４に
含まれる部分である）。１３’は知識データ作成プログ
ラムである（図２の内部磁気ディスク６４に記憶される
ものである）。１４’は分解素分解プログラムである。
１５’は格認識プログラムである。１６’は知識データ
参照プログラムである。In FIG. 4, reference numeral 11 denotes data input means. Reference numeral 12 denotes an input data holding unit which temporarily holds input data (a portion included in the internal magnetic disk 64 in FIG. 2). 13 'is a knowledge data creation program (stored on the internal magnetic disk 64 in FIG. 2). 14 'is a decomposition element decomposition program.
15 'is a case recognition program. 16 'is a knowledge data reference program.

【００４４】２０は知識データ保持部である（図２は内
部磁気ディスク６４に含まれるものとしてあるが，外部
磁気ディスク装置に含まれるものでも良い）。４１は出
力手段である。５１は編集プログラムであって，知識デ
ータをディスプレイに表示するために編集するものであ
る（図２の内部磁気ディスクに記憶されるプログラムで
ある）。５５は登録プログラムであって，知識データ保
持部２０に知識データを登録するものである（図２の内
部磁気ディスク６４に記憶されるプログラムである）。Reference numeral 20 denotes a knowledge data holding unit (FIG. 2 is included in the internal magnetic disk 64, but may be included in the external magnetic disk device). 41 is an output means. Reference numeral 51 denotes an editing program for editing knowledge data to be displayed on a display (a program stored in the internal magnetic disk of FIG. 2). Reference numeral 55 denotes a registration program for registering knowledge data in the knowledge data holding unit 20 (a program stored in the internal magnetic disk 64 in FIG. 2).

【００４５】５６はインデックス作成プログラムであっ
て，知識データ保持手段２０に登録するデータのインデ
ックスを自動生成するものである（図２の内部磁気ディ
スク６４に記憶されるプログラムである）。図４の構成
の動作は後述する。図５は本発明の格認識部と編集部の
構成を示す。図５において，１５は格認識部である
（格認識プログラムとＣＰＵ，ＲＡＭ等により構文の格
認識をするものである）。Reference numeral 56 denotes an index creation program for automatically creating an index of data to be registered in the knowledge data holding means 20 (a program stored in the internal magnetic disk 64 in FIG. 2). The operation of the configuration in FIG. 4 will be described later. FIG. 5 shows the configuration of the case recognition unit and the editing unit of the present invention. In FIG. 5, reference numeral 15 denotes a case recognition unit (which recognizes a case of a syntax using a case recognition program and a CPU, a RAM, and the like).

【００４６】９１は事例設定部あって，例えば，人物に
対するオブジェクトを生成する場合に，その名前，年
令，身長，性格等のデータを作成するものである（事例
設定プログラムとＣＰＵ，ＲＡＭ等により構成されるも
のである）。９２は言い回し記号設定部であって，例え
ば，文章データ等の分解素に格を設定するものである
（言い回し設定プログラムとＣＰＵ，ＲＡＭ等により構
成されるものである）。９３は構文言い回し記号設定部
であって，複数の分解素を含む構文に対して言い回し記
号（格等の属性名）を設定するものである（構文言い回
し記号設定プログラムととＣＰＵ，ＲＡＭ等により構成
されるものである）。Reference numeral 91 denotes a case setting unit which creates data such as name, age, height, and personality when an object is created for a person (for example, by a case setting program and a CPU, a RAM, etc.). Is composed). Reference numeral 92 denotes a wording symbol setting unit, which sets a case in a decomposed element such as sentence data, for example (comprising a wording setting program, a CPU, a RAM, and the like). Reference numeral 93 denotes a syntax wording symbol setting unit that sets a wording symbol (attribute name such as a case) for a syntax including a plurality of decomposable elements (consisting of a syntax wording symbol setting program, a CPU, a RAM, and the like). Is what is done).

【００４７】９４は共通部抽出オブジェクト作成部であ
って，複数のオブジェクトから共通部を取り出した新た
なオブジェクトを生成するものである（共通部抽出オブ
ジェクト作成プログラムとＣＰＵ，ＲＡＭ等により構成
されるものである）。５１’は編集部である（編集プロ
グラムとＣＰＵ，ＲＡＭ等により構成されるものであ
る）。９７は質問作成部であって，格認識部で付与され
た格を含むデータをもとに，知識データを編集するため
の質問を生成するものである。例えば，「学校に行
く。」というデータが入力された場合に，「誰が？」等
の質問を作成するものである（質問作成プログラムとＣ
ＰＵ，ＲＡＭ等により構成されるものである）。Reference numeral 94 denotes a common part extraction object creating unit which creates a new object by extracting a common part from a plurality of objects (a common part extracting object creation program and a CPU, RAM, etc. Is). Reference numeral 51 'denotes an editing unit (configured by an editing program, a CPU, a RAM, and the like). Reference numeral 97 denotes a question creating unit which creates a question for editing knowledge data based on data including the case given by the case recognition unit. For example, when data such as "go to school." Is input, a question such as "who is?"
It is composed of a PU, a RAM, etc.).

【００４８】９８は回答入力部であって，質問に対して
なされたオペレータの回答を入力して知識データを編集
するものである（質問回答プログラムとＣＰＵ，ＲＡＭ
等により構成されるものである）。図６，図７は本発明
の知識データのデータ構成の例を示す（ＯＯＤＢに保存
するオブジェクトの例）。事例データは属性名の説明的
内容を表すものであり，説明部等の場合もある。図６，
図７において，１１１は事例データを付与されたオブジ
ェクトの例である。名前がＡ，年令が４０歳，身長が１
６８ｃｍ，体重が６８Ｋｇ，人格が円満である人物を表
すオブジェクトである。Reference numeral 98 denotes an answer input unit for editing knowledge data by inputting an operator's answer to a question (question answer program, CPU, RAM).
Etc.). 6 and 7 show examples of the data configuration of the knowledge data of the present invention (examples of objects stored in OODB). The case data represents the descriptive content of the attribute name, and may be a description part or the like. Figure 6
In FIG. 7, reference numeral 111 denotes an example of an object to which case data is added. Name A, age 40, height 1
This is an object representing a person who is 68 cm, weighs 68 kg, and has a perfect personality.

【００４９】１１２は言い回し記号のオブジェクトの例
であって，格を表す言い回し記号の「は」の例である。
オブジェクト名が「は」，属性名が「表す格」であり，
属性値が「主格」である。１１３は言い回し記号のオブ
ジェクトの例であって，格を表す言い回し記号の「に」
の例である。オブジェクト名が「に」，属性名が「表す
格」であり，属性値が「目的格」である。１１４は言い
回し記号を付与されたオブジェクトの例である。「私は
学校に行く。」という文章データに対して分解素に分解
されて「私」に対して主格を設定されたことのオブジェ
クトである。格として主格が設定される。その後，例え
ば，「その人は私に本をくれた。」とういう文章が入力
されると，分解素に分解された「私」に対して，目的格
が設定されて登録される。Reference numeral 112 denotes an example of a phrase symbol object, which is an example of a phrase symbol "ha" representing a case.
The object name is "ha" and the attribute name is "representative case".
The attribute value is “Nominative”. Reference numeral 113 denotes an example of a wording symbol object, which is a wording symbol “Ni” representing a case.
This is an example. The object name is “ni”, the attribute name is “representative case”, and the attribute value is “objective case”. Reference numeral 114 denotes an example of an object provided with a wording symbol. It is an object that has been decomposed into sentence data of sentence data "I go to school." The nominative case is set as the case. After that, for example, when a sentence "The person gave me a book." Is input, a purpose is set and registered for "I" decomposed into decomposed elements.

【００５０】１１５は言い回し記号を付与したオブジェ
クトの例である。「私は学校に行く。」という文章デー
タにおいて「学校」に対して目的格を設定したオブジェ
クトである。１１６は文章データに格を付与したオブジ
ェクトの例である。「私は学校に行く。」に対して属性
名「主格」に対して属性値「私」，属性名「目的格」に
対して属性値「学校」，属性名「行動格」に対して属性
値「行く」が設定され，オブジェクトのＩＤが設定され
る。１１７は行動格を表すオブジェクトの例である。
「出勤する」というデータに対して属性値とて行動格を
設定し，属性名として「誰が」，「いつ」，「どこか
ら」，「どこへ」，「どのように」等の疑問詞が予め設
定されている。そして，例えば，「私は８時に出勤す
る」。という文章が入力されると，「何処に？」，「ど
のようにして？」等がディスプレイに表示される。Reference numeral 115 denotes an example of an object provided with a wording symbol. This is an object that sets a purpose for “school” in the sentence data “I go to school.” Reference numeral 116 denotes an example of an object in which case is assigned to sentence data. Attribute value "I" for attribute name "Nominative" for attribute name "Nominative", attribute value "School" for attribute name "Objective case", and attribute value for attribute name "Action case" The value “go” is set, and the ID of the object is set. Reference numeral 117 denotes an example of an object representing a behavioral case.
The behavioral case is set as the attribute value for the data of “coming to work”. Is set. And, for example, "I am going to work at 8am." Is input, "Where?" And "How?" Are displayed on the display.

【００５１】図８はマルチメディアデータによる場合の
例である。コーヒーメーカのオブジェクトである。図８
において，２１８はマルチメディアデータのオブジェク
トであり，コーヒーメーカのオブジェクトである。オブ
ジェクトの名前がコーヒーメーカ，使用方法を説明して
あるビテオのＩＤ（ビデオ１），形状を表す形名が円筒
形，形のイメージデータのＩＤが図１，考案者（作者）
の写真のイメージデータのＩＤが写真１であることを保
持する。FIG. 8 shows an example in the case of multimedia data. It is a coffee maker object. FIG.
, An object 218 of the multimedia data is an object of the coffee maker. The name of the object is the coffee maker, the ID of the video that describes how to use it (video 1), the model name representing the shape is cylindrical, and the ID of the image data of the shape is Figure 1, the inventor (author)
It holds that the ID of the image data of the photograph is photograph 1.

【００５２】２１９はコーヒーメーカのイメージデータ
であり，図１というＩＤを持つものである。２２０は
コーヒーメーカの作成者の写真イメージデータであり，
写真１というＩＤを持つことを表すものである。２２１
はコーヒーメーカの使用方法のビデオを表し，ビデオ１
というＩＤにコーヒーメーカの使用方法の映像が記録さ
れていることを表す。Reference numeral 219 denotes image data of a coffee maker, which has an ID shown in FIG. 220 is a photograph image data of the maker of the coffee maker,
This indicates that the photograph has an ID of photograph 1. 221
Shows a video of how to use a coffee maker. Video 1
ID indicates that a video of a method of using a coffee maker is recorded.

【００５３】図４の構成の動作を説明する。データ入力
手段１１より文章データ，マルチメディアデータ等のデ
ータが入力される。入力データ保持部１２は入力データ
を一時保持する。知識データ作成プログラム１３’は入
力データを分解素に分解する。分解素への分解は，分解
素分解プログラムが分解した入力データを編集プログラ
ムが編集してディスプレイに表示し，オペレータによる
その正否の判断を促し，オペレータによる正否の確認デ
ータ，あるいは正しい分解データの入力を促す。あるい
は，音声により分解されたデータの正否を訊ね，正しい
分解素の音声による入力を促す。そして，オペレータ
はその内容を確認し，訂正する必要がある場合には，オ
ペレータがディスプレイ上で訂正する。その内容は編集
プログラムを介して分解素分解プログラムに反映され，
正確に分解されたデータが各認識プログラム１５’に渡
される。The operation of the configuration shown in FIG. 4 will be described. Data such as text data and multimedia data is input from the data input means 11. The input data holding unit 12 temporarily holds input data. The knowledge data creation program 13 'decomposes input data into decomposers. Decomposition into decomposed elements is performed by the editing program editing the input data decomposed by the decomposed element decomposition program and displaying it on the display, prompting the operator to determine whether the data is correct or not, and inputting confirmation data of correctness or correct decomposition data by the operator. Prompt. Alternatively, the user is asked whether the data decomposed by voice is correct or not, and is prompted to input a correct decomposed element by voice. Then, the operator checks the contents and, if necessary, makes corrections on the display. The contents are reflected in the decomposition element decomposition program through the editing program,
The correctly decomposed data is passed to each recognition program 15 '.

【００５４】そして，知識データ参照プログラム１６’
は知識データ保持部２０を参照し，すでに作成されてい
る知識データのオブジェクトの構文，オブジェクトに付
与されている事例データ等を参照し，分解素に付加する
属性を求める。格認識プログラム１５’は分解素に求め
た格（属性名）を設定する。そして，例えば，格が見当
たらない場合（例えば，目的格の「に」がオブジェクト
に未登録の場合等）には，格が未登録であることを編集
プログラム５１により出力手段４１に表示して，格の入
力を促す。そして，格に入力されたら，その格のオブジ
ェクト（もしくはテーブル等）にその格を登録する。あ
るいは，入力データに不足する格がある場合（入力デー
タの分解素より求めたオブジェクトに記録されている格
数（属性名の数）の方が多い場合等）には，編集プログ
ラム５１により，知識データ保持部のオブジェクトの多
い部分の属性名の表す内容の質問を作成し，出力手段４
１に表示する。あるいは音声で出力する。例えば，入力
データに手段格がなく，知識データの選択したオブジェ
クトに手段格として「自転車」があれば，「自転車です
か？」をディスプレイに表示し，オペレータに訊ねる。
あるいは質問を音声データとして音声出力装置より出力
する。Then, the knowledge data reference program 16 '
Refers to the knowledge data holding unit 20, refers to the syntax of the object of the knowledge data already created, the case data assigned to the object, and the like, and obtains the attribute to be added to the decomposed element. The case recognition program 15 'sets the case (attribute name) obtained for the decomposition element. Then, for example, when no case is found (for example, when the target case “ni” is not registered in the object, etc.), the fact that the case is not registered is displayed on the output unit 41 by the editing program 51, and Prompt for case. When the case is entered, the case is registered in the case object (or table or the like). Alternatively, if the input data has a missing case (for example, if the number of cases (the number of attribute names) recorded in the object obtained from the disassembly of the input data is larger), the editing program 51 outputs the knowledge. A question of the content represented by the attribute name of the object-rich portion of the data holding unit is created, and output means 4
1 is displayed. Alternatively, the sound is output. For example, if the input data has no measure and the object selected in the knowledge data has a measure of "bicycle", "is it a bicycle?" Is displayed on the display and the operator is asked.
Alternatively, the question is output from the voice output device as voice data.

【００５５】そして，オペレータの回答に従って，キー
ボード入力もしくはマイクによる音声入力によりオペレ
ータが「ハイ」，もしくは別の属性値を入力すればそれ
ぞれの属性値を補った知識データを作成し，登録プログ
ラム５５はその知識データにインデックスを生成して知
識データ保持部２０に保持させる。また，オペレータが
「イイエ」を入力した場合には，その格を補うことなく
インデックスを生成して知識データ保持部２０に登録す
る。According to the operator's answer, if the operator inputs "high" or another attribute value by keyboard input or voice input by a microphone, knowledge data supplementing each attribute value is created, and the registration program 55 An index is generated for the knowledge data, and the knowledge data holding unit 20 holds the index. When the operator inputs “no”, an index is generated without supplementing the case and registered in the knowledge data holding unit 20.

【００５６】図９は本発明の知識データ作成プログラム
の格認識のフローチャートである。Ｓ１分解素分解部は入力データを分解素に分解する。Ｓ２言い回し（格）の分解素を判定する（「は」，
「に」等）。Ｓ３知識データ保持部を検索し，格を探す。Ｓ４格があるか判定し，格があれば，Ｓ５の処理を行
い，なければＳ６の処理を行う。Ｓ５言い回し記号の前にある分解素に格を与える（例
えば，「は」の前の分解素には主格，「に」の前の分解
素には「目的格」を与える）。Ｓ６格がないのでオペレータに格を質問する（ディス
プレイに表示する）。Ｓ７オペプレータが入力した格をその格のオブジェク
ト（もしくはデータのテーブル等）に追加する。FIG. 9 is a flowchart of case recognition of the knowledge data creation program of the present invention. S1 The decomposition element decomposition unit decomposes input data into decomposition elements. S2 Determine the disassembly of the phrase (case) (“ha”,
"Ni" etc.). S3 Search the knowledge data holding unit to find a case. S4 It is determined whether there is a case. If there is a case, the process of S5 is performed, and if not, the process of S6 is performed. S5: A case is given to the decomposition element before the wording symbol (for example, the nominative case is given to the decomposition element before "wa" and the "objective case" is given to the decomposition element before "ni"). S6 Since there is no rating, a question is asked to the operator (displayed on the display). S7 The case input by the operator is added to the case object (or data table or the like).

【００５７】図１０は主たる振る舞いを表す記号を付与
するプログラムのフローチャートである。「行く」，
「見る」等の振る舞いを表すオブジェクト（もしくはデ
ータテーブル）にあらかじめ，「誰が」，「「いつ」，
「なにを」，「どこで」，「どうして」等を表す格を有
するオブジェクト（もしくはデータテーブル等）を備え
ておき，入力データにその格がない場合には，オペレー
タに問い合わせる。Ｓ１主たる振る舞い記号を知識データ保持部から引き
出す。Ｓ２，Ｓ３知識データのプリミティブ（分解素に分解
されたデータ）があるか判定する。Ｓ４主たる振る舞いを表す記号（行く，見る等）の保
持する格（「誰が」，「いつ」，「なにを」，「どこ
で」，「どうして」等および，それぞれに属性値がある
場合には「８時半に？」，「自動車で？」等）がないも
のがあれば，ディスプレイに表示して質問し，プリミテ
ィブデータ（「８時半に」，「自動車で」等）を作り出
す。FIG. 10 is a flowchart of a program for giving a symbol representing a main behavior. "go",
"Who", "When", and "Who"
An object (or a data table or the like) having a case representing "what", "where", "why" or the like is provided. If the input data does not have the case, an inquiry is made to the operator. S1 The main behavior symbol is extracted from the knowledge data holding unit. S2, S3 It is determined whether there is a primitive of the knowledge data (data decomposed into decomposed elements). S4 The case (“Who”, “When”, “What”, “Where”, “Why”, etc.) of the symbol (“Go”, “See,” etc.) that represents the main behavior, and if each has an attribute value, If there is no such thing as "at 8:30?" Or "by car", display it on the display and ask a question to create primitive data ("at 8:30", "by car", etc.).

【００５８】図１１は本発明の構文言い回し記号の説明
図である。「私は学校に行く。」（構文Ａ）には，主
格，目的格，行動格が含まれているので，構文言い回し
記号として「主格，目的格，行動格」を設定する。３２
１はＡ構文の構文言い回し記号である。３２２は他の構
文（Ｂ構文）の構文言い回し記号の例である。Ａ構文が
入力された時，Ａ構文に含まれる格の構成と類似の構成
のＢ構文３２２の構文言い回し記号を参照し，Ａ構文に
は同行者格（「誰と？」がないことを検出すると，「誰
と？」を表示して質問し，オペレーに入力をうながす。
この時，同行者格に属性値「山田君」等が設定されてい
たら，「山田君と？」等を合わせて質問する。複数の構
文のオブジェクトが入力データの構文に類似する場合に
は，頻度もしくは確率等に従って選択する。例えば，属
性名を１，２，３，４，５で表した時，入力データの属
性の構成が〔１，２，３〕で，保存されている知識デー
タのうち入力データと類似する知識データの属性の構成
が〔１，２，３，４〕と〔１，２，３，５〕であったと
する。このような時，それまでにそれそれが選択された
回数もしくは確率を求めておき，選択された回数もしく
は確率の高い方を最初に選択するようにする。FIG. 11 is an explanatory diagram of the syntax wording symbol of the present invention. Since "I go to school." (Syntax A) includes a nominative case, a purpose case, and a behavioral case, "symantic case, purpose case, and behavioral case" are set as syntax wording symbols. 32
1 is a syntax wording symbol of the A syntax. 322 is an example of a syntax wording symbol of another syntax (B syntax). When the A syntax is input, reference is made to the syntactic phrasing symbol of the B syntax 322 having a configuration similar to that of the case included in the A syntax, and it is detected that the A syntax does not include a companion case ("who and?" Then, "Who is it?" Is displayed, a question is asked, and the operator is prompted for input.
At this time, if an attribute value such as “Yamada-kun” is set in the accompaniment, a question is also asked together with “Yamada-kun?”. If the objects of a plurality of syntaxes are similar to the syntax of the input data, the objects are selected according to the frequency or the probability. For example, when the attribute names are represented as 1, 2, 3, 4, and 5, the attribute configuration of the input data is [1, 2, 3], and knowledge data similar to the input data among the stored knowledge data. Are assumed to be [1, 2, 3, 4] and [1, 2, 3, 5]. In such a case, the number of times or probability at which the number of times has been selected is obtained beforehand, and the one with the higher number of times or probability is selected first.

【００５８】図１２は構文言い回し記号により属性を付
与するプログラムのフローチャートである。Ｓ１構文言い回し記号を決定する。Ｓ２オブジェクトの構文もしくは構文言い回しテーブ
ル（構文毎に構文言い回し記号（格）をもつテーブル）
から類似の構文を探す。Ｓ３不一致であって，入力データの格がオブジェクト
もしくはテーブルになければＳ４に進み，不一致であっ
て，入力データの方の格が少なければＳ５に進む。不一
致でなければＳ６に進む。一致，不一致の判定は含まれ
る格の内容のみを比較し，順番は問わない。Ｓ４オブジェクトもしくはテーブルがなければ，格の
データの入力を促す質問をする。Ｓ５入力データに新しい格を追加するか，入力データ
の構文を新しい構文として登録する。Ｓ６順番が違っているか判定する。違っていなければ
（ＮＯ）処理を終了する。Ｓ７順番が違っていれば（ＹＥＳ），先頭の分解素を
協調として理解する。FIG. 12 is a flowchart of a program for assigning an attribute using a syntax wording symbol. S1 Determine the syntax wording symbol. S2 Object syntax or syntax phrasing table (table with syntax phrasing symbol (case) for each syntax)
For similar syntax. S3 If there is no match and the case of the input data is not in the object or table, the process proceeds to S4. If there is no match and the case of the input data is small, the process proceeds to S5. If not, the process proceeds to S6. The match / mismatch is determined by comparing only the contents of the included cases, and the order is not important. S4 If there is no object or table, ask a question to enter case data. S5: Add a new case to the input data or register the syntax of the input data as a new syntax. S6 It is determined whether the order is different. If not (NO), the process ends. S7 If the order is different (YES), the leading disassembly is understood as coordination.

【００５９】図１３は，本発明の知識データのオブジェ
クトの共通部をまとめ新しい知識データのオブジェクト
を作成する場合の説明図，およびオブジェクトの階層化
の説明図である。図１３ (a)は説明図である。図１３
(a)において，４２１は共通部分を括り出す前の状態で
あり，図１３ (b)は共通部分を括り出した後の状態であ
る。オブジェクト１に属性１，２，３，４があり，オブ
ジェクト２に属性１，２，５，６があったとする。そこ
て，共通の属性１，２を持つ新たなオブジェクト３を生
成し，そのインデックス（ＩＤ）をＫとする。そして，
オブジェクト１の属性１，２およびオブジェクト２の属
性１，２に対してはオブジェクト３のＩＤであるＫによ
り表すようにする。FIG. 13 is an explanatory diagram of a case where the common parts of the knowledge data objects according to the present invention are combined to create a new knowledge data object, and an explanatory diagram of the hierarchical structure of the objects. FIG. 13A is an explanatory diagram. FIG.
In FIG. 13A, reference numeral 421 denotes a state before the common part is extracted, and FIG. 13B shows a state after the common part is extracted. It is assumed that object 1 has attributes 1, 2, 3, and 4, and object 2 has attributes 1, 2, 5, and 6. Then, a new object 3 having the common attributes 1 and 2 is generated, and its index (ID) is set to K. And
The attributes 1 and 2 of the object 1 and the attributes 1 and 2 of the object 2 are represented by K which is the ID of the object 3.

【００６０】図１３ (b)は本発明のオブジェクトの階層
化の説明図である。本発明では，図１３ (a)のように
して新しいオブジェクトが生成される毎に共通の属性を
持つオブジェクトを新たに生成し，オブジェクトをイン
デックスにより階層化して管理する。図１３ (b)はこの
ようにインデックスにより階層化して構成されたオブジ
ェクトの階層構造を示す。４３１は最上位の階層のオブ
ジェクトである。４３２，４３３，４３４はその次の階
層のオブジェクト（メタデータ）である。４３５，４３
６，４３７は最下位の階層のオブジェクトであって，プ
リミティブデータを持つものである。図１４は，図１３
(a)の動作をするためのフローチャートである。FIG. 13B is an explanatory diagram of hierarchization of objects according to the present invention. In the present invention, every time a new object is generated as shown in FIG. 13A, an object having a common attribute is newly generated, and the objects are hierarchically managed by an index. FIG. 13 (b) shows the hierarchical structure of an object structured in a hierarchical manner by the index. 431 is an object of the highest hierarchy. 432, 433, and 434 are objects (metadata) of the next layer. 435,43
Reference numeral 6,437 denotes an object of the lowest hierarchy, which has primitive data. FIG.
6 is a flowchart for performing the operation of FIG.

【００６１】Ｓ１複数の知識データのオブジェクトで
共通となっている部分を探す。Ｓ２共通部分を括りだし，新しいオブジェクトを作
る。Ｓ３もとのオブジェクトから共通部分の属性を外し，
共通部分のみのオブジェクトを作成し，新しいＩＤを付
与する。元のオブジェクトの共通部分には生成した新し
いオブジェクトのＩＤでそのデータの内容がわかるよう
にする。S1 Search for a part common to a plurality of knowledge data objects. S2 Create a new object by grouping out common parts. S3 Remove the attribute of the common part from the original object,
Create an object with only the common part and assign a new ID. In the common part of the original object, the content of the data is identified by the ID of the generated new object.

【００６２】図１５，図１６は本発明の知識データのオ
ブジェクトのインデックス作成のフローチャートであ
る。新しい知識データが登録されると次のフローチャー
トのプログラムが動作し，インデックスを自動生成する
（図１３のフローチャートと同じ内容を一部含む）。Ｓ１新しいデータの登録があるか判定する。なければ
処理を終了し，あればＳ１’の処理をする。Ｓ１’新たに登録されたデータと類似するデータを探
す。Ｓ２新たに登録されたデータは分解素もしくは格認識
データのみか判定する。分解素もしくは格認識データ
（属性）のみのデータであればＳ３に進み，そうでなけ
れば（分解素に格認識データの付与されたデータであれ
ば）Ｓ７に進む。Ｓ３新しく登録されたデータと同じデータのインデッ
クスがすでにあるか判定し，あればＳ４に進み，なけれ
ばＳ５に進む。FIGS. 15 and 16 are flow charts for creating an index of an object of knowledge data according to the present invention. When new knowledge data is registered, the program of the next flowchart operates to automatically generate an index (including some of the same contents as the flowchart of FIG. 13). S1 Determines whether new data is registered. If not, the process ends. If there is, the process of S1 'is performed. S1 'Search for data similar to the newly registered data. S2 It is determined whether the newly registered data is only the decomposition element or the case recognition data. If the data is only the decomposition element or the case recognition data (attribute), the process proceeds to S3. S3: It is determined whether there is an index of the same data as the newly registered data. If there is, the process proceeds to S4.

【００６３】Ｓ４新しいデータにそのインデックスを
登録する。Ｓ５新しく登録されたデータに新しいインデックスを
作成する。Ｓ６すべてのデータ（知識）を探し，新しく登録され
たデータと同じものがあれば，それを新しいインデック
スで置き換える。Ｓ７Ｓ２で分解素もしくは格認識データのみでなけれ
ば，すでに同じ構成（同じ属性名等を持つ）で格認識さ
れた構成のデータのインデックスはあるか判定する。イ
ンデックスがあればＳ８に進み，なければＳ９に進む。S4 The index is registered in the new data. S5 Create a new index for the newly registered data. S6 Search all data (knowledge), and if there is the same data as newly registered data, replace it with a new index. S7 If it is not only the decomposition element or the case recognition data in S2, it is determined whether or not there is an index of data of the case already recognized with the same structure (having the same attribute name or the like). If there is an index, the process proceeds to S8; otherwise, the process proceeds to S9.

【００６４】Ｓ８そのインデックスに登録する。Ｓ９類似の格認識データ（類似の属性の構成を持つオ
ブジェクト）をさがす。Ｓ１０類似の格認識データをさがし，あればＳ１１に
進み，なければＳ１２に進む。Ｓ１１共通部分に新しいインデックスを作成し，その
格認識データの共通部分を新たに作成したインデックス
で登録し，類似するオブジェクトデータを圧縮する。Ｓ１２新しいインデックスを作成する。本発明ではオブジェクトデータのインデックスは階層構
造で表し，検索は最上位のインデックスのオブジェクト
から開始する。S8: Register in the index. S9 Search for similar case recognition data (objects having a similar attribute configuration). S10: Search for similar case recognition data; if there is, go to S11; otherwise go to S12. S11 A new index is created in the common part, the common part of the case recognition data is registered with the newly created index, and similar object data is compressed. S12 Create a new index. In the present invention, the index of the object data is represented in a hierarchical structure, and the search starts from the object of the highest index.

【００６５】図１７は本発明の検索処理のフローチャー
トである。検索は，図１３ (b)のインデックスで階層化
されたオブジェクトの最上位のオブジェクトから検索を
開始，順次下位の階層のオブジェクトを検索する。Ｓ１検索要求がある。Ｓ２最上位データ（インデックス）を検索する。Ｓ３インデックスはあるか判定する。みつかれば処理
を終了し，あればＳ４に進む。Ｓ４次の階層のデータを検索する。Ｓ５次のデータがあればＳ４でさらに次のデータを検
索する。なければ処理を終了する。FIG. 17 is a flowchart of the retrieval process according to the present invention. In the search, the search is started from the highest object of the object hierarchized by the index in FIG. S1 There is a search request. S2 Search for the highest-order data (index). S3 Determine whether there is an index. If found, the process ends, and if there is, the process proceeds to S4. S4 Search for data of the next hierarchy. S5 If there is next data, the next data is searched for in S4. If not, the process ends.

【００６６】[0066]

【実施例３】図１８は本発明の他の実施例の基本構成を
示す。図１８において，１０はデータ処理手段であっ
て，入力データに対して意味付けをするものである。１
１はデータ入力手段であって，文章等のテキストデー
タ，画像，音声等のマルチメディアデータを入力するも
のである。１２は入力データ保持部であって，入力され
た知識データを一時保持するものである。Embodiment 3 FIG. 18 shows a basic configuration of another embodiment of the present invention. In FIG. 18, reference numeral 10 denotes data processing means for giving meaning to input data. 1
Reference numeral 1 denotes data input means for inputting text data such as text, multimedia data such as images and voices. Reference numeral 12 denotes an input data holding unit for temporarily holding input knowledge data.

【００６７】データ処理手段１０において，１３は知識
データ作成部であって，入力データを分解し，知識デー
タ保持部２１を参照して必要な属性を求め，分解素に付
与して入力データの意味付けをするものである（属性
は，文章の格（目的格，主格等）等，人物の名前，性格
等の一般的名称等を表す属性名もしくは，主格の助詞の
「は」，「山田」等の具体的名前等の属性値を意味す
る）。知識データ作成部は，ＣＰＵ，ＲＡＭ，知識デー
タ作成プログラム等により構成されるものである。In the data processing means 10, reference numeral 13 denotes a knowledge data generating unit which decomposes input data, obtains necessary attributes by referring to the knowledge data holding unit 21, assigns the necessary attributes to the decomposed elements, and obtains the meaning of the input data. (Attributes are attribute names that represent general names such as the name of a person, personality, etc., such as the case of a sentence (object case, nominative case, etc.), or the particles “ha”, “Yamada” of the nominative case Etc. means the attribute value such as a specific name). The knowledge data creation unit is configured by a CPU, a RAM, a knowledge data creation program, and the like.

【００６８】１４は分解素分解部であって，入力データ
を意味付けの最小単位の分解素に分解するものである。
分解素分解部は，ＣＰＵ，ＲＡＭ，分解素分解プログラ
ム等により構成されるものである。１５は格認識部であ
って，入力データの分解素に主格，目的格あるいは人の
名前であれば，名前等の属性を付与し，意味付けをする
ものである。格認識部はＣＰＵ，ＲＡＭ，格認識プログ
ラム等により構成されるものである。１６は知識データ
参照部であって，知識データ保持部に保持されている知
識データを参照するものである。知識データ参照部は，
ＣＰＵ，ＲＡＭ，知識データ参照プログラム等により構
成されるものである。Reference numeral 14 denotes a decomposing element decomposition unit that decomposes input data into decomposing elements having the smallest meaning.
The decomposition element decomposition unit includes a CPU, a RAM, a decomposition element decomposition program, and the like. Reference numeral 15 denotes a case recognizing unit that adds attributes such as a name to the decomposed element of the input data if it is a nominative case, a purpose case, or a person's name and gives meaning. The case recognition unit includes a CPU, a RAM, a case recognition program, and the like. Reference numeral 16 denotes a knowledge data reference unit for referring to the knowledge data held in the knowledge data holding unit. The knowledge data reference section
It comprises a CPU, a RAM, a knowledge data reference program, and the like.

【００６９】１８は再度入力要求部であって，分解素に
分解された入力データの分解素の属性（格等）の作るフ
ォーマットについて，すでに作成済の知識データの属性
の作るフッーマットと比較し，類似のフォーマットを見
つけて入力データと比較し，不足する属性について，オ
ペレータに入力要求をするものである。再度入力要求部
は，ＣＰＵ，ＲＡＭ，再度要求プログラム等により構成
されるものである。また，再度入力要求部は，分解され
た分解素の正否をオペレータに等ものである。１９はフ
ォーマット判定部であって，入力データの分解素の属性
のフォーマットと同じかもしくは類似のフォーマットを
見つけるものである。フォーマット判定部は，ＣＰＵ，
ＲＡＭ，フォーマット判定プログラム等により構成され
るものである。また，フォーマット判定部は，分解素分
解部により分解された分解素についてもフォーマットと
比較し，その正否を判定するものである。Reference numeral 18 denotes an input requesting section again, which compares the format created by the attribute (case etc.) of the disassembly element of the input data decomposed into the disassembly element with the format created by the attribute of the already created knowledge data. A similar format is found, compared with input data, and an operator is requested to input a missing attribute. The input requesting unit is constituted by a CPU, a RAM, a requesting program again, and the like. Again, the input requesting unit makes the operator determine whether the decomposed element is correct or not. Reference numeral 19 denotes a format determination unit which finds a format that is the same as or similar to the format of the attribute of the disassembly element of the input data. The format determination unit is a CPU,
It is composed of a RAM, a format determination program, and the like. The format determining unit also compares the decomposed element decomposed by the decomposed element decomposing unit with the format and judges whether the decomposed element is correct or not.

【００７０】２１はデータ保持手段（以下，知識データ
保持部と称する）であって，意味付けした入力データを
保持するものである。例えば，「私は学校に行く。」と
いう入力データの場合，分解素は「私」，「は」，「学
校」，「に」，「行く」，「。」である。そして，「私
は学校に行く。」という文章（構文Ａとする）に対して
ＩＤが付与され，さらに「私」に対して主格，「は」の
格を表す属性名と属性値として「主格」，「学校」に対
して「に」の表す属性名と属性値として目的格，「行
く」に対して行動格，「。」の格として句読点を表す属
性名と属性値を保持するものである。Reference numeral 21 denotes a data holding unit (hereinafter, referred to as a knowledge data holding unit) which holds input data with meaning. For example, in the case of the input data “I go to school.”, The decomposition factors are “I”, “Ha”, “School”, “Nii”, “Go”, “.”. Then, an ID is assigned to the sentence "I will go to school." (Syntax A), and "I" is a nominative case. "" And "School" for the attribute name and attribute value "Ni", "Go" for the action case, and "." For the punctuation mark attribute name and attribute value. is there.

【００７１】３１’は格認識データの例であって，属性
名と属性値を保持するものであり，入力データに設定さ
れた属性名（主格，目的格，名前，年令等）に対応する
属性値（主格を表す分解素の「は」，目的格を表す分解
素の「を」等）である。３２は事例データの例であっ
て，属性名として「名前」，「年令」等を有するもので
あり，属性名「名前」の属性値として「ＸＸＸ」，属性
名「年令」の属性値として「４０才」等を有するもので
ある。事例データは属性名の説明的内容を表すものであ
り，説明文等も含まれる。Reference numeral 31 'is an example of case recognition data, which holds an attribute name and an attribute value, and corresponds to the attribute name (primary case, purpose case, name, age, etc.) set in the input data. Attribute value (decomposition element “ha” representing the nominative case, “wo” of the decomposition case representing the objective case, etc.). Reference numeral 32 denotes an example of case data having attribute names such as “name” and “age”, and the attribute value of the attribute name “name” is “XXX” and the attribute value of the attribute name “age” "40 years old" and so on. The case data represents the descriptive contents of the attribute name, and includes a description and the like.

【００７２】３１は出力手段であって，ディスプレイも
しくは音声を出力するスピーカ，音声アンプ等により構
成される音声出力装置である。編集部５１は意味付けに
必要なデータを出力手段に表示するためのデータを編集
するものである。編集部はＣＰＵ，ＲＡＭ，編集プログ
ラム等により構成されるものである。５５は登録部であ
って，分解素に設定した格を辞書部に登録するものであ
る。登録部はＣＰＵ，ＲＡＭ，登録プログラム等により
構成されるものである。Reference numeral 31 denotes an output means, which is an audio output device including a display or a speaker for outputting audio, an audio amplifier, and the like. The editing unit 51 edits data for displaying data necessary for meaning on an output unit. The editing unit includes a CPU, a RAM, an editing program, and the like. Reference numeral 55 denotes a registration unit for registering a case set as a decomposition element in the dictionary unit. The registration unit includes a CPU, a RAM, a registration program, and the like.

【００７３】図１８の構成の動作を説明する（図１９を
参照する）。データ入力手段１１に文章のテキストデ
ータ，マルチメディアデータ等のデータが入力される。
入力データは入力データ保持部１２に一時保持される。
知識データ作成部１３において分解素分解部１４は入力
データを分解素に分解する。例えば，「私は行く。」と
いう入力データの場合，分解素は「私」，「は」，「行
く」，「。」である。分解された分解素は，編集部によ
り表示部に表示する表示データとして編集され，表示部
に編集される。そして，オペレータはその正否を判定
し，訂正を必要とする場合には，分解素を訂正する。ま
た，分解された分解素は，フォ−マット判定部でフォー
マットと比較され，異なる場合には，正しいと判断され
るフォーマットを表示し，オペレータに正しく分解され
た分解素の入力を促す。The operation of the configuration shown in FIG. 18 will be described (see FIG. 19). Data such as text data of a sentence and multimedia data is input to the data input means 11.
The input data is temporarily stored in the input data storage unit 12.
In the knowledge data creating unit 13, a decomposition element decomposition unit 14 decomposes input data into decomposition elements. For example, in the case of the input data "I am going." The decomposed element is edited by the editing unit as display data to be displayed on the display unit, and is edited on the display unit. Then, the operator judges the correctness and corrects the decomposed element when correction is necessary. The decomposed decomposed element is compared with the format by a format determination unit. If the decomposed element is different, a format determined to be correct is displayed, and the operator is prompted to input the correctly decomposed element.

【００７４】分解素に分解されたデータは，格認識部１
５において格認識される（最初はオペレータにより指定
される。すでに，知識データ保持部に保持されているも
のについては，それが参照されて自動設定される）。そ
して，知識データ保持部に格（主格，行動格等）に対応
して属性値（私，行く等）が設定される。そこで，図１
９に示すように，例えば，「私は学校に行く。」（構文
Ａとする）というデータが入力されたとする。分解素分
解部は構文Ａ（４１）を「私」，「は」，「学校」，
「に」，「行く」，「。」のように分解素に分解し，
「私」に対して主格，「は」の格を表す属性値として
「主格」，「学校」に対して目的格，「に」に対してそ
の格を表す属性値として目的格，「行く」に対して行動
格，「。」の格として句読点を表す属性名と属性値を設
定する。属性の設定は統計的に処理し，頻度の高いもの
を選択するようにする。例えば，「は」を意味認識する
場合，「は」が属性名とて使用される頻度の統計をと
り，使用される頻度の高いものもしくは確率の高いもの
を属性とする。The data decomposed into decomposed elements is sent to the case recognition unit 1
In step 5, the case is recognized (initially specified by the operator. If the information is already held in the knowledge data holding unit, it is referred to and automatically set). Then, an attribute value (me, go, etc.) is set in the knowledge data holding unit corresponding to the case (nominative case, action case, etc.). Therefore, FIG.
As shown in FIG. 9, for example, it is assumed that data "I go to school." The decomposition element decomposition unit converts the syntax A (41) into “I”, “ha”, “school”,
Decompose into decomposed elements like "ni", "go", "."
The attribute values that represent the nominative case for "I" and the case of "wa" are "nominative case", the purpose case for "school", the purpose case for "ni" as the attribute value representing that case, and "go" , An attribute name and an attribute value representing punctuation are set as the case of action and the case of “.”. Attribute settings are processed statistically and the one with the highest frequency is selected. For example, when recognizing “ha”, statistics of the frequency at which “wa” is used as an attribute name are obtained, and those having a high use frequency or a high probability are set as attributes.

【００７５】そこで，知識データ参照部１６は，構文デ
ータＡのフォーマット（属性値の作るる構成（「私」，
「学校」，「行く」等））もしくは属性名の作る構成
（「主格」，「目的格」，「行動格」等）と類似のフォ
ーマットを持つ知識データを知識データ保持部から探
す。その結果，図示のような知識データ４２を見つけ
る。そして，例えば，両者の属性値を比較し，既存の知
識データ４３では構文Ａに含まれる目的格「学校」がな
いので，再度入力要求部１８および編集部５１は「学校
に行くのですか？」を質問するデータを作成し，出力手
段３１に表示する。オペレータをその表示を見て，正し
ければ，その旨をデータ入力手段１１により入力する。
そして，既存の知識データ４３を知識データ４２に書き
替える。また，例えば，「学校に」が正しくなければ，
オペレータはその旨をデータ入力部より入力すると，知
識データ４３はそのまま残す。あるいは，例えば，知識
データ４３の構文に入力データの構文の認識格を付与し
たデータが正しいとオペレータにより認識され，その状
態において，さらに属性値を付与したデータが正しくな
いと判定された場合には，知識データ４３と知識データ
４３に属性値を付加した知識データの双方を保存する
（図２２の実施例の説明参照）。Therefore, the knowledge data reference unit 16 converts the format of the syntax data A (the structure of the attribute value (“I”,
A search is made from the knowledge data holding unit for knowledge data having a format similar to the structure of the attribute name ("school", "go", etc.)) or the attribute name ("nominative", "objective", "behavioral", etc.). As a result, knowledge data 42 as shown is found. Then, for example, the attribute values of the two are compared, and there is no purpose case “school” included in the syntax A in the existing knowledge data 43. Therefore, the input requesting unit 18 and the editing unit 51 again ask “Do you go to school? Is created and displayed on the output means 31. The operator looks at the display and, if correct, inputs that fact through the data input means 11.
Then, the existing knowledge data 43 is rewritten to the knowledge data 42. For example, if "at school" is not correct,
When the operator inputs that fact through the data input unit, the knowledge data 43 is left as it is. Alternatively, for example, if the operator recognizes that the data in which the syntax of the input data is added to the syntax of the knowledge data 43 is correct, and in that state, it is determined that the data to which the attribute value is further added is not correct, Then, both the knowledge data 43 and the knowledge data 43 to which the attribute value is added are stored (see the description of the embodiment in FIG. 22).

【００７６】図２０は，本発明の再度入力要求部のフロ
ーチャートである。Ｓ１データ入力がなされる。
Ｓ２データ入力されたデータを分解素に分解し，知識
データ保持部のフォーマットに当てはめる。Ｓ３格
認識データ，事例データ（分解素，属性等）に不足があ
るか判定する。不足がなければ，処理を終了する。あれ
ば，Ｓ４に進む。Ｓ４フォーマットの知識データの不足を画面表示し，
知識データの追加を確認する。Ｓ５フォーマットに知識データ（格認識データ，事例
データ）を追加する。Ｓ６新しいフォーマットを生成する。FIG. 20 is a flowchart of the re-input request unit of the present invention. S1 Data input is performed.
S2 The input data is decomposed into decomposed elements and applied to the format of the knowledge data holding unit. S3 It is determined whether the case recognition data and the case data (decomposition elements, attributes, etc.) are insufficient. If there is no shortage, the process ends. If there is, go to S4. The screen displays the lack of knowledge data in S4 format,
Confirm the addition of knowledge data. Knowledge data (case recognition data, case data) is added to the S5 format. S6 Generate a new format.

【００７７】上記は格認識データについて説明したが，
本発明は，事例データについても適用できるものである
（図２３の実施例の説明参照）。このように，本発明
は，入力データの意味を保存されている知識データをも
とに意味付けし，知識データのフォーマットに不足があ
り，意味付けできない場合には，不足を補った構文を出
力手段に表示し，オペレータに正否を判定させるもので
ある。そのように，知識データの内容を充実させ，有意
義な知識データのデータベースを対話形式で構築するこ
とができるものである。上記において，分解素への分
解，もしくは格認識において，その結果をディスプレイ
に表示し，オペレータと対話する形式について説明した
が，出力部は，音声出力装置を備え，音声によりオペレ
ータへの正否の判断を問いかけるようにすることも可能
である。Although the above has described case recognition data,
The present invention is also applicable to case data (see the description of the embodiment in FIG. 23). As described above, according to the present invention, the meaning of the input data is given based on the stored knowledge data, and if the format of the knowledge data is insufficient, and the meaning cannot be provided, a syntax that compensates the lack is output. The information is displayed on the means and the operator judges whether it is correct or not. In this way, the content of the knowledge data can be enhanced and a database of meaningful knowledge data can be constructed in an interactive manner. In the above, in the case of decomposition into decomposed elements or case recognition, the result was displayed on the display and the format for interacting with the operator was explained. However, the output unit is equipped with a voice output device, and the voice is used to judge whether the operator is correct or not. It is also possible to ask a question.

【００７７】[0077]

【実施例３】図２１は本発明の実施例３であって，属
性名（格等）とその属性値により知識データと入力デー
タを比較することにより知識データを充実するものであ
る。構文Ａ（４１）が入力されたとする。知識データ
作成部１３において，構文Ａが分解素に分解する。分解
素分解部１４は入力データを分解素に分解する。このと
き，フォーマット判定部１９はその複数の分解素の作る
フォーマットを，自身がもつフォーマットと比較する。
編集部５１は分解素分解部１４で分解された分解素，お
よびフォーマット判定部１９の比較結果に基づくオペレ
ータへの質問事項等を編集し，出力手段に表示する。あ
るいは，質問内容を音声データに編集して，音声出力装
置より出力する。オペレータはその結果により，データ
入力手段１１より，正しく分解されるようにキーボー
ド，マウス等によりデータ入力する。あるいは，音声に
より入力し，入力データを意味がとおるように正しく分
解する。Embodiment 3 FIG. 21 shows Embodiment 3 of the present invention, in which knowledge data is enriched by comparing knowledge data with input data based on an attribute name (case, etc.) and its attribute value. Assume that syntax A (41) is input. In the knowledge data creating unit 13, the syntax A is decomposed into decomposed elements. The decomposition element decomposition unit 14 decomposes input data into decomposition elements. At this time, the format determination unit 19 compares the format created by the plurality of decompositions with its own format.
The editing unit 51 edits the decomposed element decomposed by the decomposed element decomposing unit 14 and the questions to the operator based on the comparison result of the format judging unit 19, and displays them on the output unit. Alternatively, the content of the question is edited into audio data and output from the audio output device. Based on the result, the operator inputs data from the data input means 11 using a keyboard, a mouse, or the like so as to be correctly disassembled. Alternatively, input is made by voice, and the input data is correctly decomposed so as to make sense.

【００７８】このように分解された分解素に対し，格認
識をして，意味付けを行う。例えば，「私」，「は」，
「学校」，「へ」，「行く」，「。」に分解されたデー
タの場合，「私」に対して主格，「行く」に対して行動
格，「学校」に対して目的格が付与され，知識データ４
２が作成される。このとき，知識データ参照部は，知識
データベース保持部（図示せす）を検索して，その属性
値（私は，学校，行く）および属性名（主格，目的格，
行動格）で，共通的なパターンをもつ知識データを求め
る。図４の場合，全く同じデータの構成（私は学校に行
く）を表すデータはないが，主格の「私」，行動格の
「行く」，「手段格の自転車」の知識データ４３があ
り，それが一番入力データのフォーマットに近いもであ
るとする。再度入力要求部１８は，それを求めて構文Ａ
の属性名，属性値と比較する。その結果，知識データ４
２には目的格の「学校に」がないので，編集部５１は
「どこに？」，「学校に？」を出力手段３１に表示し
て，オペレータに質問する。あるいは，質問を音声デー
タとして音声出力装置に出力し，オペレータに正しいデ
ータの入力を促す。今の場合，その質問は正しいので，
その格認識は正しいものとされる。そして，次に知識デ
ータ４２には手段格「自転車」があるので，知識データ
４２に目的格「学校に主格「学校に」を追加したデータ
を考慮し，そこにある手段格「自転車」について，編集
部５１は，「なにで？」，「自転車で？」という質問を
出力手段３１に表示する。The decomposed element thus decomposed is recognized as a case and given meaning. For example, "I", "ha",
In the case of data decomposed into "school", "he", "go", and ".", Nominative case is given to "me", action case is given to "go", and purpose case is given to "school". And knowledge data 4
2 is created. At this time, the knowledge data reference unit searches the knowledge database holding unit (not shown), and finds the attribute value (I, school, going) and the attribute name (Nominative, Objective,
Behavioral data), and obtain knowledge data having a common pattern. In the case of FIG. 4, there is no data representing the exact same data configuration (I go to school), but there is the knowledge data 43 of "I" of the nominative, "go" of the behavioral, and "bicycle of the means". It is assumed that it is closest to the format of the input data. Again, the input requesting unit 18 obtains the
Is compared with the attribute name and attribute value. As a result, knowledge data 4
2 does not have the purpose “at school”, the editing unit 51 displays “where?” And “at school” on the output means 31 and asks the operator a question. Alternatively, a question is output to a voice output device as voice data, and the operator is prompted to input correct data. In this case, the question is correct,
That case recognition is correct. Then, since the knowledge data 42 has the measure "bicycle", the data in which the objective case "school has a nomination of" school "is added to the knowledge data 42. The editing unit 51 displays the question “what?” And “by bicycle?” On the output unit 31.

【００７９】そこで，オペレータはその問いかけを正し
いとした場合には，知識データ４３に目的格「学校に」
を追加した知識データ４２を新規データとして知識デー
タ４３を置き換える。この時，正しくないとされた場合
には，知識データ４３がそのまま残され，知識データ４
３に目的格「学校」を追加した知識データが新規データ
として追加される（この知識データの構成は図示さてい
ない）。質問された内容に対して応答するオペレータ
によるデータ入力は，キーボード等によるデータ入力の
他に，マイク等を利用した音声入力でも良い。Therefore, if the operator determines that the question is correct, the knowledge data 43 indicates that the purpose is “school”.
Is replaced with the knowledge data 43 as new data. At this time, if incorrect, the knowledge data 43 is left as it is,
Knowledge data obtained by adding the target case “school” to 3 is added as new data (the configuration of this knowledge data is not shown). The data input by the operator responding to the questioned content may be a voice input using a microphone or the like in addition to the data input using the keyboard or the like.

【００８０】[0080]

【実施例４】図２２は本発明の実施例４であって，格認
識データ（主格，目的格等の格の属性名）により，入力
データと知識データのフォーマットを比較するものであ
る。知識データ保持部に既存の知識データ５３’と知識
データ５４’が存在している状態において，構文Ａ（４
１）が入力されたとする。知識データ作成部は構文Ａ
（４１）を，分解素に分解し，主格「私」，目的格「学
校」，行動格「行く」を格認識する。次に，知識データ
保持部を参照し，知識データの持つ格認識データのフォ
ーマットを参照する。知識データ５３’と構文Ａ（４
１）とは二種類の格が一致し，知識データ５４’と構文
Ａ（４１）とは一種類の格が一致するだけであるので，
知識データ５３’を類似するフォーマットの知識データ
として選択する。そこで，知識データ５３’と構文Ａ
（４１）を比較し，知識データ５３’では構文Ａの行動
格「行く」がないので，両方に一致する主格「私」，目
的格「学校」，行動格「行く」を追加したデータＢ（図
示せず）を作成する。そして，知識データ５３’にある
手段格「徒歩」により，編集部５１は「なにで？」，
「徒歩で？」を出力手段３１に表示して，オペレータに
質問する。オペレータは，その質問を正しいと入力すれ
ば，新規の知識データ５５が知識データ５３’に置き替
わって登録される。また，フォーマットの類似する知識
データが複数ある場合には，それまでその使用において
選択された回数，あるいは選択された確率の高い方を選
択する。Embodiment 4 FIG. 22 shows Embodiment 4 of the present invention, in which the format of input data and the format of knowledge data are compared based on case recognition data (attribute names of cases such as nominative case and objective case). In a state where the existing knowledge data 53 'and the knowledge data 54' exist in the knowledge data holding unit, the syntax A (4
Assume that 1) is input. The knowledge data creation unit uses syntax A
(41) is decomposed into decomposed elements to recognize the nominative case “I”, the purpose case “school”, and the action case “go”. Next, the format of the case recognition data of the knowledge data is referred to by referring to the knowledge data holding unit. Knowledge data 53 'and syntax A (4
1) matches two kinds of cases, and the knowledge data 54 'and the syntax A (41) only match one kind of case.
The knowledge data 53 'is selected as knowledge data of a similar format. Therefore, the knowledge data 53 'and the syntax A
By comparing (41), there is no behavioral case “go” of the syntax A in the knowledge data 53 ′, so the data B (“B”) added with the nominative case “I”, the purpose case “school”, and the behavioral case “go” that match both. (Not shown). Then, based on the measure “walk” in the knowledge data 53 ′, the editing unit 51 determines “what?”
"On foot?" Is displayed on the output means 31 to ask the operator a question. If the operator inputs that question is correct, the new knowledge data 55 is registered in place of the knowledge data 53 '. When there are a plurality of pieces of knowledge data having similar formats, the number of times of use or the higher probability of selection is selected.

【００８１】また，「なにで？」，「徒歩で？」に対し
て，「自転車で」と回答すれば，図示の知識データ５５
の手段格の属性値が「自転車」とされた新規の知識デー
タが作成される。また，この時，「ノー」とのみ回答さ
れた場合には，知識データ５３’はそのまま保存され，
知識データ５３’に行動格「行く」を追加したデータＢ
が新規の知識データとして登録される。If "by bicycle" is answered in response to "what?" And "on foot?"
The new knowledge data in which the attribute value of the measure is "bicycle" is created. At this time, if only "no" is answered, the knowledge data 53 'is stored as it is,
Data B in which behavioral status “go” is added to knowledge data 53 ′
Is registered as new knowledge data.

【００８２】図２３は本発明の実施例３であって，事例
データについてフォーマットを当てはめるものである。
図２３は，知識データ保持部に適切なフォーマットがな
い場合に，その知識データ保持部の不足を補った新しい
フォーマットを生成し，知識データを充実させ，入力デ
ータの意味付けを行うものである。図２３は，事例デー
タとして図示のような知識データ６１，知識データ６２
が存在しているとする。このような時に，事例データＡ
（６０）がデータ入力されたとする。知識データ作成部
１３は，名前「ＸＸＸ」に対して属性「名前」を付与
し，年令の属性値「４０才」に対して属性「年令」を付
与し，属性値「円満」に対して属性「性格」を付与する
（この付与は知識データ保持部を検索し，例えば，知識
データ６１を基にして与えられる）。FIG. 23 shows a third embodiment of the present invention, in which a format is applied to case data.
FIG. 23 shows a case where, when there is no appropriate format in the knowledge data holding unit, a new format that compensates for the lack of the knowledge data holding unit is generated, the knowledge data is enriched, and the meaning of the input data is given. FIG. 23 shows knowledge data 61 and knowledge data 62 as illustrated as case data.
Assume that exists. In such a case, case data A
It is assumed that (60) has been input. The knowledge data creating unit 13 assigns the attribute “name” to the name “XXX”, assigns the attribute “age” to the age attribute value “40 years old”, and assigns the attribute “age” to the attribute value “Enma”. The attribute “character” is assigned to the attribute (this assignment is performed by searching the knowledge data holding unit and, for example, based on the knowledge data 61).

【００８３】そして，再度入力要求部１８は知識データ
６１，知識データ６２と事例データＡ（６０）を比較
し，知識データ６１が属性「名前」，「年令」が共通で
あって，共通属性が多いので，類似事例として抽出す
る。そして，その属性「性格」が「攻撃的」であること
から，編集部は，「性格は攻撃的ですか？」をディスプ
レイに表示して，オペレータにデータの訂正の有無を促
す。そして，事例データＡ（６０）の性格の属性値「円
満」を「攻撃的」に変更するのであれば，「ハイ」をデ
ータ入力手段により入力し，変更しないのあれば「イイ
エ」を入力する。ハイの時，事例データＡ（６０）の性
格「円満」を性格「攻撃的」に変更された知識データ６
１が保存される。また，「イイエ」の時，性格「円満」
とされた新規データ６３が作成される。また，その質問
の回答として，他の属性値「正直」等をデータ入力した
場合には性格「正直」等で書き換えられたデータが新規
データ（図示せず）として保持される。Then, the input requesting unit 18 again compares the knowledge data 61 and the knowledge data 62 with the case data A (60), and if the knowledge data 61 has the common attributes “name” and “age”, Because there are many cases, they are extracted as similar cases. Then, since the attribute “character” is “aggressive”, the editing unit displays “Is character aggressive?” On the display to urge the operator to correct the data. Then, if the attribute value of the character of the case data A (60) is to be changed to "aggressive" to "aggressive", "high" is inputted by the data input means, and if not, "no" is inputted. . At the time of the high, the knowledge data 6 in which the character “Enjoy” of the case data A (60) is changed to the character “Aggressive”
1 is saved. In the case of "No", the character "Enmitsu"
New data 63 is created. If another attribute value “honest” or the like is input as an answer to the question, the data rewritten with the character “honest” or the like is held as new data (not shown).

【００８４】[0084]

【実施例５】図２４は本発明のシステム構成の実施例を
示す。図２４において，１１はデータ入力手段であ
る。３１は出力手段であって，ディスプレイ，スピー
カ，音声増幅器等により構成される音声出力装置，ビデ
オ装置等の動画を出力するである。５０はコンピュータ
である。５２は入出力インタフェースであって，データ
入力手段１１，データ出力手段（ディスプレイ，プリン
タ，音声出力装置，ビデオ装置等），外部記憶装置７１
とのインタフェースである。Embodiment 5 FIG. 24 shows an embodiment of the system configuration of the present invention. In FIG. 24, reference numeral 11 denotes data input means. Reference numeral 31 denotes an output means for outputting a moving image such as an audio output device including a display, a speaker, and an audio amplifier, and a video device. 50 is a computer. Reference numeral 52 denotes an input / output interface, which includes data input means 11, data output means (display, printer, audio output device, video device, etc.), and external storage device 71.
Interface.

【００８５】５３はデータ処理部であって，入力データ
を処理するものである（図１８のデータ処理手段１０お
よびデータ保持手段２０を含む）。６１はＣＰＵであ
る。６２はＲＡＭである。６３はＲＯＭである。６４は
内部磁気ディスクである。（以上，データ処理部５３）データ入力手段１１において，８１はキーボード，マウ
スである。８３は音声入力装置である。８４は図形デー
タ入力装置であって，イメージリーダ等の図形データを
入力するものである。Reference numeral 53 denotes a data processing section for processing input data (including the data processing means 10 and the data holding means 20 in FIG. 18). 61 is a CPU. 62 is a RAM. 63 is a ROM. 64 is an internal magnetic disk. (The above is the data processing unit 53) In the data input unit 11, reference numeral 81 denotes a keyboard and a mouse. 83 is a voice input device. Numeral 84 denotes a graphic data input device for inputting graphic data from an image reader or the like.

【００８６】８５は電子カメラである。８６はＣＤ−Ｒ
ＯＭドライブである。８７はビデオカメラであって，動
画等の画像データを入力するものてある。マイクであっ
て，音声を入力するものである。９１は記憶媒体であっ
て，磁気ディスクもしくはＣＤＲＯＭ等である。９２は
インストールプログラムであって，記憶媒体９１に記録
されているプログラムを内部磁気ディスク６４５３にイ
ンストールするためのプログラムである。１２１は知識
データ作成プログラムであって，知識データ作成部のプ
ログラムである。１２２は分解素分解プログラムであっ
て，分解素分解部のプログラムである。１２３は格認識
プログラムであって，格認識部のプログラムである。
１２４は知識データ参照プログラムであって，知識デー
タ参照部のプログラムである。Reference numeral 85 denotes an electronic camera. 86 is CD-R
OM drive. Reference numeral 87 denotes a video camera for inputting image data such as a moving image. A microphone for inputting voice. Reference numeral 91 denotes a storage medium, such as a magnetic disk or a CDROM. An installation program 92 is a program for installing the program recorded on the storage medium 91 on the internal magnetic disk 6453. Reference numeral 121 denotes a knowledge data creation program, which is a program of a knowledge data creation unit. A decomposition element decomposition program 122 is a program for a decomposition element decomposition unit. Reference numeral 123 denotes a case recognition program, which is a program of a case recognition unit.
Reference numeral 124 denotes a knowledge data reference program, which is a program of a knowledge data reference unit.

【００８７】１２５は再度入力要求プログラムである。
１３１は編集プログラムであって，編集部のプログラム
である。１３２は登録プログラムであって，登録部の
プログラムである。なお，図２４７は本発明の装置構成
の例を示すものであって，例えば，データ入力部の全て
を必要とするものではない。Reference numeral 125 denotes an input request program again.
Reference numeral 131 denotes an editing program, which is a program of the editing unit. 132 is a registration program, which is a program of the registration unit. FIG. 247 shows an example of the device configuration of the present invention, and does not require, for example, all of the data input units.

【００８９】図２５は，本発明の分解素の不完全デー
タを補う方法の例である。本発明は，上記のように知識
データを作成するものであるが，分解素のデータに不完
全がある場合には，図２５のようにデータを完全にする
ものである。構文Ａが入力されたとする。格認識データ
作成部は図示のように分解素に分解して，格認識する。
この時，分解素の入力データ「コンポータ」に対して，
知識データ保持部の類似辞書７２を参照し，再度入力要
求部１８および編集部５１により「コンポータ」に類似
する単語「コンピュータ」，「コンクリート」，「コメ
ンテート」等を表示部に表示する。そして、オペレータ
はコンピュータを選択し，分解素の入力データ「コンポ
ータ」を「コンピュータ」に訂正する。FIG. 25 shows an example of a method for compensating for incomplete data of a decomposition element according to the present invention. According to the present invention, the knowledge data is created as described above. If the data of the decomposed element is incomplete, the data is completed as shown in FIG. Assume that syntax A has been entered. The case recognition data creating unit decomposes the case as shown in FIG.
At this time, for the input data “comporter” of the decomposition element,
With reference to the similar dictionary 72 of the knowledge data holding unit, the input request unit 18 and the editing unit 51 again display words “computer”, “concrete”, “comment”, etc. similar to “component” on the display unit. Then, the operator selects a computer and corrects the input data “comporter” of the decomposition element to “computer”.

【００９０】[0090]

【実施例６】図２６は本発明の実施例のフローチャート
である。Ｓ１入力データの格認識データをする。Ｓ２格認
識データに入力データ（知識データ）を当てはめる。Ｓ３プリミティブデータが見当たらないか判定する。
見当たれば（ＮＯ）Ｓ１１に進み，見当たらなければ
（ＹＥＳ）Ｓ４に進む。Ｓ４類似データを見つけ，問い合わせる。Ｓ５類似データに正しいデータはあるか判定する。あ
れば（ＹＥＳ）Ｓ６に進み，なければＳ７に進む。Ｓ６類似データとして認識する。Ｓ７再度入力要求をする。Ｓ８類似データに正しいデータはあるか判定する。あ
れば（ＹＥＳ）Ｓ１０に進み，なければＳ９に進む。Embodiment 6 FIG. 26 is a flowchart of an embodiment of the present invention. S1 The case recognition data of the input data is obtained. S2 The input data (knowledge data) is applied to the case recognition data. S3 Determine whether the primitive data is missing.
If found (NO), proceed to S11; if not found (YES), proceed to S4. S4 Find similar data and make an inquiry. S5 It is determined whether there is correct data in the similar data. If there is (YES), proceed to S6, otherwise proceed to S7. S6 Recognize as similar data. S7: Request for input again. S8 It is determined whether there is correct data in the similar data. If there is (YES), proceed to S10, otherwise proceed to S9.

【００９１】Ｓ１１プリミティブデータが不足してい
るか判定する。不足していれば（ＹＥＳ）Ｓ１２に進
み，不足していなければ（ＮＯ）Ｓ１３に進む。Ｓ１
２プリミティブデータの入力をしてもらう。Ｓ１３多過ぎるか判定する。多過ぎれば（ＹＥＳ）Ｓ
１０に進み，多過ぎなければ（ＮＯ）であれば，処理を
終了する。S11 It is determined whether primitive data is insufficient. If there is not enough (YES), the process proceeds to S12, and if not, the process proceeds to S13 (NO). S1
2 Have the user enter primitive data. S13 It is determined whether there is too much. If too much (YES) S
Proceed to 10, and if not too much (NO), end the process.

【００９２】[0092]

【実施例７】図２７は本発明の基本構成を示す。本発
明において，意味付けは入力データを分解素に分解して
格認識する等の構文解析をすること，構文パターンに分
類付けをすること（肯定文，否定文，疑問文等の文意理
解のためのものや，個人のクセや上司と話すときや，話
し言葉，書き言葉など状況で変化するものの分類等），
あるいデータの内容に具体的意味を持たせること（例え
ば，固有名詞「山田」に対して，趣味野球，正確温厚等
を属性として持たせる）等のように入力データに意味を
与えることである。Embodiment 7 FIG. 27 shows a basic configuration of the present invention. In the present invention, the meaning is to analyze the input data by decomposing it into decomposed elements to perform syntactic analysis such as case recognition, and to classify into syntactic patterns (to understand sentence understanding such as positive sentences, negative sentences, and question sentences). Or when talking with personal habits or bosses, or classifying things that change with the situation, such as spoken or written language),
To give meaning to the input data, such as giving specific meaning to the content of the data (for example, giving the proper noun "Yamada" an attribute of hobby baseball, accurate warmth, etc.). .

【００９３】図２７において，１はデータ入力手段で
あって，テキストデータ入力手段，音声データ入力手
段，図形データ入力手段等を備え，テキストデータ，図
形データ，音声データ等の自然言語，マルチメディアデ
ータを入力するものである。２はテキストデータ入力手
段であって，文字コード等のテキストデータを入力する
ものであり，キーボード等である。３は音声データ入
力手段であって，マイクロフォン等である。４は図形
データ入力手段であって，イメージスキャナ，デジタル
カメラ，ビデオ等の文字，図形等の静止画データもしく
は動画データを入力するものである。あるいは，ポイン
ティング装置等の図形を描くためにベクトルデータを入
力する手段である。In FIG. 27, reference numeral 1 denotes data input means, which is provided with text data input means, voice data input means, graphic data input means, etc., and includes natural data such as text data, graphic data, voice data, and multimedia data. Is entered. Reference numeral 2 denotes text data input means for inputting text data such as a character code, such as a keyboard. Reference numeral 3 denotes voice data input means, such as a microphone. Reference numeral 4 denotes a graphic data input means for inputting still image data or moving image data such as characters and figures such as an image scanner, a digital camera and a video. Alternatively, it is a means for inputting vector data for drawing a figure such as a pointing device.

【００９４】１０はデータ入力処理手段であって，デー
タ入力手段１により入力されるデータの種別を判別し，
入力データに種別（テキストデータ，音声データ，イメ
ージデータ等）を表すデータを付与して知識データ処理
手段に入力するものである。１１はデータ種別判定部で
あって，入力データの種別を判定するものである。１４
はデータ出力処理手段であって，出力するデータの種類
を判定し，出力装置を選択するものである。１５は出力
手段判定部である。２０は知識データ処理手段であっ
て，入力データに対して利用者と対話しながら意味付け
をするものである。２２は意味付け部であって，知識デ
ータ保持手段３０を参照して入力データを構文解析し，
入力データに意味付けをするものである。Reference numeral 10 denotes data input processing means for determining the type of data input by the data input means 1;
Data representing the type (text data, voice data, image data, etc.) is added to the input data and input to the knowledge data processing means. Reference numeral 11 denotes a data type determination unit that determines the type of input data. 14
Denotes data output processing means for determining the type of data to be output and selecting an output device. Reference numeral 15 denotes an output means determination unit. Reference numeral 20 denotes a knowledge data processing means for giving meaning to input data while interacting with a user. 22 is a semantic unit which parses the input data with reference to the knowledge data holding means 30,
It gives meaning to the input data.

【００９５】２４は知識ドメイン判定部であって，関連
知識データへのインデックスを作成する部分であり，入
力データの技術分野はなにか，あるいは一般知識に属す
るものであるか等の知識ドメインの判定や，どの知識デ
ータとどのように関係があるか等を判定し，インデック
スを作成するものである。２７は対話処理部であって，
入力データの意味付けの過程において，知識処理部の判
定結果に基づいて，入力データの補足入力を促す，構文
解析の結果もしくは意味付けの結果，あるいは知識デー
タ保持手段３０の検索結果等について音声出力するかも
しくはディスプレイ表示等により利用者と対話処理をす
るものである。Reference numeral 24 denotes a knowledge domain determination unit which creates an index to related knowledge data, and determines a knowledge domain such as what the technical field of input data belongs to or general knowledge. , Which knowledge data and how they are related to each other, and create an index. 27 is a dialogue processing unit,
In the process of giving meaning to the input data, based on the determination result of the knowledge processing unit, the input of the input data is urged, the result of the syntax analysis or the result of the meaning, or the voice output of the search result of the knowledge data holding means 30 is output. Or perform interactive processing with the user by display or the like.

【００９６】２８は再入力要求部であって，知識処理部
２１からの指示によりデータの補足的な入力を要求した
り，利用者に質問等をしたりするものである。２９は登
録部であって，知識処理部２１で意味付けした知識デー
タを自己の知識データ保持手段３０に登録するものであ
る。３０は自己の知識データ保持手段であって，自己
（現時点での利用者）の知識データ保持手段である。３
１は知識ドメインインデックス保持部であって，知識デ
ータ領域における個人情報を示す個人情報インデック
ス，知識データ領域における専門的技術分野の知識デー
タを示す専門分野ドメインインデックス，知識データ領
域における一般的知識を示す一般知識インデックスを保
持するものである。Reference numeral 28 denotes a re-input requesting unit for requesting a supplementary input of data in response to an instruction from the knowledge processing unit 21 or asking a user a question. Reference numeral 29 denotes a registration unit for registering the knowledge data given by the knowledge processing unit 21 in its own knowledge data holding means 30. Numeral 30 denotes own knowledge data holding means, which is the own (current user) knowledge data holding means. 3
Reference numeral 1 denotes a knowledge domain index holding unit, which indicates a personal information index indicating personal information in the knowledge data area, a specialty domain index indicating knowledge data in a technical field in the knowledge data area, and general knowledge in the knowledge data area. This holds the general knowledge index.

【００９７】３２は個人情報インデックスであって，個
人の好みクセ等の個人情報へのインデックスを表す。
３３は専門分野のインデックスである。３４は一般知
識のインデックスである。３５は知識データ領域であ
って，意味付けされた知識データを保持する領域であ
る。５５はデータ出力手段である。５６は画像出力
手段であって，ディスプレイ等である。５７は印刷出
力手段であって，プリンタ等である。５８は音声出力
手段であって，スピーカ等である。Reference numeral 32 denotes a personal information index, which represents an index to personal information such as personal preference.
33 is an index of a specialty field. 34 is an index of general knowledge. Numeral 35 is a knowledge data area, which is an area for retaining meaningful knowledge data. 55 is a data output means. Reference numeral 56 denotes an image output means such as a display. Reference numeral 57 denotes a print output unit, such as a printer. Reference numeral 58 denotes an audio output unit such as a speaker.

【００９８】図２７の本発明の基本構成の動作を説明す
る。データ入力手段１からデータが入力される。入力
データ処理手段１０において，データ種別判定部１１は
入力データのメディア種を判定し，メディア種を表すデ
ータを入力データに付加し，知識データ処理手段２０に
送る。知識データ処理手段２０において，知識データ
処理手段２０は入力データを構文解析し，知識データ保
持手段３０の知識データベースを参照して，入力データ
に意味付けする。構文解析は，例えば，入力データをデ
ータの意味のあるかたまりの最小単位である分解素に分
解し，知識データ参照部２６により知識データ保持手段
３０を参照する。そして，知識データ保持手段３０に保
持されているすでに意味付けされている知識データのフ
ォーマットをもとに入力データの構文パターンと一致す
るフォーマットを求め，意味付けをする。The operation of the basic configuration of the present invention shown in FIG. 27 will be described. Data is input from the data input unit 1. In the input data processing means 10, the data type determination unit 11 determines the media type of the input data, adds data representing the media type to the input data, and sends the data to the knowledge data processing means 20. In the knowledge data processing means 20, the knowledge data processing means 20 parses the input data and refers to the knowledge database of the knowledge data holding means 30 to give meaning to the input data. In the syntax analysis, for example, the input data is decomposed into decomposers which are the minimum units of a meaningful chunk of data, and the knowledge data reference unit 26 refers to the knowledge data holding unit 30. Then, a format that matches the syntax pattern of the input data is determined based on the format of the knowledge data that has already been given and held in the knowledge data holding means 30, and is given a meaning.

【００９９】例えば，「私は学校に行く。」というデー
タが入力された時，分解素に分解する。この場合
「私」，「は」，「学校」，「に」，「行く」，「。」
に分解する。そして「私」と「は」の並びの構文フォー
マットのパターンと比較する。「私は」という構文フォ
ーマットはすでに「私は」の「私」は主語，「は」は主
格を表す助詞であるように意味付けされているので，デ
ータ入力された「私」は主語，「は」は主格を表す助詞
であると判断する。さらに，知識データ保持手段３０を
参照し，すでに登録されている構文パターンもしくは単
語の定義情報等に基づいて，「学校」に対して目的格，
「に」に対して目的を表す助詞格，「行く」に対して
「行動格」，「。」に対して終了格等のように意味付け
をする。For example, when data “I go to school.” Is input, the data is decomposed into decomposers. In this case, "I", "Ha", "School", "Nii", "Go", "."
Decompose into Then, it is compared with the pattern of the syntax format of "I" and "ha". The syntactical format "I am" is already meaning that "I" in "I" is the subject and "ha" is a particle representing the nominative case, so the data input "I" is the subject, "I" Is determined to be a particle representing the nominative case. Further, referring to the knowledge data holding means 30, based on the syntactic pattern or the definition information of the words which have been already registered, the purpose case and the
"Ni" is given a meaning such as a particle case representing the purpose, "go" is given an "action case", and "." Is given a final case.

【０１００】意味付けされた結果は対話処理部２７によ
り対話データとされ，データ出力手段５５に出力され
る。例えば，「今入力されたデータの意味はこれでよろ
しいですか？」等音声出力および／もしくはディスプレ
イ表示することにより利用者の判断を求める。また，例
えば，「私は８時に行きます。」とデータ入力された
時，知識データ処理部２０は構文解析した結果，どこに
行くのかを表す目的格が不足していることを判断し，対
話処理部２７を介して，音声出力および／もくしはディ
スプレイ表示により「どこに行くのですか？」等を利用
者に質問し，入力データの補足入力を促す。The result with the meaning is converted into dialog data by the dialog processing unit 27 and output to the data output unit 55. For example, the user's judgment is requested by audio output and / or display on a display such as "Is the meaning of the data just input what is it now?" Also, for example, when the data “I am going at 8 o'clock” is input, the knowledge data processing unit 20 determines that there is not enough purpose to indicate where to go as a result of parsing, and performs interactive processing. Through the unit 27, the user is asked to ask "Where are you going?" Or the like by voice display and / or display on the display to urge the user to supplement the input data.

【０１０１】また，このような対話処理による意味付け
の過程において，その人の言葉の使い方，例えば，「え
−っと」，「そしてね」等のクセがあれば，それ言葉の
クセとしてその人の名前とともに知識データ保持部に登
録する。その後，「えーっと」と話す人を探して下さ
い」等のデータ入力があれば，そのクセをインデックス
として，その人を高速に検索して探すことができる。あ
るいは，その人が「えーっと，私は学校に行く。」と音
声によりデータ入力した時に，その入力データを書き言
葉に変換する等の時，「えーっと，」はその人の話し方
のクセであるとして，「私は学校に行く。」等のように
変換する。Further, in the process of giving meaning through such dialogue processing, if there is a habit of using the word of the person, for example, a habit such as “Ehto” or “And Ne”, the habit of the word is used as the habit of the word. It is registered in the knowledge data holding unit together with the person's name. Then, if there is a data entry such as "Look for a person who speaks," you can search for that person at high speed using that habit as an index. Or, when the person inputs data by voice saying "Um, I go to school." And converts the input data into written words, etc., "Um," is a habit of the person's speech. "I go to school."

【０１０２】あるいは，意味付けの過程において，知識
ドメイン判定部２４は，入力データの内容が医学あるい
は法律等のような専門領域であることを判定し，知識ド
メインを求める。そして，その知識ドメインのインデッ
クスにより知識データを参照することにより高速に検索
処理を行うことができる。また，知識データ処理手段
２０において，知識処理部２１は，登録されている知識
データの使用のされ方を判断し（例えば，使用のされ方
についての統計データをとる等），専門知識としてイン
デックスされているが，一般知識として利用のされ方が
多ければ一般知識にインデックスを作成し，こちらの方
を優先的に利用する。あるいはその反対に専門知識にイ
ンデックスされていたものを一般知識にインデックスし
なおす。Alternatively, in the process of assigning meanings, the knowledge domain determination unit 24 determines that the contents of the input data are a specialized area such as medicine or law, and obtains a knowledge domain. Then, the search process can be performed at high speed by referring to the knowledge data using the index of the knowledge domain. Further, in the knowledge data processing means 20, the knowledge processing unit 21 determines how the registered knowledge data is used (for example, obtains statistical data on the usage) and is indexed as expert knowledge. However, if it is used as general knowledge, an index is created for general knowledge, and this is used preferentially. Or, on the contrary, what was indexed to specialized knowledge is re-indexed to general knowledge.

【０１０３】また，本発明は，言葉タイプを判断し，他
の言葉タイプに変換する。例えば，話し言葉でデータ入
力されたが，知識データには話し言葉で登録されていな
い時，そのような時，入力データが話し言葉であること
は判断し，書き言葉に変換する。言葉タイプは，書き言
葉，ビジネス言葉，仲間言葉，幼児言葉等である。ま
た，本発明では，知識データ保持手段の参照は，利用者
個人の知識データだけでなく，他人の知識データベース
も参照する。その際，例えば，相手の知識データが方言
をベースに作成されているようなことがある。そのよう
な場合には，入力データをその方言に翻訳し，高速に検
索できるようにする。Further, according to the present invention, a word type is determined and converted to another word type. For example, when data is input in a spoken language but not registered in the knowledge data in a spoken language, in such a case, it is determined that the input data is a spoken language, and the input data is converted to a written language. The word types are written words, business words, fellow words, infant words, and the like. Further, in the present invention, the reference to the knowledge data holding means refers not only to the knowledge data of the individual user but also to the knowledge database of another person. At that time, for example, there is a case where the knowledge data of the other party is created based on a dialect. In such a case, the input data is translated into the dialect so that the data can be searched at high speed.

【０１０４】さらに，他人の知識データベースでも意味
付け，検索等をできない場合には，課題の知識を理解で
きる有知識者を求め，その有知識者情報を利用者に提供
することができる。あるいは，その有知識者に自動的に
電話する，あるいはインタ−ネットで質問内容を自動的
に送信する等を行うようにすることもできる。上記のよ
うに，本発明によれば，入力データに対して，システム
の方から利用者に質問し，その場でシステムと対話的に
解決し，さらにそれでも知識が不足する場合には他人の
知識データを借用したり，有知識者への問い合わせなど
を行い，即時に知識を充足できる。そのため，本発明に
よれば，利用者の個性に応じた適切な意味付けを利用者
により容易に行うことができる。そのため，利用者毎に
最適な知識データベースを容易に作成することができ
る。Further, if it is not possible to make a meaning, search, etc. in the knowledge database of another person, a knowledgeable person who can understand the knowledge of the problem can be obtained and the knowledgeable person information can be provided to the user. Alternatively, the intellectual can be automatically called, or the contents of the question can be automatically transmitted over the Internet. As described above, according to the present invention, the system asks the user about the input data, solves it interactively with the system on the spot, and furthermore, if the knowledge is still insufficient, the knowledge of another person is obtained. By borrowing data and making inquiries to intellectuals, the knowledge can be filled immediately. Therefore, according to the present invention, it is possible for the user to easily give an appropriate meaning according to the personality of the user. Therefore, it is possible to easily create an optimal knowledge database for each user.

【０１０５】また，その際，大量の知識データの中から
目的の知識を効率良く見つけることができるように，個
人情報，専門分野，一般知識等の知識ドメインを求め，
検索領域を絞り込めるので，能率的に知識データの作成
を行うことができる。また，知識の欠落を的確に判断し
て正確な構文解析を，対話処理により行うことができる
ので，知識データベースの作成を利用者自身が簡単な操
作で高速に行うことができる。At this time, knowledge domains such as personal information, specialized fields, and general knowledge are obtained so that the target knowledge can be efficiently found from a large amount of knowledge data.
Since search areas can be narrowed down, knowledge data can be created efficiently. In addition, since accurate parsing can be performed by interactive processing by accurately determining the lack of knowledge, the user can create the knowledge database at a high speed with a simple operation.

【０１０６】[0106]

【実施例８】図２８は本発明の実施例８の構成を示
す。図２８において，１はデータ入力手段である。
データ入力手段の詳細は図３に示す。５は入出力インタ
フェースである。入出力インタフェース５において，１
０はデータ入力処理手段である。１１はデータ種別判定
部である。１４はデータ出力処理手段である。１５は出
力手段判定部である。１６はネットワークインタフェー
スである。２０は知識データ処理手段である。２１は知
識処理部である。２６は知識データ参照部である。Eighth Embodiment FIG. 28 shows a configuration of an eighth embodiment of the present invention. In FIG. 28, reference numeral 1 denotes data input means.
Details of the data input means are shown in FIG. Reference numeral 5 denotes an input / output interface. In input / output interface 5, 1
0 is a data input processing means. Reference numeral 11 denotes a data type determination unit. 14 is a data output processing means. Reference numeral 15 denotes an output means determination unit. Reference numeral 16 denotes a network interface. 20 is a knowledge data processing means. 21 is a knowledge processing unit. 26 is a knowledge data reference unit.

【０１０７】知識データ参照部２６において，４１は
自己の知識データ参照の処理であって，利用者の自己の
知識データ保持手段３０’を参照する手段である。４２
は他人の知識データ参照の処理であって，他人の知識デ
ータ保持手段Ａ（４０），他人の知識データ保持手段Ｂ
（４１）を参照する手段である。４３は，有知識者の問
い合わせの処理であり，有知識者のリスト作成もしく
は，有知識者へ電話する，インターネット等で質問する
等をするものである。In the knowledge data reference section 26, reference numeral 41 denotes a process of referring to own knowledge data, which is a means for referring to the user's own knowledge data holding means 30 '. 42
Is a process of referring to another person's knowledge data. The other person's knowledge data holding means A (40), the other person's knowledge data holding means B
This is means for referring to (41). Reference numeral 43 denotes processing of inquiries of intellectuals, for example, making a list of intellectuals, making a telephone call to intellectuals, making a question on the Internet or the like.

【０１０８】２７は対話処理部である。２８は再入力
要求部である。２９は登録部である。３０は知識デ
ータ保持手段である。３０’は自己の知識データ保持
手段である。３２’は個人の好み，クセ等の個人情報の
知識ドメインのインデックスである。３３’は専門分野
の知識ドメインのインデックスである。３４’は一般常
識の知識ドメインのインデックスである。３５’は知識
データファイルであって，自己の知識データを保持する
もばＩ用する。あるいはその反対に専門知識にインデッ
クスされていたものを一般知識にインデックスしなお
す。Reference numeral 27 denotes a dialog processing unit. 28 is a re-input request unit. 29 is a registration unit. Reference numeral 30 denotes a knowledge data holding unit. 30 'is own knowledge data holding means. 32 'is an index of a knowledge domain of personal information such as personal preference and habit. 33 'is an index of the knowledge domain of the specialized field. 34 'is an index of a knowledge domain of general common sense. Numeral 35 'is a knowledge data file which is used for holding its own knowledge data. Or, on the contrary, what was indexed to specialized knowledge is re-indexed to general knowledge.

【０１０９】また，本発明は，言葉タイプを判断し，他
の言葉タイプに変換する。例えば，話し言葉でデータ入
力されたが，知識データには話し言葉で登録されていな
い時，そのような時，入力データが話し言葉であること
は判断し，書き言葉に変換する。言葉タイプは，書き言
葉，ビジネス言葉，仲間言葉，幼児言葉等である。ま
た，本発明では，知識データ保持手段の参照は，利用者
個人の知識データだけでなく，他人の知識データベース
も参照する。その際，例えば，相手の知識データが方言
をベースに作成されているようなことがある。そのよう
な場合には，入力データをその方言に翻訳し，高速に検
索できるようにする。Further, according to the present invention, a word type is determined and converted to another word type. For example, when data is input in a spoken language but not registered in the knowledge data in a spoken language, in such a case, it is determined that the input data is a spoken language, and the input data is converted to a written language. The word types are written words, business words, fellow words, infant words, and the like. Further, in the present invention, the reference to the knowledge data holding means refers not only to the knowledge data of the individual user but also to the knowledge database of another person. At that time, for example, there is a case where the knowledge data of the other party is created based on a dialect. In such a case, the input data is translated into the dialect so that the data can be searched at high speed.

【０１１０】さらに，他人の知識データベースでも意味
付け，検索等をできない場合には，課題の知識を理解で
きる有知識者を求め，その有知識者情報を利用者に提供
する５３は有知識者にアクセスするネットワークであ
る。５５はデータ出力手段である（詳細は図４に示
す）。６１はアプリケーションプログラム認識部であっ
て，他のシステムとの連携手段であり，他のアプリケー
ションプログラムを認識し，接続をとるものである。６
２，６３，６４，６５は他のアプリケーションプログラ
ムである。図２８の構成の動作は後述する。Further, if it is not possible to make a meaning or search in another person's knowledge database, a knowledgeable person who can understand the knowledge of the problem is sought, and the knowledgeable person information 53 is provided to the user. The network to access. 55 is a data output means (details are shown in FIG. 4). Reference numeral 61 denotes an application program recognizing unit, which is a means for cooperating with another system, for recognizing another application program and establishing a connection. 6
2, 63, 64 and 65 are other application programs. The operation of the configuration in FIG. 28 will be described later.

【０１１１】図２９は本発明の知識処理部の構成の実施
例の構成を示す。図２９において，２１は知識処理部で
ある。２２は意味付け部である。意味付け部２２におい
て，７１は分解素分解部である。７２は構文解析部であ
る。７３はフォーマット判定部であって，入力データの
分解素の並びのパターンと知識データ保持部に保持され
ている構文フォーマットを比較するものである。FIG. 29 shows the configuration of the embodiment of the configuration of the knowledge processing section of the present invention. In FIG. 29, reference numeral 21 denotes a knowledge processing unit. Reference numeral 22 denotes a meaning assigning unit. In the meaning assignment unit 22, reference numeral 71 denotes a decomposition element decomposition unit. Reference numeral 72 denotes a syntax analysis unit. Reference numeral 73 denotes a format determination unit which compares a pattern of arrangement of decomposed elements of the input data with a syntax format stored in the knowledge data storage unit.

【０１１２】７４は個人情報判定部であって，入力デー
タから個人情報を判定するものである。７５は言葉の
分類部であって，利用者の言葉のタイプ（話し言葉，書
き言葉，ビジネス言葉等）を判定するものである。７
５’はフォーマット判定部であって，入力された言葉の
タイプを知識データ保持部に保持された言葉タイプのパ
ターンを比較するものである。７５”は言葉タイプ変換
部であって，例えば，話し言葉を書き言葉に変換する，
等の変換処理を行うものである。Reference numeral 74 denotes a personal information determination unit for determining personal information from input data. Reference numeral 75 denotes a language classification unit for determining the type of the user's language (spoken language, written language, business language, and the like). 7
Reference numeral 5 'denotes a format determination unit for comparing the input word type with the word type pattern held in the knowledge data holding unit. 75 ”is a word type conversion unit that converts spoken words into written words,
And the like.

【０１１３】７６は自分に特有のパターンと他人に特有
のパターンを比較し，必要に応じて変換するものであ
る。例えば，利用者に関西なまりがあり，参照しようと
する他人の知識データが標準語である時に標準語に変換
する等を行うものである。７６’はフォーマット判定部
であって，自己に特有のパターンと他人に特有のパター
ンを比較するものである。７６”はパターン変換部であ
って，例えば，方言を標準語等に変換するものである。
７８は専門分野判定部であって，入力データの専門分野
を判定するものである。７９は一般知識判定部であっ
て，入力データの一般知識を判定するものである。Reference numeral 76 compares a pattern peculiar to oneself with a pattern peculiar to another person, and converts the pattern if necessary. For example, when there is Kansai accent in the user and the knowledge data of another person to be referred to is a standard language, it is converted into a standard language. Reference numeral 76 'denotes a format determination unit which compares a pattern unique to the user with a pattern unique to another person. Reference numeral 76 "denotes a pattern conversion unit which converts a dialect into a standard language or the like.
Reference numeral 78 denotes a specialized field determining unit for determining the specialized field of the input data. Reference numeral 79 denotes a general knowledge determination unit that determines general knowledge of input data.

【０１１４】個人情報判定部７４，専門分野判定部７
８，一般知識判定部７９は知識ドメイン判定部２４（図
２７）参照に含まれるものである。８０は統計部であ
って，単語の使われ方の統計をとるものである。一般知
識判定部７９は統計部８０の統計を基に，例えば，専門
分野にインデックスされていた知識データを一般知識に
変更する，あるいはその反対に一般知識にインデックス
されていたものを専門語に変更する等の処理を行うもの
である。Personal information judging section 74, specialized field judging section 7
8. The general knowledge determination unit 79 is included in the knowledge domain determination unit 24 (see FIG. 27). Reference numeral 80 denotes a statistics unit for calculating statistics on how words are used. Based on the statistics of the statistics section 80, the general knowledge determination section 79 changes, for example, the knowledge data indexed to the specialized field to general knowledge or vice versa. It performs processing such as performing

【０１１５】[0115]

【実施例９】図３０は本発明のシステム構成の実施例を
示す。図３０において，１はデータ入力手段であ
る。データ入力手段１において，１１１はキーボー
ド，マウスである。１１２はペン入力装置である。１
１３は音声入力装置である。１１４はＯＣＲであって，
光学的に原稿の文字等を読み取るものである。１１５は
電子カメラであって，文字，図形等の原稿を撮影し，記
憶媒体に記録するものである。Embodiment 9 FIG. 30 shows an embodiment of the system configuration of the present invention. In FIG. 30, reference numeral 1 denotes data input means. In the data input means 1, reference numeral 111 denotes a keyboard and a mouse. 112 is a pen input device. 1
Reference numeral 13 denotes a voice input device. 114 is an OCR,
This is for reading characters or the like of a document optically. Reference numeral 115 denotes an electronic camera which captures a document such as a character or a figure and records the document on a storage medium.

【０１１６】１１６はビテオカメラであって，動く被写
体を撮影し，記憶媒体に動画データ記録するものであ
る。１１７は図形データ入力装置であって，イメージ
スキャナ等であり，原稿の文字，図形等を，読み取り画
像データを得るものである。５５はデータ出力手段であ
る。データ出力手段５５において，１２１はディスプ
レイである。１２２はプリンタである。１２３はファッ
クスである。Reference numeral 116 denotes a video camera for photographing a moving subject and recording moving image data on a storage medium. Reference numeral 117 denotes a graphic data input device, which is an image scanner or the like for reading characters, figures, etc. of a document to obtain image data. 55 is a data output means. In the data output means 55, reference numeral 121 denotes a display. Reference numeral 122 denotes a printer. 123 is a fax.

【０１１７】１２５はビデオ装置である。５は入出力
インタフェースである。入出力インタフェース５にお
いて，１６はネットワークインタフェースである。
１３６は周辺装置インタフェースであって，データ入力
処理手段，データ出力処理手段等を含むものである。
１３０は外部記憶装置であって，知識データを保持する
記憶装置である。１４０と記憶媒体であって，本発明
を実行するためのプログラムを保持するフロッピーディ
スク等の記憶媒体である。Reference numeral 125 denotes a video device. Reference numeral 5 denotes an input / output interface. In the input / output interface 5, reference numeral 16 denotes a network interface.
A peripheral device interface 136 includes a data input processing unit, a data output processing unit, and the like.
Reference numeral 130 denotes an external storage device, which is a storage device that holds knowledge data. 140 and a storage medium, such as a floppy disk that stores a program for executing the present invention.

【０１１８】１４２はインストールプログラムであっ
て，本発明のシステムを実行する各種プログラムをイン
ストールするものである。１４３は入出力処理プログ
ラムであって，データ入力手段，データ出力手段を制御
するプログラムである。１４５は知識データ作成プロ
グラムであって知識データ処理をするプログラムであ
る。１４６は分解素分解プログラムであって，入力デ
ータを分解素に分解するプログラムである。１４７は
構文解析プログラムであって，構文解析等を行い分解素
の格の認識等を行うプログラムである。Reference numeral 142 denotes an installation program for installing various programs for executing the system of the present invention. An input / output processing program 143 is a program for controlling data input means and data output means. Reference numeral 145 denotes a knowledge data creation program which performs knowledge data processing. A decomposition element decomposition program 146 is a program for decomposing input data into decomposition elements. Reference numeral 147 denotes a syntax analysis program which performs syntax analysis and the like to recognize the case of a disassembly element.

【０１１９】１４８は個人情報判定プログラムであっ
て，個人情報を判定するプログラムである。個人の言葉
タイプを判定するプログラム，個人の異なる言葉タイプ
間で相互変換するプログラムを含むものである。１４
９は専門分野判定プログラムであって，知識ドメインを
判定するプログラムである。１５０は一般知識判定プ
ログラムであって，一般知識であるか，あるいは専門知
識あるかを判定するプログラムである。１５１知識デ
ータ参照プログラムであって，知識データ保持手段を検
索するプログラムである。Reference numeral 148 denotes a personal information determination program, which is a program for determining personal information. The program includes a program for determining individual word types and a program for mutually converting between different individual word types. 14
Reference numeral 9 denotes a specialized field determination program, which is a program for determining a knowledge domain. Reference numeral 150 denotes a general knowledge determination program, which is a program for determining whether general knowledge or specialized knowledge exists. 151 is a knowledge data reference program for searching for knowledge data holding means.

【０１２０】１５２は登録プログラムであって，意味付
けした知識データを知識データ保持手段に登録するプロ
グラムである。１５３は自己と他人のパターンの翻訳
プログラムであって，自己と他人のフォーマットの相違
を判断し，相互の変換を行うものである。１６０はデ
ータ処理部である。データ処理部１６０において，１
６１はＣＰＵである。１６２はＲＡＭである。１６
３はＲＯＭである。１６４は内部磁気ディスクである。A registration program 152 is a program for registering meaningful knowledge data in the knowledge data holding means. Reference numeral 153 denotes a translation program for the pattern of the self and that of another person, which determines the difference between the format of the self and that of the other person and performs mutual conversion. 160 is a data processing unit. In the data processing unit 160, 1
61 is a CPU. 162 is a RAM. 16
3 is a ROM. 164 is an internal magnetic disk.

【０１２０】１６５は知識データ作成プログラムてあっ
て，内部磁気ディスク１６４に保持された知識データ作
成プログラムを表す。１７０は個人または共通または他
人の知識データ保持手段である。図３１は図２８の実施
例構成の動作説明図である。データ入力手段１によりデ
ータが入力される。入力データはデータ種別が判定さ
れ，データ種別を付されて知識データ処理手段２０に入
力される。知識処理部２１において、知識ドメイン判定
部２４により、入力データの知識ドメインが判定され
る。Reference numeral 165 denotes a knowledge data creation program, which is a knowledge data creation program held on the internal magnetic disk 164. 170 is an individual or common or other person's knowledge data holding means. FIG. 31 is a diagram for explaining the operation of the embodiment of FIG. Data is input by the data input unit 1. The data type of the input data is determined, the data type is added, and the data is input to the knowledge data processing means 20. In the knowledge processing section 21, the knowledge domain determination section 24 determines the knowledge domain of the input data.

【０１２１】入力データは分解素分解部７１により分解
素に分解され、自己の知識データ保持手段３０′を参照
し、構文解析部７２において構文解析される。ここで、
じこの知識データ保持手段３０′もしくは他人のデータ
保持手段４０′がさんしょうされ、構文解析される。例
えば、「私は学校に自転車で行く。」と入力された時、
「私」、「は」、「学校」、「に」、「自転車」、
「で」、「行く」「。」等に分解する。そして、既に登
録されている構文フォーマットと深くされ、「私」に対
して「主語」、「は」に対して主格の助詞、「学校」に
対して目的格、「に」に対して目的の助詞格、「自転
車」に対して手段格、「で」に対して手段を表す助詞
格、「行く」に対して「行動格」、「。」に対して終了
格を付与し、意味付けをする。The input data is decomposed into decomposed elements by a decomposed element decomposing unit 71, and the parsing unit 72 parses the input data with reference to its own knowledge data holding unit 30 '. here,
The knowledge data holding means 30 'or another person's data holding means 40' is intercepted and parsed. For example, when "I go to school by bicycle" is entered,
"I", "Ha", "School", "Ni", "Bicycle",
Decompose into "in", "go", ".", Etc. Then, the syntactical format that has already been registered is deepened, and the subject is the subject for "I", the noun particle for "ha", the objective case for "school", The particle case, the device case for "bicycle", the particle case for "de", the action case for "go", and the end case for "." I do.

【０１２２】そして、構文解析により入力データに欠落
格が有るかどうかをはんていする（Ｓ１の処理）。例え
ば、入力データが「私は自転車で行く。」等のように入
力データに欠落格があれば、対話処理手段（図３１では
図示せず、図２８参照）により「どこに？」をデータ出
力手段５５に転送する。データ出力により音声で、「ど
こに？」と利用者に問い掛ける。或いはディスプレイ２
２２に「どこに？」を表示して、利用者にデータの細く
入力を促す。Then, it is checked whether or not there is a missing case in the input data by the syntax analysis (processing of S1). For example, if the input data has a missing part such as "I go by bicycle" or the like, "where?" Transfer to 55. Ask the user "Where?" Or display 2
22, "Where?" Is displayed to prompt the user to input data thinly.

【０１２３】そして利用者が、音声入力もしくはキーボ
ード入力等により「学校」が入力されると，分解素分解
部７１は，補足された入力データが付け加えられた入力
データを分解素に分解し，構文解析部７２は前述のよう
に構文解析して，意味付けをする。そしてＳ１の判定処
理で欠落格なし，あるいは欠落格を無視することの指示
の入力が利用者からなされたことを判定したら，その構
文解析結果をディスプレイ２２２に表示し，利用者の確
認をとる。When the user inputs “school” by voice input or keyboard input, the decomposition element decomposition section 71 decomposes the input data to which the supplementary input data is added into decomposition elements, and The analysis unit 72 performs syntax analysis as described above to give meaning. When it is determined in the determination process of S1 that there is no missing case or that an instruction to ignore the missing case has been input by the user, the syntax analysis result is displayed on the display 222, and the user is confirmed.

【０１２４】Ｓ２の判定処理において，利用者が装置の
解析して求めた構文解析結果に満足であることの入力が
なされたら，登録部２９は意味付けされた文章データを
自己の知識データ保持手段３０’に登録し，次のフォー
マット判定の処理（Ｓ３）をする。Ｓ３の処理において
自己の知識データ保持手段３０’を検索し，対話処理し
ながら入力データのフォーマットと知識データ保持手段
３０’に保持されているフォーマットを比較し，フォー
マット判定の処理をする。例えば，入力データの言葉タ
イプと登録されている言葉タイプを比較し，変換処理す
る等の処理を行う。そして，その処理の結果を対話処理
部２７を介してディスプレイ２２２に表示する。In the determination process of S2, when the user inputs that the user is satisfied with the syntax analysis result obtained by analyzing the apparatus, the registration unit 29 stores the meaningful sentence data in its own knowledge data holding unit. 30 'and perform the following format determination processing (S3). In the processing of S3, the own knowledge data holding means 30 'is searched, the format of the input data is compared with the format held in the knowledge data holding means 30' while performing interactive processing, and the format determination processing is performed. For example, processing such as comparing the word type of the input data with the registered word type and performing conversion processing is performed. Then, the result of the processing is displayed on the display 222 via the interactive processing unit 27.

【０１２５】Ｓ４の判定処理で利用者が満足であれば，
登録部２９により自己の知識データ保持手段３０にフォ
ーマット判定の結果を登録する。なお，図３１におい
て，分解素分解部７１，構文解析部７２，自己の知識デ
ータ保持手段３０’の参照により構文を求める処理は利
用者と対話しながら複数回繰り返してなされるものであ
る。また，構文解析された結果に対して，自己の知識デ
ータ保持手段３０’を参照してフォーマット判定するＳ
３の処理も利用者と対話しながら複数回行われるもので
ある。また，知識ドメインが最善であるかどうかの判
定，知識ドメインの変更等もＳ３の処理に含まれるもの
である。さらに，専門知識から一般知識へのインデック
スの変更，あるいは一般知識から専門知識への変更等も
Ｓ３の処理に含まれるものである。If the user is satisfied in the judgment processing of S4,
The registration unit 29 registers the format determination result in its own knowledge data holding unit 30. In FIG. 31, the processing for obtaining a syntax by referring to the decomposition element decomposition unit 71, the syntax analysis unit 72, and the own knowledge data holding unit 30 'is repeatedly performed a plurality of times while interacting with the user. In addition, the format determination is performed on the result of the syntax analysis with reference to the own knowledge data holding means 30 '.
The process 3 is also performed a plurality of times while interacting with the user. Further, the determination of whether the knowledge domain is the best, the change of the knowledge domain, and the like are also included in the processing of S3. Further, the change of the index from the specialized knowledge to the general knowledge or the change from the general knowledge to the specialized knowledge is also included in the processing of S3.

【０１２６】Ｓ４の処理で，利用者が満足でないことを
判定したら，あるいは自己のパターンを他人のパターン
に翻訳するような場合には，Ｓ５で他人の知識データ保
持手段４０’を検索して，入力データのパターンと他人
の知識データ保持手段４０’に保持されているフォーマ
ットを比較する。そして，比較結果を対話処理部２７を
介してディスプレイ２２２に表示する。ここでもＳ５の
処理は利用者と対話しながら複数回行われるものであ
る。また，知識ドメインの判定および変更もＳ５の処理
に含まれるものである。さらに，専門語から一般知識へ
のインデックスの変更，および一般知識から専門知識へ
の変更等もＳ３の処理に含まれるものである。If it is determined in step S4 that the user is not satisfied, or if the user's own pattern is to be translated into another person's pattern, another person's knowledge data holding means 40 'is searched in step S5. The pattern of the input data is compared with the format held in the knowledge data holding means 40 'of another person. Then, the comparison result is displayed on the display 222 via the interactive processing unit 27. Here also, the process of S5 is performed a plurality of times while interacting with the user. The determination and change of the knowledge domain are also included in the processing of S5. Further, the change of the index from the specialized word to the general knowledge, the change from the general knowledge to the specialized knowledge, and the like are also included in the processing of S3.

【０１２７】Ｓ６の処理により，利用者が満足であるこ
とを判定したら，登録部２９により意味付けの結果を自
己の知識データ保持手段３０’に登録する。Ｓ６の処
理で，利用者が満足でないと判定したら，Ｓ７の処理お
よび９３の処理により，有知識者のリストを作成してデ
ィスプレイに表示する。そして，利用者はディスプレイ
に表示された有知識者の電話番号等により有知識者と連
絡をとる。もしくは有知識者に装置から自動電話する，
あるいはネットワークを介してインターネット等のコン
ピュータ通信により有知識者に質問する。If it is determined in step S6 that the user is satisfied, the registration unit 29 registers the result of the meaning in the own knowledge data holding unit 30 '. If it is determined in step S6 that the user is not satisfied, a list of intellectuals is created and displayed on the display in steps S7 and 93. Then, the user contacts the intellectual by the telephone number or the like of the intellectual displayed on the display. Or automatically call an intellectual from the device,
Alternatively, a question is asked to the intellectual by computer communication such as the Internet via a network.

【０１２８】そこで，有知識者から回答が得られれば，
その回答をもとに意味付け（構文解析およびフォーマッ
ト判定）し，ディスプレイ２２２に表示する。利用者
が満足であれば，登録部２９により意味付けした結果を
自己の知識データ保持手段３０’に登録する。利用者が
満足でなければ，利用者に提示する利用者が自身で判断
もしくは有知識者から得た知識をもとに意味付けを作成
し，自己の知識データ保持手段３０’に登録する。Therefore, if an answer can be obtained from a knowledgeable person,
Based on the answer, meaning is given (syntax analysis and format determination), and displayed on the display 222. If the user is satisfied, the result given by the registration unit 29 is registered in the own knowledge data holding means 30 '. If the user is not satisfied, the user presented to the user creates a meaning based on his / her own judgment or knowledge obtained from the intellectual, and registers it in his / her own knowledge data holding means 30 '.

【０１２９】図３２は本発明の入力データ処理部のフロ
ーチャートである。図示のステップの順番に従って動
作を説明する。Ｓ１データ入力をする。Ｓ２メ
ディア属性があるか判定する。メディア属性があれば処
理を終了する。なければ，入力装置（入力デバイス）を
チェックする。まず，テキスト系デバイス（キーボー
ド等）であるか判定する。テキスト系デバイスであれば
テキスト系メディアＩＤを付与し，処理を終了する。テ
キスト系デバイスでなければ音声系デバイス（音声入力
装置）であるかないかを判定する。FIG. 32 is a flowchart of the input data processing section of the present invention. The operation will be described according to the order of the illustrated steps. S1 Input data. S2 Determine whether there is a media attribute. If there is a media attribute, the process ends. If not, check the input device (input device). First, it is determined whether the device is a text device (keyboard or the like). If the device is a text-based device, a text-based media ID is assigned, and the process ends. If it is not a text device, it is determined whether or not the device is a voice device (voice input device).

【０１３０】音声系デバイスであれば，音声系メディア
ＩＤを付与して処理を終了する。音声系デバイスでなけ
ればイメージ系デバイスであるかないかを判定する。イ
メージ系デバイス（スキャナー等）であればイメージ系
メディアＩＤを付与し，処理を終了する。イメージ系デ
バイスでなければベクトル系デバイス（マウス，ポイン
ティングデバイス等）であるかないかを判定する。If the device is an audio device, an audio media ID is assigned, and the process is terminated. If it is not an audio device, it is determined whether or not it is an image device. If the device is an image-based device (such as a scanner), an image-based media ID is assigned, and the process ends. If it is not an image device, it is determined whether it is a vector device (mouse, pointing device, etc.).

【０１４０】ベクトル系デバイスであればベクトル系メ
ディアＩＤを付与し，処理を終了する。ベクトル系デバ
イスでなければ動画系デバイスか判定する。動画系デ
バイス（ビデオカメラ等）であれば動画系メディアＩＤ
を付与し，処理を終了する。動画系デバイスでなければ
次の処理（Ｓ３）をする。If the device is a vector device, a vector media ID is assigned, and the process is terminated. If it is not a vector device, it is determined that the device is a moving image device. Video-based media ID for video-based devices (such as video cameras)
Is added, and the process ends. If it is not a moving image device, the following processing (S3) is performed.

【０１４１】Ｓ３登録されているいずれのデバイスで
もないので利用者に問い合わせる。Ｓ４，Ｓ４’，Ｓ５
利用者がメディアタイプを入力したか判定する。メデ
ィアの入力があればＳ５で指示されたメディアＩＤを付
与する。メディアタイプの指示がなければ処理を終了す
る。S3 Since the device is not any registered device, an inquiry is made to the user. S4, S4 ', S5
Determine whether the user has entered a media type. If there is a media input, the media ID specified in S5 is assigned. If there is no instruction of the media type, the process ends.

【０１４２】図３３は出力データ処理部のフローチャー
トである。Ｓ１出力データを受け取る。Ｓ２テ
キスト系データか判定する。テキスト系データであれば
テキスト系デバイス（ディスプレイ等）を選択し，処理
を終了する。テキスト系データでなければ音声系データ
であるかないかを判定する。FIG. 33 is a flowchart of the output data processing section. S1 Receive output data. S2 Determine whether the data is text data. If the data is text data, a text device (display or the like) is selected, and the process is terminated. If the data is not text data, it is determined whether the data is audio data.

【０１４３】音声系データであれば，音声系デバイス
（音声出力装置等）を選択して処理を終了する。音声系
データでなければイメージ系データであるかないかを判
定する。イメージ系データであればイメージ系デバイス
（ディスプレイ等）を選択し，処理を終了する。イメー
ジ系データでなければベクトル系データであるかないか
を判定する。If the data is audio data, an audio device (audio output device or the like) is selected, and the process is terminated. If it is not audio data, it is determined whether or not it is image data. If it is image data, an image device (display or the like) is selected, and the process is terminated. If it is not image data, it is determined whether or not it is vector data.

【０１４４】ベクトル系データであればベクトル系デバ
イス（ディスプレイ等）を選択し，処理を終了する。ベ
クトル系データでなければ動画系データか判定する。
動画系データであれば動画系デバイス（ビデオ装置等）
を選択し，処理を終了する。動画系データでなければ次
の処理（Ｓ３）をする。Ｓ３登録されているいずれのデバイスでもないので利
用者に問い合わせる。Ｓ４，Ｓ４’，Ｓ５利用者がメディアタイプを指示し
たか判定する。メディアタイプの入力があればＳ５で指
示されたデバイスを選択する。メディアタイプの指示が
なければ処理を終了する。If it is vector data, a vector device (display or the like) is selected, and the process is terminated. If it is not vector data, it is determined that the data is moving image data.
For moving image data, moving image devices (video equipment, etc.)
Is selected to end the processing. If it is not moving image data, the following processing (S3) is performed. S3 Inquires the user because it is not any registered device. S4, S4 ', S5 It is determined whether the user has designated the media type. If there is an input of the media type, the device designated in S5 is selected. If there is no instruction of the media type, the process ends.

【０１４５】図３４は本発明の言葉タイプの分類のフロ
ーチャートである。Ｓ１判定対象の言葉を取り込
む。Ｓ２話し言葉か判定する。話しことばであれば
話し言葉系のフォーマットデータの参照をする。話し言
葉でなければ書き言葉であるか判定する。FIG. 34 is a flow chart for classifying word types according to the present invention. S1 A word to be determined is captured. S2 It is determined whether it is a spoken language. For spoken language, refer to spoken language format data. If it is not a spoken language, determine if it is a written language.

【０１４６】書き言葉であれば書き言葉系のフォーマッ
トデータを参照する。書き言葉でなければビジネス言葉
か判定する。ビジネス言葉であればビジネス系言葉の
フォーマットデータを参照する。ビジネス言葉でなけれ
ば仲間系の言葉か判定する。仲間系の言葉のフォーマッ
トデータであれば仲間系の言葉のフォーマットデータを
参照する。仲間系の言葉でなければ幼児言葉が判定す
る。幼児言葉であれば幼児言葉系のフォーマットデー
タを参照する。幼児言葉でなければ次の処理（Ｓ３）を
する。In the case of a written language, format data of a written language is referred to. If it is not a written word, judge it as a business word. If it is a business language, the format data of the business language is referred to. If it is not a business language, judge it as a fellow language. If the format data is for fellow words, the format data of fellow words is referred to. If it is not a fellow language, the infant language is judged. If it is an infant language, refer to the format data of the infant language system. If it is not an infant language, the following processing (S3) is performed.

【０１４７】Ｓ３新規言葉グループのため利用者に言
葉を問い合わせ!Ｓ４，Ｓ４’，Ｓ５利用者が言葉を
指示すればＳ５で指示された言葉タイプを差言いようす
る。利用者からの言葉の指示がなければ処理を終了す
る。S3 Inquire the user for words for the new word group! S4, S4 ', S5 If the user indicates a word, the user will try to use the word type specified in S5. If there is no instruction from the user, the process is terminated.

【０１４８】図３６は言葉タイプの相互変換のフロチャ
ートである。Ｓ１変換対象の言葉を取り込むＳ２話言葉か判定する。話言葉であれば話し言葉とし
て認識し、処理を終了する。話ことばでなければ書き言
葉かを判定する。ビジネス言葉であればビジネスことば
として認識し、Ｓ６の処理を行う。ビジネス言葉でなけ
れば仲間言葉か判定する。FIG. 36 is a flowchart of the mutual conversion of word types. S1 Import the word to be converted S2 Determine whether it is a spoken word. If it is a spoken language, it is recognized as a spoken language, and the process ends. If it is not a spoken word, determine whether it is a written word. If it is a business word, it is recognized as a business word, and the process of S6 is performed. If it is not a business language, judge it as a companion language.

【０１４９】仲間言葉であれば仲間言葉と認識し、Ｓ６
の処理を行う。仲間言葉でなければ幼児言葉か判定す
る。幼児言葉であれば幼児言葉と認識し、Ｓ６の処理を
行う。幼児言葉でなければ次ぎの処理（Ｓ３）の処理を
行う。If it is a fellow language, it is recognized as a fellow language, and S6
Is performed. If it is not a fellow language, it is judged as an infant language. If it is an infant language, it is recognized as an infant language, and the process of S6 is performed. If it is not an infant language, the process of the next process (S3) is performed.

【０１５０】Ｓ３新規言葉タイプのため利用者に言葉
タイプを問い合わせる。Ｓ４，Ｓ４’，Ｓ５利用者が言葉タイプを指示すれば
Ｓ５の指示された言葉タイプを採用する。利用者からの
言葉タイプの指示がなければＳ６の処理を行う。Ｓ６変換先の言葉タイプの中で該当するフォーマット
を選択する。S3 The user is inquired of the word type for the new word type. S4, S4 ', S5 If the user indicates the word type, the specified word type in S5 is adopted. If there is no instruction of the word type from the user, the process of S6 is performed. S6 Select a corresponding format among the word types to be converted.

【０１５１】図３６は本発明の自分の表現パターンと
他人の表現パターンとの間の翻訳のフローチャートであ
る。Ｓ１翻訳すべき言葉を取り込む。Ｓ２過去の翻訳例を探す。Ｓ３該当する知識デー
タがあるか判定し，あればＳ６の処理を行い，なければ
Ｓ４の処理を行う。FIG. 36 is a flowchart of the translation between one's own expression pattern and another's expression pattern according to the present invention. S1 Incorporate words to be translated. S2 Search for past translation examples. S3: It is determined whether there is the corresponding knowledge data. If there is, the process of S6 is performed.

【０１５２】Ｓ４，Ｓ４’ 翻訳方法を利用者に問い合
わせる。Ｓ５，Ｓ５’ 利用者が翻訳方法の指示の入
力を判断する。指示の入力があればＳ６の処理を行い，
なければ終了する。S4, S4 'Inquire the user about the translation method. S5, S5 'The user judges the input of the instruction of the translation method. If an instruction is input, the process of S6 is performed,
If not, end.

【０１５３】Ｓ６当該（指示された翻訳例）を参照す
る。例えば，「私はリンゴが好きです。」を翻訳する
場合を例として図１０のフローチャートを具体的に説明
する。Ｓ１において，「私はリンゴが好きです」を取り
込む。Ｓ２において，Ｂさんは，「私はリンゴが好きで
す」をどのように表現するか検索する。Ｓ３において，
該当知識があり，Ｂさんがアメリカ人であって「Ｉｌ
ｉｋｅａｎａｐｐｌｅ．」という知識とリンク付け
されていれば，それを使用する。Ｓ３において，該当知
識がなければ，「私」とはＢさんの言葉では「Ｉ」，
「好きです」とはＢさんの言葉では「ｌｉｋｅ」，「リ
ンゴ」とはＢさんの言葉では「ａｐｐｌｅ」，並べ方は
「Ｉｌｉｋｅａｎａｐｐｌｅ．」のような構文に
なるなどを指示する。S6 References the relevant (instructed translation example). For example, the flowchart of FIG. 10 will be specifically described with reference to a case where “I like apples” is translated. In S1, "I like apples" is imported. In S2, Mr. B searches for how to express “I like apples”. In S3,
He has the relevant knowledge, and Mr. B is American and says “I l
ike an apple. If it is linked to the knowledge ", use that. In S3, if there is no relevant knowledge, "I" is "I" in Mr. B's words,
"I like" instructs "like" in the word of Mr. B, "apple" instructs "apple" in the word of Mr. B, and the arrangement method is such as "I like an apple."

【０１５４】図３７は本発明の知識ドメインの自動設定
のフローチャートである。Ｓ１，Ｓ１’ 新規作業か
仕掛り中の作業か判断する。新規作業であればＳ２の処
理を行い，仕掛り中の作業であればＳ３の処理を行う。
Ｓ２直前のディレクトリ名やファイル名などから一
番関係の深い知識ドメインを選定する。Ｓ３仕掛り
中の知識ドメインを選定する。Ｓ４，Ｓ４’ 利用者
による明示的なドメイン指示があるかチェックする。ド
メイン指示があればＳ５の処理を行い，なければＳ９の
処理を行う。FIG. 37 is a flowchart of automatic setting of a knowledge domain according to the present invention. S1, S1 'It is determined whether the work is a new work or a work in progress. If it is a new work, the process of S2 is performed, and if it is a work in progress, the process of S3 is performed.
S2: The most relevant knowledge domain is selected from the directory name and file name immediately before. S3 Select a knowledge domain in process. S4, S4 'Check if there is an explicit domain instruction by the user. If there is a domain instruction, the process of S5 is performed, and if not, the process of S9 is performed.

【０１５６】Ｓ５，Ｓ５’ 与えられた知識ドメインと
装置が判断した知識ドメインと比較する。一致すればＳ
６の処理を行い。一致しなければＳ７の処理を行う。
Ｓ６当該知識ドメインを採用する。Ｓ７利用者に問
い合わせる。Ｓ７’，Ｓ７” 知識ドメインを入力す
る。Ｓ８利用者が使用する統計処理をダイナミック
に行い，この言葉が属する知識ドメインとして一番適切
な知識ドメインを監視する。ほかに最適な知識ドメイン
がみつかれば利用者に提示し，利用者の指示を待つ。S5, S5 'The given knowledge domain is compared with the knowledge domain determined by the device. If they match, S
6 is performed. If they do not match, the process of S7 is performed.
S6: adopt the knowledge domain. S7 Inquire the user. S7 ', S7 "Input the knowledge domain. S8 Dynamically perform the statistical processing used by the user, and monitor the most appropriate knowledge domain as the knowledge domain to which this word belongs. If it is, present it to the user and wait for the user's instruction.

【０１５７】Ｓ９利用者が使用する言葉の統計処理を
ダイナミックに行い，この言葉が属する知識ドメインと
して一番適切な知識ドメインを監視する。ほかに最適な
知識ドメインが見つかれば利用者に提示し利用者の指示
を待つ。S9 The statistical processing of the words used by the user is dynamically performed, and the most appropriate knowledge domain as the knowledge domain to which the words belong is monitored. If another optimal knowledge domain is found, it is presented to the user and waits for the user's instruction.

【０１５８】図３８は知識データの参照処理と有知識者
の知識獲得方法のフローチャートである。Ｓ１要求
知識と関係の深い知識ドメインを把握する。Ｓ２自
己の知識データを参照し，この知識ドメインの中から一
番関連の深い知識を表示する。Ｓ３表示された知識が満足であるかないかの利用者の
判断の入力を待ち，満足であれば，Ｓ３の処理を行い，
満足でなければＳ５の処理を行う。Ｓ４，Ｓ５利用者から満足でないの入力があれば他
人の知識データを参照し，この知識ドメインの中から一
番関連の深い知識を表示する。FIG. 38 is a flowchart of a process of referring to knowledge data and a method of acquiring knowledge of a knowledgeable person. S1 Grasp a knowledge domain closely related to the required knowledge. S2 Refers to its own knowledge data and displays the most relevant knowledge in this knowledge domain. S3 Waits for the user to input whether the displayed knowledge is satisfactory or not, and if satisfied, performs the processing of S3.
If not satisfied, the process of S5 is performed. S4, S5 If there is an input that the user is not satisfied, the knowledge data of another person is referred to, and the knowledge most closely related to this knowledge domain is displayed.

【０１５９】Ｓ４’，Ｓ３満足であることの入力があ
れば，Ｓ３で当該知識を採用する。Ｓ５有知識者テー
ブルを参照し，適切な専門家の電話やメール番号を提示
し，問い合わせを行うか利用者に確認する。Ｓ５’，Ｓ６利用者は問い合わせを行わず，利用者が
オフラインで関連資料を調査する。Ｓ５’，Ｓ６装置が自動的に調査し，複数の回答の中
から選定して表示する。Ｓ６’，Ｓ７利用者は，表示された選定結果に満足で
なければ，利用者がオフラインで関連資料を調査する。S4 ', S3 If there is an input of satisfaction, the knowledge is adopted in S3. S5: Refer to the intellectual knowledge table, present the telephone and e-mail number of an appropriate expert, and confirm with the user whether to make an inquiry. S5 ', S6 The user does not make an inquiry, and the user investigates the related data offline. S5 ', S6 The device automatically checks and selects and displays from among a plurality of answers. S6 ', S7 If the user is not satisfied with the displayed selection result, the user investigates the related data offline.

【０１６０】Ｓ６’，Ｓ３利用者から表示された選定
結果に満足であるという回答の入力があれば，Ｓ３で当
該知識を採用する。S6 ', S3 If there is an input from the user that the user is satisfied with the displayed selection result, the knowledge is adopted in S3.

【０１６１】[0161]

【発明の効果】本発明によれば，入力情報に対してすで
にラベリングされて保存されている辞書，文章等の各種
データが格納されているデータベースを参照し、必要に
応じてユーザー（オペレーター）にその結果を知らせ、
ユーザーはその結果を基にキーボード等による入力もし
くはマイク等を利用した音声入力により容易に対話形式
でユーザーの意図する正確なラベリングが可能となる。
そのため，入力情報に対して正確なラベルが付与された
（格認識、属性の設定）データをデータベースに登録す
ることが可能となる。また、そのラベリングの過程で，
すでに作成されている辞書の内容，文章等の各種データ
も，後から入力された情報をもとに，その内容を書き替
え，情報変化に応じて，学習しながらたえずその内容を
有意義に変更することができる。そのため，本発明によ
れば蓄積されるデータベースの内容が豊富になる。According to the present invention, a user (operator) is referred to a database in which various data such as dictionaries and sentences which are already labeled and stored for input information are stored. Let them know the result,
Based on the result, the user can easily and interactively perform accurate labeling intended by the user through input using a keyboard or the like or voice input using a microphone or the like.
Therefore, it is possible to register data in which an accurate label is assigned to the input information (case recognition, attribute setting) in the database. Also, during the labeling process,
Rewrites the contents of already created dictionaries and various data, such as sentences, based on the information entered later, and constantly changes the contents in a meaningful way while learning according to changes in information. be able to. Therefore, according to the present invention, the content of the database to be stored is abundant.

【０１６２】また、入力情報に対してすでにラベリング
され登録保存されている辞書，文章等の各種データを参
照し、対話形式でラベリングしていく際既に登録保存さ
れているデータのフォーマットと入力情報から作成した
データのフォーマットを比較し，当てはめながら，デー
タのラベルの不足を判定するので，能率的にラベリング
することができる。入力情報から作成したデータに必要
な情報が不足してラベリングできないような場合には，
ディスプレイ又は音声出力装置に不足する情報を表示も
しくは音声により示し，ユーザー（オペレータ）に不足
する情報入力を促すので，登録保存されているデータの
内容を豊富にすることができる。また、そのように補わ
れたデータに基づいてさらに入力情報から得られたデー
タのデータの不足を補うことができるので，情報変化に
応じて，蓄積されるデータの内容および入力情報から作
成されたデータは正確にラベリングされたデータとな
る。Further, referring to various data such as dictionaries and sentences which are already labeled and registered with respect to the input information, and performing the labeling in an interactive manner, the format and the input information of the data which has already been registered and stored are referred to. Since the format of the created data is compared and applied to determine the lack of data labels, labeling can be performed efficiently. If labeling cannot be performed due to lack of necessary information in data created from input information,
Insufficient information is displayed or displayed on a display or a sound output device, and the user (operator) is prompted to input the missing information, thereby enriching the contents of the registered and stored data. In addition, the lack of data of the data obtained from the input information can be further compensated based on the data supplemented in such a manner. Therefore, according to the information change, the data created based on the contents of the accumulated data and the input information is prepared. The data is correctly labeled data.

【０１６３】同様にデータの作成や更新，もしくは検索
に専門的技術者を必要とすることなく，利用者自身で容
易に行なえる。また、入力情報から選られたデータにデ
ータが不足してラベリングができない時，あるいは検索
において用語の意味，専門分野等を理解できない時等に
おいて，装置から利用者に質問して入力情報のデータの
補足を促し，利用者と装置が対話しながら入力情報から
得られたデータのラベリングができる。また，知識デー
タベースの検索を高速に能率的に行うことのできる。さ
らに，本発明は，意味付けされて登録されている知識デ
ータに対しても入力データの意味付けの過程において，
最新の内容に利用者と装置が対話しながら，内容を学習
し，知識データを更新することができる。そのため，使
用しながら知識データの内容が豊かになり，最新の状況
に対応できる知識データを常に維持できる。従って，本
発明を情報検索システム等に適用すれば，必要な情報を
対話形式で容易に検索することが可能になり，蓄積され
る情報も通常の使用で自動的に豊富にすることができ、
かつ最新の状況に対応できるデータベースを常に維持す
ることができる。Similarly, the user can easily perform the data creation, update, or search without requiring a specialized engineer. Also, when labeling cannot be performed due to lack of data in the data selected from the input information, or when it is difficult to understand the meaning of the term, specialized field, or the like in the search, the device asks the user a question, and the data of the input information is deleted. It encourages supplements and allows the user and the device to interact and label the data obtained from the input information. In addition, the knowledge database can be searched quickly and efficiently. Further, the present invention provides a method for inputting meaning data to knowledge data that has been registered with meaning.
While the user and the device interact with the latest contents, the contents can be learned and the knowledge data can be updated. Therefore, the content of the knowledge data becomes rich while being used, and the knowledge data that can respond to the latest situation can be always maintained. Therefore, if the present invention is applied to an information retrieval system or the like, necessary information can be easily retrieved in an interactive manner, and information to be stored can be automatically enriched by ordinary use.
And the database which can respond to the latest situation can always be maintained.

[Brief description of the drawings]

【図１】本発明の基本的な流れを示す図であるFIG. 1 is a diagram showing a basic flow of the present invention.

【図２】本発明の実施例１基本構成を示す図である。FIG. 2 is a diagram illustrating a basic configuration of a first embodiment of the present invention.

【図３】本発明の実施例１システム構成を示す図であ
る。FIG. 3 is a diagram illustrating a system configuration according to a first embodiment of the present invention.

【図４】本発明の装置構成の実施例２を示す図である。FIG. 4 is a diagram showing a second embodiment of the device configuration of the present invention.

【図５】本発明の格認識部および編集部の構成を示す図
である。FIG. 5 is a diagram illustrating a configuration of a case recognition unit and an editing unit according to the present invention.

【図６】本発明のデータ構成の例を示す図である。FIG. 6 is a diagram showing an example of a data configuration of the present invention.

【図７】本発明のデータ構成の例を示す図である。FIG. 7 is a diagram showing an example of a data configuration of the present invention.

【図８】本発明をマルチメディアデータによる場合の例
を示す図である。FIG. 8 is a diagram showing an example in which the present invention is based on multimedia data.

【図９】本発明の格認識のフローチャートを示す図であ
る。FIG. 9 is a diagram showing a flowchart of case recognition according to the present invention.

【図１０】本発明の主たる振る舞いを表す記号を付与す
るフローチャートを示す図である。FIG. 10 is a diagram showing a flowchart for assigning a symbol representing a main behavior of the present invention.

【図１１】本発明の構文言い回し記号の説明図である。FIG. 11 is an explanatory diagram of a syntax wording symbol of the present invention.

【図１２】本発明の構文言い回し記号により属性を付与
するプログラムのフローチャートを示す図である。FIG. 12 is a diagram showing a flowchart of a program for giving an attribute by a syntax wording symbol of the present invention.

【図１３】本発明の知識データのオブジェクトをまと
め，新しい知識データのオブジェクトを生成することお
よびオブジェクトの階層化を示す説明図である。FIG. 13 is an explanatory diagram showing the generation of new knowledge data objects by grouping the objects of knowledge data according to the present invention, and the layering of the objects.

【図１４】本発明の知識データのオブジェクトをまと
め，新しい知識データのオブジェクトを生成するための
フローチャートを示す図てある。FIG. 14 is a diagram showing a flowchart for collecting knowledge data objects and generating new knowledge data objects according to the present invention.

【図１５】本発明の知識データのオブジェクトのインデ
ックス作成のフローチャートを示す図である。FIG. 15 is a diagram showing a flowchart of index creation of an object of knowledge data according to the present invention.

【図１６】本発明の知識データのオブジェクトのインデ
ックス作成のフローチャートを示す図である。FIG. 16 is a diagram showing a flowchart of index creation of an object of knowledge data according to the present invention.

【図１７】本発明の検索処理のフローチャートを示す図
である。FIG. 17 is a diagram showing a flowchart of a search process of the present invention.

【図１８】本発明の実施例３の基本構成を示す図であ
る。FIG. 18 is a diagram illustrating a basic configuration of a third embodiment of the present invention.

【図１９】本発明の実施例３の基本構成の動作説明図で
ある。FIG. 19 is an explanatory diagram of the operation of the basic configuration according to the third embodiment of the present invention.

【図２０】本発明の実施例３の再度入力要求部のフロー
チャートである。FIG. 20 is a flowchart of the input requesting unit according to the third embodiment of the present invention.

【図２１】本発明の実施例４を示す図である。FIG. 21 is a diagram showing a fourth embodiment of the present invention.

【図２２】本発明の実施例５を示す図である。FIG. 22 is a diagram showing a fifth embodiment of the present invention.

【図２３】本発明の実施例６を示す図である。FIG. 23 is a diagram showing a sixth embodiment of the present invention.

【図２４】本発明のシステム構成を示す図である。FIG. 24 is a diagram showing a system configuration of the present invention.

【図２５】本発明の分解素のデータ設定方法の例を示す
図である。FIG. 25 is a diagram showing an example of a method for setting data of a decomposition element according to the present invention.

【図２６】本発明の実施例のフローチャートを示す図で
あるFIG. 26 is a diagram showing a flowchart of the embodiment of the present invention.

【図２７】本発明の実施例７基本構成を示す図である。FIG. 27 is a diagram illustrating a basic configuration of a seventh embodiment of the present invention.

【図２８】本発明の実施例の構成を示す図である。FIG. 28 is a diagram showing a configuration of an example of the present invention.

【図２９】本発明の知識処理部の実施例を示す図であ
る。FIG. 29 is a diagram showing an embodiment of a knowledge processing unit of the present invention.

【図３０】本発明のシステム構成実施例を示す図であ
る。FIG. 30 is a diagram showing a system configuration example of the present invention.

【図３１】本発明の基本構成の動作説明図である。FIG. 31 is an explanatory diagram of the operation of the basic configuration of the present invention.

【図３２】入力データ処理部のフローチャートを示す図
である。FIG. 32 is a diagram showing a flowchart of an input data processing unit.

【図３３】出力データ処理部のフローチャートを示す図
である。FIG. 33 is a view illustrating a flowchart of the output data processing unit.

【図３４】言葉タイプの分類フローチャートを示す図で
ある。FIG. 34 is a diagram showing a word type classification flowchart.

【図３５】言葉タイプの相互変換フローチャートを示す
図である。FIG. 35 is a diagram showing a flowchart for mutually converting word types.

【図３６】自分のパターンと他人のパターンとの間の翻
訳フローチャートを示す図である。FIG. 36 is a diagram showing a translation flowchart between a user's own pattern and another person's pattern.

【図３７】知識ドメインの自動設定フローチャートを示
す図である。FIG. 37 is a diagram showing a flowchart for automatically setting a knowledge domain.

【図３８】知識データの参照と有知識者の知識獲得方法
のフローチャートである。FIG. 38 is a flowchart of a method of referring to knowledge data and acquiring knowledge of a knowledgeable person.

Claims

[Claims]

1. A database creation method for inputting information and saving the information as data in a database, a recording medium holding a program for creating a database for inputting information and saving the information as data in a database, and inputting information. In a recording medium in which a data group processed by processing the information is recorded A: Information input means B: Input information processing means C: Decomposition element decomposition means for decomposing data obtained by the information processing means into decomposition elements D: A labeling means for performing a labeling process of the decomposition element; E: a labeling dialogue means for performing a labeling dialogue process; F: a registration means for registering the labeled data in an information holding device; A storage medium storing a database creation program and A recording medium on which a data group characterized by being formed is recorded.

2. The information input means includes a discriminating means for discriminating whether the form of information is digital or analog, and when the information is analog, the information is converted into digital by the input information processing means. 2. A method for creating a database according to claim 1, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

3. The database creation method according to claim 1, wherein said information input means includes a discrimination means for discriminating a type of information (media type).
A recording medium storing a database creation program and a recording medium recording a data group created by the steps.

4. The information type is at least one information selected from characters, voices, sounds, images (still images, moving images), vectors and cases, and specifies an input device corresponding to each type of information. A method for creating a database according to claims 1 and 2, a recording medium storing a database creation program, and a recording recording a data group created by the steps. Medium.

5. The information processing means converts information which is not data which can be processed by the decomposing means for decomposing into decomposable data into data which can be processed and sends it to the decomposing means for decomposing means. The method for creating a database according to claim 1, wherein the data is sent as it is to the decomposing means, and the recording medium storing the database creating program and the data created by the steps are used. A recording medium having recorded thereon a data group.

6. The database creation method according to claim 1, wherein said decomposing element decomposing means decomposes into a minimum decomposable unit or a unit that can be labeled with a set of said minimum units. A recording medium storing a database creation program and a recording medium recording a data group created by the steps.

7. A method according to claim 1, wherein said decomposing means has reference means for referring to the decomposed elements registered in said database. A recording medium and a recording medium on which a data group created by the step is recorded.

8. A method according to claim 1, wherein said decomposing means has means for interacting with a user.
Item 8, a method for creating a database according to Item 6 and Item 7, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

9. The decomposing element dialogue means: A: The decomposing element is compared with the format to determine whether or not the decomposing element is appropriately decomposed. If it is not appropriate, the output means indicates to the user that it is not appropriate. B: Ask the user whether the decomposition is correct. C: If it is correct, use the decomposition as it is. D: If it is not a correct decomposition, output it. 9. A method for creating a database according to claim 8, further comprising: a decomposition element dialogue means comprising the steps of: (a) prompting a user to input a correct decomposition element to the user and making the inputted object a correct decomposition element. , A recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

10. The labeling means obtains a label for the obtained decomposed element by referring to an already registered database, and assigns a label. When a label to be assigned is not found, the fact is output to an output means. The method for creating a database according to claim 1, wherein the user is prompted to input a correct label and the input object is labeled, a recording medium storing a database creation program, and a program created by the steps. A recording medium having recorded thereon a data group.

11. The labeling dialogue means includes: A: a means for determining whether a label is appropriately labeled with respect to a label given to the obtained decomposed element, and output if the label is not appropriately labeled. Means is output in the means. B: Ask the user whether the labeling is correct. C: If it is correct, the label is used as is. D: If the labeling is not appropriate, the fact is output to the output means to prompt the user to input a correct label. Let the input thing be a correct new label E: If the label in the reference database and the label input by the user are different, let X be the newly input label and Y be the already registered label, The data having a configuration in which X is added to the common label is displayed on the output means, and the user is asked for correctness. The data to which the data X has been added is additionally registered to the data already registered in the database, and if the data to which the label Y has been added is not correct, the part already registered in the database is kept as it is. The labeling comprising the above steps 11. The database creation method according to claim 10, further comprising a dialogue means, a recording medium storing a database creation program, and a recording medium recording a data group created by said step.

12. The labeling means refers to a database registered for the decomposed element and, if there is no appropriate label, outputs the fact to the output means to the user and prompts the user to input a label. 12. The method for creating a database according to claim 1, wherein the object is a label, a recording medium storing a database creation program, and the method created by the steps. A recording medium on which a data group to be recorded is recorded.

13. The method according to claim 1, wherein the minimum decomposed element is a character when the information is a language.
2. A method for creating a database according to claim 1, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

14. A syntax composed of a plurality of decomposers is defined as 1
Two types of parsers, compare the types of attributes included in their own parsers with the types of attributes included in other parsers, and find other parsers that have all the types of their own parser. The method for creating a database and the database creation program according to claim 13, characterized in that an attribute which is not present in the parsing element which is not present is displayed on the output means, and the user is asked whether the label is assigned correctly or not. A recording medium storing the stored recording medium and a data group created by the steps.

15. Attributes include particles, auxiliary verbs, conjunctions, adverbs,
15. The method for creating a database according to claim 14, wherein the phrase is a phrase including a part of speech such as a punctuation mark and a symbol such as a space, a recording medium storing a database creation program, and a fact that the database is created by the steps. A recording medium on which a characteristic data group is recorded.

16. The wording symbols include who, when, where,
If the question contains a question word such as why and how, and if any of the decomposed parts of the input data is missing, the user displays the missing question word on the output means and inputs the attribute. A method for creating a database according to claim 15, a recording medium storing a database creation program, and a recording medium recording a data group created by said step.

17. A system according to claim 17, further comprising a similar word dictionary, wherein when the data of the decomposed element is incomplete, the incomplete data of the decomposed element is converted into correct data according to the result of the user's judgment. 14. A method for creating a database according to claim 13, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

18. An interactive means for editing a decomposition element or data necessary for labeling and displaying the result on an output means, and a user selecting or newly inputting the data based on the data indicated on the output means. Claims 1, 8, characterized in that the result is a formal decomposition or label
Item 13. A database creation method according to Item 9, Item 11, and Item 12, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

19. A method according to claim 1, wherein the data holding means for holding data in the database is an object database means, and the data is object data. , A recording medium storing a database creation program, and a recording medium recording a data group created by the above steps

20. When there are a plurality of common labels among objects, only a common part X is extracted to generate a new object and an ID is assigned to the object. 10. The database creation method according to claim 19, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

21. The database creation method according to claim 1, wherein said data registration means classifies the data according to the field and registers the data according to the field. And a recording medium recording a data group created by the steps.

22. The database creation method according to claim 1, wherein said registration means has a function of classifying the data according to a field and registering the data according to a field, and further comprising a data processing means registered in said database. , A recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

23. The data processing means of the database refers to the data holding means to change the contents of the registered data referred to in the labeling and labeling processing of the data by the interactive processing with the user, and in the labeling process. 23. Creation of a database according to claim 22, wherein a field of data is obtained, data of the determined field is referred to, labeling is performed on the data, and contents of registered data are updated at the same time. A method, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

24. The method according to claim 1, wherein the user first refers to his / her own database holding his / her own data, and refers to another person's data / base when labeling cannot be performed by his / her own database. 24. A method for creating a database according to items 23 to 23, a recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

25. A database according to claim 24, wherein if labeling is not possible even by referring to another person's database, a list of intellectuals having knowledge enabling labeling is obtained and displayed. , A recording medium storing a database creation program, and a recording medium recording a data group created by the steps.

26. The method according to claim 13, wherein said processing means recognizes the input word type and converts the input word type into a different word type. A recording medium storing a database creation program and a recording medium recording a data group created by the steps.

27. A difference between a pattern of own input information and a pattern of another person's database is identified, and the pattern of own input information is replaced with a pattern of another person's database.
The method for creating a database according to claims 21, 24 and 25, wherein the database created by said step is created by searching a database of another person. A recording medium on which a characteristic data group is recorded.

28. To determine in which field the input information is similar, calculate the similarity with each field, use the field with the closest similarity, 26. A database creation method according to claim 21, wherein the field is determined based on a directory name or a file name used immediately before the threshold value is not reached. A recording medium storing a creation program and a recording medium recording a data group created by the steps.