JPH0793364A

JPH0793364A - Sentence form converting device

Info

Publication number: JPH0793364A
Application number: JP5238502A
Authority: JP
Inventors: Toshiaki Makino; 俊朗牧野; Toshiyuki Iida; 敏幸飯田; Masakatsu Ota; 昌克太田; Shigeo Shimada; 茂夫島田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1993-09-24
Filing date: 1993-09-24
Publication date: 1995-04-07

Abstract

PURPOSE:To provide the sentence form converting device which converts an input sentence in a natural language accompanied with omissions of words into forms that various information processing systems can handle. CONSTITUTION:This device is provided with a word dividing process part 1 which divides the input sentence into words, a request kind decision part 2 which decides the types of the indication and question that the input sentence shows, a concept dictionary 4 wherein information showing what system's concept the words belong to is described, an unnecessary word dictionary 5 wherein unnecessary words such as particles and auxiliary verbs which are required to constitute a sentence, but not required by a DBMS 12 are described, a concept decision part 3 which judges what information each word in the input sentence means on the basis of the concept dictionary 4 and removes unnecessary words in the input sentences on the basis of the unnecessary word dictionary 5, a complementing process part 7 which complements omitted words in the input sentence according to a rule corresponding to the type of the input sentence, and a format conversion part and a main control part 11 which convert the input sentence into the representation form that the DBMS 12 can interprets on the basis of the process results of the concept decision part 3 and complementing process part 7.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用文野】この発明は、知的マンマシンイン
タフェースに係り、種々の情報処理システムに与える指
示や質問を自然言語で行うことを可能とする文形式変換
装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an intelligent man-machine interface, and relates to a sentence format conversion device capable of issuing instructions and questions to various information processing systems in natural language.

【従来の技術】従来の情報処理システムにおいて、コン
ピュータに対する指示や質問は、各システム毎に定めら
れた言語を用いて行われていた。例えば、情報処理シス
テムの１つであるデータベースシステムにおいては、リ
レーショナルデータベース操作言語であるＳＱＬ（構造
化検索言語）を用いて、 select名称from自動車where車体重量＞２０００のような命令をシステムに与えることによりデータベー
ス中の『自動車』テーブルから、項目『車体重量』の値
が２０００より大きいものを検索する。2. Description of the Related Art In a conventional information processing system, instructions and inquiries to a computer are made using a language defined for each system. For example, in a database system which is one of information processing systems, a command such as select name from automobile where body weight> 2000 is given to the system by using SQL (structured search language) which is a relational database operation language. Then, the "automobile" table in the database is searched for an item having a value of the item "body weight" larger than 2000.

【０００２】また、マンマシンインタフェースを備えて
いるものでは、メニューから必要な項目を選択したり、
あらかじめ与えられたテンプレートに従って文を入力し
たりすることによって、コンピュータに対する指示や質
問を行っていた。In the case of a man-machine interface, necessary items can be selected from a menu,
By inputting a sentence in accordance with a template given in advance, a computer was instructed or asked a question.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、従来の
情報処理システムにおいては、システム毎に入力の方法
が異なるために、利用者は各システム毎の入力フォーマ
ットに習熟する必要がある。したがって、今後の情報社
会において利用者が使用するシステムの種類が多くなっ
た場合には、利用者の負担が大きくなる。また、マンマ
シンインタフェースの設計者は、各システム毎にその利
用者が有している知識を想定してマンマシンインタフェ
ースの設計を行わなければならず、利用者のモデルを決
定するのに多大の労力が必要となる。これらの問題点
は、普通の人が誰でも使うことができる自然言語文をあ
らゆる情報処理システムに対する入力言語として使用す
ることができれば解決可能である。However, in the conventional information processing system, since the input method is different for each system, the user needs to be familiar with the input format for each system. Therefore, when the number of types of systems used by users in the future information society increases, the burden on the users will increase. In addition, the man-machine interface designer must design the man-machine interface by assuming the knowledge that the user has for each system, and it is very difficult to decide the model of the user. Labor is required. These problems can be solved if a natural language sentence that can be used by any ordinary person can be used as an input language for any information processing system.

【０００４】ところが、人が日常使っている自然言語文
は、人間が有する文法等の知識を前提としているので、
しばしば語の省略が行われている。このため、自然言語
文を情報処理システムの入力言語とした場合、語の省略
による情報の欠落によって処理が不可能となる場合があ
る。However, since the natural language sentence that a person uses everyday is premised on the knowledge of the grammar etc. that a person has,
Words are often omitted. Therefore, when the natural language sentence is used as the input language of the information processing system, the processing may be impossible due to lack of information due to omission of the word.

【０００５】この発明は、このような背景の下になされ
たもので、語の省略が伴う自然言語による入力文を各種
情報処理システムが処理可能な形式に変換することがで
きる文形式変換装置を提供することを目的としている。The present invention has been made under such a background, and a sentence format conversion device capable of converting an input sentence in natural language accompanied by omission of words into a format which can be processed by various information processing systems. It is intended to be provided.

【０００６】[0006]

【課題を解決するための手段】この発明は、上述した課
題を解決するために、自然言語で表現された指示文、質
問文を情報処理システムが解釈可能な形式に変換する文
形式変換装置であって、入力文を単語に分割する単語分
け処理部と、前記入力文中の各語のうち指示や質問のタ
イプを表すキーワードに基づき、該タイプを判定する要
求種別判定部と、語がそれぞれいかなる系統の概念に属
するかを表す情報を記述した概念辞書と、文を構成する
ために必要であるが前記情報処理システムが必要とする
情報を含まない助詞、助動詞などの不要語を記述した不
要語辞書と、前記入力文中の各語がいかなる情報を意味
しているかを前記概念辞書に基づいて判断すると共に、
該入力文中の不要語を前記不要語辞書に基づいて除去す
る概念判定部と、前記入力文において省略された語を前
記入力文のタイプに応じた所定の補完規則に従って補う
補完処理部と、前記概念判定部によって得られる情報と
前記補完処理部によって得られる情報とに基づき、前記
入力文を前記情報処理システムが解釈可能な表現形式に
変換する変換手段とを具備することを特徴としている。In order to solve the above-mentioned problems, the present invention is a sentence format conversion device for converting a directive sentence and a question sentence expressed in natural language into a format interpretable by an information processing system. There is a word division processing unit that divides the input sentence into words, a request type determination unit that determines the type of each word in the input sentence based on a keyword that indicates the type of the instruction or question, and A concept dictionary that describes information indicating whether it belongs to the concept of system, and unnecessary words that describe unnecessary words such as particles and auxiliary verbs that are necessary for constructing a sentence but do not include the information required by the information processing system. A dictionary and a judgment based on the concept dictionary as to what information each word in the input sentence means,
A concept determining unit that removes unnecessary words in the input sentence based on the unnecessary word dictionary; a complementing processing unit that complements words omitted in the input sentence according to a predetermined complementing rule according to the type of the input sentence; It is characterized by comprising a conversion means for converting the input sentence into an expression format interpretable by the information processing system on the basis of the information obtained by the concept determination section and the information obtained by the complementation processing section.

【０００７】[0007]

【作用】この発明によれば、単語分け処理部が、自然言
語による入力文を形態素解析手法等によって単語に分割
し、要求種別判定部が、入力文中の各語のうち指示や質
問のタイプを表すキーワードに基づいて該タイプを判定
し、概念判定部が、入力文中の各語がいかなる情報を意
味しているか（例えば、『自動車』は対象を表す対象概
念語であり、『重い』は量の程度を表す比較概念語であ
るといった情報）を概念辞書に基づいて判断すると共
に、該入力文中の不要語を不要語辞書に基づいて除去す
る。そして、補完処理部が、入力文において省略された
語を前記入力文のタイプに応じた所定の補完規則に従っ
て補い、変換手段が、概念判定部によって得られる情報
と補完処理部によって得られる情報とに基づき、入力文
を情報処理システムが解釈可能な表現形式に変換する。
これにより、情報処理システムの利用者は、コンピュー
タに対する指示や質問を自然言語によって行うことがで
きる。According to the present invention, the word division processing unit divides the input sentence in natural language into words by the morphological analysis method and the like, and the request type determination unit determines the type of instruction or question in each word in the input sentence. The type is judged based on the keyword to be expressed, and the concept judgment unit determines what information each word in the input sentence means (for example, “car” is a target concept word representing a target, “heavy” is a quantity). Information that is a comparative conceptual word indicating the degree of) is determined based on the conceptual dictionary, and unnecessary words in the input sentence are removed based on the unnecessary word dictionary. Then, the complement processing unit complements the omitted word in the input sentence according to a predetermined complement rule according to the type of the input sentence, and the conversion unit combines the information obtained by the concept determination unit and the information obtained by the complement processing unit. Based on, the input sentence is converted into an expression format that can be interpreted by the information processing system.
This allows the user of the information processing system to give instructions and questions to the computer in natural language.

【０００８】[0008]

【実施例】以下、図面を参照して、この発明の実施例に
ついて説明する。Ａ：全体構成図１は、この発明の一実施例による文形式変換装置１０
０を入力インタフェースとして有するデータベースシス
テムの構成を示すブロック図である。Embodiments of the present invention will be described below with reference to the drawings. A: Overall Configuration FIG. 1 shows a sentence format conversion device 10 according to an embodiment of the present invention.
It is a block diagram which shows the structure of the database system which has 0 as an input interface.

【０００９】図１において、１は入力装置９から入力さ
れた自然言語文を形態素解析手法によって単語に分割す
る単語分け処理部である。２は入力文中の語とあらかじ
め登録されたキーワードとのマッチングを取ることによ
って、入力文によって表される指示や質問のタイプを判
定する要求種別判定部である。In FIG. 1, reference numeral 1 is a word division processing section for dividing a natural language sentence input from the input device 9 into words by a morphological analysis method. Reference numeral 2 is a request type determination unit that determines the type of the instruction or question represented by the input sentence by matching a word in the input sentence with a keyword registered in advance.

【００１０】次に、３は概念判定部である。この概念判
定部３は、概念辞書４を参照し、単語分け処理部１によ
って単語分けされた各語がどの概念に属するかを判断す
ると共に、不要語辞書５を参照し、特にシステム（後述
する情報処理システム）が処理を行なうために必要な情
報を含んでいない語を入力文から除去する（すなわち、
情報処理システムにおいて入力文の解釈等の処理を行う
上で不要な助詞、助動詞などの語を入力文から取り除
く）。Next, 3 is a concept determination unit. The concept determination unit 3 refers to the concept dictionary 4 to determine which concept each word word-divided by the word-division processing unit 1 belongs to, and also refers to the unnecessary word dictionary 5, particularly to a system (described later). A word that does not contain information necessary for the information processing system to perform processing is removed from the input sentence (that is,
The words such as particles and auxiliary verbs that are unnecessary for performing processing such as interpretation of the input sentence in the information processing system are removed from the input sentence).

【００１１】ここで、概念辞書４とは、単語がどのよう
な概念に属するかを表す情報が記述されたテーブルであ
って、図示しないメモリ等に記憶されている。例えば図
２に示すように、単語の概念の属性を表す概念種（図
中、「対象概念」、「量属性」、……）毎に、その概念
名（図中、「自動車」、「山」、「重量」、……）と、
幾つかの概念語（図中、「自動車」、「車」、「カ
ー」、……、「山」、「山岳」、……、「重さ」、「重
量」、……）が対応づけて記述されている。この概念辞
書４は、文形式変換装置１００に接続すべき情報処理シ
ステム（本実施例ではデータベースシステム）が定めら
れると、形式変換を行うべき入力文の範囲が限定される
ことから、該範囲内であらかじめ人手によって作成され
る。Here, the concept dictionary 4 is a table in which information indicating which concept a word belongs to is described, and is stored in a memory or the like (not shown). For example, as shown in FIG. 2, for each concept type (in the figure, “target concept”, “quantity attribute”, ...) Representing the attribute of the concept of a word, the concept name (“car”, “mountain” in the figure) ”,“ Weight ”, ……)
Corresponding to some conceptual words ("automobile", "car", "car", ..., "mountain", "mountain", ..., "weight", "weight", ...) in the figure Is described. This concept dictionary 4 is within the range when the information processing system (database system in this embodiment) to be connected to the sentence format conversion device 100 is defined, because the range of the input sentence for which the format conversion is performed is limited. It is created by hand in advance.

【００１２】また、不要語辞書５は、入力文中には含ま
れるが情報処理システムが処理を行なうために必要な情
報を含んでない助詞、助動詞などの語のリストが記述さ
れたテーブルであって、上記概念辞書４と同様、メモリ
等に記憶されている。The unnecessary word dictionary 5 is a table in which a list of words such as particles and auxiliary verbs that are included in the input sentence but do not include information necessary for the information processing system to perform processing is described. Like the concept dictionary 4, it is stored in a memory or the like.

【００１３】次に、６は形式変換部である。この形式変
換部６は、概念判定部３の判定結果に基づき、概念種毎
に各語あるいは各語の属する概念をあらかじめ定められ
た形式のテーブルにセットする。例えば、図３に示すよ
うに、概念種毎に各語あるいは各語の属する概念が格納
される記憶領域（以下、スロットという）を有するＣＳ
Ｔ（concept set）テーブルにセットする。また、入力
文が「または」、「かつ」等を意味する接続語によって
複数の語列単位からなる場合には、この語列単位毎に複
数のｃｓｔテーブルを作成し、各ｃｓｔテーブルの接続
関係を上記接続語および各ＣＳＴテーブルの識別子ＩＤ
によって表すｒｅｌ（relation）テーブルを作成する。
ここで、ｃｓｔテーブルの各項は下記（１）〜（１０）
に示される内容を表している。（１）ＩＤ：ｃｓｔテーブルの識別子（２）対象概念：判断対象を表す概念（３）量属性：量の尺度に関する属性（４）比較概念：量の値の程度、値の変化の傾向を表現
する概念（５）修飾概念：比較概念を修飾する概念（６）時間概念：時間を表す概念（７）時間修飾概念：時制概念を修飾する概念（８）比較対象：比較の対象となるもの（９）比較値：比較の基準となる値（１０）比較種類：比較の種類Next, 6 is a format converter. The format conversion unit 6 sets each word or the concept to which each word belongs in a table of a predetermined format for each concept type based on the determination result of the concept determination unit 3. For example, as shown in FIG. 3, a CS having a storage area (hereinafter, referred to as a slot) in which each word or a concept to which each word belongs is stored for each concept type.
Set in the T (concept set) table. Also, when the input sentence is composed of a plurality of word string units by connecting words that mean “or”, “and”, etc., a plurality of cst tables are created for each word string unit, and the connection relation of each cst table is created. Is the above connection word and the identifier ID of each CST table
Create a rel (relation) table represented by.
Here, each item of the cst table is the following (1) to (10).
Represents the content shown in. (1) ID: identifier of cst table (2) Target concept: concept representing a judgment target (3) Quantity attribute: attribute related to quantity scale (4) Comparison concept: expressing degree of quantity value, tendency of value change (5) Modification concept: Concept that modifies the comparison concept (6) Time concept: Concept that represents time (7) Time modification concept: Concept that modifies the tense concept (8) Comparison target: What is the target of comparison ( 9) Comparison value: Value that serves as a reference for comparison (10) Kind of comparison: Kind of comparison

【００１４】次に、７は補完処理部である。この補完処
理部７は、形式変換部６によって作成されたデータにつ
いて、必要に応じて入力文中で省略された情報の補完を
行なう。この補完処理は、あらかじめ補完処理部７に記
憶保持されたルールに基づいて行われる。Next, 7 is a complement processing section. The complementation processing unit 7 complements the data created by the format conversion unit 6 with information omitted in the input sentence as necessary. This complementing process is performed based on the rule stored and held in the complementing processing unit 7 in advance.

【００１５】図４は補完処理で適用されるルールの一例
を示す図である。同図において、まずルール１は補完処
理の終了条件を規定するものである。この例では、要求
種別がwhatタイプのときに各ｃｓｔテーブルにおける対
象概念、量属性および比較概念のスロットにデータが格
納されている場合に、補完処理を終了することが規定さ
れている。FIG. 4 is a diagram showing an example of rules applied in the complementary processing. In the figure, first, rule 1 defines the termination condition of the complementary processing. In this example, when the request type is what type, it is stipulated that the complementing process is ended when data is stored in the slots of the target concept, the quantity attribute, and the comparison concept in each cst table.

【００１６】また、ルール２は補完処理の処理条件を規
定するものである。この例では、識別子ＩＤが「Ｘ」で
あるｃｓｔテーブルの対象概念スロットにデータが格納
されておらず、かつ、識別子ＩＤが「Ｘ＋１」であるｃ
ｓｔテーブルの対象概念スロットにデータが格納されて
いる場合に、識別子ＩＤが「Ｘ＋１」であるｃｓｔテー
ブルの対象概念スロットのデータを識別子ＩＤが「Ｘ」
であるｃｓｔテーブルの対象概念スロットにセットする
ことが規定されている。Rule 2 defines the processing conditions for the complementary processing. In this example, data is not stored in the target concept slot of the cst table whose identifier ID is "X", and c whose identifier ID is "X + 1"
When data is stored in the target concept slot of the st table, the identifier ID of the data of the target concept slot of the cst table whose identifier ID is “X + 1” is “X”.
Is set in the target concept slot of the cst table.

【００１７】すなわち、ルール１は、whatタイプ、whic
hタイプ、yes/noタイプ等の質問文タイプによってシス
テムが処理を行うために必要とする情報が異なることか
ら、あらかじめ質問文タイプ毎に人手によって作成され
る。また、ルール２は、省略された概念種に応じて適用
されるものであり、これもあらかじめ人手によって作成
される。なお、このルール２は、上記質問文タイプに依
存しないことから、各質問文タイプについて共通に適用
される。That is, the rule 1 is what type, whic
Since the information required for the system to perform processing differs depending on the question sentence type such as h type or yes / no type, it is manually created in advance for each question sentence type. Rule 2 is applied according to the omitted concept type, and this is also created manually in advance. Since this rule 2 does not depend on the above-mentioned question sentence type, it is commonly applied to each question sentence type.

【００１８】ここで、図５は質問文タイプ毎に必要とさ
れる情報と各質問文タイプの例文とを示している。すな
わち、例えばwhichタイプやyes/noタイプのように、wha
tタイプで必要となる情報（図中、必須のスロット）の
他に、比較対象または比較値の情報が必要となる質問文
タイプも存在する。FIG. 5 shows information required for each question sentence type and example sentences of each question sentence type. That is, wha, such as which type or yes / no type
In addition to the information required for the t type (required slot in the figure), there is also a question sentence type for which information on the comparison target or comparison value is required.

【００１９】さて、再び図１に戻って実施例の構成を説
明する。同図において、８はシステムインタフェース部
であり、要求種別判定部２によって与えられる要求種別
と、形式変換部６および補完処理部７によって作成され
た形式のデータを情報処理システムへ伝達する。以上の
各部１〜８によって文形式変換装置１００が構成され
る。Now, returning to FIG. 1, the structure of the embodiment will be described. In the figure, reference numeral 8 denotes a system interface unit, which transmits the request type given by the request type determination unit 2 and the data in the format created by the format conversion unit 6 and the complement processing unit 7 to the information processing system. The sentence format conversion device 100 is configured by the above units 1 to 8.

【００２０】また、入力装置９は、情報処理システムを
操作するために利用者が入力する自然言語文を取り込
み、これを単語分け処理部１へ供給する。出力装置１０
は、利用者が必要とする情報や処理結果等を出力する。The input device 9 also takes in a natural language sentence input by the user to operate the information processing system and supplies it to the word division processing section 1. Output device 10
Outputs the information required by the user, the processing result, and the like.

【００２１】さらに、１１はＣＰＵやメモリ等で構成さ
れた主制御部であり、上述した文形式変換装置１００を
含むシステム全体を制御する。また、１２はＤＢＭＳ
（データベース管理システム）であり、主制御部１１の
指示に応じて、種々のデータが格納されるＤＢ（データ
ベース）１３を操作する。以上が本実施例の全体構成で
ある。Further, 11 is a main control unit composed of a CPU, a memory, etc., and controls the entire system including the sentence format conversion apparatus 100 described above. 12 is a DBMS
It is a (database management system), and operates a DB (database) 13 in which various data are stored according to an instruction from the main control unit 11. The above is the overall configuration of the present embodiment.

【００２２】Ｂ：文形式変換装置１００の動作次に、図６に示すフローチャートを参照し、文形式変換
装置１００の動作を説明する。まず、単語分け処理部１
は、形態素解析手法により入力文を単語に分割する（ス
テップＳ１０１）。次に、要求種別判定部２は、単語分
け処理部１から受け取った単語列と内部に保持している
キーワード辞書とのマッチングを行って入力文の要求種
別を判定し、該判定結果を補完処理部７とシステムイン
タフェース部８へ供給する（ステップＳ１０２）。B: Operation of Sentence Format Converter 100 Next, the operation of the sentence format converter 100 will be described with reference to the flowchart shown in FIG. First, the word division processing unit 1
Divides the input sentence into words by the morphological analysis method (step S101). Next, the request type determination unit 2 matches the word string received from the word division processing unit 1 with the keyword dictionary held therein to determine the request type of the input sentence, and complements the determination result. It is supplied to the unit 7 and the system interface unit 8 (step S102).

【００２３】そして、概念判定部３は、単語分け処理部
１から受けとった単語列の各単語について、概念辞書４
を用いて概念種および概念名の対応付けを行うと共に、
不要語辞書５を参照して、助詞、助動詞などの不要語を
入力文中から除去する（ステップＳ１０３）。Then, the concept determination unit 3 determines the concept dictionary 4 for each word of the word string received from the word division processing unit 1.
And associate the concept type and concept name using
By referring to the unnecessary word dictionary 5, unnecessary words such as particles and auxiliary verbs are removed from the input sentence (step S103).

【００２４】次に、形式変換部６は、上記概念判定部３
の処理結果に基づき、図３に示した形式に従って、ｒｅ
ｌテーブルとｃｓｔテーブルとを作成する（ステップＳ
１０４）。そして、補完処理部７は、形式変換部６が作
成したｃｓｔテーブルについて補完処理が必要か否かを
判定する（ステップＳ１０５）。Next, the format conversion section 6 includes the concept determination section 3 described above.
Based on the processing result of
l table and cst table are created (step S
104). Then, the complementary processing unit 7 determines whether the complementary processing is necessary for the cst table created by the format conversion unit 6 (step S105).

【００２５】ここで、補完処理が必要でない場合、補完
処理部７は、ｃｓｔテーブルが完成しているものとみな
し、該ｃｓｔテーブルとｒｅｌテーブルとを形式変換部
６へ供給する。そして、形式変換部６は、この完成した
ｃｓｔテーブルとｒｅｌテーブルとをシステムインタフ
ェース部８へ供給し、該インタフェース部８は、これら
のテーブルと要求種別判定部２から受けとった要求種別
とを合わせて、主制御部１１へ供給する（ステップＳ１
０９）。Here, when the complementary processing is not necessary, the complementary processing unit 7 regards the cst table as completed and supplies the cst table and the rel table to the format conversion unit 6. Then, the format conversion unit 6 supplies the completed cst table and rel table to the system interface unit 8, and the interface unit 8 combines these tables and the request type received from the request type determination unit 2 together. , To the main controller 11 (step S1
09).

【００２６】一方、上記ステップＳ１０５において、補
完処理が必要な場合、補完処理部７は、要求種別判定部
２から受けとった要求種別と形式変換部６から受けとっ
たｃｓｔテーブルの状態に基づき、内部に保持するルー
ル２（図４参照）のうち適応可能なルール２が存在する
か否かを判断する（ステップＳ１０６）。On the other hand, in step S105, when the complementing process is necessary, the complementing processing unit 7 internally determines based on the request type received from the request type determining unit 2 and the state of the cst table received from the format converting unit 6. It is determined whether or not there is an applicable rule 2 among the held rules 2 (see FIG. 4) (step S106).

【００２７】ここで、適応可能なルール２が存在する場
合、補完処理部７は、該ルール２を適用して補完処理を
実行する（ステップＳ１０７）。そして、再び補完処理
が必要か否かの判定を行なう（ステップＳ１０５）。す
なわち、入力文が複数の接続語によって接続されてお
り、ｃｓｔテーブルが３つ以上存在する場合、ルール１
の条件を満たすまでルール２を繰り返し適用して複数回
補完処理を繰り返す。Here, if there is an applicable rule 2, the complementary processing section 7 applies the rule 2 to execute the complementary processing (step S107). Then, it is again determined whether or not the complementary processing is necessary (step S105). That is, if the input sentences are connected by a plurality of connecting words and there are three or more cst tables, the rule 1
Rule 2 is repeatedly applied until the condition of is satisfied, and the complementing process is repeated a plurality of times.

【００２８】一方、上記ステップＳ１０６において、入
力文が文法的に矛盾している等、適応可能なルール２が
存在せず、補完処理を行うことが出来ない場合には、ｃ
ｓｔテーブルを完成させることができないため、システ
ムインタフェース部８に入力文が理解不能である旨を伝
える。これにより、インタフェース部８は、主制御部１
１に入力文が理解不能であることを伝達する（ステップ
Ｓ１０８）。On the other hand, in step S106, if there is no applicable rule 2 such as the input sentence is grammatically inconsistent and the complementary processing cannot be performed, c
Since the st table cannot be completed, the system interface unit 8 is notified that the input sentence is incomprehensible. As a result, the interface unit 8 becomes the main control unit 1
The fact that the input sentence is incomprehensible is transmitted to 1 (step S108).

【００２９】Ｃ：入力文の具体例による全体動作次に、文形式変換装置１００を入力インタフェースとし
て有するデータベースシステム（図１参照）の動作を、
「重量が重くて価格が非常に安い車は何ですか」という
入力文が与えられた場合を例として説明する。C: Overall Operation by Specific Example of Input Sentence Next, the operation of the database system (see FIG. 1) having the sentence format conversion device 100 as an input interface will be described.
The case where the input sentence "What is a car that is heavy and is very cheap" is given as an example.

【００３０】まず、入力装置９より入力文が与えられる
と、単語分け処理部１は形態素解析手法により単語分け
を行なう。この例では、「重量」「が」「重く」「て」
「価格」「が」「非常に」「安い」「車」「は」「何」
「です」「か」となる。この場合、「重く」のように活
用する語に関しては、「重い」のように終止形に変形さ
れる。First, when an input sentence is given from the input device 9, the word division processing unit 1 divides words by a morphological analysis method. In this example, "weight""ga""heavy""te"
"Price""ga""very""cheap""car""ha""what"
It becomes "is""ka". In this case, a word to be utilized like "heavy" is transformed into an end form like "heavy".

【００３１】次に、要求種別判定部２は、内部に持つキ
ーワード辞書とこの語列の中の語とのマッチングを行
い、「なに」、「か」よりこの文が条件を満たすものの
検索を要求しているwhatタイプの文であることを判別す
る。そして、要求種別をwhatタイプとして補完処理部７
とシステムインタフェース部８へ伝達する。Next, the request type judgment unit 2 matches the keyword dictionary held internally with the words in this word string, and searches for "What" and "Ka" that satisfy this condition. Determine what type of statement you are requesting. Then, the complement processing unit 7 sets the request type to what type.
To the system interface unit 8.

【００３２】次に、概念判定部３は、図２に示した概念
辞書４を参照し、各語について概念種、概念名の対応付
けを行なう。この例では、「重量（量属性、重量）」
「重い（比較概念、重い）」「て（接続語、かつ）」
「価格（量属性、価格）」「非常に（修飾概念、ver
y）」「車（対象概念、自動車）」のように対応付けさ
れ、その他の助詞、助動詞等の語は不要語辞書５に従っ
て不要語として除去される。Next, the concept determination unit 3 refers to the concept dictionary 4 shown in FIG. 2 and associates each word with a concept type and a concept name. In this example, "weight (quantity attribute, weight)"
"Heavy (comparative concept, heavy)""Te (connection word, and)"
"Price (quantity attribute, price)""Very (modification concept, ver
y) ”“ car (target concept, automobile) ”, and other words such as particles and auxiliary verbs are removed as unnecessary words according to the unnecessary word dictionary 5.

【００３３】そして、形式変換部６は、概念判定部３に
よる処理結果に基づき、図３に示した内部のテーブルに
概念名を各スロットの値として埋めていく。このとき、
接続語の部分で語列を区切り、それぞれの語列単位を１
つのｃｓｔテーブルとし、これらｃｓｔテーブルの接続
関係をｒｅｌテーブルによって表す。Then, the format conversion unit 6 fills in the concept name as the value of each slot in the internal table shown in FIG. 3 based on the processing result by the concept determination unit 3. At this time,
Separate the word string at the connecting word part, and set each word string unit to 1
One cst table is used, and the connection relation of these cst tables is represented by the rel table.

【００３４】この例では、「て（接続語、かつ）」の前
の「重量（量属性、重量）」と「重い（比較概念、重
い）」とによって識別子ＩＤ＝１のｃｓｔテーブルが作
成され、「て（接続語、かつ）」の後ろの「価格（量属
性、価格）」、「非常に（修飾概念、very）」および
「車（対象概念、自動車）」によって識別子ＩＤ＝２の
ｃｓｔテーブルが作成される。In this example, the cst table with the identifier ID = 1 is created by "weight (quantity attribute, weight)" and "heavy (comparative concept, heavy)" before "te (connection word, and)". , "Te (connecting word, and)" followed by "price (quantity attribute, price)", "very (modification concept, very)" and "car (target concept, automobile)", the cst of identifier ID = 2 The table is created.

【００３５】次に、補完処理部７は、まず図４に示した
ルール１に従って補完処理が必要か否かを判断する。こ
の場合、識別子ＩＤ＝１のｃｓｔテーブルの対象概念ス
ロットが空欄になっており、補完処理の終了条件を満た
していないため、対象概念スロットの補完処理を行うた
めに適用可能なルール２（図４参照）を探す。そして、
このルール２に従って、図７に示すような補完処理を行
なう。この例では、識別子ＩＤ＝１のｃｓｔテーブルの
対象概念スロットを識別子ＩＤ＝２のｃｓｔテーブルの
対象概念スロットの値『自動車』によって補っている。
この結果、ルール１の終了条件が満たされ、補完処理が
完了する。Next, the complementary processing section 7 first determines whether or not the complementary processing is necessary according to the rule 1 shown in FIG. In this case, the target concept slot of the cst table with the identifier ID = 1 is blank and does not satisfy the completion condition of the complementing process, so that the rule 2 (FIG. Search). And
According to this rule 2, the complementary processing as shown in FIG. 7 is performed. In this example, the target concept slot of the cst table with the identifier ID = 1 is supplemented with the value “automobile” of the target concept slot of the cst table with the identifier ID = 2.
As a result, the termination condition of rule 1 is satisfied, and the complementing process is completed.

【００３６】補完処理が完了すると、システムインタフ
ェース部８は完成した各テーブルの情報を形式変換部６
から受けとり、要求種別判定部２より伝達された要求種
別と合わせて、図８に示す形式によるデータとして主制
御部１１へ渡す。そして、主制御部１１は、各ｃｓｔテ
ーブル毎に必要な検索条件の定量化やデータベーステー
ブル、項目の選定を行い、要求種別に基づき、データベ
ース制御文を作成し、ＤＢＭＳ１２に検索命令を発す
る。When the complementary processing is completed, the system interface section 8 converts the information of each completed table into the format conversion section 6
From the request type determining unit 2 and sends the data to the main control unit 11 as data in the format shown in FIG. Then, the main control unit 11 quantifies necessary search conditions for each cst table, selects a database table and items, creates a database control statement based on the request type, and issues a search command to the DBMS 12.

【００３７】すなわち、この例では、主制御部１１は識
別子ＩＤ＝１のｃｓｔテーブルの比較概念「重い」を定
量化し、重量＞２０００を得る。そして、要求種別がwh
atタイプであることから、条件に適合する自動車をデー
タベースから検索するための下記データベース制御文を
作成する。 select名称from自動車where重量＞２０００That is, in this example, the main control unit 11 quantifies the comparative concept "heavy" in the cst table with the identifier ID = 1, and obtains weight> 2000. And the request type is wh
Since it is an at type, create the following database control statement to search the database for cars that meet the conditions. select name from car where weight> 2000

【００３８】同様に、識別子ＩＤ＝２のｃｓｔテーブル
の情報に基づき、下記のデータベース制御文を作成す
る。 select名称from自動車where価格＜１４０Similarly, the following database control statement is created based on the information in the cst table with the identifier ID = 2. select name from automobile where price <140

【００３９】さらに、ｒｅｌテーブルの情報に従い、上
記２つのデータベース制御文より、下記のデータベース
制御文を作成する。 select名称from自動車where重量＞２０００and価格＜１
４０Further, the following database control statement is created from the above two database control statements according to the information in the rel table. select name from automobile where weight> 2000 and price <1
40

【００４０】次に、ＤＢＭＳ１２は、主制御部１１から
与えられたデータベース制御文に従って、ＤＢ１３を検
索し、該検索結果を主制御部１１に返す。主制御部１１
は、得られた検索結果を出力装置１０に渡す。これによ
り、出力装置１０は検索結果を出力して利用者に知らせ
る。Next, the DBMS 12 searches the DB 13 according to the database control statement given from the main control unit 11 and returns the search result to the main control unit 11. Main controller 11
Passes the obtained search result to the output device 10. As a result, the output device 10 outputs the search result and informs the user.

【００４１】このように、本実施例によれば、情報処理
システムに対して自然言語によって入力された指示文や
質問文は、まず文中に含まれるキーワードに基づいてそ
の要求種別が判定されるとともに、ｃｓｔテーブルおよ
びｒｅｌテーブルの形式で表現される（このとき、助
詞、助動詞等の不要語は除去される）。そして、要求種
別に応じて必要となる情報が省略されているか否かがル
ール１に従って判断され、必要な情報が省略されている
場合には、省略されている概念種に応じたルール２の適
用によって、ｃｓｔテーブルの補完処理が行われる。こ
うして完成されたｃｓｔテーブル、ｒｅｌテーブルおよ
び要求種別の情報は、各々の情報処理システムに適合し
た形式（例えば、ＳＱＬ形式）に変換される。これによ
り、情報処理システムの利用者は、該システムに対する
指示や質問を自然言語文によって行うことができる。As described above, according to the present embodiment, the request type of the directive sentence or question sentence input to the information processing system in natural language is first determined based on the keyword included in the sentence. , Cst table and rel table (in this case, unnecessary words such as particles and auxiliary verbs are removed). Then, it is determined according to the rule 1 whether necessary information is omitted according to the request type, and if the necessary information is omitted, the rule 2 is applied according to the omitted concept type. In this way, the cst table is complemented. The cst table, rel table, and request type information thus completed are converted into a format (for example, SQL format) suitable for each information processing system. This allows the user of the information processing system to give instructions and questions to the system using natural language sentences.

【００４２】なお、この発明は、既述した実施例の態様
に限定されない。例えば、文形式変換装置１００に接続
する情報処理システムはデータベースシステムに限るも
のではなく、そのシステムが動作するために必要な情報
を特定でき、図３に例示したように定まった形式を決定
できるものであれば、いかなるシステムにも適用可能で
ある。The present invention is not limited to the embodiment described above. For example, the information processing system connected to the sentence format conversion device 100 is not limited to the database system, but the information necessary for the system to operate can be specified, and the fixed format as illustrated in FIG. 3 can be determined. So long as it is applicable to any system.

【００４３】また、既述した実施例では、情報処理シス
テムとのインタフェースとしてｃｓｔテーブルを１つず
つ処理することが前提となっているため、上述した形式
変換や補完処理を行っているが、これに限らず、テーブ
ルの形式や補完処理に適用されるルールは対象とする情
報処理システムに応じて適宜変更される。Further, in the above-described embodiment, since it is premised that the cst table is processed one by one as an interface with the information processing system, the above-mentioned format conversion and complementary processing are performed. However, the format of the table and the rules applied to the complementary processing are appropriately changed according to the target information processing system.

【００４４】[0044]

【発明の効果】以上説明したように、この発明によれ
ば、種々の情報処理システムへの指示や質問を自然言語
文で行なうことが可能となるので、システム利用者が各
システムの利用法を習得する必要がなくなり、負担が軽
減される。As described above, according to the present invention, it is possible to give instructions and questions to various information processing systems in natural language sentences, so that the system user can use each system. Eliminates the need to learn and reduces the burden.

[Brief description of drawings]

【図１】この発明の一実施例の全体構成を示すブロック
図である。FIG. 1 is a block diagram showing the overall configuration of an embodiment of the present invention.

【図２】同実施例による概念辞書４の記述例を示す図で
ある。FIG. 2 is a diagram showing a description example of a concept dictionary 4 according to the same embodiment.

【図３】同実施例による形式変換部６において採用され
るデータ形式の一例を示す図である。FIG. 3 is a diagram showing an example of a data format adopted in a format conversion unit 6 according to the same embodiment.

【図４】同実施例による補完処理で適用されるルールの
一例を示す図である。FIG. 4 is a diagram showing an example of rules applied in a complementing process according to the embodiment.

【図５】要求種別に対応する質問文タイプ毎に必要とさ
れる情報と各質問文タイプの例文とを示す図である。FIG. 5 is a diagram showing information required for each question sentence type corresponding to a request type and an example sentence of each question sentence type.

【図６】同実施例による文形式変換装置１００の動作を
示すフローチャートである。FIG. 6 is a flowchart showing an operation of the sentence format conversion device 100 according to the same embodiment.

【図７】同実施例による補完処理の一例を示す図であ
る。FIG. 7 is a diagram showing an example of complementary processing according to the embodiment.

【図８】同実施例による文形式変換装置１００の処理結
果がシステムインタフェース部８から主制御部１１へ渡
されるときの形式例を示す図である。FIG. 8 is a diagram showing a format example when a processing result of the sentence format conversion apparatus 100 according to the embodiment is passed from the system interface section 8 to the main control section 11.

[Explanation of symbols]

１単語分け処理部２要求種別判定部３概念判定部４概念辞書５不要語辞書６形式変換部７補完処理部８システムインタフェース部９入力装置１０出力装置１１主制御部１２ＤＢＭＳ１３ＤＢ 1 word division processing unit 2 request type determination unit 3 concept determination unit 4 concept dictionary 5 unnecessary word dictionary 6 format conversion unit 7 complementation processing unit 8 system interface unit 9 input device 10 output device 11 main control unit 12 DBMS 13 DB

───────────────────────────────────────────────────── フロントページの続き (72)発明者島田茂夫東京都千代田区内幸町一丁目１番６号日本電信電話株式会社内 ─────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Shigeo Shimada 1-1-6 Uchisaiwaicho, Chiyoda-ku, Tokyo Nihon Telegraph and Telephone Corporation

Claims

[Claims]

1. A sentence format conversion device for converting a directive sentence and a question sentence expressed in natural language into a format that can be interpreted by an information processing system, and a word division processing unit for dividing an input sentence into words. Of the words in the input sentence, a request type determination unit that determines the type of the instruction or question based on the keyword that represents the type, a concept dictionary that describes information indicating which system each word belongs to, and a concept dictionary An unnecessary word dictionary that describes unnecessary words such as auxiliary particles and auxiliary verbs that do not include information required by the information processing system, but what information does each word in the input sentence mean? Based on the concept dictionary and removes unnecessary words in the input sentence based on the unnecessary word dictionary, and a word omitted in the input sentence according to the type of the input sentence. The input sentence is converted into an expression format that can be interpreted by the information processing system, based on a complementary processing unit that complements the same predetermined complementary rule, and information obtained by the concept determination unit and information obtained by the complementary processing unit. A sentence format conversion device, comprising: