JP2001306603A

JP2001306603A - Structured database generating device

Info

Publication number: JP2001306603A
Application number: JP2000118430A
Authority: JP
Inventors: Tadaaki Sakamoto; 忠昭坂本
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2000-04-19
Filing date: 2000-04-19
Publication date: 2001-11-02

Abstract

PROBLEM TO BE SOLVED: To solve such a problem that a conventional structured database generating device requires a word database wherein a sufficient number of words are previously stored and, particularly, a large-scale data base requires considerable operation for word registration. SOLUTION: This structured database generating device is equipped with a name data storage means 3, a node name candidate extracting means 4 which makes the longest-length matching comparison between character strings of pieces of name data from the beginning to the end to extract a node name candidate, a node name selecting means 5 which selects a proper node name, a hierarchical structure generating means 6 which generates a structure database by constituting a hierarchical structure of selected node names, etc., and a hierarchical structure storage means 7 which stores the structured database.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、入力される名称
データに含まれている構造情報から階層別に構成要素を
抽出するとともに当該構成要素間の相互関係を規定して
構造化データベースを作成する構造化データベース作成
装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a structure for extracting constituent elements by hierarchy from structural information included in input name data and defining a mutual relationship between the constituent elements to create a structured database. The present invention relates to a structured database creation device.

【０００２】[0002]

【従来の技術】業務システム等において管理される対象
物については、他の対象物と区別するために、当該対象
物の構造に係る各階層の構成要素名をつなげて当該対象
物を表現する名称として与えることが多い。このような
名称表現を用いる典型的な例として、プラントの設備デ
ータが挙げられる。プラントの設備管理において用いら
れる名称データとしては、例えば「○○タンク××ポン
プ△△弁」という表現が用いられる。すなわち、データ
の対象は「△△弁」という弁であるが、他の△△弁と区
別するために、［「○○タンク」に設けられた「××ポ
ンプ」に付設された「△△弁」である］という構造的な
情報を付加して命名したものである。したがって、名称
データに含まれる対象物の構造情報から各階層の構成要
素情報を抽出することにより、処理対象である設備の構
造を反映した構造化データベースを作成することができ
る。2. Description of the Related Art In order to distinguish an object managed in a business system or the like from other objects, a name expressing the object by connecting the component names of respective layers relating to the structure of the object. Often given as. A typical example of using such a name expression is facility data of a plant. As the name data used in the facility management of the plant, for example, the expression “XX tank XX pump / valve” is used. In other words, although the target of the data is a valve called “△△ valve”, in order to distinguish it from other valves, [［pump attached to “xx pump” provided in “「 tank ” The name is added with structural information such as "valve". Therefore, by extracting the component information of each layer from the structure information of the object included in the name data, it is possible to create a structured database that reflects the structure of the equipment to be processed.

【０００３】構造化データベースは、対象物に係る構成
要素と当該構成要素間の相互関係とから表現される対象
物の構造を容易に把握することができるために、単なる
名称データの集合からは得られない以下に記載するよう
な長所を有する。第１に、効率の良い検索を提供するこ
とができる。例えば、ある設備に付属する設備全体の検
索等についての構成要素間の相互関係に関わる検索は、
相互関係に関わるリンク（この場合には所属関係を表す
リンク）を辿るのみで過不足なく高速に実施することが
できる。[0003] The structured database can be easily obtained from a mere set of name data because the structure of the object represented by the components related to the object and the mutual relation between the components can be easily grasped. It has the following advantages that cannot be obtained. First, an efficient search can be provided. For example, a search related to the interrelation between components, such as a search for the entire facility attached to a certain facility,
Only by following the link related to the mutual relationship (in this case, the link indicating the affiliation relationship), it is possible to execute the process at high speed without any excess or shortage.

【０００４】第２に、データベースの整備を容易とす
る。既存の大規模データベースは、電子化の際に、管理
番号体系、名称の命名規則等の見直しを実施することが
多い。このような場合に、対象物を構成する構成要素と
当該構成要素間の関係とを表現した構造化データベース
が作成されていれば、対象物の全体構造を容易に把握す
ることができるから、管理番号の割り当て、名称の命名
等に係る規則をトップダウン的に効率良く決定すること
ができる。[0004] Second, maintenance of a database is facilitated. When digitizing an existing large-scale database, the management number system, naming rules, and the like are often reviewed. In such a case, if a structured database expressing the constituent elements of the object and the relationship between the elements is created, the entire structure of the object can be easily grasped. Rules concerning number assignment, name naming, and the like can be efficiently determined from the top down.

【０００５】第３に、データベースの統合を容易とす
る。例えば、設備データの場合では、設備管理業務の効
率化を目的として、類似のプラントのデータベースを統
合して複数の設備を統括的に管理する試みがなされる。
この際、各設備について構造化データベースが作成され
ていれば、データベースの統合について、構造的に共通
化できる部分とローカルデータとして維持する部分との
識別に係る判断を容易に行うことができる。Third, database integration is facilitated. For example, in the case of equipment data, an attempt is made to integrate a database of similar plants and manage a plurality of equipments in an integrated manner in order to improve the efficiency of equipment management work.
At this time, if a structured database is created for each facility, it is possible to easily determine the integration of the databases with respect to identification of a part that can be structurally shared and a part that is maintained as local data.

【０００６】以上のような理由から、特に設備管理シス
テムの構築に係り、入力される名称データから構造化デ
ータベースを作成する方法が提案されている。図１４
は、例えば特開平６−５０７７８号公報に記載された従
来の構造化データベース作成装置の構成を示す図であ
る。図１４において、１０１ａ，１０１ｂはユーザから
入力される名称データ、１０２はプラント設備の構造、
機能、故障モードに関する名称を予めカテゴリー毎に記
憶する単語データベース、１０３は予防保全情報を蓄積
する構造化データベースとしての予防保全データベー
ス、１０４は入力される名称データ１０１ａ，１０１ｂ
に対して単語データベース１０２を参照して分割処理を
行い、予防保全データベースに構造的に記憶させる情報
処理手段である。なお、単語データベース１０２には、
同義語や類義語を集めたシソーラスが用意され、このシ
ソーラスを参照することで、例えば“プラントＡ”と
“プラントＡ’”とは同じ対象物を表すことを認識する
ことが可能となる。For the above reasons, a method of creating a structured database from input name data has been proposed, particularly in connection with the construction of an equipment management system. FIG.
FIG. 1 is a diagram showing a configuration of a conventional structured database creation device described in, for example, Japanese Patent Application Laid-Open No. 6-50778. In FIG. 14, 101a and 101b are name data input by the user, 102 is the structure of the plant equipment,
A word database for storing names relating to functions and failure modes for each category in advance, 103 is a preventive maintenance database as a structured database for storing preventive maintenance information, and 104 is input name data 101a, 101b.
This is an information processing means for performing a division process with respect to the word database 102 and storing the divided data structurally in the preventive maintenance database. Note that the word database 102 includes
A thesaurus in which synonyms and synonyms are collected is prepared. By referring to this thesaurus, it is possible to recognize that “plant A” and “plant A ′” represent the same object, for example.

【０００７】次に動作について説明する。ユーザから入
力された名称データ１０１ａ，１０１ｂに対して、情報
処理手段１０４が単語データベース１０２を参照して検
索を実施し、マッチした単語によって名称データ１０１
ａ，１０１ｂを分割し、階層的な構造に変換して予防保
全データベース１０３に蓄積する。Next, the operation will be described. The information processing unit 104 performs a search on the name data 101a and 101b input by the user with reference to the word database 102, and uses the matched words to search the name data 101a and 101b.
a, 101 b are divided, converted into a hierarchical structure, and stored in the preventive maintenance database 103.

【０００８】例えば、単語データベース１０２に、プラ
ントレベルの単語データとして「原子力発電所ｄｅｍｏ
プラント」、系統レベルの単語データとして「原子炉再
循環系」、機器レベルの単語データとして「ポンプ」、
部品レベルの単語データとして「ケーシング」を予め記
憶させておくことにより、入力データ「原子力発電所ｄ
ｅｍｏプラント原子炉再循環系ポンプケーシング」を、
「原子力発電所ｄｅｍｏプラント」−「原子炉再循環
系」−「ポンプ」−「ケーシング」という階層構造に変
換して蓄積することができる。また、単語データベース
１０２内のシソーラスに、「原子炉再循環系」と「ＰＬ
Ｒ系」とが同じ意味を示すものとして登録されていれ
ば、入力データ「原子力発電所ｄｅｍｏプラントＰＬＲ
系ポンプケーシング」を、「原子力発電所ｄｅｍｏプラ
ント」−「原子炉再循環系」−「ポンプ」−「ケーシン
グ」という階層構造に等価的に変換して蓄積することが
できる。For example, in the word database 102, "nuclear power plant demo
Plant, system-level word data "reactor recirculation system", equipment-level word data "pump"
By storing “casing” in advance as word data at the part level, the input data “nuclear power plant d
emo plant reactor recirculation pump casing "
It can be converted and stored in a hierarchical structure of "nuclear power plant demo plant"-"reactor recirculation system"-"pump"-"casing". In addition, the thesaurus in the word database 102 includes “reactor recirculation system” and “PL
If "R system" is registered as having the same meaning, the input data "nuclear power plant demo plant PLR
The "system pump casing" can be equivalently converted into a hierarchical structure of "nuclear power plant demo plant"-"reactor recirculation system"-"pump"-"casing" and accumulated.

【０００９】[0009]

【発明が解決しようとする課題】従来の構造化データベ
ース作成装置は以上のように構成されているので、単語
データベース１０２に予め充分な量の単語を登録してお
くか、あるいは名称データの入力と並行してユーザが単
語データベース１０２に単語を対話的に追加していく必
要がある。しかし、単語データベース１０２を予め作成
するためには、入力される可能性のある全ての名称デー
タを予見して、当該名称データを構成する各単語を人手
により調査して事前に登録する必要があり、特に大規模
データベースでは多大な作業が発生するという課題があ
った。Since the conventional structured database creating apparatus is configured as described above, it is necessary to register a sufficient amount of words in the word database 102 in advance, or to input name data. At the same time, the user needs to interactively add words to the word database 102. However, in order to create the word database 102 in advance, it is necessary to foresee all the name data that may be input, manually investigate each word constituting the name data, and register them in advance. In particular, there is a problem that a large amount of work occurs in a large-scale database.

【００１０】一方、名称データの入力と並行して単語を
対話的に追加していく場合には、単語データを一つ一つ
入力する必要があり、特に大規模なデータベースではユ
ーザにとって大きな負担になるという課題があった。On the other hand, when words are added interactively in parallel with the input of the name data, it is necessary to input the word data one by one. There was a problem of becoming.

【００１１】また、一般的に設備に係る名称はプラント
毎に異なる場合が多いので、上記の単語データベース作
成作業はプラント毎に実施する必要があり、さらに多く
の作業が必要になるという課題があった。[0011] In addition, since the names of facilities generally differ from plant to plant in many cases, the above-mentioned word database creation work must be performed for each plant, and there is a problem that more work is required. Was.

【００１２】この発明は上記のような課題を解決するた
めになされたもので、予め単語データベースを準備する
必要がなく、名称データに含まれている構造情報から階
層別に構成要素を抽出して構造化データベースを自動的
に作成する構造化データベース作成装置を得ることを目
的とする。SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problems, and it is not necessary to prepare a word database in advance, and a component is extracted for each layer from the structure information included in the name data. It is an object of the present invention to obtain a structured database creation device for automatically creating a structured database.

【００１３】[0013]

【課題を解決するための手段】この発明に係る構造化デ
ータベース作成装置は、入力される名称データを蓄積す
る名称データ記憶手段と、構成要素に相当するノードを
階層的に配列するとともに、各ノード間の相互関係を規
定して構成される構造化データベースを格納する階層構
造記憶手段と、名称データ記憶手段に蓄積されている名
称データを読み込んで、各名称データ毎に他の全ての名
称データとの間で名称データを成す文字列に対する先頭
からの最長一致比較および末尾からの最長一致比較をそ
れぞれ実施して、構造化データベースを構成するノード
の候補を表現する文字列として与えられるノード名候補
を抽出するノード名候補抽出手段と、ノード名候補抽出
手段により抽出されたノード名候補を提示して、これら
ノード名候補から構造化データベースを構成する構成要
素として適正なノードのノード名をユーザに選択させる
ノード名選択手段と、ノード名選択手段により選択され
たノード名を有するノードおよび名称データ記憶手段に
蓄積されている名称データをノード名として有するノー
ドを階層的に配列するとともに、各ノード間の相互関係
を規定して構造化データベースを作成する階層構造作成
手段とを備えるようにしたものである。According to the present invention, there is provided a structured database creating apparatus comprising: a name data storage unit for storing input name data; nodes corresponding to constituent elements arranged hierarchically; A hierarchical structure storing means for storing a structured database configured by defining the interrelationship between them, and reading the name data stored in the name data storing means, and storing all other name data for each name data. A longest match comparison from the beginning and a longest match comparison from the end are performed on the character string forming the name data between the two, and a node name candidate given as a character string representing a node candidate constituting the structured database is obtained. The node name candidate extracting means to be extracted and the node name candidates extracted by the node name candidate extracting means are presented, and from these node name candidates, Node name selection means for allowing a user to select a node name of an appropriate node as a component constituting the structured database, and a node having the node name selected by the node name selection means and name data stored in name data storage means And a hierarchical structure creating means for creating a structured database by defining the interrelationship between the nodes in a hierarchical manner.

【００１４】この発明に係る構造化データベース作成装
置は、階層構造記憶手段に格納された構造化データベー
スをユーザが編集可能とする階層構造編集手段を備える
ようにしたものである。The structured database creating apparatus according to the present invention is provided with a hierarchical structure editing means for enabling a user to edit the structured database stored in the hierarchical structure storage means.

【００１５】この発明に係る構造化データベース作成装
置は、ユーザによる同義語の定義を可能とする同義語定
義手段と、同義語定義手段を用いて定義された同義語を
蓄積する同義語情報記憶手段と、同義語情報記憶手段に
蓄積された同義語情報を参照して名称データおよびノー
ド名を同じ意義を有する代表的な用語に変換する同義語
変換手段とを備えるようにしたものである。[0015] A structured database creating apparatus according to the present invention comprises a synonym definition means for enabling a user to define a synonym, and a synonym information storage means for storing synonyms defined using the synonym definition means. And synonym conversion means for referring to the synonym information stored in the synonym information storage means, and converting the name data and the node name into representative terms having the same meaning.

【００１６】この発明に係る構造化データベース作成装
置は、ノード名候補抽出の際に適用される抽出補助規則
を蓄積する抽出補助規則記憶手段と、名称データ記憶手
段に蓄積されている名称データを読み込んで、各名称デ
ータ毎に他の全ての名称データとの間で名称データを成
す文字列に対する先頭からの最長一致比較および末尾か
らの最長一致比較をそれぞれ実施して、構造化データベ
ースを構成するノードの候補を表現する文字列として与
えられるノード名候補を抽出するとともに、得られたノ
ード名候補の抽出過程が抽出補助規則記憶手段に蓄積さ
れているいずれかの規則に該当する場合には当該ノード
名候補を破棄するノード名候補抽出手段とを備えるよう
にしたものである。An apparatus for creating a structured database according to the present invention reads an extraction auxiliary rule storing means for storing an extraction auxiliary rule applied when extracting a node name candidate, and reads name data stored in a name data storage means. In each of the name data, the longest match comparison from the beginning and the longest match comparison from the end with respect to the character string forming the name data with all other name data are performed, and the nodes constituting the structured database A node name candidate given as a character string representing the candidate of the extracted node name candidate, and if the extraction process of the obtained node name candidate corresponds to any of the rules stored in the extraction auxiliary rule storage means, Node name candidate extracting means for discarding name candidates.

【００１７】この発明に係る構造化データベース作成装
置は、名称データに含まれる用語に対する補正方法を定
義した名称補正規則を蓄積する名称補正規則記憶手段
と、入力データを構成する各名称データに対して名称補
正規則記憶手段に格納されている名称補正規則を適用し
て、当該名称補正規則に基づいて変換された名称データ
を名称データ記憶手段に蓄積する名称補正手段とを備え
るようにしたものである。A structured database creation device according to the present invention includes a name correction rule storage unit for storing a name correction rule defining a correction method for a term included in name data, and a name correction rule storage unit for storing name data constituting input data. A name correction unit that applies a name correction rule stored in a name correction rule storage unit and accumulates name data converted based on the name correction rule in the name data storage unit. .

【００１８】[0018]

【発明の実施の形態】以下、この発明の実施の一形態を
説明する。実施の形態１．図１は、この発明の実施の形態１による
構造化データベース作成装置の構成を示す図である。１
は１または複数の名称データから成る入力データ、２は
ユーザ、３は入力される名称データを記憶する名称デー
タ記憶手段、４は名称データ記憶手段３に記憶された各
名称データを検定して構成要素に相当するノードの候補
を表現する文字列を名称データから抽出するノード名候
補抽出手段、５はノード名候補抽出手段４で得られたノ
ード名をユーザ２に開示して各ノードの構成要素として
の適否をユーザ２に選択させるノード名選択手段、６は
名称データと当該名称データについて選択された適正な
ノード名とを基にして構造化データベースを作成する階
層構造作成手段、７は階層構造作成手段６により作成さ
れる構造化データベースを格納する階層構造記憶手段、
８は名称データ記憶手段３、ノード名候補抽出手段４、
ノード名選択手段５、階層構造作成手段６および階層構
造記憶手段７を有して構成される構造化データベース作
成装置である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below. Embodiment 1 FIG. FIG. 1 is a diagram showing a configuration of a structured database creation device according to Embodiment 1 of the present invention. 1
Is input data composed of one or a plurality of name data, 2 is a user, 3 is name data storage means for storing input name data, and 4 is a configuration which tests each name data stored in the name data storage means 3. A node name candidate extracting means for extracting a character string representing a node candidate corresponding to an element from the name data, 5 discloses the node name obtained by the node name candidate extracting means 4 to the user 2, and configures the components of each node. Node name selecting means for allowing the user 2 to select whether or not the name is appropriate, 6 is a hierarchical structure creating means for creating a structured database based on name data and an appropriate node name selected for the name data, 7 is a hierarchical structure Hierarchical structure storage means for storing the structured database created by the creation means 6,
8 is name data storage means 3, node name candidate extraction means 4,
This is a structured database creation device including a node name selection means 5, a hierarchy structure creation means 6, and a hierarchy structure storage means 7.

【００１９】図２は、複数の名称データから成る入力デ
ータの一例を示す図である。図２において、Ｌ１〜Ｌ１
０はそれぞれ発電プラントにおける設備名称を示すもの
である。なお、入力データの入力方法については、例え
ば紙の帳票に記載された情報を入力担当者が対話的に入
力してもよく、また既存のデータベースに蓄積されてい
る情報をプログラムを用いて一括送信して入力してもよ
い。いずれの場合にも、名称データ記憶手段３には、名
称データが図２に示された形式を保持して蓄積される。FIG. 2 is a diagram showing an example of input data composed of a plurality of name data. In FIG. 2, L1 to L1
0 indicates a facility name in the power plant. Regarding the input method of the input data, for example, the input person may interactively input information described on a paper form, or collectively transmit information stored in an existing database using a program. May be entered. In any case, the name data is stored in the name data storage means 3 while retaining the format shown in FIG.

【００２０】図３は、構造化データベースにおける階層
構造を示す図である。図２に示される入力データ１に係
る各名称データをノード名に分割することで、図３に示
される階層構造が作成される。図３に示されるように、
階層構造は、四角枠で示されるノードと称される構成要
素と、これらノード間の種々の関係を表現する線（リン
ク）とから構成される。FIG. 3 is a diagram showing a hierarchical structure in the structured database. The hierarchical structure shown in FIG. 3 is created by dividing each name data of the input data 1 shown in FIG. 2 into node names. As shown in FIG.
The hierarchical structure is composed of components called nodes indicated by square frames and lines (links) expressing various relationships between these nodes.

【００２１】図３において、１１は概念的なモノを表す
クラスノード、１２は実際に存在する実体的なモノを表
すインスタンスノードである。図２に示される名称デー
タＬ１〜Ｌ１０は、最終的にはこのインスタンスノード
として階層構造を構成することになる。１３はクラスノ
ード１１間の相互関係を表すｉｓ＿ａリンクであり、例
えば［「入口弁」は「弁」である］という関係を表現す
る。図３では、ｉｓ＿ａリンクの左側のノードの方が右
側のノードよりも抽象化の程度が高くなっている。１４
は最も左側に位置していて、全てのノードの中で最も抽
象化された仮想的なノードであるルートノードであり、
構造化データベースが初期値として準備している特殊な
ノードである。１５はクラスノード１１とインスタンス
ノード１２との間の「概念−実体」関係を表すｉｎｓｔ
ａｎｃｅ＿ｏｆリンクである。１６はインスタンスノー
ド１２間の所有関係を表すｈａｓ＿ａリンクであり、例
えば［「Ａ復水器」は「Ａ復水器真空ポンプ」を有して
いる］という関係を表す。また、ｈａｓ＿ａリンク１６
の矢印は、矢印の基端側のノードが矢印の先端側のノー
ドを所有することを意味している。In FIG. 3, reference numeral 11 denotes a class node representing a conceptual object, and reference numeral 12 denotes an instance node representing an actual physical object. The name data L1 to L10 shown in FIG. 2 will eventually form a hierarchical structure as this instance node. Reference numeral 13 denotes an is_a link indicating a mutual relationship between the class nodes 11, and expresses, for example, a relationship [“the inlet valve” is a “valve”]. In FIG. 3, the node on the left side of the is_a link has a higher degree of abstraction than the node on the right side. 14
Is the root node which is located on the leftmost side and is the most abstract virtual node among all the nodes,
This is a special node prepared by the structured database as an initial value. Reference numeral 15 denotes an inst representing a "concept-entity" relationship between the class node 11 and the instance node 12.
ance_of link. Reference numeral 16 denotes a has_a link indicating the ownership relationship between the instance nodes 12, and represents, for example, a relationship "[A condenser] has" A condenser vacuum pump "]. Also, has_a link 16
The arrow means that the node on the proximal end side of the arrow owns the node on the distal end side of the arrow.

【００２２】なお、「Ａ復水器」と「Ｂ復水器」とのよ
うに同種のインスタンスが複数存在する場合には、これ
らのインスタンスの集合としてのクラス「復水器」が存
在する。しかし、「復水回収タンク」のように同種のイ
ンスタンスがそれ自体のみの１つしか存在しない場合に
は、クラス「復水回収タンク」は作成しないものとす
る。When there are a plurality of instances of the same type, such as "A condenser" and "B condenser", there is a class "condenser" as a set of these instances. However, if there is only one instance of the same type as the "condensed water recovery tank", such as the "condensed water recovery tank", the class "condensed water recovery tank" is not created.

【００２３】次に、構造化データベース作成処理全体の
流れを説明する。図４は、構造化データベース作成方法
を示すフローチャートである。第１に、名称データ記憶
手段３が、ユーザから対話的にデータを読み込むか、あ
るいは既存のデータベースから一括して名称データを読
み込んで、当該入力された名称データを蓄積する（ステ
ップＳＴ１）。次に、ノード名候補抽出手段４が、名称
データ記憶手段３に格納されているデータを読み込ん
で、階層構造におけるクラスノードの候補を表現する文
字列を名称データから抽出する（ステップＳＴ２）。次
に、ノード名選択手段５が、ノード名候補抽出手段４に
より抽出された文字列を長さの短い順にユーザに提示し
て、各文字列がそれぞれクラスノード名として適正か否
かをユーザに判断させ、当該判断結果をマウス等で入力
させる（ステップＳＴ３）。Next, the flow of the entire structured database creation process will be described. FIG. 4 is a flowchart showing a structured database creation method. First, the name data storage means 3 reads data interactively from a user or collectively reads name data from an existing database and accumulates the input name data (step ST1). Next, the node name candidate extraction means 4 reads the data stored in the name data storage means 3 and extracts a character string representing a class node candidate in the hierarchical structure from the name data (step ST2). Next, the node name selecting means 5 presents the character strings extracted by the node name candidate extracting means 4 to the user in ascending order of length, and asks the user whether each character string is appropriate as a class node name. The user is made to make a determination, and the result of the determination is input using a mouse or the like (step ST3).

【００２４】次に、階層構造作成手段６は、ステップＳ
Ｔ３においてユーザにより適正であると判断されたクラ
スノード名に基づいてクラスノード１１を生成するとと
もに、クラスノード間のｉｓ＿ａリンクを自動的に張る
ことで階層構造を作成し、当該階層構造を階層構造記憶
手段７に蓄積する（ステップＳＴ４）。なお、このステ
ップＳＴ４に係る処理は、ステップＳＴ３においてユー
ザによりノード名選択が行われる毎に実行してもよく、
ステップＳＴ３において全てのノード名選択が終了した
後にバッチ処理的に実行してもよい。但し、いずれの場
合にも選択されたクラスノード名は文字列の長さの短い
順に入力されてくるものとする。Next, the hierarchical structure creating means 6 executes step S
At T3, the class node 11 is generated based on the class node name determined to be appropriate by the user, and a hierarchical structure is created by automatically establishing an is_a link between the class nodes. The data is stored in the storage means 7 (step ST4). Note that the process relating to step ST4 may be executed each time a node name is selected by the user in step ST3.
It may be executed in a batch process after all the node names are selected in step ST3. However, in each case, the selected class node names are input in ascending order of the character string length.

【００２５】ステップＳＴ４において、クラスノードに
係る階層構造の作成が終了すると、階層構造作成手段６
は、名称データ記憶手段３に蓄積されている名称データ
を長さの短い順に読み込んで、インスタンスノード１２
を生成して階層構造に追加する（ステップＳＴ５）。こ
の際、階層構造記憶手段７に蓄積されている階層構造の
クラスノード１１との間にｉｎｓｔａｎｃｅ＿ｏｆリン
クを張るとともに、インスタンスノード１２間にｈａｓ
＿ａリンクを張ることで、構造化データベースの作成を
終了する。In step ST4, when the creation of the hierarchical structure related to the class nodes is completed, the hierarchical structure creating means 6
Reads the name data stored in the name data storage means 3 in ascending order of length, and reads the instance node 12
Is generated and added to the hierarchical structure (step ST5). At this time, an instance_of link is established between the class node 11 of the hierarchical structure stored in the hierarchical structure storage means 7 and a has
The creation of the structured database is completed by linking the __a link.

【００２６】次に、ステップＳＴ２で実行されるノード
名候補の抽出処理について説明する。図５は、ノード名
候補抽出処理の処理工程を示すフローチャートである。
第１に、ノード名候補リストＮＬを作成して、当該ノー
ド名候補リストＮＬを空リストにする（ステップＳＴ１
１）。次に、カウンタ変数ｉを用いて、以降の処理を全
名称データ数だけ繰り返し実行する（ステップＳＴ１
２）。そして、この繰り返し処理においては、まずｉ番
目の名称データＤｉを取り出す（ステップＳＴ１３）。
次に、局所変数Ｌｆ，Ｌｂ，ＳＬ，Ｓの初期化を実行す
る（ステップＳＴ１４）。なお、Ｌｆ，Ｌｂは文字数、
ＳＬは文字列リスト、Ｓは文字列をそれぞれ格納する変
数であり、それぞれ０、空リスト、空文字列に初期化さ
れる。Next, a description will be given of a node name candidate extraction process executed in step ST2. FIG. 5 is a flowchart illustrating the processing steps of the node name candidate extraction processing.
First, a node name candidate list NL is created and the node name candidate list NL is made an empty list (step ST1).
1). Next, using the counter variable i, the subsequent processing is repeatedly executed by the number of all name data (step ST1).
2). Then, in this repetition processing, first, the i-th name data Di is extracted (step ST13).
Next, the local variables Lf, Lb, SL, and S are initialized (step ST14). Note that Lf and Lb are the number of characters,
SL is a character string list, and S is a variable for storing a character string, and is initialized to 0, an empty list, and an empty character string, respectively.

【００２７】次に、カウンタ変数ｊを用いて、以降の処
理を全名称データ数だけ繰り返し実行する（ステップＳ
Ｔ１５）。そして、この繰り返し処理においては、まず
同じ名称データを比較しないように、ｉとｊが同じ場合
には即座にステップＳＴ１５に再帰する（ステップＳＴ
１６）。ｉとｊとが異なる場合には、ｊ番目の名称デー
タＤｊを取り出す（ステップＳＴ１７）。そして、名称
データＤｉと名称データＤｊとの間で以下の２つの文字
列比較処理を実行する。Next, using the counter variable j, the subsequent processing is repeatedly executed by the number of all name data (step S).
T15). In this repetition processing, first, if i and j are the same, the process immediately returns to step ST15 so as not to compare the same name data (step ST15).
16). If i and j are different, the j-th name data Dj is extracted (step ST17). Then, the following two character string comparison processes are performed between the name data Di and the name data Dj.

【００２８】第１の文字列比較処理として、まず名称デ
ータＤｉの文字列と名称データＤｊの文字列とを先頭か
ら比較して、先頭から連続して一致した文字数をＬ
ｆ’、名称データＤｉ，Ｄｊから一致した文字列を除い
た残りの文字列をそれぞれＳｉ，Ｓｊとする（ステップ
ＳＴ１８）。次に、文字数Ｌｆ’と、名称データＤｉに
係る第１の文字列比較処理においてその時点までに最も
長く一致した文字数（以下、第１最長一致文字数と称す
る）として与えられるＬｆとを比較する（ステップＳＴ
１９）。ステップＳＴ１９における比較において、Ｌ
ｆ’＝Ｌｆの場合、すなわち今回の一致文字数が第１最
長一致文字数と等しい場合には、残りの文字列Ｓｉ，Ｓ
ｊを文字列リストＳＬに追加する。また、ステップＳＴ
１９における比較において、Ｌｆ’＞Ｌｆの場合、すな
わち今回の一致文字数が第１最長一致文字数より多い場
合には、残りの文字列Ｓｉ，Ｓｊから成るリストを新た
な文字列リストＳＬに置き換えるとともに、今回の一致
文字数Ｌｆ’を第１最長一致文字数Ｌｆに置き換える。
なお、ステップＳＴ１９における比較において、Ｌｆ’
＜Ｌｆの場合には、何ら処理を実行しない。以上のステ
ップＳＴ１８からステップＳＴ２１に係る第１の文字列
比較処理により、名称データＤｉに対して全名称データ
のなかで文字列の比較について先頭から最も文字が一致
するデータが全て検索されて、名称データＤｉを含むこ
れらの名称データから一致文字列を除いた文字列が全て
ノード名候補として抽出される。As a first character string comparison process, first, the character string of the name data Di and the character string of the name data Dj are compared from the beginning, and the number of characters that are continuously matched from the beginning is L.
f ′ and the remaining character strings obtained by removing the matched character strings from the name data Di and Dj are set to Si and Sj, respectively (step ST18). Next, the number of characters Lf 'is compared with Lf given as the number of characters that have longest matched up to that point in the first character string comparison process for the name data Di (hereinafter, referred to as the first longest matching character number) ( Step ST
19). In the comparison in step ST19, L
If f ′ = Lf, that is, if the current number of matching characters is equal to the first longest matching character number, the remaining character strings Si, S
j is added to the character string list SL. Step ST
In the comparison in 19, if Lf ′> Lf, that is, if the current number of matching characters is greater than the first longest matching character number, the list including the remaining character strings Si and Sj is replaced with a new character string list SL, The current matching character number Lf ′ is replaced with the first longest matching character number Lf.
Note that in the comparison in step ST19, Lf ′
If <Lf, no processing is performed. In the first character string comparison process from step ST18 to step ST21, all data having the same character from the beginning in character string comparison among all the name data are searched for the name data Di, and the name is searched. All character strings obtained by removing the matching character strings from these name data including the data Di are extracted as node name candidates.

【００２９】次に、第２の文字列比較処理として、まず
名称データＤｉの文字列と名称データＤｊの文字列とを
末尾から比較して、末尾から連続して一致した文字数を
Ｌｂ’、一致した文字列をＳ’とする（ステップＳＴ２
２）。次に、文字数Ｌｂ’と、名称データＤｉに係る第
２の文字列比較処理においてその時点までに最も長く一
致した文字数（以下、第２最長一致文字数と称する）と
して与えられるＬｂとを比較する（ステップＳＴ２
３）。ステップＳＴ２３における比較において、Ｌｂ’
＞Ｌｂの場合、すなわち今回の一致文字数が第２最長一
致文字数より多い場合には、文字列Ｓ’を文字列Ｓに置
き換えるとともに、今回の一致文字数Ｌｂ’を第２最長
一致文字数Ｌｂに置き換える（ステップＳＴ２４）。な
お、ステップＳＴ２３における比較において、上記以外
の場合には、何ら処理を実行しない。以上のステップＳ
Ｔ２２からステップＳＴ２４に係る第２の文字列比較処
理により、名称データＤｉに対して全名称データのなか
で文字列の比較について末尾から最も文字が一致するデ
ータが全て検索されて、一致文字列がノード名候補とし
て抽出される。Next, as a second character string comparison process, first, the character string of the name data Di and the character string of the name data Dj are compared from the end, and the number of characters that match continuously from the end is Lb '. The resulting character string is defined as S ′ (step ST2
2). Next, the number of characters Lb ′ is compared with Lb given as the number of characters that have longest matched up to that point in the second character string comparison process for the name data Di (hereinafter, referred to as the second longest matching character number) ( Step ST2
3). In the comparison in step ST23, Lb ′
If> Lb, that is, if the current number of matching characters is greater than the second longest matching character number, the character string S ′ is replaced with the character string S, and the current matching character number Lb ′ is replaced with the second longest matching character number Lb ( Step ST24). In the comparison in step ST23, in the cases other than the above, no processing is performed. Step S above
By the second character string comparison process from T22 to step ST24, all data having the same character from the end in character string comparison among all the name data are searched for the name data Di, and the matching character string is found. It is extracted as a node name candidate.

【００３０】次に、ステップＳＴ１３からステップＳＴ
２４で実行された名称データＤｉについてのノード名候
補抽出処理の結果として得られた文字列リストＳＬおよ
び文字列Ｓを全体のノード名候補リストＮＬに追加す
る。なお、この実施の形態では、文字列の先頭からの最
長一致比較を第１に実行して、文字列の末尾からの最長
一致比較を第２に実行しているが、この実行順序は逆に
してもよい。Next, from step ST13 to step ST
The character string list SL and the character string S obtained as a result of the node name candidate extraction processing for the name data Di executed in 24 are added to the entire node name candidate list NL. In this embodiment, the longest match comparison from the beginning of the character string is executed first, and the longest match comparison from the end of the character string is executed second, but this execution order is reversed. You may.

【００３１】図６は、ノード名候補抽出処理の実行結果
を示す図である。図２の複数の名称データから成る入力
データ１に対して、図５に示されたフローチャートに基
づくノード名候補抽出処理を実行した結果が示されてい
る。図６において、２１は図５に示されるステップＳＴ
１８からステップＳＴ２１の第１の文字列比較処理、す
なわち文字列の先頭からの最長一致比較に係る処理を適
用した結果抽出された文字列を示すものである。例え
ば、名称データＬ１「Ａ復水器真空ポンプ入口弁」につ
いては、文字列の先頭からの文字の一致が最長となる名
称データはＬ６「Ａ復水器真空ポンプ入口元弁」であっ
て、一致文字列は「Ａ復水器真空ポンプ入口」となるか
ら、抽出文字列は「弁」と「元弁」とになる。また、名
称データＬ４「Ａ復水器」については、文字列の先頭か
らの文字の一致が最長となる名称データとしてはＬ１
「Ａ復水器真空ポンプ入口弁」、Ｌ６「Ａ復水器真空ポ
ンプ入口元弁」、Ｌ８「Ａ復水器真空ポンプ」の３つが
該当し、一致文字列は「Ａ復水器」となるから、抽出文
字列は「真空ポンプ入口弁」、「真空ポンプ入口元弁」
および「真空ポンプ」となる。FIG. 6 is a diagram showing an execution result of the node name candidate extraction processing. The result of executing the node name candidate extraction processing based on the flowchart shown in FIG. 5 for the input data 1 including the plurality of name data in FIG. 2 is shown. 6, reference numeral 21 denotes a step ST shown in FIG.
18 shows the character string extracted as a result of applying the first character string comparison processing of step ST21, that is, the processing relating to the longest match comparison from the head of the character string. For example, for the name data L1 "A condenser vacuum pump inlet valve", the name data with the longest match of characters from the beginning of the character string is L6 "A condenser vacuum pump inlet valve", Since the matching character string is “A condenser vacuum pump inlet”, the extracted character strings are “valve” and “base valve”. In addition, as for the name data L4 “A condenser”, the name data having the longest character match from the head of the character string is L1.
"A condenser vacuum pump inlet valve", L6 "A condenser vacuum pump inlet main valve", L8 "A condenser vacuum pump", and the matching character string is "A condenser" Therefore, the extracted strings are "Vacuum pump inlet valve" and "Vacuum pump inlet main valve".
And a "vacuum pump".

【００３２】また、２２は図５に示されるステップＳＴ
２２からステップＳＴ２４の第２の文字列比較処理、す
なわち文字列の末尾からの最長一致比較に係る処理を適
用した結果抽出された文字列を示すものである。例え
ば、名称データＬ１「Ａ復水器真空ポンプ入口弁」につ
いては、文字列の末尾からの文字の一致が最長となる名
称データはＬ２「Ｂ復水器真空ポンプ入口弁」であっ
て、一致文字列すなわち抽出文字列は「復水器真空ポン
プ入口弁」となる。Step ST22 shown in FIG.
22 shows a character string extracted as a result of applying the second character string comparison processing from step 22 to step ST24, that is, the processing relating to the longest match comparison from the end of the character string. For example, as for the name data L1 "A condenser vacuum pump inlet valve", the name data with the longest matching of characters from the end of the character string is L2 "B condenser vacuum pump inlet valve". The character string, that is, the extracted character string, is “condenser vacuum pump inlet valve”.

【００３３】そして、２３は抽出文字列２１と抽出文字
列２２とを統合して重複を取り除くことで得られた、入
力データ１に対する全てのノード名候補に係るノード名
候補文字列リストである。文字列リスト２３のなかに
は、図３に示されたクラスノード１１の名称がすべて含
まれている。したがって、図５に示されたフローチャー
トに基づくノード名候補抽出処理を実行することで、ユ
ーザが階層構造を作成するための単語辞書を予め準備し
ておかなくても、名称データからクラスノード名を自動
的に抽出することができる。なお、文字列リスト２３に
は、図３に示されるクラスノードには含まれていない
「元弁」という文字列が含まれているが、これは、ステ
ップＳＴ３におけるユーザ選択により適正でないクラス
ノードを表現する文字列として除外されるので問題はな
い。Reference numeral 23 denotes a node name candidate character string list relating to all node name candidates for the input data 1 obtained by integrating the extracted character strings 21 and 22 to remove duplication. The character string list 23 contains all the names of the class nodes 11 shown in FIG. Therefore, by executing the node name candidate extraction process based on the flowchart shown in FIG. 5, the class node name can be obtained from the name data without the user having to prepare a word dictionary for creating a hierarchical structure in advance. Can be extracted automatically. Note that the character string list 23 includes a character string “Genben” that is not included in the class node illustrated in FIG. 3. There is no problem because it is excluded as a character string to be expressed.

【００３４】次に、図４に示されるステップＳＴ４にお
けるクラス階層作成処理について説明する。図７は、ク
ラス階層作成処理の処理工程を示すフローチャートであ
る。図４に示されるステップＳＴ３においてユーザによ
り選択された個々のクラスノード名について、当該クラ
スノード名を有するクラスノードを作成し、各クラスノ
ード毎に図７に示されるクラス階層作成処理を適用して
当該クラスノードを階層構造に順次追加していく。な
お、この際、クラスノードとしてはクラスノード名の文
字数が短いものから順に処理する。Next, the class hierarchy creation processing in step ST4 shown in FIG. 4 will be described. FIG. 7 is a flowchart showing the processing steps of the class hierarchy creation processing. For each class node name selected by the user in step ST3 shown in FIG. 4, a class node having the class node name is created, and the class hierarchy creation process shown in FIG. 7 is applied to each class node. The class nodes are sequentially added to the hierarchical structure. At this time, the class nodes are processed in ascending order of the number of characters of the class node name.

【００３５】このクラス階層作成処理においては、第１
にルートノードを含むクラスノードを格納する局所変数
であるＮｐを用意して、Ｎｐにルートノード１４を格納
する（ステップＳＴ３１）。次に、ノードＮｐにｉｓ＿
ａリンクで接続された子ノードが存在するか否かを判定
する（ステップＳＴ３２）。子ノードが存在する場合に
は、カウンタ変数ｉを用いて、以降の処理を子ノードの
数だけ繰り返し実行する（ステップＳＴ３３）。そし
て、この繰り返し処理においては、子ノードをＮｉとお
いて、当該子ノードＮｉのクラスノード名が今回の追加
対象クラスノードのクラスノード名の末尾に含まれてい
るか否かを判定する（ステップＳＴ３４）。例えば子ノ
ードＮｉのクラスノード名が「入口弁」である場合に
は、追加対象ノードのクラスノード名が「真空ポンプ入
口弁」であればｙｅｓであり、追加対象ノードのクラス
ノード名が「真空ポンプ出口弁」であればｎｏである。In this class hierarchy creation processing, the first
, A local variable Np for storing a class node including a root node is prepared, and the root node 14 is stored in Np (step ST31). Next, is_ is added to the node Np.
It is determined whether there is a child node connected by link a (step ST32). If there are child nodes, the subsequent processing is repeatedly executed by the number of child nodes using the counter variable i (step ST33). In this repetition process, the child node is set to Ni, and it is determined whether or not the class node name of the child node Ni is included at the end of the class node name of the current addition target class node (step ST34). . For example, when the class node name of the child node Ni is “entrance valve”, if the class node name of the addition target node is “vacuum pump inlet valve”, “yes”, and the class node name of the addition target node is “vacuum” No for pump outlet valve.

【００３６】ステップＳＴ３４における判定がｙｅｓの
場合には、子ノードＮｉをＮｐに置き換えて（ステップ
ＳＴ３５）、ステップＳＴ３２へ再帰する。一方、ステ
ップＳＴ３４における判定がｎｏの場合には、次の子ノ
ードについてステップＳＴ３４における判定処理を実行
して、全ての子ノードに対してステップＳＴ３４の判定
がｎｏである場合には、その時点におけるノードＮｐの
子ノードとして追加対象ノードを追加する（ステップＳ
Ｔ３６）。具体的には、追加対象ノードとノードＮｐと
をｉｓ＿ａリンクで接続する。また、ステップＳＴ３２
における判定がｎｏの場合にも、ステップＳＴ３６にお
ける処理を実行する。図６に示される例においては、文
字列リスト２３の「元弁」以外のクラスノード名につい
て、クラスノード名の文字数が短い順に図７に示される
クラス階層作成処理を適用することで、図３に示される
階層構造のなかでルートノード１４、クラスノード１１
およびｉｓ＿ａリンク１３から成る部分的な階層構造が
作成される。If the determination in step ST34 is yes, the child node Ni is replaced with Np (step ST35), and the process returns to step ST32. On the other hand, if the determination in step ST34 is no, the determination process in step ST34 is executed for the next child node, and if the determination in step ST34 is no for all child nodes, An addition target node is added as a child node of the node Np (step S
T36). Specifically, the addition target node and the node Np are connected by an is_a link. Step ST32
If the determination in step is no, the processing in step ST36 is also performed. In the example shown in FIG. 6, by applying the class hierarchy creation processing shown in FIG. The root node 14 and the class node 11 in the hierarchical structure shown in FIG.
And a partial hierarchical structure consisting of is_a link 13 is created.

【００３７】次に、図４に示されるステップＳＴ５にお
けるインスタンスノード追加処理について説明する。図
４に示されるステップＳＴ５では、ステップＳＴ４にお
いて作成されたクラス階層構造に対して、インスタンス
ノードを追加するとともに、インスタンスノード間にｈ
ａｓ＿ａリンクを張る。Next, the process of adding an instance node in step ST5 shown in FIG. 4 will be described. In step ST5 shown in FIG. 4, an instance node is added to the class hierarchical structure created in step ST4, and h is placed between the instance nodes.
Create an as_a link.

【００３８】第１に、インスタンスノードの追加に関し
ては、名称データ記憶手段３に蓄積されている名称デー
タを文字列の短い順に取り出して、インスタンスノード
を作成した後に、インスタンスノード名を追加対象ノー
ド名として、図７のフローチャートにより示されたアル
ゴリズムを再度適用する。但し、図７に示されるステッ
プＳＴ３６における追加対象ノードの追加に関しては、
ノードＮｐと追加対象ノードとをｉｎｓｔａｎｃｅ＿ｏ
ｆリンクで接続するものとする。そして、１つのインス
タンスノードを追加する毎に、図８に示されるフローチ
ャートに基づいて、追加される当該インスタンスノード
に係るｈａｓ＿ａリンクの接続処理を実行する。First, with regard to the addition of an instance node, the name data stored in the name data storage means 3 is taken out in the order of the shortest character string, an instance node is created, and the instance node name is changed to the name of the node to be added. The algorithm shown by the flowchart in FIG. 7 is applied again. However, regarding the addition of the addition target node in step ST36 shown in FIG.
Instance_o is defined as the node Np and the node to be added.
It is assumed that they are connected by f link. Then, every time one instance node is added, based on the flowchart shown in FIG. 8, the connection process of the has_a link related to the added instance node is executed.

【００３９】図８は、ｈａｓ＿ａリンク接続処理の処理
工程を示すフローチャートである。このｈａｓ＿ａリン
ク接続処理においては、第１にインスタンスノードを格
納する局所変数Ｎｔをｎｕｌｌで初期化する（ステップ
ＳＴ４１）。次に、カウンタ変数ｉを用いて、以降の処
理を全インスタンスノード数だけ繰り返し実行する（ス
テップＳＴ４２）。そして、この繰り返し処理において
は、まず取り出されたインスタンスノードＮｉが、追加
対象インスタンスノードと同一であるか否かを判定する
（ステップＳＴ４３）。同一であれば、処理をステップ
ＳＴ４２へ移行する。FIG. 8 is a flowchart showing the processing steps of the has_a link connection processing. In this has_a link connection processing, first, a local variable Nt for storing an instance node is initialized with null (step ST41). Next, using the counter variable i, the subsequent processing is repeatedly executed for all instance nodes (step ST42). In this repetitive processing, first, it is determined whether or not the extracted instance node Ni is the same as the addition target instance node (step ST43). If they are the same, the process moves to step ST42.

【００４０】取り出されたインスタンスノードＮｉが追
加対象インスタンスノードと同一でない場合には、イン
スタンスノードＮｉのインスタンスノード名が追加対象
インスタンスノードのインスタンスノード名の先頭に含
まれているか否かを判定する（ステップＳＴ４４）。例
えば、インスタンスノードＮｉのインスタンス名が「Ａ
復水器」の場合に、追加対象インスタンスノードのイン
スタンス名が「Ａ復水器真空ポンプ」であればｙｅｓと
判定され、「Ｂ復水器真空ポンプ」であればｎｏと判定
される。ステップＳＴ４４においてｎｏと判定された場
合には、処理をステップＳＴ４２へ移行する。If the extracted instance node Ni is not the same as the instance node to be added, it is determined whether or not the instance node name of the instance node Ni is included at the beginning of the instance node name of the instance node to be added ( Step ST44). For example, if the instance name of the instance node Ni is “A
In the case of "condenser", if the instance name of the addition target instance node is "A condenser vacuum pump", it is determined as yes, and if it is "B condenser vacuum pump", it is determined as no. If it is determined as no in step ST44, the process proceeds to step ST42.

【００４１】ステップＳＴ４４においてｙｅｓと判定さ
れた場合には、インスタンスノードＮｉのインスタンス
名の文字数が局所変数Ｎｔに格納されているインスタン
スノードのインスタンス名の文字数より長いか否かを判
定する（ステップＳＴ４５）。長い場合には、ＮｉをＮ
ｔに置き換える（ステップＳＴ４６）。また、短い場合
には、何ら処理を実行しない。これにより、追加対象イ
ンスタンスノードのインスタンス名の先頭に、インスタ
ンス名が含まれているインスタンスノードＮｉのなかで
インスタンスノード名の文字数が最も長いインスタンス
ノードが検出される。If it is determined "yes" in step ST44, it is determined whether or not the number of characters of the instance name of the instance node Ni is longer than the number of characters of the instance name of the instance node stored in the local variable Nt (step ST45). ). In the case of long, Ni
Replace with t (step ST46). If it is short, no processing is performed. As a result, the instance node having the longest number of characters of the instance node name among the instance nodes Ni including the instance name is detected at the head of the instance name of the addition target instance node.

【００４２】以上のステップＳＴ４３からステップＳＴ
４６の処理を全インスタンスノードに対して実行した後
に、変数Ｎｔにインスタンスノードが格納されているか
否かを判定する（ステップＳＴ４７）。Ｎｔにインスタ
ンスノードが格納されている場合、すなわちＮｔがｎｕ
ｌｌでない場合には、Ｎｔに格納されているインスタン
スノードから追加対象インスタンスノードへｈａｓ＿ａ
リンクを張る（ステップＳＴ４８）。また、Ｎｔにイン
スタンスノードが格納されていない場合には、リンクは
張らない。The above steps ST43 to ST
After executing the process of 46 for all the instance nodes, it is determined whether or not the instance nodes are stored in the variable Nt (step ST47). When an instance node is stored in Nt, that is, when Nt is nu
If it is not 11_has_a from the instance node stored in Nt to the instance node to be added
A link is established (step ST48). If no instance node is stored in Nt, no link is established.

【００４３】図２に示される例において、入力データ１
に含まれる全名称データすなわち全インスタンスノード
に対して、図４に示されるステップＳＴ５に係り図７に
示される階層作成処理および図８に示されるｈａｓ＿ａ
リンク接続処理を適用することで、図３に示される階層
構造のなかでステップＳＴ４で作成されたクラス階層構
造を除く残りの全てのノードおよびリンクを作成するこ
とができる。なお、図３では、スペースの関係上ｈａｓ
＿ａリンクは「Ａ復水器」に関わるものしか表示されて
いないが、実際には「Ｂ復水器」に関わるものも同様に
作成される。In the example shown in FIG.
Are related to step ST5 shown in FIG. 4 with respect to all the name data included in, and the hierarchy creation processing shown in FIG. 7 and has_a shown in FIG.
By applying the link connection processing, all the nodes and links other than the class hierarchical structure created in step ST4 in the hierarchical structure shown in FIG. 3 can be created. In addition, in FIG.
Only the link related to “A condenser” is displayed as the _a link, but the link related to “B condenser” is actually created similarly.

【００４４】以上のように、この実施の形態１によれ
ば、入力される名称データを蓄積する名称データ記憶手
段３と、構造化データベースを格納する階層構造記憶手
段７と、名称データ記憶手段３に蓄積されている名称デ
ータを読み込んで、名称データ間において名称データを
成す文字列に対する先頭からの最長一致比較および末尾
からの最長一致比較を実施してノード名候補を抽出する
ノード名候補抽出手段４と、適正なノード名をユーザに
選択させるノード名選択手段５と、ノード名選択手段５
により選択されたノード名を有するノードおよび名称デ
ータ記憶手段３に蓄積されている名称データをノード名
として有するノードを階層的に配列するとともに、各ノ
ード間の相互関係を規定して構造化データベースを作成
する階層構造作成手段６とを備えるように構成したの
で、入力された名称データから階層構造の構成要素とな
るノードの候補を表現する文字列として与えられるノー
ド名候補を自動的に抽出して、ユーザが選択したノード
名および名称データを用いて自動的に階層構造を作成す
ることができるから、ユーザが予め単語データベースを
用意する必要がなく少ない労力で効率的に構造化データ
ベースを作成することができるという効果を奏する。As described above, according to the first embodiment, name data storage means 3 for storing input name data, hierarchical structure storage means 7 for storing a structured database, and name data storage means 3 Node name candidate extracting means for reading the name data stored in the storage device and performing the longest match comparison from the beginning and the longest match comparison from the end with respect to the character strings constituting the name data between the name data to extract the node name candidates 4, node name selecting means 5 for allowing a user to select an appropriate node name, and node name selecting means 5
The node having the node name selected by the above and the node having the name data stored in the name data storage means 3 as the node name are arranged hierarchically, and the mutual relation between the nodes is defined to form the structured database. Since it is configured to include the hierarchical structure creating means 6 for creating, a node name candidate given as a character string representing a candidate of a node to be a component of the hierarchical structure is automatically extracted from the input name data. Since it is possible to automatically create a hierarchical structure using node names and name data selected by the user, it is not necessary for the user to prepare a word database in advance, and to efficiently create a structured database with a small amount of labor. This has the effect that it can be performed.

【００４５】実施の形態２．この発明の実施の形態１に
よる構造化データベース作成装置においては、名称デー
タ同士の文字列比較によりノード名候補の抽出を行って
いるが、名称データに含まれている入力ミス等に起因し
て、正しいノード名を抽出できない場合がある。例え
ば、図２に示される入力データ１内の名称データについ
て、「Ｂ復水器」が「Ａ復水器」と間違って入力されて
いた（すなわち「Ａ復水器」が２つ存在する）場合に
は、末尾からの最長一致比較処理を実行すると、「Ａ復
水器」という名称が抽出されて「復水器」という名称は
抽出されない。このような場合には、ノード名選択手段
５を用いてノード名候補「Ａ復水器」を破棄することは
できるが、ノード名「復水器」を作成することはできな
い。Embodiment 2 In the structured database creation device according to the first embodiment of the present invention, node name candidates are extracted by comparing character strings between name data. However, due to an input error or the like included in the name data, The correct node name may not be extracted. For example, regarding the name data in the input data 1 shown in FIG. 2, “B condenser” was incorrectly inputted as “A condenser” (that is, two “A condensers” existed). In this case, when the longest match comparison process is performed from the end, the name “A condenser” is extracted and the name “condenser” is not extracted. In such a case, the node name candidate "A condenser" can be discarded using the node name selecting means 5, but the node name "condenser" cannot be created.

【００４６】図９は、このような問題に対処するために
なされた本願発明の実施の形態２による構造化データベ
ース作成装置の構成を示す図である。図９において、図
１と同一符号は同一または相当部分を示すのでその説明
を省略する。３１は、階層構造記憶手段７に格納された
構造化データベースをユーザが編集可能とする階層構造
編集手段である。また、３２は名称データ記憶手段３、
ノード名候補抽出手段４、ノード名選択手段５、階層構
造作成手段６、階層構造記憶手段７および階層構造編集
手段３１を有して構成される構造化データベース作成装
置である。FIG. 9 is a diagram showing a configuration of a structured database creating apparatus according to a second embodiment of the present invention which has been made to address such a problem. 9, the same reference numerals as those in FIG. 1 denote the same or corresponding parts, and a description thereof will not be repeated. Reference numeral 31 denotes a hierarchical structure editing unit that enables a user to edit the structured database stored in the hierarchical structure storage unit 7. 32 is a name data storage means 3,
This is a structured database creating apparatus including a node name candidate extracting unit 4, a node name selecting unit 5, a hierarchical structure creating unit 6, a hierarchical structure storing unit 7, and a hierarchical structure editing unit 31.

【００４７】階層構造編集手段３１は、ＧＵＩ（Ｇｒａ
ｐｈｉｃａｌＵｓｅｒＩｎｔｅｒｆａｃｅ）等を用
いてユーザにより対話的に与えられる階層構造のノード
およびリンクの作成、変更、削除等に係るコマンドにし
たがって階層構造記憶手段７に格納された階層構造を変
更する。これにより、上記のような場合にも、「Ａ復水
器」を選択して階層構造に追加した後に「復水器」に変
更するか、あるいは「Ａ復水器」を破棄して階層構造に
新たに「復水器」を作成することにより、目的の階層構
造を作成することができる。The hierarchical structure editing means 31 provides a GUI (Gra)
The hierarchical structure stored in the hierarchical structure storage unit 7 is changed in accordance with commands relating to creation, change, deletion, etc. of nodes and links having a hierarchical structure interactively given by the user using a physical user interface). As a result, even in the case described above, the “A condenser” is selected and added to the hierarchical structure and then changed to the “condenser”, or the “A condenser” is discarded and the hierarchical structure is discarded. By creating a new “condenser”, a desired hierarchical structure can be created.

【００４８】以上のように、この実施の形態２によれ
ば、階層構造記憶手段７に格納された構造化データベー
スをユーザが編集可能とする階層構造編集手段３１を備
えるように構成したので、ユーザが階層構造についてノ
ードおよびリンクの作成、変更、削除等の編集を行うこ
とができるから、名称データの入力ミス等によって正し
いノード名候補の抽出が阻害された場合にも、正しい階
層構造を作成することができるという効果を奏する。As described above, according to the second embodiment, the structured database stored in the hierarchical structure storage means 7 is provided with the hierarchical structure editing means 31 which enables the user to edit the structured database. Can edit, create, change, delete, etc. nodes and links in the hierarchical structure. Therefore, even if extraction of a correct node name candidate is hindered due to input error of name data, etc., a correct hierarchical structure is created. It has the effect of being able to do so.

【００４９】実施の形態３．大規模な設備データにおい
ては、複数の人間によって大量のデータが作成されるた
めに、用語の統一が取れていない場合が多い。また、入
力文字数制限等の理由により、部分的に省略名が用いら
れることもある。例えば、「水張りポンプ」の「水張
り」が「水張」、「水漲り」、「水漲」と表記されるこ
とや、「蒸気発生器」が「Ｓ／Ｇ」、「ＳＧ」と表記さ
れることがある。このような同義語が用いられる場合に
は、この発明の実施の形態１による構造化データベース
８では、本来同じ設備を示す「蒸気発生器」、「Ｓ／
Ｇ」、「ＳＧ」はそれぞれ別の設備と判断され、階層構
造には別個のノードとして追加されてしまう。Embodiment 3 In large-scale facility data, since a large amount of data is created by a plurality of persons, terms are often not unified. Further, an abbreviated name may be partially used for reasons such as limitation of the number of input characters. For example, "water filling" of "water pump" is described as "water filling", "water filling", "water filling", and "steam generator" is written as "S / G", "SG". Sometimes. When such synonyms are used, in the structured database 8 according to the first embodiment of the present invention, “steam generator”, “S /
G ”and“ SG ”are determined to be separate facilities, and are added as separate nodes to the hierarchical structure.

【００５０】図１０は、このような問題に対処するため
になされた本願発明の実施の形態３による構造化データ
ベース作成装置の構成を示す図である。図１０におい
て、４１は種々のユーザインタフェース機能を用いてユ
ーザによる同義語の定義を可能とする同義語定義手段、
４２は同義語定義手段４１を用いて定義された同義語を
蓄積する同義語情報記憶手段、４３は同義語情報記憶手
段４２に蓄積された同義語情報を参照して名称データお
よびノード名を同じ意義を有する代表的な用語に変換す
る同義語変換手段である。また、４４は名称データ記憶
手段３、ノード名候補抽出手段４、ノード名選択手段
５、階層構造作成手段６、階層構造記憶手段７、同義語
定義手段４１、同義語情報記憶手段４２および同義語変
換手段４３を有して構成される構造化データベース作成
装置である。FIG. 10 is a diagram showing a configuration of a structured database creating apparatus according to a third embodiment of the present invention which has been made to address such a problem. In FIG. 10, reference numeral 41 denotes a synonym definition unit that enables a user to define a synonym using various user interface functions;
Reference numeral 42 denotes a synonym information storage unit that stores synonyms defined using the synonym definition unit 41. Reference numeral 43 denotes the same name data and node name by referring to the synonym information stored in the synonym information storage unit 42. This is a synonym conversion means for converting into a meaningful representative term. Reference numeral 44 denotes name data storage means 3, node name candidate extraction means 4, node name selection means 5, hierarchical structure creation means 6, hierarchical structure storage means 7, synonym definition means 41, synonym information storage means 42, and synonyms. This is a structured database creation device having a conversion unit 43.

【００５１】同義語定義手段４１は、ユーザが予めまた
は構造化データベースを作成する過程において対話的に
同義語情報を定義できる機能を有する。図１１は、同義
語情報の例を示す図である。これら同義語情報は、同義
語定義手段４１を用いて定義されて、同義語情報記憶手
段４２に蓄積されるものである。The synonym definition means 41 has a function that allows a user to define synonym information interactively in advance or in the process of creating a structured database. FIG. 11 is a diagram illustrating an example of the synonym information. These synonym information is defined using the synonym definition means 41 and is stored in the synonym information storage means 42.

【００５２】図１１において、４５は同義語の集合毎に
付与される同義語情報の識別番号、４６は当該同義語の
集合を構成して互いに同じ意義を有する個々の同義語を
羅列することで与えられる定義内容を示すものである。
定義内容としては、同義語情報Ｌ１０１では［「蒸気発
生器」は「Ｓ／Ｇ」、「ＳＧ」と同義である］ことを意
味し、同義語情報Ｌ１０２では［「水張り」は「水
張」、「水漲り」、「水漲」と同義である］ことを意味
する。なお、定義内容４６において用いられている下線
については、当該下線が引かれている「蒸気発生器」、
「水張り」がそれぞれ同義語の集合を代表する用語であ
ることを示している。In FIG. 11, reference numeral 45 denotes an identification number of synonym information assigned to each set of synonyms, and reference numeral 46 denotes a list of synonyms constituting the set of synonyms and listing individual synonyms having the same meaning. It shows the definition contents given.
The definition content means that “steam generator” is synonymous with “S / G” and “SG” in the synonym information L101, and “[water-filled] is“ water-filled ”in the synonym information L102. Synonymous with "water filling" and "water filling". As for the underline used in the definition 46, the underlined "steam generator",
"Water-filled" indicates that each is a term representing a set of synonyms.

【００５３】次に動作について説明する。同義語変換手
段４３は、ノード名選択手段５を用いて選択されたノー
ド名および名称データ記憶手段３から読み込まれる名称
データについて、同義語情報記憶手段４２を参照して、
当該ノード名および名称データ、またはノード名および
名称データに部分的に含まれる用語が同義語として登録
されている場合には、該当する用語を同義語を代表する
用語に統一的に変換する。Next, the operation will be described. The synonym conversion unit 43 refers to the synonym information storage unit 42 for the node name selected using the node name selection unit 5 and the name data read from the name data storage unit 3,
If the node name and the name data or a term partially included in the node name and the name data is registered as a synonym, the corresponding term is uniformly converted to a term representing the synonym.

【００５４】以上のように、この実施の形態３によれ
ば、ユーザによる同義語の定義を可能とする同義語定義
手段４１と、同義語定義手段４１を用いて定義された同
義語を蓄積する同義語情報記憶手段４２と、同義語情報
記憶手段４２に蓄積された同義語情報を参照して名称デ
ータおよびノード名を同じ意義を有する代表的な用語に
変換する同義語変換手段４３とを備えるように構成した
ので、入力データ１内に同じ意義を有する複数の用語が
混在していたとしても、これら同義語を代表する１つの
用語に統一することができ、同一の機能または仕様を有
する設備等を階層構造上において同じノードとしてまと
めることができるから、より整備された構造化データベ
ースを作成することができるという効果を奏する。As described above, according to the third embodiment, the synonym definition means 41 that enables the user to define synonyms, and the synonyms defined by using the synonym definition means 41 are stored. A synonym information storage unit 42; and a synonym conversion unit 43 that converts the name data and the node name into representative terms having the same meaning by referring to the synonym information stored in the synonym information storage unit 42. With such a configuration, even if a plurality of terms having the same meaning are mixed in the input data 1, they can be unified into one term representing these synonyms, and facilities having the same function or specification can be used. And the like can be grouped as the same node on the hierarchical structure, so that an improved structured database can be created.

【００５５】実施の形態４．大規模なデータベースにお
いては、様々な名称データが扱われており、それらの中
には例えば「復水器側○○ポンプ」という表現のものも
ある。これは、復水器の付属設備ではなく、単に復水器
の側にあるという意味であるが、このような名称が与え
られた場合に、この発明の実施の形態１による構造化デ
ータベース作成装置８では、「復水器」との先頭からの
最長一致比較により、「側○○ポンプ」というノード名
候補が抽出される。通常は、ユーザがこれはノード名
（すなわち設備名）としては適正ではないと判断して破
棄するために階層構造作成に影響はないが、このような
ノード名候補が数多く選択候補として現れると、ユーザ
の選択作業に係る負担が大きくなる。なお、同様の表現
としては、「〜用〜」といったものもある。Embodiment 4 In a large-scale database, various name data are handled, and among them, for example, there is one having the expression “condenser side OO pump”. This means that it is not the accessory equipment of the condenser, but simply the condenser side. If such a name is given, the structured database creating apparatus according to the first embodiment of the present invention will be described. In 8, the longest match comparison from the top with "condenser" extracts a node name candidate "side xx pump". Usually, the user determines that this is not appropriate as a node name (namely, equipment name) and discards it, so that there is no influence on the hierarchical structure creation. However, when many such node name candidates appear as selection candidates, The burden associated with the user's selection work increases. In addition, as a similar expression, there is a phrase such as “for”.

【００５６】図１２は、このような問題に対処するため
になされた本願発明の実施の形態４による構造化データ
ベース作成装置の構成を示す図である。図１２におい
て、５１はノード名候補抽出の際に適用される抽出補助
規則を格納する抽出補助規則記憶手段、５２は名称デー
タ記憶手段３に蓄積されている名称データを読み込ん
で、各名称データ毎に他の全ての名称データとの間で名
称データを成す文字列に対する最長一致比較を実施して
ノード名候補を抽出するとともに、得られたノード名候
補の抽出過程が抽出補助規則記憶手段５１に蓄積されて
いるいずれかの規則に該当する場合には当該ノード名候
補を破棄するノード名候補抽出手段である。また、５３
は名称データ記憶手段３、ノード名選択手段５、階層構
造作成手段６、階層構造記憶手段７、抽出補助規則記憶
手段５１およびノード名候補抽出手段５２を有して構成
される構造化データベース作成装置である。FIG. 12 is a diagram showing a configuration of a structured database creating apparatus according to a fourth embodiment of the present invention which has been made to address such a problem. In FIG. 12, reference numeral 51 denotes an extraction auxiliary rule storage unit for storing an extraction auxiliary rule applied at the time of extracting a node name candidate, and 52 reads the name data stored in the name data storage unit 3 and reads each name data. The node name candidate is extracted by performing the longest match comparison on the character string constituting the name data with all the other name data, and the extraction process of the obtained node name candidate is stored in the extraction auxiliary rule storage means 51. A node name candidate extracting means for discarding the node name candidate when any of the stored rules is satisfied. Also, 53
Is a structured database creating apparatus comprising name data storing means 3, node name selecting means 5, hierarchical structure creating means 6, hierarchical structure storing means 7, extraction auxiliary rule storing means 51 and node name candidate extracting means 52 It is.

【００５７】抽出補助規則記憶手段５１に蓄積されてい
る補助規則としては、例えば［先頭からの最長一致比較
において、残りの文字列が「側〜」や「用〜」となる場
合には、その部位で文字列を切ってはならない］という
ものを定義する。実際には、ノード名候補抽出手段５２
は、図５に示されるステップＳＴ１８の処理の際に、抽
出補助規則記憶手段５１に蓄積された抽出補助規則に該
当した場合には、一致文字数を０にするとともに残りの
文字列をＤｉ，Ｄｊそのものとする。これにより、抽出
補助規則に該当した場合には、その文字列が最長一致条
件を満たす場合でもこの段階で棄却されることになる。
その他、設備名としてその部位で切れるのが不自然であ
るという部位が予め判明している場合には、当該部位で
文字列を切ることを禁止する内容の抽出補助規則を抽出
補助規則記憶手段５１に格納しておくとよい。The auxiliary rules stored in the extraction auxiliary rule storage means 51 include, for example, [in the longest match comparison from the beginning, when the remaining character strings are “side-” or “use- Do not cut the character string at the part]. Actually, the node name candidate extracting means 52
In the processing of step ST18 shown in FIG. 5, if the extraction auxiliary rule stored in the extraction auxiliary rule storage unit 51 is satisfied, the number of matching characters is set to 0 and the remaining character strings are set to Di, Dj. It is the same. As a result, when the character string matches the extraction assisting rule, even if the character string satisfies the longest matching condition, the character string is rejected at this stage.
In addition, if it is known in advance that the part that is cut off at that part is unnatural as the equipment name, the extraction auxiliary rule storage unit 51 stores an extraction auxiliary rule that prohibits the character string from being cut at that part. It is good to store in.

【００５８】以上のように、この実施の形態４によれ
ば、ノード名候補抽出の際に適用される抽出補助規則を
格納する抽出補助規則記憶手段５１と、名称データ記憶
手段３に蓄積されている名称データを読み込んで、各名
称データ毎に他の全ての名称データとの間で名称データ
を成す文字列に対する最長一致比較を実施してノード名
候補を抽出するとともに、得られたノード名候補の抽出
過程が抽出補助規則記憶手段５１に蓄積されているいず
れかの規則に該当する場合には当該ノード名候補を破棄
するノード名候補抽出手段５２とを備えるように構成し
たので、ノード名として不適正なノード名候補をユーザ
による選択以前に排除できるから、ユーザの選択作業に
係る負担を軽減できて、構造化データベースの作成作業
をより効率化することができるという効果を奏する。As described above, according to the fourth embodiment, the extraction auxiliary rule storage means 51 for storing the extraction auxiliary rules applied when extracting the node name candidates and the name data storage means 3 Read name data, perform a longest match comparison on a character string forming name data with all other name data for each name data, extract node name candidates, and obtain the obtained node name candidates. If the extraction process corresponds to any of the rules stored in the extraction auxiliary rule storage means 51, the node name candidate extraction means 52 for discarding the node name candidate is provided. Inappropriate node name candidates can be eliminated before selection by the user, so that the burden of user selection can be reduced and the efficiency of creating a structured database can be increased. There is an effect that it is.

【００５９】実施の形態５．大規模なデータベースにお
いては、実施の形態３について述べた同義語に似た問題
として、表記のブレの問題がある。これは、同じカタカ
ナ表記による用語について一方では全角で表記され他方
では半角で表記されている場合、あるいは例えば「ター
ビン」という用語について一方では「ー」という長音記
号を用いて表記され他方では「−」というハイフンを用
いて表記される場合等で、同一の用語に対して異なる表
記が対応する問題として与えられるものである。Embodiment 5 FIG. In a large-scale database, as a problem similar to the synonym described in the third embodiment, there is a problem of notation blur. This is the case when the terms in the same katakana notation are written on the one hand in full-width and on the other hand in half-width, or, for example, the term "turbine" is written on the one hand using the long sign "-" and on the other hand "- In this case, a different notation is given as a problem corresponding to the same term.

【００６０】上記のような場合に、この発明の実施の形
態１による構造化データベース作成装置８では、ノード
名候補抽出手段４による文字列比較を実行するに際し
て、半角カタカナと全角カタカナ、または「タービン」
と「タ−ビン」とは同一のノード名候補とは判定され
ず、それぞれ別個のノード名候補として抽出される。In such a case, the structured database creating apparatus 8 according to the first embodiment of the present invention performs a half-width katakana and a full-width katakana or a "turbine""
And “turbin” are not determined to be the same node name candidate, but are extracted as separate node name candidates.

【００６１】このような問題に対しては、実施の形態３
で述べた同義語情報として「タービン」と「タ−ビン」
とを同義語として定義することにより対処することが可
能である。しかし、この方法によると、長音記号「ー」
を含む全ての用語に対して同様の定義を行う必要があ
り、同義語定義に係るユーザの負担が増大する。To solve such a problem, Embodiment 3
"Turbine" and "turbine" as synonym information described in
Can be dealt with by defining as a synonym. However, according to this method, the long clef "-"
It is necessary to perform the same definition for all terms including, and the burden on the user for synonym definition increases.

【００６２】図１３は、このような問題に対処するため
になされた本願発明の実施の形態５による構造化データ
ベース作成装置の構成を示す図である。図１３におい
て、６１は名称データに含まれる用語に対する補正方法
を定義した名称補正規則を蓄積する名称補正規則記憶手
段、６２は入力データ１に含まれる各名称データに対し
て名称補正規則記憶手段６１に格納されている名称補正
規則を適用して、当該規則に基づいて変換された名称デ
ータを名称データ記憶手段３に蓄積する名称補正手段で
ある。また、６３は名称データ記憶手段３、ノード名候
補抽出手段４、ノード名選択手段５、階層構造作成手段
６、階層構造記憶手段７、名称補正規則記憶手段６１お
よび名称補正手段６２を有して構成される構造化データ
ベース作成装置である。FIG. 13 is a diagram showing a configuration of a structured database creating apparatus according to a fifth embodiment of the present invention which has been made to address such a problem. In FIG. 13, reference numeral 61 denotes a name correction rule storage unit that stores a name correction rule that defines a correction method for a term included in name data, and 62 denotes a name correction rule storage unit 61 for each name data included in the input data 1. Is a name correction unit that applies a name correction rule stored in the name data storage unit and accumulates name data converted based on the rule in the name data storage unit 3. Reference numeral 63 denotes a name data storage unit 3, a node name candidate extraction unit 4, a node name selection unit 5, a hierarchical structure creation unit 6, a hierarchical structure storage unit 7, a name correction rule storage unit 61, and a name correction unit 62. This is a structured database creation device to be configured.

【００６３】名称補正規則記憶手段６１に蓄積されてい
る名称補正規則としては、例えば「半角カタカナは全角
カタカナに変更する」、「長音記号は全角ハイフンに変
更する」といったものが定義して与えられる。名称補正
手段６２は、上記名称補正規則を名称データに適用し
て、例えば半角カタカナを全て全角カタカナに変更し、
また長音記号を全て全角ハイフンに変換する。As the name correction rules stored in the name correction rule storage means 61, for example, "half-width katakana is changed to full-width katakana" and "long-tone symbols are changed to full-width hyphens" are defined and given. . The name correction unit 62 applies the above-described name correction rule to the name data, for example, changes all half-width katakana to full-width katakana,
In addition, all long sounds are converted to full-width hyphens.

【００６４】以上のように、この実施の形態５によれ
ば、名称データに含まれる用語に対する補正方法を定義
した名称補正規則を蓄積する名称補正規則記憶手段６１
と、入力データ１に含まれる各名称データに対して名称
補正規則記憶手段６１に格納されている名称補正規則を
適用して、当該規則に基づいて変換された名称データを
名称データ記憶手段３に蓄積する名称補正手段６２とを
備えるように構成したので、名称データに表記のブレが
存在する場合にも、名称補正規則を適用することで名称
データの表記を統一的に補正することができるから、よ
り整備された構造化データベースを作成することができ
るという効果を奏する。また、ユーザが名称データに係
る表記を補正する作業を行う必要がないので、ユーザの
負担を軽減することができて、構造化データベースの作
成作業をより効率化することができるという効果を奏す
る。As described above, according to the fifth embodiment, the name correction rule storage means 61 for storing the name correction rule defining the correction method for the term included in the name data.
And applying the name correction rule stored in the name correction rule storage unit 61 to each name data included in the input data 1 and storing the name data converted based on the rule in the name data storage unit 3. Since the configuration is provided with the accumulating name correction means 62, even when the name data has blurring of the notation, it is possible to uniformly correct the notation of the name data by applying the name correction rule. Thus, there is an effect that a more organized structured database can be created. Further, since it is not necessary for the user to perform the work of correcting the notation related to the name data, the burden on the user can be reduced, and the effect of creating the structured database more efficiently can be achieved.

【００６５】[0065]

【発明の効果】以上のように、この発明によれば、入力
される名称データを蓄積する名称データ記憶手段と、構
造化データベースを格納する階層構造記憶手段と、名称
データ記憶手段に蓄積されている名称データを読み込ん
で、各名称データ毎に他の全ての名称データとの間で名
称データを成す文字列に対する先頭からの最長一致比較
および末尾からの最長一致比較をそれぞれ実施して、ノ
ード名候補を抽出するノード名候補抽出手段と、ノード
名候補から構造化データベースを構成する構成要素とし
て適正なノードのノード名をユーザに選択させるノード
名選択手段と、ノード名選択手段により選択されたノー
ド名を有するノードおよび名称データ記憶手段に蓄積さ
れている名称データをノード名として有するノードを階
層的に配列するとともに、各ノード間の相互関係を規定
して構造化データベースを作成する階層構造作成手段と
を備えるように構成したので、入力された名称データか
ら階層構造の構成要素となるノードの候補を表現する文
字列として与えられるノード名候補を自動的に抽出し
て、ユーザが選択したノード名および名称データを用い
て自動的に階層構造を作成することができるから、ユー
ザが予め単語データベースを用意する必要がなく少ない
労力で効率的に構造化データベースを作成することがで
きるという効果を奏する。As described above, according to the present invention, the name data storage means for storing the input name data, the hierarchical structure storage means for storing the structured database, and the name data storage means Read the name data, and perform the longest match comparison from the beginning and the longest match comparison from the end with respect to the character string forming the name data with all other name data for each name data, and execute the node name. Node name candidate extracting means for extracting candidates, node name selecting means for allowing a user to select a node name of an appropriate node as a constituent element of the structured database from the node name candidates, and a node selected by the node name selecting means When nodes having names and nodes having name data stored in the name data storage means as node names are hierarchically arranged. In addition, since it is configured to include a hierarchical structure creating means for creating a structured database by defining the interrelationship between the nodes, a candidate of a node to be a component of the hierarchical structure is represented from the input name data. A node name candidate given as a character string to be extracted is automatically extracted, and a hierarchical structure can be automatically created using the node name and name data selected by the user. Therefore, the user prepares a word database in advance. There is an effect that a structured database can be efficiently created with no need and with little labor.

【００６６】この発明によれば、階層構造記憶手段に格
納された構造化データベースをユーザが編集可能とする
階層構造編集手段を備えるように構成したので、ユーザ
が階層構造についてノードおよびリンクの作成、変更、
削除等の編集を行うことができるから、名称データの入
力ミス等によって正しいノード名候補の抽出が阻害され
た場合にも、正しい階層構造を作成することができると
いう効果を奏する。According to the present invention, since the structured database stored in the hierarchical structure storage means is provided with the hierarchical structure editing means for enabling the user to edit, the user can create nodes and links for the hierarchical structure, Change,
Since editing such as deletion can be performed, there is an effect that a correct hierarchical structure can be created even when extraction of a correct node name candidate is hindered due to an input error of name data or the like.

【００６７】この発明によれば、ユーザによる同義語の
定義を可能とする同義語定義手段と、同義語定義手段を
用いて定義された同義語を蓄積する同義語情報記憶手段
と、同義語情報記憶手段に蓄積された同義語情報を参照
して名称データおよびノード名を同じ意義を有する代表
的な用語に変換する同義語変換手段とを備えるように構
成したので、入力データを構成する複数の名称データ内
に同じ意義を有する複数の用語が混在していたとして
も、これら同義語を代表する１つの用語に統一すること
ができ、同一の機能または仕様を有する設備等を階層構
造上において同じノードとしてまとめることができるか
ら、より整備された構造化データベースを作成すること
ができるという効果を奏する。According to the present invention, synonym definition means for enabling a user to define a synonym, synonym information storage means for storing synonyms defined using the synonym definition means, and synonym information And synonym conversion means for converting the name data and the node name into representative terms having the same meaning by referring to the synonym information stored in the storage means. Even if a plurality of terms having the same meaning are mixed in the name data, they can be unified into one term representing these synonyms, and facilities having the same function or specification are the same on the hierarchical structure. Since the nodes can be grouped together, there is an effect that a more organized structured database can be created.

【００６８】この発明によれば、ノード名候補抽出の際
に適用される抽出補助規則を蓄積する抽出補助規則記憶
手段と、名称データ記憶手段に蓄積されている名称デー
タを読み込んで、各名称データ毎に他の全ての名称デー
タとの間で名称データを成す文字列に対する最長一致比
較を実施してノード名候補を抽出するとともに、得られ
たノード名候補の抽出過程が抽出補助規則記憶手段に蓄
積されているいずれかの規則に該当する場合には当該ノ
ード名候補を破棄するノード名候補抽出手段とを備える
ように構成したので、ノード名として不適正なノード名
候補をユーザによる選択以前に排除できるから、ユーザ
の選択作業に係る負担を軽減できて、構造化データベー
スの作成作業をより効率化することができるという効果
を奏する。According to the present invention, the extraction auxiliary rule storage means for storing the extraction auxiliary rules applied when extracting the node name candidates, and the name data stored in the name data storage means are read, and each name data is read. In each case, the longest match comparison is performed on a character string constituting name data with all other name data to extract node name candidates, and the extraction process of the obtained node name candidates is stored in the extraction auxiliary rule storage means. A node name candidate extracting means for discarding the node name candidate when any of the stored rules is applicable, so that an incorrect node name candidate as a node name is selected before the user selects it. Since the elimination can be eliminated, it is possible to reduce the burden of the user on the selection operation, and to achieve an effect of making the operation of creating the structured database more efficient.

【００６９】この発明によれば、名称データに含まれる
用語に対する補正方法を定義した名称補正規則を蓄積す
る名称補正規則記憶手段と、入力データを構成する各名
称データに対して名称補正規則記憶手段に格納されてい
る名称補正規則を適用して、当該名称補正規則に基づい
て変換された名称データを名称データ記憶手段に蓄積す
る名称補正手段とを備えるように構成したので、名称デ
ータに表記のブレが存在する場合にも、名称補正規則を
適用することで名称データの表記を統一的に補正するこ
とができるから、より整備された構造化データベースを
作成することができるという効果を奏する。また、ユー
ザが名称データに係る表記を補正する作業を行う必要が
ないので、ユーザの負担を軽減することができて、構造
化データベースの作成作業をより効率化することができ
るという効果を奏する。According to the present invention, a name correction rule storing means for storing a name correction rule defining a correction method for a term included in name data, and a name correction rule storing means for each name data constituting input data And a name correction means for accumulating the name data converted based on the name correction rule in the name data storage means by applying the name correction rule stored in the name data. Even if there is blur, the notation of the name data can be uniformly corrected by applying the name correction rule, so that a more structured database can be created. Further, since it is not necessary for the user to perform the work of correcting the notation related to the name data, the burden on the user can be reduced, and the effect of creating the structured database more efficiently can be achieved.

[Brief description of the drawings]

【図１】この発明の実施の形態１による構造化データ
ベース作成装置の構成を示す図である。FIG. 1 is a diagram showing a configuration of a structured database creation device according to a first embodiment of the present invention.

【図２】複数の名称データから成る入力データの一例
を示す図である。FIG. 2 is a diagram illustrating an example of input data including a plurality of name data.

【図３】構造化データベースにおける階層構造を示す
図である。FIG. 3 is a diagram showing a hierarchical structure in a structured database.

【図４】構造化データベース作成方法を示すフローチ
ャートである。FIG. 4 is a flowchart showing a structured database creation method.

【図５】ノード名候補抽出処理の処理工程を示すフロ
ーチャートである。FIG. 5 is a flowchart illustrating processing steps of a node name candidate extraction processing.

【図６】ノード名候補抽出処理の実行結果を示す図で
ある。FIG. 6 is a diagram illustrating an execution result of a node name candidate extraction process.

【図７】クラス階層作成処理の処理工程を示すフロー
チャートである。FIG. 7 is a flowchart illustrating processing steps of a class hierarchy creation processing.

【図８】ｈａｓ＿ａリンク接続処理の処理工程を示す
フローチャートである。FIG. 8 is a flowchart showing processing steps of a has_a link connection process.

【図９】この発明の実施の形態２による構造化データ
ベース作成装置の構成を示す図である。FIG. 9 is a diagram showing a configuration of a structured database creation device according to a second embodiment of the present invention.

【図１０】この発明の実施の形態３による構造化デー
タベース作成装置の構成を示す図である。FIG. 10 is a diagram showing a configuration of a structured database creation device according to a third embodiment of the present invention.

【図１１】同義語情報の例を示す図である。FIG. 11 is a diagram illustrating an example of synonym information.

【図１２】この発明の実施の形態４による構造化デー
タベース作成装置の構成を示す図である。FIG. 12 is a diagram showing a configuration of a structured database creation device according to a fourth embodiment of the present invention.

【図１３】この発明の実施の形態５による構造化デー
タベース作成装置の構成を示す図である。FIG. 13 is a diagram showing a configuration of a structured database creation device according to a fifth embodiment of the present invention.

【図１４】従来の構造化データベース作成装置の構成
を示す図である。FIG. 14 is a diagram showing a configuration of a conventional structured database creation device.

【符号の説明】１入力データ、２ユーザ、３名称データ記憶手
段、４ノード名候補抽出手段、５ノード名選択手
段、６階層構造作成手段、７階層構造記憶手段、
８，３２，４４，５３，６３構造化データベース作成
装置、１１クラスノード、１２インスタンスノー
ド、１３ｉｓ＿ａリンク、１４ルートノード、１５
ｉｎｓｔａｎｃｅ＿ｏｆリンク、１６ｈａｓ＿ａリ
ンク、２１先頭からの最長一致比較結果、２２末尾
からの最長一致比較結果、２３ノード名候補文字列リ
スト、３１階層構造編集手段、４１同義語定義手
段、４２同義語情報記憶手段、４３同義語変換手
段、４５同義語情報識別番号、４６同義語定義内容、
５１抽出補助規則記憶手段、５２ノード名候補抽出
手段、６１名称補正規則記憶手段、６２名称補正手
段。[Description of Signs] 1 input data, 2 users, 3 name data storage means, 4 node name candidate extraction means, 5 node name selection means, 6 hierarchical structure creation means, 7 hierarchical structure storage means,
8, 32, 44, 53, 63 Structured database creation device, 11 class node, 12 instance node, 13 is_a link, 14 root node, 15
instance_of link, 16 has_a link, 21 longest match comparison result from the beginning, 22 longest match comparison result from the end, 23 node name candidate character string list, 31 hierarchical structure editing means, 41 synonym definition means, 42 synonym information storage Means, 43 synonym conversion means, 45 synonym information identification number, 46 synonym definition contents,
51 extraction auxiliary rule storage means, 52 node name candidate extraction means, 61 name correction rule storage means, 62 name correction means.

Claims

[Claims]

1. Name data storage means for storing input name data, and a structured database configured by hierarchically arranging nodes corresponding to constituent elements and defining mutual relations between the nodes. A hierarchical structure storing means for storing, and reading the name data stored in the name data storing means, and for each name data, the longest character string from the beginning with respect to a character string forming name data with all other name data Node name candidate extracting means for performing a match comparison and a longest match comparison from the end to extract a node name candidate given as a character string representing a candidate for a node constituting the structured database; The node name candidates extracted by the extraction means are presented, and the node name candidates are used as components constituting the structured database. Node name selecting means for allowing a user to select a node name of a correct node; a node having the node name selected by the node name selecting means; and a node having name data stored in the name data storage means as a node name. And a hierarchical structure creating means for creating a structured database by arranging hierarchically and defining a mutual relationship between the nodes.

2. The structured database creating device according to claim 1, further comprising a hierarchical structure editing unit that allows a user to edit the structured database stored in the hierarchical structure storage unit.

3. A synonym definition means for enabling a user to define a synonym, a synonym information storage means for storing synonyms defined using the synonym definition means, and a synonym information storage means. 2. A structured database creating apparatus according to claim 1, further comprising: synonym conversion means for converting the name data and the node name into representative terms having the same meaning by referring to the synonym information stored in the database. .

4. An extraction assistance rule storage means for storing extraction assistance rules applied at the time of node name candidate extraction, and name data stored in the name data storage means,
For each name data, the longest match comparison from the beginning and the longest match comparison from the end are performed for the character strings constituting the name data with all the other name data, and candidates for nodes constituting the structured database are performed. Node name candidates given as a character string representing the character string. If the extraction process of the obtained node name candidates corresponds to one of the rules stored in the extraction auxiliary rule storage means, the node name candidate is extracted. 2. The structured database creating apparatus according to claim 1, further comprising a node name candidate extracting unit for discarding candidates.

5. A name correction rule storage means for storing a name correction rule defining a correction method for a term included in name data, and a name correction rule storage means for each name data constituting input data. 2. A structured database creation apparatus according to claim 1, further comprising: a name correction unit that applies a name correction rule, and stores name data converted based on the name correction rule in a name data storage unit. apparatus.