JP2000122903A

JP2000122903A - Method for processing information and computer system

Info

Publication number: JP2000122903A
Application number: JP11283048A
Authority: JP
Inventors: Guepner Juan Roldan; フアン・ロルダン・グェプナー
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 1998-10-10
Filing date: 1999-10-04
Publication date: 2000-04-28
Also published as: DE19948030A1; TW571203B; SG85680A1

Abstract

PROBLEM TO BE SOLVED: To provide a method for processing structured information which is to be stored in the data base of a computer system. SOLUTION: In the method, grammar 14 describing the structure of information is decided and a data base schemer 10 is derived from the grammar 14. A mapping rule is generated from the grammar 14 and structured information 16 is stored in a data base 12 in accordance with the schemer and the mapping rule. The method can be used for transferring data to the data base 12 and reading data from the data base 12.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、一般に、情報を処
理するための方法に関し、特に、データベースに記憶す
べき構造化情報を処理するための方法およびこのような
情報をデータベースに転送するための方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates generally to a method for processing information, and more particularly, to a method for processing structured information to be stored in a database and a method for transferring such information to a database. About the method.

【０００２】[0002]

【従来の技術】当事者Ａから発信される所与の量の情報
を当事者Ｂが読み取って処理しなければならない場合は
非常に多い。このような情報は膨大な種類の形式で存在
する可能性があり、数字、単語、文章、ならびにフリー
・テキスト、テーブルなどの様々な変形を含む可能性が
ある。現代では、このような情報は大量に発生し、受取
り側が迅速かつ確実に処理し理解する必要がある。ま
た、このような情報は、データ転送の受取り側のデータ
ベースに供給される場合が多い。このような情報を複数
の異なるサイトから受け取り、このような情報を大規模
データベース・システムに記憶している人は、まず、デ
ータベースからその情報を読み取らなければならず、そ
の後、記憶した「パッケージ」またはより一般的には大
量の情報を解析しなければならない。次にその人は、そ
の情報を理解することができる。情報を解析するこのよ
うなステップがなければ、記憶した情報には、データ・
キャリアに記憶されること以外の真の価値はない。この
ような情報パッケージはＢＬＯＢＳ（＝２進大型オブジ
ェクト, Binary Large Object）として知られている。
このようなＢＬＯＢＳは解析しなければならず、したが
って、その構造は判明済みであると判定されなければ、
誰もそれを読み取って、ＢＬＯＢＳに含まれる情報を理
解することはできない。BACKGROUND OF THE INVENTION Very often, a given amount of information originating from party A must be read and processed by party B. Such information can exist in a huge variety of formats and can include numbers, words, sentences, and various variations of free text, tables, and the like. In modern times, such information is generated in large quantities and needs to be quickly and reliably processed and understood by the recipient. In addition, such information is often supplied to a database on the data transfer receiving side. A person receiving such information from a number of different sites and storing such information in a large database system must first read the information from the database, and then store the "package" Or, more generally, a large amount of information must be analyzed. The person can then understand the information. Without such a step of analyzing the information, the stored information would contain data,
There is no real value other than being remembered in the carrier. Such an information package is known as BLOBS (= Binary Large Object).
Such a BLOBS must be analyzed and, therefore, if its structure is not determined to be known,
No one can read it and understand the information contained in BLOBS.

【０００３】多くのＢＬＯＢＳの内容を読み取り、デー
タベースの内容について行ったのと同様の方法でそれを
評価することが必要な場合に問題が発生する。このよう
な評価は、データベースの所与の内容に選択的にアクセ
スし、その結果が詳細な形式の照会済み情報のリストに
なり、個々の基準によって記憶されるような照会を含
む。複数のＢＬＯＢＳをデータベースに記憶する場合、
従来技術の技法では、各ＢＬＯＢＳごとに個々の解析プ
ロセスを実行しなければならず、それを事前に解析せず
に複数のＢＬＯＢＳに対して直ちに照会を向けることは
不可能である。したがって、大量の記憶情報に関する評
価はどのようなものでも多くの作業になり、大量の時間
を要するが、その情報は電子的に読取り可能な形式で記
憶される。A problem arises when it is necessary to read the contents of many BLOBS and evaluate them in the same way as was done for the contents of the database. Such evaluations include queries that selectively access a given content of a database, the result of which is a detailed list of queried information that is stored by individual criteria. When storing multiple BLOBS in a database,
With prior art techniques, a separate parsing process must be performed for each BLOBS, and it is not possible to immediately direct queries to multiple BLOBS without pre-parsing it. Thus, any assessment of a large amount of stored information can be a lot of work and time consuming, but the information is stored in an electronically readable form.

【０００４】上記を考慮すると、より快適かつより迅速
にコンピュータ・システムのデータベースに記憶すべき
大量の構造化情報を処理するための方法に対する必要性
が存在することは明らかなはずである。In view of the above, it should be apparent that a need exists for a method for processing large amounts of structured information that should be stored in a database of a computer system more comfortably and more quickly.

【０００５】[0005]

【発明が解決しようとする課題】したがって、本発明の
一目的は、コンピュータ・システムのデータベースに記
憶すべき構造化情報を処理するための改良された方法を
提供することにある。SUMMARY OF THE INVENTION It is, therefore, one object of the present invention to provide an improved method for processing structured information to be stored in a computer system database.

【０００６】本発明の他の目的は、外部サイトからデー
タベース内に情報を転送するための改良された方法を提
供することにある。It is another object of the present invention to provide an improved method for transferring information from an external site into a database.

【０００７】本発明のさらに他の目的は、本発明による
処理方法を実現するコンピュータ・プログラムを実行す
るように構成されたデータベース管理システムがインス
トールされたコンピュータ・システムを提供することに
ある。It is still another object of the present invention to provide a computer system in which a database management system configured to execute a computer program for realizing the processing method according to the present invention is installed.

【０００８】[0008]

【課題を解決するための手段】上記の目的を達成するた
めの主な考え方は、転送すべきファイル内容の構造を特
徴付ける所与の文法（grammar）からデータベース・ス
キーマ情報を直接導き出すことと、その文法からデータ
ベース・スキーマを直接作成することである。前記スキ
ーマ情報は、文法の助けを借りて解析する間に導き出さ
れる。マッピング規則は、所与の文法を固守する内容を
マッピングする方法を示すものであり、スキーマ情報を
導き出すのと同時に導き出される。The main idea to achieve the above object is to directly derive database schema information from a given grammar that characterizes the structure of the file contents to be transferred. Creating a database schema directly from the grammar. The schema information is derived during parsing with the help of grammar. The mapping rules indicate how to map content that adheres to a given grammar, and are derived at the same time as the schema information is derived.

【０００９】[0009]

【発明の実施の形態】次に添付図面を全般的に参照し、
特に図１を参照すると、所与の量の構造化情報１００に
ついては、本発明の処理方法への対象になるものとして
説明する。前記量の情報を処理するために記述すべきス
テップはこの方法の一部であり、有利なことに一部は送
り側のコンピュータ・システムで実行され、一部は受取
り側のコンピュータ・システムで実行される。これらの
コンピュータ・システムは、前記ステップを実行できる
ようにするためのプログラムとツールを備えている。BRIEF DESCRIPTION OF THE DRAWINGS FIG.
With particular reference to FIG. 1, a given amount of structured information 100 will be described as being subject to the processing method of the present invention. The steps to be described for processing said quantity of information are part of the method, advantageously partly performed on the sending computer system and partly performed on the receiving computer system Is done. These computer systems include programs and tools that enable the above steps to be performed.

【００１０】情報１００は、電子（コンピュータ可読）
ファイルに含まれている。The information 100 is electronic (computer readable)
Included in the file.

【００１１】文法は、たとえば、所与のファイル拡張子
などの命名規則を使用するかまたは側面情報により、情
報量１００に応じて決定される（ステップ１１０）。The grammar is determined according to the amount of information 100, using, for example, a naming convention such as a given file extension or by side information (step 110).

【００１２】文法に対応するデータベース・スキーマ
（ここではリレーショナル・データベース用）は、ステ
ップ１２０に示すようにこの文法から導き出される。A database schema corresponding to the grammar (here for a relational database) is derived from this grammar as shown in step 120.

【００１３】次に、追加ステップ１３０では、文法によ
って記述された情報の構造化内容をデータベース・スキ
ーマからなる複数のテーブルにマッピングする方法を決
定するマッピング規則が生成される。Next, in an additional step 130, a mapping rule is generated which determines how to map the structured content of the information described by the grammar to a plurality of tables comprising a database schema.

【００１４】最後に、ステップ１４０では、データベー
ス・スキーマならびにあらかじめ確立した文法規則に対
応する情報をデータベースに充填することができる。Finally, at step 140, the database can be filled with information corresponding to the database schema and pre-established grammar rules.

【００１５】その結果、この「文法ベース・マッピン
グ」の場合、情報はリレーショナル・データベース・シ
ステムのテーブル内に構造化形式で記憶される。本発明
による上記の方法の利点として、データベースのユーザ
は、データベースの内容を照会し、所望のすべての情報
をソートされた形式で持ち出すような詳細な照会を作成
することができ、どのようなデータベースのテーブルも
一般に意味的単一体（semantic unities）なので理解し
やすいものである。したがって、情報の解析は、従来技
術の技法の場合と同様に、そのデータベースへの読取り
アクセスの一部である必要はない。As a result, in the case of this "grammar-based mapping", the information is stored in a structured format in a table of a relational database system. An advantage of the above method according to the present invention is that the database user can query the contents of the database and create a detailed query that brings up all the desired information in a sorted format, Tables are also easy to understand because they are generally semantic unities. Thus, parsing the information need not be part of the read access to the database, as in the prior art techniques.

【００１６】次に図２を参照すると、この処理方法の第
１の単純な実施の形態には、いわゆる「単純文法ベース
・マッピング」が記載されている。Referring now to FIG. 2, a first simple embodiment of the processing method describes a so-called “simple grammar-based mapping”.

【００１７】単純文法ベース・マッピングでは、構造化
情報ファイル内容１６を記述する文法１４からデータベ
ース１２の完全なスキーマ１０が導き出され、生成され
る。マッピング規則１８は、この文法を固守する内容
を、導き出されたスキーマが記述するデータベースにマ
ッピングする方法を示すものであり、同じくこの文法に
よって静的に導き出される。In simple grammar-based mapping, the complete schema 10 of the database 12 is derived and generated from a grammar 14 describing the structured information file contents 16. The mapping rule 18 indicates a method of mapping the content that adheres to the grammar to the database described by the derived schema, and is also derived statically by the grammar.

【００１８】以下の文法例は、この技法の数多くの応用
例の可能性の１つにすぎない。これは、たとえば、特殊
なコンピュータ言語用のネスト可能なレコード構造を記
述するための単純な文法である。The following grammar example is just one of many possible applications of this technique. This is, for example, a simple grammar for describing nestable record structures for special computer languages.

【００１９】[0019]

【表１】 [Table 1]

【００２０】この例証的な文法は、レコードがその構造
の任意選択名から始まり、次にキーワード「ＳＴＲＵＣ
Ｔ」と０個またはそれ以上のＭＥＭＢＥＲが続き、キー
ワード「ＥＮＤ」で終わることを示している。ＭＥＭＢ
ＥＲは、ＩＤＥＮＴＩＦＩＥＲとそれに続くＴＹＰＥか
らなる。次にＩＤＥＮＴＩＦＩＥＲは１つまたは複数の
英字の連結であり、ＴＹＰＥはキーワード「ＳＴＲＩＮ
Ｇ」または「ＩＮＴＥＧＥＲ」のいずれか一方であるか
あるいはネストされたレコードである。This illustrative grammar shows that a record starts with an optional name of its structure, and then the keyword "STRUC"
"T" followed by zero or more MEMBERs, ending with the keyword "END". MEMB
ER consists of IDENTIFIER followed by TYPE. Next, IDENTIFIER is a concatenation of one or more alphabetic characters, and TYPE is the keyword "STRIN".
G "or" INTEGER "or a nested record.

【００２１】この文法から導き出されるリレーショナル
・スキーマとしては、以下のものが考えられる。The following relational schema can be derived from this grammar.

【００２２】[0022]

【表２】 [Table 2]

【００２３】対応するマッピング規則は以下のようにな
るだろう。The corresponding mapping rule would be as follows:

【００２４】この文法の「ＲＥＣＯＲＤ」規則が適用さ
れた場合、テーブルＲＥＣＯＲＤ＿ＴＡＢＬＥに新しい
行を挿入し、そのレコードを識別する固有の番号を生成
し、この番号をその行のrecord_keyフィールドに入れ
る。次に、レコードの名前があればその名前と、実際の
ＭＥＭＢＥＲの数をその行の列record_nameおよびnumbe
r_of_membersに入れる。すべてのＭＥＭＢＥＲについ
て、このレコードのrecord_key値によってフィールドpa
rent_recore_keyを更新することにより、対応する行を
更新する。ＥＮＤＩＦIf the "RECORD" rule of this grammar is applied, a new row is inserted into the table RECORD_TABLE, generating a unique number identifying the record, and placing this number in the record_key field of that row. Next, the name of the record, if any, and the actual number of MEMBERs are entered in the columns record_name and numbe of that row.
Put in r_of_members. For all MEMBERs, field pa by record_key value of this record
Update the corresponding row by updating rent_recore_key. ENDIF

【００２５】この文法のＭＥＭＢＥＲ規則が適用された
場合、テーブルＭＥＭＢＥＲ＿ＴＡＢＬＥに新しい行を
挿入し、このテーブル内で固有の番号を生成し、それを
その行のmember_keyフィールドに入れる。「ＩＤＥＮＴ
ＩＦＩＥＲ」の値をその行のmember_nameに入れる。こ
のメンバの型が「ＳＴＲＩＮＧ」である場合、simple_t
ypeフィールドに０を入れ、complex_typeフィールドに
ＮＵＬＬを入れる。型が「ＩＮＴＥＧＥＲ」である場
合、simple_typeフィールドに１を入れ、complex_type
フィールドにＮＵＬＬを入れる。次にその型がレコード
構造である場合、それに対応するデータベース項目のre
cord_key値をcomplex_typeフィールドに入れ、simple_t
ypeフィールドにＮＵＬＬを入れる。ＥＮＤＩＦIf the MEMBER rule of this grammar is applied, insert a new row in the table MEMBER_TABLE, generate a unique number in this table and put it in the member_key field of that row. "IDENT
Put the value of "IFIER" in the member_name of that row. If this member is of type "STRING", simple_t
Put 0 in the ype field and NULL in the complex_type field. If the type is "INTEGER", enter 1 in the simple_type field and enter complex_type
Put NULL in the field. Next, if the type is a record structure, the re
Put the cord_key value in the complex_type field and use simple_t
Put NULL in the ype field. ENDIF

【００２６】次の例は、上記のマッピングを実証するも
のである。以下の内容は個人データを記述するものであ
り、その構造は所与の文法を固守している。The following example demonstrates the above mapping. The following describes personal data, the structure of which adheres to a given grammar.

【００２７】[0027]

【表３】 [Table 3]

【００２８】上記によるマッピング規則の場合、文法ベ
ース・マッピングの結果は以下のようになるだろう。For a mapping rule according to the above, the result of the grammar-based mapping would be:

【００２９】[0029]

【表４】 [Table 4]

【００３０】したがって、上記による文法１４によって
一般化し記述することができる構造を有する複数の異な
る量またはパッケージの情報をデータベースに充填する
場合、まず従来技術に対応すると思われる同じ複数のＢ
ＬＯＢを解析する必要なしに、完全な最新の照会を含む
データベースからの快適な読取りまたは照会を実行する
ことができる。Thus, when filling a database with information of a plurality of different quantities or packages having a structure that can be generalized and described by the grammar 14 according to the above, first the same plurality of Bs, which would correspond to the prior art,
A comfortable read or query from a database containing a complete up-to-date query can be performed without having to analyze the LOB.

【００３１】図３に概要を示す本発明の処理方法の第２
の実施の形態には、いわゆる「複合文法ベース・マッピ
ング」が記載されている。FIG. 3 shows a second embodiment of the processing method of the present invention.
In this embodiment, a so-called “compound grammar-based mapping” is described.

【００３２】複合文法ベース・マッピングでは、スキー
マ１０は必ずしも文法１４によって完全に導き出される
わけではなく、その文法の所与の規則が内容に適用され
る場合は文法といわゆるメタ文脈（meta-context）２０
の両方によって導き出される。スキーマ１０は一部は文
法から導き出すことができ、解析中はスキーマ拡張とデ
ータベースへの情報のマッピングの両方を行うことがで
きる。In a compound grammar-based mapping, the schema 10 is not always completely derived by the grammar 14, but if a given rule of the grammar applies to the content, the grammar and the so-called meta-context 20
Derived by both. The schema 10 can be derived, in part, from the grammar, and can both perform schema extension and map information to a database during parsing.

【００３３】それをより詳細に説明するため、上記の例
である「record」では、文脈によって各レコードごとに
それ自体のテーブルを生成し、生成した各テーブルを登
録テーブルに登録することが望ましいだろう。この文脈
は、たとえば、所与のユーザまたはアプリケーションの
要求を反映することになるだろう。In order to explain this in more detail, in the above-mentioned "record", it is desirable to generate its own table for each record depending on the context and register each generated table in the registration table. Would. This context would reflect, for example, the needs of a given user or application.

【００３４】どのテーブルが作成されるかは解析時にし
か分からない。この例では、以下の文法から１つのテー
ブルだけが静的に導き出される。RECORD_REGISTRY_TABL
Eは、そのテーブルが存在するすべてのレコードを含
む。テーブル「RECORD_REGISTRY_TABLE」は列「record_
table_name」を有し、これはこのテーブルの主キーであ
る。The table to be created can be known only at the time of analysis. In this example, only one table is statically derived from the following grammar. RECORD_REGISTRY_TABL
E includes all records where the table exists. Table "RECORD_REGISTRY_TABLE" has column "record_
table_name ", which is the primary key for this table.

【００３５】したがって、この規則は、「単純文法ベー
ス・マッピング」に比べ、スキーマ拡張情報、たとえ
ば、追加のデータベース・テーブルを生成するかあるい
はすでに作成されたテーブル内の追加の行またはフィー
ルドを作成するなどの規則も含むことができる。Thus, this rule creates schema extension information, eg, additional database tables, or creates additional rows or fields in already created tables, as compared to “simple grammar-based mapping”. And other rules.

【００３６】この例の場合、１組の規則は以下のように
なる。In this example, a set of rules is as follows.

【００３７】この文法の「ＲＥＣＯＲＤ」規則が適用さ
れた場合、スキーマ内に新しいテーブルを作成する。そ
のテーブルの名前は解析されたレコードの名前によって
示される。そのテーブルの列は（varcharとしてのmembe
r_name、integerとしてのsimple_type、varcharとしての
complex_type）になる。member_nameは１次キーであ
る。レコード名（ＩＤＥＮＴＩＦＩＥＲの値）をＲＥＣ
ＯＲＤ＿ＲＥＧＩＳＴＲＹ＿ＴＡＢＬＥテーブルに入れ
る。レコード名が一切示されない場合、固有のものを作
成する。ＥＮＤＩＦこの文法のＭＥＭＢＥＲ規則が適用された場合、このメ
ンバが属すレコードに対応するテーブルに新しい行を挿
入する。ＩＤＥＮＴＩＦＩＥＲの値をその行のmember_n
ameに入れる。このメンバの型が「ＳＴＲＩＮＧ」であ
る場合、simple_typeフィールドに０を入れ、complex_t
ypeフィールドにＮＵＬＬを入れる。型が「ＩＮＴＥＧ
ＥＲ」である場合、simple_typeフィールドに１を入
れ、complex_typeフィールドにＮＵＬＬを入れる。次に
その型がレコード構造である場合、そのレコード名をco
mplex_typeフィールドに入れ、simple_typeフィールド
にＮＵＬＬを入れる。ＥＮＤＩＦWhen the "RECORD" rule of this grammar is applied, a new table is created in the schema. The name of the table is indicated by the name of the parsed record. The columns of that table are (membe as varchar
r_name, simple_type as integer, varchar
complex_type). member_name is a primary key. Record name (IDENTIFIER value) to REC
Put in ORD_REGISTRY_TABLE table. If no record name is given, create a unique one. ENDIF When the MEMBER rule of this grammar is applied, a new row is inserted into the table corresponding to the record to which this member belongs. Set the value of IDENTIFIER to member_n of the row
Put in ame. If the type of this member is "STRING", put 0 in the simple_type field, and
Put NULL in the ype field. If the type is "INTEG
If "ER", enter 1 in the simple_type field and NULL in the complex_type field. Next, if the type is a record structure, change the record name to co
Put in mplex_type field and put NULL in simple_type field. ENDIF

【００３８】以下の例を解析する。The following example is analyzed.

【００３９】[0039]

【表５】 [Table 5]

【００４０】このスキーマは、テーブル「PersonData」
および「SubRecord1」によって拡張することができる。
すなわち、ネストされたレコードについてレコード名が
一切示されていないので、その名前として「SubRecord
1」が作成されることになる。This schema has a table "PersonData"
And can be extended by "SubRecord1".
In other words, no record name is shown for nested records, so the name "SubRecord
1 "will be created.

【００４１】[0041]

【表６】 [Table 6]

【００４２】解析後、テーブルは以下のデータを含むこ
とになる。After analysis, the table will contain the following data:

【００４３】[0043]

【表７】 [Table 7]

【００４４】上記の第１の実施の形態で説明したよう
に、事前にＢＬＯＢを解析せずにデータベースからの快
適な読取りまたは照会を実行することができる。As described in the first embodiment, comfortable reading or querying from the database can be performed without analyzing the BLOB in advance.

【００４５】次に図４および図５を参照すると、事前に
情報を解析せずにデータベース１２への直接読取り／照
会アクセスを実行できる、本発明による処理方法の応用
に関する例が示されている。Referring now to FIGS. 4 and 5, there is shown an example relating to the application of the processing method according to the invention, in which direct read / query access to the database 12 can be performed without prior analysis of the information.

【００４６】受取り側当事者Ｂ（図５の右側）は、複数
の同じ型の情報パッケージ、すなわち、共通の一般的な
構造を有する情報パッケージを様々な当事者から受け取
ることになっており、そのうちの１つ（当事者Ａ）は図
５の左側に示されている。The receiving party B (right side of FIG. 5) is to receive a plurality of information packages of the same type, ie, information packages having a common general structure, from various parties, one of which. One (party A) is shown on the left side of FIG.

【００４７】Ｂは、所与の量の患者データを医師Ａの診
療所のコンピュータから受け取る病院のデータベースに
することができる。Ａの患者のそれぞれについて、同じ
型のデータ、すなわち、情報パッケージが転送されると
予想される。B can be a hospital database that receives a given amount of patient data from physician A's clinic computer. It is expected that the same type of data, ie, the information package, will be transferred for each of the patients in A.

【００４８】当事者Ａは、転送すべき情報量の内容に関
し、それを特徴付ける文法規則を含む１つまたは１組の
いわゆるＤＴＤ（文書型宣言）を設計または使用する
（４００）。Party A designs or uses (400) one or a set of so-called DTDs (Document Type Declarations) containing the grammatical rules that characterize the content of the information to be transferred.

【００４９】次に、ステップ４１０では、ＤＴＤファイ
ル３０の１つがＢに送られる。これは、いずれかの電子
形式またはＡからＢに書状として送られる書類の形式に
なる可能性がある。Next, at step 410, one of the DTD files 30 is sent to B. This can be in any electronic form or in the form of a document sent as a letter from A to B.

【００５０】次に、ステップ４２０では、ＢがＤＴＤフ
ァイル３０を受け取り、そのＤＴＤファイルに含まれる
情報に応じてデータベース・スキーマ１０を生成する。Next, in step 420, B receives the DTD file 30, and generates the database schema 10 according to the information contained in the DTD file.

【００５１】図５の上部にも示されているように、Ａは
図４のステップ４１０でＤＴＤファイルをＢに送る。デ
ータベース記号１２から分かるように、データベース・
スキーマ１０はＤＴＤファイル内容から導き出され、デ
ータベース１２内の４つの「ダミー」ポイントによって
示されるようにＤＴＤファイルを受け取ると動的に生成
される。データベース・スキーマ１０が完成し、データ
ベース１２が常駐するコンピュータ・システムがデータ
を受け取れる状態になると、データ転送を開始すること
ができる。As shown at the top of FIG. 5, A sends the DTD file to B in step 410 of FIG. As can be seen from the database symbol 12, the database
Schema 10 is derived from the DTD file contents and is dynamically generated upon receipt of the DTD file as indicated by the four “dummy” points in database 12. Once the database schema 10 is complete and the computer system on which the database 12 resides is ready to receive data, data transfer can begin.

【００５２】図５の次の行では、情報パッケージがＡか
らＢに送られる。この情報は、受取り側Ｂの例証的なデ
ータベース・スキーマを表す４つのテーブル内に分割さ
れる。したがって、すべての情報はデータベース内のそ
の正確な場所に記憶される。この送出プロセスは所与の
回数繰り返され、Ｂのデータベースに記憶された情報の
量はますます大きくなる。第三者ＣもＢのデータベース
に同じ型の情報を送るように示されている。In the next line of FIG. 5, the information package is sent from A to B. This information is split into four tables representing the recipient B's illustrative database schema. Thus, all information is stored at its exact location in the database. This sending process is repeated a given number of times, and the amount of information stored in B's database is growing. Third party C is also shown to send the same type of information to B's database.

【００５３】図５の下部右側に示されているように、Ｂ
のデータベースから所望のテーブル内容を選択する最新
の照会によって、記憶された情報全体にアクセスするこ
とができる。現況技術の場合のように、ＢＬＯＢＳの解
析は不要である。As shown in the lower right part of FIG.
The entire stored information can be accessed by an up-to-date query that selects the desired table contents from the database. BLOBS analysis is not required as in the state of the art.

【００５４】図４に戻ると、ステップ４３０では、Ａ
は、Ｂのコンピュータ・システム内にセットアップされ
る文法規則およびデータベース・スキーマに応じて所与
の量の情報（ファイル、電子メールなどの複数の情報パ
ッケージ）を送る。これは、図５の第２の「行」として
示されている。Returning to FIG. 4, in step 430, A
Sends a given amount of information (files, multiple information packages such as emails) depending on the grammar rules and database schema set up in B's computer system. This is shown as the second "row" in FIG.

【００５５】次のステップ４４０では、Ｂのコンピュー
タ・システムがその情報量を受け取り、データベース・
スキーマ１０に応じてそれをデータベース・テーブル３
２に自動的に記憶する。図４に示すループから分かるよ
うに、Ａの送出プロセスは必要なだけ頻繁に繰り返すこ
とができる。あるいは、他の当事者Ｃ（他の医師の診療
所）は、図５の第３の行に示す同じ型の追加量の情報を
送ることができる。次に、後者の情報はＡの情報と同じ
ように構造化されているので、同じように記憶される。In the next step 440, B's computer system receives the amount of information and
Database table 3 according to schema 10
2 automatically stored. As can be seen from the loop shown in FIG. 4, the sending process of A can be repeated as often as necessary. Alternatively, the other party C (another doctor's clinic) can send an additional amount of the same type of information shown in the third row of FIG. Next, since the latter information is structured in the same way as the information of A, it is stored in the same manner.

【００５６】その結果、受取り側のデータベース１２
は、正しく構造化され、対応するデータベース・テーブ
ル３２に記憶された情報で充填される。したがって、個
々の照会は転送された情報パッケージの全体量にわたっ
てＢのスタッフによって直ちにかつ快適にセットアップ
することができ、データベースからデータを読み取った
ときにいかなるＢＬＯＢ解析も不要である。したがっ
て、データベースからデータを読み取って理解するため
に必要な時間量は最小限になる。As a result, the receiving side database 12
Are correctly structured and filled with the information stored in the corresponding database table 32. Thus, individual queries can be set up immediately and comfortably by B's staff over the entire volume of information package transferred, without the need for any BLOB analysis when reading data from the database. Thus, the amount of time required to read and understand data from the database is minimized.

【００５７】次に、本発明の方法の主な応用部分を形成
するいわゆる文脈自由文法を特に参照し、図１のステッ
プ１２０およびステップ１３０について、より詳細に説
明する。ファイル内容の構造、あるいはより一般的に表
すと本出願の用語における情報パッケージまたは量の構
造は、文脈自由文法によって記述される場合が多い。こ
の場合、その文法の非端末（non-terminals）はリレー
ショナル・データベース・システムのテーブルにマッピ
ングされる。非端末は通常、意味的エンティティ・グル
ープを構文上記述するために使用するので、これは自然
なマッピングである。次に、このようなエンティティ・
グループをテーブルにマッピングすることができる。Steps 120 and 130 of FIG. 1 will now be described in more detail, with particular reference to the so-called context-free grammar which forms the main application part of the method of the invention. The structure of the file contents, or more generally, the structure of the information package or quantity in the terms of the present application, is often described by a context-free grammar. In this case, the non-terminals of the grammar are mapped to tables in a relational database system. This is a natural mapping, since non-terminals are typically used to describe a semantic entity group syntactically. Next, such an entity
Groups can be mapped to tables.

【００５８】普遍性を失わずに、文脈自由文法の規則は
以下の形式になると想定される。Without loss of universality, the rules of a context-free grammar are assumed to be of the form

【００５９】 [1] ls ==> rs1 rs2 ... rsN または [2] ls ==> rs1 rs2 ... rsN または [3] ls ==> (rs)+ または [4] ls ==> (rs)*[1] ls ==> rs1 rs2 ... rsN or [2] ls ==> rs1 rs2 ... rsN or [3] ls ==> (rs) + or [4] ls ==> (rs) *

【００６０】ｌｓは「左側」を表し、ｒｓは「右側」を
表す。ｌｓは常に非端末を表し、ｒｓは端末、空端末
（empty-terminal）、非端末のいずれかを表す。Ls represents “left side”, and rs represents “right side”. ls always represents a non-terminal, and rs represents any of a terminal, an empty-terminal, and a non-terminal.

【００６１】［１］は構造の連結と呼ばれ、左側が右側
の１つまたは複数の端末または非端末によって連結によ
って構成されることを意味する。[1] is referred to as a structure connection, and means that the left side is formed by connection of one or more terminals or non-terminals on the right side.

【００６２】［２］は構造の選択と呼ばれ、左側が右側
の１つの端末または非端末から正確に構成されることを
意味する。[2] is referred to as structure selection and means that the left side is correctly composed of one terminal or non-terminal on the right side.

【００６３】［３］は構造の１繰返しと呼ばれ、左側が
ｒｓの１回または複数回の繰返しによって構成されるこ
とを意味する。[3] is called one repetition of the structure, and means that the left side is constituted by one or more repetitions of rs.

【００６４】［４］は構造の０繰返しと呼ばれ、左側が
ｒｓの０回またはそれ以上の繰返しによって構成される
ことを意味する。[4] is called zero repetition of the structure and means that the left side is constituted by zero or more repetitions of rs.

【００６５】各文脈自由文法は、それぞれが上記４通り
のパターンの１つを固守する規則の有限セットによって
記述することができる。さらに、普遍性を失わずに、各
非端末が規則の左側に正確に１回ずつ現れると想定する
ことができる。正規表現は、特殊なサブクラスの文脈自
由文法であり、したがって、これらのパターンを固守す
る１組の規則によって記述することができる。実際に
は、非端末は、トークンクラスと実非端末とに分割され
る。前者は字句解析プログラムによって認識され、後者
は構文解析プログラムによって認識される。上記の例で
は、これらのグループは、'名前'、'Ａ'、'０'、'：'の
ように端末用の引用符と、＜ＳＴＲＩＮＧ＞、＜ＩＤＥ
ＮＴＩＦＩＥＲ＞、＜ＮＵＭＢＥＲ＞のようにトークン
クラス用の＜＞と、ＡＤＤＲＥＳＳ、ＥＭＰＬＯＹＥＲ
＿ＮＵＭＢＥＲのように非端末用の接頭辞または接尾辞
のない大文字単語によって区別される。Each context-free grammar can be described by a finite set of rules, each of which adheres to one of the above four patterns. Further, without loss of universality, it can be assumed that each non-terminal appears exactly once on the left side of the rule. Regular expressions are a special subclass of context-free grammar, and can therefore be described by a set of rules that adhere to these patterns. In practice, non-terminals are divided into token classes and real non-terminals. The former is recognized by the lexical analyzer, and the latter is recognized by the parser. In the above example, these groups are quoted for the terminal, such as 'name', 'A', '0', ':', and <STRING>, <IDE
<> For token class such as NTIFIER>, <NUMBER>, ADDRESS, EMPLOYER
It is distinguished by an uppercase word without a prefix or suffix for non-terminals, such as _NUMBER.

【００６６】「実非端末（Real non-terminal）」は非
端末という。"Real non-terminal" is called a non-terminal.

【００６７】このような規則の「単純文法ベース・マッ
ピング」は以下のようになるだろう。The "simple grammar-based mapping" of such a rule would be as follows:

【００６８】各非端末ごとにテーブルが作成される。さ
らに、各テーブルごとに自動的に１次キーが作成される
ことになる。これは行を明確に識別する数字にすぎな
い。このキーは、テーブルに挿入される新しい行ごとに
自動的に増分される。A table is created for each non-terminal. Further, a primary key is automatically created for each table. This is just a number that clearly identifies the line. This key is automatically incremented for each new row inserted into the table.

【００６９】パターン［１］に一致する各規則ごとに、
規則の右側の各非端末の外部キーを表す列が規則の左側
の非端末に対応するテーブルに追加される。規則の右側
の各トークンクラスごとに、varchar型の列が追加され
る。解析時に規則が一致した場合、その規則の右側の各
非端末ごとに適切な列に外部キーが入れられ、各トーク
ンクラスごとに適切な列にその具体的な値が入れられ
る。For each rule that matches pattern [1],
A column representing the foreign key of each non-terminal on the right of the rule is added to the table corresponding to the non-terminal on the left of the rule. A varchar column is added for each token class on the right side of the rule. If the rule matches during parsing, the foreign key is placed in the appropriate column for each non-terminal to the right of the rule, and the specific value is placed in the appropriate column for each token class.

【００７０】たとえば、文法規則は以下の通りとする。For example, the grammar rules are as follows.

【００７１】[0071]

【表８】 [Table 8]

【００７２】その結果、以下のスキーマ拡張が行われ
る。As a result, the following schema extension is performed.

【００７３】[0073]

【表９】 [Table 9]

【００７４】文字列「Name: John Miller No: 00000
7」、「Name: Liz Parker No: 000011」、「Name: Cind
y Smith No: 000003」の解析後、以下の項目が生成され
ることになる。The character string "Name: John Miller No: 00000"
7 "," Name: Liz Parker No: 000011 "," Name: Cind
After parsing "y Smith No: 000003", the following items will be generated.

【００７５】[0075]

【表１０】 [Table 10]

【００７６】テーブルＥＭＰＬＯＹＥＥの列「name」の
値はテーブルＮＡＭＥの行を参照することに留意された
い。Note that the value of column "name" of table EMPLOEEEE refers to a row of table NAME.

【００７７】パターン［２］に一致する各規則ごとに、
規則の右側の各非端末の外部キーを表す列が規則の左側
の非端末に対応するテーブルに追加される。各トークン
クラスごとに、varchar型の列が生成され、右側に端末
がある場合は「terminal_value」という名前のvarchar
型の列が生成される。解析時に規則が非端末と一致した
場合、その規則の右側の一致した非端末について適切な
列に外部キーが入れられ、他のすべての列はＮＵＬＬに
設定される。規則がトークンクラスと一致した場合、そ
の具体的な値が適切な列に入れられ、他のすべての列は
ＮＵＬＬに設定される。端末が右側で一致する場合、そ
の端末の値が「terminal」列に入れられ、他のすべての
列はＮＵＬＬに設定される。For each rule that matches pattern [2],
A column representing the foreign key of each non-terminal on the right of the rule is added to the table corresponding to the non-terminal on the left of the rule. For each token class, a varchar column is generated, and if there is a terminal on the right, a varchar named "terminal_value"
A type column is generated. If the rule matches a non-terminal during parsing, the foreign key is placed in the appropriate column for the matching non-terminal on the right of the rule, and all other columns are set to NULL. If the rule matches the token class, its specific value is placed in the appropriate column, and all other columns are set to NULL. If a terminal matches on the right side, the value of that terminal is placed in the "terminal" column and all other columns are set to NULL.

【００７８】他の例証的な潜在的ファイル内容では、文
法規則は以下の通りとする。For another illustrative potential file content, the grammar rules are as follows:

【００７９】[0079]

【表１１】 [Table 11]

【００８０】その結果、以下のスキーマ拡張が行われ
る。As a result, the following schema extension is performed.

【００８１】[0081]

【表１２】 [Table 12]

【００８２】文字列「A: Dog」、「P: Oak」、「P: Ald
er」、「A: Squirrel」の解析時に以下の項目が生成さ
れることになる。Character strings "A: Dog", "P: Oak", "P: Ald
The following items will be generated when analyzing "er" and "A: Squirrel".

【００８３】[0083]

【表１３】 [Table 13]

【００８４】完全に理解するために他の例を示す。この
例では、顧客は名前または顧客番号によって識別するこ
とができる。注文の対象は、テーブル、イス、またはＳ
ＵＢＪＥＣＴテーブルのany_other_subject列に対応す
る他のものにすることができる。文法規則は以下の通り
とする。Another example is provided for a thorough understanding. In this example, customers can be identified by name or customer number. Order can be placed on a table, chair, or S
It can be another one corresponding to the any_other_subject column of the UBJECT table. The grammar rules are as follows.

【００８５】[0085]

【表１４】 [Table 14]

【００８６】その結果、以下のスキーマ拡張が行われ
る。As a result, the following schema extension is performed.

【００８７】[0087]

【表１５】 [Table 15]

【００８８】文字列「Order from: John Miller Subjec
t is: Table」、「Order from: 000918 Subject is: Wa
ter」、「Order from: Liz Parker Subject is: Ca
r」、「Order from: Cindy Smith Subject is: Chair」
の解析時に以下の項目が生成されることになる。The character string "Order from: John Miller Subjec"
t is: Table '', `` Order from: 000918 Subject is: Wa
ter '', `` Order from: Liz Parker Subject is: Ca
r "," Order from: Cindy Smith Subject is: Chair "
The following items will be generated when analyzing.

【００８９】[0089]

【表１６】 [Table 16]

【００９０】パターン［３］に一致し、右側に非端末を
有する各規則ごとに、１次キー目的の列が規則の左側の
非端末に対応するテーブルに追加される。また、カウン
ト目的の列もこのテーブルに追加され、１次キーの一部
としても機能する。右側の非端末について、外部キーの
列がテーブルに追加される。解析時に規則が一致した場
合、右側の各インスタンス化ごとに、その項目の順序番
号および外部キーを含む行が作成され、テーブルに挿入
される。For each rule that matches pattern [3] and has a non-terminal on the right, a column for primary key purposes is added to the table corresponding to the non-terminal on the left of the rule. Also, a column for counting purposes is added to this table and functions as a part of the primary key. For the right non-terminal, a foreign key column is added to the table. If the rule matches during parsing, a row containing the item's sequence number and foreign key is created and inserted into the table for each instantiation on the right.

【００９１】たとえば、文法規則は以下の通りとする。For example, the grammar rules are as follows.

【００９２】[0092]

【表１７】 [Table 17]

【００９３】その結果、以下のスキーマ拡張が行われ
る。As a result, the following schema extension is performed.

【００９４】[0094]

【表１８】 [Table 18]

【００９５】文字列「John Miller Liz Parker Cindy S
mith」および「Elvis Presley Sandra Brown」の解析時
に以下の項目が生成されることになる。The string "John Miller Liz Parker Cindy S"
The following items will be generated when analyzing "mith" and "Elvis Presley Sandra Brown".

【００９６】[0096]

【表１９】 [Table 19]

【００９７】パターン［３］に一致し、右側にトークン
クラスを有する各規則ごとに、１次キー目的の列が規則
の左側の非端末に対応するテーブルに追加される。ま
た、カウント目的の列もこのテーブルに追加され、それ
が１次キーの一部としても機能する。右側のトークンク
ラスについて、varchar型の列がテーブルに追加され
る。解析時にその規則が適用された場合、右側の各イン
スタンス化ごとに、その項目の順序番号および具体的な
トークンクラス値を含む行が作成され、テーブルに挿入
される。For each rule that matches pattern [3] and has a token class on the right, a column for primary key purposes is added to the table corresponding to the non-terminal on the left of the rule. A column for counting purposes is also added to this table, which also functions as a part of the primary key. For the token class on the right, a varchar column is added to the table. If the rule was applied during parsing, for each instantiation on the right, a row containing the item's sequence number and specific token class value is created and inserted into the table.

【００９８】たとえば、文法規則は以下の通りとする。For example, the grammar rules are as follows.

【００９９】[0099]

【表２０】 [Table 20]

【０１００】その結果、以下のスキーマ拡張が行われ
る。As a result, the following schema extension is performed.

【０１０１】[0101]

【表２１】 [Table 21]

【０１０２】文字列「John Liz Cindy」および「Elvis
Sandra」の解析時に以下の項目が生成されることにな
る。The strings "John Liz Cindy" and "Elvis
The following items will be generated when analyzing "Sandra".

【０１０３】[0103]

【表２２】 [Table 22]

【０１０４】パターン［３］に一致し、右側に端末を有
する各規則ごとに、１次キー目的の列が規則の左側の非
端末に対応するテーブルに追加される。また、カウント
目的の列もこのテーブルに追加される。解析時にその規
則が適用された場合、端末の出現数を含む行が作成さ
れ、テーブルに挿入される。For each rule that matches pattern [3] and has a terminal on the right, a column for primary key purposes is added to the table corresponding to the non-terminal on the left of the rule. A column for counting purposes is also added to this table. If the rule is applied during parsing, a row containing the number of occurrences of the terminal is created and inserted into the table.

【０１０５】たとえば、文法規則は以下の通りとする。For example, the grammar rules are as follows.

【０１０６】[0106]

【表２３】 [Table 23]

【０１０７】その結果、以下のスキーマ拡張が行われ
る。As a result, the following schema extension is performed.

【０１０８】[0108]

【表２４】 [Table 24]

【０１０９】文字列「００００００００００」、「００
０」、「００００００」の解析時に以下の項目が生成さ
れることになる。Character strings "000000000000", "00"
The following items are generated when “0” and “000000” are analyzed.

【０１１０】[0110]

【表２５】 [Table 25]

【０１１１】パターン［４］に一致する各規則について
は、［３］の場合と同じものが適用される。For each rule that matches pattern [4], the same rule as in case of [3] is applied.

【０１１２】本発明による処理および転送方法は、有利
なことに、ＸＭＬ（拡張可能マークアップ言語, Extens
ible Markup Language）文書とともに適用することがで
きる。というのは、各ＸＭＬ文書は任意選択で文書型宣
言ファイル（ＤＴＤ）を参照するからである。当技術分
野で既知の通り、ＤＴＤは、他の構成体の中でも主に正
規表現を使用して、ＸＭＬファイルの構造を記述する。
以下のサンプルＤＴＤは、ＸＭＬで作成されたＤＴＤを
本発明による処理および転送方法に適用できることを立
証するためのものである。このＤＴＤは、従業員のリス
トの構造を含む。The processing and forwarding method according to the invention is advantageously implemented in XML (Extensible Markup Language, Extens
ible Markup Language) documents. This is because each XML document optionally references a document type declaration file (DTD). As is known in the art, DTD describes the structure of an XML file using primarily regular expressions, among other constructs.
The following sample DTD demonstrates that a DTD created in XML can be applied to the processing and forwarding method according to the present invention. This DTD contains the structure of the list of employees.

【０１１３】[0113]

【表２６】 [Table 26]

【０１１４】上記のＤＴＤを固守するＸＭＬ文書は以下
のようになる可能性がある。An XML document that adheres to the above DTD may be as follows.

【０１１５】[0115]

【表２７】 [Table 27]

【０１１６】このＸＭＬ文書は、それぞれの名前と相互
の関係を含む従業員のリストを記述するものである。こ
のＸＭＬ文書の構造がＤＴＤによって示され、ＤＴＤが
正規表現によって文法のようにこの構造を記述している
ことを強調しなければならない。This XML document describes a list of employees including their names and mutual relationships. It must be emphasized that the structure of this XML document is indicated by the DTD and that the DTD describes this structure like a grammar by regular expressions.

【０１１７】ＤＴＤの規則は以下のように解釈しなけれ
ばならない。The DTD rules must be interpreted as follows.

【０１１８】[0118]

【表２８】 [Table 28]

【０１１９】この規則では、従業員のリストが１人また
は複数の従業員を含み、このようなリストはタグ<list_
of_employees>および</list_of_employees>によって囲
まなければならないことを示している。According to this rule, the list of employees includes one or more employees, and such a list has the tag <list_
of_employees> and </ list_of_employees>.

【０１２０】[0120]

【表２９】 [Table 29]

【０１２１】これらの規則では、１人の従業員が１つの
名前を含まなければならず、複数の電子メール・アドレ
スを含むことができ、任意選択で指定のリンクを有する
こともできることを示している。さらに、従業員は固有
のＩＤを持っていなければならない。内容ファイル内の
従業員項目は<employee>および</employee>によってタ
グを付けなければならない。These rules indicate that one employee must include one name, may include multiple e-mail addresses, and may optionally have a designated link. I have. In addition, employees must have unique IDs. Employee items in the content file must be tagged with <employee> and </ employee>.

【０１２２】[0122]

【表３０】 [Table 30]

【０１２３】これらの規則では、名前が姓と洗礼名から
なり、それが任意の文字列からなることを示している。
内容ファイル内の名前項目は<name>および</name>によ
ってタグを付けなければならない。また、姓および洗礼
名は<family>、</family>、および<given>、</given>に
よってタグを付けなければならない。These rules indicate that the name is composed of a surname and a baptismal name, and that it is composed of an arbitrary character string.
Name entries in the content file must be tagged with <name> and </ name>. Last names and baptisms must also be tagged with <family>, </ family>, and <given>, </ given>.

【０１２４】[0124]

【表３１】 [Table 31]

【０１２５】この規則では、電子メールが任意の文字列
からなることを示している。電子メール項目は<email>
および</email>によってタグを付けなければならない。This rule indicates that an electronic mail is composed of an arbitrary character string. Email item is <email>
And </ email>.

【０１２６】[0126]

【表３２】 [Table 32]

【０１２７】これらの規則では、１つのリンクが２つの
任意選択属性を有し、一方が管理者を参照し、もう一方
が部下のリストを参照することを示している。These rules indicate that one link has two optional attributes, one referring to the manager and the other referring to the list of subordinates.

【０１２８】各<!ELEMENT...>項目は要素型と呼ばれ
る。上記の例は、list_of_employees、employee、emai
l、name、family、given、linkという７通りの要素型を
示している。Each <! ELEMENT ...> item is called an element type. In the above example, list_of_employees, employee, emai
It shows seven element types, l, name, family, given, and link.

【０１２９】文書要素は所与の要素型の構造を固守し、
属性（同じくＤＴＤに定義されている）を有することが
できる。これらの属性は<!ATTLIST...>項目によって記
述される。A document element adheres to the structure of a given element type,
It can have attributes (also defined in the DTD). These attributes are described by <! ATTLIST ...> items.

【０１３０】ここで、本発明の方法は、要素型定義を固
守する要素のマッピングを記述することにより、リレー
ショナル・データベースに適用できることが分かる。Here, it can be seen that the method of the present invention can be applied to a relational database by describing element mappings that adhere to element type definitions.

【０１３１】このマッピング技術の単純な実施態様の１
つは、文脈自由文法の場合とまったく同じ手法であるデ
ータベース・スキーマで各要素型ごとにテーブルを作成
することになるだろう。One of the simple embodiments of this mapping technique
One would create a table for each element type in a database schema, exactly the same way as in context-free grammar.

【０１３２】したがって、以下のテーブルを生成するこ
とができる。Accordingly, the following table can be generated.

【０１３３】[0133]

【表３３】 [Table 33]

【０１３４】上記のテーブルの列は、要素型自体の定義
およびその属性に対応する。この例では、文脈自由文法
の場合とまったく同じ規則が使用され、したがって、以
下のテーブル定義が得られる。The columns in the above table correspond to the definition of the element type itself and its attributes. In this example, exactly the same rules are used as in the context-free grammar, so we get the following table definition:

【０１３５】[0135]

【表３４】 [Table 34]

【０１３６】上記の明細書では、特定の例証的な実施の
形態に関連して本発明を説明してきた。しかし、特許請
求の範囲に記載した本発明のより広範囲の趣旨および範
囲を逸脱せずに様々な修正および変更を加えることがで
きることは明らかになるだろう。したがって、明細書お
よび添付図面は、限定的意味ではなく、例証と見なすべ
きものである。In the foregoing specification, the invention has been described with reference to specific illustrative embodiments. It will be apparent, however, that various modifications and changes may be made without departing from the broader spirit and scope of the invention as set forth in the appended claims. Accordingly, the specification and accompanying drawings are to be regarded in an illustrative, rather than a restrictive, sense.

【０１３７】また、本発明による処理方法は、構造化情
報を記憶するためのオブジェクト指向データベースにも
適合可能である。The processing method according to the present invention is also applicable to an object-oriented database for storing structured information.

【０１３８】まとめとして、本発明の構成に関して以下
の事項を開示する。In summary, the following items are disclosed regarding the configuration of the present invention.

【０１３９】（１）コンピュータ・システムのデータベ
ースに記憶すべき構造化情報（１００）を処理するため
の方法であって、情報の構造を記述する文法を決定する
ステップ（１１０）と、前記文法からデータベース・ス
キーマを導き出すステップ（１２０）と、前記文法から
マッピング規則を生成するステップ（１３０）と、スキ
ーマおよびマッピング規則に応じて前記データベースに
構造化情報を記憶するステップ（１４０）とを含む方
法。（２）リレーショナル・データベース（１２）のテーブ
ル（３２）に文法の非端末をマッピングするステップを
特徴とする、上記（１）に記載の方法。（３）文脈自由文法を使用して構造化情報（１００）を
記述するステップを特徴とする、上記（２）に記載の方
法。（４）文脈自由文法を使用して構造化情報（１００）を
記述し、複数の情報パッケージ（１００）の構造化内容
を反映する文書記述ファイル３０を生成し、前記文書記
述ファイル（３０）から前記データベース・スキーマ
（１２）を導き出すステップを特徴とする、上記（３）
に記載の方法。（５）ＸＭＬを使用して構造化情報（１００）を記述す
るステップを特徴とする、上記（４）に記載の方法。（６）オブジェクト指向データベース・システムの構造
要素に文法の非端末をマッピングするステップを特徴と
する、上記（５）に記載の方法。（７）ある量の情報を外部サイトからデータベース内に
転送する方法であって、その量の情報が、共通の一般的
な情報構造によって一般化可能な複数の少なくとも同様
に構造化された情報パッケージ（１００）を含み、前記
方法が、転送すべき情報パッケージ（１００）の一般的
な構造を特徴付ける文書記述ファイル（３０）（ＤＴ
Ｄ）を１回生成するステップと、文書記述ファイル（３
０）に応じて受取り側データベース（１０）内にデータ
ベース・スキーマ（１０）を１回生成するステップと、
前記複数の情報パッケージ（１００）を受取り側データ
ベース（１２）に送るステップと、スキーマ（１０）に
対応するデータベース（１２）に情報を書き込むステッ
プとを含む方法。（８）上記（１）ないし上記（６）に記載の方法を実現
するコンピュータ・プログラムを実行できるかまたは上
記（７）に記載の方法の実行を支援できるように構成さ
れた少なくとも１つのデータベース管理システムがイン
ストールされたコンピュータ・システム。（９）上記（１）に記載の方法を実現するコンピュータ
・プログラムをインストールするため、または上記
（７）に記載の方法の実行を支援するために使用可能な
データを記憶する電子データ・キャリア。(1) A method for processing structured information (100) to be stored in a database of a computer system, comprising the steps of: (110) determining a grammar describing the structure of the information; A method comprising: deriving a database schema (120); generating mapping rules from the grammar (130); and storing structured information in the database according to the schema and mapping rules (140). (2) The method according to the above (1), characterized by mapping a grammar non-terminal to the table (32) of the relational database (12). (3) The method according to (2), wherein the structured information (100) is described using a context-free grammar. (4) The structured information (100) is described using a context-free grammar, and a document description file 30 reflecting the structured content of the plurality of information packages (100) is generated. (3) characterized in that the database schema (12) is derived.
The method described in. (5) The method according to (4), wherein the structured information (100) is described using XML. (6) The method according to the above (5), wherein a grammar non-terminal is mapped to a structural element of the object-oriented database system. (7) A method of transferring a certain amount of information from an external site into a database, wherein the amount of information is a plurality of at least similarly structured information packages that can be generalized by a common general information structure. (100), wherein the method characterizes the general structure of the information package (100) to be transferred.
D) once, and a document description file (3)
Generating a database schema (10) once in the receiving database (10) in response to 0);
A method comprising: sending the plurality of information packages (100) to a receiving database (12); and writing information to a database (12) corresponding to a schema (10). (8) At least one database management configured to execute a computer program for implementing the method described in (1) to (6) or to support the execution of the method described in (7). Computer system on which the system was installed. (9) An electronic data carrier for storing data usable for installing a computer program for implementing the method according to (1) or for supporting the execution of the method according to (7).

[Brief description of the drawings]

【図１】本発明の処理方法の概略流れ図である。FIG. 1 is a schematic flowchart of a processing method of the present invention.

【図２】本発明の処理方法の第１の実施の形態の基本態
様を示す包括的な概略図である。FIG. 2 is a comprehensive schematic diagram showing a basic mode of the first embodiment of the processing method of the present invention.

【図３】本発明の処理方法の第２の実施の形態の基本態
様を示す包括的な概略図である。FIG. 3 is a comprehensive schematic diagram showing a basic mode of a second embodiment of the processing method of the present invention.

【図４】本発明により情報を転送するための方法の概略
ブロック図である。FIG. 4 is a schematic block diagram of a method for transferring information according to the present invention.

【図５】本発明の方法による情報転送を示す包括的な概
略図である。FIG. 5 is a comprehensive schematic diagram illustrating information transfer according to the method of the present invention.

[Explanation of symbols]

１０スキーマ１２データベース（ｄｂ）１４文法１６ファイル内容１８マッピング規則２０メタ文脈３０ＤＴＤファイル３２ｄｂテーブル１００情報パッケージ１１０文法を決定する１２０文法からｄｂスキーマを導き出す１３０文法からマッピング規則を生成する１４０スキーマおよびマッピング規則に応じてデータ
ベースに情報を記憶する４００プロトタイプ情報４１０情報を解析する４２０ＤＴＤを生成する４３０ＢにＤＴＤを送る４４０ＤＴＤを受け取り、ｄｂスキーマを生成する４５０複数の情報パッケージを送る４６０情報を受け取るReference Signs List 10 schema 12 database (db) 14 grammar 16 file contents 18 mapping rule 20 metacontext 30 DTD file 32 db table 100 information package 110 determining grammar 120 deriving db schema from grammar 130 generating mapping rules from grammar 140 schema and Store information in database according to mapping rules 400 Prototype information 410 Analyze information 420 Generate DTD 430 Send DTD to B 440 Receive DTD and generate db schema 450 Send multiple information packages 460 Send information receive

Claims

[Claims]

A method for processing structured information (100) to be stored in a database of a computer system, the method comprising: determining a grammar describing the structure of the information (11).
0), deriving a database schema from the grammar (120), and generating mapping rules from the grammar (13)
0) and storing (140) structured information in the database according to the schema and the mapping rules.

2. The method according to claim 1, characterized in that grammar non-terminals are mapped to tables (32) of the relational database (12).

3. Structured information (10) using a context-free grammar.
Method according to claim 2, characterized by the step of describing 0).

4. Structured information (10) using a context-free grammar.
0), generating a document description file 30 reflecting the structured contents of the plurality of information packages (100), and deriving the database schema (12) from the document description file (30). The method of claim 3.

5. Method according to claim 4, characterized in that the structured information (100) is described using XML.

6. The method according to claim 5, characterized in that grammar non-terminals are mapped to structural elements of the object-oriented database system.

7. A method for transferring a quantity of information from an external site into a database, said quantity of information being at least a plurality of at least similarly structured generalizable by a common general information structure. An information package (100), said method comprising: once generating a document description file (30) (DTD) characterizing the general structure of the information package (100) to be transferred; Generating a database schema (10) once in a receiving database (10) in response to: sending the plurality of information packages (100) to a receiving database (12); Writing information to a corresponding database (12).

8. At least one database management system configured to execute a computer program for realizing the method according to claim 1 or to support the execution of the method according to claim 7. Computer system on which is installed.

9. An electronic data carrier storing data usable for installing a computer program for implementing the method according to claim 1 or for supporting the execution of the method according to claim 7.