JP2015094988A

JP2015094988A - Data structure, data generation device, data generation method, and data generation program

Info

Publication number: JP2015094988A
Application number: JP2013232374A
Authority: JP
Inventors: 進藤井; Susumu Fujii
Original assignee: Saga University NUC
Current assignee: Saga University NUC
Priority date: 2013-11-08
Filing date: 2013-11-08
Publication date: 2015-05-18
Anticipated expiration: 2033-11-08
Also published as: JP6245571B2

Abstract

PROBLEM TO BE SOLVED: To provide a data structure and the like capable of ensuring robust data management and efficient operation without the need of complicated table designing and management even for data of no relation and managed by a plurality of systems.SOLUTION: A data structure includes: a universal table 43 for managing data collected from a plurality of ordinary systems 41 using different definitions by item IDs without defining item names; a mask table 45 storing therein mask information for each row unit for necessary item IDs that are minimum necessary item IDs for identifying the data from among the item IDs, the mask information being for enabling only the data stored in the necessary item IDs to be extracted; and a master table 46 for managing a plurality of extracted identical data as one data if the data is extracted from all the data in the universal table 43 according to the mask information stored for every row unit in the mask table 45 and the identical extracted data is extracted from a plurality of rows among the extracted data.

Description

本発明は、大量で複雑なデータを簡単なテーブル設計で効率よく管理するデータ構造等に関する。 The present invention relates to a data structure for efficiently managing a large amount of complex data with a simple table design.

一般的に知られているデータベース技術として、ＲＤＢ（Relational Data Base）やＫＶＳ（Key-Value Store）がある。ＲＤＢは関係データベースと訳され、現在最も多く利用されている。データは表形式のテーブルで管理され、テーブル間の関係を定義付ける。複数のテーブルからデータの取得、変更、削除などが行われる。一方、ＫＶＳは分散ファイルシステムやデータベースとして商品化されており、一意のＫＥＹに対して値（Ｖａｌｕｅ）をセットで保存する。近年のビッグデータに対応した技術であり、データはＫＥＹを指定することで取得することができる。 Commonly known database technologies include RDB (Relational Data Base) and KVS (Key-Value Store). RDB is translated as a relational database and is currently most frequently used. Data is managed in tabular tables and defines relationships between tables. Data is acquired, changed, deleted, etc. from multiple tables. On the other hand, KVS is commercialized as a distributed file system or database, and stores a value (Value) as a set for a unique KEY. This is a technology corresponding to recent big data, and data can be acquired by specifying KEY.

ＲＤＢはテーブルが表形式であるため、エンジニアが直感的にテーブル内のデータ設計を行うという利点があるものの、システムごとにテーブル設計が異なり、テーブル間の連携情報もその都度設計する必要があり、理論上の設計が必要とされ複雑性が否めない。また、こうした特性からメンテナンス性、拡張性、柔軟性が弱く、関係性においてマスタによる項目の分類は１つのカテゴリで管理されることが多く、設計者の意図と利用者の感性が違う場合はそれを取り出すことに困難性が残る。 Since RDB has a table format, there is an advantage that the engineer intuitively designs the data in the table, but the table design is different for each system, and the linkage information between the tables must be designed each time. Theoretical design is required and complexity cannot be denied. In addition, because of these characteristics, maintenance, expandability, and flexibility are weak, and in the relationship, the classification of items by the master is often managed in one category, and if the intention of the designer is different from the sensitivity of the user Difficulty in taking out

一方でＫＶＳ技術は、データ自体にＫＥＹ（フィールド情報）を持つことで、ＲＤＢに対してテーブル設計の柔軟性を解決している。また、テーブル構造に縛られないことから拡張性も改善している。ただし、単一のフィールドに複数の異なる情報が格納されたり、テーブル内のデータ設計が直感的に行えず、俯瞰的に把握しづらくメンテナンス性も含めて弱い。 On the other hand, the KVS technology solves the flexibility of table design for RDB by having KEY (field information) in the data itself. Also, extensibility is improved because it is not tied to the table structure. However, a plurality of different information is stored in a single field, and the data design in the table cannot be intuitively performed, and it is difficult to grasp from a bird's-eye view.

このようなＲＤＢやＫＶＳを複合的に利用する技術として、例えば特許文献１に示す技術が開示されている。特許文献１に示す技術は、アクセスパターンが存在する場合に、アクセス対象データを第１のデータベース及び第２のデータベースに配置することを決定するデータ保存先決定部と、データ保存先決定部によりアクセス対象データを第１のデータベース及び第２のデータベースに配置する場合に、書込みアクセスが所定回数以上となる期間を判別し、判別した期間に対応する期間のアクセス対象データを第１のデータベースに配置するように移行スケジュールを決定すると共に、読出しアクセスが所定回数以上となる期間を判別し、判別した期間に対応する期間のアクセス対象データを第２のデータベースに配置するように移行スケジュールを決定する移行スケジュール決定部と、を備えるものである。 As a technique of using such RDB and KVS in combination, for example, a technique disclosed in Patent Document 1 is disclosed. The technique shown in Patent Document 1 is accessed by a data storage destination determination unit that determines to arrange access target data in the first database and the second database when an access pattern exists, and a data storage destination determination unit. When the target data is arranged in the first database and the second database, the period in which the write access is a predetermined number of times or more is determined, and the access target data in the period corresponding to the determined period is arranged in the first database. In this way, the migration schedule is determined, the period in which the read access is a predetermined number of times or more is determined, and the migration schedule is determined so that the access target data in the period corresponding to the determined period is arranged in the second database. And a determination unit.

特開２０１２−１０３８４７号公報JP 2012-103847 A

特許文献１に示す技術は、更新に必要な時間に応じてＲＤＢとＫＶＳの使い分け等を行うことで、負荷を分散して運用効率を上げるものであるが、データ移行などの処理が発生するため、管理が複雑になってしまうと共に、メンテナンス性が悪いものになってしまう。 The technique disclosed in Patent Document 1 uses RDB and KVS separately according to the time required for updating, thereby distributing the load and improving operational efficiency. However, processing such as data migration occurs. As a result, the management becomes complicated and the maintainability becomes poor.

本発明は、複数のシステムで管理されている関連性のないデータであっても、複雑なテーブル設計や管理を必要とせずに堅牢なデータ管理を行うと共に、効率的な運用を行うことができるデータ構造等を提供する。 The present invention can perform robust operation and efficient operation without requiring complicated table design and management even for irrelevant data managed by a plurality of systems. Provide data structure etc.

本発明に係るデータ構造は、異なる定義がなされている複数のテーブルから収集したデータを項目名を定義せずに項目ＩＤで管理する万能テーブルと、前記項目ＩＤの中からデータを識別するために最低限必要となる必要項目ＩＤに対して、当該必要項目ＩＤに格納されているデータのみを抽出可能とするためのマスク情報が各行単位で格納されたマスクテーブルと、前記万能テーブルの全データから前記マスクテーブルの各行単位に格納されたマスク情報にしたがってデータを抽出し、当該抽出された抽出データのうち同一の抽出データが複数行抽出された場合には、当該抽出された複数の同一データを一つのデータとして管理するマスタテーブルとを備えるものである。 The data structure according to the present invention is a universal table for managing data collected from a plurality of tables having different definitions by item IDs without defining item names, and for identifying data from the item IDs. From the mask table in which mask information for enabling extraction of only the data stored in the necessary item ID is stored for each necessary item ID, and all data in the universal table. When data is extracted according to the mask information stored in units of rows of the mask table, and the same extracted data is extracted from the extracted data in a plurality of rows, the extracted plurality of the same data is And a master table managed as one piece of data.

このように、本発明に係るデータ構造においては、様々に定義されているデータを項目名が定義されていない１つの万能テーブルで管理し、データを識別するために最低限必要となる項目に対して、当該項目に格納されているデータのみを抽出可能とするためのマスク情報（例えば、「０」又は「１」のマスク情報）が各行単位で格納されたマスクテーブルを用意し、万能テーブルの全データからマスクテーブルの各行単位に格納されたマスク情報にしたがってデータを抽出し、当該抽出された同一データを一つのデータとして管理するマスタテーブルを備えるため、マスタテーブルには万能テーブルで管理する全てのデータのマスタとなる情報のみが集約されて管理され、検索等の処理を非常に効率よく行うことが可能になるという効果を奏する。 As described above, in the data structure according to the present invention, variously defined data are managed by a single universal table in which item names are not defined. A mask table in which mask information (for example, mask information of “0” or “1”) for enabling extraction of only the data stored in the item is stored for each row is prepared. Since all the data is provided with a master table that extracts data according to the mask information stored in units of each row of the mask table and manages the extracted same data as one data, the master table is all managed with a universal table Only the information that becomes the master of the data is centralized and managed, and the processing such as search can be performed very efficiently. To.

また、必要に応じてマスタテーブルのデータを検索条件として、万能テーブルを詳細に検索することが可能になり、極めて膨大なビッグデータであっても必要とするデータを効率よく抽出することができるという効果を奏する。 In addition, it is possible to search the universal table in detail using the data in the master table as a search condition as necessary, and the required data can be efficiently extracted even with extremely large amounts of big data. There is an effect.

本発明に係るデータ構造は、前記マスタテーブルのデータに付与されるパターンＩＤと、前記万能テーブルのデータに付与されるローカルＩＤと、前記パターンＩＤ及び前記ローカルＩＤを統合した統合ＩＤを管理するＩＤ管理テーブルを備え、共通の前記パターンＩＤを有するデータを共通の前記統合ＩＤとして管理しつつ、共通の前記ローカルＩＤを有するデータを共通の前記統合ＩＤとして管理するものである。 The data structure according to the present invention includes a pattern ID assigned to the data of the master table, a local ID assigned to the data of the universal table, and an ID for managing an integrated ID obtained by integrating the pattern ID and the local ID. A management table is provided, and data having a common pattern ID is managed as a common integrated ID, while data having a common local ID is managed as a common integrated ID.

このように、本発明に係るデータ構造においては、共通の前記パターンＩＤを有するデータを共通の前記統合ＩＤとして管理しつつ、共通の前記ローカルＩＤを有するデータを共通の前記統合ＩＤとして管理することで、ＩＤのみの管理で名寄せ処理を効率よく行うことができるという効果を奏する。 As described above, in the data structure according to the present invention, the data having the common local ID is managed as the common integrated ID while the data having the common pattern ID is managed as the common integrated ID. Thus, there is an effect that the name identification process can be efficiently performed by managing only the ID.

本発明に係るデータ構造は、前記マスクテーブルのマスク情報に対応する項目ＩＤの項目名を管理する項目定義テーブルを備えるものである。 The data structure according to the present invention includes an item definition table for managing item names of item IDs corresponding to the mask information of the mask table.

このように、本発明に係るデータ構造においては、マスクテーブルのマスク情報に対応する項目ＩＤの項目名を管理する項目定義テーブルを備えるため、万能テーブルのようにあらゆるデータが項目ＩＤのみで管理されて格納されているような場合であっても、管理者や利用者が項目定義テーブルの項目名を参照して、どのようなデータであるかを認識することが可能になるという効果を奏する。 As described above, in the data structure according to the present invention, since the item definition table for managing the item name of the item ID corresponding to the mask information of the mask table is provided, all data is managed only by the item ID as in the universal table. Even in such a case, an administrator or a user can refer to the item name of the item definition table and recognize what kind of data it is.

本発明に係るデータ構造は、前記万能テーブルが、前記異なる定義がなされている複数のテーブルにて格納されているデータ内容の実施に関連する情報を固定項目として各データごとに有するものである。 In the data structure according to the present invention, the universal table has information related to the implementation of the data contents stored in the plurality of tables having the different definitions as a fixed item for each data.

このように、本発明に係るデータ構造においては、万能テーブルが、異なる定義がなされている複数のテーブルにて格納されているデータ内容の実施に関連する情報（例えば、５Ｗ１Ｈ情報のうち１又は複数の情報）を固定項目として各データごとに有することで、運用面で重要な情報として活用することができるという効果を奏する。 As described above, in the data structure according to the present invention, the universal table is information related to the implementation of data contents stored in a plurality of tables having different definitions (for example, one or more of 5W1H information). Information) as a fixed item for each data, there is an effect that it can be used as important information in terms of operation.

本発明に係るデータ構造は、前記マスタテーブルが、前記万能テーブルが有していない情報であって、前記抽出されたデータに関連する情報を付加的に格納するものである。 In the data structure according to the present invention, the master table is information that the universal table does not have, and additionally stores information related to the extracted data.

このように、本発明に係るデータ構造においては、マスタテーブルが、前記万能テーブルが有していない情報であって、前記抽出されたデータに関連する情報を付加的に格納するため、管理者側が付加的な要素を付けて情報を提供することができるという効果を奏する。 As described above, in the data structure according to the present invention, the master table is information that the universal table does not have, and additionally stores information related to the extracted data. There is an effect that information can be provided with additional elements.

本発明に係るデータ生成装置は、前記データ構造を生成するデータ生成装置であって、異なる定義がなされている複数のテーブルからのデータを入力し、入力された入力データを前記万能テーブルに展開する万能テーブル生成手段と、前記マスクテーブルの前記マスク情報にしたがって、前記万能テーブルの全データから必要なデータを抽出する抽出手段と、抽出された抽出データのうち同一の前記抽出データが複数行抽出された場合に、当該抽出された複数の同一データを一つのデータとして前記マスタテーブルに格納するマスタテーブル生成手段とを備えるものである。 A data generation device according to the present invention is a data generation device that generates the data structure, inputs data from a plurality of tables having different definitions, and expands the input data that has been input into the universal table Universal table generation means, extraction means for extracting necessary data from all data of the universal table according to the mask information of the mask table, and a plurality of lines of the same extracted data among the extracted extracted data are extracted And a master table generating means for storing the extracted plurality of the same data as one data in the master table.

このように、本発明に係るデータ生成装置においては、万能テーブルを生成し、マスクテーブルのマスク情報にしたがって必要なデータを抽出し、抽出されたデータのうち複数の同一データを一つのデータとしてマスタテーブルに格納するため、万能テーブルで管理する全てのデータのマスタとなる情報のみをマスタテーブルに集約して管理することができ、検索等の処理を非常に効率よく行うことが可能になるという効果を奏する。 As described above, in the data generation device according to the present invention, the universal table is generated, the necessary data is extracted according to the mask information of the mask table, and a plurality of identical data among the extracted data is mastered as one data. Since it is stored in the table, only the information that becomes the master of all data managed by the universal table can be collected and managed in the master table, and the processing such as search can be performed very efficiently Play.

本発明に係るデータ生成装置は、前記マスタテーブルのデータに付与されるパターンＩＤと、前記万能テーブルのデータに付与されるローカルＩＤと、前記パターンＩＤ及び前記ローカルＩＤを統合した統合ＩＤを管理するＩＤ管理テーブルを生成するＩＤ管理テーブル生成手段と、共通の前記パターンＩＤを有するデータに共通の前記統合ＩＤを付与し、共通の前記ローカルＩＤを有するデータに共通の前記統合ＩＤを付与して統合ＩＤの管理を行う統合ＩＤ管理手段とを備えるものである。 The data generation device according to the present invention manages a pattern ID assigned to the data of the master table, a local ID assigned to the data of the universal table, and an integrated ID obtained by integrating the pattern ID and the local ID. ID management table generating means for generating an ID management table, and adding the common integrated ID to the data having the common pattern ID, and adding the common integrated ID to the data having the common local ID Integrated ID management means for managing IDs.

このように、本発明に係るデータ生成装置においては、共通の前記パターンＩＤを有するデータを共通の前記統合ＩＤとして管理しつつ、共通の前記ローカルＩＤを有するデータを共通の前記統合ＩＤとして管理することで、ＩＤのみの管理で名寄せ処理を効率よく行うことができるという効果を奏する。 As described above, in the data generation device according to the present invention, the data having the common local ID is managed as the common integrated ID while the data having the common pattern ID is managed as the common integrated ID. Thus, there is an effect that the name identification process can be efficiently performed by managing only the ID.

本発明に係るデータ生成装置は、前記マスクテーブルのマスク情報に対応する項目ＩＤの項目名を管理する項目定義テーブルに基づいて、前記万能テーブルから抽出されたデータの対応する項目名を出力する出力手段を備えるものである。 The data generation device according to the present invention outputs an output corresponding item name of data extracted from the universal table based on an item definition table that manages an item name of an item ID corresponding to the mask information of the mask table Means are provided.

このように、本発明に係るデータ生成装置においては、マスクテーブルのマスク情報に対応する項目ＩＤの項目名を管理する項目定義テーブルに基づいて、万能テーブルから抽出されたデータの対応する項目名を出力するため、万能テーブルのようにあらゆるデータが項目ＩＤのみで管理されて格納されているような場合であっても、管理者や利用者が項目定義テーブルの項目名を参照して、どのようなデータであるかを認識することが可能になるという効果を奏する。 As described above, in the data generation device according to the present invention, based on the item definition table that manages the item name of the item ID corresponding to the mask information of the mask table, the corresponding item name of the data extracted from the universal table is obtained. In order to output, even if all data is managed and stored only by item ID as in the universal table, how the administrator or user refers to the item name in the item definition table It is possible to recognize whether the data is correct.

本発明に係るデータ生成装置は、前記万能テーブル生成手段が、前記異なる定義がなされている複数のテーブルにて格納されているデータ内容の実施に関連する情報を固定項目として各データごとに付加するものである。 In the data generation device according to the present invention, the universal table generation means adds information related to the implementation of the data contents stored in the plurality of tables having the different definitions as a fixed item for each data. Is.

このように、本発明に係るデータ生成装置においては、万能テーブルが、異なる定義がなされている複数のテーブルにて格納されているデータ内容の実施に関連する情報（例えば、５Ｗ１Ｈ情報のうち１又は複数の情報）を固定項目として各データごとに有することで、運用面で重要な情報として活用することができるという効果を奏する。 As described above, in the data generation device according to the present invention, the universal table is information related to the implementation of data contents stored in a plurality of tables having different definitions (for example, one or more of 5W1H information) By having each piece of data as a fixed item for each data, there is an effect that it can be used as important information in terms of operation.

本発明に係るデータ生成装置は、前記マスタテーブルに格納され、前記万能テーブルが有していない情報であって、前記抽出されたデータに関連する付加的な情報を出力する出力手段を備えるものである。 The data generation device according to the present invention comprises output means for outputting additional information related to the extracted data that is stored in the master table and is not included in the universal table. is there.

このように、本発明に係るデータ生成装置においては、マスタテーブルが、前記万能テーブルが有していない情報であって、前記抽出されたデータに関連する情報を付加的に格納するため、管理者側が付加的な要素を付けて情報を提供することができるという効果を奏する。 As described above, in the data generation device according to the present invention, the master table is information that the universal table does not have, and additionally stores information related to the extracted data. There is an effect that the side can provide information with additional elements.

本発明に係るデータ検索装置は、前記データ構造を用いたデータ検索装置であって、前記マスタテーブルに格納されたデータを検索条件データとして入力する入力手段と、入力された前記検索条件データに基づいて、当該検索条件データと一致するデータを前記万能テーブルから抽出する抽出手段と、抽出されたデータを出力する出力手段とを備えるものである。 The data search device according to the present invention is a data search device using the data structure, based on input means for inputting data stored in the master table as search condition data, and the input search condition data And extracting means for extracting data that matches the search condition data from the universal table, and output means for outputting the extracted data.

このように、本発明に係るデータ検索装置においては、マスタテーブルに格納されたデータを検索条件データとして入力し、入力された前記検索条件データに基づいて、当該検索条件データと一致するデータを前記万能テーブルから抽出し出力するため、膨大な量のビッグデータであっても、必ずヒットする検索条件が設定され効率的な検索を行うことができるという効果を奏する。 As described above, in the data search device according to the present invention, the data stored in the master table is input as search condition data, and based on the input search condition data, the data that matches the search condition data is Since it is extracted from the universal table and output, even if a huge amount of big data is found, there is an effect that a search condition that always hits is set and an efficient search can be performed.

第１の実施形態に係るデータ生成装置のハードウェア構成図である。It is a hardware block diagram of the data generation apparatus which concerns on 1st Embodiment. 第１の実施形態に係るデータ生成装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the data generation apparatus which concerns on 1st Embodiment. 第１の実施形態に係る万能テーブル、マスクテーブル及びマスタテーブルのデータ構造を示す図である。It is a figure which shows the data structure of the universal table which concerns on 1st Embodiment, a mask table, and a master table. 第１の実施形態に係るパターンマップ及びＩＤ管理テーブルのデータ構造を示す図である。It is a figure which shows the data structure of the pattern map and ID management table which concern on 1st Embodiment. 万能テーブル及びマスクテーブルの一例を示す図である。It is a figure which shows an example of a universal table and a mask table. 図５のデータを用いて生成されるマスタテーブルの一例を示す図である。It is a figure which shows an example of the master table produced | generated using the data of FIG. マスタデータを用いたデータの検索を示す図である。It is a figure which shows the search of the data using master data. 名寄せにおける第１の処理を示す図である。It is a figure which shows the 1st process in name identification. 名寄せにおける第２の処理を示す図である。It is a figure which shows the 2nd process in name identification. 名寄せにおける第３の処理を示す図である。It is a figure which shows the 3rd process in name identification. 名寄せにおける第４の処理を示す図である。It is a figure which shows the 4th process in name identification. 第１の実施形態に係るテーブル構成を示す図である。It is a figure which shows the table structure which concerns on 1st Embodiment. 第１の実施形態に係るデータ生成装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the data generation apparatus which concerns on 1st Embodiment. 第２の実施形態に係るデータ検索装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the data search device which concerns on 2nd Embodiment. 第２の実施形態に係るデータ検索装置に用いられるデータのテーブル構成を示す図である。It is a figure which shows the table structure of the data used for the data search device which concerns on 2nd Embodiment. 第３の実施形態に係るデータ生成装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the data generation apparatus which concerns on 3rd Embodiment. 第３の実施形態に係るテーブル構成を示す図である。It is a figure which shows the table structure which concerns on 3rd Embodiment. 項目定義テーブル及びマスタテーブルを結合して項目名を表示する場合の処理を示す図である。It is a figure which shows the process in the case of combining an item definition table and a master table, and displaying an item name. 項目定義テーブルの利用方法を示す図である。It is a figure which shows the utilization method of an item definition table. 第４の実施形態に係るデータ生成装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the data generation apparatus which concerns on 4th Embodiment. 第４の実施形態に係るテーブル構成を示す図である。It is a figure which shows the table structure which concerns on 4th Embodiment. 固定項目を含む万能テーブルの情報の一例を示す図である。It is a figure which shows an example of the information of the universal table containing a fixed item. 第５の実施形態に係るデータ生成装置の構成を示す機能ブロック図である。It is a functional block diagram which shows the structure of the data generation apparatus which concerns on 5th Embodiment. 第５の実施形態に係るテーブル構成を示す図である。It is a figure which shows the table structure which concerns on 5th Embodiment. 付加情報の具体例を示す図である。It is a figure which shows the specific example of additional information.

以下、本発明の実施の形態を説明する。本実施形態の全体を通して同じ要素には同じ符号を付けている。 Embodiments of the present invention will be described below. Throughout the present embodiment, the same elements are denoted by the same reference numerals.

（本発明の第１の実施形態）
本実施形態に係るデータ構造及びデータ生成装置について、図１ないし図１３を用いて説明する。本実施形態に係るデータ構造は、異なるシステムにおいて個々に定義された複数のデータを一つのテーブルで一元管理すると共に、インデックスにより各データへのアクセスを制御することで、大量のビッグデータであっても小さい容量で効率よくデータ管理ができるものである。 (First embodiment of the present invention)
The data structure and data generation apparatus according to this embodiment will be described with reference to FIGS. The data structure according to the present embodiment is a large amount of big data by centrally managing a plurality of data individually defined in different systems in one table and controlling access to each data by an index. Can manage data efficiently with a small capacity.

図１は、本実施形態に係るデータ生成装置のハードウェア構成図である。図１において、データ生成装置１は、ＣＰＵ１１、ＲＡＭ１２、ＲＯＭ１３、ハードディスク（ＨＤとする）１４、通信Ｉ／Ｆ１５、及び入出力Ｉ／Ｆ１６を備える。ＲＯＭ１３やＨＤ１４には、オペレーティングシステムや各種プログラムが格納されており、必要に応じてＲＡＭ１２に読み出され、ＣＰＵ１１により各プログラムが実行される。生成されたデータやその大元のデータはＨＤ１４に格納され、必要に応じてプログラムによりアクセスすることができる。 FIG. 1 is a hardware configuration diagram of a data generation apparatus according to the present embodiment. In FIG. 1, the data generation device 1 includes a CPU 11, a RAM 12, a ROM 13, a hard disk (HD) 14, a communication I / F 15, and an input / output I / F 16. The ROM 13 and the HD 14 store an operating system and various programs, which are read into the RAM 12 as necessary, and each program is executed by the CPU 11. The generated data and its original data are stored in the HD 14 and can be accessed by a program as necessary.

通信Ｉ／Ｆ１５は、装置間の通信を行うためのインタフェースである。入出力Ｉ／Ｆ１６は、タッチパネル、キーボード、マウス等の入力機器からの入力を受け付けたり、プリンタや画面等にデータを出力するためのインタフェースである。この入出力Ｉ／Ｆ１６は、必要に応じて光磁気ディスク、フロッピーディスク、ＣＤ−Ｒ、ＤＶＤ−Ｒ等のリムーバブルディスク等に対応したドライブを接続することができる。各処理部はバスを介して接続され、情報のやり取りを行う。なお、上記ハードウェアの構成はあくまで一例であり、必要に応じて変更可能である。 The communication I / F 15 is an interface for performing communication between apparatuses. The input / output I / F 16 is an interface for receiving input from an input device such as a touch panel, a keyboard, and a mouse, and outputting data to a printer, a screen, and the like. The input / output I / F 16 can be connected to a drive corresponding to a removable disk such as a magneto-optical disk, a floppy disk, a CD-R, a DVD-R, or the like, if necessary. Each processing unit is connected via a bus and exchanges information. The hardware configuration is merely an example, and can be changed as necessary.

図２は、本実施形態に係るデータ生成装置の構成を示す機能ブロック図である。データ生成装置１は、複数の異なるシステム（１０ａ〜１０ｄ）で管理されているデータを入力するデータ入力部２１を備える。システム１０ａ〜１０ｄは、例えば、医療管理システムであり、システム１０ａが医療機関Ａのシステムデータ、システム１０ｂが医療機関Ｂのシステムデータという言ったように、独立した異なるシステムで利用され、それぞれに異なる定義がなされているシステムデータである。 FIG. 2 is a functional block diagram showing the configuration of the data generation apparatus according to this embodiment. The data generation device 1 includes a data input unit 21 that inputs data managed by a plurality of different systems (10a to 10d). The systems 10a to 10d are, for example, medical management systems, and the system 10a is used in different and independent systems such as the system data of the medical institution A and the system 10b is the system data of the medical institution B. System data that has been defined.

また、データ生成装置１は、データ入力部２１で入力されたデータを万能テーブル記憶部２３に格納する万能テーブル生成部２２と、当該万能テーブル記憶部２３に記憶されたデータを、予め管理者により作成されたマスクテーブル記憶部２４のマスク情報に基づいて抽出するデータ抽出部２５と、抽出されたデータのうち同一のデータについては１つのデータとして纏めてマスタデータを生成し、マスタテーブル記憶部２７に格納するマスタテーブル生成部２６とを備える。ここで、万能テーブル、マスクテーブル、マスタテーブルについて説明する。 In addition, the data generation device 1 includes a universal table generation unit 22 that stores data input by the data input unit 21 in the universal table storage unit 23 and data stored in the universal table storage unit 23 in advance by an administrator. The data extraction unit 25 that extracts based on the mask information of the created mask table storage unit 24 and the same data among the extracted data are collected as one data to generate master data, and the master table storage unit 27 And a master table generating unit 26 for storing the data. Here, the universal table, the mask table, and the master table will be described.

図３は、本実施形態に係る万能テーブル、マスクテーブル及びマスタテーブルのデータ構造を示す図である。図３（Ａ）は万能テーブル、図３（Ｂ）はマスクテーブル、図３（Ｃ）はマスタテーブルの構造を示す。通常フィールド名には、そのフィールドに入れるデータの種類（例えば、氏名、年齢、生年月日等）を示す名称が定義されるが、図３（Ａ）の万能テーブルの場合、どのフィールドにどのデータを入れてもいいように、フィールド名がＳＬＯＴ番号で管理される。つまり、例えば、システム１０ａで利用されていたデータの場合は、ＳＬＯＴ１に「ローカルＩＤ」、ＳＬＯＴ２に「性別」、ＳＬＯＴ３に「氏名」、ＳＬＯＴ４に「住所」が格納されるが、システム１０ｂで利用されていたデータの場合は、ＳＬＯＴ１に「ローカルＩＤ」、ＳＬＯＴ２に「氏名」、ＳＬＯＴ３に「電話番号」、ＳＬＯＴ４に「郵便番号」が格納されるといった形で万能テーブルが形成される。 FIG. 3 is a diagram showing a data structure of the universal table, the mask table, and the master table according to the present embodiment. 3A shows the structure of the universal table, FIG. 3B shows the structure of the mask table, and FIG. 3C shows the structure of the master table. Usually, the field name defines a name indicating the type of data (for example, name, age, date of birth, etc.) to be entered in the field. In the case of the universal table in FIG. The field name is managed by the SLOT number so that it can be entered. That is, for example, in the case of data used in the system 10a, “local ID” is stored in SLOT1, “sex” is stored in SLOT2, “name” is stored in SLOT3, and “address” is stored in SLOT4. In the case of the stored data, the universal table is formed in such a manner that “local ID” is stored in SLOT1, “name” is stored in SLOT2, “telephone number” is stored in SLOT3, and “zip code” is stored in SLOT4.

図３（Ｂ）のマスクテーブルは、データとしては「０」又は「１」の情報のみを有しており、データを識別するために必要となるフィールドに「１」が設定され、万能テーブルの各レコードと掛け合わせることで、必要な情報のみを抽出することができる。例えば、図３（Ｂ）の２レコード目（ＰＰ−ＩＤ＝２）は、ＳＬＯＴ１、５及び６に「１」が設定され他は「０」がマスクデータとして設定されている。つまり、万能テーブルのデータと図３（Ｂ）のＰＰ−ＩＤ＝２のマスクデータを掛け合わせることで、万能テーブルからＳＬＯＴ１、５及び６の情報のみを抽出できる。ただし、このとき、万能テーブルにはあらゆるデータがランダムに格納されているため、図３（Ｂ）のＰＰ−ＩＤ＝２を掛け合わせて抽出されたデータが全て意味のあるデータとはならない。つまり、ＰＰ−ＩＤ＝２のマスク情報によりデータを識別するために必要なデータが必ず含まれているものの、全く関係のないデータ（利用者が必要としていないデータ）も含まれている。 The mask table of FIG. 3B has only information “0” or “1” as data, and “1” is set in a field necessary for identifying the data. By multiplying with each record, only necessary information can be extracted. For example, in the second record (PP-ID = 2) in FIG. 3B, “1” is set in SLOT1, 5 and 6, and “0” is set as mask data in the other records. That is, by multiplying the data of the universal table by the mask data of PP-ID = 2 in FIG. 3B, only the information of SLOT1, 5, and 6 can be extracted from the universal table. However, at this time, since all data is randomly stored in the universal table, the data extracted by multiplying PP-ID = 2 in FIG. 3B is not all meaningful data. That is, although data necessary for identifying the data is always included by the mask information of PP-ID = 2, data irrelevant (data not required by the user) is also included.

図３（Ｃ）のマスタテーブルは、マスクテーブルのマスク情報により抽出されたデータのうち、同一のデータについては１つのデータに纏めてマスタデータとして情報を格納する。例えば、ＰＰ−ＩＤ＝２のマスク情報により、ＳＬＯＴ１＝１１、ＳＬＯＴ５＝患者太郎、ＳＬＯＴ６＝１９７６／１０／１０というデータが複数抽出された場合には、それらのデータは１件のデータとし、１つのＩＤ（以下、ＰＴＮ−ＩＤという）が付与される。なお、万能テーブル、マスクテーブル及びマスタテーブルのより具体的なデータ構造については後述する。 The master table in FIG. 3C stores information as master data by combining the same data into one data among the data extracted by the mask information of the mask table. For example, when a plurality of data such as SLOT1 = 11, SLOT5 = patient Taro, and SLOT6 = 1976/10/10 are extracted by the mask information of PP-ID = 2, those data are regarded as one data. One ID (hereinafter referred to as PTN-ID) is assigned. More specific data structures of the universal table, mask table, and master table will be described later.

このように、万能テーブルに格納されたデータのうち、マスクテーブルのマスク情報で抽出された識別性のある情報を１つのデータに括ることで万能テーブルのマスタデータが生成される。つまり、何千万件に上るような膨大な量のデータを扱う際の検索においては、検索条件の設定が非常に重要となるが、本実施形態のようにマスタデータが作成されることにより、必ずヒットする検索条件としてマスタデータを提供することができ、作業効率を格段に向上させることができる。 In this way, the master data of the universal table is generated by combining the discriminating information extracted from the mask information of the mask table into one data among the data stored in the universal table. In other words, in the search when handling a huge amount of data such as tens of millions, setting of search conditions is very important, but by creating master data as in this embodiment, Master data can be provided as a search condition that always makes a hit, and work efficiency can be greatly improved.

図２に戻って、マスタテーブル生成部２６によりマスタテーブルが生成されると、元のシステム１０ａ〜１０ｄで利用されていたユニーク性を持った識別コードであるＬＯＣＡＬ＿ＩＤ（以下、Ｌ−ＩＤとする）を万能テーブルから抽出し、マスタテーブル生成時に付与されたマスタデータごとのＰＴＮ＿ＩＤをマスタテーブルから抽出し、それらを付け合せたパターンマップをパターンマップ記憶部２９に記憶するパターンマップ生成部２８と、生成されたパターンマップからＩＤを管理するためのＩＤ管理テーブルを作成してＩＤ管理テーブル記憶部３１に格納する統合ＩＤ管理部３０とを備える。 Returning to FIG. 2, when the master table is generated by the master table generation unit 26, LOCAL_ID (hereinafter referred to as L-ID) which is a unique identification code used in the original systems 10a to 10d. Is extracted from the universal table, the PTN_ID for each master data given at the time of generating the master table is extracted from the master table, and a pattern map generated by adding them together is stored in the pattern map storage unit 29. And an integrated ID management unit 30 that creates an ID management table for managing IDs from the pattern map and stores the ID management table in the ID management table storage unit 31.

図４は、本実施形態に係るパターンマップ及びＩＤ管理テーブルのデータ構造を示す図である。図４（Ａ）はパターンマップを示す図、図４（Ｂ）はＩＤ管理テーブルを示す図である。図４（Ａ）に示すＬ−ＩＤはＬＯＣＡＬ＿ＩＤのことであり、元々のシステムで利用されていたユニーク性を持つＩＤである。例えば、医療管理システムで言えば、医療機関コード（医療機関ごとのコード）と診察券コード（患者ごとのコード）を繋げた一意のＩＤである。ＰＴＮ−ＩＤは、マスタテーブルを生成する際に付与された一意のＩＤである。パターンマップでは、Ｌ−ＩＤとＰＴＮ−ＩＤの組み合わせが格納される。 FIG. 4 is a diagram showing the data structure of the pattern map and ID management table according to the present embodiment. 4A shows a pattern map, and FIG. 4B shows an ID management table. L-ID shown in FIG. 4 (A) is LOCAL_ID, which is an ID having uniqueness used in the original system. For example, in the case of a medical management system, it is a unique ID that connects a medical institution code (code for each medical institution) and an examination ticket code (code for each patient). The PTN-ID is a unique ID assigned when the master table is generated. In the pattern map, a combination of L-ID and PTN-ID is stored.

ここで、Ｌ−ＩＤは、システムの上では一意性を持っているが、例えば、医療管理システムにおいて、実際にＬ−ＩＤと患者の関係が１対１とは限らない。つまり、同じ患者が異なる医療機関で診察した場合には、２つのＬ−ＩＤを持っていることとなる。一方、ＰＴＮ−ＩＤについても、システムの上では一意性を持っているが、例えば、医療管理システムにおいて、１つのマスタデータ（例えば、氏名、郵便番号、性別、生年月日）が１人の患者を示しているとは限らない。つまり、結婚して性が変わった人や引越しなどで郵便番号が変わった人は、同じ人物であっても複数の異なるマスタデータを有することとなる。 Here, the L-ID has uniqueness on the system, but, for example, in a medical management system, the relationship between the L-ID and the patient is not always one-to-one. That is, when the same patient is examined at different medical institutions, the patient has two L-IDs. On the other hand, PTN-ID also has uniqueness in the system. For example, in a medical management system, one master data (for example, name, zip code, gender, date of birth) is one patient. It does not always indicate. That is, a person whose sex has changed after marriage or a person whose postal code has changed due to moving or the like has a plurality of different master data even if the person is the same person.

このような、データを同一人物に統合する（以下、名寄せという）ために、図４（Ｂ）のＩＤ管理テーブルが生成される。図４（Ｂ）のＩＤ管理テーブルにおいては、同一のパターンＩＤを有するデータには共通の統合ＩＤが付与され、同一のＰＴＮ−ＩＤを有するデータには共通の統合ＩＤが付与される。つまり、統合ＩＤと、当該統合ＩＤにより識別される人（システムの種類によっては物）は、必ず１対１の関係となる。したがって、同じ統合ＩＤで抽出されたデータは、全て同じ人（又は物）に関連する情報として抽出することが可能となる。なお、統合ＩＤを付与する際の具体的な処理については、詳細を後述する。 In order to integrate such data into the same person (hereinafter referred to as name identification), the ID management table in FIG. 4B is generated. In the ID management table of FIG. 4B, a common integrated ID is assigned to data having the same pattern ID, and a common integrated ID is assigned to data having the same PTN-ID. That is, there is always a one-to-one relationship between the integrated ID and the person (thing depending on the type of system) identified by the integrated ID. Therefore, all the data extracted with the same integrated ID can be extracted as information related to the same person (or thing). Details of the specific process for assigning the integrated ID will be described later.

次に、マスタテーブルを生成するまでの処理とその利用方法を具体例を挙げて説明する。図５は、万能テーブル及びマスクテーブルの一例を示す図である。図５（Ａ）は万能テーブル、図５（Ｂ）はマスクテーブルの一例を示す。万能テーブルには、１〜３行に患者情報、４〜６行に薬剤情報、７〜９行に検査情報が格納されている。マスクテーブルのＰＰ−ＩＤ＝１にはＳＬＯＴ２〜６まで「１」が設定されている。これは、患者情報のマスタデータを作成するために予め管理者により設定されたものであり、ＳＬＯＴ２〜６の情報（具体的には、アカウント、氏名、生年月日、性別、郵便番号）があれば識別可能な患者情報のマスタデータを作成することができる。同様に、ＰＰ−ＩＤ＝２にはＳＬＯＴ２〜５まで「１」が設定され、ＰＰ−ＩＤ＝３にはＳＬＯＴ２、３、５〜７まで「１」が設定されている。 Next, a process until the master table is generated and a method for using the process will be described with a specific example. FIG. 5 is a diagram illustrating an example of the universal table and the mask table. FIG. 5A shows an example of a universal table, and FIG. 5B shows an example of a mask table. In the universal table, patient information is stored in lines 1 to 3, drug information is stored in lines 4 to 6, and examination information is stored in lines 7 to 9. In PP-ID = 1 of the mask table, “1” is set for SLOT 2 to 6. This is set in advance by the administrator in order to create master data of patient information, and includes information on SLOT 2 to 6 (specifically, account, name, date of birth, gender, zip code). Therefore, master data of identifiable patient information can be created. Similarly, “1” is set to SLOT2 to 5 in PP-ID = 2, and “1” is set to SLOT2, 3, and 5-7 in PP-ID = 3.

図６は、図５のデータを用いて生成されるマスタテーブルの一例を示す図である。図５（Ａ）の万能テーブルと図５（Ｂ）のマスクテーブルを掛け合わせ、複数の同一のデータは１つのデータ（１つのＰＴＮ−ＩＤ）として取り扱う。図６において、ＰＰ−ＩＤ＝１のデータは、図５（Ｂ）におけるＰＰ−ＩＤ＝１のマスク情報により作成されたデータである。ここで、上から３行目の「患者花子」は上から２行目の「患者花子」と完全に一致しているので、マスタデータには含まれない。なお、このような場合は、図６に示すようにＰＴＮ−ＩＤを付与しないようにしてもよいし、データ自体を削除してもよい。同様に、１２行目の「患者花子」、１５行目の「目薬Ｂ」、２１行目の「患者花子」、２７行目の「尿検査（タンパク）」は、同一のデータが存在するためＰＴＮ−ＩＤが付与されずデータとしては存在しない。このように、マスタテーブルは、必要最小限のフィールドで抽出された最小のデータが集まったマスタデータとなる。 FIG. 6 is a diagram illustrating an example of a master table generated using the data of FIG. By multiplying the universal table of FIG. 5A and the mask table of FIG. 5B, a plurality of identical data is handled as one data (one PTN-ID). In FIG. 6, data with PP-ID = 1 is data created with the mask information with PP-ID = 1 in FIG. Here, “patient Hanako” in the third row from the top completely matches “patient Hanako” in the second row from the top, and thus is not included in the master data. In such a case, the PTN-ID may not be assigned as shown in FIG. 6, or the data itself may be deleted. Similarly, “patient Hanako” on the 12th line, “eye drops B” on the 15th line, “patient Hanako” on the 21st line, and “urinalysis (protein)” on the 27th line have the same data. No PTN-ID is assigned and there is no data. Thus, the master table is master data in which the minimum data extracted in the minimum necessary fields is collected.

上記のようなマスタデータが作成されることで、データの検索を効率よく行うことが可能となる。図７は、マスタデータを用いたデータの検索を示す図である。ここでは、例えば、利用者が万能テーブルから薬剤情報のデータを検索したいとする。薬剤情報に関するデータのマスク情報はＰＰ−ＩＤ＝２であるため（詳細は他の実施形態で後述するが、マスクテーブルに対応付けて項目名を定義している項目定義テーブルを用いることで、薬剤情報を取得するためのマスク情報がＰＰ−ＩＤ＝２であることを利用者が把握できる）、ＰＰ−ＩＤ＝２のマスク情報で抽出された図７（Ａ）のマスタデータが検索する際の選択肢の対象となる。利用者はこれらの選択肢から図７（Ｂ）に示すように検索条件を指定することで、図７（Ｃ）の行がヒットする。ヒットした行のＰＴＮ−ＩＤから統合ＩＤを特定し、その統合ＩＤのデータを万能テーブルから検索することで、必要なデータを万能テーブルから得ることができる。なお、図７（Ｂ）のＳＬＯＴごとの括弧書きの項目名は、マスクテーブルに対応付けて項目名を定義している項目定義テーブルに定義されている項目名である。 By creating the master data as described above, it is possible to efficiently search for data. FIG. 7 is a diagram showing data search using master data. Here, for example, it is assumed that the user wants to retrieve drug information data from the universal table. Since the mask information of the data related to the drug information is PP-ID = 2 (details will be described later in another embodiment, by using an item definition table in which item names are defined in association with the mask table, the drug information is used. The user can grasp that the mask information for acquiring the information is PP-ID = 2), and the master data of FIG. 7A extracted with the mask information of PP-ID = 2 is searched. Subject to choice. The user designates a search condition from these options as shown in FIG. 7B, and hits the line in FIG. 7C. Necessary data can be obtained from the universal table by specifying the integrated ID from the PTN-ID of the hit line and searching the data of the integrated ID from the universal table. The item name in parentheses for each SLOT in FIG. 7B is an item name defined in the item definition table that defines the item name in association with the mask table.

次に、名寄せの具体的な処理について４つの例を挙げて説明する。図８は、名寄せにおける第１の処理を示す図である。ここでは、万能テーブルの３件目に新たなデータが追加されたものとする（図８（Ａ））。追加されたデータＣは、他の患者情報と異なるため新規にＰＴＮ−ＩＤ（＝３００）が付与される（図８（Ｂ））。Ｌ−ＩＤ（ここでは、Ｌ−ＩＤはＨＯＳＰ＿ＣＤとＬＯＣＡＬ＿ＣＤを繋げたものとする）とＰＴＮ−ＩＤを管理するために、パターンマップに新たにデータＣが追加される（図８（Ｃ））。パターンマップへの追加に伴って、新たに統合ＩＤが付与されてＩＤ管理テーブルにデータＣが追加される（図８（Ｄ））。ＩＤ管理テーブルにおけるデータＡ、Ｂ及びＣは、いずれも異なるＰＴＮ−ＩＤと異なるＬ−ＩＤであるため、それぞれが異なる人物の情報として管理される。 Next, specific processing for name identification will be described with four examples. FIG. 8 is a diagram illustrating a first process in name identification. Here, it is assumed that new data is added to the third item in the universal table (FIG. 8A). Since the added data C is different from other patient information, a new PTN-ID (= 300) is assigned (FIG. 8B). Data C is newly added to the pattern map in order to manage the L-ID (here, L-ID is a connection between HOSP_CD and LOCAL_CD) and PTN-ID (FIG. 8C). Along with the addition to the pattern map, a new integrated ID is assigned and data C is added to the ID management table (FIG. 8D). Since the data A, B, and C in the ID management table are all different PTN-IDs and different L-IDs, they are managed as different person information.

図９は、名寄せにおける第２の処理を示す図である。ここでは、万能テーブルの４件目に新たなデータが追加されたものとする（図９（Ａ））。追加されたデータＤは、データＡの患者情報と完全に一致しているため、新たなＰＴＮ−ＩＤを採番しない（図９（Ｂ））。Ｌ−ＩＤとＰＴＮ−ＩＤを管理するために、パターンマップに新たにデータＣが追加される（図９（Ｃ））。データＡとデータＤは、患者情報が同じであるため、同じＰＴＮ−ＩＤ＝１００としてパターンマップに追加し、それぞれを別管理とする。パターンマップへの追加に伴って、新たに統合ＩＤが付与されてＩＤ管理テーブルにデータＤが追加される（図９（Ｄ））。このとき、既に同一のＰＴＮ−ＩＤ（＝１００）であるデータＡが存在しているため、データＤの統合ＩＤはデータＡと同一のＩＤに取り直しされる（図９（Ｅ））。ＩＤ管理テーブルにおけるデータＡ、Ｄは、それぞれ異なるＬ−ＩＤが付与されているものの、統合ＩＤが同じであるため、それぞれが同一人物の情報として管理される。 FIG. 9 is a diagram illustrating a second process in name identification. Here, it is assumed that new data is added to the fourth case in the universal table (FIG. 9A). Since the added data D completely coincides with the patient information of data A, a new PTN-ID is not assigned (FIG. 9B). In order to manage the L-ID and the PTN-ID, data C is newly added to the pattern map (FIG. 9C). Since data A and data D have the same patient information, they are added to the pattern map as the same PTN-ID = 100, and are separately managed. Along with the addition to the pattern map, a new integrated ID is assigned and data D is added to the ID management table (FIG. 9D). At this time, since data A having the same PTN-ID (= 100) already exists, the integrated ID of data D is re-assigned to the same ID as data A (FIG. 9E). Although the data A and D in the ID management table are assigned different L-IDs but have the same integrated ID, they are managed as information on the same person.

図１０は、名寄せにおける第３の処理を示す図である。ここでは、万能テーブルの５件目に新たなデータが追加されたものとする（図１０（Ａ））。追加されたデータＥは、他の患者情報と異なるため新規にＰＴＮ−ＩＤ（＝４００）が付与される（図１０（Ｂ））。Ｌ−ＩＤとＰＴＮ−ＩＤを管理するために、パターンマップに新たにデータＥが追加される（図１０（Ｃ））。パターンマップへの追加に伴って、新たに統合ＩＤが付与されてＩＤ管理テーブルにデータＥが追加される（図１０（Ｄ））。このとき、既に同一のＰ−ＩＤ（＝４１９９−９９９００１０）であるデータＢが存在しているため、データＥの統合ＩＤはデータＢと同一のＩＤに取り直しされる（図１０（Ｅ））。ＩＤ管理テーブルにおけるデータＢ、Ｅは、それぞれ異なるＰＴＮ−ＩＤが付与されているものの、統合ＩＤが同じであるため、それぞれが同一人物の情報として管理される。 FIG. 10 is a diagram illustrating a third process in name identification. Here, it is assumed that new data is added to the fifth case of the universal table (FIG. 10A). Since the added data E is different from other patient information, a new PTN-ID (= 400) is assigned (FIG. 10B). In order to manage the L-ID and the PTN-ID, data E is newly added to the pattern map (FIG. 10C). Along with the addition to the pattern map, a new integrated ID is assigned and data E is added to the ID management table (FIG. 10D). At this time, since the data B having the same P-ID (= 4199-999010) already exists, the integrated ID of the data E is changed to the same ID as the data B (FIG. 10E). Although the data B and E in the ID management table are assigned different PTN-IDs but have the same integrated ID, they are managed as information on the same person.

図１１は、名寄せにおける第４の処理を示す図である。ここでは、万能テーブルの３件目に新たなデータが追加されたものとする（図１１（Ａ））。追加されたデータＣは、データＢの患者情報と完全に一致しているため、新たなＰＴＮ−ＩＤを採番しない（図１１（Ｂ））。Ｌ−ＩＤとＰＴＮ−ＩＤを管理するために、パターンマップに新たにデータＣが追加される（図１１（Ｃ））。データＢとデータＣは、患者情報が同じであるため、同じＰＴＮ−ＩＤ＝４００となり、また、データＡとデータＣは、Ｌ−ＩＤが同じであるため、同じＬ−ＩＤとしてパターンマップに追加される。パターンマップへの追加に伴って、新たに統合ＩＤが付与されてＩＤ管理テーブルにデータＣが追加される（図１１（Ｄ））。このとき、既に同一のＰＴＮ−ＩＤ（＝４００）であるデータＢが存在しているため、データＣの統合ＩＤはデータＢと同一のＩＤに取り直しされる（図１１（Ｅ））。次に、Ｌ−ＩＤの一致を見た場合に、データＡとデータＣは同じＬ−ＩＤであるため（図１１（Ｆ））、データＣの統合ＩＤをデータＡの統合ＩＤに取り直す（図１１（Ｇ））。さらに、ＰＴＮ−ＩＤの一致を見た場合に、データＢとデータＣのＰＴＮ−ＩＤが一致しているにも関わらず、統合ＩＤが異なっているため（図１１（Ｈ））、データＢの統合ＩＤをデータＣの統合ＩＤに合わせて取り直す（図１１（Ｉ））。 FIG. 11 is a diagram illustrating a fourth process in name identification. Here, it is assumed that new data is added to the third item in the universal table (FIG. 11A). Since the added data C completely matches the patient information of the data B, a new PTN-ID is not assigned (FIG. 11B). In order to manage the L-ID and the PTN-ID, data C is newly added to the pattern map (FIG. 11C). Since data B and data C have the same patient information, the same PTN-ID = 400. Since data L and data C have the same L-ID, they are added to the pattern map as the same L-ID. Is done. Along with the addition to the pattern map, a new integrated ID is assigned and data C is added to the ID management table (FIG. 11D). At this time, since data B having the same PTN-ID (= 400) already exists, the integrated ID of data C is re-assigned to the same ID as data B (FIG. 11E). Next, when the L-ID matches are found, since the data A and the data C are the same L-ID (FIG. 11F), the integrated ID of the data C is changed to the integrated ID of the data A (FIG. 11). 11 (G)). Furthermore, when the PTN-ID matches are found, the integrated IDs are different even though the PTN-IDs of the data B and the data C match (FIG. 11 (H)). The integrated ID is reacquired according to the integrated ID of data C (FIG. 11 (I)).

なお、ここでは、統合ＩＤの取り直しする際に、若いコードを優先して行うこととしている。つまり、図１１（Ｉ）においてデータＢの統合ＩＤをデータＣの統合ＩＤに合わせて取り直したのは、若いコードを優先したからである。ただし、必ずしも若いコードを優先する必要はなく、予め決められたルールに基づく任意の一のコードを優先するようにしてもよい。 Here, when the integrated ID is re-acquired, the young code is preferentially performed. In other words, the reason why the integrated ID of data B is re-acquired in accordance with the integrated ID of data C in FIG. However, it is not always necessary to give priority to the young code, and any one code based on a predetermined rule may be given priority.

次に、本実施形態に係るデータ構造について説明する。図１２は、本実施形態に係るテーブル構成を示す図である。本実施形態において、複数の一般システム４１をＣＳＶ４２に形式に変換し、万能テーブル４３の各ＳＬＯＴに展開する。このとき、一般システム４１におけるどのフィールドがどのＳＬＯＴに対応するかを示した１^ＳＴＭＡＰ４４に基づいて展開処理が行われる。万能テーブル４３を元に、予め管理者が作成したマスクテーブル４５に基づいて、マスタテーブル４６が自動生成される。万能テーブル４３のＬ−ＩＤとマスタテーブル４６のＰＴＮ−ＩＤとから、パターンマップ４７が生成され、このパターンマップ４７に基づいて、名寄せ処理を行った場合の統合ＩＤを管理するＩＤ管理テーブル４８が生成される。 Next, the data structure according to this embodiment will be described. FIG. 12 is a diagram showing a table configuration according to the present embodiment. In this embodiment, a plurality of general systems 41 are converted into a CSV 42 format and developed in each SLOT of the universal table 43. At this time, the expansion process is performed based on the 1 ^ST MAP 44 indicating which field in the general system 41 corresponds to which SLOT. Based on the universal table 43, a master table 46 is automatically generated based on a mask table 45 created in advance by an administrator. A pattern map 47 is generated from the L-ID of the universal table 43 and the PTN-ID of the master table 46. Based on this pattern map 47, an ID management table 48 for managing an integrated ID when name identification processing is performed. Generated.

次に、本実施形態に係るデータ生成装置の動作について説明する。図１３は、本実施形態に係るデータ生成装置の動作を示すフローチャートである。まず、データ入力部２１が、複数のシステム１０ａ〜１０ｄからシステムデータを入力し（Ｓ１）、万能テーブル生成部２２が、入力されたシステムデータをＣＳＶに変換し、１^ＳＴＭＡＰ４４に基づいて万能テーブル４３を生成し、万能テーブル記憶部２３に格納する（Ｓ２）。データ抽出部２５が、予め作成されたマスクテーブル４５をマスクテーブル記憶部２４から読み出し、マスク情報にしたがって万能テーブル４３の情報を抽出する（Ｓ３）。マスタテーブル生成部２６は、抽出された情報のうち、同一のデータについては同一のＰＴＮ−ＣＤで管理するようにマスタテーブル４６を生成し、マスタテーブル記憶部２７に格納する（Ｓ４）。 Next, the operation of the data generation apparatus according to this embodiment will be described. FIG. 13 is a flowchart showing the operation of the data generation apparatus according to this embodiment. First, the data input unit 21 inputs system data from a plurality of systems 10a to 10d (S1), and the universal table generation unit 22 converts the input system data into CSV, and the universal table based on 1 ^ST MAP 44 43 is generated and stored in the universal table storage unit 23 (S2). The data extraction unit 25 reads the mask table 45 created in advance from the mask table storage unit 24, and extracts information from the universal table 43 according to the mask information (S3). The master table generation unit 26 generates a master table 46 so that the same data among the extracted information is managed by the same PTN-CD, and stores the master table 46 in the master table storage unit 27 (S4).

パターンマップ生成部２８が、万能テーブル記憶部２４に記憶された万能テーブル４３のＬ−ＩＤとマスタテーブル記憶部２７に記憶されたマスタテーブル４６のＰＴＮ−ＩＤとを組み合わせて対応付けたパターンマップ４７を生成し、パターンマップ記憶部２９に格納する（Ｓ５）。統合ＩＤ管理部３０が、パターンマップ４７に基づいて上記で説明した名寄せ処理を行い、統合ＩＤを付与したＩＤ管理テーブル４８を作成し、ＩＤ管理テーブル記憶部３１に格納して（Ｓ６）、処理を終了する。 The pattern map generating unit 28 combines the L-ID of the universal table 43 stored in the universal table storage unit 24 with the PTN-ID of the master table 46 stored in the master table storage unit 27 in association with each other. Is generated and stored in the pattern map storage unit 29 (S5). The integrated ID management unit 30 performs the name identification process described above based on the pattern map 47, creates an ID management table 48 to which an integrated ID is assigned, stores the ID management table 48 in the ID management table storage unit 31 (S6), and performs processing Exit.

このように、本実施形態に係るデータ生成装置及びデータ構造においては、様々に定義されているデータを項目名が定義されていない１つの万能テーブルで管理し、データを識別するために最低限必要となる項目に対して、当該項目に格納されているデータのみを抽出可能とするためのマスク情報が各行単位で格納されたマスクテーブルを用意し、万能テーブルの全データからマスクテーブルの各行単位に格納されたマスク情報にしたがってデータを抽出し、当該抽出された同一データを一つのデータとして管理するマスタテーブルを備えるため、マスタテーブルには万能テーブルで管理する全てのデータのマスタとなる情報のみが集約されて管理され、検索等の処理を非常に効率よく行うことが可能になる。 As described above, in the data generation device and the data structure according to the present embodiment, variously defined data are managed by a single universal table in which item names are not defined, and are at least necessary for identifying the data. For each item, prepare a mask table that stores the mask information for each row so that only the data stored in the item can be extracted. Since the master table includes a master table that extracts data according to the stored mask information and manages the extracted same data as one data, the master table has only information that becomes a master of all data managed by the universal table. It is managed in a centralized manner, and processing such as search can be performed very efficiently.

また、必要に応じてマスタテーブルのデータを検索条件として、万能テーブルを詳細に検索することが可能になり、極めて膨大なビッグデータであっても必要とするデータを効率よく抽出することができる。 Moreover, it becomes possible to search the universal table in detail using the data in the master table as a search condition as required, and the required data can be efficiently extracted even with extremely large amounts of big data.

さらに、共通のＰＴＮ−ＩＤを有するデータを共通の統合ＩＤとして管理しつつ、共通のＬ−ＩＤを有するデータを共通の統合ＩＤとして管理することで、ＩＤのみの管理で名寄せ処理を効率よく行うことができる。 Further, by managing data having a common PTN-ID as a common integrated ID and managing data having a common L-ID as a common integrated ID, name identification processing can be efficiently performed by managing only the ID. be able to.

（本発明の第２の実施形態）
本実施形態に係るデータ検索装置について、図１４及び図１５を用いて説明する。本実施形態に係るデータ検索装置は、上記第１の実施形態におけるデータ構造及び生成されたデータから所定のデータを検索するものである。
なお、本実施形態において、前記第１の実施形態と重複する説明は省略する。 (Second embodiment of the present invention)
A data search apparatus according to this embodiment will be described with reference to FIGS. 14 and 15. The data search device according to the present embodiment searches for predetermined data from the data structure and generated data in the first embodiment.
In addition, in this embodiment, the description which overlaps with the said 1st Embodiment is abbreviate | omitted.

図１４は、本実施形態に係るデータ検索装置の構成を示す機能ブロック図である。本実施形態に係るデータ検索装置５０は、前記第１の実施形態に係るデータ生成装置１のマスタテーブル４６に記憶されたマスタ情報に基づく検索条件情報５１を入力する条件情報入力部５２と、入力された検索条件に一致するデータの統合ＩＤをＩＤ管理テーブル４８から抽出し、抽出された統合ＩＤのデータを万能テーブル４３から抽出する検索部５３と、検索部５３が抽出した結果を例えばディスプレイに出力する出力部５４とを備える。 FIG. 14 is a functional block diagram showing the configuration of the data search apparatus according to this embodiment. The data search device 50 according to the present embodiment includes a condition information input unit 52 that inputs search condition information 51 based on master information stored in the master table 46 of the data generation device 1 according to the first embodiment, and an input The integrated ID of the data that matches the retrieved search condition is extracted from the ID management table 48, the retrieved unit 53 that extracts the extracted integrated ID data from the universal table 43, and the result extracted by the retrieved unit 53 is displayed on a display, for example. And an output unit 54 for outputting.

マスタテーブル４６には、検索対象となる万能テーブル４３のマスタ情報（必要最小限の情報）が記憶されているため、このマスタ情報を検索条件情報として利用することで、膨大なデータから必要な情報の絞り込みを行い易くなる。そして、マスタ情報に基づく検索条件から統合ＩＤを割り出し、当該統合ＩＤに該当するデータを万能テーブル４３から抽出することで、絞り込んだデータからより詳細なデータを効率よく検索することができる。 The master table 46 stores master information (minimum necessary information) of the universal table 43 to be searched. By using this master information as search condition information, necessary information can be obtained from a large amount of data. It becomes easy to narrow down. Then, by determining the integrated ID from the search condition based on the master information and extracting the data corresponding to the integrated ID from the universal table 43, more detailed data can be efficiently searched from the narrowed down data.

図１５は、本実施形態に係るデータ検索装置に用いられるデータのテーブル構成を示す図である。設定された検索条件情報に基づいて、ＩＤ管理テーブル４８から統合ＩＤが抽出され、抽出された統合ＩＤにより結合された万能テーブル４３のデータが検索結果情報として出力される。 FIG. 15 is a diagram illustrating a data table configuration used in the data search apparatus according to the present embodiment. Based on the set search condition information, the integrated ID is extracted from the ID management table 48, and the data of the universal table 43 combined by the extracted integrated ID is output as the search result information.

このように、本実施形態に係るデータ検索装置においては、マスタテーブルに格納されたデータを検索条件データとして入力し、入力された前記検索条件データに基づいて、当該検索条件データと一致するデータを前記万能テーブルから抽出し出力するため、膨大な量のビッグデータであっても、必ずヒットする検索条件が設定され効率的な検索を行うことができる。 As described above, in the data search device according to the present embodiment, data stored in the master table is input as search condition data, and data matching the search condition data is input based on the input search condition data. Since the universal table is extracted and output, even if a large amount of big data is found, a search condition that always hits is set and an efficient search can be performed.

（本発明の第３の実施形態）
本実施形態に係るデータ構造及びデータ生成装置について、図１６ないし図１９を用いて説明する。本実施形態に係るデータ生成装置は、マスクテーブル４５のマスク情報に対応するＳＬＯＴのＳＬＯＴ名を管理する項目定義テーブルを備えるものである。
なお、本実施形態において、前記各実施形態と重複する説明は省略する。 (Third embodiment of the present invention)
The data structure and data generation apparatus according to this embodiment will be described with reference to FIGS. The data generation apparatus according to the present embodiment includes an item definition table that manages the SLOT name of the SLOT corresponding to the mask information of the mask table 45.
In addition, in this embodiment, the description which overlaps with each said embodiment is abbreviate | omitted.

図１６は、本実施形態に係るデータ生成装置の構成を示す機能ブロック図、図１７は、本実施形態に係るテーブル構成を示す図である。図１６において、第２の実施形態における図１４と異なるのは、マスクテーブル記憶部２４に対応付けられた項目定義テーブルを記憶する項目定義テーブル記憶部３２を備えることである。また、図１７において、第２の実施形態における図１５と異なるのは、マスクテーブル４５に対応する項目定義テーブル４９を備えることである。 FIG. 16 is a functional block diagram illustrating a configuration of the data generation apparatus according to the present embodiment, and FIG. 17 is a diagram illustrating a table configuration according to the present embodiment. 16 is different from FIG. 14 in the second embodiment in that an item definition table storage unit 32 that stores an item definition table associated with the mask table storage unit 24 is provided. 17 is different from FIG. 15 in the second embodiment in that an item definition table 49 corresponding to the mask table 45 is provided.

項目定義テーブル４９は、万能テーブル４３のＳＬＯＴ番号に意味のある情報を対応付けるためのテーブルであり、マスクテーブル４５のマスク情報により対応付けがなされている。以下に、項目定義テーブル４９を用いた項目名の管理方法について具体的に説明する。図１８は、項目定義テーブル及びマスタテーブルを結合して項目名を表示する場合の処理を示す図である。図１８（Ａ）は項目定義テーブルの一例を示し、図１８（Ｂ）はマスタ情報を抽出する際の汎用画面の一例を示す。図１８（Ａ）に示すように、マスクテーブル４５のＰＰ−ＩＤに応じて項目名が各ＳＬＯＴごとに定義されている。ここでは、ＰＰ−ＩＤ＝１は患者情報、ＰＰ−ＩＤ＝２は薬剤情報、ＰＰ−ＩＤ＝３は検査情報を示しており、患者情報について検索する場合には、図１８（Ｂ）に示すように、ＰＰ−ＩＤ＝１の情報を列名に表示することで、利用者は、マスタテーブルにおける患者情報のＳＬＯＴの中身の意味を認識することができ、使い勝手がよくなる。 The item definition table 49 is a table for associating meaningful information with the SLOT number of the universal table 43, and is associated with the mask information of the mask table 45. The item name management method using the item definition table 49 will be specifically described below. FIG. 18 is a diagram illustrating a process when the item definition table and the master table are combined to display the item name. FIG. 18A shows an example of an item definition table, and FIG. 18B shows an example of a general-purpose screen when extracting master information. As shown in FIG. 18A, item names are defined for each SLOT in accordance with the PP-ID of the mask table 45. Here, PP-ID = 1 indicates patient information, PP-ID = 2 indicates drug information, and PP-ID = 3 indicates examination information. When searching for patient information, FIG. 18B shows. Thus, by displaying the information of PP-ID = 1 in the column name, the user can recognize the meaning of the SLOT contents of the patient information in the master table, and the usability is improved.

図１９は、項目定義テーブルの利用方法を示す図である。図１９（Ａ）は項目定義テーブル４９の項目名が列名として表示された汎用画面を示し、図１９（Ｂ）は検索条件の設定及び検索結果を示している。図１９に示すように、利用者は、項目定義テーブルに記憶されている項目名を意識した検索条件を設定することで、必要な情報を抽出することができる。すなわち、利用者が項目名を意識することができることで、様々な情報が混在する一元化されたテーブルから必要な情報を抽出するための適切な抽出条件を効率よく設定することができる。 FIG. 19 is a diagram illustrating a method of using the item definition table. FIG. 19A shows a general-purpose screen on which item names in the item definition table 49 are displayed as column names, and FIG. 19B shows setting of search conditions and search results. As shown in FIG. 19, the user can extract necessary information by setting a search condition in consideration of the item names stored in the item definition table. That is, since the user can be aware of the item name, it is possible to efficiently set an appropriate extraction condition for extracting necessary information from a unified table in which various pieces of information are mixed.

このように、本発明に実施形態に係るデータ生成装置においては、マスクテーブルのマスク情報に対応する項目ＩＤの項目名を管理する項目定義テーブルを備えるため、万能テーブルのようにあらゆるデータが項目ＩＤのみで管理されて格納されているような場合であっても、管理者や利用者が項目定義テーブルの項目名を参照して、どのようなデータであるかを認識することが可能になり、操作性や作業効率を上げることができる。 As described above, in the data generation device according to the embodiment of the present invention, since the item definition table for managing the item name of the item ID corresponding to the mask information of the mask table is provided, all data is stored in the item ID as in the universal table. Even if it is managed and stored only by the administrator, users and users can refer to the item name in the item definition table and recognize what kind of data it is, Operability and work efficiency can be improved.

（本発明の第４の実施形態）
本実施形態に係るデータ構造及びデータ生成装置について、図２０ないし図２２を用いて説明する。本実施形態に係るデータ構造は、万能テーブルが、固定項目として各データごとに、例えば５Ｗ１Ｈの情報を有するものである。
なお、本実施形態において、前記各実施形態と重複する説明は省略する。 (Fourth embodiment of the present invention)
The data structure and data generation apparatus according to this embodiment will be described with reference to FIGS. In the data structure according to the present embodiment, the universal table has, for example, 5W1H information for each data as a fixed item.
In addition, in this embodiment, the description which overlaps with each said embodiment is abbreviate | omitted.

図２０は、本実施形態に係るデータ生成装置の構成を示す機能ブロック図、図２１は、本実施形態に係るテーブル構成を示す図である。図２０において、第３の実施形態における図１６と異なるのは、データ入力部２１が、システム１０ａ〜１０ｄにて格納されている情報内容の実施に関連する５Ｗ１Ｈ情報（いつ、何処で、誰が（誰に）、何を、なぜ、どのように）のうち１又は複数の情報を入力し、万能テーブルを生成する際に、万能テーブル生成部２２が各データごとに５Ｗ１Ｈ情報のうち１又は複数の情報を固定項目として付加することである。また、図２１において、第３の実施形態における図１７と異なるのは、一般システム４１からＣＳＶ４２を生成する際に、ＣＳＶ４２に各データごとの５Ｗ１Ｈの情報を固定項目として追加されていることである。 FIG. 20 is a functional block diagram showing the configuration of the data generation apparatus according to this embodiment, and FIG. 21 is a diagram showing the table configuration according to this embodiment. In FIG. 20, the data input unit 21 is different from FIG. 16 in the third embodiment in that the 5W1H information (when, where, who ( When one or a plurality of information is input to (who), what, why, how) and a universal table is generated, the universal table generator 22 generates one or more of 5W1H information for each data. It is to add information as a fixed item. 21 is different from FIG. 17 in the third embodiment in that, when the CSV 42 is generated from the general system 41, 5W1H information for each data is added to the CSV 42 as a fixed item. .

図２２は、固定項目を含む万能テーブルの情報の一例を示す図である。図２２（Ａ）に示すように、一般システム４０に格納されている「日時」〜「パターン」の情報を５Ｗ１Ｈ情報として抽出し、図２２（Ｂ）に示すように、固定項目として万能テーブル４３に付加する。このような固定項目は、各ＳＬＯＴの項目に関係なく、常に固定的に万能テーブルに付加される情報であり、このような情報を保持することで検索処理の幅を広げることができると共に、何か問題が起こった場合に、その原因の究明等に利用することができる。 FIG. 22 is a diagram illustrating an example of information in the universal table including fixed items. As shown in FIG. 22A, the information of “date and time” to “pattern” stored in the general system 40 is extracted as 5W1H information, and as shown in FIG. Append to Such a fixed item is information that is always fixedly added to the universal table regardless of each SLOT item. By holding such information, the range of search processing can be expanded, and what If a problem occurs, it can be used to investigate the cause.

より具体的には、例えば、「実施日時」、「場所（排他的な要素のない場所）」、「対象者（万能テーブルに格納される情報の内容が実施される者）」、「実施者（万能テーブルに格納される情報の内容を実施する者）」、「管理者（万能テーブルに格納される情報の内容の実施を指示した者）」が固定項目として付加される。そして、情報の内容については、各ＳＬＯＴに展開された情報であり、これは行為情報と言うことができる。つまり、５Ｗ１Ｈ情報のような固定項目が同じであれば（例えば、数日に亘って同じ実施行為を繰り返して行ったような場合）、実際の行為（例えば、具体的な日付）を指定することが可能となり、データの上書きや削除の検索対象とすることができる。 More specifically, for example, “implementation date / time”, “place (place without an exclusive element)”, “target person (person who carries out the contents of information stored in the universal table)”, “implementer” (Person who executes the contents of the information stored in the universal table) ”and“ Manager (person who instructed the execution of the contents of the information stored in the universal table) ”are added as fixed items. The contents of the information are information developed in each SLOT, which can be called action information. That is, if the fixed item such as 5W1H information is the same (for example, when the same implementation action is repeated for several days), the actual action (for example, a specific date) is designated. It is possible to search for overwriting or deleting data.

このように、本実施形態に係るデータ構造においては、万能テーブルが、固定項目として各データごとに５Ｗ１Ｈのうち１又は複数の情報を有するため、運用面で重要な情報として活用することができる。 Thus, in the data structure according to the present embodiment, the universal table has one or a plurality of pieces of information out of 5W1H for each piece of data as a fixed item, so that it can be used as important information in terms of operation.

（本発明の第５の実施形態）
本実施形態に係るデータ構造及びデータ生成装置について、図２３ないし図２５を用いて説明する。本実施形態に係るデータ構造は、マスタテーブルのマスタ情報に、必要に応じて付加情報を追加したものである。
なお、本実施形態において、前記各実施形態と重複する説明は省略する。 (Fifth embodiment of the present invention)
The data structure and data generation apparatus according to this embodiment will be described with reference to FIGS. The data structure according to the present embodiment is obtained by adding additional information to the master information of the master table as necessary.
In addition, in this embodiment, the description which overlaps with each said embodiment is abbreviate | omitted.

図２３は、本実施形態に係るデータ生成装置の構成を示す機能ブロック図、図２４は、本実施形態に係るテーブル構成を示す図である。図２３において、第１の実施形態における図２と異なるのは、マスタテーブル記憶部２７に記憶されたマスタテーブル４６に、マスタテーブル生成部２６が生成したマスタ情報以外の付加情報を追加されていることである。また、図２４において、第１の実施形態における図１２と異なるのは、マスタテーブル４６に付加情報が追加され、検索情報としてそれらの情報が活用可能となっていることである。 FIG. 23 is a functional block diagram showing a configuration of the data generation apparatus according to the present embodiment, and FIG. 24 is a diagram showing a table configuration according to the present embodiment. 23 is different from FIG. 2 in the first embodiment in that additional information other than the master information generated by the master table generating unit 26 is added to the master table 46 stored in the master table storing unit 27. That is. 24 is different from FIG. 12 in the first embodiment in that additional information is added to the master table 46 and the information can be used as search information.

図２５は、付加情報の具体例を示す図である。図２５（Ａ）は、一般システム４１から提供されたデータであり、万能テーブル４３に格納される情報の一例を示し、図２５（Ｂ）は図２５（Ａ）で提供されたデータについて既知の情報を格納したテーブルの一例を示し、図２５（Ｃ）は最終的に提供される情報の項目を示している。図２５（Ｂ）の既知の情報を格納したテーブルが付加情報に相当し、中身は１つのマスタ情報に特有の情報である。例えば、胃薬Ａに関して既に一般的に知られていて容易に取得することができる情報として、「薬効」、「禁忌」、「販売元」といった情報があり、これらの情報は一般システム４１において管理されていなくても、管理者が別途付加情報として追加することで、ユーザへの提供をより充実化することができる。 FIG. 25 is a diagram illustrating a specific example of additional information. FIG. 25A shows data provided from the general system 41, and shows an example of information stored in the universal table 43. FIG. 25B shows the data provided in FIG. An example of a table storing information is shown, and FIG. 25C shows information items finally provided. The table storing known information in FIG. 25B corresponds to additional information, and the contents are information unique to one piece of master information. For example, there are information such as “medicinal effect”, “inhibition”, and “sales source” as information that is already generally known and can be easily acquired with respect to the stomach medicine A, and these information are managed in the general system 41. Even if it is not, the administrator can add the additional information separately so that the provision to the user can be further enhanced.

付加情報を追加した場合の各テーブルにおける最終的な管理項目は、図２５（Ｃ）に示す通りであり、万能テーブル４３に格納される情報は、図２５（Ａ）に示すように「種別」、「薬剤名称」、「薬効１」、「薬効２」、「処方量１」、「処方量２」、「処方量３」である。マスタテーブル４６に格納される情報は、一意性を示す最小のパターンであるため、「種別」、「薬剤名称」、「薬効１」、「薬効２」の４項目である。そして、最終的に検索情報として提供されるテーブルは、万能テーブル４３に付加情報の項目が追加されたものであり、図２５（Ｃ）の「種別」〜「販売」までの全ての項目を有しているものとなる。このとき、各項目の項目名は、項目定義テーブル４９による定義付けられるようにしてもよい。 The final management items in each table when additional information is added are as shown in FIG. 25C, and the information stored in the universal table 43 is “type” as shown in FIG. “Medicine name”, “medicine effect 1”, “medicine effect 2”, “prescription amount 1”, “prescription amount 2”, “prescription amount 3”. Since the information stored in the master table 46 is a minimum pattern indicating uniqueness, there are four items of “type”, “drug name”, “medicine effect 1”, and “medicine effect 2”. The table finally provided as search information is obtained by adding additional information items to the universal table 43, and has all items from “type” to “sales” in FIG. It will be what you are doing. At this time, the item name of each item may be defined by the item definition table 49.

このように、本実施形態に係るデータ構造においては、マスタテーブルが、万能テーブルが有していない情報であって、抽出されたデータに関連する情報を付加的に格納するため、管理者側が付加的な要素を付けて情報を提供することができる。 As described above, in the data structure according to the present embodiment, the master table is information that the universal table does not have and additionally stores information related to the extracted data. Information can be provided with specific elements.

なお、上記各実施形態において、万能テーブル４３に格納されるデータが完全に揃っていることを前提として、マスタテーブルが作成される処理を説明したが、元々の一般システム４１において、必要なデータが不足していることが多々ある。その場合は、Ｌ−ＩＤとＰＴＮ−ＩＤとの関係を示すパターンマップ４７を手動で生成するようにしてもよい。そうすることで、適正なマスタテーブル４６を生成することができる。 In each of the above embodiments, the process of creating the master table has been described on the assumption that the data stored in the universal table 43 is completely prepared. However, in the original general system 41, the necessary data is There are many things that are lacking. In that case, the pattern map 47 indicating the relationship between the L-ID and the PTN-ID may be manually generated. By doing so, an appropriate master table 46 can be generated.

また、上記各実施形態における各構成要件は、任意に組み合わせることが可能であり、利用環境に応じたシステム構築が可能である。 Moreover, each component in each said embodiment can be combined arbitrarily, and the system construction according to a use environment is possible.

１データ生成装置
１０ａ〜１０ｄシステム
１１ＣＰＵ
１２ＲＡＭ
１３ＲＯＭ
１４ＨＤ
１５通信Ｉ／Ｆ
１６入出力Ｉ／Ｆ
２１データ入力部
２２万能テーブル生成部
２３万能テーブル記憶部
２４マスクテーブル記憶部
２５データ抽出部
２６マスタテーブル生成部
２７マスタテーブル記憶部
２８パターンマップ生成部
２９パターンマップ記憶部
３０統合ＩＤ管理部
３１ＩＤ管理テーブル記憶部
３２項目定義テーブル記憶部
４１一般システム
４２ＣＳＶ
４３万能テーブル
４４１^ＳＴＭＡＰ
４５マスクテーブル
４６マスタテーブル
４７パターンマップ
４８ＩＤ管理テーブル
４９項目定義テーブル
５０データ検索装置
５１検索条件情報
５２条件情報入力部
５３検索部
５４出力部 1 data generation device 10a to 10d system 11 CPU
12 RAM
13 ROM
14 HD
15 Communication I / F
16 Input / output I / F
21 Data Input Unit 22 Universal Table Generation Unit 23 Universal Table Storage Unit 24 Mask Table Storage Unit 25 Data Extraction Unit 26 Master Table Generation Unit 27 Master Table Storage Unit 28 Pattern Map Generation Unit 29 Pattern Map Storage Unit 30 Integrated ID Management Unit 31 ID Management table storage unit 32 Item definition table storage unit 41 General system 42 CSV
43 Universal table 44 1 ^ST MAP
45 Mask Table 46 Master Table 47 Pattern Map 48 ID Management Table 49 Item Definition Table 50 Data Retrieval Device 51 Search Condition Information 52 Condition Information Input Unit 53 Search Unit 54 Output Unit

Claims

A universal table that manages data collected from multiple tables with different definitions by item ID without defining item names;
For each necessary item ID that is at least necessary for identifying data from among the item IDs, mask information for enabling extraction of only data stored in the necessary item ID is stored for each row. Mask table,
When data is extracted from all data of the universal table according to mask information stored in units of rows of the mask table, and when the same extracted data is extracted from the extracted data, a plurality of rows are extracted. And a master table for managing a plurality of identical data as one data.

The data structure of claim 1, wherein
An ID management table for managing a pattern ID given to the data of the master table, a local ID given to the data of the universal table, and an integrated ID obtained by integrating the pattern ID and the local ID;
A data structure characterized by managing data having a common local ID as a common integrated ID while managing data having a common pattern ID as the common integrated ID.

The data structure according to claim 1 or 2,
A data structure comprising an item definition table for managing item names of item IDs corresponding to mask information of the mask table.

The data structure according to any one of claims 1 to 3,
A data structure, wherein the universal table has information related to the implementation of data contents stored in the plurality of tables having the different definitions as fixed items for each data.

The data structure according to any one of claims 1 to 4,
A data structure, wherein the master table additionally stores information related to the extracted data, which is information that the universal table does not have.

A data generation device for generating the data structure according to claim 1,
Universal table generation means for inputting data from a plurality of tables having different definitions and expanding the input data into the universal table;
Extracting means for extracting necessary data from all data of the universal table according to the mask information of the mask table;
Master table generating means for storing the extracted plurality of the same data as one data in the master table when the same extracted data is extracted from a plurality of lines of the extracted data. A data generation device.

The data generation device according to claim 6, wherein a pattern ID assigned to the data of the master table, a local ID assigned to the data of the universal table, and an integrated ID obtained by integrating the pattern ID and the local ID are managed. ID management table generation means for generating an ID management table to be performed;
An integrated ID management unit that assigns the common integrated ID to the data having the common pattern ID, and assigns the common integrated ID to the data having the common local ID to manage the integrated ID; A data generation device characterized by the above.

In the data generation device according to claim 6 or 7,
Data comprising output means for outputting a corresponding item name of data extracted from the universal table based on an item definition table for managing an item name of an item ID corresponding to the mask information of the mask table Generator.

The data generation device according to any one of claims 6 to 8,
The universal table generation means adds information related to the implementation of data contents stored in a plurality of tables having different definitions as a fixed item for each data.

The data generation device according to any one of claims 6 to 9,
A data generation apparatus comprising: output means for outputting additional information related to the extracted data that is stored in the master table and is not included in the universal table.

A data search device using the data structure according to claim 1,
Input means for inputting the data stored in the master table as search condition data;
Based on the input search condition data, extraction means for extracting data that matches the search condition data from the universal table;
A data search apparatus comprising: output means for outputting the extracted data.

A data generation method for generating the data structure according to claim 1,
Computer
A universal table generation step of inputting data from a plurality of tables having different definitions and expanding the input data into the universal table;
An extraction step of extracting necessary data from all data of the universal table according to the mask information of the mask table;
A master table generating step of storing the extracted plurality of the same data as one data in the master table when the same extracted data is extracted from a plurality of lines of the extracted data. Characteristic data generation method.

A data generation program for causing a computer to function to generate the data structure according to claim 1,
A universal table generating means for inputting data from a plurality of tables having different definitions and expanding the input data into the universal table;
Extraction means for extracting necessary data from all data of the universal table according to the mask information of the mask table;
When a plurality of the same extracted data is extracted from the extracted data, the computer is caused to function as a master table generating means for storing the extracted plurality of the same data as one data in the master table. A data generation program characterized by