JP6328768B2

JP6328768B2 - Metadata automation system

Info

Publication number: JP6328768B2
Application number: JP2016540299A
Authority: JP
Inventors: ブライアン・バーンズ; クリストファー・マツァンティ; リチャード・ムカンバー; ジェレミー・ミラー
Original assignee: トランスメドシステムズ，インコーポレイテッド
Priority date: 2013-09-06
Filing date: 2014-09-01
Publication date: 2018-05-23
Anticipated expiration: 2034-09-01
Also published as: HK1225826A1; EP3042354B1; EP3042354A1; US20150074149A1; EP3979091A1; WO2015034795A1; US9626388B2; CN105706084B; AU2014101659A4; AU2014315432A1; ES2905081T3; JP2016530646A; CN105706084A

Description

本発明の実施の形態は、一般的にコンピュータデータベースに関し、特に、定義付け、創作、変更、変換、およびポピュレーションを含むデータ構造の管理の容易化に関する。 Embodiments of the present invention relate generally to computer databases, and more particularly to facilitating management of data structures including definition, creation, modification, transformation, and population.

情報の自動処理は、ビジネスに多大な利益をもたらしてきた。なぜなら、意志決定者が判断をする際のあらゆる過程で、それは有効性および効率を大いに改善してきたからだ。政府、商業的ビジネス、非営利組織を問わず全ての事業は、情報を管理する業務上の必要がある。 Automatic processing of information has provided tremendous benefits to the business. Because every process of decision-making by decision-makers has greatly improved effectiveness and efficiency. All businesses, whether government, commercial business, or non-profit organizations, have a business need to manage information.

情報は、例えば商業的ビジネスの場合、従業員、顧客、供給業者間でおこなわれる、患者の処置、顧客の獲得、注文の入力、製品の出荷、顧客への請求、請求書の受領、従業員および仕入先への支払い、製品の発注、在庫監査、取引の記録の維持に用いられる。 The information may be, for example, in the case of commercial business, between the employee, customer and supplier, patient treatment, customer acquisition, order entry, product shipment, customer billing, invoice receipt, employee Used to pay suppliers, place orders for products, inventory audits, and maintain transaction records.

出来事の自然な成り行きにおいて、情報は、各組織の内部オペレーションモデルにしたがって、ソフトウェア、コンピュータハードウェア、およびデジタルネットワークを活用して獲得され、処理され、統合される。あいにく、情報の自動処理は、利用可能で、タイムリーかつコスト対効果が高いデータの統合、標準化および報告を妨げて事業を弱体化させる多くの問題をはらんでいる。 In the natural course of events, information is acquired, processed and integrated utilizing software, computer hardware and digital networks according to each organization's internal operational model. Unfortunately, automated processing of information presents a number of problems that undermine business by hindering the integration, standardization and reporting of available, timely and cost-effective data.

ある従来の手法では、統合され標準化されたデータを組織全体から収集するためのエンタープライズデータウェアハウスを構築することに注目した。典型的なエンタープライズデータウェアハウスは、多くのソースから、抽出され、変換され、第３の通常形態のオペレーショナルデータストアデータベースにロードされるオペレーショナルデータを必要とする。それは、再度抽出され、変換され、スタースノーフレーク・データボルトデータベースにロードされる。データボルトデータベースは、その後データマートにロードされ、それぞれ特定の部門または機能にあてられる。 One traditional approach focused on building an enterprise data warehouse to collect integrated and standardized data from across the organization. A typical enterprise data warehouse requires operational data extracted from many sources, converted, and loaded into a third normal form operational data store database. It is extracted again, converted, and loaded into the Star Snowflake data vault database. The data vault database is then loaded into the data mart and each assigned to a specific department or function.

エンタープライズデータウェアハウスの形成および機能プロセスにおける各データベースは、カスタム抽出変換ロード（ＥＴＬ）関数とともに設計され、維持され、ポピュレートされなければならない。さらに、組織がリポートを生成できるようになる前に、開発利用の全段階が何らかの形で完成されなければならず、かつエンタープライズデータウェアハウスから利益を得始めなければならない。 Each database in the enterprise data warehouse formation and functional process must be designed, maintained, and populated with custom extract transform load (ETL) functions. In addition, before an organization can generate reports, all phases of development and use must be completed in some way and begin to benefit from the enterprise data warehouse.

エンタープライズデータウェアハウスが、全組織のために集中管理されている標準化データを獲得するが、これはコストが非常に高くつく。金銭的コストが天文学的になりうるため、包括的なエンタープライズデータウェアハウスの実現のために要求されるリソース（開発費）は、非常に限られた小数以外の全てにとって法外に高くなりうる。金銭的なリソースが限定要因とならない場合でも、エンタープライズデータウェアハウスを構築し実現するために要する時間は、通常年単位になる。 An enterprise data warehouse acquires standardized data that is centrally managed for the entire organization, which is very expensive. Because monetary costs can be astronomical, the resources (development costs) required to implement a comprehensive enterprise data warehouse can be prohibitively high for all but a very limited number of decimals. Even when monetary resources are not a limiting factor, the time required to build and implement an enterprise data warehouse is usually on a yearly basis.

エンタープライズデータウェアハウスに起因するエンタープライズデータウェアハウス化の他の欠点は、要約された情報を強調する判断支援アプリケーションに集中する。これらのシステムの本質的な欠点は、顧客の身元に関する取引詳細が失われることである。エンタープライズデータウェアハウスは、顧客データの分析のようなアプリケーションに適用される場合に欠点をあらわにする。顧客データの分析は、データを顧客の活動、イベント、取引、状態その他に関連付ける判断を支援する分析である。要約された情報は、顧客の身元に関する詳細レベルの情報を通常失っており、これらのアプリケーションにおけるエンタープライズデータウェアハウスの有用性は限られる。 Other shortcomings of enterprise data warehousing due to enterprise data warehousing focus on decision support applications that highlight summarized information. An essential drawback of these systems is that transaction details regarding the customer's identity are lost. Enterprise data warehouses present shortcomings when applied to applications such as customer data analysis. Customer data analysis is an analysis that assists in the determination of associating data with customer activity, events, transactions, status, and the like. The summarized information usually loses a level of detail about the customer's identity, and the usefulness of the enterprise data warehouse in these applications is limited.

その他の手法には、組織のオペレーショナルデータから直接、部門別データマートを創作することに注目するものがある。部門別データマートの場合、単一の部門に関するデータを取り込むだけでよい。そのため、部門別データマートは、はるかに小さくできる。 Other approaches focus on creating a departmental data mart directly from an organization's operational data. In the case of a departmental data mart, it is only necessary to capture data relating to a single department. Therefore, the departmental data mart can be much smaller.

部門別データマートは、サイズが小さいことから、一般的に構築に要する時間および金銭の観点からのリソースも少なくて済むが、これらの利益もコストが高くつく。部門別データマートは集中管理されず、質またはデータ形式の観点から整合した標準を有さない。 Departmental data marts are small in size and generally require fewer resources in terms of time and money to build, but these benefits are also expensive. Departmental data marts are not centrally managed and do not have standards that are consistent in terms of quality or data format.

部門別データマートが創作された場合、標準が整合しないために組織全体で統合できない。また、各部門別データマートは包括的な計画がないまま創作され、各部門がデータマートを保有すべく投資されたリソースを合計すると、よく計画された単一のエンタープライズデータウェアハウスを創作するよりも実質的に高くなりうる。整合しない部門別データマートを維持するためのリソースは、単一のエンタープライズデータウェアハウスを維持するよりもはるかに高くつくこともある。 When a departmental data mart is created, it cannot be integrated across the organization due to inconsistent standards. In addition, each departmental data mart is created without a comprehensive plan, and adding up the resources that each department has invested to hold a data mart is more than creating a single, well-planned enterprise data warehouse. Can also be substantially higher. Resources for maintaining inconsistent departmental data marts can be much more expensive than maintaining a single enterprise data warehouse.

現在、利用可能で、タイムリーかつコスト対効果が高いデータの統合、標準化および報告を提供するという課題の包括的な解決策はない。この必要性は業界内で長らく感じられてきた。 There is currently no comprehensive solution to the challenge of providing available, timely and cost-effective data integration, standardization and reporting. This need has long been felt within the industry.

従来の開発は、上述の全ての限定を克服するための解決策を何ら教示も提案もしてこなかった。よって、これらの限定を克服するための解決策は長らく当業者に発見されなかった。 Prior developments have not taught or suggested any solution to overcome all the above limitations. Thus, no solution to overcome these limitations has long been discovered by those skilled in the art.

クレームされた発明は、エンティティにリンクされ、かつモジュールにおいて共にグループ分けされた特性を定義するメタデータを有するスキーマ定義言語を活用する方法、製品、およびシステムを主題とする。スキーマ定義言語は、特性に基づいて、互いにリンクを有するモジュールとエンティティとのための物理テーブルを生成するために活用される。物理テーブルは、メタデータに整合するデータでポピュレートされることができる。 The claimed invention is directed to methods, products, and systems that utilize a schema definition language that has metadata defining properties that are linked to entities and grouped together in a module. The schema definition language is utilized to generate physical tables for modules and entities that are linked to each other based on characteristics. The physical table can be populated with data that matches the metadata.

スキーマ定義言語を参照して物理テーブルの位置を特定し、かつモジュールまたはエンティティのための物理テーブルが選択された特性を含んでいるか否かを判断することができるとさらに考えられる。同じ方法で、スキーマ定義言語を参照して物理テーブルの位置を特定し、物理テーブルが選択された特性を含んでいるときのみ物理テーブルが次の選択された特性を含んでいるか否かを判断することができる。 It is further believed that the schema definition language can be referenced to locate the physical table and determine whether the physical table for the module or entity includes the selected property. In the same way, refer to the schema definition language to locate the physical table and determine whether the physical table contains the next selected characteristic only if the physical table contains the selected characteristic. be able to.

特性が選択された特性として同じモジュール内にグループ分けされているときは、スキーマ定義言語からワンホップリンケージが特定されることができるとさらに考えられる。臨床パターンがエンティティに関連付けられた特性と一致するか否かに基づいて、エンティティは、一致コホートと不一致コホートとに分類されるとさらに考えられる。 It is further considered that the one-hop linkage can be specified from the schema definition language when the properties are grouped within the same module as the selected property. It is further believed that entities are classified into matched and unmatched cohorts based on whether the clinical pattern matches the characteristics associated with the entity.

リポートデータの定義は、モジュールに関連付けられた特性をリポートデータの定義に含んでいれば、スキーマ定義言語を参照してモジュールを含めることができるとさらに考えられる。物理テーブルは、それぞれモジュールとエンティティとに関連付けられた経年的データと非経年的データとを含むことができるとさらに考えられる。 It is further considered that the report data definition can include the module with reference to the schema definition language if the report data definition includes characteristics associated with the module. It is further contemplated that the physical table can include aged data and aged data associated with modules and entities, respectively.

本発明の上記特徴、利点および目的が達成され、かつ詳細に理解されることができる方法で、上記簡潔に要約された発明を、添付図面に図示された実施の形態を参照してより詳細に説明する。 The above briefly summarized invention will now be described in more detail with reference to the embodiments illustrated in the accompanying drawings in a manner that allows the above features, advantages and objects of the invention to be achieved and understood in detail. explain.

なお、添付図面は、本発明の典型的な実施の形態のみを説明するものであり、発明の範囲を限定するものと考えられるべきものではない。本発明は同等に効果的な他の実施の形態を許容することがあるからである。以下に示す添付図面において、同様の参照符号は同種または対応する部分に言及することを意図している。 It should be noted that the accompanying drawings illustrate only typical embodiments of the present invention and should not be considered as limiting the scope of the invention. This is because the present invention may allow other embodiments that are equally effective. In the accompanying drawings shown below, like reference numerals are intended to refer to the same or corresponding parts.

図１は、本発明の実施の形態に係る、例示的な分散型コンピュータシステムを表す。FIG. 1 represents an exemplary distributed computer system according to an embodiment of the present invention. 図２は、本発明の実施の形態に係る、データ処理システムの例示的なブロック図を表す。FIG. 2 depicts an exemplary block diagram of a data processing system according to an embodiment of the present invention. 図３は、本発明の実施の形態に係る、例示的なメタデータテーブルを表す。FIG. 3 illustrates an exemplary metadata table according to an embodiment of the present invention. 図４は、本発明の実施の形態に係る、例示的なスキーマ定義モデルを表す。FIG. 4 represents an exemplary schema definition model according to an embodiment of the present invention. 図５は、本発明の実施の形態に係る、例示的なスキーマ定義モデルテーブルを表す。FIG. 5 represents an exemplary schema definition model table according to an embodiment of the present invention. 図６は、本発明の実施の形態に係る、例示的な物理的なスキーマテーブルを表す。FIG. 6 depicts an exemplary physical schema table according to an embodiment of the present invention. 図７は、本発明の実施の形態に係る、オペレーショナルデータストアを実現し、ポピュレートするための例示的な制御フローを表す。FIG. 7 depicts an exemplary control flow for implementing and populating an operational data store according to an embodiment of the present invention. 図８は、本発明の実施の形態に係る、抽出変換ロード関数を生成するための例示的な制御フローを表す。FIG. 8 depicts an exemplary control flow for generating an extracted transformation load function according to an embodiment of the present invention. 図９は、本発明の実施の形態に係る、コホートのフィルタリングのための例示的な制御フローを表す。FIG. 9 depicts an exemplary control flow for cohort filtering according to an embodiment of the present invention. 図１０は、ビジネスインテリジェンスツールにおいて実現されるフィルタのスクリーンショットを表す。FIG. 10 represents a screen shot of a filter implemented in a business intelligence tool. 図１１は、本発明の実施の形態に係る、パターンを活用したコホートのフィルタリングのための例示的な制御フローを表す。FIG. 11 illustrates an exemplary control flow for cohort filtering utilizing patterns according to an embodiment of the present invention. 図１２は、ビジネスインテリジェンスツールにおいて実現されるフィルタのスクリーンショットを表す。FIG. 12 represents a screen shot of a filter implemented in a business intelligence tool. 図１３は、本発明の実施の形態に係る、リポートを生成するための例示的な制御フローを表す。FIG. 13 depicts an exemplary control flow for generating a report, according to an embodiment of the present invention. 図１４は、ビジネスインテリジェンスツールにおいて実現されるリポートデータの定義のスクリーンショットを表す。FIG. 14 represents a screenshot of the definition of report data implemented in a business intelligence tool. 図１５は、本発明の実施の形態に係る、リポートを分析するための制御フローを表す。FIG. 15 shows a control flow for analyzing a report according to the embodiment of the present invention.

本発明の次に説明する実施の形態において、本発明の一部を構成し、かつ本発明が実施されることがある例示的な実施の形態を説明する目的で示される添付図面を参照する。本発明の範囲から逸脱することなく、他の実施の形態も利用してもよく、構造上の変更をおこなってもよいと理解されるべきである。 In the following description of the invention, reference is made to the accompanying drawings that form a part hereof, and in which are shown for the purpose of illustrating exemplary embodiments in which the invention may be practiced. It should be understood that other embodiments may be utilized and structural changes may be made without departing from the scope of the present invention.

次に述べる実施の形態は、当業者が本発明を実施し利用できる程度に十分詳細に説明される。他の実施の形態は、本開示に基づき明らかとなるであろうこと、また、本発明の範囲から逸脱することなく、システム、プロセス、機械的な変更をおこなってもよいことが理解されるべきである。 The following embodiments are described in sufficient detail to enable those skilled in the art to make and use the invention. It is to be understood that other embodiments will be apparent based on the present disclosure, and that system, process, and mechanical changes may be made without departing from the scope of the invention. It is.

以下に述べる説明において、本発明の完全な理解のために数多くの具体的な詳細が与えられているが、これらの具体的な詳細なしに本発明を実施してもよいことが明らかとなる。本発明が不明瞭になるのを避けるため、いくつかの周知の回路、システム構成、プロセスステップは詳細に開示されない。 In the following description, numerous specific details are given to provide a thorough understanding of the present invention, but it will be apparent that the invention may be practiced without these specific details. In order to avoid obscuring the present invention, some well-known circuits, system configurations, and process steps are not disclosed in detail.

また、いくつかの特徴を共通にする複数の実施の形態が開示され説明されている部分において、説明、記載、理解を明確かつ容易にするために、互いに類似または同種の特徴は、同種の参照符号付きで通常説明される。これらの実施の形態は、説明の便宜上、第１実施形態、第２実施形態等というように番号が付されているが、他の意義があり本発明を限定するものと意図していない。 In addition, in a part in which a plurality of embodiments sharing some features are disclosed and described, in order to clarify and facilitate description, description, and understanding, features similar or similar to each other are referred to by the same reference. Usually described with a sign. For convenience of explanation, these embodiments are numbered as the first embodiment, the second embodiment, etc., but have other significance and are not intended to limit the present invention.

解説的な目的で、ここで用いられる「メタデータ」という語は、データについてのデータとして定義される。ここで用いられる「システム」という語は、この語が用いられるコンテキストにしたがって、本発明の方法および装置を意味し指し示す。 For explanatory purposes, the term “metadata” as used herein is defined as data about data. As used herein, the term “system” means and refers to the method and apparatus of the present invention according to the context in which the word is used.

本発明の実施の形態は、独自のスキーマ定義言語を採用するため、かつデータベースまたはデータストアを構築し、生成し、ポピュレートする際のスキーマ定義言語を活用するための技術を提供し、スキーマ定義言語を参照して抽出−変換―ロード関数を自動的に生成し、かつデータベースまたはデータストアからデータを引き出すビジネスインテリジェンスツールを実現可能にするための技術を提供する。ここで用いられるように、ビジネスインテリジェンスツールとは一般的に、データを報告し、分析し、提示するよう構成されたソフトウェアアプリケーションを指す。このデータは、データウェアハウス、データベース、データストア、データマート、またはそれらの組み合わせにおいて格納される。 Embodiments of the present invention provide a technique for adopting a unique schema definition language and utilizing a schema definition language when constructing, generating, and populating a database or a data store. Provides a technique for enabling a business intelligence tool that automatically generates an extract-transform-load function and retrieves data from a database or data store. As used herein, a business intelligence tool generally refers to a software application configured to report, analyze and present data. This data is stored in a data warehouse, database, data store, data mart, or a combination thereof.

ある態様によれば、スキーマ定義言語は、スキーマ定義言語によって定義された特性、エンティティ、およびモジュールの関係を示すスキーマ定義モデルにおいて実現されることができ、またはスキーマ定義モデルによって提示されることができる。スキーマ定義言語は、オペレーショナルデータストアの物理テーブルのための構造およびフレームワークを定義する。スキーマ定義言語は、スキーマ定義モデルを物理データの１以上の物理エンティティにマッピングするための関係と位置とを含んでもよい。したがって、スキーマ定義言語は、物理データの特定のセットを含んでいる物理データのフィールドへのアクセスを定義し、アクセスするために用いられることができる。 According to certain aspects, the schema definition language can be implemented in or presented by a schema definition model that indicates the relationships between properties, entities, and modules defined by the schema definition language. . The schema definition language defines the structure and framework for the physical tables of the operational data store. The schema definition language may include relationships and locations for mapping the schema definition model to one or more physical entities of physical data. Thus, the schema definition language can be used to define and access access to fields of physical data that contain a specific set of physical data.

有利なことに、本発明の実施の形態は、高い抽象化レベルでスキーマ定義言語を提供し、オペレーショナルデータストアを低コストかつ短いタイムフレームで活用するためのビジネスインテリジェンスツールを実現可能にするためのスキーマ定義言語とのインターフェースとなるプラットフォームを提供するための技術を提供する。スキーマ定義言語は、基礎となるデータベースが発展したときでも、これらのツールがもはや変更される必要がないように、高い抽象化レベルで動作するビジネスインテリジェンスツールを実現可能にする。データベースは、本発明のスキーマ定義言語を活用する際、容易にかつ妨害されることなく発展する。 Advantageously, embodiments of the present invention provide a schema definition language at a high level of abstraction to enable a business intelligence tool for leveraging operational data stores at low cost and in a short time frame. Provides technology to provide a platform that interfaces with the schema definition language. The schema definition language makes it possible to implement business intelligence tools that operate at a high level of abstraction so that even when the underlying database evolves, these tools no longer need to be changed. Databases evolve easily and unimpeded when utilizing the schema definition language of the present invention.

有利なことに、物理テーブルにアクセスしてクエリを実行するためのスキーマ定義言語を活用するビジネスインテリジェンスツールは、前のステップで選択された特性のタイプに関連するオプションを提供することによって、ユーザに直感的な体験を提供する。スキーマ定義言語は、オペレーショナルデータストアをモデル化し設計するための容易かつ効果的な解決策を実現可能にする物理テーブルにシンプルな構造を与えるためにも用いられることができる。さらに、スキーマ定義言語を活用すると、スキーマ定義モデル内に含まれているスキーマ定義言語構造とメタデータとに基づいてオペレーショナルデータストアに素早くポピュレートするための関数を自動的に抽出し、変換し、ロードすることが可能になる。 Advantageously, business intelligence tools that take advantage of the schema definition language to access physical tables and execute queries provide the user with options related to the type of characteristic selected in the previous step. Provide an intuitive experience. The schema definition language can also be used to give a simple structure to a physical table that enables an easy and effective solution for modeling and designing an operational data store. In addition, leveraging the schema definition language automatically extracts, transforms, and loads functions to quickly populate the operational data store based on the schema definition language structure and metadata contained within the schema definition model. It becomes possible to do.

以下において、本発明の実施の形態および具体的な例を参照するが、本発明は具体的に説明される実施の形態または実施例に限定されないと解釈されるべきである。代わりに、異なる実施の形態に関連するか否かに関わらず、以下の特徴および要素のいかなる組み合わせも、本発明を実現し実施するために考慮される。さらに、本発明の実施の形態は、他の可能な解決策を超え、かつ／または先行技術を超える有利な効果を達成することがあるが、ある実施の形態によって特定の効果が達成されるか否かは本発明を限定するものではない。よって、以下の態様、特徴、実施の形態、および有利な効果は、説明目的に過ぎず、添付された請求項において明示的に記載されている場合を除き、請求項の特徴または限定とは解釈されない。同様に、「本発明」への言及は、ここに開示されたいずれかの発明の主題を一般化するものと解釈されてはならず、添付された請求項において明示的に記載されている場合を除き、請求項の特徴または限定と解釈されてはならない。 In the following, reference is made to embodiments and specific examples of the present invention, but the present invention should not be construed as being limited to the specifically described embodiments or examples. Instead, any combination of the following features and elements, whether related to different embodiments or not, is contemplated for implementing and implementing the present invention. Furthermore, although embodiments of the present invention may achieve advantageous effects over other possible solutions and / or over the prior art, are certain embodiments achieved with certain embodiments? It does not limit the present invention. Thus, the following aspects, features, embodiments, and advantages are for illustrative purposes only and are not to be construed as the features or limitations of the claims, except as explicitly recited in the appended claims. Not. Similarly, references to “the present invention” should not be construed as generalizing any subject matter disclosed herein, but are explicitly recited in the appended claims. Should not be construed as the features or limitations of the claims.

添付された請求項のいずれかが純粋なソフトウェアおよび／またはファームウェアによる実現を権利範囲に含むように読めるときは、これによって少なくとも一つの実施例の少なくとも一つの要素は、当該ソフトウェアおよび／またはファームウェアを格納した有形のコンピュータで読み取り可能なメディアを含むことが明示的に定義される。 When any of the appended claims can be read to include a pure software and / or firmware implementation, this allows at least one element of at least one embodiment to include the software and / or firmware. It is explicitly defined to include stored tangible computer readable media.

本発明のある実施の形態は、コンピュータシステムで用いられるプログラム製品として実現される。プログラム製品のプログラムは、実施の形態の機能（そこに記載された方法を含む）を定義し、さまざまなコンピュータで読み取り可能なストレージメディアに保存可能である。ここに定義するコンピュータで読み取り可能なストレージメディアは、製品である。説明目的であり限定されないコンピュータで読み取り可能なストレージメディアには、次にあげるものが含まれる。すなわち、（ｉ）情報が永久に格納される書き込み不可のストレージメディア（例えば、ＣＤ−ＲＯＭドライブで読み取り可能なＣＤ−ＲＯＭディスクのような、コンピュータ内の読み取り専用メモリデバイス）や、（ｉｉ）書き換え可能な情報が格納される書き込み可能なストレージメディア（例えば、ディスケットドライブまたはハードディスクドライブ内のフレキシブルディスク）である。そのようなコンピュータで読み取り可能なストレージメディアは、本発明の機能を指令するコンピュータで読み取り可能な命令を備えるときは、本発明の実施の形態である。他のメディアには、無線通信ネットワークを含むコンピュータまたは電話ネットワークなどの、情報をコンピュータに伝達する通信メディアが含まれる。後者の実施の形態は、特に、情報をインターネットおよび他のネットワークへ／から送受信することを含む。そのような通信メディアは、本発明の機能を指令するコンピュータで読み取り可能な命令を備えるときは、本発明の実施の形態である。広義には、コンピュータで読み取り可能なストレージメディアおよび通信メディアは、ここではコンピュータで読み取り可能なメディアと言及されることがある。 An embodiment of the present invention is realized as a program product used in a computer system. Programs of the program product define the functions of the embodiments (including the methods described therein) and can be stored on various computer readable storage media. A computer-readable storage medium as defined herein is a product. Computer readable storage media for illustrative purposes and not limited include the following. (I) a non-writable storage medium in which information is permanently stored (for example, a read-only memory device in a computer such as a CD-ROM disc readable by a CD-ROM drive), or (ii) rewriting A writable storage medium (eg, a flexible disk in a diskette drive or hard disk drive) where possible information is stored. Such a computer readable storage medium is an embodiment of the present invention when it includes computer readable instructions that direct the functions of the present invention. Other media include communication media that conveys information to the computer, such as a computer including a wireless communication network or a telephone network. The latter embodiment specifically includes sending and receiving information to / from the Internet and other networks. Such a communication medium is an embodiment of the present invention when it comprises computer readable instructions that direct the functions of the present invention. In a broad sense, computer readable storage media and communication media may be referred to herein as computer readable media.

一般的に、本発明の実施の形態を実現するために実行されるルーティンは、オペレーションシステムまたは特定のアプリケーション、部品、プログラム、モジュール、オブジェクト、あるいは命令シーケンスの一部であってもよい。本発明のコンピュータプログラムは、典型的には、ネィティブコンピュータによって機械で読み取り可能であり実行可能な命令に翻訳されることとなる多数の命令からなる。また、プログラムは、プログラムにローカルに存在するかメモリまたはストレージメディアの中で発見される変数およびデータ構造からなる。また、以下説明されるさまざまなプログラムは、本発明の特定の実施の形態においてプログラムが実現されるためのアプリケーションに基づいて特定されてもよい。しかしながら、後に続く特定のプログラムのいかなる名称も便宜上用いられるものに過ぎないと解釈されるべきである。よって、本発明はそのような名称によって特定され、かつ／または暗示される特定のアプリケーションにおいてのみ用いられると限定されるべきではない。 In general, a routine executed to implement an embodiment of the present invention may be part of an operating system or a specific application, part, program, module, object, or instruction sequence. The computer program of the present invention typically consists of a number of instructions that will be translated into executable instructions that can be read and executed by a native computer. A program also consists of variables and data structures that are local to the program or found in memory or storage media. Various programs described below may be specified based on an application for realizing the program in the specific embodiment of the present invention. However, any name of the particular program that follows should be construed as merely being used for convenience. Thus, the present invention should not be limited to be used only in the specific applications identified and / or implied by such names.

ここで図１を参照すると、そこには、本発明の実施の形態に係る、例示的な分散型コンピュータシステム１００が示されている。一般的に、分散型コンピュータシステム１００は、分散された環境として示され、コンピュータシステム１０２と複数のネットワーク接続された装置１０４とを含む。コンピュータシステム１０２は、次に述べるいかなるタイプのコンピュータ、コンピュータシステム、または他のプログラム可能な電子機器を示してもよい。すなわち、本発明の方法、装置、および製品に対応するよう適応させたクライアントコンピュータ、サーバーコンピュータ、ポータブルコンピュータ、組み込みコントローラ、ＰＣベースのサーバー、ミニコンピュータ、ミッドレンジコンピュータ、メインフレームコンピュータ、および他のコンピュータなどである。 Referring now to FIG. 1, there is shown an exemplary distributed computer system 100 according to an embodiment of the present invention. In general, the distributed computer system 100 is shown as a distributed environment and includes a computer system 102 and a plurality of networked devices 104. Computer system 102 may represent any type of computer, computer system, or other programmable electronic device described below. That is, client computers, server computers, portable computers, embedded controllers, PC-based servers, minicomputers, midrange computers, mainframe computers, and other computers adapted to accommodate the methods, apparatus, and products of the present invention Etc.

説明目的で、コンピュータシステム１０２は、ネットワーク接続されたシステムからなる。しかしながら、コンピュータシステム１０２は、独立した装置からなってもよい。いずれにせよ、図１はコンピュータシステム１００の一構成に過ぎないと理解される。本発明の実施の形態は、匹敵するいかなる構成にも適用することができる。コンピュータシステム１０２が複雑な複数ユーザ用装置、単一ユーザ用ワークステーション、または自身の不揮発性ストレージを有さないネットワーク機器であるか否かを問わない。 For illustrative purposes, computer system 102 comprises a networked system. However, the computer system 102 may consist of independent devices. In any case, it is understood that FIG. 1 is only one configuration of computer system 100. Embodiments of the present invention can be applied to any comparable configuration. It does not matter whether the computer system 102 is a complex multi-user device, a single-user workstation, or a network device that does not have its own non-volatile storage.

本発明の実施の形態は、通信ネットワークを通してリンクされたリモート処理装置によってタスクが実施される分散型コンピューティング環境において実施されてもよい。分散型コンピューティング環境において、プログラムモジュールは、ローカルおよびリモート両方のメモリストレージデバイスに置かれてもよい。これに関し、コンピュータシステム１０２および／または１以上のネットワーク接続された装置１０４は、ほとんどまたはまったく処理をおこなわないシンクライアントであってもよい。 Embodiments of the invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices. In this regard, the computer system 102 and / or one or more networked devices 104 may be thin clients that perform little or no processing.

コンピュータシステム１０２は、次にあげるものに示されるような数多くのオペレータや周辺システムを含みうる。例えば、直接アクセスストレージデバイス１０８に操作可能に接続された大型ストレージインターフェース１０６、ディスプレイ１１２に操作可能に接続されたビデオインターフェース１１０、および、複数のネットワーク接続された装置１０４に操作可能に接続されたネットワークインターフェース１１４である。ディスプレイ１１２は、視ることが可能な情報を出力するためのビデオ出力装置であればよい。 Computer system 102 may include a number of operators and peripheral systems such as those listed below. For example, a large storage interface 106 operably connected to a direct access storage device 108, a video interface 110 operably connected to a display 112, and a network interface operably connected to a plurality of networked devices 104 114. The display 112 may be a video output device for outputting information that can be viewed.

コンピュータシステム１０２は、メインメモリ１２０からバス１１８を経由して命令およびデータを取得する少なくとも１つのプロセッサ１１６からなるものとして示される。プロセッサ１１６は、本発明の方法に対応するよう適応させた何らかのプロセッサでありえる。 Computer system 102 is shown as comprising at least one processor 116 that obtains instructions and data from main memory 120 via bus 118. The processor 116 can be any processor adapted to accommodate the method of the present invention.

メインメモリ１２０は、必要なプログラムおよびデータ構造を保持するには十分な大きさの何らかのメモリである。メインメモリ１２０は、ランダムアクセスメモリや不揮発性またはバックアップメモリ（例えばプログラム可能なまたはフラッシュメモリや読み取り専用メモリなど）を含む単独のメモリデバイス、またはこれらのメモリデバイスの組み合わせでありうる。また、メモリ１２０は、コンピュータシステム１０２内のどこかに物理的に位置するメモリを含んでいると考えてもよい。例えば、バーチャルメモリ、大型ストレージデバイス（例えば、直接アクセスストレージデバイス１０８）、またはコンピュータシステム１０２にバス１１８経由で接続された他のコンピュータとして用いられる記憶装置であればよい。 Main memory 120 is any memory large enough to hold the necessary programs and data structures. Main memory 120 may be a single memory device, including random access memory, non-volatile or backup memory (eg, programmable or flash memory, read-only memory, etc.), or a combination of these memory devices. Memory 120 may also be considered to include memory that is physically located somewhere within computer system 102. For example, any storage device may be used as a virtual memory, a large storage device (eg, direct access storage device 108), or other computer connected to the computer system 102 via the bus 118.

メモリ１２０は、オペレーティングシステム１２２と構成されるものとして示される。オペレーティングシステム１２２は、コンピュータシステム１０２のオペレーションを管理するために用いられるソフトウェアである。 Memory 120 is shown as configured with operating system 122. The operating system 122 is software that is used to manage the operation of the computer system 102.

メモリ１２０はさらに、アクセス層１２４と、スキーマ定義言語１２６と、フィルタ１２８と、リポートデータの定義１３０と、１以上のアプリケーション１３２と、複数のビジネスインテリジェンスツール１３４とをさらに備える。アプリケーション１３２、ビジネスインテリジェンスツール１３４、およびアクセス層１２４は、コンピュータシステム１０２におけるさまざまなメモリおよびストレージ装置においてさまざまなときに存在する複数の命令からなるソフトウェア製品である。コンピュータシステム１０２において１以上のプロセッサ１１６によって読み取られ実行されるとき、アプリケーション１３２、アクセス層１２４、およびスキーマ定義言語１２６は、個別にまたは組み合わせとして、コンピュータシステム１０２に本発明のさまざまな態様を実行するために必要なステップを実施させる。 The memory 120 further includes an access layer 124, a schema definition language 126, a filter 128, a report data definition 130, one or more applications 132, and a plurality of business intelligence tools 134. Application 132, business intelligence tool 134, and access layer 124 are software products comprised of multiple instructions that exist at various times in various memory and storage devices in computer system 102. When read and executed by one or more processors 116 at computer system 102, application 132, access layer 124, and schema definition language 126 perform various aspects of the invention on computer system 102, either individually or in combination. In order to carry out the necessary steps.

アクセス層１２４（より一般的には、オペレーティングシステム１２２を含む要求するエンティティ）は、データベース１３６にクエリを発行するよう構成される。説明目的で、データベース１３６は、ストレージ１０８におけるデータベース管理システム（ＤＢＭＳ）の一部として示される。簡単のためひとつのデータベースのみ示されているが、ＤＢＭＳ１３８は複数のデータベースを含んでもよい。 Access layer 124 (more generally, a requesting entity that includes operating system 122) is configured to issue a query to database 136. For illustrative purposes, the database 136 is shown as part of a database management system (DBMS) in the storage 108. Although only one database is shown for simplicity, the DBMS 138 may include multiple databases.

さらに、これらのデータベースは互いに分散されてもよい。さらに、１以上のデータベースは、１以上のネットワーク接続された装置１０４に分散されることができる。説明目的で、ネットワーク接続されたシステム１４０は、データベース１４４を含むＤＢＭＳ１４２を備えるものとして示される。簡単のためひとつのデータベース１４４のみ示されるが、ＤＢＭＳ１４２は複数のデータベースを含んでもよい。さらに、これらのデータベース１４２は互いに分散されてもよい。このような異なる実施態様の全てが広く考慮される。ストレージ１０８またはネットワーク接続された装置１０４は、フィルタを構成するため、ユーザにフィードバックするため、およびビジネスインテリジェンスツール１３４のためのリポートデータの定義を構成するためのアプリケーション１３２によって用いられる、メタデータテーブル、物理テーブル、スキーマ定義言語１２６、またはスキーマ定義モデルをも含んでもよい。 Furthermore, these databases may be distributed with each other. Further, one or more databases can be distributed across one or more networked devices 104. For illustrative purposes, the networked system 140 is shown as comprising a DBMS 142 that includes a database 144. Although only one database 144 is shown for simplicity, the DBMS 142 may include multiple databases. Further, these databases 142 may be distributed with each other. All such different embodiments are widely considered. A storage table or networked device 104 is a metadata table used by the application 132 to configure filters, feed back to users, and configure report data definitions for business intelligence tools 134. A physical table, schema definition language 126, or schema definition model may also be included.

データベース１３６および１４４は、データの特定の物理表現に関わらずデータの集合の代表的なものである。データの物理的な表現は、当該データの構造上のスキーマを定義する。 Databases 136 and 144 are representative of a collection of data regardless of the specific physical representation of the data. The physical representation of data defines the structural schema of the data.

ある実施の形態において、データベース１３６はオペレーショナルデータベースを含んでおり、データベース１４４はオペレーショナルデータストアを含んでいる。オペレーショナルデータベースは、データストア内に含まれる物理データの少なくとも一部を含んでいる。ある態様によれば、データストアは、オペレーショナルデータベースにおける物理データから導出されるクエリを実行可能なデータを含んでいる。したがって、データストアにおけるクエリを実行可能なデータは、オペレーショナルデータベースにおける物理データのサブセットを含んでいる。オペレーショナルデータベースからのデータのサブセットに加え、データストアは他のデータを含んでもよい。 In one embodiment, database 136 includes an operational database and database 144 includes an operational data store. The operational database includes at least a portion of the physical data contained within the data store. According to an aspect, the data store includes data that can execute queries derived from physical data in the operational database. Thus, the data that can be queried in the data store includes a subset of the physical data in the operational database. In addition to the subset of data from the operational database, the data store may contain other data.

ある実施の形態において、クエリは、入力（例えばユーザ入力）に応答して生成されてもよい。クエリは、スキーマ定義言語１２６に定義された論理フィールドを用いて構成されることができる。クエリは、データベース１４４内の物理データによって表される図４のエンティティ４０４のクラスを分離するか定義するためのスキーマ定義言語１２６を参照することができるフィルタ１２８を用いて、データベース１４４に対して実行される。フィルタ１２８のオペレーションは、図９〜１２を参照して以下に詳細に説明される。 In certain embodiments, the query may be generated in response to input (eg, user input). The query can be constructed using logical fields defined in the schema definition language 126. The query is performed against the database 144 using a filter 128 that can reference the schema definition language 126 for isolating or defining the class of the entity 404 of FIG. 4 represented by the physical data in the database 144. Is done. The operation of filter 128 is described in detail below with reference to FIGS.

クエリは、図４のスキーマ定義言語、エンティティ４０４の構造リポート、モジュール４０６、および特性の観察（オブザベーション）４０８を参照可能なリポートデータの定義１３０を用いてデータベース１４４に対しても実行され、データベース１４４内の物理テーブルからデータを引き出す。リポートデータの定義１３０のオペレーションは、図１３〜１５を参照して以下に詳細に説明される。 The query is also executed against the database 144 using the schema definition language of FIG. 4, the structure report of the entity 404, the module 406, and the report data definition 130 that can reference the observations 408 of the characteristics. Data is extracted from the physical table in 144. The operation of the report data definition 130 is described in detail below with reference to FIGS.

ここで図２を参照すると、そこには、本発明の実施の形態に係る、データ処理システム２００の例示的なブロック図が示されている。データ処理システム２００はデータ入力ブロック２０２を含み、データ入力ブロック２０２は１以上の外部ソース２０４からデータ２０３を抽出して接続しているオペレーショナルデータストア２０６にロードするためのものである。例えば、外部ソース２０４は、図１のストレージデバイス１０８のデータベース１３６でありえる。オペレーショナルデータストア２０６は、図１のネットワーク接続された装置１０４のデータベース１４４内に含まれうる。 Referring now to FIG. 2, there is shown an exemplary block diagram of a data processing system 200 according to an embodiment of the present invention. The data processing system 200 includes a data input block 202 for extracting data 203 from one or more external sources 204 and loading it into a connected operational data store 206. For example, the external source 204 can be the database 136 of the storage device 108 of FIG. The operational data store 206 may be included in the database 144 of the networked device 104 of FIG.

管理エンジン２０８は、オペレーショナルデータストア２０６におけるデータ２０３の少なくとも一部をマッピングし位置を特定するよう構成され、後に続くアロケーション処理、マッピング、フィルタリング、リポーティングなどのためにスキーマ定義言語１２６に基づいてデータ２０３の位置と関係とを計算する。 The management engine 208 is configured to map and locate at least a portion of the data 203 in the operational data store 206 and based on the schema definition language 126 for subsequent allocation processing, mapping, filtering, reporting, etc. Calculate the position and relationship.

データ処理システム２００は、ユーザにデータ２０３を伝達するために、情報伝達ブロック２１０をも備え、オペレーショナルデータストア２０６および管理エンジン２０８に接続されている。伝達機能は、ビジネスインテリジェンスツール１３４によって実施されることができる。ビジネスインテリジェンスツール１３４は、図９および１１においてそれぞれ説明されたトランスメッドシステム・コホート・エクスプローラまたはトランスメッドシステム・臨床パターンマッチャーのようなフィルタリングツールでありえる。ビジネスインテリジェンスツール１３４は、図１３および１５においてそれぞれ説明されたトランスメッドシステム・コホート・リポーターまたはトランスメッドシステム・コホート・アナライザーのようなリポーティングツールでありえる。 The data processing system 200 also includes an information transmission block 210 for communicating data 203 to a user and is connected to an operational data store 206 and a management engine 208. The communication function can be performed by the business intelligence tool 134. The business intelligence tool 134 can be a filtering tool such as the TransMed System Cohort Explorer or TransMed System Clinical Pattern Matcher described in FIGS. 9 and 11, respectively. The business intelligence tool 134 can be a reporting tool such as the TransMed System Cohort Reporter or TransMed System Cohort Analyzer described in FIGS. 13 and 15, respectively.

また、データ処理システム２００の情報伝達ブロック２１０は、ユーザから実行時パラメータを受信するためのユーザインターフェース２１２と、ユーザインターフェース２１２からの入力に応答して管理エンジン２０８を起動するためのタスクコントローラ２１４とを備える。ユーザインターフェース２１２は、ユーザとの双方向通信のためのハードウェアおよびソフトウェアを備えうる。タスクコントローラ２１４は、ユーザインターフェース２１２を介したデータ２０３のグルーピング、フィルタリング、リターンニングのためのパラメータと命令とをユーザが提出できるようにする管理エンジン２０８とも通信する。 The information transmission block 210 of the data processing system 200 includes a user interface 212 for receiving runtime parameters from the user, and a task controller 214 for starting the management engine 208 in response to an input from the user interface 212. Is provided. User interface 212 may comprise hardware and software for two-way communication with a user. The task controller 214 also communicates with a management engine 208 that allows the user to submit parameters and instructions for grouping, filtering, and returning data 203 via the user interface 212.

データ入力ブロック２０２は、１以上の外部ソース２０４からデータ２０３を抽出するための抽出−変換−ロード（ＥＴＬ）ツール２１６を用い、その後抽出されたデータ２０３を少なくともひとつの望ましいデータ形式に変換する。変換されたデータ２０３は、その後データの格納のためにオペレーショナルデータストア２０６において１以上の物理テーブル２１８へロードされることができる。 Data input block 202 uses an extract-transform-load (ETL) tool 216 to extract data 203 from one or more external sources 204 and then converts the extracted data 203 into at least one desired data format. The converted data 203 can then be loaded into one or more physical tables 218 in the operational data store 206 for data storage.

重要なことに、管理エンジン２０８は、ビジネスインテリジェンスツール１３４および物理テーブル２１８のためのフレームワークとしてスキーマ定義言語１２６を提供することができる。管理エンジン２０８は、スキーマ定義言語１２６を活用して外部ソース２０４からデータ２０３内の物理テーブル２１８をポピュレートするためのＥＴＬツール２１６を自動的に生成することができる。 Significantly, the management engine 208 can provide the schema definition language 126 as a framework for the business intelligence tool 134 and the physical table 218. The management engine 208 can automatically generate the ETL tool 216 for populating the physical table 218 in the data 203 from the external source 204 utilizing the schema definition language 126.

ＥＴＬツール２１６を自動化し物理テーブル２１８をポピュレートするオペレーションについては、図７および８に関して以下に説明する。管理エンジン２０８はさらに、ビジネスインテリジェンスツール１３４がリポートとフィルタ検索とを構造化するために動作可能となるフレームワークとして、スキーマ定義言語１２６を提供する。 Operations that automate the ETL tool 216 and populate the physical table 218 are described below with respect to FIGS. The management engine 208 further provides the schema definition language 126 as a framework that enables the business intelligence tool 134 to operate to structure the reports and filter searches.

スキーマ定義言語１２６は、エンティティタイプ２２０と、特性２２２と、モジュールタイプ２２４とを含む。特性２２２は、リンク２２６を有するエンティティタイプ２２０にリンクされる。特性２２２は、特性２２２が共に観察されるか否かに基づいて、モジュールタイプ２２４において共にグループ分けされることができる。 The schema definition language 126 includes an entity type 220, a characteristic 222, and a module type 224. A characteristic 222 is linked to an entity type 220 having a link 226. Properties 222 can be grouped together in module type 224 based on whether properties 222 are observed together.

エンティティタイプ２２０、特性２２２、およびモジュールタイプ２２４は、オペレーショナルデータストア２０６において物理テーブル２１８の構造と関係とを提供する図４に関して以下のように説明されるスキーマ定義モデル４００の構造とフォーメーションを提供するスキーマ定義言語１２６のメタデータ構造でありえる。スキーマ定義言語１２６はさらに、データ２０３がどのようにして物理テーブル２１８にポピュレートすることとなるかについてのフレームワークをさらに提供する。 Entity type 220, property 222, and module type 224 provide the structure and formation of the schema definition model 400 described below with respect to FIG. 4 that provides the structure and relationships of the physical table 218 in the operational data store 206. It can be a metadata structure of the schema definition language 126. The schema definition language 126 further provides a framework for how the data 203 will be populated into the physical table 218.

説明目的と理解を容易にするために、エンティティタイプ２２０、特性２２２、およびモジュールタイプ２２４について、ヘルスケア分野を参照して以下説明する。当業者であれば、説明目的のためにそのようにされ、本発明を限定することを意図しないことを認識するであろう。 For ease of explanation and understanding, entity types 220, properties 222, and module types 224 are described below with reference to the healthcare field. Those skilled in the art will recognize that this is done for illustrative purposes and is not intended to limit the invention.

エンティティタイプ２２０は物理オブジェクトのタイプを定義することができると考えられる。図４の実際のエンティティ４０４のエンティティタイプ２２０は、データ構造であるアブストラクションおよび図３のメタデータ３０２である。実際のエンティティ４０４は、図４の特性の観察４０８とライフタイムを有するエンティティタイプ２２０の具体的な例である。ここで用いられるライフタイムは、妥当なオブザーバがエンティティタイプ２２０の一つとして物理オブジェクトを認識するであろう状態において物理オブジェクトが存在するタイムスパンとして定義される。ここで特性の観察４０８は、エンティティ４０４について観察可能な事実として定義される。 It is contemplated that the entity type 220 can define the type of physical object. The entity type 220 of the actual entity 404 in FIG. 4 is an abstract that is a data structure and the metadata 302 in FIG. The actual entity 404 is a specific example of the entity type 220 having the property observation 408 and lifetime of FIG. The lifetime used here is defined as the time span in which a physical object exists in a state where a valid observer will recognize the physical object as one of the entity types 220. Here, property observation 408 is defined as an observable fact for entity 404.

ヘルスケア分野に関連して、エンティティタイプ２２０は、患者、外科医、または施設として例示されることができる。エンティティ４０４は、特定の病院、患者、外科医、または支払い者として例示されることができる。つまり、エンティティ４０４は、ジョーンズ医師やサバーバン病院である。 In the context of the healthcare field, entity type 220 can be exemplified as a patient, a surgeon, or a facility. Entity 404 can be exemplified as a particular hospital, patient, surgeon, or payer. In other words, the entity 404 is Dr. Jones or Suburban Hospital.

モジュールタイプ２２４は、特性の観察４０８の観察結果となる特定のレコードやモーメントを定義することができると考えられる。モジュールタイプ２２４は、データ構造である、図４の実際のモジュール４０６のアブストラクションおよびメタデータ３０２である。実際のモジュール４０６は、モジュールタイプ２２４の具体的な例である。モジュール４０６は、通常共に観察されるエンティティ４０４のために観察される特性の観察４０８をリスト化することができる。ここで用いられるように、モジュール４０６は、しばしば共に観察されるエンティティ４０４の関連した特性の観察４０８の集合として定義される。モジュールタイプ２２４は、特性２２２の観察の開始および終了日程を含むことができる。モジュールタイプ２２４は、モジュールタイプ２２４と関連付けられた特性２２２をリスト化し、モジュールタイプ２２４の開始および終了日程を特定する。モジュールタイプ２２４は、期間のないイベントの開始および終了タイムスタンプと同じ単一の日または時間のスタンプのみを含むことができるとさらに考えられる。 The module type 224 is considered to be able to define a specific record or moment that is the observation result of the characteristic observation 408. Module type 224 is the abstract and metadata 302 of the actual module 406 of FIG. The actual module 406 is a specific example of the module type 224. Module 406 may list observations 408 of characteristics observed for entities 404 that are normally observed together. As used herein, module 406 is defined as a collection of observations 408 of related properties of entities 404 that are often observed together. Module type 224 may include start and end dates for observation of characteristic 222. Module type 224 lists characteristics 222 associated with module type 224 and identifies the start and end dates for module type 224. It is further contemplated that module type 224 may only include a single date or time stamp that is the same as the start and end time stamps of the event without a duration.

モジュールタイプ２２４は、入院（入院日、退院日）、診断（診断日）、処置（処置開始日および終了日）、病理検査（病理検査日）として例示されることができる。モジュール４０６それ自身は、特定の入院、診断、処置、および病理検査として例示されることができる。つまり、モジュール４０６は、２０１２年２月５日にサバーバン病院の緊急処置室に受け入れられたマイク・ジョンソンでありえ、マイク・ジョンソンは２０１２年２月５日に肺炎と診断され、ジョーンズ医師は、マイク・ジョンソンの肺炎を処置するために２０１２年２月５日にマイク・ジョンソンにベンジルペニシリンを投与し、またはジョーンズ医師は、２０１２年２月５日にマイク・ジョンソンのために分類なしの全血球計算を命じる。 The module type 224 can be exemplified as hospitalization (hospital date, discharge date), diagnosis (diagnosis date), treatment (treatment start date and end date), and pathological examination (pathological examination date). Module 406 itself can be illustrated as a specific hospitalization, diagnosis, treatment, and pathology examination. That is, module 406 may be Mike Johnson, who was accepted into the Suburb Hospital emergency room on February 5, 2012, Mike Johnson was diagnosed with pneumonia on February 5, 2012, and Dr. Jones -Benzylpenicillin was administered to Mike Johnson on Feb. 5, 2012 to treat Johnson's pneumonia, or Dr. Jones did a complete blood count for Mike Johnson on Feb. 5, 2012 Order.

特性２２２は、エンティティ４０４のため、またはエンティティ４０４の単一の観察可能なプロパティまたは特性の観察４０８を定義することができると考えられる。特性２２２は、名称を有し、整数、実際、日時、テキスト、選択、モジュールポインタまたはエンティティポインタのような、図４の特定のデータタイプ４１０を指定する。特性２２２は、観察された特性の観察４０８のデータ構造であるアブストラクションおよびメタデータ３０２である。特性の観察４０８は、エンティティ４０４についての事実の観察の具体的な例である。特性の観察４０８は、特性２２２のデータタイプ４１０に一致する。 It is contemplated that characteristic 222 may define a single observable property or characteristic observation 408 for entity 404 or entity 404. The characteristic 222 has a name and specifies the particular data type 410 of FIG. 4, such as integer, actual, date / time, text, selection, module pointer or entity pointer. Property 222 is abstraction and metadata 302 which is the data structure of observed property observation 408. Property observation 408 is a specific example of fact observation about entity 404. Characteristic observation 408 is consistent with data type 410 of characteristic 222.

特性２２２は、時間のデータタイプ４１０付きの診断日、選択のデータタイプ４１０付きのＩＣＤ９選択、テキストのデータタイプ４１０付きの外科医ノート、または選択のデータタイプ４１０付きの重症度選択としての診断モジュールタイプのために例示されることができる。特性２２２はさらに、入院モジュールのタイプとして、日時のデータタイプ４１０付きの入院日、日時のデータタイプ４１０付きの退院日、選択のデータタイプ４１０付きの主たる支払い者、または選択のデータタイプ４１０付きの退院措置として例示されることができる。 The characteristic 222 is a diagnostic module type as a diagnosis date with a time data type 410, an ICD9 selection with a selection data type 410, a surgeon note with a text data type 410, or a severity selection with a selection data type 410. Can be exemplified for. Characteristic 222 further includes the type of hospitalization module as hospitalization date with date / time data type 410, discharge date with date / time data type 410, primary payer with selection data type 410, or with selection data type 410. It can be illustrated as a discharge measure.

特性の観察４０８は、診断日：２０１２年２月５日午後１時３０分、ＩＣＤ９：「４８０．１」、または外科医ノート「患者は呼吸困難」として診断モジュールのために例示されることができる。特性の観察４０８は、入院モジュールとして、診断日：２０１２年２月５日午前１１時３０分、退院日：２０１２年２月７日午後１時３０分、主たる支払い者「Ａｅｔｎａ」、退院措置：「自宅」として例示されることができる。 Characteristic observation 408 can be illustrated for the diagnostic module as diagnosis date: February 5, 2012, 1:30 pm, ICD9: “480.1”, or surgeon note “Patients have difficulty breathing” . Characteristic observation 408 includes, as an inpatient module, diagnosis date: February 5, 2012, 11:30 am, discharge date: February 7, 2012, 1:30 pm, primary payer “Aetna”, discharge measures: It can be exemplified as “home”.

特性２２２は、経年的および非経年的な特性の観察４０８を含むことができる。経年的な特性の観察４０８は、ここで用いられるように、エンティティ４０４の生涯に生じるものとして扱われるイベントに対応する特性の観察４０８を意味するよう定義される。経年的な特性の観察４０８は、通常、それらを所有するために観察されるエンティティ４０４の生涯よりも短い期間を有する。非経年的な特性の観察４０８は、ここで用いられるように、氏名、血液型、患者ＩＤのようなエンティティ４０４の生涯継続するものとして取り扱われる特性の観察４０８を意味するよう定義される。非経年的な特性は、変化することもあるが、特定のイベントに必ずしも関連付けられない。エンティティタイプ２２０は、エンティティ４０４の非経年的な特性を獲得するために、モジュールタイプ２２４のうちの特別なひとつを付与されることができる。一方、経年的な特性は、エンティティ４０４の発生日を付与されることとなり、標準モジュールタイプ２２４と共にグループ分けされることとなる。 Characteristics 222 can include observations 408 of aged and non-aged characteristics. Aged characteristic observation 408 is defined to mean characteristic observation 408 corresponding to an event that is treated as occurring during the lifetime of entity 404, as used herein. Aging trait observations 408 typically have a shorter duration than the lifetime of the entity 404 observed to own them. Non-aged property observations 408 are defined to mean property observations 408 that are treated as life-long entities 404 such as name, blood type, patient ID, as used herein. Non-aged characteristics may change but are not necessarily associated with a particular event. The entity type 220 can be given a special one of the module types 224 to acquire the non-aged characteristics of the entity 404. On the other hand, the aging characteristics are given the date of occurrence of the entity 404 and are grouped together with the standard module type 224.

メタデータ３０２として実現され、図３のメタデータテーブル３００によって表されるスキーマ定義言語１２６を活用することにより、モジュール４０６によってグループ分けされる特性の観察にリンクされたエンティティ４０４を意味論的に表すこと、それにより物理テーブル２１８のためのシンプルかつ効果的なフレームワークを提供することを発見した。このフレームワークは、メタデータテーブル３００のスキーマ定義言語１２６と結び付けられるものであるが、メタデータテーブル３００を参照することによって物理データテーブル２１８を素早く、シンプルに、かつ効果的に利用可能にする。物理データテーブル２１８は、メタデータテーブル３００においてスキーマ定義言語１２６を参照することによりリポートを生成するためにフィルタリングされ用いられることができる。 Semantic representation of entities 404 linked to the observation of properties grouped by module 406 by leveraging schema definition language 126 implemented as metadata 302 and represented by metadata table 300 of FIG. And thereby found to provide a simple and effective framework for the physical table 218. This framework is tied to the schema definition language 126 of the metadata table 300, but by referring to the metadata table 300, the physical data table 218 can be used quickly, simply and effectively. The physical data table 218 can be filtered and used to generate a report by referring to the schema definition language 126 in the metadata table 300.

メタデータテーブル３００においてメタデータ３０２として獲得されたスキーマ定義言語１２６は、実施コストとタイムラインを同時に削減しながら直感的な方法でリポートを生成するために、物理データテーブル２１８がフィルタリングされ用いられることを可能にする。実施タイムラインは、スキーマ定義言語１２６を活用することによって大幅に削減されることができる。なぜなら、典型的なデータウェアハウスにおいて、データ構造は正規化されて次元テーブルに置かれる必要があるからである。しかしながら、スキーマ定義言語１２６を活用することによって、データ２０３はメタデータ構造およびデータ構造とシンプルに関連付けられることができる。一度関連付けられれば、ＥＴＬツール２１６は自動的に生成されることができ、物理データテーブル２１８はポピュレートされることができる。オペレーショナルデータストア２０６の物理データテーブル２１８がポピュレートされ次第、ビジネスインテリジェンスツール１３４は、追加的に複数の正規化ステップを要することなく、また、各ステップ間で抽出、変換およびロード動作を有する次元テーブルをポピュレートすることなく、スキーマ定義言語１２６を活用してオペレーショナルデータストア２０６におけるデータをフィルタリングし、リポートし、クエリを実行することができる。 The schema definition language 126 acquired as metadata 302 in the metadata table 300 is used by filtering the physical data table 218 in order to generate reports in an intuitive manner while simultaneously reducing implementation costs and timelines. Enable. Implementation timelines can be significantly reduced by leveraging the schema definition language 126. This is because in a typical data warehouse, the data structure needs to be normalized and placed in a dimension table. However, by utilizing the schema definition language 126, the data 203 can be simply associated with metadata structures and data structures. Once associated, the ETL tool 216 can be automatically generated and the physical data table 218 can be populated. As soon as the physical data table 218 of the operational data store 206 is populated, the business intelligence tool 134 does not require additional normalization steps and creates a dimension table with extraction, transformation and loading operations between each step. Without populating, the schema definition language 126 can be utilized to filter, report, and execute data in the operational data store 206.

予め定義されたデータタイプ４１０を有する特性２２２を提供するスキーマ定義言語１２６は、ＥＴＬツール２１６の自動生成を可能にし、これにより実施コストとタイムラインを大幅に削減することもさらに発見した。 It has further been discovered that a schema definition language 126 that provides a characteristic 222 having a predefined data type 410 allows automatic generation of the ETL tool 216, thereby significantly reducing implementation costs and timelines.

ビジネスインテリジェンスツール１３４が効果的に動作してユーザに直感的な体験を与えることができるように、情報伝達ブロック２１０は、スキーマ定義言語１２６を参照するための管理エンジン２０８に接続される。管理エンジン２０８は、フィルタ１２８とリポートデータの定義１３０とをさらに有することができる。フィルタ１２８は、エンティティ４０４の特性の観察４０８に基づいて、エンティティ４０４のクラスを生成するために用いられる一次元フィルタでありえる。フィルタは、図９〜１２を参照して以下に詳細に説明される。リポートデータの定義１３０は、リポートのために要求され望まれる、かつオペレーショナルデータストア２０６からデータ２０３をリトリーブするために活用されることとなる、エンティティタイプ２２０と、モジュールタイプ２２４と、特性２２２とのセットでありえる。リポートデータの定義１３０は、図１３および１４を参照して以下に詳細に説明される。 The information delivery block 210 is connected to a management engine 208 for referencing the schema definition language 126 so that the business intelligence tool 134 can operate effectively to give the user an intuitive experience. The management engine 208 can further include a filter 128 and a report data definition 130. Filter 128 may be a one-dimensional filter used to generate a class of entity 404 based on observation 408 of the characteristics of entity 404. The filter is described in detail below with reference to FIGS. The report data definition 130 includes the entity type 220, module type 224, and characteristic 222 that are required and desired for the report and that will be utilized to retrieve the data 203 from the operational data store 206. Can be a set. The report data definition 130 is described in detail below with reference to FIGS.

ここで図３を参照すると、そこには、本発明の実施の形態に係る例示的なメタデータテーブル３００が示されている。メタデータテーブル３００は、図１のスキーマ定義言語１２６を表わす。スキーマ定義言語１２６は、図２の物理データテーブル２１８のデータ２０３を説明するためのメタデータ３０２として用いられることができる。メタデータテーブル３００間で、複数の関係リンク３０４を有することができる。メタデータテーブル３００は、特性テーブル３０８にリンクされたエンティティタイプテーブル３０６を有することができる。エンティティタイプテーブル３０６と特性テーブル３０８とは、モジュールタイプテーブル３１０にリンクされることができる。 Referring now to FIG. 3, there is shown an exemplary metadata table 300 according to an embodiment of the present invention. The metadata table 300 represents the schema definition language 126 of FIG. The schema definition language 126 can be used as metadata 302 for explaining the data 203 of the physical data table 218 of FIG. A plurality of relationship links 304 can be provided between the metadata tables 300. The metadata table 300 can have an entity type table 306 linked to the characteristics table 308. The entity type table 306 and the characteristic table 308 can be linked to the module type table 310.

エンティティタイプテーブル３０６と、特性テーブル３０８と、モジュールタイプテーブル３１０とは、図２のエンティティタイプ２２０と、特性２２２と、モジュールタイプ２２４とそれぞれ関連付けられることができる。メタデータテーブル３００は、エンティティタイプテーブル３０６、モジュールタイプテーブル３１０、および特性テーブル３０８内で行として、図４のエンティティ４０４、モジュール４０６、および特性の観察４０８をそれぞれ含むことができる。 The entity type table 306, the characteristic table 308, and the module type table 310 can be associated with the entity type 220, the characteristic 222, and the module type 224 of FIG. The metadata table 300 may include the entity 404, module 406, and property observation 408 of FIG. 4 as rows in the entity type table 306, module type table 310, and property table 308, respectively.

メタデータテーブル３００は、物理データテーブル２１８を意味論的表現によってマッピングするメタデータ３０２を含んでいる。一例として、特性テーブル３０８は、特性の観察４０８の対応する行を含むことができる。一例として、ＩＣＤ９選択は、特性テーブル３０８内の行として含まれることができる特性の観察４０８である。特性テーブル３０８のＩＣＤ９選択は、モジュール４０６への、かつ／または特性の観察４０８に関連付けられることができるエンティティ４０４へのポインタまたはレファレンスを含むことができる。例えば、ＩＣＤ９の特性の観察４０８は、診断モジュール４０６と患者エンティティ４０４とを指し示すことがある。 The metadata table 300 includes metadata 302 that maps the physical data table 218 with a semantic representation. As an example, the characteristics table 308 may include corresponding rows of the characteristics observation 408. As an example, the ICD 9 selection is a property observation 408 that can be included as a row in the property table 308. The ICD 9 selection of the property table 308 can include a pointer or reference to the module 404 and / or to the entity 404 that can be associated with the property observation 408. For example, the observation 408 of the characteristics of the ICD 9 may point to the diagnostic module 406 and the patient entity 404.

メタデータテーブル３００は意味論的表現または物理データテーブル２１８の構造のマップを含むので、メタデータテーブル３００は物理データテーブル２１８内に含まれるデータ２０３の位置を特定するために用いられることができる。メタデータテーブル３００は、物理データテーブル２１８のデータ２０３間の関係リンク３０４を特定するためにも用いられる。 Since the metadata table 300 includes a semantic representation or a map of the structure of the physical data table 218, the metadata table 300 can be used to locate the data 203 contained within the physical data table 218. The metadata table 300 is also used to specify the relationship link 304 between the data 203 of the physical data table 218.

モジュールタイプテーブル３０６と、エンティティタイプテーブル３０６と、特性テーブル３０８との関係は、図２に関しすでに詳細に説明されたエンティティタイプ２２０と、特性２２２と、モジュールタイプ２２４との関係を示す。 The relationship between the module type table 306, the entity type table 306, and the property table 308 shows the relationship between the entity type 220, the property 222, and the module type 224 that have already been described in detail with reference to FIG.

メタデータテーブル３００間の関係リンク３０４は、データ２０３がどのようにして図２のオペレーショナルデータストア２０６の物理テーブル２１８内に結果的に構成されることとなるかについてのマップである。関係リンク３０４は、モジュール４０６、エンティティ４０４あるいは他のデータまたはメタデータ構造を指し示すか参照する特性の観察４０８を絵で表したものである。 The relationship link 304 between the metadata tables 300 is a map of how the data 203 will eventually be organized in the physical table 218 of the operational data store 206 of FIG. Relationship link 304 is a pictorial representation of a characteristic observation 408 that points to or references module 406, entity 404, or other data or metadata structure.

スキーマ定義言語１２６の関係リンク３０４は、データ２０３を結果的に保持することとなる物理テーブル２１８のための関係構造を提供する。スキーマ定義言語１２６は、物理テーブル２１８の位置を特定するためのマップと物理テーブル間の関係とを与える。 The relationship link 304 of the schema definition language 126 provides a relationship structure for the physical table 218 that will eventually hold the data 203. The schema definition language 126 gives a map for specifying the position of the physical table 218 and the relationship between the physical tables.

一例として、図１のビジネスインテリジェンスツール１３４が、スキーマ定義言語１２６を参照して特性の観察４０８の位置を特定し、特性の観察４０８がどのようにしてエンティティ４０４とモジュール４０６とにリンクされているかを判断することにより、ユーザはエンティティ４０４またはモジュール４０６をフィルタリングするための特性の観察４０８を選択する。特性の観察４０８にリンクされたモジュール４０６とエンティティ４０４とがメタデータテーブル３００によって一度特定されると、他の特徴の観察４０８が、ユーザが選択した特徴の観察４０８として同じモジュール４０６またはエンティティ４０４に属する。 As an example, the business intelligence tool 134 of FIG. 1 refers to the schema definition language 126 to locate the characteristic observation 408 and how the characteristic observation 408 is linked to the entity 404 and the module 406. , The user selects the characteristic observation 408 for filtering the entity 404 or module 406. Once the module 406 and the entity 404 linked to the property observation 408 are identified by the metadata table 300, the other feature observation 408 is transferred to the same module 406 or entity 404 as the user selected feature observation 408. Belongs.

エンティティ４０４とモジュール４０６とについての特性の観察４０８の関係を提供することにより、図２の管理エンジン２０８は、選択された第１の特性の観察４０８に関連する他の特性の観察４０８を選択するオプションをユーザに与えることができ、または、他のモジュール４０６を選択するオプションをユーザに与えることができる。管理エンジン２０８は、ユーザのための選択オプションとして他の特性の観察４０８をユーザへ同じモジュール４０６からリターンするためのスキーマ定義言語１２６を活用することができる。このようにして、スキーマ定義言語１２６は、スキーマ定義言語１２６内に要求されたか含まれたものとして物理テーブル２１８内のデータ２０３の構造のみに基づいて、関連するフィルタまたはリポートをユーザに提供するために活用されることができる。 By providing the property observation 408 relationship for the entity 404 and the module 406, the management engine 208 of FIG. 2 selects the other property observation 408 associated with the selected first property observation 408. Options can be given to the user or the user can be given the option of selecting another module 406. The management engine 208 can leverage the schema definition language 126 to return other characteristic observations 408 to the user from the same module 406 as a selection option for the user. In this way, the schema definition language 126 provides the user with relevant filters or reports based solely on the structure of the data 203 in the physical table 218 as required or included in the schema definition language 126. Can be utilized.

メタデータ３００間の関係リンク３０４は、多対１リンク３１２を示す。エンティティタイプテーブル３０６とモジュールタイプテーブル３１０とは、エンティティ４０４間で階層的な関係を可能にするリカーシブリンク３１４をも含んでいる。 The relationship link 304 between the metadata 300 indicates a many-to-one link 312. The entity type table 306 and the module type table 310 also include a recursive link 314 that enables a hierarchical relationship between the entities 404.

他のテーブルは、データタイプテーブル３１６、選択タイプテーブル３１８、選択テーブル３２０、および承認テーブル３２２を含むメタデータテーブル３００を構成することができる。メタデータテーブル３００は、本発明から逸脱することなくより多くのテーブルを含むことができると考えられる。一例として、メタデータテーブル３００は、モジュールタイプメンバーテーブルまたは測定単位テーブルを含んでもよい。本発明から逸脱することなくテーブルは組み合わされまたは削除されることができるとさらに理解される。 Other tables can constitute a metadata table 300 that includes a data type table 316, a selection type table 318, a selection table 320, and an approval table 322. It is contemplated that the metadata table 300 can include more tables without departing from the present invention. As an example, the metadata table 300 may include a module type member table or a unit of measurement table. It is further understood that the tables can be combined or deleted without departing from the invention.

ここで図４を参照すると、そこには、本発明の実施の形態に係る例示的なスキーマ定義モデル４００が示されている。スキーマ定義モデル４００は、図１のスキーマ定義言語１２６をデータコンテキストに適用することができる。説明目的のためのみのヘルスケアの例を続けると、エンティティタイプ２２０と、モジュールタイプ２２４と、特性２２２とは、エンティティ４０４と、モジュール４０６と、特性の観察４０８のためのメタデータ３０２にマッピングされた。 Referring now to FIG. 4, there is shown an exemplary schema definition model 400 according to an embodiment of the present invention. The schema definition model 400 can apply the schema definition language 126 of FIG. 1 to a data context. Continuing with the healthcare example for illustrative purposes only, entity type 220, module type 224, and characteristic 222 are mapped to entity 404, module 406, and metadata 302 for characteristic observation 408. It was.

メタデータ３０２は、エンティティタイプ２２０と、モジュールタイプ２２４と、特性２２２と同様に、エンティティ４０４と、モジュール４０６と、特性の観察４０８とに関連付けられることができる。以下に説明的な例を通じて示されるように、スキーマ定義モデル４００内に含まれるメタデータ３０２は、特性の観察４０８がどのようにしてエンティティ４０４にリンクされるかについて、および、特性の観察４０８がどのようにしてモジュール４０６と共にグループ分けされるかについて定義することができる。 The metadata 302 can be associated with the entity 404, the module 406, and the property observation 408 as well as the entity type 220, the module type 224, and the property 222. As shown through the illustrative examples below, the metadata 302 contained within the schema definition model 400 includes information about how the property observation 408 is linked to the entity 404 and the property observation 408. It can be defined how it is grouped with the module 406.

説明的な一例として、エンティティ４０４は、患者、サンプル、実験または遺伝的変種でありえる。エンティティ４０４は、共にグループ分けされた非経年的な特性の観察４０８を有するものとして示される。説明的な一例として、患者として特定されたエンティティ４０４と共にグループ分けされた特性の観察４０８は、ＩＤ、誕生日、死亡日、性別、民族性、主たる支払い者、および現在の生活機能状態を含むことができる。 As an illustrative example, entity 404 can be a patient, sample, experiment, or genetic variant. Entity 404 is shown as having an aged characteristic observation 408 grouped together. As an illustrative example, the observations 408 of characteristics grouped with the entity 404 identified as a patient include ID, date of birth, death date, gender, ethnicity, primary payer, and current living functioning status. Can do.

モジュール４０６と共にグループ分けされた特性の観察４０８は、経年的な特性の観察４０８として示される。説明的な一例として、照射のためのメタデータ３０２を有するモジュール４０６と共にグループ分けされた特性は、１回分の追加照射、追加照射処置モダリティ、処置回数、放射線照射対象の解剖学的部位、放射線量、および局所への１回分の照射量を含むことができる。モジュール４０６は、照射に対応するモジュール４０６において処置モジュール４０６のためのポインタのような他のモジュール４０６への外来キーとなるポインタをも含むことができる。 Characteristic observation 408 grouped with module 406 is shown as aged characteristic observation 408. As an illustrative example, the characteristics grouped together with the module 406 having metadata 302 for irradiation include one additional dose, additional irradiation treatment modality, number of treatments, anatomical site to be irradiated, radiation dose , And a local dose. Module 406 can also include a pointer that is a foreign key to other modules 406, such as a pointer for treatment module 406 in module 406 corresponding to irradiation.

モジュール４０６とエンティティ４０４は、スキーマ定義モデル４００において図２の実際のデータ２０３として描かれていないが、代わりにメタデータ３０２を有するものとして示されている。特性の観察４０８は、データタイプ４１０の形式でメタデータ３０２を有するものとしても示される。データタイプ４１０は、図２の物理テーブル２１８を結果的にポピュレートすることとなるデータ２０３のタイプを指定することができる。 Module 406 and entity 404 are not depicted as actual data 203 in FIG. 2 in schema definition model 400, but are shown as having metadata 302 instead. Characteristic observation 408 is also shown as having metadata 302 in the form of data type 410. The data type 410 can specify the type of data 203 that will eventually populate the physical table 218 of FIG.

特性の観察４０８のためのデータタイプ４１０は、整数データ、実データ、テキストデータ、長いテキストデータ、日付、日時、選択データ、モジュールポインタデータ、およびエンティティポインタデータを指定することができる。データタイプ４１０のメタデータ３０２を活用することにより、図２のＥＴＬツール２１６は図２の管理エンジン２０８によって自動的に生成されることとなる。ＥＴＬツール２１６の生成については、図８を参照して以下に詳細に説明する。 The data type 410 for the property observation 408 can specify integer data, real data, text data, long text data, date, date and time, selection data, module pointer data, and entity pointer data. By utilizing the metadata 302 of the data type 410, the ETL tool 216 of FIG. 2 is automatically generated by the management engine 208 of FIG. The generation of the ETL tool 216 will be described in detail below with reference to FIG.

図３のエンティティタイプテーブル３０６とモジュールタイプテーブル３１０とのリカーシブ関係により、エンティティ４０４とモジュール４０６はそれぞれ、サブエンティティおよびサブモジュールを含むことができる。サブエンティティは、特定のエンティティ４０４から生じるエンティティ４０４でありえる。サブモジュールは、特定のモジュール４０６から生じるモジュール４０６でありえる。 Due to the recursive relationship between the entity type table 306 and the module type table 310 of FIG. 3, the entity 404 and the module 406 can include sub-entities and sub-modules, respectively. A sub-entity can be an entity 404 that originates from a particular entity 404. A submodule can be a module 406 that originates from a particular module 406.

ここで図５を参照すると、そこには、本発明の実施の形態に係る例示的なスキーマ定義モデルテーブル５００が示されている。スキーマ定義モデルテーブル５００は、説明を容易にするためにシンプル化したフォーマットにおいて示される。 Referring now to FIG. 5, there is shown an exemplary schema definition model table 500 according to an embodiment of the present invention. The schema definition model table 500 is shown in a simplified format for ease of explanation.

スキーマ定義モデルテーブル５００は、複数の行および列を有するものとして示される。各行は特性の観察４０８に対応する。テーブルのセルは、図２のオペレーショナルデータストアの物理テーブル２１８をポピュレートすることとなる現実のデータ２０３ではなくメタデータ３０２で埋められる。 The schema definition model table 500 is shown as having a plurality of rows and columns. Each row corresponds to a characteristic observation 408. The table cells are filled with metadata 302 instead of the actual data 203 that will populate the physical table 218 of the operational data store of FIG.

図４と同様に、特性の観察４０８は、エンティティタイプ２２０およびモジュールタイプ２２４とリンクされることができる。エンティティタイプ２２０は第１列によって表されることができ、モジュールタイプ２２４は第２列によって表されることができる。同様に、特性の観察４０８の行は、特性の観察４０８の他の特徴を示す列を通過することとなる。特性の観察４０８は、次のものと関連付けられることができる。すなわち、ディスプレイ特性５０２、シーケンス５０４、特性の名称５０６、データタイプ４１０、選択タイプ５０８、垂直インジケータ５１０、対象エンティティタイプ５１２、対象モジュールタイプ５１４、測定単位５１６、多数の選択インジケータ５１８、および保護インジケータ５２０である。 Similar to FIG. 4, characteristic observation 408 can be linked with entity type 220 and module type 224. The entity type 220 can be represented by the first column and the module type 224 can be represented by the second column. Similarly, the row of characteristic observations 408 will pass through a column showing other features of characteristic observations 408. The characteristic observation 408 can be associated with: That is, display characteristic 502, sequence 504, characteristic name 506, data type 410, selection type 508, vertical indicator 510, target entity type 512, target module type 514, unit of measure 516, multiple selection indicators 518, and protection indicator 520 It is.

第１列のエンティティタイプ２２０は、特性の観察４０８が関連付けられるエンティティ４０４でありえる。エンティティタイプ２２０は、先行するヘルスケア分野の例を活用するならば、例えば、患者、サンプル、実験、遺伝的変種、病院、または外科医を含むことができる。第２列のエンティティタイプ２２４は、特性の観察４０８が関連付けられるモジュール４０６でありえる。モジュールタイプ２２４は、患者情報（エンティティ４０４のための非経年的な特性の観察４０８を含むための特別なモジュール）、がん診断、化学療法、薬物療法、照射、外科手術、処置、病理および生活機能、同意、サンプル情報、実験情報、遺伝的変種の観察、その他を含むことができる。 The first column entity type 220 may be the entity 404 with which the property observation 408 is associated. Entity types 220 can include, for example, patients, samples, experiments, genetic variants, hospitals, or surgeons if they take advantage of previous healthcare field examples. The second column of entity types 224 may be a module 406 with which a characteristic observation 408 is associated. Module type 224 includes patient information (special module for including aged characteristics observation 408 for entity 404), cancer diagnosis, chemotherapy, drug therapy, irradiation, surgery, treatment, pathology and life. Functions, consent, sample information, experimental information, observation of genetic variants, etc. can be included.

第３列のディスプレイ特性５０２は、特性の観察４０８がモジュール４０６のためのデフォルトＩＤ特性として表示されるべきが否かを示すことができる。ディスプレイ特性５０２は、特性の観察４０８を特定された特性または特定されていない特性であるとして特定することができる。患者エンティティの患者情報モジュールと関連付けられた、特定された特性の一例は、患者ＩＤでありえる。患者エンティティの診断モジュールと関連付けられた、特定された特性の一例は、ＩＣＤ９値でありえる。患者エンティティの患者情報モジュールと関連付けられた、特定されていない特性の一例は、この特性がエンティティ４０４と共に機能するデフォルト値としてあまり役に立たないであろうことから、誕生日でありえる。 A third column of display characteristics 502 can indicate whether characteristic observation 408 should be displayed as a default ID characteristic for module 406. Display characteristic 502 may identify characteristic observation 408 as being a specified characteristic or an unspecified characteristic. An example of the identified characteristic associated with the patient information module of the patient entity may be a patient ID. An example of an identified characteristic associated with a patient entity diagnostic module may be an ICD9 value. An example of an unspecified characteristic associated with the patient information module of a patient entity may be a birthday because this characteristic would not be very useful as a default value to work with entity 404.

第４列のシーケンス５０４は、特性の観察４０８のモジュール４０６内への表示可能順序を示すことができる。第５列の特性の名称５０６は、名称によって特性を特定することができる。薬物療法モジュールの特性の名称５０６の一例として、特性の観察４０８は、投薬量、投薬頻度、薬剤名、投薬順、薬物療法処置、または薬物療法診断として特定されることができる。各モジュールタイプ２２４は、物理テーブル２１８において特性の観察４０８へマッピングされるデータ２０３に対応する独自の特性の名称５０６を含むことができる。 The fourth column sequence 504 may indicate the order in which the observation of characteristics 408 can be displayed in the module 406. The characteristic name 506 in the fifth column can specify the characteristic by the name. As an example of a drug therapy module property name 506, the property observation 408 can be specified as a dosage, a dosing frequency, a drug name, a dosing order, a drug therapy treatment, or a drug therapy diagnosis. Each module type 224 may include a unique property name 506 corresponding to the data 203 that is mapped to the property observation 408 in the physical table 218.

第６列のデータタイプ４１０は、特性の観察４０８が格納されることとなるデータのタイプを示すことができる。特性の観察４０８は、整数データ、実データ、テキストデータ、長いテキストデータ、日付データ、日時データ、選択データ、モジュールポインタ、およびエンティティポインタを特定することができる。 The data type 410 in the sixth column may indicate the type of data for which the characteristic observation 408 will be stored. Characteristic observation 408 can identify integer data, real data, text data, long text data, date data, date / time data, selection data, module pointers, and entity pointers.

第７列の選択タイプ５０８は、データタイプ４１０が選択データに対応するならば、用いられることとなる。選択タイプ５０８は、選択データに関連付けられた特性の観察４０８のためのすべての有効な選択のリストでありえる。選択タイプ５０８の一例として、はい／いいえや男性／女性を含むことができる。選択タイプ５０８は、処置または化学療法モジュールのための潜在的な化学療法タイプのリストのような長いものを含むこともできる。選択タイプ５０８のこのタイプは、次のものを含むリストであってもよい。すなわち、塩酸メクロレタミン、シクロフォスファミド、クロラムブシル、メルファラン、イフォスファミド、チオテパ、ヘキサメチルメラミン、アルトレートアミン、塩酸プロカルバジン、ダカルバジン、テモゾロミド、カルムスチン、ロムスチン、ストレプトゾシン、カルボプラチン、シスプラチン、オキサリプラチンなどである。 The selection type 508 in the seventh column will be used if the data type 410 corresponds to the selection data. Selection type 508 can be a list of all valid selections for observation 408 of the characteristics associated with the selection data. Examples of selection type 508 can include yes / no and male / female. Selection type 508 may also include a long one such as a list of potential chemotherapy types for a treatment or chemotherapy module. This type of selection type 508 may be a list that includes: That is, mechloretamine hydrochloride, cyclophosphamide, chlorambucil, melphalan, ifosfamide, thiotepa, hexamethylmelamine, altamine, procarbazine hydrochloride, dacarbazine, temozolomide, carmustine, lomustine, streptozocin, carboplatin, cisplatin, oxaliplatin, etc. .

第８列の垂直インジケータ５１０は、特性の観察４０８がどのようにしてオペレーショナルデータストア２０６の物理テーブル２１８に格納されることとなるかを示すことができる。垂直インジケータ５１０が、特性の観察４０８が垂直に格納されていないことを示す「Ｎ」であるなら、特性の観察４０８は、当該特性が関連付けられているモジュール４０６を表すテーブルにおける、ある列に格納されることとなる。垂直インジケータ５１０が、特性の観察４０８が垂直に格納されていることを示す「Ｙ」であるなら、それぞれ別々の行を有する特性の観察４０８が別々のテーブルに格納されることとなる。 The eighth column vertical indicator 510 may indicate how the characteristic observation 408 will be stored in the physical table 218 of the operational data store 206. If the vertical indicator 510 is “N” indicating that the property observation 408 is not stored vertically, the property observation 408 is stored in a column in the table representing the module 406 with which the property is associated. Will be. If vertical indicator 510 is “Y” indicating that characteristic observations 408 are stored vertically, characteristic observations 408, each having a separate row, will be stored in separate tables.

第９列の対象エンティティタイプ５１２は、データタイプ４１０がエンティティポインタであるならば用いられることとなる。対象エンティティタイプ５１２は、特性の観察４０８が関連付けられているモジュール４０６によって参照されるエンティティタイプ２２０を示すために用いられることができる。一例として、エンティティポインタに対応し、がん診断モジュールと関連付けられている特性の観察４０８のひとつは、がん診断が生体組織検査の結果であるならば、「サンプル」のエンティティタイプ２２０を参照してもよい。 The target entity type 512 in the ninth column is used if the data type 410 is an entity pointer. The target entity type 512 can be used to indicate the entity type 220 referenced by the module 406 with which the property observation 408 is associated. As an example, one of the observations 408 of characteristics corresponding to the entity pointer and associated with the cancer diagnostic module refers to the “sample” entity type 220 if the cancer diagnosis is the result of a biopsy. May be.

第１０列の対象モジュールタイプ５１４は、データタイプ４１０がモジュールポインタであるならば用いられることとなる。対象モジュールタイプ５１４は、特性の観察４０８が関連付けられているモジュール４０６によって参照されるモジュールタイプ２２４を示すために用いられることができる。一例として、モジュールポインタに対応し、照射モジュールと関連付けられている特性の観察４０８のひとつは、「処置」のモジュールタイプ２２４を参照してもよい。 The target module type 514 in the tenth column is used if the data type 410 is a module pointer. The target module type 514 can be used to indicate the module type 224 referenced by the module 406 with which the property observation 408 is associated. As an example, one of the observations 408 of the characteristics corresponding to the module pointer and associated with the irradiation module may refer to the “treatment” module type 224.

第１１列の測定単位５１６は、数字である場合の特性の観察４０８について測定単位を示すために用いられることができる。一例として、測定単位５１６はポンドやキログラムでありえる。 The measurement unit 516 in the eleventh column can be used to indicate the unit of measurement for the observation 408 of the characteristic when it is a number. As an example, the unit of measure 516 can be pounds or kilograms.

第１２列の複数選択インジケータ５１８は、データタイプ４１０が、ユーザが複数選択できるか１つだけ選択できるかを示す選択であるとき、活用されることができる。一例として、複数選択インジケータ５１８が「Ｎ」であれば、１つの選択のみが有効であり、複数選択インジケータ５１８が「Ｙ」であれば、複数の選択が有効である。 The twelfth column multi-select indicator 518 can be leveraged when the data type 410 is a selection that indicates whether the user can select multiple or only one. As an example, if multiple selection indicator 518 is “N”, only one selection is valid, and if multiple selection indicator 518 is “Y”, multiple selections are valid.

第１３列の保護インジケータ５２０は、その特性が取り扱いに慎重を要する情報を示すか否かを示すことができる。例えば、保護インジケータ５２０が「Ｎ」であれば、この特性は患者を特定する情報を含んでいることがあり、異なる取り扱いがなされるべきである。 The protection indicator 520 in the thirteenth column can indicate whether the characteristic indicates sensitive information. For example, if the protection indicator 520 is “N”, this characteristic may contain information identifying the patient and should be handled differently.

スキーマ定義モデルテーブル５００は、オペレーショナルデータストア２０６の物理テーブル２１８を生成するために用いられることとなる。各モジュール４０６は、オペレーショナルデータストア２０６内の別々のテーブルとなる。各特性の観察４０８は、それらが関連付けられるモジュール４０６のテーブル内の列となる。データ２０３は、物理テーブル２１８内のセルを占有することとなる。 The schema definition model table 500 will be used to generate the physical table 218 of the operational data store 206. Each module 406 becomes a separate table in the operational data store 206. Each characteristic observation 408 is a column in the table of module 406 to which they are associated. Data 203 will occupy a cell in the physical table 218.

ここで図６を参照すると、そこには、本発明の実施の形態に係る例示的な物理テーブル２１８が示されている。物理テーブル２１８は、図２のオペレーショナルデータストア２０６の物理テーブル２１８でありえる。 Referring now to FIG. 6, there is shown an exemplary physical table 218 according to an embodiment of the present invention. The physical table 218 can be the physical table 218 of the operational data store 206 of FIG.

以下に説明するように、物理テーブル２１８は、図３のメタデータ３０２を参照することによって生成されることができる。メタデータ３０２は、図１のスキーマ定義言語１２６内に、単独で含まれるか、図３のメタデータテーブル３００に表されるスキーマ定義言語１２６と、図４のスキーマ定義モデル４００と、図５のスキーマ定義モデルテーブル５００との組み合わせとして含まれる。物理テーブル２１８は、図４の特性の観察４０８の少なくとも一つに基づいて、図４の互いにリンクを有するモジュール４０６とエンティティ４０４のためのテーブルを含むことができる。 As described below, the physical table 218 can be generated by referring to the metadata 302 of FIG. The metadata 302 is included alone in the schema definition language 126 of FIG. 1 or represented in the metadata table 300 of FIG. 3, the schema definition model 400 of FIG. 4, and the metadata 302 of FIG. It is included as a combination with the schema definition model table 500. The physical table 218 may include a table for the modules 406 and entities 404 having links to each other in FIG. 4 based on at least one of the property observations 408 in FIG.

既に説明したヘルスケアの例を続け、スキーマ定義言語１２６を活用すると、物理テーブル２１８は、モジュールタイプ２２４の列において定義されたモジュール４０６またはスキーマ定義モデルテーブル５００の第２列に対応するものとして示される。スキーマ定義モデルテーブル５００において定義された各モジュール４０６は、物理テーブル２１８の完全なリストにおいて創作される独自の物理テーブル６０２を有する。 Continuing with the health care example already described and utilizing the schema definition language 126, the physical table 218 is shown as corresponding to the module 406 defined in the module type 224 column or the second column of the schema definition model table 500. It is. Each module 406 defined in the schema definition model table 500 has its own physical table 602 that is created in the complete list of physical tables 218.

物理テーブル２１８は、テーブルタイトル６０４と、それから列名称６０６のリストとを有することができる。列名称６０６は、スキーマ定義モデルテーブル５００の行に含まれる特性の観察４０８である。さらに、スキーマ定義モデルテーブル５００の特性の観察４０８がモジュールポインタまたはエンティティポインタであるときは、これらの特性の観察４０８はリンク６０８として物理テーブル２１８に描かれる。一般的に、リンク６０８は、１対１関係または多対１関係を含むことができる。 The physical table 218 can have a table title 604 and then a list of column names 606. A column name 606 is an observation 408 of characteristics included in the row of the schema definition model table 500. Further, when the property observations 408 of the schema definition model table 500 are module pointers or entity pointers, these property observations 408 are depicted in the physical table 218 as links 608. In general, link 608 can include a one-to-one relationship or a many-to-one relationship.

例えば、エンティティテーブル６１０は、ＯＤＳ＿ＰａｔｉｅｎｔＩｎｆｏテーブル６１２と１対１関係を有することができる。このことは、エンティティテーブル６１０は、エンティティごとに、ＯＤＳ＿ＰａｔｉｅｎｔＩｎｆｏテーブル６１２を指し示す一つの外来キーを含むこととなることを意味する。同様に、ＯＤＳ＿ＰａｔｉｅｎｔＩｎｆｏテーブル６１２は、エンティティテーブル６１０を指し示す外来キーを含むこととなる。 For example, the entity table 610 can have a one-to-one relationship with the ODS_PatientInfo table 612. This means that the entity table 610 includes one foreign key indicating the ODS_PatientInfo table 612 for each entity. Similarly, the ODS_PatientInfo table 612 includes a foreign key that points to the entity table 610.

物理テーブル２１８は、多対１リンクをさらに含むことができる。例えば、ＯＤＳ＿ＰａｔｉｅｎｔＩｎｆｏテーブル６１２は、複数のＯＤＳ＿Ｍｅｄｉｃａｔｉｏｎテーブル６１４から多数の外来キーを含むことができるが、各ＯＤＳ＿Ｍｅｄｉｃａｔｉｏｎテーブル６１４は、ＯＤＳ＿ＰａｔｉｅｎｔＩｎｆｏテーブル６１２のために単一の外来キーのみを含むこととなる。 The physical table 218 can further include many-to-one links. For example, the ODS_PatientInfo table 612 can include a number of foreign keys from the multiple ODS_Mediation tables 614, but each ODS_Mediation table 614 will only include a single foreign key for the ODS_PatientInfo table 612.

スキーマ定義言語１２６を利用することによって表される別の解決策は、垂直インジケータ５１０のフラグによってもたらされる。典型的には、垂直インジケータ５１０のフラグが否定的または「Ｎ」であるときは、スキーマ定義モデルテーブル５００における各特性の観察４０８は、物理データテーブル２１８における列となる。このことはＯＤＳ＿ＬａｂｓＶｉｔａｌｓテーブル６１６によって説明される。しかしながら、特性の観察４０８が非常に多いときは、垂直インジケータ５１０は肯定または「Ｙ」に設定されることができる。垂直インジケータ５１０が肯定のときは、特性の観察４０８は物理データテーブル２１８内の列として構成されるべきではないが、ＯＤＳ＿ＬａｂｓＶｉｔａｌｓ＿Ｖｅｒｔｉｃａｌテーブル６１８として説明される別のテーブル内の行として占有すべきである。一例として、白血球数や赤血球数のような特性の観察４０８が、他の数多くの特性に沿ってＯＤＳ＿ＬａｂｓＶｉｔａｌｓ＿Ｖｅｒｔｉｃａｌテーブル６１８を占有しうる。 Another solution expressed by utilizing the schema definition language 126 is provided by the flag of the vertical indicator 510. Typically, when the vertical indicator 510 flag is negative or “N”, each characteristic observation 408 in the schema definition model table 500 is a column in the physical data table 218. This is illustrated by the ODS_LabsVitals table 616. However, when there are very many characteristic observations 408, the vertical indicator 510 can be set to positive or “Y”. When vertical indicator 510 is positive, characteristic observation 408 should not be configured as a column in physical data table 218, but should be occupied as a row in another table described as ODS_LabsVitals_Vertical table 618. As an example, observations 408 of characteristics such as white blood cell count and red blood cell count may occupy the ODS_LabsVitals_Vertical table 618 along with many other characteristics.

図７、８、９、１１、１３、および１５のフローチャートのためのデータ要件と同様に、図１〜６に関し説明されるデータ要件は、非一時的なコンピュータで読み取り可能なメディアに固定されるかセーブされることができる。これらのデータ要件は、ハードウェアの利用および実施なしにはアクセスされ又は変更されることはできず、データ値の変化は、ビット形式でハードウェア記録の物理的かつ非一時的な変換を表すことができる。 Similar to the data requirements for the flowcharts of FIGS. 7, 8, 9, 11, 13, and 15, the data requirements described with respect to FIGS. 1-6 are anchored to non-transitory computer readable media. Can be saved. These data requirements cannot be accessed or changed without hardware utilization and implementation, and changes in data values represent physical and non-temporary conversions of hardware records in bit form. Can do.

図２のオペレーショナルデータストア２０６の実施と利用のためのデータ要件について説明したが、オペレーショナルデータストア２０６の実施と利用のためのプロセスについてこれから説明する。図７、８、９、１１、１３、および１５に関し説明されるフローチャートにおいて、オペレーショナルデータストア２０６の実施と利用のためのプロセスのステップは、ハードウェア、ソフトウェア、ファームウェア、またはそれらの組み合わせにおいて実現されることができる。さらに、ステップの境界は一般に多種多様であり、異なる実施の形態において機能は共に実施されたり別々に実施されたりする。 Having described the data requirements for implementation and use of the operational data store 206 of FIG. 2, the process for implementing and using the operational data store 206 will now be described. In the flowcharts described with respect to FIGS. 7, 8, 9, 11, 13, and 15, the process steps for implementation and utilization of operational data store 206 are implemented in hardware, software, firmware, or a combination thereof. Can. Furthermore, the step boundaries are generally diverse, and the functions may be performed together or separately in different embodiments.

ここで図７を参照すると、そこには、本発明の実施の形態に係る、オペレーショナルデータストアを実施しポピュレートするための例示的な制御フロー７００が示されている。一例として、図２のオペレーショナルデータストア２０６は、同様に実施されポピュレートされてもよい。 Referring now to FIG. 7, there is shown an exemplary control flow 700 for implementing and populating an operational data store according to an embodiment of the present invention. As an example, the operational data store 206 of FIG. 2 may be similarly implemented and populated.

制御フロー７００は、スキーマ定義言語提供ステップ７０２を含む。スキーマ定義言語提供ステップ７０２は、スキーマ定義言語１２６を提供するために呼び出されることができる。スキーマ定義言語１２６を提供することによって、設計者はデータおよびメタデータ構造を活用することができる。 The control flow 700 includes a schema definition language providing step 702. The provide schema definition language step 702 can be invoked to provide the schema definition language 126. By providing a schema definition language 126, designers can take advantage of data and metadata structures.

スキーマ定義言語提供ステップ７０２から続くのは、スキーマ定義モデル化ステップ７０４である。スキーマ定義モデル化ステップ７０４の間、設計者は、存在するファイル、存在するオペレーショナルデータストア、または存在するデータベースに含まれるデータ要素を用いてスキーマ定義モデル４００を定義するためのスキーマ定義言語１２６を活用することとなる。スキーマ定義言語１２６は、図２および図３に関し上記詳述したメタデータおよびデータ構造を活用することにより設計者によって実際に実現される。 Following the schema definition language providing step 702 is a schema definition modeling step 704. During the schema definition modeling step 704, the designer leverages the schema definition language 126 to define the schema definition model 400 using data elements contained in existing files, existing operational data stores, or existing databases. Will be. The schema definition language 126 is actually implemented by the designer by utilizing the metadata and data structures detailed above with respect to FIGS.

スキーマ定義モデル化ステップ７０４は、スキーマ定義モデル獲得ステップ７０６へと続くことができる。スキーマ定義モデル獲得ステップ７０６の間、スキーマ定義モデル４００は、各特性の観察４０８の各行を提供するとともにそれらを図４のエンティティ４０４およびモジュール４０６に付与するか関連付けるスキーマ定義モデルテーブル５００内で獲得されることができる。スキーマ定義モデル獲得ステップ７０６はスキップされてもよく、スキーマ定義言語１２６は、メタデータ３０２を参照しクエリを実行するためのスキーマ定義モデルテーブル５００の代わりに用いられることができる。スキーマ定義言語１２６は、図３のメタデータテーブル３００、スキーマ定義モデル４００、またはそれらの組み合わせとして用いられることができる。 The schema definition modeling step 704 can continue to the schema definition model acquisition step 706. During the schema definition model acquisition step 706, the schema definition model 400 is acquired in the schema definition model table 500 that provides each row of observations 408 of each characteristic and attaches or associates them to the entity 404 and module 406 of FIG. Can. The schema definition model acquisition step 706 may be skipped, and the schema definition language 126 may be used in place of the schema definition model table 500 for referencing the metadata 302 and executing a query. The schema definition language 126 can be used as the metadata table 300 of FIG. 3, the schema definition model 400, or a combination thereof.

スキーマ定義モデル獲得ステップ７０６は、オペレーショナルデータストア（ＯＤＳ）生成ステップ７０８へと続くことができる。ＯＤＳ生成ステップ７０８は、スキーマ定義モデルテーブル５００内に含まれるスキーマ定義言語１２６からオペレーショナルデータストア２０６の物理テーブル２１８を生成し、それらの間に図６のリンク６０８を創作することを含む。 The schema definition model acquisition step 706 can continue to an operational data store (ODS) generation step 708. The ODS generation step 708 includes generating the physical table 218 of the operational data store 206 from the schema definition language 126 contained within the schema definition model table 500 and creating the link 608 of FIG. 6 therebetween.

特に、ＯＤＳ生成ステップ７０８の間、物理テーブル２１８のひとつは、モジュール４０６ごとにまず生成される。スキーマ定義モデルテーブル５００の垂直インジケータ５１０が「Ｎ」であるならば、その後特性の観察４０８に関連付けられたモジュール４０６のために図６の物理テーブル６０２においてひとつの列が創作される。この列が特性の観察４０８のために創作されたときは、図５のデータタイプ４１０のインジケータは、この列が含むこととなるデータのタイプを特定する。 In particular, during the ODS generation step 708, one of the physical tables 218 is first generated for each module 406. If the vertical indicator 510 of the schema definition model table 500 is “N”, then a column is created in the physical table 602 of FIG. 6 for the module 406 associated with the property observation 408. When this column is created for a characteristic observation 408, the data type 410 indicator of FIG. 5 identifies the type of data that this column will contain.

ＯＤＳ生成ステップ７０８の間に、選択テーブルを参照するためのデータタイプ４１０の選択の特性の観察４０８のために、外来キーが生成されることになる。同様に、ＯＤＳ生成ステップ７０８の間に、対象エンティティタイプ５１２におけるインジケータを含むかスキーマ定義モデルテーブル５００の対象モジュールタイプ５１４を含む特性の観察４０８のために、外来キーが生成されることになる。対象エンティティタイプ５１２がインジケータを含むならば、図６のエンティティテーブル６１０を参照する外来キーが生成されることとなる。一方、対象モジュールタイプ５１４がインジケータを含むならば、モジュールテーブルのうちの一つを参照する外来キーが創作されることとなる。 During the ODS generation step 708, a foreign key will be generated for the observation 408 of the selection characteristics of the data type 410 to reference the selection table. Similarly, during the ODS generation step 708, a foreign key will be generated for the observation 408 of characteristics that include an indicator in the target entity type 512 or that includes the target module type 514 of the schema definition model table 500. If the target entity type 512 includes an indicator, a foreign key referring to the entity table 610 of FIG. 6 will be generated. On the other hand, if the target module type 514 includes an indicator, a foreign key referring to one of the module tables will be created.

スキーマ定義モデルテーブル５００の垂直インジケータ５１０が「Ｙ」であるならば、その後特性の観察４０８のために列は生成されない。代わりに、特性の観察４０８のタイプは、一行につき一つの特性の観察４０８を有する垂直テーブルに関連付けられたモジュール４０６にロードされることとなる。最後に、経年的なタイムスタンプに対応する特性の観察４０８は、モジュール４０６の物理テーブル２１８と関連付けられることとなる。 If the vertical indicator 510 of the schema definition model table 500 is “Y”, then no column is generated for the observation of characteristics 408. Instead, the type of characteristic observation 408 will be loaded into the module 406 associated with the vertical table having one characteristic observation 408 per row. Finally, a characteristic observation 408 corresponding to an aging time stamp will be associated with the physical table 218 of the module 406.

ＯＤＳ生成ステップ７０８は、ＥＴＬ生成ステップ７１０へと続く。ＯＤＳ生成ステップ７０８の物理テーブル２１８が、スキーマ定義モデル獲得ステップ７０６において完成されたスキーマ定義モデルテーブル５００に沿って一度完成されたときは、ＥＴＬツール２１６は、スキーマ定義モデルテーブル５００とスキーマ定義言語１２６とに基づいて自動的に生成されることができる。ＥＴＬツール２１６の生成については、図８を参照して以下に詳細に説明する。 ODS generation step 708 continues to ETL generation step 710. When the physical table 218 of the ODS generation step 708 is completed once along the schema definition model table 500 completed in the schema definition model acquisition step 706, the ETL tool 216 executes the schema definition model table 500 and the schema definition language 126. And can be automatically generated based on The generation of the ETL tool 216 will be described in detail below with reference to FIG.

ＥＴＬ生成ステップ７１０は、ＯＤＳポピュレートステップ７１２へと続く。ＥＴＬツール２１６がスキーマ定義モデルテーブル５００とスキーマ定義言語１２６とに基づいて一度自動的に生成され、かつ、物理テーブル２１８が一度創作されたときは、ＥＴＬツール２１６はＯＤＳポピュレートステップ７１２において活用されて、図２の外部ソース２０４からデータ２０３が抽出され、図２のデータ２０３がデータタイプ４１０、選択タイプ５０８、またはスキーマ定義モデルテーブル５００の他の列に準拠したフォーマットに変換される。データ２０３がスキーマ定義モデルテーブル５００とスキーマ定義言語１２６の要件に一度準拠すると、データ２０３は、オペレーショナルデータストア２０６の物理テーブル２１８へロードされる。 The ETL generation step 710 continues to the ODS population step 712. When the ETL tool 216 is automatically generated once based on the schema definition model table 500 and the schema definition language 126 and the physical table 218 is created once, the ETL tool 216 is utilized in the ODS population step 712. The data 203 is extracted from the external source 204 of FIG. 2, and the data 203 of FIG. Once the data 203 conforms to the requirements of the schema definition model table 500 and the schema definition language 126, the data 203 is loaded into the physical table 218 of the operational data store 206.

図８、９、１１、１３、および１５のフローチャートにおいて、スキーマ定義言語１２６、スキーマ定義モデル４００、またはスキーマ定義モデルテーブル５００を含む要素を参照しまたは利用する際は、それらのうち１つ又はそのいくつかの組み合わせが名付けられた要素の代わりに又はそれと共に参照され利用されることができると考えられる。説明を容易にするため、スキーマ定義モデルテーブル５００は、フローチャートの参照のための例として一般的に用いられる。 When referring to or using an element including the schema definition language 126, the schema definition model 400, or the schema definition model table 500 in the flowcharts of FIGS. It is contemplated that several combinations can be referenced and utilized in place of or in conjunction with the named element. For ease of explanation, the schema definition model table 500 is generally used as an example for reference to a flowchart.

ここで図８を参照すると、そこには、本発明の実施の形態に係る、抽出、変換、ロード関数を生成するための例示的な制御フロー８００が示されている。制御フロー８００は、図１のスキーマ定義言語１２６を参照することにより、図２のＥＴＬツール２１６を自動的に生成する例示的な方法を示す。スキーマ定義言語１２６は、図３のメタデータテーブル３００、図４のスキーマ定義モデル４００、図５のスキーマ定義モデルテーブル５００、またはその組み合わせ内に含まれるメタデータ３０２に関し参照されることができる。以下に説明されるとおり、図２の物理テーブル２１８は、メタデータ３０２にしたがって、ＥＴＬツール２１６を自動的に生成することによって、図２のデータ２０３でポピュレートされることができる。 Referring now to FIG. 8, there is shown an exemplary control flow 800 for generating an extract, transform, and load function according to an embodiment of the present invention. The control flow 800 illustrates an exemplary method for automatically generating the ETL tool 216 of FIG. 2 by referring to the schema definition language 126 of FIG. The schema definition language 126 can be referenced with respect to the metadata 302 included in the metadata table 300 of FIG. 3, the schema definition model 400 of FIG. 4, the schema definition model table 500 of FIG. 5, or a combination thereof. As described below, the physical table 218 of FIG. 2 can be populated with the data 203 of FIG. 2 by automatically generating the ETL tool 216 in accordance with the metadata 302.

一般的に、図２の外部ソース２０４からのデータ２０３は、図２のオペレーショナルデータストア２０６に最終的に格納される前に、非準拠ステージングテーブル８０２と準拠ステージングテーブル８０４とを通過して流れる。図２の管理エンジン２０８は、スキーマ定義言語１２６から非準拠ステージングテーブル８０２と準拠ステージングテーブル８０４とを自動的に生成することができる。 In general, the data 203 from the external source 204 of FIG. 2 flows through the non-compliant staging table 802 and the compliant staging table 804 before being finally stored in the operational data store 206 of FIG. The management engine 208 of FIG. 2 can automatically generate a non-compliant staging table 802 and a compliant staging table 804 from the schema definition language 126.

クリーンでないソースデータであるデータ２０３は、準拠ステージングテーブル８０４へ移動する前に非準拠ステージングテーブル８０２においてまず解析されスクラブされなければならない。アダプタとＡＰＩは、データ２０３を解析し、シンタックスの検証が失敗した記録を追い出すために備えられることができる。クリーンなデータ２０３は、準拠ステージングテーブル８０４へ直接流れることができる。テーブルとデータベースレベルの検証は、準拠ステージングテーブル８０４において実施されることができる。 Data 203, which is unclean source data, must first be analyzed and scrubbed in non-compliant staging table 802 before moving to compliant staging table 804. An adapter and API can be provided to parse the data 203 and drive out records where syntax validation failed. Clean data 203 can flow directly to the compliant staging table 804. Table and database level verification may be performed in the compliant staging table 804.

ＥＴＬツール２１６のルーティンは、これがデータを非準拠ステージングテーブル８０２および準拠ステージングテーブル８０４からオペレーショナルデータストア２０６に移動させるのであるが、自動的に生成されることができる。このプロセスは、ステップ８０６〜８２４に関し、以下に詳細に説明される。 The routine of ETL tool 216, which moves data from non-compliant staging table 802 and compliant staging table 804 to operational data store 206, can be generated automatically. This process is described in detail below with respect to steps 806-824.

最初に、リトリーブステップ８０６が、非準拠ソースデータとしてのデータ２０３をリトリーブすることができる。これはフラットファイル形式でありえるが、データは利用可能な形式であればよいと考えられる。フラットファイルとして、データ２０３は、データ２０３のテキスト列タイプのものでありえる。ソースデータは、図４のエンティティ４０４とモジュール４０６とに対応するデータ２０３を含むことができる。一例として、非準拠ステージングテーブル８０２におけるソースデータは、患者情報フラットファイルを含むことができ、これにより図２のエンティティ４０４とエンティティタイプ２２０のための非経年的なデータを保持する特別なモジュールタイプに対応することができる。 Initially, a retrieve step 806 can retrieve the data 203 as non-compliant source data. This can be a flat file format, but the data may be in any usable format. As a flat file, the data 203 can be a text string type of the data 203. The source data can include data 203 corresponding to entity 404 and module 406 of FIG. As an example, the source data in the non-compliant staging table 802 can include a patient information flat file, which allows a special module type to hold non-aged data for the entity 404 and entity type 220 of FIG. Can respond.

非準拠ステージングテーブル８０２は、図２のモジュール４０６とモジュールタイプ２２４とに対応するモジュールテーブルをさらに含むことができる。一例として、非準拠ステージングテーブル８０２は、診断モジュールタイプ２２４とがん診断モジュール４０６とに対応するがん診断のフラットファイルを含むことができる。 The non-compliant staging table 802 may further include a module table corresponding to the module 406 and the module type 224 of FIG. As an example, the non-compliant staging table 802 can include flat files for cancer diagnosis corresponding to the diagnostic module type 224 and the cancer diagnosis module 406.

モジュール４０６に対応する非準拠ステージングテーブル８０２におけるデータ２０３は、関連付けられたエンティティ４０４を指し示す外来キーを含むことができる。一方、エンティティ４０４に対応する非準拠ステージングテーブル８０２におけるデータ２０３は、エンティティ４０４の非経年的なデータを保持するためだけに特別に創作されたモジュール４０６と同様に、エンティティ４０４の主要キーとしての主要キー（患者ＩＤなど）を含むことができる。 Data 203 in non-compliant staging table 802 corresponding to module 406 may include a foreign key that points to the associated entity 404. On the other hand, the data 203 in the non-compliant staging table 802 corresponding to the entity 404 is the key as the main key of the entity 404, similar to the module 406 created specifically to hold the non-aged data of the entity 404. A key (such as a patient ID) can be included.

リトリーブステップ８０６は、プロセスステップ８０８へと続く。プロセスステップ８０８は、スキーマ定義言語１２６のメタデータ３０２を収集し、かつ／または整理し、選択タイプ準拠ステージテーブル８１０および選択準拠ステージテーブル８１２とする。スキーマ定義言語１２６から利用可能な、各観察された選択は、選択タイプ準拠ステージテーブル８１０に置かれることができ、かつ、主要キー、性別のような選択タイプＩＤ、民族性、ＩＣＤ９などが付与されることができる。 Retrieve step 806 continues to process step 808. Process step 808 collects and / or organizes the metadata 302 of the schema definition language 126 into a selection type compliant stage table 810 and a selection compliant stage table 812. Each observed selection available from the schema definition language 126 can be placed in a selection type compliant stage table 810 and given a primary key, a selection type ID such as gender, ethnicity, ICD9, etc. Can.

選択準拠ステージテーブル８１２は、選択タイプ準拠ステージテーブル８１０の外来キーおよび利用可能な各選択の主要キーを含むことができる。選択は、男性、女性、コーカシアン、ヒスパニック、アジア系、１７３．０−悪性・・・、１７４．４−パジェットまたは１７４．６−悪性新生物などを含みうる。最終更新日も、選択タイプ準拠ステージテーブル８１０および選択準拠ステージテーブル８１２に含められることができる。インクリメンタルロードを実施する際、オペレーショナルデータストア２０６から利用可能な選択が可能である。 The selection compliant stage table 812 can include the foreign keys of the selection type compliant stage table 810 and a primary key for each available selection. The selection may include male, female, Caucasian, Hispanic, Asian, 173.0-malignant ... 174.4-paget or 174.6-malignant neoplasm. The last update date can also be included in the selection type compliant stage table 810 and the selection compliant stage table 812. When performing an incremental load, available selections from the operational data store 206 are possible.

プロセスステップ８０８に続くのは、準拠ロードステップ８１４である。準拠ロードステップ８１４は、準拠ステージングテーブル８０４を含むことができる。メタデータテーブル３００、スキーマ定義モデルテーブル５００、またはスキーマ定義モデル４００内に格納された図４のモジュール４０６と特性の観察４０８のためのスキーマ定義言語１２６のメタデータ３０２内に含まれる定義は、データがどのようにして準拠ステージングテーブル８０４へロードされるべきであるかを特定するために用いられることできる。例えば、メタデータ３０２は、図４のデータタイプ４１０が日付であることを要求してもよい。その場合、非準拠ステージングテーブル８０２から抽出されたデータ２０３が、それが準拠ステージングテーブル８０４へロードされる際に日付形式であるべきである。 Following the process step 808 is a compliant load step 814. The compliance load step 814 can include a compliance staging table 804. The definitions contained in the metadata 302 of the schema definition language 126 for the module 406 and the property observation 408 of FIG. 4 stored in the metadata table 300, schema definition model table 500, or schema definition model 400 are data Can be used to specify how should be loaded into the compliant staging table 804. For example, the metadata 302 may require that the data type 410 of FIG. 4 is a date. In that case, the data 203 extracted from the non-compliant staging table 802 should be in date format when it is loaded into the compliant staging table 804.

非準拠ステージングテーブル８０２内に含まれるエンティティ４０４は、準拠ステージングテーブル８０４へロードされることができる。準拠ステージングテーブル８０４の各列は、エンティティ４０４の単一の例を含むことができる。各エンティティ４０４は、準拠ステージングテーブルの主要キーを付与される。エンティティ４０４は、エンティティのサブエンティティであるが、ロードされることとなり、親エンティティを指すポインタを与えられる。例えば、患者は、そこから抽出されたサンプルを有していた場合がある。サンプルは、サブエンティティと考えられ、遡って患者を示すポインタを有する。ポインタは、患者の準拠ステージングテーブルの主要キーでありえる。非経年的なデータ２０３ではなくエンティティ４０４それ自身は、まず準拠ステージングテーブル８０４へロードされる。 Entities 404 included in the non-compliant staging table 802 can be loaded into the compliant staging table 804. Each column of the compliant staging table 804 can include a single instance of the entity 404. Each entity 404 is given a primary key of the compliant staging table. Entity 404 is a sub-entity of the entity but will be loaded and given a pointer to the parent entity. For example, the patient may have a sample extracted therefrom. The sample is considered a sub-entity and has a pointer back to the patient. The pointer can be the primary key of the patient's compliance staging table. The entity 404 itself, not the aged data 203, is first loaded into the compliant staging table 804.

エンティティ４０４が準拠ステージングテーブル８０４に一度ロードされたときは、モジュール４０６と関連付けられたデータ２０３は、準拠ステージングテーブル８０４にロードされることができる。準拠ロードステップ８１４において、非準拠ステージングテーブル８０２は開放されており解析される。 Once the entity 404 is loaded into the compliant staging table 804, the data 203 associated with the module 406 can be loaded into the compliant staging table 804. In compliant load step 814, the non-compliant staging table 802 is opened and analyzed.

非準拠ステージングテーブル８０２の各行が処理され、準拠ロードステップ８１４は、データ２０３の値を対象データタイプ４１０に変換し、非ゼロの特性の観察４０８が確実に特性値を持つようにし、この特性の観察４０８についての有効な制限を証明する。特性の観察４０８についての制限が証明されたときは、日付および数字の特性の観察４０８の値の範囲についての制限は証明され、テキストの特性の観察４０８の有効な文字や有効な長さが証明され、かつ、エンティティ４０４への有効な参照が証明される。他のモジュール４０６への参照の有効性の証明は、第２パスの間におこなわれる。 Each row of the non-compliant staging table 802 is processed, and the compliant load step 814 converts the value of the data 203 to the target data type 410 to ensure that the non-zero characteristic observation 408 has a characteristic value. Demonstrate valid limitations for observation 408. When the limitations on the property observation 408 are proven, the limits on the range of values for the date and number property observation 408 are proven, and the valid characters and the effective length of the text property observation 408 are proven. And a valid reference to entity 404 is proved. Proof of validity of references to other modules 406 is made during the second pass.

準拠ロードステップ８１４の間のエラーは、エラーテーブル８１６に置かれることとなるモジュール４０６のうちの一つと関連付けられたレコードを生じさせる。処理が成功した各行には、独自準拠ステージングテーブルの主要キーが付与されることとなる。準拠ステージングテーブル８０４は、主要キーおよび外来キーを示す第１行と、列のデータタイプ４１０を示す第２行と、準拠ステージングテーブル８０４の列名称を示す第３行とを含むことができる。準拠ステージングテーブル８０４は、他のモジュール４０６を参照している特性の観察４０８を除いて、オペレーショナルデータストア２０６を反映する。 An error during the compliance load step 814 results in a record associated with one of the modules 406 that will be placed in the error table 816. The main key of the unique compliant staging table is assigned to each row that has been successfully processed. The compliant staging table 804 may include a first row that indicates primary and foreign keys, a second row that indicates the column data type 410, and a third row that indicates the column name of the compliant staging table 804. The compliant staging table 804 reflects the operational data store 206 with the exception of characteristic observations 408 that reference other modules 406.

上述のフラットファイルの例を続けると、フラットファイルのテキストの値は適切なデータタイプ４１０に変換されることができる。そして、エンティティ４０４への参照が、準拠ステージングテーブルの主要キーに置き換えられる。 Continuing with the flat file example described above, the text value of the flat file can be converted to the appropriate data type 410. The reference to entity 404 is then replaced with the primary key of the compliant staging table.

あるレコードが、準拠ロードステップ８１４の有効性チェックに失敗したときは、このレコードは、エラーテーブル８１６に置かれることができる。例えば、エンティティ４０４の非経年的なデータ２０３を含むレコードが不適切な誕生日を含むときは、このレコードはエラーテーブル８１６に置かれることができる。さらに、データがエラーテーブル８１６におけるエンティティ４０４を参照するモジュールに対応するときは、このモジュール４０６のレコードはエラーテーブル８１６内にも置かれる。 If a record fails the validity check of the compliance load step 814, this record can be placed in the error table 816. For example, if a record that includes aged data 203 for entity 404 includes an inappropriate birthday, this record can be placed in error table 816. Further, when the data corresponds to a module that references entity 404 in error table 816, the record for this module 406 is also placed in error table 816.

モジュール４０６が準拠ロードステップ８１４における準拠ステージングテーブル８０４に一度ロードされたときは、モジュール４０６間のレファレンスはアップデートステップ８１８においてアップデートされることができる。アップデートステップ８１８は、準拠ロードステップ８１４から続くが、すべてのモジュール４０６が準拠ステージングテーブル８０４にロードされた後にのみ、レファレンスを含むことによって、モジュール４０６の適切な従属性を確実にする。 Once the module 406 has been loaded into the compliance staging table 804 in the compliance load step 814, the references between the modules 406 can be updated in the update step 818. Update step 818 continues from compliance loading step 814, but ensures proper dependency of module 406 by including references only after all modules 406 have been loaded into compliance staging table 804.

アップデートステップ８１８は、ステージングテーブル８０４内で他のモジュール４０６を参照する各特性の観察４０８毎に２つの予備列を創作することができる。この２つの予備列は、参照されたモジュールの外来キー同様に参照されたモジュールの名称を含むことができる。一例として、処置は診断を参照し、特性の観察４０８は、診断および診断外来キーを取り扱う追加的な列を含むこととなる。 The update step 818 can create two preliminary columns for each observation 408 of each property that references other modules 406 in the staging table 804. These two reserved columns can contain the name of the referenced module as well as the foreign key of the referenced module. As an example, the procedure refers to a diagnosis, and the characteristic observation 408 will include an additional column that deals with the diagnosis and diagnostic foreign keys.

参照されたモジュールの外来キーを含む列は、参照されたモジュール４０６の準拠ステージングテーブルの主要キーを含むことができる。この時点では、参照先が無効または欠落している特性の観察４０８の場合、エラーテーブル８１６におけるモジュール４０６は削除され、エラーテーブル８１６内に置かれる。アップデートステップ８１８はアダプタを含むことができ、プロセスのこの時点で、インストールによってそれら自身のカスタム検証をおこなうことができる。 The column containing the foreign key of the referenced module can contain the primary key of the compliant staging table of the referenced module 406. At this point, in the case of the observation 408 where the reference destination is invalid or missing, the module 406 in the error table 816 is deleted and placed in the error table 816. Update step 818 can include adapters, and at this point in the process, installation can perform their own custom verification.

アップデートステップ８１８に続くのは、オペレーショナルデータストアロードステップ８２０である。アップデートステップ８１８の後、準拠ステージングテーブル８０４は検証されたデータを含むことができる。オペレーショナルデータストアロードステップ８２０は、準拠ステージングテーブル８０４ごとに、ＳＱＬサーバーマージステートメントを生成する。オペレーショナルデータストアロードステップ８２０は、選択タイプ準拠ステージテーブル８１０のためのＳＱＬサーバーマージステートメントを生成することができ、それから選択準拠ステージテーブル８１２、それからエンティティ４０４を含む準拠ステージングテーブル８０４、それからエンティティ４０４の非経年的なデータ２０３を含む準拠ステージングテーブル８０４、そして最後に他のモジュール４０６の準拠ステージングテーブル８０４のためのＳＱＬサーバーマージステートメントを生成することができる。 Following the update step 818 is an operational data store load step 820. After the update step 818, the compliant staging table 804 can include verified data. The operational data store load step 820 generates a SQL server merge statement for each compliant staging table 804. The operational data store load step 820 can generate an SQL server merge statement for the selection type compliant stage table 810, then the selection compliant stage table 812, then the compliant staging table 804 including the entity 404, and then the entity 404 non- SQL server merge statements for the compliant staging table 804 containing the aged data 203 and finally the compliant staging table 804 of the other module 406 can be generated.

準拠ステージングテーブル８０４内のデータ２０３は、すべて検証されているので、環境の問題のみがマージの失敗を引き起こしうる。環境の問題は、十分でないディスク容量などの問題を含みうる。マージが失敗したときは、オペレーショナルデータストアロードステップ８２０はエラーを記録して中止される。 Since all data 203 in the compliant staging table 804 has been verified, only environmental issues can cause merge failures. Environmental issues can include issues such as insufficient disk space. If the merge fails, the operational data store load step 820 is aborted with an error recorded.

オペレーショナルデータストアロードステップ８２０に続くのは、デリートステップ８２２である。デリートステップ８２２は、オペレーショナルデータストア２０６へのインクリメンタルロードの間に適用されるインクリメンタルロードの間、デリートフラグは準拠ステージングテーブル８０４およびＯＤＳにロードされる。デリートステップ８２２は、削除される記録がオペレーショナルデータストア２０６に存在しないことを検証する。削除されるレコードがオペレーショナルデータストア２０６に存在しないときは、このレコードはエラーテーブル８１６へ送られる。ロードが一度完了すれば、デリートステップ８２２は、参照の整合性を確実にするためにカスケードデリートフラグを有するレコードを削除する。 Following the operational data store load step 820 is a delete step 822. The delete step 822 loads the delete flag into the compliant staging table 804 and ODS during incremental load applied during incremental load to the operational data store 206. The delete step 822 verifies that the record to be deleted does not exist in the operational data store 206. If the record to be deleted does not exist in the operational data store 206, this record is sent to the error table 816. Once the load is complete, delete step 822 deletes records that have a cascade delete flag to ensure referential integrity.

デリートステップ８２２に続くのは、トランケートステップ８２４である。トランケートステップ８２４は、ロードが成功したときは、後続のインクリメンタルロードに備えるために、すべての準拠ステージングテーブル８０４にトランケートをおこなうことができる。データ２０３がオペレーショナルデータストア２０６に一度ロードされたときは、ユーザは、図９〜１５に関し以下詳細に説明される図１のビジネスインテリジェンスツール１３４を用いて、データ２０３を完全に活用しそこへアクセスすることができる。 Following the delete step 822 is a truncate step 824. Truncate step 824 may truncate all compliant staging tables 804 to prepare for subsequent incremental loads when the load is successful. Once the data 203 has been loaded into the operational data store 206, the user can fully utilize and access the data 203 using the business intelligence tool 134 of FIG. 1 described in detail below with respect to FIGS. can do.

ここで図９を参照すると、そこには、本発明の実施の形態に係る、コホートをフィルタリングするための例示的な制御フロー９００が示されている。例示的な制御フロー９００は、図１のフィルタ１２８がどのようにして図１のスキーマ定義言語１２６に対して動作するのかについての例示的な説明図でありえる。スキーマ定義言語１２６のメタデータ３０２は、図３のメタデータテーブル３００、図５のスキーマ定義モデルテーブル５００、図４のスキーマ定義モデル４００、またはそれらの組み合わせから参照されることができる。 Referring now to FIG. 9, there is shown an exemplary control flow 900 for filtering a cohort according to an embodiment of the present invention. The example control flow 900 can be an illustrative diagram of how the filter 128 of FIG. 1 operates on the schema definition language 126 of FIG. The metadata 302 of the schema definition language 126 can be referenced from the metadata table 300 of FIG. 3, the schema definition model table 500 of FIG. 5, the schema definition model 400 of FIG. 4, or a combination thereof.

一例として、図２のオペレーショナルデータストア２０６がポピュレートされ次第、ＯＤＳポピュレートステップ７１２において、図１のビジネスインテリジェンスツール１３４は、それらがスキーマ定義言語１２６に対して動作することから、完全に機能的である。オペレーショナルデータストア２０６がポピュレートされたとき、トータルコホート９０２が利用できるようになる。トータルコホート９０２は、オペレーショナルデータストア２０６内に表される図４のエンティティ４０４のすべてを含むことができる。 As an example, as soon as the operational data store 206 of FIG. 2 is populated, in the ODS Population step 712, the business intelligence tools 134 of FIG. 1 are fully functional as they operate against the schema definition language 126. . When the operational data store 206 is populated, the total cohort 902 becomes available. The total cohort 902 can include all of the entities 404 of FIG. 4 represented in the operational data store 206.

トータルコホート９０２は、オペレーショナルデータストア２０６をフィルタリングするための開始点でありえる。トータルコホート９０２のエンティティ４０４は、スキーマ定義言語１２６内の図４の特性の観察４０８のいずれかを用いてフィルタリングされることができる。スキーマ定義言語１２６内の特性の観察４０８は、特性の観察４０８と関連付けられた図３のメタデータ３０２によって特定されることができる。 The total cohort 902 can be a starting point for filtering the operational data store 206. The entities 404 of the total cohort 902 can be filtered using any of the property observations 408 of FIG. The property observation 408 in the schema definition language 126 can be identified by the metadata 302 of FIG. 3 associated with the property observation 408.

特性選択ステップ９０４において、これはＯＤＳポピュレートステップ７１２から続くが、ユーザは選択された特性９０５を選択することができる。選択された特性９０５は、特性の観察４０８のうちの一つでありえる。上記活用されたヘルスケアの例の延長として、ユーザは、乳がんについてＩＣＤ９診断がされたエンティティ４０４の患者クラスの特定を望むことがある。 In a property selection step 904, this continues from the ODS population step 712, but the user can select the selected property 905. The selected characteristic 905 can be one of the characteristic observations 408. As an extension of the utilized health care example, the user may want to identify the patient class of entity 404 with an ICD9 diagnosis for breast cancer.

ビジネスインテリジェンスツール１３４は、有効ＩＣＤ９選択などのすべての特性の観察４０８を、スキーマ定義言語１２６からリターンする。ユーザは、選択された特性９０５として、ＩＣＤ９特性を選択し、かつ乳がんに対応するＩＣＤ９コードのみのために選択された特性９０５をフィルタリングするなどして、特性の観察４０８の一つを選択し、選択された特性９０５をさらにフィルタリングすることができる。 The business intelligence tool 134 returns from the schema definition language 126 an observation 408 of all properties, such as a valid ICD 9 selection. The user selects one of the characteristic observations 408, such as selecting an ICD9 characteristic as the selected characteristic 905 and filtering the selected characteristic 905 for only the ICD9 code corresponding to breast cancer, The selected characteristic 905 can be further filtered.

トータルコホート９０２をフィルタリングするための選択された特性９０５をユーザが選択した後に、ビジネスインテリジェンスツール１３４は、図４のモジュール４０６のためのスキーマ定義言語１２６と、メタデータクエリステップ９０６において選択された特性９０５と関連付けられたエンティティ４０４とについてクエリを実行することができる。スキーマ定義言語１２６のクエリは、モジュール４０６と、エンティティ４０４と、選択された特性９０５に関連する他の特性の観察４０８についてメタデータ３０２をリターンすることができる。さらに、選択された特性９０５に関連付けられたモジュール４０６とエンティティ４０４とは、図２の物理データテーブル２１８内で特定されることができる。ビジネスインテリジェンスツール１３４は、選択された特性９０５に関連付けられた図２のエンティティタイプ２２０とモジュールタイプ２２４をさらにリターンすることができる。 After the user selects the selected characteristic 905 for filtering the total cohort 902, the business intelligence tool 134 selects the schema definition language 126 for the module 406 of FIG. 4 and the characteristic selected in the metadata query step 906. A query can be performed on the entity 404 associated with 905. The query in the schema definition language 126 can return metadata 302 for the module 406, the entity 404, and other property observations 408 associated with the selected property 905. Further, the module 406 and entity 404 associated with the selected characteristic 905 can be identified in the physical data table 218 of FIG. The business intelligence tool 134 can further return the entity type 220 and module type 224 of FIG. 2 associated with the selected characteristic 905.

スキーマ定義言語１２６に対しクエリが実行された後に、メタデータクエリステップ９０６から続く特性関連付けステップ９０８は、選択された特性９０５に関連付けられたスキーマ定義言語１２６からモジュール４０６とエンティティ４０４とをリターンすることができる。選択された特性９０５がエンティティ４０４にのみ関連付けられているときは、いずれのモジュール４０６も選択された特性９０５には関連付けられず、エンティティ４０４のみがリターンされる。 After the query is performed against the schema definition language 126, a property association step 908 that follows the metadata query step 906 returns the module 406 and the entity 404 from the schema definition language 126 associated with the selected property 905. Can do. When the selected characteristic 905 is only associated with the entity 404, no module 406 is associated with the selected characteristic 905 and only the entity 404 is returned.

選択された特性９０５に関連付けられたエンティティ４０４とモジュール４０６のリターンで、特性関連付けステップ９０８から続く計算ステップ９１０は、現在のコホート９１２とすべてのワンホップリンケージ９１４とを計算することができる。計算ステップ９１０が現在のコホート９１２を計算したときは、スキーマ定義言語１２６は、選択された特性９０５を有する患者に対応するオペレーショナルデータストア２０６の物理テーブル２１８の位置を特定するために用いられることができる。 With the return of the entity 404 and module 406 associated with the selected property 905, the calculation step 910 following the property association step 908 can calculate the current cohort 912 and all one-hop linkages 914. When the calculation step 910 calculates the current cohort 912, the schema definition language 126 may be used to locate the physical table 218 of the operational data store 206 corresponding to a patient having the selected characteristic 905. it can.

選択された特性９０５に関連付けられたエンティティ４０４の図２のデータ２０３を抽出するために、スキーマ定義言語１２６のメタデータ３０２に基づいて、クエリが生成される。計算ステップ９１０は、物理テーブル２１８にクエリを実行し、リンク６０８と、特性の観察４０８、モジュール４０６、およびエンティティ４０４の図６の物理テーブル６０２の位置を特定することにより、選択された特性９０５に関連付けられた現在のコホート９１２をリターンするためのＳＱＬ命令９１５を自動的に生成することができる。スキーマ定義言語１２６は、データ２０３が物理的に位置する物理データテーブル２１８ごとに複数列を検出するために用いられることができるメタデータテーブル３００として参照されることができる。 A query is generated based on the metadata 302 of the schema definition language 126 to extract the data 203 of FIG. 2 of the entity 404 associated with the selected property 905. The calculation step 910 queries the physical table 218 and locates the selected property 905 by locating the link 608 and the property observation 408, module 406, and the physical table 602 of FIG. An SQL instruction 915 can be automatically generated to return the associated current cohort 912. The schema definition language 126 can be referred to as a metadata table 300 that can be used to detect multiple columns for each physical data table 218 where the data 203 is physically located.

ビジネスインテリジェンスツール１３４は、選択された特性９０５に関連付けられたエンティティ４０４の数として、現在のコホート９１２をリターンすることができる。計算ステップ９１０は、ワンホップリンケージ９１４を計算し、ユーザがさらにフィルタ１２８をリファインするためのオプションとしてそれらを提示することもできる。 The business intelligence tool 134 can return the current cohort 912 as the number of entities 404 associated with the selected characteristic 905. The calculation step 910 may calculate one-hop linkages 914 and present them as options for the user to further refine the filter 128.

ワンホップリンケージ９１４は、エンティティ４０４に対応する他の選択されていない特性の観察４０８と選択された特性９０５が対応するモジュール４０６とでありえる。つまり、ワンホップリンケージ９１４は、選択された特性の観察４０８が関連付けられた同じモジュール４０６とエンティティ４０４から選択されていない特性である。選択されていない特性の計算に応じて、計算ステップ９１０は、ワンホップリンケージ９１４としてモジュール４０６とエンティティ４０４をもリターンすることができる。選択された特性９０５と関連付けられたモジュール４０６とエンティティ４０４は、関連付けられていないモジュール４０６またはエンティティ４０４によって指し示され、これら関連付けられていないモジュール４０６またはエンティティ４０４は、ワンホップリンケージ９１４としてリターンされることができる。つまり、いずれかの特性の観察４０８の図５の対象エンティティタイプ５１２または図５の対象モジュールタイプ５１４が、選択された特性９０５と関連付けられたモジュール４０６またはエンティティ４０４を指すデータポインタを含むときは、そのデータポインタを有する特性の観察４０８に関連付けられたモジュール４０６およびエンティティ４０４は、ワンホップリンケージ９１４としてリターンされることとなる。 The one-hop linkage 914 can be an observation 408 of other unselected properties corresponding to the entity 404 and a module 406 to which the selected property 905 corresponds. That is, the one-hop linkage 914 is a property that has not been selected from the same module 406 and entity 404 with which the selected property observation 408 is associated. Depending on the calculation of the unselected property, the calculation step 910 can also return the module 406 and the entity 404 as a one-hop linkage 914. The module 406 and entity 404 associated with the selected property 905 are pointed to by the unassociated module 406 or entity 404, and these unassociated module 406 or entity 404 are returned as a one-hop linkage 914. be able to. That is, when the subject entity type 512 of FIG. 5 or the subject module type 514 of FIG. 5 of any property observation 408 includes a data pointer that points to the module 406 or entity 404 associated with the selected property 905, The module 406 and entity 404 associated with the trait observation 408 having that data pointer will be returned as a one-hop linkage 914.

乳がんに基づいてフィルタリングする例を活用して、ワンホップリンケージ９１４の計算を説明すると、計算ステップ９１０は、エンティティ４０４に対応する年齢、性別、体重、その他の非経年的な特性の観察４０８と関連付けられたエンティティ４０４のワンホップリンケージ９１４を計算することができる。計算ステップ９１０はさらに、モジュール４０６に対応する診断日、診断時年齢、原発巣部位、その他の非経年的な特性の観察４０８などの選択された特性９０５に関連付けられたモジュール４０６のワンホップリンケージ９１４も計算することができる。 Taking the example of filtering based on breast cancer and describing the calculation of the one-hop linkage 914, the calculation step 910 associates with the observation 408 of age, gender, weight, and other aged characteristics corresponding to the entity 404. The one-hop linkage 914 for the given entity 404 can be calculated. The calculation step 910 further includes the one-hop linkage 914 of the module 406 associated with the selected characteristic 905, such as the observation date 408 corresponding to the module 406, the date of diagnosis, age at diagnosis, primary site, and other aged characteristics. Can also be calculated.

計算ステップ９１０は、選択された特性９０５と関連付けられていないその他のモジュール４０６またはエンティティ４０４のワンホップリンケージ９１４もリターンすることができる。例えば、特性の観察４０８が乳がんについてのＩＣＤ９が含まれたユーザによって選択されたときは、ＩＣＤ９特性は診断モジュールに対応し、計算ステップ９１０は、処置でフィルタリングするオプションをユーザにリターンすることができる。なぜなら、処置モジュールは、診断モジュールに対し対象モジュールタイプ５１４におけるポインタを有するからである。 The calculation step 910 can also return the one-hop linkage 914 of other modules 406 or entities 404 that are not associated with the selected property 905. For example, if characteristic observation 408 is selected by a user that includes ICD9 for breast cancer, the ICD9 characteristic corresponds to a diagnostic module, and calculation step 910 can return the user with the option to filter by treatment. . This is because the treatment module has a pointer in the target module type 514 to the diagnostic module.

計算ステップ９１０に続くのは、他の特性選択オプション９１６である。ユーザがより多くの特性の観察４０８を選択しないと決定したときは、他の特性選択オプション９１６は、終了ステップ９１８においてフィルタ１２８の動作を終了する。終了ステップ９１８が呼び起こされたときは、現在のコホート９１２は、ビジネスインテリジェンスツール１３４と共にさらに利用されるために、引き続き活用されセーブされることができる。 Following the calculation step 910 is another property selection option 916. If the user decides not to select more property observations 408, another property selection option 916 terminates the operation of filter 128 in an end step 918. When the end step 918 is invoked, the current cohort 912 can continue to be utilized and saved for further use with the business intelligence tool 134.

ユーザが他の特性選択オプション９１６において他の特性を選択することを決定したときは、後続の選択された特性９１９は、他の特性選択オプション９１６に続く他の特性選択ステップ９２０において選択されることができる。ユーザは、選択された特性ステップ９０４における選択された特性９０５をユーザが選択した方法と同じ方法で、後続の選択された特性９１９を選択することができる。 When the user decides to select another characteristic in other characteristic selection option 916, the subsequent selected characteristic 919 is selected in other characteristic selection step 920 following other characteristic selection option 916. Can do. The user can select subsequent selected characteristics 919 in the same manner that the user selected the selected characteristics 905 in the selected characteristics step 904.

特性選択ステップ９０４とは対照的な他の特性選択ステップ９２０におけるユーザの体験の差異は、他の特性選択ステップ９２０においてユーザは、選択オプションとしてワンホップリンケージ９１４を与えられており、トータルコホート９０２ではなく現在のコホート９１２が表示されることである。 The difference in user experience in other property selection step 920 as opposed to property selection step 904 is that in other property selection step 920 the user is given a one-hop linkage 914 as a selection option, and in total cohort 902 The current cohort 912 is displayed.

ユーザが後続の選択された特性９１９を一度選択したときは、後続の選択された特性９１９は、他の特性選択ステップ９２０に続く特性リンクステップ９２２において選択された特性９０５にリンクされることができる。特性リンクステップ９２２は、フィルタ１２８をリファインし、現在のコホート９１２を制限するために、選択された特性９０５を後続の選択された特性９１９にリンクすることができる。後続の選択された特性９１９は、フィルタ１２８の現在のエンティティ４０４またはモジュール４０６を制限することになると仮定されるが、ユーザは、この論理ＡＮＤ演算を優先することがあり、後続の特性の観察４０８の論理ＯＲ関数を活用することがあると考えられる。 Once the user has selected the subsequent selected characteristic 919, the subsequent selected characteristic 919 can be linked to the characteristic 905 selected in the characteristic link step 922 following the other characteristic selection step 920. . A characteristic linking step 922 can link the selected characteristic 905 to a subsequent selected characteristic 919 to refine the filter 128 and limit the current cohort 912. Although the subsequent selected property 919 is assumed to limit the current entity 404 or module 406 of the filter 128, the user may prefer this logical AND operation and observe the subsequent property 408. It is considered that the logical OR function of

説明的な一例として、ユーザは、年齢３５〜４０歳または男性のような選択された特性９０５に関連付けられたエンティティ４０４に対応するワンホップリンケージ９１４のうちの一つを選択することがある。ＩＣＤ９乳がんの例を続けると、特性リンクステップ９２２は、第１の選択されたＩＣＤ９特性（選択された特性９０５）を続いて選択された年齢または性別の特性の観察４０８（後続の選択された特性９１９）とリンクするだろう。特性リンクステップ９２２は、選択された特性９０５と後続の選択された特性９１９の両方を組み合わせて、年齢が３５〜４０歳で男性であった患者の乳がんのみについてフィルタ１２８をさらにリファインするだろう。 As an illustrative example, the user may select one of the one-hop linkages 914 corresponding to the entity 404 associated with the selected characteristic 905, such as age 35-40 or male. Continuing with the ICD9 breast cancer example, the trait link step 922 is followed by a first selected ICD9 trait (selected trait 905) followed by an observation of the selected age or gender trait 408 (subsequent selected trait). 919). The characteristic linking step 922 will combine both the selected characteristic 905 and the subsequent selected characteristic 919 to further refine the filter 128 only for breast cancer in patients who were male at 35-40 years of age.

説明的な第２の例として、ユーザは、診断日２００７のような選択された特性９０５に関連付けられたモジュール４０６に対応するワンホップリンケージ９１４のうちの一つを選択することがある。ＩＣＤ９乳がんの例を続けると、特性リンクステップ９２２は、第１の選択されたＩＣＤ９特性（選択された特性９０５）を続いて選択された日付の特性（後続の選択された特性９１９）とリンクするだろう。特性リンクステップ９２２は、選択された特性９０５と後続の選択された特性９１９の両方を組み合わせて、２００７年の乳がん診断のみについてフィルタ１２８をリファインするだろう。 As a second illustrative example, the user may select one of the one-hop linkages 914 corresponding to the module 406 associated with the selected characteristic 905, such as the diagnosis date 2007. Continuing with the ICD9 breast cancer example, characteristic linking step 922 links the first selected ICD9 characteristic (selected characteristic 905) with the selected date characteristic (following selected characteristic 919). right. The characteristic link step 922 will refine both the selected characteristic 905 and the subsequent selected characteristic 919 to refine the filter 128 for the 2007 breast cancer diagnosis only.

説明的な第３の例として、ユーザは、モジュール４０６または処置タイプ：外科手術のような選択された特性９０５に関連付けられた特性の観察４０８へのリンケージ６０８に対応するワンホップリンケージ９１４のうちの一つを選択することがある。ＩＣＤ９乳がんの例を続けると、特性リンクステップ９２２は、選択されたＩＣＤ９特性（選択された特性９０５）を続いて選択されたポインタ特性（後続の選択された特性９１９）とリンクするだろう。特性リンクステップ９２２は、選択された特性９０５と後続の選択された特性９１９の両方を組み合わせて、外科手術による処置がなされた乳がん診断のみについてフィルタ１２８をさらにリファインするだろう。 As a third illustrative example, the user may select one of the one-hop linkages 914 corresponding to the linkage 608 to the observation 408 of the property associated with the selected property 905 such as module 406 or treatment type: surgery. One may be selected. Continuing with the ICD9 breast cancer example, characteristic linking step 922 will link the selected ICD9 characteristic (selected characteristic 905) with the subsequently selected pointer characteristic (subsequent selected characteristic 919). The characteristic link step 922 will further refine the filter 128 only for breast cancer diagnoses that have been surgically treated, combining both the selected characteristic 905 and the subsequent selected characteristic 919.

特性リンクステップ９２２は、メタデータクエリステップ９０６へと続く。ユーザが後続の選択された特性９１９を一度選択し、選択された特性９０５と後続の選択された特性９１９とが特性リンクステップ９２２においてリンクされると、メタデータクエリステップ９０６は、上述の方法と同様の方法で、スキーマ定義言語１２６についてクエリを実行することとなる。 The property link step 922 continues to the metadata query step 906. Once the user selects a subsequent selected characteristic 919 and the selected characteristic 905 and the subsequent selected characteristic 919 are linked in the characteristic linking step 922, the metadata query step 906 includes the method described above. A query is executed for the schema definition language 126 in the same manner.

メタデータクエリステップ９０６でスキーマ定義言語１２６について一度クエリを実行したときは、特性関連付けステップ９０８は、全ての選択された特性９０５が論理ＡＮＤ関数のように用いられて動作するという点を除いて、上述の方法と同様の方法で動作することができる。同様に、計算ステップ９１０で、上述の方法と同様の方法で後続の選択された特性９１９のワンホップリンケージ９１４を算出することができ、上述の方法と同様の方法でオペレーショナルデータストア２０６の物理テーブル２１８についてクエリを実行するためのスキーマ定義言語１２６を活用することにより現在のコホート９１２をリターンすることができる。 Once the query is performed for the schema definition language 126 in the metadata query step 906, the property association step 908 operates except that all selected properties 905 are used like a logical AND function. It can operate in a manner similar to that described above. Similarly, the calculation step 910 can calculate the one-hop linkage 914 of the subsequent selected characteristic 919 in a manner similar to that described above, and the physical table of the operational data store 206 in a manner similar to that described above. The current cohort 912 can be returned by leveraging the schema definition language 126 for executing queries on 218.

ここで図１０を参照すると、そこには、ビジネスインテリジェンスツール１２６において実現されるようなフィルタ１２８のスクリーンショット１０００が示されている。スクリーンショット１０００は、後続の選択された特性９１９にリンクされた選択された特性９０５の例を描いている。後続の選択された特性９１９は、トータルコホート９０２をフィルタリングするための第２の特性の観察４０８として選択された特性９０５にリンクされることができる。このリンクは、論理ＡＮＤ条件を表すことができるが、ユーザはトータルコホート９０２の論理ＯＲフィルタリングを呼び起こすためのリンクアイコンを切り替えることがある。 Referring now to FIG. 10, there is shown a screen shot 1000 of the filter 128 as implemented in the business intelligence tool 126. Screenshot 1000 depicts an example of a selected characteristic 905 that is linked to a subsequent selected characteristic 919. A subsequent selected characteristic 919 can be linked to the selected characteristic 905 as a second characteristic observation 408 for filtering the total cohort 902. This link can represent a logical AND condition, but the user may toggle the link icon to invoke the logical OR filtering of the total cohort 902.

トータルコホート９０２は、フィルタ１２８の開始コホートとして示される。選択された特性９０５は、乳がんについてフィルタリングされたＩＣＤ９値であると示される。選択された特性９０５は、後続の選択された特性９１９にコンテキスト上リンクされる。 Total cohort 902 is shown as the starting cohort of filter 128. Selected characteristic 905 is shown to be an ICD9 value filtered for breast cancer. The selected characteristic 905 is contextually linked to a subsequent selected characteristic 919.

後続の選択された特性９１９は、２０１０年１月１日と２０１３年１月１日の間の日付をリターンするためにフィルタリングされる診断日であると示される。現在のコホート９１２は、現在のコホート９１２内のエンティティ４０４の数と共にエンティティ４０４のグループとして描かれる。 A subsequent selected characteristic 919 is shown to be a diagnostic date that is filtered to return a date between January 1, 2010 and January 1, 2013. The current cohort 912 is depicted as a group of entities 404 with the number of entities 404 in the current cohort 912.

ここで図１１を参照すると、そこには、本発明の実施の形態に係るパターンを活用するコホートのフィルタリングのための例示的な制御フロー１１００が示されている。例示的な制御フロー１１００は、図１のフィルタ１２８がどのようにして図１のスキーマ定義言語１２６に対して動作するのかについての他のあるいはさらなる例示的な説明図でありえる。スキーマ定義言語１２６のメタデータ３０２は、図３のメタデータテーブル３００、図５のスキーマ定義モデルテーブル５００、図４のスキーマ定義モデル４００、またはそれらの組み合わせから参照されることができる。 Referring now to FIG. 11, there is shown an exemplary control flow 1100 for cohort filtering that utilizes patterns according to embodiments of the present invention. The example control flow 1100 may be another or further example illustration of how the filter 128 of FIG. 1 operates on the schema definition language 126 of FIG. The metadata 302 of the schema definition language 126 can be referenced from the metadata table 300 of FIG. 3, the schema definition model table 500 of FIG. 5, the schema definition model 400 of FIG. 4, or a combination thereof.

これは、図４のエンティティ４０４のクラスを特定し、図２のオペレーショナルデータストア２０６をスキャンすることによって、臨床イベントに対応する図４の特性の観察４０８のうちの特定のシリーズと一致するエンティティ４０４を探すさらに別の方法を表す。ユーザは、臨床パターンを定義して図１のビジネスインテリジェンスツール１３４を利用するため、かつその臨床パターンに一致するエンティティ４０４と、その臨床パターンに一致しないエンティティ４０４とを特定するために、選択された特性９０５として特性の観察４０８を選択することができる。 This identifies entities 404 of FIG. 4 and scans the operational data store 206 of FIG. 2 to match entities 404 that match a particular series of characteristic observations 408 of FIG. 4 corresponding to clinical events. Represents yet another way to find. The user is selected to define a clinical pattern to utilize the business intelligence tool 134 of FIG. 1 and to identify entities 404 that match the clinical pattern and entities 404 that do not match the clinical pattern. The characteristic observation 408 can be selected as the characteristic 905.

オペレーショナルデータストア２０６にポピュレートされたとき、図９のトータルコホート９０２が利用できるようになる。トータルコホート９０２は、オペレーショナルデータストア２０６内で表されるエンティティ４０４のすべてを含むことができる。制御フロー１１００は、計算ステップ９１０で計算されたトータルコホート９０２または現在のコホート９１２から動作することができる。この説明的な例においては、現在のコホート９１２を提供する計算ステップ９１０は、パターンに基づいて現在のコホート９１２をフィルタリングするための制御フロー１１００を用いることにより、フィルタ１２８をさらにリファインするために用いられることとなる。 When populated in the operational data store 206, the total cohort 902 of FIG. 9 becomes available. The total cohort 902 can include all of the entities 404 represented in the operational data store 206. The control flow 1100 can operate from the total cohort 902 calculated in the calculation step 910 or the current cohort 912. In this illustrative example, the calculation step 910 providing the current cohort 912 is used to further refine the filter 128 by using a control flow 1100 to filter the current cohort 912 based on the pattern. Will be.

計算ステップ９１０から続くのは、選択された特性９０５として特性の観察４０８が選択されることができる特性選択ステップ１１０２である。ユーザが選択できる特性の観察４０８は、ワンホップリンケージ９１４を含むが、関連のない特性の観察４０８も含むことができる。ユーザが選択された特性９０５を一度選択したときは、ビジネスインテリジェンスツール１３４は特性選択ステップ１１０２から続く特性表示モジュール１１０４において、この特性と現在のコホート９１２とをフィードバックすることとなる。ユーザは、その後特性表示モジュール１１０４から続くオプション選択１１０６において別の特性を選択するオプションを有する。ユーザがより多くの特性の観察４０８を選択したいときは、オプション選択１１０６は、ユーザを特性選択ステップ１１０２へ運ぶこととなり、ユーザは後続の特性を選択することができる。このようにして、ユーザは、臨床イベントに対応する特性の観察４０８を選択し、かつフィルタ１２８を制限するパターンを定義することができる。 Following the calculation step 910 is a property selection step 1102 in which the property observation 408 can be selected as the selected property 905. User-selectable property observations 408 include one-hop linkage 914, but may also include unrelated property observations 408. Once the user has selected the selected characteristic 905, the business intelligence tool 134 will feed back this characteristic and the current cohort 912 in the characteristic display module 1104 following the characteristic selection step 1102. The user then has the option to select another characteristic in option selection 1106 that follows from the characteristic display module 1104. If the user wants to select more property observations 408, option selection 1106 will take the user to property selection step 1102, which allows the user to select a subsequent property. In this way, the user can select a characteristic observation 408 corresponding to a clinical event and define a pattern that restricts the filter 128.

上記活用されたヘルスケア分野の例を続けると、ユーザが、退院後３０日以内に再入院した虫垂切除術を受けた患者を特定したいと望んだと仮定する。ユーザは、虫垂切除術についての特性の観察４０８を選択することができ、ビジネスインテリジェンスツール１３４は、図９の制御フローを活用する虫垂切除術を受けたエンティティ４０４全員を検出することができる。虫垂切除術についての特性の観察４０８に関連付けられたすべてのエンティティ４０４が一度現在のコホート９１２において特定されたときは、ユーザは特性選択ステップ１１０２において始まる新たな臨床パターンを創作することができる。ユーザは、入院についての特性の観察４０８に続く退院についての特性の観察４０８を選択することができる。入院についての特性の観察４０８は、退院後３０日以内に生じた入院を示す特性の観察４０８によって制限されることができる。この臨床パターンの例は、最大で３０日間離れたワンホップリンケージ９１４についての制限を選択することにつながる２つの特性の観察４０８（退院と入院）を選択することに関するだろう。上述のように、ワンホップリンケージ９１４は、選択された特性９０５と関連付けられた図４の同じモジュール４０６またはエンティティ４０４に属する選択されていない選択特性の観察４０８である。 Continuing with the example of the above utilized healthcare field, assume that the user wishes to identify a patient who has undergone an appendectomy who has been readmitted within 30 days of discharge. The user can select an observation 408 of characteristics about the appendectomy, and the business intelligence tool 134 can detect all entities 404 that have undergone an appendectomy utilizing the control flow of FIG. Once all the entities 404 associated with the characteristic observation 408 for appendectomy have been identified in the current cohort 912, the user can create a new clinical pattern that begins in the characteristic selection step 1102. The user can select a trait observation 408 for discharge following a trait observation 408 for hospitalization. Characteristic observation 408 for hospitalization can be limited by characteristic observation 408 indicating hospitalization that occurred within 30 days after discharge. An example of this clinical pattern would involve selecting two trait observations 408 (discharge and hospitalization) that would lead to selecting a limit for one-hop linkage 914 that was separated by up to 30 days. As described above, the one-hop linkage 914 is an observation 408 of unselected selection characteristics belonging to the same module 406 or entity 404 of FIG. 4 associated with the selected characteristics 905.

ユーザは、「虫垂切除術３０日以内再入院」と名づけられたパターンをセーブする。 The user saves the pattern named “re-hospitalization within 30 days of appendectomy”.

ユーザが臨床パターンに対応する特性の観察４０８を選択することによって臨床パターンを定義したときは、ユーザは臨床パターンをセーブすると共にマッチ（一致するもの）計算ステップ１１０８を選択することができる。マッチ計算ステップ１１０８は、図６のリンク６０８と、選択された特性９０５を含んでいる図２の物理テーブル２１８の位置とを特定するために、スキーマ定義言語１２６を参照するメタデータクエリステップ１１１０へと続く。 When the user has defined a clinical pattern by selecting a characteristic observation 408 corresponding to the clinical pattern, the user can save the clinical pattern and select a match calculation step 1108. The match calculation step 1108 proceeds to a metadata query step 1110 that references the schema definition language 126 to identify the link 608 of FIG. 6 and the location of the physical table 218 of FIG. 2 that includes the selected property 905. It continues with.

メタデータクエリステップ１１１０は、ＳＱＬ生成ステップ１１１２へと続く。ＳＱＬ生成ステップ１１１２は、メタデータクエリステップ１１１０におけるスキーマ定義言語１２６からリターンされた物理テーブル２１８の関係と位置とを活用することができ、オペレーショナルデータストア２０６の物理テーブル２１８に対しクエリを実行するためのＳＱＬ命令１１１４を自動的に生成する。 The metadata query step 1110 continues to the SQL generation step 1112. The SQL generation step 1112 can utilize the relationship and position of the physical table 218 returned from the schema definition language 126 in the metadata query step 1110, and execute a query against the physical table 218 of the operational data store 206. The SQL instruction 1114 is automatically generated.

ＳＱＬ実行ステップ１１１６は、ＳＱＬ生成ステップ１１１２へと続くが、オペレーショナルデータストア２０６の物理テーブル２１８に対しクエリを実行するためのＳＱＬ生成ステップ１１１２から自動的に生成される。ＳＱＬ実行ステップ１１１６から続くのは、リターンステップ１１１８である。リターンステップ１１１８は、一致コホート１１２０と不一致コホート１１２２とをリターンすることができる。 The SQL execution step 1116 continues to the SQL generation step 1112, but is automatically generated from the SQL generation step 1112 for executing a query against the physical table 218 of the operational data store 206. Following the SQL execution step 1116 is a return step 1118. A return step 1118 can return the matching cohort 1120 and the non-matching cohort 1122.

一致コホート１１２０は、臨床パターンとして選択された特性９０５に対応するエンティティ４０４でありえる。不一致コホート１１２２は、臨床パターンとして選択された特性９０５に対応しないエンティティ４０４でありえる。 The matching cohort 1120 can be the entity 404 corresponding to the characteristic 905 selected as the clinical pattern. The discrepancy cohort 1122 may be an entity 404 that does not correspond to the characteristic 905 selected as the clinical pattern.

一致コホート１１２０と不一致コホート１１２２は、後の処理のため、またはビジネスインテリジェンスツール１３４によってさらに制限されるためにセーブされる。リターンステップ１１１８が一致コホート１１２０をリターンしたときは、ユーザは、一致コホート１１２０内のエンティティ４０４の詳細について視ることができる。例えば、ユーザは、特性９０５によって定義された臨床パターンに一致するものが発生したときは、そのＩＤを視ることができる。 Matching cohort 1120 and mismatching cohort 1122 are saved for later processing or to be further restricted by business intelligence tool 134. When return step 1118 returns match cohort 1120, the user can view details of entity 404 in match cohort 1120. For example, the user can view the ID when something that matches the clinical pattern defined by characteristic 905 occurs.

ここで図１２を参照すると、そこには、図１のビジネスインテリジェンスツール１２６において実現されるような図１のフィルタ１２８のスクリーンショット１２００が示されている。スクリーンショット１２００は、特性の観察４０８の例を描いている。特性の観察４０８は、選択ボックス１２０２においてユーザが選択するために表示されることができる。 Referring now to FIG. 12, there is shown a screenshot 1200 of the filter 128 of FIG. 1 as implemented in the business intelligence tool 126 of FIG. Screen shot 1200 depicts an example of characteristic observation 408. Characteristic observation 408 can be displayed for selection by the user in selection box 1202.

ユーザが選択する特性の観察４０８は、パターンボックス１２０４において示されることができる。後続の選択された特性９１９に沿って選択された特性９０５は、パターンボックス１２０４において共にリンクされて示されることができる。選択された特性９０５は、例えば、虫垂切除術についてフィルタリングされた処置または手続でありえる。 An observation 408 of a user-selected property can be shown in the pattern box 1204. Selected characteristics 905 along with subsequent selected characteristics 919 can be shown linked together in pattern box 1204. The selected characteristic 905 can be, for example, a filtered procedure or procedure for appendectomy.

第１の後続の選択された特性１２０６は、例えば、退院でありえる。第２の後続の選択された特性１２０８は、例えば、再入院でありえる。パターンフィルタ１２１０は、第１の後続の選択された特性１２０６および第２の後続の選択された特性１２０８のリターンを、特性の観察４０８の各インスタンスから３０日以内に制限するために含まれることができる。言い換えると、パターンフィルタ１２１０は、リターンされる図４のエンティティ４０４を、虫垂切除術を受けて退院し、退院後３０日以内に再入院したエンティティ４０４に制限することができる。 The first subsequent selected characteristic 1206 can be, for example, discharge. The second subsequent selected characteristic 1208 can be, for example, readmission. A pattern filter 1210 may be included to limit the return of the first subsequent selected characteristic 1206 and the second subsequent selected characteristic 1208 within 30 days from each instance of the characteristic observation 408. it can. In other words, the pattern filter 1210 may limit the returned entities 404 of FIG. 4 to those entities 404 that have undergone appendectomy and have been readmitted within 30 days of discharge.

ここで図１３を参照すると、そこには、本発明の実施の形態に係る、リポートを生成するための例示的な制御フロー１３００が示されている。例示的な制御フロー１３００は、図２のリポートデータの定義１３０がどのようにして図１のスキーマ定義言語１２６に対して動作するのかについての例示的な説明でありえる。スキーマ定義言語１２６のメタデータ３０２は、図３のメタデータテーブル３００、図５のスキーマ定義モデルテーブル５００、図４のスキーマ定義モデル４００、またはそれらの組み合わせから参照されることができる。 Referring now to FIG. 13, there is shown an exemplary control flow 1300 for generating a report according to an embodiment of the present invention. The example control flow 1300 may be an example description of how the report data definition 130 of FIG. 2 operates on the schema definition language 126 of FIG. The metadata 302 of the schema definition language 126 can be referenced from the metadata table 300 of FIG. 3, the schema definition model table 500 of FIG. 5, the schema definition model 400 of FIG. 4, or a combination thereof.

図２のオペレーショナルデータストア２０６が一度ポピュレートされたときは、ユーザは、オペレーショナルデータストア２０６内の図４のエンティティ４０４のいずれかについてリポートすることができる。ユーザは、リターンされたリポートデータを定義することができる。図１のビジネスインテリジェンスツール１３４は、スキーマ定義言語１２６のメタデータ３０２を活用し、図４のエンティティ４０４、モジュール４０６、および特性の観察４０８の観点から定義されたリポートデータの定義１３０を構築する。ビジネスインテリジェンスツール１３４は、オペレーショナルデータストア２０６から図２のデータ２０３を抽出するためのリポートデータの定義１３０を実行し、１つ以上のスプレッドシートを通じてその結果を表す。ユーザは、図１５に関して以下詳細に説明されるさまざまなリポートビューワを通じてスプレッドシートデータを視て要約することもできる。 Once the operational data store 206 of FIG. 2 has been populated, the user can report on any of the entities 404 of FIG. 4 in the operational data store 206. The user can define the returned report data. The business intelligence tool 134 of FIG. 1 leverages the metadata 302 of the schema definition language 126 to build the report data definition 130 defined in terms of the entity 404, module 406, and property observation 408 of FIG. The business intelligence tool 134 executes the report data definition 130 for extracting the data 203 of FIG. 2 from the operational data store 206 and presents the results through one or more spreadsheets. The user can also view and summarize the spreadsheet data through various report viewers described in detail below with respect to FIG.

リポートデータの定義１３０は、リポート開始ステップ１３０２において始められる。リポート開始ステップ１３０２は、現在のコホート９１２を含み、これに対してリポートデータの定義１３０をおこなうことができる。図９のトータルコホート９０２、図１１の一致コホート１１２０、または図１１の不一致コホート１１２２さえも、リポートデータの定義１３０をおこなうために用いられることできる。 Report data definition 130 is initiated in a report start step 1302. The report start step 1302 includes the current cohort 912, against which report data definition 130 can be made. The total cohort 902 of FIG. 9, the matching cohort 1120 of FIG. 11, or even the non-matching cohort 1122 of FIG. 11 can be used to define the report data 130.

リポート開始ステップ１３０２から続くのは、データ定義ステップ１３０４である。データ定義ステップ１３０４は、リポートデータの定義１３０を含む。リポートデータの定義１３０は、どのエンティティ４０４および特性の観察４０８がリポート１３０８に含まれる予定であるかについて指定する。リポートデータの定義１３０は、デフォルトのエンティティ４０４とデフォルトの特性の観察４０８とを含む。デフォルトのエンティティ４０４は、現在のコホート９１２であり（または現在選択されているコホート）、一方でデフォルトの特性の観察４０８はエンティティ４０４のＩＤであり、特性の観察４０８は図９および１１に関し詳細に上述したようにエンティティ４０４を選択するために用いられる。 Following the report start step 1302 is a data definition step 1304. Data definition step 1304 includes report data definition 130. The report data definition 130 specifies which entities 404 and property observations 408 are to be included in the report 1308. The report data definition 130 includes a default entity 404 and a default property observation 408. The default entity 404 is the current cohort 912 (or the currently selected cohort), while the default property observation 408 is the entity 404 ID, and the property observation 408 is described in detail with respect to FIGS. Used to select entity 404 as described above.

リポートデータの定義１３０が、特性の観察４０８およびエンティティ４０４とともに一度創作され定義されたときは、データ定義ステップ１３０４から続くオプション変更１３０９におけるリポートデータの定義１３０を変更するオプションをユーザは有する。ユーザが、リポートデータの定義１３０を変更することを望むときは、オプション変更１３０９から続くエンティティオプション変更１３１０または特性オプション変更１３１２が選択されることができる。 Once the report data definition 130 has been created and defined along with the property observation 408 and the entity 404, the user has the option to change the report data definition 130 in the option change 1309 following the data definition step 1304. When the user desires to change the report data definition 130, the entity option change 1310 or the characteristic option change 1312 following the option change 1309 can be selected.

エンティティオプション変更１３１０または特性オプション変更１３１２がユーザに選択されたときは、スキーマ定義言語１２６はビジネスインテリジェンスツール１３４によって参照されることとなる。ユーザがリポートデータの定義１３０内でエンティティ４０４を変更することを選択したときは、ユーザはエンティティオプション変更１３１０を選択することとなる。 When the entity option change 1310 or characteristic option change 1312 is selected by the user, the schema definition language 126 will be referenced by the business intelligence tool 134. When the user chooses to change the entity 404 in the report data definition 130, the user will select the change entity option 1310.

ユーザが、エンティティを追加することによってエンティティ４０４を変更することを決定したときは、参照ステップ１３１４は、スキーマ定義言語１２６を参照し、他のエンティティ４０４への図６のリンクのすべてを特定する。このことは、現在のコホート９１２のサブエンティティであるエンティティ４０４を追加する際に特に重要である。リンク６０８は、参照ステップ１３１４に続くリンクステップ１３１６において特定される。 When the user decides to change entity 404 by adding an entity, reference step 1314 refers to schema definition language 126 to identify all of the links of FIG. 6 to other entities 404. This is particularly important when adding an entity 404 that is a sub-entity of the current cohort 912. The link 608 is identified in a link step 1316 that follows the reference step 1314.

リンク６０８が追加的なエンティティ４０４と前のエンティティ４０４との間で一度特定されたときは、図９のワンホップリンケージ９１４は計算ステップ１３１８において計算される。ワンホップリンケージ９１４は、新たに選択されたエンティティ４０４に対応する特性の観察４０８でありえる。つまり、ワンホップリンケージ９１４は、新たに選択されたエンティティ４０４が関連付けられた特性の観察４０８である。モジュール４０６は、ワンホップリンケージ９１４としてリターンされることもできる。新たに選択されたエンティティ４０４が、モジュール４０６またはエンティティ４０４によって指し示された場合、または代替的に、新たに選択されたエンティティ４０４が他のモジュール４０６またはエンティティ４０４を指し示したときは、これらの指し示されたモジュール４０６またはエンティティ４０４は、ワンホップリンケージ９１４としてリターンされることができる。 Once the link 608 has been identified between the additional entity 404 and the previous entity 404, the one-hop linkage 914 of FIG. The one-hop linkage 914 may be a property observation 408 corresponding to the newly selected entity 404. That is, the one-hop linkage 914 is an observation 408 of characteristics associated with the newly selected entity 404. Module 406 can also be returned as a one-hop linkage 914. If the newly selected entity 404 is pointed to by module 406 or entity 404, or alternatively, if the newly selected entity 404 points to another module 406 or entity 404, these pointers The indicated module 406 or entity 404 can be returned as a one-hop linkage 914.

計算ステップ１３１８は、オプション１３２２を表示することができる表示ステップ１３２０へと続く。エンティティ４０４が添付、追加、またはリポートデータの定義１３０へリンクされたときは、ユーザはオプション１３２２を提供されることとなる。オプション１３２２は、最初、最後、またはすべての経年的なオプションを含む。オプション１３２２は、リポートに含まれる特性のタイプに基づくフィルタリングなどのフィルタリングのオプションをさらに含むことができる。フィルタリングのオプションは、フィルタ基準に一致する結果を維持するだけとなる。オプション１３２２は、制限的、非限定的などの制約のオプションをさらに含む。制約のオプションは、ＳＱＬ内部結合対外部結合と同等の意味である。 The calculation step 1318 continues to a display step 1320 where the option 1322 can be displayed. When the entity 404 is linked to the attachment, addition, or report data definition 130, the user will be provided with an option 1322. Options 1322 include first, last, or all aging options. Options 1322 may further include filtering options such as filtering based on the type of characteristics included in the report. The filtering option will only maintain results that match the filter criteria. Options 1322 further include any constraint options, limited or non-limiting. The constraint option has the same meaning as SQL inner join versus outer join.

オプション１３２２が、ユーザに提供された後に、アップデートステップ１３２４はリポートデータの定義１３０を新たに要求されたエンティティ４０４と現在のコホート９１２へのリンク６０８とでアップデートする。特性の観察４０８をリポートデータの定義１３０に追加するときは、ビジネスインテリジェンスツール１３４は同様のフローを活用する。 After option 1322 is provided to the user, update step 1324 updates report data definition 130 with newly requested entity 404 and link 608 to current cohort 912. When adding characteristic observation 408 to report data definition 130, business intelligence tool 134 utilizes a similar flow.

リポート１３０８が特性オプション変更１３１２を表示することとなる特性の観察４０８を変更することをユーザが望む場合は、特性オプション変更１３１２が選択されることができる。ビジネスインテリジェンスツール１３４は、ユーザが追加したい特性の観察４０８が決定オプション１３２６内のリポートデータの定義１３０に現在存在するモジュール４０６に属しているか否かを判断することとなる。モジュール４０６が現在リポートデータの定義１３０の一部でないときは、参照ステップ１３１４は、スキーマ定義言語１２６を参照するために用いられる。リンク６０８がリンクステップ１３１６で計算され、最後にワンホップリンケージ９１４が計算ステップ１３１８で計算される。 If the user desires to change the property observation 408 that the report 1308 will display the property option change 1312, the property option change 1312 may be selected. The business intelligence tool 134 will determine whether the observation 408 of the characteristic that the user wants to add belongs to the module 406 that currently exists in the report data definition 130 in the decision option 1326. When module 406 is not currently part of report data definition 130, reference step 1314 is used to reference schema definition language 126. Link 608 is calculated at link step 1316 and finally one-hop linkage 914 is calculated at calculation step 1318.

ワンホップリンケージ９１４は、ユーザに追加された特性の観察４０８が対応するエンティティ４０４とモジュール４０６とに対応する他の選択されていない特性の観察４０８でありえる。つまり、ワンホップリンケージ９１４は、追加された特性の観察４０８が関連付けられた同じモジュール４０６とエンティティ４０４から追加されていない特性である。追加されていない特性の観察４０８と共に、計算ステップ１３１８はワンホップリンケージ９１４としてモジュール４０６とエンティティ４０４とをリターンすることもできる。選択された特性９０５と関連付けられたモジュール４０６とエンティティ４０４とが、他の関連付けられていないモジュール４０６とエンティティ４０４とによって指し示されたときは、これらの関連付けられていないモジュール４０６またはエンティティ４０４は、ワンホップリンケージ９１４としてリターンされることができる。つまり、いずれかの特性の観察４０８を有する図５の対象エンティティタイプ５１２または対象モジュールタイプ５１４が、ユーザに追加された特性の観察４０８に関連付けられたモジュール４０６またはエンティティ４０４へのデータポインタを含んでいるときは、このデータポインタを有する特性の観察４０８に関連付けられたモジュール４０６およびエンティティ４０４は、ワンホップリンケージ９１４としてリターンされることとなる。 The one-hop linkage 914 may be another unselected property observation 408 corresponding to the entity 404 and module 406 to which the property observation 408 added to the user corresponds. That is, the one-hop linkage 914 is a property that has not been added from the same module 406 and entity 404 with which the added property observation 408 is associated. The calculation step 1318 may return the module 406 and the entity 404 as a one-hop linkage 914, along with the observation 408 of the properties that have not been added. When the module 406 and entity 404 associated with the selected property 905 are pointed to by other unassociated modules 406 and entities 404, these unassociated modules 406 or entities 404 It can be returned as a one-hop linkage 914. That is, the target entity type 512 or target module type 514 of FIG. 5 with any characteristic observation 408 includes a data pointer to the module 406 or entity 404 associated with the characteristic observation 408 added to the user. If so, the module 406 and entity 404 associated with the observation of properties 408 having this data pointer will be returned as a one-hop linkage 914.

直前に説明されたエンティティ４０４の追加と同様に、表示ステップ１３２０はユーザに対しオプション１３２２を表示する。エンティティ４０４とモジュール４０６を追加する際に表示されるオプション１３２２は同じでありえる。しかしながら、追加のオプションまたはエンティティ４０４かモジュール４０６に独自のオプションが異なるように表示され、追加されるものに依存して表示されることができる。 Similar to adding entity 404 just described, display step 1320 displays option 1322 to the user. Options 1322 displayed when adding entity 404 and module 406 may be the same. However, additional options or entities 404 or module 406 may display different options and may be displayed depending on what is added.

オプション１３２２がユーザに提供された後に、アップデートステップ１３２４は、新たに要求されたモジュール４０６と、それらの現在のコホート９１２へのリンク６０８と、その中のモジュール４０６とで、リポートデータの定義１３０をアップデートすることとなる。決定オプション１３２６が呼び起こされ、特性の観察４０８がリポートデータの定義１３０内ですでにモジュール４０６と関連付けられていると判断したときは、参照ステップ１３１４は、スキーマ定義言語１２６を単純に参照し、リンクステップ１３１６は特性の観察４０８をリポートデータの定義１３０に取り入れるために要求されるリンク６０８を特定する。 After option 1322 is provided to the user, update step 1324 updates report data definition 130 with newly requested modules 406, links 608 to their current cohort 912, and modules 406 within them. It will be updated. When the decision option 1326 is invoked and it is determined that the characteristic observation 408 is already associated with the module 406 in the report data definition 130, the reference step 1314 simply references the schema definition language 126 and links Step 1316 identifies the link 608 required to incorporate the property observation 408 into the report data definition 130.

特性の観察４０８がリポートデータの定義１３０内ですでにモジュール４０６と関連付けられているときは、計算ステップ１３１８と表示ステップ１３２０は、ワンホップリンケージ９１４を計算するためまたはオプション１３２２を表示するために呼び起こされる必要はない。アップデートステップ１３２４は、リポートデータの定義１３０をアップデートし、ユーザにオプション変更１３０９をリターンするために呼び起こされることができる。 When the characteristic observation 408 is already associated with the module 406 in the report data definition 130, the calculation step 1318 and display step 1320 are invoked to calculate the one-hop linkage 914 or to display the option 1322. There is no need to be Update step 1324 can be invoked to update report data definition 130 and return option change 1309 to the user.

上記活用されたＩＣＤ９乳がんの例から続けると、ユーザは、分子実験に対応する品質に関する特性の観察４０８についてリポートを望むことがあり、モジュール４０６は現在のコホート９１２としてリターンされたがん患者の腫瘍サンプルエンティティ４０４について実行されるが、乳がんのタイプを示す特性の観察４０８によって階層化される。ここで、ユーザは、「Ｃｅｌファイル品質」と「トリプルネガティブ」という特性の観察４０８を、リポートデータの定義１３０に単純に追加する。スキーマ定義言語１２６は、Ｃｅｌファイル品質の特性の観察４０８を追加する際、乳がん診断の特性の観察４０８の腫瘍エンティティ４０４の診断へリンク６０８を提供するため、かつ、これらの腫瘍エンティティ４０４に対して実行される分子実験モジュールへも適切にリンクするために参照される。なぜなら、トリプルネガティブという特性の観察４０８は、リポートデータの定義１３０内にすでに含まれているがん診断モジュール４０６の一部だからである。 Continuing with the utilized ICD9 breast cancer example above, the user may wish to report on quality-related observations 408 corresponding to molecular experiments, and module 406 is a tumor patient tumor returned as the current cohort 912. Performed on the sample entity 404, but stratified by a characteristic observation 408 indicative of the type of breast cancer. Here, the user simply adds the observations 408 of the characteristics “Cel file quality” and “triple negative” to the report data definition 130. The schema definition language 126 provides a link 608 to the diagnosis of the tumor entity 404 in the breast cancer diagnosis characteristic observation 408 when adding the Cel file quality characteristic observation 408, and for these tumor entities 404. Reference is also made to properly link to the molecular experiment module to be performed. This is because the triple negative observation 408 is part of the cancer diagnostic module 406 already included in the report data definition 130.

ユーザがリポートデータの定義１３０の変更をもはや望まないときは、ユーザは、オプション変更１３０９から続く実行ステップ１３２８内でリポートを実行してもよい。スキーマ定義言語１２６は、実行ステップ１３２８から続くスキーマ参照ステップ１３３０によるリポートデータの定義１３０内で、特性の観察４０８と、モジュール４０６と、エンティティ４０４とを含むオペレーショナルデータストア２０６内の物理テーブル２１８の位置を特定するために参照されることができる。 When the user no longer wants to change the report data definition 130, the user may execute the report within an execution step 1328 that follows the option change 1309. The schema definition language 126 is the location of the physical table 218 in the operational data store 206 that includes the property observation 408, the module 406, and the entity 404 within the report data definition 130 from the execution step 1328 following the schema reference step 1330. Can be referred to to identify.

ＳＱＬ命令１３３２は、スキーマ参照ステップ１３３０から続くＳＱＬ生成ステップ１３３４によって自動的に生成されることができる。ＳＱＬ命令１３３２は、スキーマ参照ステップ１３３０から集められた情報に基づいて生成されることができる。なぜなら、物理テーブル２１８の位置と要求されたデータ２０３間のリンク６０８は、スキーマ定義言語１２６から特定されることができるからである。 The SQL instruction 1332 can be automatically generated by the SQL generation step 1334 following the schema reference step 1330. The SQL instruction 1332 can be generated based on the information gathered from the schema reference step 1330. This is because the link 608 between the location of the physical table 218 and the requested data 203 can be specified from the schema definition language 126.

ＳＱＬ命令１３３２がオペレーショナルデータストア２０６に対して一度実行されたときは、リポート１３０８は、ＳＱＬ生成ステップ１３３４に続く提示ステップ１３３６において、ユーザに対しスプレッドシートが提示されることができる。リポート１３０８は、提示ステップ１３３６に続く報告ステップ１３３８においてフィルタリングされ、表示され、分析されることができる。報告ステップ１３３８は、図１５に関し以下に詳細に説明される。 Once the SQL instruction 1332 has been executed against the operational data store 206, the report 1308 can present the spreadsheet to the user in a presentation step 1336 that follows the SQL generation step 1334. The report 1308 can be filtered, displayed and analyzed in a reporting step 1338 that follows the presentation step 1336. The reporting step 1338 is described in detail below with respect to FIG.

一例として、報告ステップ１３３８は、ピボットテーブル報告ツールを提供し、かつ、リポートデータの定義１３０で抽出されたデータ２０３を有するピボットテーブルの値と次元とをポピュレートするようユーザに促すことができる。 As an example, the reporting step 1338 may provide a pivot table reporting tool and prompt the user to populate the values and dimensions of the pivot table with the data 203 extracted in the report data definition 130.

ここで図１４を参照すると、そこには、図１のビジネスインテリジェンスツール１２６において実現されたような図１のリポートデータの定義１３０のスクリーンショットが示されている。スクリーンショット１４００は、特性の観察４０８が関連付けられた特性２２２のサンプルを描いている。 Referring now to FIG. 14, there is shown a screen shot of the report data definition 130 of FIG. 1 as implemented in the business intelligence tool 126 of FIG. Screenshot 1400 depicts a sample of characteristic 222 with associated characteristic observation 408.

デフォルトで含まれる特性の観察４０８は、患者ＩＤと図４のエンティティ４０４をフィルタリングするために用いられた特性の観察４０８とを含むことができる。説明的な一例として、スクリーンショット１４００においてデフォルトである特性の観察４０８は、患者ＩＤとＩＣＤ９である。 Characteristic observations 408 included by default can include patient ID and characteristic observations 408 used to filter entity 404 of FIG. As an illustrative example, the default characteristic observations 408 in screenshot 1400 are patient ID and ICD9.

一時的なコホート１４０２のためのリポートを生成するために、ユーザは、分析のために含まれるべきである他の特性の観察４０８を選択してもよい。一例として、ユーザはＣｅｌファイル品質やトリプルネガティブを選択することができる。描かれているように、ビジネスインテリジェンスツール１２６は、新たに追加された特性の観察４０８に対応するエンティティ４０４を含むことができる。 To generate a report for the temporary cohort 1402, the user may select an observation 408 of other characteristics that should be included for analysis. As an example, the user can select Cel file quality or triple negative. As depicted, the business intelligence tool 126 can include an entity 404 corresponding to the newly added property observation 408.

Ｃｅｌファイル品質の特性の観察４０８のために、エンティティ４０４の実験およびサンプルは、対応する特性の観察４０８の実験ＩＤおよびサンプルＩＤと共に追加されることができる。さらに描かれているように、トリプルネガティブの特性の観察４０８は、デフォルトで含まれたモジュール４０６をすでに含んでいる。特に、がん診断は、ＩＣＤ９特性の観察４０８に対応するモジュール４０６としてすでに含まれていた。 For the Cel file quality characteristic observation 408, the experiment and sample of the entity 404 can be added along with the experiment ID and sample ID of the corresponding characteristic observation 408. As further depicted, the triple negative property observation 408 already includes a module 406 included by default. In particular, cancer diagnosis was already included as module 406 corresponding to ICD9 characteristic observation 408.

ここで図１５を参照すると、そこには、本発明の実施の形態に係るリポートを解析するための制御フロー１５００が示されている。制御フロー１５００は、図１３のリポート１３０８を解析するための追加的なステップを含む図１３における前の制御フロー１３００からステップの一部を描いている。 Referring now to FIG. 15, there is shown a control flow 1500 for analyzing a report according to an embodiment of the present invention. Control flow 1500 depicts some of the steps from previous control flow 1300 in FIG. 13 that include additional steps for analyzing report 1308 of FIG.

描かれているように、リファインステップ１５０２は、データ引き出しステップ１５０４へと続くことができる。リファインステップ１５０２とデータ引き出しステップ１５０４とは、制御フロー１３００のステップからなることができる。例えば、リファインステップ１５０２は、図１のリポートデータの定義１３０をリファインまたは変更するために、図１３のデータ定義ステップ１３０４と、オプション変更１３０９と、エンティティオプション変更１３１０と、特性オプション変更１３１２と、参照ステップ１３１４と、リンクステップ１３１６と、計算ステップ１３１８と、表示ステップ１３２０と、アップデートステップ１３２４とからなる。 As depicted, the refine step 1502 can continue to a data retrieval step 1504. The refinement step 1502 and the data extraction step 1504 can be composed of the steps of the control flow 1300. For example, a refine step 1502 can be used to refine or change the report data definition 130 of FIG. 1, see data definition step 1304, option change 1309, entity option change 1310, characteristic option change 1312 of FIG. It consists of step 1314, link step 1316, calculation step 1318, display step 1320, and update step 1324.

データ引き出しステップ１５０４は、リポート１３０８内で図２のデータ２０３を引き出してリポート１３０８をユーザに表示するために、図１３のステップ実行ステップ１３２８と、スキーマ参照ステップ１３３０と、ＳＱＬ生成ステップ１３３４とからなることができる。リポート１３０８がデータ引き出しステップ１５０４において一度引き出されたときは、データ引き出しステップ１５０４から続くツール選択ステップ１５０６において、分析ツール１５０８を選択するようユーザに促すことができる。 The data extraction step 1504 includes a step execution step 1328, a schema reference step 1330, and an SQL generation step 1334 shown in FIG. 13 in order to extract the data 203 shown in FIG. be able to. Once the report 1308 has been extracted in the data extraction step 1504, the user can be prompted to select an analysis tool 1508 in the tool selection step 1506 that follows the data extraction step 1504.

ツール選択ステップ１５０６は、次にあげる分析ツールを選択することができる。すなわち、ＡＮＯＶＡ（一元）ツール、ＡＮＯＶＡ（二元）ツール、分割ツール、コックス回帰ツール、コックス回帰（多制御変数）ツール、カプラン・メイヤーのツール（二つの日付）、Ｋ平均法クラスタリングツール、ＬｉＭＭＡツール、ロジスティック回帰ツール、マン・ホイットニーのツール、１標本ｔ検定ツール、および他の分析ツールである。分析ツール１５０８がツール選択ステップ１５０６において一度選択されたときは、図１のビジネスインテリジェンスツール１３４は分析ツール１５０８を実行するために要求される入力を選択するようユーザを促すことができる。入力は、ツール選択ステップ１５０６から続くプロンプトステップ１５１０内で要求されることができる。 In the tool selection step 1506, the following analysis tools can be selected. ANOVA (one-way) tool, ANOVA (two-way) tool, partitioning tool, Cox regression tool, Cox regression (multi-control variable) tool, Kaplan-Meier tool (two dates), K-means clustering tool, LiMMA tool Logistic regression tools, Mann-Whitney tools, one-sample t-test tools, and other analytical tools. Once the analysis tool 1508 is selected in the tool selection step 1506, the business intelligence tool 134 of FIG. 1 may prompt the user to select the input required to execute the analysis tool 1508. Input can be requested in a prompt step 1510 that follows the tool selection step 1506.

ユーザが入力を一度定義すると、ユーザは、リポート１３０８からのデータ２０３をマップステップ１５１２における入力へマッピングする。データ２０３が分析ツール１５０８の入力へ一度マッピングされたときは、ビジネスインテリジェンスツール１３４は、分析を実行し、分析結果をユーザに表示する。上記ＩＣＤ９乳がんの例を続けると、仮にユーザが、患者エンティティ４０４に施された図４のさまざまな処置プロトコルの特性の観察４０８によってグループ分けされた、図４の乳がん患者エンティティ４０４に対して図４の生存期間の特性の観察４０８を比較し分析することを望んでいると仮定する。ユーザは、リポートデータの定義１３０をリファインして、分析を完全にするために要求される特性の観察４０８を含める。すなわち、診断時年齢の特性の観察４０８、生存期間の特性の観察４０８、処置プロトコルの特性の観察４０８、および患者が乳がんで亡くなったか否かを示すフラグの特性の観察４０８である。 Once the user has defined the input, the user maps the data 203 from report 1308 to the input in map step 1512. Once the data 203 has been mapped to the input of the analysis tool 1508, the business intelligence tool 134 performs the analysis and displays the analysis results to the user. Continuing with the ICD9 breast cancer example, FIG. 4 illustrates that for a breast cancer patient entity 404 of FIG. 4, the user is grouped by observation 408 of the various treatment protocol characteristics of FIG. Assume that one wants to compare and analyze the observation 408 of the lifetime characteristics of The user refines the report data definition 130 to include the required characteristic observations 408 to complete the analysis. That is, an observation age characteristic observation 408, a survival characteristic observation 408, a treatment protocol characteristic observation 408, and a flag characteristic observation 408 indicating whether the patient died of breast cancer.

リポート１３０８は、データ引き出しステップ１５０４において引き出される。ユーザは、その後、ツール選択ステップ１５０６において分析ツール１５０８であるコックス回帰を選択することができる。ユーザは、リポート１３０８からのデータ２０３をマッピングステップ１５１２における入力にマッピングし、分析を実行する。ビジネスインテリジェンスツール１３４は、分析をおこない、処置プロトコルの各組の統計的比較により戻されたビジュアルカプラン・マイヤー曲線などのさまざまな形式で、分析結果をユーザに提示する。 The report 1308 is retrieved in a data retrieval step 1504. The user can then select Cox regression which is the analysis tool 1508 in the tool selection step 1506. The user maps the data 203 from the report 1308 to the input in the mapping step 1512 and performs the analysis. The business intelligence tool 134 performs the analysis and presents the results of the analysis to the user in various forms, such as a visual Kaplan-Meier curve returned by a statistical comparison of each set of treatment protocols.

上記図面における制御フローとブロック図は、本発明のさまざまな実施の形態に係る構造および機能ならびに、システム、方法、コンピュータプログラム製品の実現可能なオペレーションを示す。この観点で、制御フローまたはブロック図における各ブロックは、特定の論理関数を実現するための１つ以上の実行可能な命令からなる、モジュール、セグメント、またはコードの部分を表してもよい。いくつかの代替的な実施の形態において、各ブロックに示された機能は図面に示された順番以外の順番で生じてもよいことにも留意すべきである。例えば、連続して示される２つのブロックは、実際、実質的に同時に実行されてもよく、または、当該ブロックは有する機能性次第で時折逆の順序で実行されてもよい。ブロック図および／またはフローチャート図の各ブロックと、ブロック図または／およびフローチャート図におけるブロックの組み合わせは、特定の機能または動作を実施する特殊目的のハードウェアに基づくシステムによってまたは特殊目的のハードウェアとコンピュータプログラムとの組み合わせによって実現されることができる。 The control flow and block diagrams in the above drawings illustrate the structure and function and the possible operations of the system, method and computer program product according to various embodiments of the present invention. In this regard, each block in the control flow or block diagram may represent a module, segment, or portion of code that consists of one or more executable instructions to implement a particular logical function. It should also be noted that in some alternative embodiments, the functions shown in each block may occur in an order other than that shown in the drawings. For example, two blocks shown in succession may in fact be executed substantially simultaneously, or may be executed in reverse order, depending on the functionality that the block has. Each block in the block diagram and / or flowchart diagram, and combinations of blocks in the block diagram or / and flowchart diagram, are based on special purpose hardware based systems or special purpose hardware and computers that perform specific functions or operations. It can be realized in combination with a program.

有利なことに、本発明の実施の形態は、高いレベルで抽象化されたスキーマ定義言語を提供するため、かつオペレーショナルデータストアを低コストかつ短いタイムフレームで活用するためのビジネスインテリジェンスツールを実現可能にするためのスキーマ定義言語とのインターフェースとなるプラットフォームを提供するための技術を提供する。スキーマ定義言語は、ビジネスインテリジェンスツールが高い抽象化レベルで動作できるようにして、基礎となるデータベースが発展したときでもこれらのツールが変更される必要がないようにする。データベースは、本発明のスキーマ定義言語を活用する際、容易かつ妨害されることなく発展する。 Advantageously, embodiments of the present invention can provide a business intelligence tool for providing a high level abstraction of schema definition language and for leveraging operational data stores at low cost and in a short time frame. To provide a platform that serves as an interface with the schema definition language. The schema definition language allows business intelligence tools to operate at a high level of abstraction so that they do not need to be changed as the underlying database evolves. The database evolves easily and unimpeded when utilizing the schema definition language of the present invention.

生じるプロセスと構成は、分かりやすく、コスト対効果が高く、複雑でなく、非常に多目的で、正確で、センシティブで、効果的であり、また、容易に、効率的かつ経済的に適用し利用するための周知の部品を採用して実施されることができる。 The resulting processes and configurations are easy to understand, cost effective, uncomplicated, very versatile, accurate, sensitive, effective, easy to apply, use efficiently and economically Can be implemented by employing well-known components.

上述の記載は本発明の実施の形態に関するが、本発明の基礎的な範囲から逸脱することなく、本発明の他のさらなる実施の形態が考案されることがあり、その範囲は後に続く請求項によって定められる。 Although the foregoing description relates to embodiments of the invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, the scope of which follows Determined by.

したがって、本発明は、含まれたクレームの範囲内であるそのような代替、変更、および変形のすべてを包含すると意図される。ここに記載されまたは添付図面に示されたすべての事項は、説明的かつ非限定的な意味において解釈されるべきである。 Accordingly, the present invention is intended to embrace all such alterations, modifications and variations that fall within the scope of the appended claims. All matters set forth herein or shown in the accompanying drawings are to be interpreted in an illustrative and non-limiting sense.

Claims

A method performed by one or more processors, comprising:
One or more schema definition languages from computer memory that define observations of properties linked to entities that are observable facts about the entities and grouped in a module with metadata. processor is the provision of,
A row for each observation of the property; an entity type defining a type of entity to which the observation of the property is linked; and a column of module type defining a type of module to which the observation of the property is linked; and The one or more processors obtain a schema definition model table having cells containing the metadata;
In an operational data store, another physical table for the module and the entity linked to each other based on at least one of the observations of the characteristic, for each module type in the schema definition model table The one or more processors generate another physical table having a column for each observation of the characteristic in the schema definition model table;
A method for the one or more processors to populate the other physical table according to the metadata.

A method according to claim 1, further comprising:
Referring to the schema definition language metadata to locate the selected property in the other physical table, and another physical table for the module or the entity includes the selected property A method in which the one or more processors determine whether or not.

The method according to claim 2, further comprising: referring to metadata of the schema definition language to identify a location of the another physical table, wherein the another physical table for the module or the entity is selected. The method in which the one or more processors determine whether or not the other physical table contains a subsequent selected property only if it contains the property.

The method according to claim 1, further comprising: when the observation of the characteristic is grouped in the same module as the selected characteristic, the one or more processors if the observation of the characteristic is a one-hop linkage. How to judge.

2. The method according to claim 1, further comprising the one or more processors classifying the entity into a matching cohort if a clinical pattern of the selected characteristic corresponds to the observation of the characteristic.
A method of classifying the entity as a mismatched cohort if the clinical pattern of the selected characteristic does not correspond to the observation of the characteristic.

The method according to claim 1, wherein the one or more processors further determine whether the module is associated with a definition of report data.
A method in which the one or more processors refer to the schema definition language when the module is not associated with a report data definition.

A method according to claim 1, comprising:
Said one or more processors to said another physical table is populated, the whether the data by referring to the metadata of the schema definition language is appropriate data type in the observation of the characteristics the one or more A method comprising determining by a processor .

A non-transitory computer-readable medium used in connection with a processor,
Providing a schema definition language for observing properties that are observable facts about the entity, linked to the entity, defining the observations of the properties grouped in a module with metadata;
A row for each observation of the property; an entity type defining a type of entity to which the observation of the property is linked; and a column of module type defining a type of module to which the observation of the property is linked; and Obtaining a schema definition model table having cells containing the metadata;
In an operational data store, another physical table for the module and the entity linked to each other based on at least one of the observations of the characteristic, for each module type in the schema definition model table Generate another physical table with a column for each observation of the characteristic in the schema definition model table;
A medium including instructions configured to populate data in the another physical table according to the metadata.

9. A non-transitory computer-readable medium according to claim 8, further comprising: referring to the metadata of the schema definition language to locate a selected characteristic in the other physical table; and A medium including instructions configured to determine whether a module or another physical table for the entity includes the selected property.

10. A non-transitory computer readable medium according to claim 9, further comprising locating the other physical table with reference to the schema definition language metadata for the module or the entity. A medium comprising instructions configured to determine whether or not another physical table includes a subsequent selected characteristic only when the other physical table includes the selected characteristic.

9. The computer readable non-transitory medium according to claim 8, further comprising the step of observing the characteristic when grouped in the same module as the selected characteristic. A medium comprising instructions configured to determine that

9. A computer-readable non-transitory medium according to claim 8, further comprising: classifying the entities into a matching cohort if a selected clinical pattern of characteristics corresponds to observation of the characteristics;
A medium comprising instructions configured to classify the entity as a mismatched cohort if a clinical pattern of the selected characteristic does not correspond to the observation of the characteristic.

9. A non-transitory computer readable medium according to claim 8, further comprising: determining whether the module is associated with a report data definition;
A medium containing instructions configured to reference the schema definition language when the module is not associated with a report data definition.

A computer-readable non-transitory medium according to claim 8,
Populating the other physical table is configured to refer to the metadata of the schema definition language to determine whether the data is a data type appropriate for observing the property. Medium containing instructions.

An observation of a property linked to an entity, which is an observable fact about the entity, and is an example of a record that includes a start date and an end date along with metadata and is one of the observations of the property A schema definition language for defining observations of the properties grouped in a module is provided by a processor from computer memory;
A row for each observation of the property; an entity type defining a type of entity to which the observation of the property is linked; and a column of module type defining a type of module to which the observation of the property is linked; and Obtaining a schema definition model table having cells containing the metadata;
In an operational data store, another physical table for the module and the entity linked to each other based on at least one of the observations of the characteristic, for each module type in the schema definition model table Configured to generate another physical table having a column for each observation of the characteristic in the schema definition model table;
A system comprising one or more processors that populates said another physical table according to said metadata.

A system according to claim 15, wherein
The one or more processors locate the selected property in the another physical table with reference to the schema definition language metadata, and another physical table for the module or the entity A system configured to determine whether a selected characteristic is included.

A system according to claim 16, comprising:
The one or more processors locate the other physical table with reference to the schema definition language metadata, and the another physical table for the module or the entity includes the selected characteristic. A system configured to determine whether the other physical table contains subsequent selected characteristics only when

A system according to claim 15, wherein
The system wherein the one or more processors are configured to determine that the property observation is a one-hop linkage when the property observation is grouped in the same module as the selected property.

A system according to claim 15, wherein
The one or more processors are:
If the clinical pattern of the selected characteristic corresponds to observation of the characteristic, classify the entity into a matching cohort;
A system configured to classify the entity as a mismatched cohort if a clinical pattern of a selected characteristic does not correspond to the observation of the characteristic.

A system according to claim 15, wherein
The one or more processors are:
Determine whether the module is associated with a report data definition;
A system configured to reference the schema definition language when the module is not associated with a report data definition.