JP7365446B2

JP7365446B2 - Method and system for performing reuse analysis for model lifecycle management

Info

Publication number: JP7365446B2
Application number: JP2022044960A
Authority: JP
Inventors: ヴェルマプラネイ; 康文鈴木; モハンサイマ
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2021-05-12
Filing date: 2022-03-22
Publication date: 2023-10-19
Anticipated expiration: 2042-03-22
Also published as: JP2022176084A

Description

本主題は、一般に、ソフトウェアの再利用に関し、限定するものではないが、より詳細にはモデルライフサイクル管理のために再利用解析を実行するための方法及びシステムに関する。 TECHNICAL FIELD The present subject matter relates generally to software reuse, and more particularly, but not exclusively, to methods and systems for performing reuse analysis for model lifecycle management.

今日の世界では、データインフラストラクチャは、分野横断的なサービスを提供するために進化している。多くの場合、データエンジニアは、ソフトウェアコードの開発に関わる時間及びリソース、ならびに新しく開発されたソフトウェアを管理する際に関わる費用を削減するために、データ解析で使用するための再利用可能な解析コードの迅速な開発及び展開を必要としている。 In today's world, data infrastructure is evolving to provide cross-cutting services. Data engineers often create reusable analysis code for use in data analysis to reduce the time and resources involved in developing software code, as well as the costs involved in managing newly developed software. requires rapid development and deployment.

現在では、コンピュータ化されたデータモデリング及び解析で使用するための再利用可能な解析コードの迅速な開発及び展開を可能にする技術がある。１つのかかる既存の技術は、ビッグデータを解析する際に使用される前処理ステップを取り込む集中型の継続的に更新される環境を必要とする。かかる取り込みは、継続的に新鮮で、ビジネスチャンスを調査する者がアクセス可能な複雑な変換及び計算を達成するのに役立つ。しかしながら、かかる既存の技術は、前処理ステップのみの再利用解析は提供するが、前処理ステップの中間ステップの再利用解析は提供しない。したがって、かかる既存の技術は、事前に格納されたコードサンプルのユニットレベル、つまり中間ステップレベルでの再利用を可能にするのではなく、高レベル、つまり事前処理ステップでの再利用だけを可能にする。前処理ステップレベルでの事前に格納されたコードサンプルの再利用は不利であり、したがって前処理ステップ全体が事前に格納されたコードサンプルの前処理ステップと一致するときにのみ、かかる再利用は可能である。一例として、前処理ステップ「ＸＹＺ」の事前に格納されたコードサンプルが、中間ステップ１、２、及び３を含むと考える。データエンジニアが、中間ステップ１、３、及び６を含む前処理ステップ「ＡＢＣ」にコードサンプルの再利用を希望すると考える。かかる場合、既存の技術で実行される再利用解析は前処理ステップレベルであって、中間ステップレベルではないため、２つの前処理ステップ「ＡＢＣ」及び「ＸＹＺ」の中間ステップ１及び３は共通であるが、データエンジニアは、前処理ステップ「ＸＹＺ」の事前に格納されたコードサンプルを再利用できない場合がある。さらに、かかる既存の技術では、再利用のために事前に格納されたコードサンプルはユーザ定義である。したがって、再利用のために提供された事前に格納されたコードサンプルは、再利用コードサンプルを定義するユーザの優先傾向に基づき、それによって再利用に利用可能なコードサンプルの数を減少させる。 Technologies now exist that allow for the rapid development and deployment of reusable analysis code for use in computerized data modeling and analysis. One such existing technique requires a centralized, continuously updated environment that captures the pre-processing steps used in analyzing big data. Such capture helps accomplish complex transformations and calculations that are continually fresh and accessible to those investigating business opportunities. However, such existing techniques provide reuse analysis of only the preprocessing step, but not of intermediate steps of the preprocessing step. Therefore, such existing techniques do not allow reuse of pre-stored code samples at the unit level, i.e., at the intermediate step level, but only at a higher level, i.e., at the preprocessing step. do. Reuse of pre-stored code samples at the pre-processing step level is disadvantageous and such re-use is therefore only possible when the entire pre-processing step matches the pre-processing step of pre-stored code samples. It is. As an example, consider that the pre-stored code sample for preprocessing step "XYZ" includes intermediate steps 1, 2, and 3. Suppose a data engineer wants to reuse code samples for preprocessing step "ABC", which includes intermediate steps 1, 3, and 6. In such a case, the reuse analysis performed with existing technology is at the preprocessing step level and not at the intermediate step level, so intermediate steps 1 and 3 of the two preprocessing steps "ABC" and "XYZ" are common. However, data engineers may not be able to reuse pre-stored code samples for pre-processing steps "XYZ". Furthermore, in such existing techniques, the code samples pre-stored for reuse are user-defined. Accordingly, pre-stored code samples provided for reuse are based on user preferences for defining reuse code samples, thereby reducing the number of code samples available for reuse.

他の既存の技術も、ほとんどマクロサービスアルゴリズムベースの再利用を開示していない。そのような既存の技術は、操作ユニットのセットを含む解析プログラムのソースコードをグラフィック表現に変換し、グラフィック表現の決定されたサブセットに基づいてマクロサービス用のコードを生成する。しかしながら、これらの既存の技術は、アルゴリズムレベルでの再利用は可能にするが、前処理ステップレベル又は中間ステップレベルでは可能にしない。さらに、これらの既存の技術は、使用されている最終的なアルゴリズムの解析ステップに基づいた再利用のためのコードサンプルの生成は可能にするが、アルゴリズムを開発中に遭遇する予備解析ステップに基づいた再利用のためのコードサンプルの生成は可能にしない。したがって、再利用に利用可能なコードサンプル、及びそのようなコードサンプルの対象は実質的にはより少なく、これは迅速なソフトウェア開発環境でのユーザのソフトウェア再利用要求を満たすことができないという危険を冒している。 Other existing techniques also rarely disclose macroservice algorithm-based reuse. Such existing techniques convert the source code of an analysis program that includes a set of operational units into a graphical representation and generate code for a macro service based on a determined subset of the graphical representation. However, these existing techniques allow reuse at the algorithm level, but not at the preprocessing step level or at the intermediate step level. Furthermore, these existing techniques allow the generation of code samples for reuse based on the final algorithmic analysis steps used, but not on the preliminary analysis steps encountered while developing the algorithm. does not enable generation of code samples for reuse. Therefore, the code samples available for reuse, and the scope of such code samples, are substantially less, which risks not being able to meet the software reuse requirements of users in rapid software development environments. It's affecting me.

したがって、中間ステップレベルでのコードサンプルの再利用を可能にし、再利用に利用可能なコードサンプルの対象を改善する必要性が存在している
本開示のこの背景技術に開示される情報は、本発明の一般的な背景の理解の向上のためだけであり、この情報が、すでに当業者に既知である先行技術を形成する旨の承認又はいかなる形式での示唆としても解釈されるべきではない。 Therefore, there is a need to enable reuse of code samples at intermediate step levels and to improve the targeting of code samples available for reuse. It is for the purpose of improving the understanding of the general background of the invention only and is not to be construed as an admission or any form of suggestion that this information constitutes prior art already known to those skilled in the art.

本明細書に開示されるのは、モデルライフサイクル管理のために再利用解析を実行する方法である。方法は、再利用解析システムにより、ユーザデバイスを介してユーザから入力要件を受け取ることを含む。入力要件は、複数の再利用パッケージから対応する再利用パッケージを決定することを容易にする。複数の再利用パッケージは、１つ以上のデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のために生成される。入力要件を受け取ると、方法は、入力要件に適切である再利用パッケージのタイプに対応する再利用解析構成を決定することを含む。その後、方法は、再利用解析構成に基づいて、複数の再利用パッケージと関連付けられた１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行することによって、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定することを含む。最後に、方法は、ユーザへの提案として１つ以上の候補再利用パッケージを提供することを含む。 Disclosed herein is a method of performing reuse analysis for model lifecycle management. The method includes receiving input requirements from a user via a user device by a reuse analysis system. The input requirements facilitate determining a corresponding reuse package from multiple reuse packages. A plurality of reuse packages are generated for each of the one or more intermediate steps associated with each of the one or more data processing actions. Upon receiving the input requirements, the method includes determining a reuse analysis configuration corresponding to a type of reuse package that is appropriate for the input requirements. Thereafter, the method performs a semantic similarity analysis of the input requirements using one or more pre-stored metadata associated with the plurality of reuse packages based on the reuse analysis configuration. including determining one or more candidate reuse packages from the reuse packages. Finally, the method includes providing one or more candidate reuse packages as a suggestion to the user.

さらに、本開示は、モデルライフサイクル管理のために再利用解析を実行するための再利用解析システムを開示する。再利用解析システムは、プロセッサと、プロセッサに通信で結合されたメモリとを含む。メモリは、実行時に、プロセッサに、ユーザデバイスを介してユーザから入力要件を受け取らせるプロセッサ命令を格納する。入力要件は、複数の再利用パッケージから対応する再利用パッケージを決定することを容易にする。複数の再利用パッケージは、１つ以上のデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々に生成される。入力要件を受け取ると、プロセッサは、入力要件に適切である再利用パッケージのタイプに対応する再利用解析構成を決定する。その後、プロセッサは、再利用解析構成に基づいて、複数の再利用パッケージと関連付けられた１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行することによって、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定する。最後に、プロセッサは、ユーザへの提案として１つ以上の候補再利用パッケージを提供する。 Additionally, the present disclosure discloses a reuse analysis system for performing reuse analysis for model lifecycle management. The reuse analysis system includes a processor and memory communicatively coupled to the processor. The memory stores processor instructions that, when executed, cause the processor to receive input requirements from a user via a user device. The input requirements facilitate determining a corresponding reuse package from multiple reuse packages. A plurality of reuse packages are generated for each of the one or more intermediate steps associated with each of the one or more data processing actions. Upon receiving the input requirements, the processor determines a reuse analysis configuration that corresponds to a type of reuse package that is appropriate to the input requirements. The processor then performs a semantic similarity analysis of the input requirements using one or more pre-stored metadata associated with the plurality of reuse packages based on the reuse analysis configuration. One or more candidate reuse packages are determined from the reuse packages. Finally, the processor provides one or more candidate reuse packages as a suggestion to the user.

上記発明の概要は、例示目的のみであり、いかなる形でも限定的となることを意図していない。上述された例示的な態様、実施形態、及び特徴に加えて、図面及び以下の発明を実施するための形態を参照することにより、追加の態様、実施形態、及び特徴が明らかになる。 The above summary of the invention is for illustrative purposes only and is not intended to be limiting in any way. In addition to the exemplary aspects, embodiments, and features described above, additional aspects, embodiments, and features will become apparent by reference to the drawings and the following detailed description.

本開示に組み込まれ、本開示の一部を構成する添付図面は、例示的な実施形態を示し、説明とともに、開示されている原理を説明する役割を果たす。図中、参照番号の最も左の数字（複数の場合がある）は、参照番号が最初に表示される図を識別する。類似する特徴及び構成要素を参照するために、図を通して同じ番号が使用される。本主題の実施形態にかかるシステム及び／又は方法のいくつかの実施形態は、ここでほんの一例として、及び添付図を参照して説明される。 The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate example embodiments and, together with the description, serve to explain the disclosed principles. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The same numbers are used throughout the figures to refer to similar features and components. Some embodiments of systems and/or methods according to embodiments of the present subject matter will now be described, by way of example only, and with reference to the accompanying figures.

本開示のいくつかの実施形態にかかるモデルライフサイクル管理のために再利用解析を実行するための例示的なアーキテクチャを示す図である。FIG. 3 illustrates an example architecture for performing reuse analysis for model lifecycle management in accordance with some embodiments of the present disclosure. 本開示のいくつかの実施形態に従って、再利用解析を実行するために必要とされる複数の再利用パッケージを生成するためのパッケージ生成システムの詳細ブロック図を示す図である。FIG. 2 illustrates a detailed block diagram of a package generation system for generating multiple reuse packages needed to perform reuse analysis, in accordance with some embodiments of the present disclosure. 本開示のいくつかの実施形態にかかる、それぞれ、例示的な所定のナビゲーションスキーマ、及び例示的な有向データ依存状態図を示す図である。3A and 3B are diagrams illustrating an example predetermined navigation schema and an example directed data dependency state diagram, respectively, in accordance with some embodiments of the present disclosure; FIG. 本開示のいくつかの実施形態にかかる、それぞれ、例示的な所定のナビゲーションスキーマ、及び例示的な有向データ依存状態図を示す図である。3A and 3B are diagrams illustrating an example predetermined navigation schema and an example directed data dependency state diagram, respectively, in accordance with some embodiments of the present disclosure; FIG. 本開示のいくつかの実施形態にかかる、例示的な第１のメタデータを示す図である。FIG. 3 is a diagram illustrating exemplary first metadata in accordance with some embodiments of the present disclosure. 本開示のいくつかの実施形態にかかる例示的な第２のメタデータに基づいて構築された例示的なアクショ実行ンシーケンスを示す図である。FIG. 4 illustrates an example action execution sequence constructed based on example second metadata in accordance with some embodiments of the present disclosure. 本開示のいくつかの実施形態にかかる例示的な所定の関係スキーマを示す図である。FIG. 3 is a diagram illustrating an example predetermined relationship schema according to some embodiments of the present disclosure. 本開示のいくつかの実施形態にかかる例示的な第３のメタデータを示す図である。FIG. 7 is a diagram illustrating exemplary third metadata according to some embodiments of the present disclosure. 本開示のいくつかの実施形態に従って、モデルライフサイクル管理のために再利用解析を実行するための再利用解析システムの詳細ブロック図を示す図である。1 illustrates a detailed block diagram of a reuse analysis system for performing reuse analysis for model lifecycle management in accordance with some embodiments of the present disclosure. FIG. 本開示のいくつかの実施形態に従って再利用解析を実行するために必要とされる複数の再利用パッケージを生成する方法を示すフローチャートを示す図である。FIG. 3 illustrates a flowchart illustrating a method for generating multiple reuse packages needed to perform reuse analysis according to some embodiments of the present disclosure. 本開示のいくつかの実施形態に従ってモデルライフサイクル管理のために再利用解析を実行する方法を示すフローチャートを示す図ある。FIG. 3 depicts a flowchart illustrating a method of performing reuse analysis for model lifecycle management in accordance with some embodiments of the present disclosure. 本開示と一致する実施形態を実装するための例示的なコンピュータシステムのブロック図を示す図である。1 illustrates a block diagram of an example computer system for implementing embodiments consistent with this disclosure. FIG.

本明細書のいかなるブロック図も、本主題の原理を実施する例示的なシステムの概念図を表すことが当業者によって理解されるべきである。同様に、いかなるフローチャート、流れ図、状態遷移図、疑似コードなども、コンピュータ可読媒体で実質的に表され、コンピュータ又はプロセッサが明示的に示されているかどうかに関わらず、かかるコンピュータ又はプロセッサによって実行され得る多様なプロセスを表すことが理解され得る。 It should be understood by those skilled in the art that any block diagrams herein represent conceptual illustrations of example systems implementing the principles of the present subject matter. Similarly, any flowchart, flowchart, state diagram, pseudocode, etc. substantially represented on a computer-readable medium and executed by a computer or processor, whether or not such computer or processor is explicitly depicted. It can be understood to represent a variety of processes for obtaining.

本書では、単語「例示的な」は、「実施例、例、又は実例としての役割を果たすこと」を意味するために本明細書で使用される。本明細書に「例示的」として説明される本主題のいかなる実施形態又は実施態様も、他の実施形態よりも好ましい又は有利と必ずしも見なされるべきではない。 The word "exemplary" is used herein to mean "serving as an example, example, or illustration." Any embodiment or implementation of the subject matter described herein as "exemplary" is not necessarily to be considered preferred or advantageous over other embodiments.

本開示は、多様な変更形態及び代替形式の影響を受けやすいが、その具体的な実施形態は図面中に例として示され、以下に詳細に説明される。しかしながら、それは、本開示を、開示される特定の形に限定することを意図しておらず、逆に本開示は、本開示の範囲に入るすべての変更形態、均等物、及び代替策を含むことを理解されたい。 While the disclosure is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will be described in detail below. However, it is not intended to limit this disclosure to the particular form disclosed, but on the contrary, this disclosure includes all modifications, equivalents, and alternatives falling within the scope of this disclosure. I hope you understand that.

用語「含む（ｃｏｍｐｒｉｓｅｓ）」、「含む（ｃｏｍｐｒｉｓｉｎｇ）」、又はその任意の他の変形物は、包括的な包含を対象とすることを目的とし、これにより構成要素又はステップのリストを含むセットアップ、デバイス、又は方法は、それらの構成要素又はステップだけを含むのではなく、明示的に示されていない又はかかるセットアップ若しくはデバイス若しくは方法に固有の他の構成要素又はステップを含んでよい。言い換えると、「・・・を含む（ｃｏｍｐｒｉｓｅｓ．．．ａ）」が前に置かれたシステム又は装置の１つ以上の要素は、より多くの制約を受けることなく、システム又は方法での他の要素又は追加の要素の存在を除外しない。 The term "comprises," "comprising," or any other variation thereof is intended to cover inclusiveness, thereby referring to a setup that includes a list of components or steps; A device or method may include not only those components or steps, but also other components or steps not explicitly shown or inherent to such setup or device or method. In other words, one or more elements of a system or apparatus that are preceded by "comprises...a" may be used with other elements of the system or method without further restriction. does not exclude the presence of elements or additional elements.

本明細書に開示されるのは、モデルライフサイクル管理のために再利用解析を実行するための方法及び装置である。方法は、再利用解析システムにより、ユーザと関連付けられたユーザデバイスから入力要件を受け取ることを含む。いくつかの実施形態では、入力要件は、複数の再利用パッケージから対応する再利用パッケージを決定することを容易にし得るビジネス要件（ＢＲ）、解析要件（ＡＲ）、及びデータ要件（ＤＲ）であってよい。いくつかの実施形態では、複数の再利用パッケージは、１つ以上のデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のために生成され得る。いくつかの実施形態では、複数の再利用パッケージは、再利用解析システムと関連付けられたパッケージ生成システムによって生成され得る。 Disclosed herein are methods and apparatus for performing reuse analysis for model lifecycle management. The method includes receiving input requirements from a user device associated with a user by a reuse analysis system. In some embodiments, the input requirements are business requirements (BR), analysis requirements (AR), and data requirements (DR) that may facilitate determining a corresponding reuse package from multiple reuse packages. It's fine. In some embodiments, multiple reuse packages may be generated for each of the one or more intermediate steps associated with each of the one or more data processing actions. In some embodiments, multiple reuse packages may be generated by a package generation system associated with a reuse analysis system.

いくつかの実施形態では、パッケージ生成システムは、使いやすい形式で、未処理の履歴データを含むデータリポジトリを最初に提供することによって複数の再利用パッケージを生成し得る。いくつかの実施形態では、使いやすい形式で未処理の履歴データを提供することは、履歴データのパラメータ、制約、及び関係性を示す第１のメタデータを生成することを含み得る。いくつかの実施形態では、第１のメタデータは、保存タイプであってよい。その後、パッケージ生成システムがユーザからデータクエリを受け取ると、パッケージ生成システムは、データクエリに基づいて１つ以上のデータ処理アクションを実行するために必要とされるプロセスデータをフェッチするために、使いやすい形式の履歴データを構文解析し得る。その間に、パケット生成システムは、データリポジトリ内の履歴データの状態、データクエリ、プロセスデータ、及びプロセスデータとプロセスデータが使用されるであろう対応するデータ処理アクションとの間の関係性を示すスナップショットを生成し得る。いくつかの実施形態では、パッケージ生成システムは、プロセスデータに対して１つ以上のデータ処理アクションを実行する要求をユーザから受け取り得、そのデータ処理アクション中、パッケージ生成システムは、１つ以上のデータ処理アクションを実行する際に関与する１つ以上の中間ステップの各々を取り込むアクション実行シーケンスを構築し得る。パッケージ生成システムは、１つ以上の中間ステップの各々のために生成された第２のメタデータ、及び所定の関係性スキーマに定義された流れに基づいてアクション実行シーケンスを生成し得る。いくつかの実施形態では、第２のメタデータは、連続した流れで、１つ以上の中間ステップの各々を実行する際に関与する、手順、アクション、参照されるモジュール、依存状態、関係性などのコード化されたレコードを示し得る。いくつかの実施形態では、第２のメタデータは、記述タイプであってよい。その後、パッケージ生成システムは、おもに第１のメタデータ、第２のメタデータ、及びアクション実行シーケンスに基づいて１つ以上のデータ処理アクションの各々のために第３のメタデータを生成し得る。いくつかの実施形態では、第３のメタデータは、記述タイプであってよい。いくつかの実施形態では、パッケージ生成システムは、第３のメタデータに基づいて複数の再利用パッケージを生成し得る。いくつかの実施形態では、第３のメタデータは、再利用パッケージを設計／構築するデータを示し得る。いくつかの実施形態では、各再利用パッケージは、各データ処理アクションを実行することに関与する１つ以上の中間ステップの各々に対応する複数の再利用コードを含み得る。第１のメタデータ、第２のメタデータ、及びパッケージ生成システムによって生成される第３のメタデータは、後続の項では１つ以上の事前に格納されたメタデータと呼ばれる、パッケージ生成システムのメモリに格納され得る。いくつかの実施形態では、１つ以上の事前に格納されたメタデータは、再利用評価を実行するために、言い換えると入力要件に適切である１つ以上の候補再利用パッケージを決定するために取り出され得る。 In some embodiments, the package generation system may generate multiple reuse packages by first providing a data repository containing raw historical data in an easy-to-use format. In some embodiments, providing raw historical data in an easy-to-use format may include generating first metadata that indicates parameters, constraints, and relationships of the historical data. In some embodiments, the first metadata may be a retention type. Then, when the package generation system receives a data query from the user, the package generation system uses an easy-to-use Can parse historical data in formats. In the meantime, the packet generation system provides a snapshot of the state of historical data in the data repository, the data queries, the process data, and the relationships between the process data and the corresponding data processing actions in which the process data will be used. Can generate shots. In some embodiments, the package generation system may receive a request from a user to perform one or more data processing actions on process data, and during the data processing action, the package generation system may perform one or more data processing actions on process data. An action execution sequence may be constructed that incorporates each of one or more intermediate steps involved in performing a processing action. The package generation system may generate an action execution sequence based on second metadata generated for each of the one or more intermediate steps and a flow defined in a predetermined relationship schema. In some embodiments, the second metadata includes, in a continuous flow, the steps, actions, referenced modules, dependencies, relationships, etc. involved in performing each of the one or more intermediate steps. may indicate an encoded record. In some embodiments, the second metadata may be of a descriptive type. Thereafter, the package generation system may generate third metadata for each of the one or more data processing actions based primarily on the first metadata, the second metadata, and the action execution sequence. In some embodiments, the third metadata may be a description type. In some embodiments, the package generation system may generate multiple reuse packages based on the third metadata. In some embodiments, the third metadata may indicate data for designing/building the reuse package. In some embodiments, each reuse package may include multiple reuse codes corresponding to each of the one or more intermediate steps involved in performing each data processing action. The first metadata, the second metadata, and the third metadata generated by the package generation system are referred to in subsequent sections as one or more pre-stored metadata in the memory of the package generation system. can be stored in In some embodiments, the one or more pre-stored metadata is used to perform a reuse evaluation, or in other words, to determine one or more candidate reuse packages that are appropriate for the input requirements. can be taken out.

したがって、入力要件が受け取られると、再利用解析システムは、入力要件に適した再利用解析構成に基づいて、複数の再利用パッケージと関連付けられた１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行することによって、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定し得る。いくつかの実施形態では、このようにして決定された１つ以上の候補再利用パッケージは、ユーザへの提案として提供され得る。 Accordingly, upon receiving input requirements, the reuse analysis system uses one or more pre-stored metadata associated with the plurality of reuse packages based on a reuse analysis configuration appropriate to the input requirements. One or more candidate reuse packages may be determined from the plurality of reuse packages by performing a semantic similarity analysis of the input requirements. In some embodiments, one or more candidate reuse packages thus determined may be provided as a suggestion to a user.

本開示は、データ処理アクションを実行する際に関与する各中間ステップを記録し、各中間ステップに対応する再利用パッケージを生成する。これは、データ処理層、解析層、及びアプリケーション層にわたる再利用を可能にし、これは同様に複数の使用点のためのアプリケーションの展開及び作成のプロセスを早めるのに役立つ。再利用パッケージは各中間ステップのために生成されるので、本開示は、再利用を可能にするためにユーザの入力要件の、完全なメインステップとの相似を命令する既存の技術とは異なり、中間ステップレベル自体からの再利用を可能にする。したがって、再利用パッケージはデータ処理アクションを実行する際に関与するメインステップだけに限定されないので、本開示は、より高い再利用確率を提供する。また、本開示では、ユーザから受け取られた入力要件の（本開示では、再利用解析構成とも呼ばれる）構成タイプに基づいて、第３のメタデータに存在する情報ラベルがチェックされ、それが、複数の再利用パッケージの間で１つ以上の候補再利用パッケージを決定するのに役立つ。かかる検索プロセスは、検索を、入力要件の構成タイプに属する再利用パッケージだけに絞り込み、それによって使用するための１つ以上の候補再利用パッケージの容易で迅速且つ正確な決定を達成する。また、本開示は、どの再利用パッケージが生成されるのかに基づいて各ステップ、中間ステップ、アクションなどでデータのスナップショットを撮り、持続させる。かかるデータの持続及びスナップショットは、予備的なステップ及びアクションに関連するデータも含み得る。予備的なステップ及びアクションは、正しいステップ及びアクションに到達する前に、ユーザによって実行されるトライアルステップ及びアクションとして定義できるであろう。したがって、再利用パッケージは、予備的なステップ及びアクションにも基づいて生成され、これは同様に、再利用パッケージの数及び対象を増加させるのに役立つ。さらに、本開示では、再利用パッケージは、パッケージ生成システムにより、データ処理アクションを実行しながら、関与する各中間ステップのために自動的に生成される。したがって、再利用パッケージを生成するために使用されるデータを定義／選択するための手動介入は完全に排除され、それによって使用するための再利用パッケージの対象を拡大する。 The present disclosure records each intermediate step involved in performing a data processing action and generates a reuse package corresponding to each intermediate step. This enables reuse across data processing, analysis, and application layers, which also helps speed up the process of deploying and creating applications for multiple points of use. Because a reuse package is generated for each intermediate step, the present disclosure differs from existing techniques that mandate similarity of user input requirements to the complete main step to enable reuse. Enables reuse from intermediate step levels themselves. Therefore, the present disclosure provides a higher reuse probability since the reuse package is not limited to only the main steps involved in performing data processing actions. The present disclosure also checks the information labels present in the third metadata based on the configuration type (also referred to in this disclosure as reuse analysis configuration) of the input requirements received from the user, and the information labels present in the third metadata are is useful for determining one or more candidate reuse packages among reuse packages. Such a search process narrows the search to only reuse packages belonging to the configuration type of the input requirements, thereby achieving an easy, quick, and accurate determination of one or more candidate reuse packages for use. The present disclosure also takes and persists snapshots of data at each step, intermediate step, action, etc. based on which reuse package is generated. Such data persistence and snapshots may also include data related to preliminary steps and actions. Preliminary steps and actions could be defined as trial steps and actions performed by the user before arriving at the correct steps and actions. Therefore, reuse packages are also generated based on preliminary steps and actions, which likewise serves to increase the number and scope of reuse packages. Further, in this disclosure, reuse packages are automatically generated by the package generation system for each intermediate step involved while performing data processing actions. Therefore, manual intervention to define/select the data used to generate the reuse package is completely eliminated, thereby expanding the scope of the reuse package for use.

概して、本開示は、手動介入を排除し、再利用パッケージの対象を拡大し、中間ステップレベル自体から再利用パッケージを導入することによってより高い再利用確率を確実にし、入力要件の構成タイプを決定することによって候補再利用パッケージの迅速で正確な検索を可能にする。したがって、本開示は、ソフトウェアコードの開発に関わる時間及びリソース、並びに新しく開発されたソフトウェアを管理する際に関わる費用を大きく削減するのに役立つ。 In general, the present disclosure eliminates manual intervention, expands the scope of reuse packages, ensures higher reuse probability by introducing reuse packages from the intermediate step level itself, and determines the configuration type of input requirements. enables quick and accurate retrieval of candidate reuse packages. Accordingly, the present disclosure helps to significantly reduce the time and resources involved in developing software code, as well as the costs involved in managing newly developed software.

本開示の実施形態の以下の詳細な説明では、本明細書の一部を形成し、本開示を実践し得る特定の実施形態が実例として示される添付図面が参照される。これらの実施形態は、当業者が本開示を実践することを可能にするほど十分に詳しく説明され、他の実施形態が利用され得、本開示の範囲から逸脱することなく変更が加えられ得ることを理解されたい。したがって、以下の説明は、限定的な意味で解釈されるべきではない。 In the following detailed description of embodiments of the present disclosure, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration certain embodiments in which the present disclosure may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the disclosure, and it is understood that other embodiments may be utilized and changes may be made without departing from the scope of the disclosure. I want you to understand. Therefore, the following description should not be construed in a limiting sense.

図１は、本開示のいくつかの実施形態にかかるモデルライフサイクル管理のために再利用解析を実行するための例示的なアーキテクチャを示す。 FIG. 1 illustrates an example architecture for performing reuse analysis for model lifecycle management according to some embodiments of the present disclosure.

アーキテクチャ１００は、データソース１０１_１からデータソース１０１_ｎ（集合的に、１つ以上のデータソース１０１と呼ばれる）、データリポジトリ１０３、パッケージ生成システム（ＰＧＳ）１０５、パッケージリポジトリ１０７、再利用解析システム（ＲＡＳ）１０９、ユーザデバイス１１１、及びユーザ１１３を含む。いくつかの実施形態では、１つ以上のデータソース１０１は、例えば医療リポジトリ、輸送リポジトリ、金融リポジトリ、レストランリポジトリ、電気通信リポジトリ、ブロックチェーンレコードリポジトリなど、企業体と関連付けられたリポジトリであってよい。いくつかの実施形態では、ＰＧＳ１０５は、ＰＧＳ１０５と関連付けられた１つ以上のデータソース１０１の各々から未処理データを受け取り、未処理データを履歴データとして、ＰＧＳ１０５と関連付けられたデータリポジトリ１０３に格納し得る。いくつかの実施形態では、未処理データは、例えば機密扱いされていないフォーマット、クラッタフォーマット、異なるフォーマットなど、その元の形式でのデータである場合がある。かかる未処理データは、特定の使いやすい形式、つまりデータ及び関係性、データ内での依存状態、データの場所などが十分に定義され、接続され、分類されているなどの形式への変換を必要とし、これがデータを使用可能にする。いくつかの実施形態では、ＰＧＳ１０５は、（図１に示されない）通信ネットワークを介して１つ以上のデータソース１０１と通信し得る。通信ネットワークは、有線通信ネットワーク、無線通信ネットワーク、又は有線通信ネットワークと無線通信ネットワーク両方の組合せの１つであってよい。いくつかの実施形態では、ＰＧＳ１０５は、複数の再利用パッケージを生成するためにデータリポジトリ１０３からの履歴データにアクセスし得る。いくつかの実施形態では、複数の再利用パッケージは、再利用解析を実行するために使用され得る。 The architecture 100 includes data sources 101 ₁ through data sources 101 _n (collectively referred to as one or more data sources 101), a data repository 103, a package generation system (PGS) 105, a package repository 107, and a reuse analysis system ( RAS) 109, user device 111, and user 113. In some embodiments, one or more data sources 101 may be a repository associated with a business entity, such as, for example, a medical repository, a transportation repository, a financial repository, a restaurant repository, a telecommunications repository, a blockchain records repository, etc. . In some embodiments, the PGS 105 receives raw data from each of the one or more data sources 101 associated with the PGS 105 and stores the raw data as historical data in a data repository 103 associated with the PGS 105. obtain. In some embodiments, raw data may be data in its original form, such as, for example, in an unclassified format, a cluttered format, a different format, etc. Such raw data requires transformation into a specific, easy-to-use format, one in which the data and relationships, dependencies within the data, location of the data, etc. are well defined, connected, and categorized. and this makes the data usable. In some embodiments, PGS 105 may communicate with one or more data sources 101 via a communication network (not shown in FIG. 1). The communication network may be one of a wired communication network, a wireless communication network, or a combination of both wired and wireless communication networks. In some embodiments, PGS 105 may access historical data from data repository 103 to generate multiple reuse packages. In some embodiments, multiple reuse packages may be used to perform reuse analysis.

ＰＧＳ１０５は、図１に示されるように、ＰＧＳプロセッサ１１５、ＰＧＳ入出力（Ｉ／Ｏ）インタフェース１１７、及びＰＧＳメモリ１１９を含み得る。ＰＧＳＩ／Ｏインタフェース１１７は、履歴データを未処理形式から使いやすい形式に変換することによって取得されるサービスとしてのデータを提供し得る。いくつかの実施形態では、ＰＧＳプロセッサ１１５は、履歴データを変換しながら、複数の再利用パッケージを生成するために必要とされる第１のメタデータを生成し得る。いくつかの実施形態では、第１のメタデータは、保存タイプであってよい。いくつかの実施形態では、第１のメタデータは、履歴データのパラメータ、制約、及び関係性を含む場合があるが、これらに限定されない。サービスとしてのデータを提供すると、ＰＧＳプロセッサ１１５は、ユーザデバイス１１１を介してユーザ１１３からデータクエリを受け取り得る。いくつかの実施形態では、データクエリは、ＰＧＳ１０５による１つ以上のデータ処理アクションを実行するために必要とされるプロセスデータに関連する場合がある。一例として、１つ以上のデータ処理アクションは、サンプリング、フィルトレーション、アグリゲーションなどを含む場合があるが、これらに限定されない。ＰＧＳプロセッサ１１５は、データクエリに基づいて１つ以上のデータ処理アクションを実行するために必要とされるプロセスデータをフェッチするために使いやすい形式の履歴データを構文解析し得る。 PGS 105 may include a PGS processor 115, a PGS input/output (I/O) interface 117, and a PGS memory 119, as shown in FIG. PGS I/O interface 117 may provide data-as-a-service obtained by converting historical data from a raw format to an easy-to-use format. In some embodiments, PGS processor 115 may generate the first metadata needed to generate multiple reuse packages while converting historical data. In some embodiments, the first metadata may be a retention type. In some embodiments, the first metadata may include, but is not limited to, historical data parameters, constraints, and relationships. Providing data as a service, PGS processor 115 may receive data queries from user 113 via user device 111. In some embodiments, data queries may relate to process data needed to perform one or more data processing actions by PGS 105. As an example, one or more data processing actions may include, but are not limited to, sampling, filtration, aggregation, and the like. PGS processor 115 may parse historical data in a convenient format to fetch process data needed to perform one or more data processing actions based on a data query.

また、並行して、ＰＧＳプロセッサ１１５は、１つ以上のデータ処理アクションを実行し、対応するデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のアクション実行シーケンスを構築しながら、１つ以上の状態スナップショットを作成してもよい。いくつかの実施形態では、１つ以上の状態スナップショットは、エンティティの現在の状態を取り込み、例えば、１つ以上の状態スナップショットは、データリポジトリ１０３の履歴データの状態、データクエリ、データクエリに基づいてフェッチされたプロセスデータ、及びプロセスデータと、プロセスデータが使用されるであろう対応するデータ処理アクションとの間の関係性を示し得る。いくつかの実施形態では、ＰＧＳプロセッサ１１５は、アクション実行シーケンスを構築しながら、１つ以上の中間ステップの各々のための第２のメタデータを生成し得る。いくつかの実施形態では、第２のメタデータは記述タイプであってよい。いくつかの実施形態では、第２のメタデータは、連続した流れで、１つ以上の中間ステップの各々を実行することに関与する手順、アクション、参照されるモジュール、依存状態、関係性などのコード化されたレコードを示し得る。その後、ＰＧＳプロセッサ１１５は、おもに第１のメタデータ、第２のメタデータ、及びアクション実行シーケンスに基づいて、１つ以上のデータ処理アクションの各々のために第３のメタデータを生成し得る。いくつかの実施形態では、第３のメタデータは、記述タイプであってよい。いくつかの実施形態では、第３のメタデータは、対応するデータ処理アクションの１つ以上の中間ステップ、１つ以上の中間ステップの各々のために生成された第２のメタデータ、第２のメタデータに対応する第１のメタデータに対する参照、１つ以上の中間ステップの各々のアクション実行シーケンス、各アクション実行シーケンスに対応する識別子（ＩＤ）、１つ以上の中間ステップの各々及び対応するデータ処理アクションと関連付けられた所定のデータクエリ、再利用パッケージのタイプ、並びにパッケージ識別子（ＩＤ）を含む場合があるが、これらに限定されない。いくつかの実施形態では、第３のメタデータは、再利用パッケージを設計／構築するデータを示し得る。ＰＧＳプロセッサ１１５は、第３のメタデータに基づき、ＰＧＳ１０５と関連付けられたパッケージリポジトリ１０７に格納された複数の再利用パッケージを生成し得る。いくつかの実施形態では、各再利用パッケージは、各データ処理アクションを実行する際に関与する１つ以上の中間ステップの各々に対応する複数の再利用コードを含み得る。 Also, in parallel, the PGS processor 115 executes one or more data processing actions and constructs an action execution sequence for each of the one or more intermediate steps associated with each of the corresponding data processing actions. One or more state snapshots may be created. In some embodiments, the one or more state snapshots capture the current state of the entity, e.g., the one or more state snapshots capture the state of historical data in the data repository 103, the data queries, the data queries, etc. and the relationship between the process data and the corresponding data processing action in which the process data would be used. In some embodiments, PGS processor 115 may generate second metadata for each of the one or more intermediate steps while building the action execution sequence. In some embodiments, the second metadata may be of a descriptive type. In some embodiments, the second metadata includes, in a continuous flow, the steps, actions, referenced modules, dependencies, relationships, etc. involved in performing each of the one or more intermediate steps. May indicate encoded records. Thereafter, PGS processor 115 may generate third metadata for each of the one or more data processing actions based primarily on the first metadata, the second metadata, and the action execution sequence. In some embodiments, the third metadata may be a description type. In some embodiments, the third metadata includes the one or more intermediate steps of the corresponding data processing action, the second metadata generated for each of the one or more intermediate steps, the second a reference to first metadata corresponding to the metadata, an action execution sequence for each of the one or more intermediate steps, an identifier (ID) corresponding to each action execution sequence, each of the one or more intermediate steps and corresponding data. It may include, but is not limited to, predetermined data queries associated with the processing action, the type of reuse package, and a package identifier (ID). In some embodiments, the third metadata may indicate data for designing/building the reuse package. PGS processor 115 may generate a plurality of reuse packages stored in package repository 107 associated with PGS 105 based on the third metadata. In some embodiments, each reuse package may include multiple reuse codes corresponding to each of the one or more intermediate steps involved in performing each data processing action.

さらに、ＲＡＳ１０９は、図１に示されるように、パッケージリポジトリ１０７と関連付けられてよい。いくつかの実施形態では、ＲＡＳ１０９は、ユーザ（複数の場合がある）１１３と関連付けられたユーザデバイス１１１を介してユーザ（複数の場合がある）１１３から受け取られたデータクエリ及び要求に基づいて、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定し、取り出すために、再利用解析を実行し得る。いくつかの実施形態では、ＲＡＳ１０９と通信するユーザ１１３は、ＰＧＳ１０５と通信するユーザ１１３と同じである場合がある。いくつかの他の実施形態では、異なるユーザが、ＲＡＳ１０９及びＰＧＳ１０５と通信している場合がある。いくつかの実施形態では、ＲＡＳ１０９は、以後、１つ以上の候補再利用パッケージを、ソフトウェア開発での再利用のためにユーザ１１３への提案として提供し得る。 Additionally, RAS 109 may be associated with package repository 107, as shown in FIG. In some embodiments, the RAS 109 performs the following based on data queries and requests received from the user(s) 113 via the user device 111 associated with the user(s) 113: Reuse analysis may be performed to determine and retrieve one or more candidate reuse packages from the plurality of reuse packages. In some embodiments, the user 113 communicating with RAS 109 may be the same user 113 communicating with PGS 105. In some other embodiments, different users may be communicating with RAS 109 and PGS 105. In some embodiments, RAS 109 may subsequently provide one or more candidate reuse packages as suggestions to user 113 for reuse in software development.

いくつかの実施形態では、ＲＡＳ１０９は、図１に示されるように、ＲＡＳプロセッサ１２１、ＲＡＳ入出力（Ｉ／Ｏ）インタフェース１２３、及びＲＡＳメモリ１２５を含む。ＲＡＳＩ／Ｏインタフェース１１７は、ユーザ１１３から入力要件を受け取り得、それが、ユーザ１１３に適切な複数の再利用パッケージから対応する再利用パッケージを決定するのを促進し得る。いくつかの実施形態では、入力要件は、ビジネス要件（ＢＲ）、解析要件（ＡＲ）、及びデータ要件（ＤＲ）を含む場合があるが、これらに限定されない。いくつかの実施形態では、ＢＲは、高レベルでのユーザの要件又は目的である場合があり、ＡＲはユーザ１１３に必要とされる結果を提供するためにデータ解析を実行することを必要とする特定のＢＲ以下の要件である場合があり、ＤＲは、多様なデータ処理活動を実行するためにデータをフェッチする又は準備することを必要とする特定のＢＲ及びＡＲ以下の要件である場合がある。いくつかの実施形態では、複数の再利用パッケージは、１つ以上のデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のために生成され得る。入力要件を受け取ると、ＲＡＳプロセッサ１２１は、入力要件に適した再利用パッケージのタイプに対応する再利用解析構成を決定し得る。いくつかの実施形態では、ＲＡＳプロセッサ１２１は、１つ以上の分類技術を使用して再利用パッケージのタイプに入力要件をマッピングすることによって、再利用解析構成を決定し得る。 In some embodiments, RAS 109 includes a RAS processor 121, a RAS input/output (I/O) interface 123, and a RAS memory 125, as shown in FIG. RAS I/O interface 117 may receive input requirements from user 113, which may facilitate determining a corresponding reuse package from a plurality of reuse packages that is appropriate for user 113. In some embodiments, input requirements may include, but are not limited to, business requirements (BR), analysis requirements (AR), and data requirements (DR). In some embodiments, BR may be a high-level user requirement or objective, and AR requires performing data analysis to provide the user 113 with the required results. The DR may be a requirement below a specific BR and AR that requires fetching or preparing data to perform various data processing activities. . In some embodiments, multiple reuse packages may be generated for each of the one or more intermediate steps associated with each of the one or more data processing actions. Upon receiving the input requirements, RAS processor 121 may determine a reuse analysis configuration that corresponds to a type of reuse package suitable for the input requirements. In some embodiments, RAS processor 121 may determine the reuse analysis configuration by mapping input requirements to types of reuse packages using one or more classification techniques.

さらに、ＲＡＳプロセッサ１２１は、再利用解析構成に基づいて、複数の再利用パッケージと関連付けられた１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行することによって、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定し得る。１つ以上の事前に格納されたメタデータは、本開示の以降の段落で説明される、第１のメタデータ、第２のメタデータ、及び第３のメタデータの少なくとも１つを指す場合がある。いくつかの実施形態では、第１のメタデータ、第２のメタデータ、及び第３のメタデータは、複数の再利用パッケージを生成するときに生成され、ユーザ１１３から受け取られた入力要件に基づいて再利用解析を実行する前に、ＰＢＳメモリ１１９及びパッケージリポジトリ１０７の少なくとも１つに格納され得る。いくつかの実施形態では、１つ以上の候補再利用パッケージは、その１つ以上の事前に格納されたメタデータが入力要件と意味論的に一致する再利用パッケージである。いくつかの実施形態では、１つ以上の候補再利用パッケージの各々は、入力要件と１つ以上の事前に格納されたメタデータとの間の意味論的な一致の程度を示すスコアを割り当てられてよい。いくつかの実施形態では、スコアは、ユーザ１１３が、使用のために適切な候補再利用パッケージを選択するのに役立つ場合がある。その後、ＲＡＳプロセッサ１２１は、ソフトウェア開発動作でのソフトウェア再利用を可能にするために、ユーザ１１３への提案として１つ以上の候補再利用パッケージを提供し得る。 Additionally, RAS processor 121 performs semantic similarity analysis of the input requirements using one or more pre-stored metadata associated with the plurality of reuse packages based on the reuse analysis configuration. One or more candidate reuse packages may be determined from the plurality of reuse packages. The one or more pre-stored metadata may refer to at least one of first metadata, second metadata, and third metadata, described in subsequent paragraphs of this disclosure. be. In some embodiments, the first metadata, the second metadata, and the third metadata are generated when generating the multiple reuse packages and are based on input requirements received from the user 113. may be stored in at least one of PBS memory 119 and package repository 107 before performing reuse analysis. In some embodiments, the one or more candidate reuse packages are reuse packages whose one or more pre-stored metadata semantically match the input requirements. In some embodiments, each of the one or more candidate reuse packages is assigned a score indicating the degree of semantic match between the input requirements and the one or more pre-stored metadata. It's fine. In some embodiments, the score may assist user 113 in selecting appropriate candidate reuse packages for use. RAS processor 121 may then provide one or more candidate reuse packages as suggestions to user 113 to enable software reuse in software development operations.

図２Ａは、本開示のいくつかの実施形態に従って複数の再利用パッケージを再生するためのパッケージ生成システム（ＰＧＳ）の詳細なブロック図である。 FIG. 2A is a detailed block diagram of a package generation system (PGS) for reclaiming multiple reuse packages in accordance with some embodiments of the present disclosure.

いくつかの実施態様では、ＰＧＳ１０５はデータ２０３及びモジュール２０５を含み得る。一例として、ＰＧＳ１０５のデータ２０３は、図２Ａに示されるように、ＰＧＳ１０５内で構成されたメモリ１１９に格納される。一実施形態では、ＰＧＳ１０５のデータ２０３は、第１のメタデータ２０７、スナップショットデータ２０９、第２のメタデータ２１１、アクションシーケンスデータ２１３、第３のメタデータ２１５、再利用パッケージデータ２１７、及び他のデータ２１９を含み得る。図示されている図２Ａでは、ＰＧＳ１０５のモジュール２０５が本明細書で詳細に説明される。 In some implementations, PGS 105 may include data 203 and modules 205. As an example, data 203 of PGS 105 is stored in memory 119 configured within PGS 105, as shown in FIG. 2A. In one embodiment, the data 203 of the PGS 105 includes first metadata 207, snapshot data 209, second metadata 211, action sequence data 213, third metadata 215, reuse package data 217, and others. data 219 may be included. 2A, module 205 of PGS 105 is described in detail herein.

いくつかの実施形態では、ＰＧＳ１０５のデータ２０３は、様々なデータ構造の形式でＰＧＳ１０５のメモリ１１９に格納され得る。さらに、ＰＧＳ１０５のデータ２０３は、リレーショナルデータモデル又は階層データモデルなど、データモデルを使用して編成することができる。いくつかの実施形態では、他のデータ２１９は、ＰＧＳ１０５の様々な機能を実行するためにＰＧＳ１０５のモジュール２０５によって生成された一時データ及び一次ファイルを含むデータを格納し得る。 In some embodiments, data 203 of PGS 105 may be stored in memory 119 of PGS 105 in the form of various data structures. Additionally, data 203 in PGS 105 may be organized using a data model, such as a relational data model or a hierarchical data model. In some embodiments, other data 219 may store data including temporary data and primary files generated by modules 205 of PGS 105 to perform various functions of PGS 105.

いくつかの実施形態では、ＰＧＳメモリ１１９に格納されたＰＧＳ１０５のデータ２０３は、ＰＧＳ１０５のモジュール２０５によって処理され得る。モジュール２０５は、ＰＧＳ１０５のメモリ１１９内に格納され得る。また、一例では、ＰＧＳ１０５内で構成されたＰＧＳプロセッサ１１５に通信で結合されたモジュール２０５は、図２Ａに示されるようにＰＧＳメモリ１１９の外に存在し、ハードウェアとして実装され得る。本明細書で使用されるように、用語モジュールは、特定用途向け集積回路（ＡＳＩＣ）、電子回路、１つ以上のソフトウェアプログラム又はファームウェアプログラムを実行するプロセッサ（共用、専用、又はグループ）及びメモリ、組合せ論理回路、及び／又は説明される機能を提供する他の適切な構成要素を指す。 In some embodiments, data 203 of PGS 105 stored in PGS memory 119 may be processed by module 205 of PGS 105. Module 205 may be stored within memory 119 of PGS 105. Also, in one example, module 205 communicatively coupled to PGS processor 115 configured within PGS 105 may reside outside of PGS memory 119 and be implemented as hardware, as shown in FIG. 2A. As used herein, the term module refers to an application specific integrated circuit (ASIC), an electronic circuit, a processor (shared, dedicated, or group) that executes one or more software or firmware programs, and memory; Refers to combinational logic circuits and/or other suitable components that provide the described functionality.

一実施形態では、ＰＧＳ１０５のモジュール２０５は、例えば、変換モジュール２２１、メタデータ生成モジュール２２３、スナップショット作成モジュール２２５、シーケンス構築モジュール２２７、パッケージ生成モジュール２２９、及び他のモジュール２３１を含み得る。他のモジュール２３１は、ＰＧＳ１０５の様々な多岐にわたる機能を実行するために使用され得る。ＰＧＳ１０５のかかる上述のモジュール２０５が、単一のモジュール、又は異なるモジュールの組合せとして表され得ることを理解されたい。 In one embodiment, the modules 205 of PGS 105 may include, for example, a transformation module 221, a metadata generation module 223, a snapshot creation module 225, a sequence construction module 227, a package generation module 229, and other modules 231. Other modules 231 may be used to perform a variety of diverse functions of PGS 105. It is to be understood that such above-described modules 205 of PGS 105 may be represented as a single module or a combination of different modules.

いくつかの実施態様では、変換モジュール２２１は、サービスとしてのデータを提供するために履歴データを未処理形式から使いやすい形式に変換し得る。いくつかの実施形態では、履歴データは、ＰＧＳ１０５と関連付けられたデータリポジトリ１０３に格納され得る。一例として、履歴データは、医療リポジトリ、輸送リポジトリ、金融リポジトリ、レストランリポジトリ、電気通信リポジトリ、ブロックチェーンレコードリポジトリなど、１つ以上のデータソース１０１から受け取られたデータを含み得る。いくつかの実施形態では、サービスとしてのデータを提供するために、変換モジュール２２１は、履歴データを最初に構文解析して、履歴データのパラメータ、制約、及び関係性を学習し得る。その後、変換モジュール２２１は、履歴データの学習されたパラメータ、制約、及び関係性、並びに所定のデータナビゲーションスキーマに基づいて有向データ依存状態図を作成し得る。いくつかの実施形態では、所定のナビゲーションスキーマは、データナビゲーションコマンド、データエンドポイント、関数、モジュール名などを使用してパラメータ、制約、及び関係性を接続するために一般的なコード化された形式のレイアウトであってよい。いくつかの実施形態では、有向データ依存状態図は、所定のナビゲーションスキーマにより提供されるレイアウトに基づいた、学習されたパラメータ、制約、及び関係性を表すモジュール構造であってよい。いくつかの実施形態では、有向データ依存状態図は、データ識別子、履歴データ、データナビゲーションコマンド、及びデータエンドポイントを含む場合があるが、これらに限定されるものではなく、各データエンドポイントは、アプリケーションプログラミングインタフェース（ＡＰＩ）と関連付けられる。例示的な所与のナビゲーションスキーマ及び例示的な有向データ依存状態図は、それぞれ図２Ｂ及び図２Ｃに示される通りであってよい。図２Ｂに示されるように、所定のナビゲーションスキーマは、スキーマの題名、モジュール、パラメータ、制約などのスキーマで参照されるリソース、データ表現のタイプ、データ接続及びアクションを表すリンク、関係、ハイパーテキスト参照（ｈｒｅｆ）の形式のデータエンドポイント、データに適用された方法などを含む場合があるが、これらに限定されない。いくつかの実施形態では、関係は、「深く掘り下げる」、「終える」、「浸透する」などのデータナビゲーションコマンドを使用して示され得る。いくつかの実施形態では、「深く掘り下げる」は、データを検索しながら１レベル深く移動することを可能にするデータナビゲーションコマンドであってよく、「終える」は、データを検索しながら１レベル上に移動することを可能にするデータナビゲーションコマンドであってよく、「浸透する」は、データを検索しながら上位レベルまで浸透することを可能にするナビゲーションコマンドであってよい。いくつかの他の実施形態では、関係は、モジュール内のデータが処理されるべきである方法を示し得る。一例として、「要約する」として示される関係は、指定されたモジュール内のデータが要約される必要があることを意味する場合がある。同様に、「関連する」として示された関係は、指定されたモジュール内のデータを相互に関連付ける必要があることを意味し得る。 In some implementations, conversion module 221 may convert historical data from a raw format to an easy-to-use format to provide data-as-a-service. In some embodiments, historical data may be stored in a data repository 103 associated with PGS 105. As an example, historical data may include data received from one or more data sources 101, such as a medical repository, a transportation repository, a financial repository, a restaurant repository, a telecommunications repository, a blockchain records repository, etc. In some embodiments, to provide data as a service, transformation module 221 may first parse historical data to learn parameters, constraints, and relationships of the historical data. The transformation module 221 may then create a directed data dependency state diagram based on the learned parameters, constraints, and relationships of the historical data and the predetermined data navigation schema. In some embodiments, a predetermined navigation schema provides a common coded format for connecting parameters, constraints, and relationships using data navigation commands, data endpoints, functions, module names, etc. The layout may be as follows. In some embodiments, a directed data dependency state diagram may be a modular structure representing learned parameters, constraints, and relationships based on a layout provided by a predetermined navigation schema. In some embodiments, a directed data dependency state diagram may include, but is not limited to, data identifiers, historical data, data navigation commands, and data endpoints, where each data endpoint is , associated with an application programming interface (API). An example given navigation schema and an example directed data dependency state diagram may be as shown in FIGS. 2B and 2C, respectively. As shown in Figure 2B, a given navigation schema includes the schema title, modules, parameters, resources referenced in the schema such as constraints, types of data representation, links representing data connections and actions, relationships, hypertext references, etc. (href), methods applied to the data, etc. In some embodiments, relationships may be indicated using data navigation commands such as "dive deep," "finish," "penetrate," etc. In some embodiments, "Dig Deep" may be a data navigation command that allows you to move one level deeper while searching for data, and "Finish" allows you to move up one level while searching for data. It may be a data navigation command that allows you to navigate, and "penetrate" may be a navigation command that allows you to penetrate to a higher level while searching for data. In some other embodiments, relationships may indicate how data within a module should be processed. As an example, a relationship designated as "summarize" may mean that data within the specified module needs to be summarized. Similarly, a relationship indicated as "related" may mean that the data within the specified module must be related to each other.

さらに、図２Ｂに示される方法「取得」は「ｈｒｅｆ：／／」と並行してデータエンドポイントを使用して示される特定のモジュールからのデータを読み取り、取り出すための手順を適用するコマンドであってよい。 Additionally, the method "get" shown in Figure 2B is a command that applies a procedure for reading and retrieving data from a particular module shown using a data endpoint in parallel with "href://". It's fine.

図２Ｃに示されるように、有向データ依存状態グラフは、モジュールの識別子、モジュールと関連付けられたデータ、及び有向データ依存状態グラフを検索するためのデータナビゲーションコマンド、及び各モジュールにアクセスするために必要とされるデータエンドポイントを含み得る。図２Ｃで、矢印マークはデータナビゲーションの方向を示し、例えば、「Ｉｄ１」によって示される第１のモジュールでのデータナビゲーションコマンド「深く掘り下げる」の選択は、１レベル下への、つまり「Ｉｄ２」によって示される第２のモジュールへのナビゲーションを可能にし得、これは、第１のモジュールの「深く掘り下げる」から第２のモジュールの「Ｉｄ２」への矢印を使用して、図２Ｃに示される。同様に、「Ｉｄ２」によって示される第２のモジュールでのデータナビゲーションコマンド「終える」の選択は、１レベル上への、つまり「Ｉｄ１」によって示される第１のモジュールへのナビゲーションを可能にし得、これは、第２のモジュールの「終える」から第１のモジュールの「Ｉｄ１」への矢印を使用して図２Ｃに示される。同様に、図２Ｃの矢印の残りはデータナビゲーション方向を示す。 As shown in Figure 2C, the directed data dependency state graph includes the module's identifier, the data associated with the module, and the data navigation commands for searching the directed data dependency state graph and accessing each module. may contain the data endpoints needed for the In Figure 2C, the arrow mark indicates the direction of data navigation, for example, selecting the data navigation command "Deep Deep" in the first module indicated by "Id1" is followed by one level down, i.e. by "Id2". This may enable navigation to the second module shown, which is illustrated in FIG. 2C using an arrow from "Dig Deep" in the first module to "Id2" in the second module. Similarly, selection of the data navigation command "Exit" in the second module indicated by "Id2" may enable navigation up one level, i.e. to the first module indicated by "Id1"; This is illustrated in Figure 2C using the arrow from "Finish" in the second module to "Id1" in the first module. Similarly, the remainder of the arrows in FIG. 2C indicate data navigation directions.

さらに、メタデータ生成モジュール２２３は、サービスとしてのデータを提供するために、有向データ依存状態図及び所定のデータナビゲーションスキーマに基づいて第１のメタデータ２０７を生成し得る。言い換えると、有向データ依存状態図に示される学習されたパラメータに従って、所定のナビゲーションスキーマによって提供されるレイアウトに表される履歴データは、メタデータ生成モジュール２２３によって生成される第１のメタデータ２０７であってよい。図２Ｄは、例えば、「モジュールのための関係性スキーマ」、「アレイ」としてのデータ表現のタイプ、並びにデータ接続及びアクションを表すリンクを含む、例示的な第１のメタデータを示す。図２Ｄに示される例示的なリンクの１つは、「関連する」としての関係、「／ｍｏｄｅｌ」としてのデータエンドポイント、及び「取得」としての方法を示し、これは、データエンドポイント「／ｍｏｄｅｌ」に格納された指定されたモジュールのデータが、補正され、取り出される必要があることを意味する。同様に、図２Ｄに示される別の例示的なリンクは、「要約」としての関係、「／ａｇｇｒｅｇａｔｉｏｎ」としてのデータポイント、及び「取得」としての方法」を示し、これは、データエンドポイント「／ａｇｇｒｅｇａｔｉｏｎ」に格納された指定されたモジュールのデータが要約され、取り出される必要があることを意味する。 Additionally, metadata generation module 223 may generate first metadata 207 based on a directed data dependency state diagram and a predetermined data navigation schema to provide data as a service. In other words, according to the learned parameters shown in the directed data dependency state diagram, the historical data represented in the layout provided by the predetermined navigation schema is the first metadata 207 generated by the metadata generation module 223. It may be. FIG. 2D shows example first metadata, including, for example, a "relationship schema for modules," a type of data representation as an "array," and links representing data connections and actions. One of the example links shown in FIG. 2D shows a relationship as "related", a data endpoint as "/model", and a method as "get", which is related to the data endpoint "/model". This means that the data of the specified module stored in "model" needs to be corrected and retrieved. Similarly, another exemplary link shown in FIG. 2D shows a relationship as 'summary', a data point as '/aggregation', and a 'method' as 'get', which connects the data endpoint ' /aggregation" means that the data of the specified module stored in "/aggregation" needs to be summarized and retrieved.

一般に、未処理データは、１つ以上のデータソースからストレージ領域に蓄積されたデータである。上記に説明されるように、パラメータ、関係、依存状態などを識別することによって、メタデータ生成モジュール２２３と関連する変換モジュール２２１は、未処理データを使いやすい形式、つまり意味があり有用な形式のデータに変換し、このようにしてサービスとしてのデータを提供し得る。 Generally, raw data is data that has been accumulated in a storage area from one or more data sources. As explained above, by identifying parameters, relationships, dependencies, etc., the metadata generation module 223 and associated transformation module 221 convert the raw data into a consumable format, i.e., a meaningful and useful format. data and thus provide data as a service.

いくつかの実施形態では、スナップショット作成モジュール２２５は、１つ以上のデータ処理アクションを実行しながら、１つ以上の状態スナップショットを作成し得る。一例として、１つ以上のデータ処理アクションは、サンプリング、フィルトレーション、アグリゲーションなどであってよい。いくつかの実施形態では、サービスとしてのデータを提供すると、ＰＧＳ１０５は、１つ以上のデータ処理アクションを実行するために必要なプロセスデータを取り出すために、ユーザからデータクエリを受け取り得る。いくつかの実施形態では、データクエリは元のデータ形式であってよい。スナップショット作成モジュール２２５は、受け取られたデータクエリを元のデータ形式からデータエンドポイントを示す検索可能なインデックス形式に変換し得る。検索可能なインデックス形式に変換すると、スナップショット作成モジュール２２５は、それぞれのデータエンドポイントからのデータクエリに対応するプロセスデータを識別するために履歴データを詳しく検討し得（サービスとしてのデータとして提供し）データリポジトリ１０３内のプロセスデータの、ストレージ識別子及び場所とともにプロセスデータを提供し得る。いくつかの実施形態では、プロセスデータは、１つ以上のデータ処理アクションを実行するために必要とされる履歴データのサブセットであってよい。 In some embodiments, snapshot creation module 225 may create one or more state snapshots while performing one or more data processing actions. As an example, one or more data processing actions may be sampling, filtration, aggregation, etc. In some embodiments, providing data as a service allows PGS 105 to receive data queries from users to retrieve process data necessary to perform one or more data processing actions. In some embodiments, the data query may be in the original data format. Snapshot creation module 225 may convert the received data query from its original data format to a searchable index format that represents data endpoints. Once converted to a searchable index format, the snapshot creation module 225 may mine the historical data (provided as data-as-a-service) to identify process data that corresponds to data queries from each data endpoint. ) may provide the process data along with a storage identifier and location of the process data within the data repository 103; In some embodiments, process data may be a subset of historical data needed to perform one or more data processing actions.

例えば、以下に示す通りである元の形式のデータエントリを検討する。
．ｃｓｖ，ｄａｔａデータ１，範囲＞５０＜１００ For example, consider a data entry in the original format as shown below.
．． csv, data data 1, range>50<100

元の形式から検索可能なインデックス形式に変換されたデータクエリは、以下に示す通りであってよい。
ｈｒｅｆ：／／ｍｏｄｕｌｅ１，ｐａｒａｍｅｔｅｒ，ｃｏｎｓｔｒａｉｎｔ The data query converted from its original format to a searchable index format may be as shown below.
href://module1, parameter, constraint

上記の検索可能なインデックス形式で、パラメータは「データ１」であり、制約は、パラメータが、５０を超えているが、１００未満である範囲内になければならないことである。したがって、要件はデータ要件（ＤＲ）と見なされ、ユーザ１１３への出力として提供されるプロセスデータは、データ範囲によって示される制約内のデータエンドポイントｈｒｅｆ：／／ｍｏｄｕｌｅ１によって参照されるストレージ場所からのデータであるであろう。一例として、この例でＰＧＳ１０５によってデータリポジトリ１０３に提供される、上述のように出力プロセスデータを取り出すためのクエリは、以下のように通りであろう。
ＤＲ：範囲＞５０＜１００を戻すためのデータ１
ストレージＩｄ：＜３２４８５６＞
場所：／ｄａｔａ／ｍｏｄｕｌｅ／？ｍ１ In the above searchable index format, the parameter is "data 1" and the constraint is that the parameter must be in a range greater than 50 but less than 100. Therefore, the requirement is considered a data requirement (DR) and the process data provided as output to the user 113 is from the storage location referenced by the data endpoint href://module1 within the constraints indicated by the data range. It will be data. As an example, a query to retrieve the output process data as described above, provided by PGS 105 to data repository 103 in this example, would be as follows.
DR: Data 1 to return range > 50 < 100
Storage Id: <324856>
Location: /data/module/? m1

さらに、データクエリに対応するプロセスデータを提供すると、スナップショット作成モジュール２２５は、データリポジトリ１０３内の履歴データの状態、データクエリ、プロセスデータ、及びプロセスデータと、プロセスデータが使用される対応するデータアクションとの関係性を格納するために１つ以上の状態スナップショットを作成し得る。いくつかの実施形態では、１つ以上のスナップショットは、データクエリに基づいて１つ以上のデータ処理アクションを実行するために必要とされるプロセスデータを取り出した後、又は取り出しながら、データリポジトリ内の履歴データの状態を理解するために必要とされる場合がある。いくつかの実施形態では、１つ以上の状態スナップショットは、スナップショットデータ２０９として格納され得る。 Additionally, upon providing the process data corresponding to the data query, the snapshot creation module 225 determines the state of the historical data in the data repository 103, the data query, the process data, and the corresponding data for which the process data is used. One or more state snapshots may be created to store relationships with actions. In some embodiments, one or more snapshots are taken within a data repository after or while retrieving process data needed to perform one or more data processing actions based on a data query. may be required to understand the state of historical data. In some embodiments, one or more state snapshots may be stored as snapshot data 209.

いくつかの実施形態では、シーケンス構築モジュール２２７は、対応するデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のアクション実行シーケンスを構築し得る。かかるアクション実行シーケンスを構築するために、最初に、シーケンス構築モジュール２２７は、ユーザ１１３から要求を受け取って、プロセスデータに対して１つ以上のデータ処理アクションを実行し得る。一例として、１つ以上のデータ処理アクションは、サンプリング、フィルトレーション、アグリゲーション、ロードなどに関連してよい。いくつかの実施形態では、１つ以上のデータ処理アクションを実行しながら、シーケンス構築モジュール２２７は、１つ以上のデータ処理アクションの各々に関与する１つ以上の中間ステップの各々のためにステップ識別子を作成し得る。ステップ識別子は、１つ以上の処理アクションの各中間ステップに、それらが実行される順序で与えられる数字、アルファベット、又は英数字コードであってよい。その後、メタデータ生成モジュール２２３は、対応するデータ処理アクション、ステップ識別子、及びプロセスデータに基づいて、１つ以上の中間ステップの各々のために第２のメタデータ２１１を生成し得る。いくつかの実施形態では、第２のメタデータ２１１は、連続した流れで、１つ以上の中間ステップの各々を実行する際に関与する手順、アクション、参照されるモジュール、依存状態、関係性などのコード化されたレコードを示し得る。このようにして生成された第２のメタデータ２１１は、メモリ１１９に格納され得る。第２のメタデータ２１１を生成すると、シーケンス構築モジュール２２７は、第２のメタデータ２１１、及び所定の関係性スキーマで定義された流れに基づいてアクション実行シーケンスを構築し得る。ユーザ１１３から受け取られた１つ以上のデータ処理アクションを実行する各要求のためにこのようにして構築されたアクション実行シーケンスは、アクションシーケンスデータ２１３として格納され得る。図２Ｅは、図２Ｅに示されるように例示的な第２のメタデータ２１１に基づいて構築された例示的なアクション実行シーケンスを示す。さらに、図２Ｆは、データサンプリング、データアグリゲーション、データフィルトレーション、モデル構築、ローダなどの例示的なデータ処理アクションの間の流れを示す例示的な所定の関係性スキーマを示す。図２Ｆのこれらの例示的なデータ処理アクションの各々の間の矢印マークは、異なるデータ処理アクション間の関係性を示す。一例として、データサンプリングとモデルとの間の矢印「関連する」は、サンプリングされたデータが所定のモデルと相互に関連付けられることを示し、モデルとデータアグリゲーションとの間の矢印「要約」は、モデルが、要約されたアグリゲーションデータをフェッチしていることを示す場合がある。かかる所定の関係性スキーマを使用して、シーケンス構築モジュール２２７は、図２Ｅに示されるようにアクション実行シーケンスを生成し得る。 In some embodiments, sequence construction module 227 may construct an action performance sequence for each of the one or more intermediate steps associated with each corresponding data processing action. To construct such an action execution sequence, sequence construction module 227 may first receive a request from user 113 to perform one or more data processing actions on process data. As an example, one or more data processing actions may involve sampling, filtration, aggregation, loading, etc. In some embodiments, while performing one or more data processing actions, sequence construction module 227 identifies a step identifier for each of the one or more intermediate steps involved in each of the one or more data processing actions. can be created. A step identifier may be a numeric, alphabetic, or alphanumeric code given to each intermediate step of one or more processing actions in the order in which they are performed. Metadata generation module 223 may then generate second metadata 211 for each of the one or more intermediate steps based on the corresponding data processing action, step identifier, and process data. In some embodiments, the second metadata 211 includes, in a continuous flow, the steps, actions, referenced modules, dependencies, relationships, etc. involved in performing each of the one or more intermediate steps. may indicate an encoded record. The second metadata 211 generated in this manner may be stored in the memory 119. Upon generating the second metadata 211, the sequence construction module 227 may construct an action execution sequence based on the second metadata 211 and the flow defined in the predetermined relationship schema. The action execution sequence thus constructed for each request to perform one or more data processing actions received from user 113 may be stored as action sequence data 213. FIG. 2E shows an example action execution sequence constructed based on the example second metadata 211 as shown in FIG. 2E. Further, FIG. 2F depicts an example predetermined relationship schema that illustrates the flow between example data processing actions such as data sampling, data aggregation, data filtration, model building, loader, and the like. The arrow marks between each of these example data processing actions in FIG. 2F indicate the relationship between the different data processing actions. As an example, the arrow "Associate" between data sampling and model indicates that the sampled data is correlated with a given model, and the arrow "Summary" between model and data aggregation indicates that the model may indicate that it is fetching summarized aggregation data. Using such a predetermined relationship schema, sequence construction module 227 may generate an action execution sequence as shown in FIG. 2E.

いくつかの実施形態では、メタデータ生成モジュール２２３は、１つ以上のデータ処理アクションの各々のために第３のメタデータ２１５を生成し得る。第３のメタデータ２１５は、対応するデータ処理アクションの１つ以上の中間ステップ、１つ以上の中間ステップのために生成された第２のメタデータ２１１、第２のメタデータ２１１に対応する第１のメタデータ２０７に対する参照、１つ以上の中間ステップの各々のアクション実行シーケンス、各アクション実行シーケンスに対応する識別子（ＩＤ）、１つ以上の中間ステップの各々及び対応するデータ処理アクションと関連付けられた所定のデータクエリ、再利用パッケージのタイプ、並びにパッケージ識別子（ＩＤ）を含む場合があるが、これらに限定されない。いくつかの実施形態では、第３のメタデータ２１５は、第３のメタデータ２１５のコンテンツとして上述されたすべての情報を使用して生成され得る。いくつかの実施形態では、第３のメタデータ２１５は、例示的な再利用パッケージ「Ｐ－０１」を設計／構築するデータを示し得る。一例として、図２Ｇの例示的な第３のメタデータは、「Ｐ－０１」がビジネス要件（ＢＲ）「支払額解析に基づいたカスタマ分類」に関連することを示す。この再利用パッケージ「Ｐ－０１」の中には、中間ステップがあり、例示的な第３のメタデータは、各中間ステップの「Ｓ１」、「Ｓ３」、「Ｓ４」などのステップＩｄを示す。いくつかの実施形態では、ステップＩｄは、アクション実行シーケンスを生成するプセス中に各中間ステップに割り当てられた。いくつかの実施形態では、各中間ステップは、特定のタイプの要件に関連する場合がある。一例として、図２Ｇで、中間ステップ「サンプリング」の１つは、「支払額の減少傾向を有するカスタマセグメントを取得」及び「支払額の増加傾向を有するカスタマセグメントを取得」などのデータ要件（ＤＲ）に関連する。同様に、中間ステップ「モデル」は、「毎月Ｒｓ．５０，０００／－を超えて支払額を増加させると予測されたカスタマセグメントを取得」及び「毎月Ｒｓ．５，０００／－未満に支払額を減少させると予測されたカスタマセグメントを取得」などの解析要件（ＡＲ）に関連する。 In some embodiments, metadata generation module 223 may generate third metadata 215 for each of the one or more data processing actions. The third metadata 215 includes one or more intermediate steps of the corresponding data processing action, second metadata 211 generated for the one or more intermediate steps, and third metadata 215 that corresponds to the second metadata 211. 1, an action execution sequence for each of the one or more intermediate steps, an identifier (ID) corresponding to each action execution sequence, an identifier (ID) associated with each of the one or more intermediate steps and the corresponding data processing action. may include, but are not limited to, predetermined data queries, types of reuse packages, and package identifiers (IDs). In some embodiments, third metadata 215 may be generated using all the information described above as content of third metadata 215. In some embodiments, third metadata 215 may indicate data that designs/builds example reuse package "P-01." As an example, the exemplary third metadata of FIG. 2G indicates that "P-01" is associated with the business requirement (BR) "Customer Classification Based on Payment Analysis." Within this reuse package "P-01" there are intermediate steps, and the exemplary third metadata indicates step Ids such as "S1", "S3", "S4", etc. of each intermediate step. . In some embodiments, a step Id was assigned to each intermediate step during the process of generating the action execution sequence. In some embodiments, each intermediate step may be associated with a particular type of requirement. As an example, in Figure 2G, one of the intermediate steps "Sampling" includes data requirements (DR )is connected with. Similarly, the intermediate steps ``Model'' include ``Obtain customer segments predicted to increase their payments by more than Rs. related to analysis requirements (ARs) such as ``Obtain customer segments predicted to decrease

いくつかの実施形態では、メタデータ生成モジュール２２３によって生成されたかかる第３のメタデータ２１５を使用し、パッケージ生成モジュール２２９は、複数の再利用パッケージを生成し得る。いくつかの実施形態では、複数の再利用パッケージは、データ処理ステップに関与する各中間ステップのために生成され得、かかるパッケージは、ビジネス要件、奇跡要件、データ要件などの異なる要件とも関連付けられ得る。このようにして生成された複数の再利用パッケージは、再利用パッケージデータ２１７として格納され得る。いくつかの実施形態では、複数の再利用パッケージは、次いでＰＧＳ１０５と関連付けられたパッケージリポジトリ１０７に格納され得る。 In some embodiments, using such third metadata 215 generated by metadata generation module 223, package generation module 229 may generate multiple reuse packages. In some embodiments, multiple reuse packages may be generated for each intermediate step involved in the data processing step, and such packages may also be associated with different requirements, such as business requirements, miracle requirements, data requirements, etc. . A plurality of reusable packages generated in this manner can be stored as reusable package data 217. In some embodiments, multiple reuse packages may then be stored in a package repository 107 associated with PGS 105.

図２Ｈは、本開示のいくつかの実施形態に従ってモデルライフサイクル管理のために再利用解析を実行するための再利用解析システム（ＲＡＳ）の詳細なブロック図を示す。 FIG. 2H shows a detailed block diagram of a reuse analysis system (RAS) for performing reuse analysis for model lifecycle management in accordance with some embodiments of the present disclosure.

いくつかの実施態様では、ＲＡＳ１０９は、データ２４３及びモジュール２４５を含み得る。一例として、ＲＡＳ１０９のデータ２４３は、図２Ｈに示されるようにＲＡＳ１０９に構成されるＲＡＳメモリ１２５に記憶される。一実施形態では、ＲＡＳ１０９のデータ２４３は、要件データ２４７、構成データ２４９、及び候補再利用パッケージデータ２５１、及び他のデータ２５３を含み得る。図示されている図２Ａでは、ＲＡＳ１０９のモジュール２４５が本明細書に詳細に説明される。 In some implementations, RAS 109 may include data 243 and modules 245. As an example, data 243 of RAS 109 is stored in RAS memory 125 configured in RAS 109 as shown in FIG. 2H. In one embodiment, data 243 in RAS 109 may include requirements data 247, configuration data 249, and candidate reuse package data 251, and other data 253. 2A, module 245 of RAS 109 is described in detail herein.

いくつかの実施形態では、ＲＡＳ１０９のデータ２４３は、ＲＡＳ１０９のＲＡＳメモリ１２５にさまざまなデータ構造の形式で格納され得る。さらに、ＲＡＳ１０９のデータ２４３は、リレーショナルデータモデル又は階層データモデルなど、データモデルを使用して編成できる。いくつかの実施形態では、他のデータ２５３は、ＲＡＳ１０９の様々な機能を実行するためのＲＡＳ１０９のモジュール２４５によって生成された一時データ及び一時ファイルを含むデータを格納し得る。 In some embodiments, RAS 109 data 243 may be stored in RAS memory 125 of RAS 109 in the form of various data structures. Additionally, data 243 in RAS 109 can be organized using a data model, such as a relational data model or a hierarchical data model. In some embodiments, other data 253 may store data including temporary data and temporary files generated by modules 245 of RAS 109 to perform various functions of RAS 109.

いくつかの実施形態では、ＲＡＳメモリ１２５に格納されたＲＡＳ１０９のデータ２４３は、ＲＡＳ１０９のモジュール２０５によって処理され得る。モジュール２４５は、ＲＡＳ１０９のＲＡＳメモリ１２５の中に格納され得る。一例では、ＲＡＳ１０９で構成されたＰＧＳプロセッサ１１５に通信で結合されたモジュール２４５は、図２Ｈに示されるように、ＲＡＳメモリ１２５の外に存在し、ハードウェアとして実装され得る。本明細書で使用されるように、用語モジュールは、特定用途向け集積回路（ＡＳＩＣ）、電子回路、１つ以上のソフトウェアプログラム又はファームウェアプログラムを実行するプロセッサ（共用、専用、又はグループ）及びメモリ、組合せ論理回路、及び／又は説明される機能を提供する他の適切な構成要素を指す。 In some embodiments, RAS 109 data 243 stored in RAS memory 125 may be processed by RAS 109 module 205. Module 245 may be stored within RAS memory 125 of RAS 109. In one example, module 245 communicatively coupled to PGS processor 115 configured with RAS 109 resides outside of RAS memory 125 and may be implemented as hardware, as shown in FIG. 2H. As used herein, the term module refers to an application specific integrated circuit (ASIC), an electronic circuit, a processor (shared, dedicated, or group) that executes one or more software or firmware programs, and memory; Refers to combinational logic circuits and/or other suitable components that provide the described functionality.

一実施形態では、ＲＡＳ１０９のモジュール２４５は、例えば、受取りモジュール２５５、構成決定モジュール２５７、再利用パッケージ決定モジュール２５９、及び他のモジュール２６１を含み得る。他のモジュール２６１は、ＲＡＳ１０９の様々な多岐にわたる機能を実行するために使用され得る。ＲＡＳ１０９のかかる上述のモジュール２４５は、単一のモジュール又は異なるモジュールの組合せとして表され得ることを理解されたい。 In one embodiment, modules 245 of RAS 109 may include, for example, a receiving module 255, a configuration determination module 257, a reuse package determination module 259, and other modules 261. Other modules 261 may be used to perform a variety of diverse functions of RAS 109. It should be understood that such above-described modules 245 of RAS 109 may be represented as a single module or a combination of different modules.

いくつかの実施形態では、受取りモジュール２５５は、ユーザ１１３のユーザデバイス１１１から入力要件を受け取り得る。ユーザ１１３からこのようにして受け取られた入力要件は、要件データ２４７として格納され得る。入力要件は、ＲＡＳ１０９及びパッケージ生成システム（ＰＧＳ）１０５と関連付けられたパッケージリポジトリ１０７に格納された複数の再利用パッケージから対応する再利用パッケージを決定することを容易にし得る。いくつかの実施形態では、本開示の図２Ｈの説明の下で説明されるように、ＰＧＳ１０５は、１つ以上のデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のために複数の再利用パッケージを生成し得る。いくつかの実施形態では、入力要件は、ビジネス要件（ＢＲ）、解析要件（ＡＲ）、及びデータ要件（ＤＲ）の少なくとも１つに関連してよい。いくつかの実施形態では、ＢＲはユーザの高レベルでの要件又は目的であってよく、ＡＲは、ユーザ１１３に必要とされる結果を提供するためにデータ解析を実行することを必要とする特定のＢＲ以下の要件であってよく、ＤＲは、様々なデータ処理活動を実行するためにデータをフェッチすること又は準備することを必要とする特定のＢＲ及びＡＲ以下の要件であってよい。一例として、ＢＲは、「支払額解析に基づいたカスタマ分類」である場合があり、かかるＢＲに関連するＡＲは、「毎月Ｒｓ．５，０００／－未満に支払額を減少させると予測されたカスタマセグメントを取得」である場合があり、かかるＢＲ及びＡＲのためのＤＲは、「カスタマテーブルから最近の３年間のカスタマ支払額データを取得」である場合がある。 In some embodiments, receiving module 255 may receive input requirements from user device 111 of user 113. Input requirements received in this manner from user 113 may be stored as requirements data 247. The input requirements may facilitate determining a corresponding reuse package from a plurality of reuse packages stored in a package repository 107 associated with RAS 109 and package generation system (PGS) 105. In some embodiments, the PGS 105 performs a process for each of the one or more intermediate steps associated with each of the one or more data processing actions, as described below in the description of FIG. 2H of this disclosure. Multiple reuse packages can be generated. In some embodiments, input requirements may relate to at least one of business requirements (BR), analysis requirements (AR), and data requirements (DR). In some embodiments, the BR may be a high-level requirement or objective of the user, and the AR may be a specific requirement or objective that requires data analysis to be performed to provide the user 113 with the required results. The DR may be a requirement below a particular BR and AR that requires fetching or preparing data to perform various data processing activities. As an example, a BR may be "Customer Classification Based on Payment Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Amount Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Amount Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Amount Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Amount Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Amount Analysis" and the AR associated with such BR may be "Customer Classification Based on Payment Amount Analysis"; The DR for such BRs and ARs may be "Get the last three years of customer payment data from the customer table."

いくつかの実施形態では、構成決定モジュール２５７は、入力要件に適切である再利用パッケージのタイプに対応する再利用解析再構成を決定し得る。いくつかの実施形態では、再利用解析構成を決定することは、入力要件がＢＲ、ＡＲ、又はＤＲの１つに属するかどうかを判断することを含み得る。いくつかの実施形態では、構成決定モジュール２５７は、１つ以上の分類技術を使用して、入力要件を再利用パッケージのタイプにマッピングすることによって再利用解析構成を決定し得る。一例として、１つ以上の分類技術は、潜在的意味解析技術、話題分析技術などを含む場合があるが、これらに限定されない。いくつかの実施形態では、再利用パッケージのタイプは、ＡＲに関連する再利用パッケージ、ＤＲに関連する再利用パッケージ、ＢＲに関連する再利用パッケージ、ＡＲ、ＢＲ、及びＤＲの組合せに関連する再利用パッケージなどであってよい。このようにして決定された再利用解析構成は、構成データ２４９として格納され得る。 In some embodiments, configuration determination module 257 may determine a reuse analysis reconfiguration that corresponds to a type of reuse package that is appropriate to the input requirements. In some embodiments, determining the reuse analysis configuration may include determining whether the input requirement belongs to one of BR, AR, or DR. In some embodiments, configuration determination module 257 may determine reuse analysis configurations by mapping input requirements to types of reuse packages using one or more classification techniques. By way of example and not limitation, the one or more classification techniques may include latent semantic analysis techniques, topic analysis techniques, and the like. In some embodiments, the types of reuse packages include reuse packages associated with AR, reuse packages associated with DR, reuse packages associated with BR, reuse packages associated with a combination of AR, BR, and DR. It may be a usage package, etc. The reuse analysis configuration determined in this manner may be stored as configuration data 249.

さらに、再利用パッケージ決定モジュール２５９は、再利用解析構成に基づいて、複数の再利用パッケージと関連付けられた１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行することによって、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定し得る。いくつかの実施形態では、１つ以上の候補再利用パッケージは、その１つ以上の事前に格納されたメタデータが入力要件と意味論的に一致する再利用パッケージであり得る。このようにして決定された１つ以上の候補再利用パッケージは、候補再利用パッケージデータ２５１として格納され得る。１つ以上の事前に格納されたメタデータは、第１のメタデータ、第２のメタデータ、及び第３のメタデータの少なくとも１つであってよい。いくつかの実施形態では、再利用パッケージ決定モジュール２５９は、１つ以上の自然言語処理（ＮＬＰ）技術を使用して、１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を決定し得る。一例として、１つ以上のＮＬＰ技術は、単語出現頻度－逆文書頻度（ＴＦ－ＩＤＦ）技術、潜在的意味解析（ＬＳＡ）技術、及びＮＬＰの分野での他の類似した技術を含む場合があるが、これらに限定されない。さらに、いくつかの実施形態では、再利用パッケージ決定モジュール２５９は、１つ以上の候補再利用パッケージに、入力要件と１つ以上の事前に格納されたメタデータとの間の意味論的一致の程度に基づいたスコアを割り当て得る。かかるスコアは、ユーザ１１３が、使用するための適切な候補再利用パッケージを選択するのに役立つ場合がある。一例として、ユーザ１１３は、事前に設定された閾値などを超える意味論的一致を有するスコアに基づいて、最高スコアに基づいた適切な候補再利用パッケージを選択し得る。 Further, the reuse package determination module 259 performs a semantic similarity analysis of the input requirements using one or more pre-stored metadata associated with the plurality of reuse packages based on the reuse analysis configuration. One or more candidate reuse packages may be determined from the plurality of reuse packages. In some embodiments, the one or more candidate reuse packages may be reuse packages whose one or more pre-stored metadata semantically match the input requirements. One or more candidate reuse packages determined in this manner may be stored as candidate reuse package data 251. The one or more pre-stored metadata may be at least one of first metadata, second metadata, and third metadata. In some embodiments, reuse package determination module 259 uses one or more natural language processing (NLP) techniques to determine semantic similarity of input requirements using one or more pre-stored metadata. analysis can be determined. As an example, the one or more NLP techniques may include word frequency-inverse document frequency (TF-IDF) techniques, latent semantic analysis (LSA) techniques, and other similar techniques in the field of NLP. However, it is not limited to these. Additionally, in some embodiments, the reuse package determination module 259 provides one or more candidate reuse packages with semantic matches between the input requirements and the one or more pre-stored metadata. A grade-based score may be assigned. Such scores may assist user 113 in selecting appropriate candidate reuse packages for use. As an example, the user 113 may select an appropriate candidate reuse package based on the highest score, based on the score having a semantic match that exceeds a preset threshold, etc.

一例として、以下の表１に示されるように要件を検討する。 As an example, consider the requirements as shown in Table 1 below.

１つ以上の分類技術を使用して、ＲＡＳ１０９は、以下の表２に示されるように入力要件の再利用解析構成を決定し得る。 Using one or more classification techniques, RAS 109 may determine a reuse analysis configuration of the input requirements as shown in Table 2 below.

入力要件の再利用解析構成は、ＢＲであると判断されるので、ＲＡＳ１０９は、タイプＢＲに関連する複数の再利用パッケージの１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行し得る。 Since the reuse analysis configuration of the input requirement is determined to be a BR, the RAS 109 analyzes the meaning of the input requirement using pre-stored metadata of one or more of the plurality of reuse packages associated with type BR. Similarity analysis can be performed.

一例として、タイプＢＲに関連する複数の再利用パッケージが以下の表３に示される通りであると考える。 As an example, consider that the multiple reuse packages associated with type BR are as shown in Table 3 below.

意味相似解析を実行すると、ＲＡＳ１０９は、以下の表４に示されるように１つ以上の候補再利用パッケージ及び対応するスコアを決定し得る。
Upon performing the semantic similarity analysis, RAS 109 may determine one or more candidate reuse packages and corresponding scores as shown in Table 4 below.

上記表４によれば、要件ＩＤ１、２、及び３を有する入力要件は、それぞれ９９％、９０％、及び７８％のスコアを有する再利用パッケージＰ１、Ｐ３、及びＰｎの事前に格納されたメタデータとそれぞれ意味論的に一致する。したがって、Ｐ１は、要件ＩＤ１の入力要件のための候補再利用パッケージであり、Ｐ２は、要件ＩＤ２の入力要件のための候補再利用パッケージであり、Ｐｎは、要件ＩＤ３の入力要件の候補再利用パッケージである。 According to Table 4 above, input requirements with requirement IDs 1, 2, and 3 are pre-stored metadata of reuse packages P1, P3, and Pn with scores of 99%, 90%, and 78%, respectively. Each semantically matches the data. Therefore, P1 is a candidate reuse package for the input requirement with requirement ID1, P2 is a candidate reuse package for the input requirement with requirement ID2, and Pn is a candidate reuse package for the input requirement with requirement ID3. It's a package.

同様に、一例として、以下の表５に示されるように要件を検討する。 Similarly, by way of example, consider the requirements as shown in Table 5 below.

１つ以上の分類技術を使用して、ＲＡＳ１０９は、以下の表６に示されるように入力要件の再利用解析構成を決定し得る。 Using one or more classification techniques, RAS 109 may determine a reuse analysis configuration of the input requirements as shown in Table 6 below.

入力要件の再利用解析構成がＡＲであると判断されるので、ＲＡＳ１０９は、タイプＡＲに関連する複数の再利用パッケージの１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行し得る。 Since the reuse analysis configuration of the input requirement is determined to be an AR, the RAS 109 analyzes the semantic similarity of the input requirement using pre-stored metadata of one or more of the plurality of reuse packages associated with type AR. Analysis can be performed.

一例として、タイプＡＲに関連する複数の再利用パッケージが以下の表７に示される通りであると考える。 As an example, consider that the multiple reuse packages associated with type AR are as shown in Table 7 below.

意味相似解析を実行すると、ＲＡＳ１０９は、以下の表８に示されるように１つ以上の候補再利用パッケージ及び対応するスコアを決定し得る。 Upon performing the semantic similarity analysis, RAS 109 may determine one or more candidate reuse packages and corresponding scores as shown in Table 8 below.

上記表８によれば、要件ＩＤ１及び２を有する入力要件は、それぞれ９９％及び９０％のスコアを有する再利用パッケージＰ１及びＰ３の事前に格納されたメタデータと意味論的にそれぞれ一致する。したがって、Ｐ１は、要件ＩＤ１の入力要件の候補再利用パッケージであり、Ｐ３は、要件ＩＤ２の入力要件の候補再利用パッケージである。 According to Table 8 above, the input requirements with requirement IDs 1 and 2 semantically match the pre-stored metadata of reuse packages P1 and P3 with scores of 99% and 90%, respectively. Therefore, P1 is a candidate reuse package for the input requirement of requirement ID1, and P3 is a candidate reuse package for the input requirement of requirement ID2.

同様に、一例として、以下の表９に示されるように要件を検討する。 Similarly, by way of example, consider the requirements as shown in Table 9 below.

１つ以上の分類技術を使用して、ＲＡＳ１０９は、以下の表１０に示されるように入力要件の再利用解析構成を決定し得る。 Using one or more classification techniques, RAS 109 may determine a reuse analysis configuration for the input requirements as shown in Table 10 below.

入力要件の再利用解析構成がＤＲであると判断されるので、ＲＡＳ１０９は、タイプＤＲに関連する複数の再利用パッケージの１つ以上の事前に格納されたメタデータを用いて入力要件の意味相似解析を実行し得る。 Since the reuse analysis configuration of the input requirement is determined to be a DR, the RAS 109 analyzes the semantic similarity of the input requirement using pre-stored metadata of one or more of the plurality of reuse packages associated with type DR. Analysis can be performed.

一例として、タイプＤＲに関連する複数の再利用パッケージが以下の表１１に示される通りであると考える。 As an example, consider that the multiple reuse packages associated with type DR are as shown in Table 11 below.

意味相似解析を実行すると、ＲＡＳ１０９は、以下の表１２に示されるように１つ以上の候補再利用パッケージ及び対応するスコアを決定し得る。 Upon performing the semantic similarity analysis, RAS 109 may determine one or more candidate reuse packages and corresponding scores as shown in Table 12 below.

上記表１２によれば、要件ＩＤ１及び２を有する入力要件は、それぞれ９９％及び９０％のスコアを有する再利用パッケージＰ１及びＰ３の事前に格納されたメタデータと意味論的に一致する。したがって、Ｐ１は、要件ＩＤ１の入力要件の候補再利用パッケージであり、Ｐ３は、要件ＩＤ２の入力要件の候補再利用パッケージである。 According to Table 12 above, the input requirements with requirement IDs 1 and 2 semantically match the pre-stored metadata of reuse packages P1 and P3 with scores of 99% and 90%, respectively. Therefore, P1 is a candidate reuse package for the input requirement of requirement ID1, and P3 is a candidate reuse package for the input requirement of requirement ID2.

図３Ａは、本開示のいくつかの実施形態に従って、複数の再利用パッケージを生成する方法を示すフローチャートを示す。 FIG. 3A depicts a flowchart illustrating a method of generating multiple reuse packages in accordance with some embodiments of the present disclosure.

図３Ａに図示されるように、方法３００ａは、複数の再利用パッケージを生成する方法を示す１つ以上のブロックを含む。方法３００ａは、コンピュータ実行可能命令の一般的な文脈で説明され得る。一般に、コンピュータ実行可能命令は、機能を実行する、又は抽象データタイプを実装するルーチン、プログラム、オブジェクト、構成要素、データ構造、手順、モジュール、及び機能を含む可能性がある。 As illustrated in FIG. 3A, method 300a includes one or more blocks that illustrate how to generate multiple reuse packages. Method 300a may be described in the general context of computer-executable instructions. Generally, computer-executable instructions may include routines, programs, objects, components, data structures, procedures, modules, and functions that perform functions or implement abstract data types.

方法３００ａが説明される順序は、制限として解釈されることを意図しておらず、説明される方法ブロックの任意の数は、方法３００ａを実装するために任意の順序で組み合わせることができる。さらに、個々のブロックは、本明細書に説明される主題の趣旨及び範囲から逸脱することなく方法から削除され得る。さらに、方法３００ａは、任意の適切なハードウェア、ソフトウェア、ファームウェア、又はそれらの組み合わせで実装することができる。 The order in which method 300a is described is not intended to be construed as a limitation, and any number of method blocks described may be combined in any order to implement method 300a. Additionally, individual blocks may be deleted from the method without departing from the spirit and scope of the subject matter described herein. Additionally, method 300a may be implemented in any suitable hardware, software, firmware, or combinations thereof.

ブロック３０１で、方法は、（ＰＧＳプロセッサ１１５とも呼ばれる）パッケージ生成システム（ＰＧＳ）１０３のプロセッサ１１５により、サービスとしてのデータを提供することを含む。いくつかの実施形態では、ＰＧＳプロセッサ１１５は、履歴データを未処理形式から使いやすい形式に変換することによってサービスとしてのデータを提供する。いくつかの実施形態では、ＰＧＳプロセッサ１１５は、１つ以上のデータソース１０１から、及びＰＧＳ１０５と関連付けられたデータリポジトリ１０３に格納された履歴データを受け取り得る。最初に、ＰＧＳプロセッサ１１５は、履歴データのパラメータ、制約、及び関係性を学習するために履歴データを構文解析し得る。その後、ＰＧＳ１１５は、履歴データの学習されたパラメータ、制約、及び関係性、並びに所定のデータナビゲーションスキーマに基づいて有向データ依存状態図を作成し得る。いくつかの実施形態では、有向データ依存状態図は、データ識別子、履歴データ、データナビゲーションコマンド、及びデータエンドポイントを含み、各データエンドポイントは、アプリケーションプログラミングインタフェース（ＡＰＩ）と関連付けられる。いくつかの実施形態では、履歴データを未処理形式から使いやすい形式に変換しながら、ＰＧＳプロセッサ１１５は、複数の再利用パッケージを生成するために必要とされる第１のメタデータを生成し得る。ＰＧＳプロセッサ１１５は、有向データ依存状態図、及び所定のデータナビゲーションスキーマに基づいて第１のメタデータを生成し得る。 At block 301, the method includes providing data as a service by a processor 115 of a package generation system (PGS) 103 (also referred to as a PGS processor 115). In some embodiments, PGS processor 115 provides data-as-a-service by converting historical data from a raw format to an easy-to-use format. In some embodiments, PGS processor 115 may receive historical data from one or more data sources 101 and stored in a data repository 103 associated with PGS 105. Initially, PGS processor 115 may parse historical data to learn parameters, constraints, and relationships of the historical data. PGS 115 may then create a directed data dependency state diagram based on learned parameters, constraints, and relationships of historical data and a predetermined data navigation schema. In some embodiments, the directed data dependency state diagram includes data identifiers, historical data, data navigation commands, and data endpoints, and each data endpoint is associated with an application programming interface (API). In some embodiments, the PGS processor 115 may generate the first metadata needed to generate multiple reuse packages while converting the historical data from a raw format to a usable format. . PGS processor 115 may generate first metadata based on a directed data dependency state diagram and a predetermined data navigation schema.

ブロック３０３で、方法は、ＰＧＳプロセッサ１１５により、１つ以上のデータ処理アクションを実行しながら、１つ以上の状態スナップショットを作成することを含む。１つ以上の状態スナップショットを作成するために、ＰＧＳプロセッサ１１５は、元のデータ形式でデータクエリを受け取り、データクエリを元のデータ形式からデータエンドポイントを示す検索可能なインデックス形式に変換し得る。さらに、ＰＧＳ１１５は、データリポジトリ１０３内のプロセスデータのストレージ識別子及び場所とともに、それぞれのデータエンドポイントからデータクエリに対応するプロセスデータを取得するために履歴データを詳しく検討し、ユーザ１１３にプロセスデータを提供し得る。プロセスデータを提供すると、ＰＧＳ１１５は、ＰＧＳメモリ１１９内の履歴データの状態、データクエリ、プロセスデータ、及びプロセスデータと、プロセスデータが使用される対応するデータアクションとの関係性を格納し、それによって１つ以上の状態スナップショットを作成する。その後、ＰＧＳ１１５は、対応するデータ処理アクションの各々と関連付けられた１つ以上の中間ステップの各々のアクション実行シーケンスを構築し得る。アクション実行シーケンスを構築するために、ＰＧＳプロセッサ１１５は、ユーザ１１３から受け取られた要求に基づいて、最初にプロセスデータに対して１つ以上のデータ処理アクションを実行し得る。ＰＧＳ１１５は、１つ以上のデータ処理アクションの各々に関与する１つ以上の中間ステップの各々のためにステップ識別子を作成し、対応するデータ処理アクション、ステップ識別子、及びプロセスデータに基づいてアクション実行シーケンスを構築しながら、１つ以上の中間ステップの各々のために第２のメタデータを生成し得る。ＰＧＳプロセッサ１１５は、次いで第２のメタデータ及び所定の関係性スキーマに定義された流れに基づいてアクション実行シーケンスを構築し、各要求のために構築されたアクション実行シーケンスを格納し得る。 At block 303, the method includes creating one or more state snapshots while performing one or more data processing actions by PGS processor 115. To create one or more state snapshots, PGS processor 115 may receive a data query in its original data format and convert the data query from the original data format to a searchable index format that represents data endpoints. . Additionally, the PGS 115 reviews the historical data to retrieve process data corresponding to the data query from each data endpoint, along with the storage identifier and location of the process data within the data repository 103, and provides the process data to the user 113. can be provided. Upon providing process data, PGS 115 stores the state of historical data in PGS memory 119, data queries, process data, and relationships between the process data and the corresponding data actions in which the process data is used, thereby Create one or more state snapshots. PGS 115 may then construct an action execution sequence for each of the one or more intermediate steps associated with each corresponding data processing action. To build an action execution sequence, PGS processor 115 may first perform one or more data processing actions on process data based on requests received from user 113. PGS 115 creates a step identifier for each of the one or more intermediate steps involved in each of the one or more data processing actions, and creates an action execution sequence based on the corresponding data processing action, step identifier, and process data. The second metadata may be generated for each of the one or more intermediate steps while constructing the second metadata. PGS processor 115 may then construct an action execution sequence based on the second metadata and the flow defined in the predetermined relationship schema and store the constructed action execution sequence for each request.

ブロック３０５で、方法は、ＰＧＳプロセッサ１１５により、１つ以上のデータ処理アクションの各々のために第３のメタデータを生成することを含む。いくつかの実施形態では、第３のメタデータは、対応するデータ処理アクションの１つ以上の中間ステップ、１つ以上の中間ステップの各々のために生成された第２のメタデータ、第２のメタデータに対応する第１のメタデータに対する参照、１つ以上の中間ステップの各々のアクション実行シーケンス、各アクション実行シーケンスに対応する識別子（ＩＤ）、１つ以上の中間ステップの各々及び対応するデータ処理アクションと関連付けられた所定のデータクエリ、再利用パッケージのタイプ、並びにパッケージ識別子（ＩＤ）を含む場合があるが、これらに限定されない。 At block 305, the method includes generating third metadata for each of the one or more data processing actions by the PGS processor 115. In some embodiments, the third metadata includes the one or more intermediate steps of the corresponding data processing action, the second metadata generated for each of the one or more intermediate steps, the second a reference to first metadata corresponding to the metadata, an action execution sequence for each of the one or more intermediate steps, an identifier (ID) corresponding to each action execution sequence, each of the one or more intermediate steps and corresponding data. It may include, but is not limited to, predetermined data queries associated with the processing action, the type of reuse package, and a package identifier (ID).

図３Ｂは、本開示のいくつかの実施形態にかかる、モデルライフサイクル管理のために再利用解析を実行する方法を示すフローチャートを示す。 FIG. 3B depicts a flowchart illustrating a method for performing reuse analysis for model lifecycle management, according to some embodiments of the present disclosure.

図３Ｂに図示されるように、方法３００ｂは、モデルライフサイクル管理のために再利用解析を実行する方法を示す１つ以上のブロックを含む。方法３００ｂは、コンピュータ実行可能命令の一般的な文脈で説明され得る。一般に、コンピュータ実行可能命令は、機能を実行する、又は抽象データタイプを実装するルーチン、プログラム、オブジェクト、構成要素、データ構造、手順、モジュール、及び機能を含む可能性がある。 As illustrated in FIG. 3B, method 300b includes one or more blocks that illustrate how to perform reuse analysis for model lifecycle management. Method 300b may be described in the general context of computer-executable instructions. Generally, computer-executable instructions may include routines, programs, objects, components, data structures, procedures, modules, and functions that perform functions or implement abstract data types.

方法３００ｂが説明される順序は、制限として解釈されることを意図しておらず、説明される方法ブロックの数は、方法３００ｂを実装するために任意の順序で組み合わせることができる。さらに、個々のブロックは、本明細書に説明される主題の趣旨及び範囲から逸脱することなく方法から削除され得る。さらに、方法３００ｂは、任意の適切なハードウェア、ソフトウェア、ファームウェア、又はそれらの組み合わせで実装することができる。 The order in which method 300b is described is not intended to be construed as a limitation, and the number of method blocks described may be combined in any order to implement method 300b. Additionally, individual blocks may be deleted from the method without departing from the spirit and scope of the subject matter described herein. Additionally, method 300b may be implemented in any suitable hardware, software, firmware, or combinations thereof.

ブロック３１１で、方法は、（ＲＡＳプロセッサ１２１とも呼ばれる）再利用解析システム（ＲＡＳ）のプロセッサ１２１により、ユーザ１１３から入力要件を受け取ることを含む。一例として、入力要件は、ビジネス要件（ＢＲ）、解析要件（ＡＲ）、及びデータ要件（ＤＲ）の１つである場合がある。いくつかの実施形態では、入力要件は、複数の再利用パッケージから対応する再利用パッケージを決定することを容易にする。複数の再利用パッケージは、１つ以上のデータ処理アクションの各々と関連付けられた１つ以上の中間ステップのために生成され得る。 At block 311, the method includes receiving input requirements from a user 113 by a reuse analysis system (RAS) processor 121 (also referred to as RAS processor 121). As an example, the input requirements may be one of business requirements (BR), analysis requirements (AR), and data requirements (DR). In some embodiments, the input requirements facilitate determining a corresponding reuse package from multiple reuse packages. Multiple reuse packages may be generated for one or more intermediate steps associated with each of the one or more data processing actions.

ブロック３１３で、方法は、ＲＡＳプロセッサ１２１により、入力要件に適切である再利用パッケージのタイプに対応する再利用解析構成を決定することを含む。いくつかの実施形態では、ＲＡＳプロセッサ１２１は、１つ以上の分類技術を使用して、入力要件を再利用パッケージのタイプにマッピングすることによって再利用解析構成を決定し得る。 At block 313, the method includes determining, by RAS processor 121, a reuse analysis configuration corresponding to a type of reuse package that is appropriate to the input requirements. In some embodiments, RAS processor 121 may determine the reuse analysis configuration by mapping input requirements to types of reuse packages using one or more classification techniques.

ブロック３１５で、方法は、ＲＡＳプロセッサ１２１により、再利用解析構成に基づいて複数の再利用パッケージと関連付けられた１つ以上の事前に格納されたメタデータとを用いて入力要件の意味相似解析を実行することによって、複数の再利用パッケージから１つ以上の候補再利用パッケージを決定することを含む。いくつかの実施形態では、１つ以上の候補再利用パッケージは、その１つ以上の事前に格納されたメタデータが、その１つ以上の事前に格納されたメタデータが入力要件と意味論的に一致する再利用パッケージである。いくつかの実施形態では、１つ以上の候補再利用パッケージは、入力要件と１つ以上の事前に格納されたメタデータとの間の意味論的一致の程度に基づいたスコアを割り当てられる。 At block 315, the method performs a semantic similarity analysis of the input requirements using the plurality of reuse packages and the one or more pre-stored metadata associated with the reuse analysis configuration based on the reuse analysis configuration. The steps include determining one or more candidate reuse packages from a plurality of reuse packages. In some embodiments, the one or more candidate reuse packages are configured such that the one or more pre-stored metadata are semantically related to input requirements. is a reusable package that matches. In some embodiments, one or more candidate reuse packages are assigned a score based on the degree of semantic match between the input requirements and the one or more pre-stored metadata.

ブロック３１７で、方法は、ＲＡＳプロセッサ１２１により、ユーザ１１３への提案として１つ以上の候補再利用パッケージを提供することを含む。いくつかの実施形態では、ＲＡＳプロセッサ１２１は、ユーザ１１３への提案として提供される１つ以上の候補再利用パッケージの各々とともにスコアを提供し得る。 At block 317, the method includes providing one or more candidate reuse packages as a suggestion to user 113 by RAS processor 121. In some embodiments, RAS processor 121 may provide a score with each of the one or more candidate reuse packages provided as a suggestion to user 113.

図４は、本開示と一致する実施態様を実装するための例示的なコンピュータシステムのブロック図である。 FIG. 4 is a block diagram of an exemplary computer system for implementing implementations consistent with this disclosure.

いくつかの実施形態では、図４は、本発明と一致する実施形態を実装するための例示的なコンピュータシステム４００のブロック図を示す。いくつかの実施形態では、コンピュータシステム４００は、モデルライフサイクル管理のために再利用解析を実行するために使用される再利用解析システム１０９である可能性がある。いくつかの実施形態では、コンピュータシステム４００は、複数の再利用パッケージを生成するために使用される再利用解析システム１０９と関連付けられたパッケージ生成システム１０５である可能性がある。図４は、コンピュータシステム４００を再利用解析システム１０９と見なす図４が示されている。しかしながら、コンピュータシステム４００は、（図４に図示されない）パッケージ生成システム１０５でもある可能性があるので、これは、制限と見なされるべきではない。コンピュータシステム４００は、中央演算処理装置（「ＣＰＵ」又は「プロセッサ」）４０２を含み得る。プロセッサ４０２は、ユーザ又はシステムにより生成されるビジネスプロセスを実行するためのプログラム構成要素を実行するための少なくとも１つのデータプロセッサを含み得る。ユーザは、人、本発明に含まれるものなどデバイスを使用する人、又はかかるデバイス自体を含み得る。プロセッサ４０２は、統合システム（バス）コントローラ、メモリ管理制御装置、浮動小数点演算ユニット、グラフィックスプロセッシングユニット、デジタル信号処理装置などの特殊処理装置を含み得る。 In some embodiments, FIG. 4 depicts a block diagram of an exemplary computer system 400 for implementing embodiments consistent with the present invention. In some embodiments, computer system 400 may be reuse analysis system 109 used to perform reuse analysis for model lifecycle management. In some embodiments, computer system 400 may be a package generation system 105 associated with a reuse analysis system 109 that is used to generate multiple reuse packages. FIG. 4 is shown considering computer system 400 as reuse analysis system 109. In FIG. However, this should not be considered a limitation, as computer system 400 may also be package generation system 105 (not shown in FIG. 4). Computer system 400 may include a central processing unit (“CPU” or “processor”) 402. Processor 402 may include at least one data processor for executing program components to execute user- or system-generated business processes. A user may include a person, a person using a device such as one included in the present invention, or such a device itself. Processor 402 may include specialized processing units such as an integrated system (bus) controller, memory management controller, floating point unit, graphics processing unit, digital signal processing unit, and the like.

プロセッサ４０２は、Ｉ／Ｏインタフェース４０１を介して入力装置４１１及び出力装置４１２と通信して配置され得る。Ｉ／Ｏインタフェース４０１は、制限なく、音声バス、アナログバス、デジタルバス、ステレオバス、ＩＥＥＥ－１３９４バス、シリアルバス、ユニバーサルシリアルバス（ＵＳＢ）、赤外線、ＰＳ／２、ＢＮＣ、同軸、コンポーネントＯ、複合、デジタルビジュアルインタフェース（ＤＶＩ）、高精細度マルチメディアインタフェース（ＨＤＭＩ（登録商標））、無線周波数（ＲＦ）アンテナ、Ｓ－ビデオ、ビデオグラフィックスアレイ（ＶＧＡ）、ＩＥＥＥ８０２．ｎ／ｂ／ｇ／ｎ／ｘ、Ｂｌｕｅｔｏｏｔｈ、セルラー（例えば、符号分割多元接続（ＣＤＭＡ）、高速パケットアクセス（ＨＳＰＡ＋）、グローバルサービスフォーモバイルコミュニケーションズ（ＧＳＭ）、ロングタームエボリューション（ＬＴＥ）、ＷｉＭａｘ、他）などの通信プロトコル／方法を利用し得る。 Processor 402 may be placed in communication with input devices 411 and output devices 412 via I/O interface 401 . I/O interface 401 may include, without limitation, audio bus, analog bus, digital bus, stereo bus, IEEE-1394 bus, serial bus, universal serial bus (USB), infrared, PS/2, BNC, coaxial, component O, Composite, Digital Visual Interface (DVI), High Definition Multimedia Interface (HDMI(R)), Radio Frequency (RF) Antenna, S-Video, Video Graphics Array (VGA), IEEE802. n/b/g/n/x, Bluetooth, cellular (e.g. Code Division Multiple Access (CDMA), High Speed Packet Access (HSPA+), Global Service for Mobile Communications (GSM), Long Term Evolution (LTE), WiMax, etc. ) and other communication protocols/methods may be utilized.

Ｉ／Ｏインタフェース４０１を使用して、コンピュータシステム４００は、入力装置４１１及び出力装置４１２と通信し得る。 Using I/O interface 401, computer system 400 may communicate with input devices 411 and output devices 412.

いくつかの実施形態では、プロセッサ４０２は、ネットワークインタフェース４０３を介して通信ネットワーク４０９と通信して配置され得る。ネットワークインタフェース４０３は、通信ネットワーク４０９と通信し得る。ネットワークインタフェース４０３は、制限なく、直接接続、イーサネット（例えば、ツイストペア１０／１００／１０００ベースＴ）、伝送制御プロトコル／インターネットプロトコル（ＴＣＰ／ＩＰ）、トークンリング、ＩＥＥＥ８０２．１１ａ／ｂ／ｇ／ｎ／ｘなどを含む接続プロトコルを利用し得る。ネットワークインタフェース４０３及び通信ネットワーク４０９を使用し、コンピュータシステム４００は、パッケージ生成システム１０５、パッケージリポジトリ１０７、及びユーザデバイス１１１と通信し得る。パッケージ生成システム１０５は、さらに１つ以上のデータソース１０１（１０１_１～１０１_ｎまで）及びデータリポジトリ１０３と関連付けられ得る。いくつかの実施形態では、コンピュータシステム４００は、ＩＰベースの通信、及びユニバーサルシリアルバス（ＵＳＢ）、Ｂｌｕｅｔｏｏｔｈなど、非インターネット又は非ＩＰベースの通信を介して通信し得る。通信ネットワーク４０９は、イントラネット又はローカルエリアネットワーク（ＬＡＮ）、クローズドエリアネットワーク（ＣＡＮ）、及び自立走行車の中のそのようなものなどの異なるタイプのネットワークの１つとして実装できる。通信ネットワーク４０９は、専用ネットワーク又は共用ネットワークのどちらかであってよく、これは、互いと通信するための例えばハイパーテキスト転送プロトコル（ＨＴＴＰ）、ＣＡＮプロトコル、伝送制御プロトコル／インターネットプロトコル（ＴＣＰ／ＩＰ）、無線プリケーションプロトコル（ＷＡＰ）など様々なプロトコルを使用する異なるタイプのネットワークの関連付けを表す。さらに、通信ネットワーク４０９は、ルータ、ブリッジ、サーバ、コンピューティングデバイス、ストレージデバイスなどを含む様々なネットワークデバイスを含み得る。いくつかの実施形態では、プロセッサ４０２は、ストレージインタフェース４０４を介してメモリ４０５（例えば、図４に図示されないＲＡＭ、ＲＯＭなど）と通信して配置されてよい。ストレージインタフェース４０４は、制限なく、シリアルアドバンストテクノロジーアタッチメント（ＳＡＴＡ）、インテグレーティドドライブエレクトロニクス（ＩＤＥ）、ＩＥＥＥ－１３９４、ユニバーサルシリアルバス（ＵＳＢ）、ファイバーチャネル、小型コンピュータ用周辺機器インタフェース（ＳＣＳＩ）などの接続プロトコルを利用して、メモリドライアブ、リムーバブルディスクドライブなどを含むメモリ４０５に接続し得る。メモリドライブは、さらにドラム、磁気ディスクドライブ、光磁気ドライブ、光ドライブ、レイド（ＲＡＩＤ）、ソリッドステートメモリデバイス、ソリッドステートドライブなどを含み得る。 In some embodiments, processor 402 may be placed in communication with communication network 409 via network interface 403. Network interface 403 may communicate with communication network 409. Network interface 403 can include, without limitation, direct connect, Ethernet (e.g., twisted pair 10/100/1000 Base-T), Transmission Control Protocol/Internet Protocol (TCP/IP), Token Ring, IEEE 802.11a/b/g/n/ Connection protocols may be utilized, including x and the like. Using network interface 403 and communication network 409, computer system 400 may communicate with package generation system 105, package repository 107, and user device 111. Package generation system 105 may further be associated with one or more data sources 101 (101 ₁ -101 _n ) and data repositories 103 . In some embodiments, computer system 400 may communicate via IP-based communications and non-Internet or non-IP-based communications, such as Universal Serial Bus (USB), Bluetooth, and the like. Communication network 409 can be implemented as one of different types of networks, such as an intranet or local area network (LAN), a closed area network (CAN), and the like in autonomous vehicles. The communication network 409 can be either a dedicated network or a shared network, which uses e.g. Hypertext Transfer Protocol (HTTP), CAN protocol, Transmission Control Protocol/Internet Protocol (TCP/IP) to communicate with each other. , represents an association of different types of networks using various protocols such as Wireless Application Protocol (WAP). Additionally, communication network 409 may include various network devices including routers, bridges, servers, computing devices, storage devices, and the like. In some embodiments, processor 402 may be arranged in communication with memory 405 (eg, RAM, ROM, etc. not shown in FIG. 4) via storage interface 404. Storage interface 404 may include, without limitation, Serial Advanced Technology Attachment (SATA), Integrated Drive Electronics (IDE), IEEE-1394, Universal Serial Bus (USB), Fiber Channel, Small Computer Peripheral Interface (SCSI), etc. Connection protocols may be utilized to connect to memory 405, including memory drives, removable disk drives, and the like. Memory drives may further include drums, magnetic disk drives, magneto-optical drives, optical drives, RAID, solid state memory devices, solid state drives, and the like.

メモリ（４０５）は、制限なく、ユーザインタフェース４０６、オペレーティングシステム４０７、ウェブブラウザ４０８などを含むプログラムコンポーネント又はデータベースコンポーネントの集合体を格納し得る。いくつかの実施形態では、コンピュータシステム４００は、本発明に説明されるように、データ、変数、レコードなどのユーザデータ／アプリケーションデータを格納し得る。かかるデータベースは、Ｏｒａｃｌｅ又はＳｙｂａｓｅなど、フォールトトレラントな、リレーショナルでスケーラブルで安全なデータベースとして実装され得る。 Memory (405) may store a collection of program or database components including, without limitation, a user interface 406, an operating system 407, a web browser 408, and the like. In some embodiments, computer system 400 may store user/application data such as data, variables, records, etc. as described in the present invention. Such a database may be implemented as a fault-tolerant, relational, scalable, and secure database, such as Oracle or Sybase.

オペレーティングシステム４０７は、コンピュータシステム４００のリソース管理及び操作を容易にし得る。オペレーティングシステム４０７の例は、制限なく、ＡＰＰＬＥ（登録商標）ＭＡＣＩＮＴＯＳＨ（登録商標）ＯＳＸ（登録商標）、ＵＮＩＸ（登録商標）、ＵＮＩＸのようなシステムの提供（例えば、ＢＥＲＫＥＬＥＹＳＯＦＴＷＡＲＥＤｉＳＴＲＩＢＵＴｉＯＮ（登録商標）（ＢＳＤ）、ＦＲＥＥＢＳＤ（登録商標）、ＮＥＴＢＳＤ（登録商標）、ＯＰＥＮＢＳＤなど）、ＬＩＮＵＸ（登録商標）ＤＩＳＴＲＩＢＵＴＩＯＮＳ（例えば、ＲＥＤＨＡＴ（登録商標）、ＵＢＵＮＴＵ（登録商標）、ＫＵＢＵＮＴＵ（登録商標）など）、ＩＢＭ（登録商標）ＯＳ／２（登録商標）、ＭＩＣＲＯＳＯＦＴ（登録商標）ＷＩＮＤＯＷＳ（登録商標）（ＸＰ（登録商標）、ＶＩＳＴＡ（登録商標）／７／８、１０他）、ＡＰＰＬＥ（登録商標）ＩＯＳ（登録商標）、ＧＯＯＧＬＥ（商標）ＡＮＤＲＯＩＤ（登録商標）、ＢＬＡＣＫＢＥＲＲＹ（登録商標）ＯＳなどを含む。ユーザインタフェース４０６は、テキスト機能又はグラフィック機能を通してプログラムコンポーネントの表示、実行、対話、操作、又は動作を容易にし得る。例えば、ユーザインタフェース４０６は、カーソル、アイコン、チェックボックス、メニュー、スクローラ、ウィンドウ、ウィジェットなどの、コンピュータシステム４００に動作可能なように接続されたディスプレイシステム上にコンピュータ対話インタフェース要素を提供し得る。Ａｐｐｌｅ（登録商標）Ｍａｃｉｎｔｏｓｈ（登録商標）オペレーティングシステムのＡｑｕａ（登録商標）、ＩＢＭ（登録商標）ＯＳ／２（登録商標）、Ｍｉｃｒｏｓｏｆｔ（登録商標）Ｗｉｎｄｏｗｓ（登録商標）（例えば、Ａｅｒｏ、Ｍｅｔｒｏなど）、ウェブインタフェースライブラリ（例えば、ＡｃｔｉｖｅＸ（登録商標）、Ｊａｖａ（登録商標）、Ｊａｖａｓｃｒｉｐｔ（登録商標）、ＡＪＡＸ、ＨＴＭＬ、Ａｄｏｂｅ（登録商標）Ｆｌａｓｈ（登録商標）、他）などを含むグラフィカルユーザインタフェース（ＧＵＩ）が利用され得る。 Operating system 407 may facilitate resource management and operation of computer system 400. Examples of operating systems 407 include, without limitation, APPLE® MACINTOSH® OS (BSD), FREEBSD(R), NETBSD(R), OPENBSD, etc.), LINUX(R) DISTRIBUTIONS (e.g., RED HAT(R), UBUNTU(R), KUBUNTU(R), etc.), IBM (registered trademark) OS/2 (registered trademark), MICROSOFT (registered trademark) WINDOWS (registered trademark) (XP (registered trademark), VISTA (registered trademark)/7/8, 10, etc.), APPLE (registered trademark) IOS (registered trademark), GOOGLE (trademark), ANDROID (registered trademark), BLACKBERRY (registered trademark) OS, etc. User interface 406 may facilitate displaying, executing, interacting with, manipulating, or operating program components through textual or graphical features. For example, user interface 406 may provide computer interaction interface elements on a display system operably connected to computer system 400, such as cursors, icons, check boxes, menus, scrollers, windows, widgets, and the like. Apple® Macintosh® operating systems Aqua®, IBM® OS/2®, Microsoft® Windows® (e.g., Aero, Metro, etc.) , graphical user interfaces (GUI), including web interface libraries (e.g., ActiveX®, Java®, JavaScript®, AJAX, HTML, Adobe® Flash®, etc.), etc. ) can be used.

いくつかの実施形態では、コンピュータシステム４００は、ウェブブラウザ４０８に格納されたプログラムコンポーネントを実装し得る。ウェブブラウザ４０８は、ＭＩＣＲＯＳＯＦＴ（登録商標）ＩＮＴＥＲＮＥＴＥＸＰＬＯＲＥＲ（登録商標）、ＧＯＯＧＬＥ（商標）ＣＨＲＯＭＥ（商標）、ＭＯＺＩＬＬＡ（登録商標）ＦＩＲＥＦＯＸ（登録商標）、ＡＰＰＬＥ（登録商標）ＳＡＦＡＲＩ（登録商標）などのハイパーテキスト表示アプリケーションであってよい。安全なウェブ閲覧は、セキュアハイパーテキスト転送プロトコル（ＨＴＴＰＳ）、セキュアソケットレイヤ（ＳＳＬ）、トランスポート層セキュリティ（ＴＬＳ）などを使用して提供し得る。ウェブブラウザ４０８は、ＡＪＡＸ、ＤＨＴＭＬ、ＡＤＯＢＥ（登録商標）ＦＬＡＳＨ（登録商標）、ＪＡＶＡＳＣＲＩＰＴ（登録商標）、ＪＡＶＡ（登録商標）、アプリケーションプログラミングインタフェース（ＡＰＩ）などの機能を利用し得る。いくつかの実施形態では、コンピュータシステム４００は、メールサーバに格納されたプログラムコンポーネントを実装し得る。メールサーバは、ＭｉｃｒｏｓｏｆｔＥｘｃｈａｎｇｅなどのインターネットメールサーバであってよい。メールサーバは、アクティブサーバページ（ＡＳＰ）、ＡＣＴＩＶＥＸ（登録商標）、ＡＮＳＩ（登録商標）Ｃ＋＋／Ｃ＃、ＭＩＣＲＯＳＯＦＴ（登録商標）、．ＮＥＴ、ＣＧＩＳＣＲＩＰＴＳ、ＪＡＶＡ（登録商標）、ＪＡＶＡＳＣＲＩＰＴ（登録商標）、ＰＥＲＬ（登録商標）、ＰＨＰ、ＰＹＴＨＯＮ（登録商標）、ＷＥＢＯＢＪＥＣＴＳ（登録商標）などの機能を利用し得る。メールサーバは、インターネットメッセージアクセスプロトコル（ＩＭＡＰ）、メッセージングアプリケーションプログラミングインタフェース（ＭＡＰＩ）、ＭＩＣＲＯＳＯＦＴ（登録商標）エクスチェンジ、ポストオフィスプロトコル（ＰＯＰ）、シンプルメールトランスファプロトコル（ＳＭＴＰ）などの通信プロトコルを利用し得る。いくつかの実施形態では、コンピュータシステム４００は、メールクライアントに格納されたプログラムコンポーネントを実装し得る。メールクライアントは、ＡＰＰＬＥ（登録商標）ＭＡＩＬ、ＭＩＣＲＯＳＯＦＴ（登録商標）ＥＮＴＯＵＲＡＧＥ（登録商標）、ＭＩＣＲＯＳＯＦＴ（登録商標）ＯＵＴＬＯＯＫ（登録商標）、ＭＯＺＩＬＬＡ（登録商標）ＴＨＵＮＤＥＲＢＩＲＤ（登録商標）などのメール表示アプリケーションであってよい。 In some embodiments, computer system 400 may implement program components stored in web browser 408. The web browser 408 is a hyperlink such as MICROSOFT (registered trademark) INTERNET EXPLORER (registered trademark), GOOGLE (trademark) CHROME (trademark), MOZILLA (registered trademark) FIREFOX (registered trademark), APPLE (registered trademark) SAFARI (registered trademark), etc. It may be a text display application. Secure web browsing may be provided using Secure Hypertext Transfer Protocol (HTTPS), Secure Sockets Layer (SSL), Transport Layer Security (TLS), and the like. Web browser 408 may utilize features such as AJAX, DHTML, ADOBE® FLASH®, JAVASCRIPT®, JAVA®, application programming interfaces (APIs), and the like. In some embodiments, computer system 400 may implement program components stored on a mail server. The mail server may be an Internet mail server such as Microsoft Exchange. Mail servers include Active Server Pages (ASP), ACTIVEX®, ANSI® C++/C#, MICROSOFT®, . NET, CGI SCRIPTS, JAVA (registered trademark), JAVASCRIPT (registered trademark), PERL (registered trademark), PHP, PYTHON (registered trademark), WEBOBJECTS (registered trademark), and the like. Mail servers may utilize communication protocols such as Internet Message Access Protocol (IMAP), Messaging Application Programming Interface (MAPI), MICROSOFT Exchange, Post Office Protocol (POP), Simple Mail Transfer Protocol (SMTP), and the like. In some embodiments, computer system 400 may implement program components stored in a mail client. The email client is an email display application such as APPLE (registered trademark) MAIL, MICROSOFT (registered trademark) ENTOURAGE (registered trademark), MICROSOFT (registered trademark) OUTLOOK (registered trademark), MOZILLA (registered trademark) THUNDERBIRD (registered trademark), etc. It's fine.

さらに、１つ以上のコンピュータ可読記憶媒体は、本発明と一致する実施形態を実装する際に利用され得る。コンピュータ可読記憶媒体は、プロセッサにより読み取り可能な情報又はデータが格納され得る任意のタイプの物理メモリを指す。したがって、コンピュータ可読記憶媒体は、プロセッサ（複数の場合がある）に本明細書に説明する実施形態と一致するステップ又は段階を実行させるための命令を含む、１つ以上のプロセッサによる実行のための命令を格納し得る。用語「コンピュータ可読媒体」は、有形の品目を含み、搬送波及び過渡信号を除外する、つまり非一時的と理解されるべきである。例は、ランダムアクセスメモリ（ＲＡＭ）、読取り専用メモリ（ＲＯＭ）、揮発性メモリ、不揮発性メモリ、ハードドライブ、コンパクトディスク（ＣＤ）ＲＯＭ、デジタルビデオディスク（ＤＶＤ）、フラッシュドライブ、ディスク、及び任意の他の既知の物理記憶媒体を含む。 Additionally, one or more computer-readable storage media may be utilized in implementing embodiments consistent with the present invention. Computer-readable storage media refers to any type of physical memory that can store information or data that can be read by a processor. Accordingly, a computer-readable storage medium includes instructions for execution by one or more processors to cause the processor(s) to perform steps or stages consistent with the embodiments described herein. May store instructions. The term "computer-readable medium" is to be understood to include tangible items and exclude carrier waves and transient signals, ie, non-transitory. Examples include random access memory (RAM), read-only memory (ROM), volatile memory, non-volatile memory, hard drives, compact discs (CDs) ROMs, digital video discs (DVDs), flash drives, disks, and any including other known physical storage media.

本開示の実施形態の優位点がここに示される。 Advantages of embodiments of the present disclosure are presented here.

本開示は、データ処理アクションを実行する際に関与する各中間ステップを記録し、各中間ステップに対応する再利用パッケージを生成する。これは、データ処理層、解析層、及びアプリケーション層にわたる再利用が可能にし、これは同様に、複数の使用点のためのアプリケーションの展開及び作成のプロセスを早めるのに役立つ。再利用パッケージは各中間ステップのために生成されるので、本開示は、再利用を可能にするためにユーザの入力要件の、完全なメインステップとの相似を命令する既存の技術とは異なり、中間ステップレベル自体からの再利用を可能にする。したがって、再利用パッケージは、データ処理アクションを実行する際に関与するメインステップに限定されないので、本開示は、より高い再利用確率を提供する。 The present disclosure records each intermediate step involved in performing a data processing action and generates a reuse package corresponding to each intermediate step. This enables reuse across data processing, analysis, and application layers, which in turn helps speed up the process of deploying and creating applications for multiple points of use. Because a reuse package is generated for each intermediate step, the present disclosure differs from existing techniques that mandate similarity of user input requirements to the complete main step to enable reuse. Enables reuse from intermediate step levels themselves. Therefore, the present disclosure provides a higher reuse probability since reuse packages are not limited to the main steps involved in performing data processing actions.

さらに、本開示では、ユーザから受け取られた入力要件の構成タイプに基づいて、第３のメタデータに存在する情報ラベルがチェックされ、それは複数の再利用パッケージの間で１つ以上の候補再利用パッケージを決定するのに役立つ。かかる検索プロセスは、検索を、入力要件の構成タイプに属する再利用パッケージだけに絞り込み、それにより使用するための１つ以上の候補再利用パッケージの容易で迅速且つ正確な決定を達成する。 Further, in this disclosure, based on the configuration type of the input requirement received from the user, an information label present in the third metadata is checked, which identifies one or more candidate reuses among multiple reuse packages. Help you decide on a package. Such a search process narrows the search to only reuse packages that belong to the configuration type of the input requirements, thereby achieving an easy, quick, and accurate determination of one or more candidate reuse packages for use.

さらに、本開示は、どの再利用パッケージが生成されるのかに基づいて、各ステップ、中間ステップ、アクションなどでデータのスナップショットを撮り、持続させる。かかるデータの持続及びスナップショットは、予備的なステップ及びアクションにも関連するデータを取り込み得る。予備的なステップ及びアクションは、正しいステップ及びアクションに到達する前に、ユーザによって実行されるトライアルステップ及びアクションとして定義できるであろう。したがって、再利用パッケージは、予備的なステップ及びアクションにも基づいて生成され、これは同様に、再利用パッケージの数及び対象を増加させるのに役立つ。 Additionally, the present disclosure takes and persists snapshots of data at each step, intermediate step, action, etc. based on which reuse package is generated. Such data persistence and snapshots may capture data related to preliminary steps and actions as well. Preliminary steps and actions could be defined as trial steps and actions performed by the user before arriving at the correct steps and actions. Therefore, reuse packages are also generated based on preliminary steps and actions, which likewise serves to increase the number and scope of reuse packages.

さらに、本開示では、再利用パッケージは、パッケージ生成システムにより、データ処理アクションを実行しながら、関与する各中間ステップのために自動的に生成される。したがって、再利用パッケージを生成するために使用されるデータを定義／選択するための手動介入は完全に排除され、それによって使用するための再利用パッケージの対象を拡大する。 Further, in this disclosure, reuse packages are automatically generated by the package generation system for each intermediate step involved while performing data processing actions. Therefore, manual intervention to define/select the data used to generate the reuse package is completely eliminated, thereby expanding the scope of the reuse package for use.

互いと通信するいくつかの構成要素を有する実施形態の説明は、すべてのかかる構成要素が必要とされることを暗示していない。逆に、様々な任意選択の構成要素は、本発明の多種多様な考えられる実施形態を示すために説明される。単一のデバイス又は製品が本明細書で説明されるとき、１つ以上のデバイス／製品が（それらが協調するかどうかに関わりなく）単一のデバイス／製品の代わりに使用され得ることが明らかになる。同様に、複数のデバイス又は製品が（それらが協調するかどうかに関わりなく）本明細書に説明される場合、単一のデバイス／製品が複数のデバイス若しくは製品の代わりに使用されてよい、又は示されている数のデバイス又はプログラムの代わりに異なる数のデバイス／製品が使用されてよいことが明らかになる。デバイスの機能及び／又は特徴は、かかる機能／特徴を有するとして明示的に説明されない１つ以上の他のデバイスによって代わりに実施されてよい。したがって、本発明の他の実施形態は、デバイス自体を含む必要はない。 A description of an embodiment having several components in communication with each other does not imply that all such components are required. On the contrary, various optional components are described to illustrate the wide variety of possible embodiments of the invention. When a single device or product is described herein, it is clear that one or more devices/products (whether or not they cooperate) may be used in place of the single device/product. become. Similarly, when multiple devices or products are described herein (regardless of whether they coordinate or not), a single device/product may be used in place of the multiple devices or products, or It will be apparent that a different number of devices/products may be used in place of the number of devices or programs shown. The functions and/or features of a device may be alternatively implemented by one or more other devices not explicitly described as having such functions/features. Therefore, other embodiments of the invention need not include the device itself.

本明細書は、モデルライフサイクル管理のために再利用解析を実行するための方法及びシステムを説明してきた。示されているステップは、示されている例示的な実施形態を説明するために提示されており、継続中の技術開発が特定の機能が実行される方法を変更することが予測されるべきである。これらの例は、制限のためではなく、例示の目的で本明細書に提示される。さらに、機能上の基礎的要素の限界は、説明の便宜上、本明細書では任意に定義されている。指定された機能及びその関係性が適切に実行される限り、代替の限界を定義することもできる。（本明細書に説明されるものの均等物、拡張、変形、逸脱などを含む）代替策は、本明細書に含まれる教示に基づき当業者に明らかになる。かかる代替策は、開示される実施形態の範囲内に入る。また、単語「含む（ｃｏｍｐｒｉｓｉｎｇ）」、「有する（ｈａｖｉｎｇ）」、「含む（ｃｏｎｔａｉｎｉｎｇ）」、及び「含む（ｉｎｃｌｕｄｉｎｇ）」、及び他の類似した形式は、意味において同等であることが意図され、これらの単語の任意の１つに続く１つ又は複数の品目が、かかる１つ又は複数の品目の限定列挙であることを意図していない、又は列挙されている１つ若しくは複数の品目だけに限定されることを意図していないという点でオープンエンドである。また、本明細書及び添付の特許請求の範囲で使用されるとき、単数形「ある」、「１つの」、及び「該」は、文脈が別途明確に指示しない限り、複数の参照を含むことに留意されたい。 This specification has described methods and systems for performing reuse analysis for model lifecycle management. The steps shown are presented to explain the example embodiments shown, and it should be anticipated that ongoing technological developments will change the manner in which certain functions are performed. be. These examples are presented herein for purposes of illustration and not limitation. Furthermore, the limits of functional fundamentals are arbitrarily defined herein for convenience of explanation. Alternative limits may also be defined, as long as the specified functions and their relationships are properly performed. Alternatives (including equivalents, extensions, variations, deviations, etc. to those described herein) will be apparent to those skilled in the art based on the teachings contained herein. Such alternatives fall within the scope of the disclosed embodiments. Also, the words "comprising," "having," "containing," and "including" and other similar forms are intended to be equivalent in meaning; The item or items that follow any one of these words are not intended to be a limited enumeration of such item or items, or only the item or items listed. It is open-ended in that it is not intended to be limited. Also, as used in this specification and the appended claims, the singular forms "a," "a," and "the" include plural references unless the context clearly dictates otherwise. Please note that.

最後に、本明細書で使用される言語は、おもに読みやすさ及び教育の目的で選択されており、言語は、本発明の主題を詳しく説明又は制限するために選択されなかった可能性がある。したがって、本発明の範囲は、本発明を実施するための形態によって制限されるのではなく、むしろここに基づく出願に対して発生するあらゆる請求項によって制限されることが意図される。したがって、本発明の実施形態は、続く特許請求の範囲に説明される、本発明の範囲の制限的ではなく、例示的となることを目的とする。 Finally, the language used herein has been chosen primarily for readability and educational purposes; the language may not have been chosen to elaborate or limit the subject matter of the invention. . Accordingly, it is intended that the scope of the invention be limited not by this detailed description, but rather by any claims arising on this application. Accordingly, the embodiments of the invention are intended to be illustrative rather than restrictive of the scope of the invention, which is set forth in the claims that follow.

１００アーキテクチャ
１０１１つ以上のデータソース
１０３データリポジトリ
１０５パッケージ生成システム（ＰＧＳ）
１０７パッケージリポジトリ
１０９再利用解析システム（ＲＡＳ）
１１１ユーザデバイス
１１３ユーザ
１１５ＰＧＳプロセッサ
１１７ＰＧＳＩ／Ｏインタフェース
１１９ＰＧＳメモリ
１２１ＲＡＳプロセッサ
１２３ＲＡＳＩ／Ｏインタフェース
１２５ＲＡＳメモリ
２０３ＰＧＳのデータ
２０５ＰＧＳのモジュール
２０７第１のメタデータ
２０９スナップショットデータ
２１１第２のメタデータ
２１３アクションシーケンスデータ
２１５第３のメタデータ
２１７再利用パッケージデータ
２１９ＰＧＳの他のデータ
２２１変換モジュール
２２３メタデータ生成モジュール
２２５スナップショット作成モジュール
２２７シーケンス構築モジュール
２２９パッケージ生成モジュール
２３１ＰＧＳの他のモジュール
２４３ＲＡＳのデータ
２４５ＲＡＳのモジュール
２４７要件データ
２４９構成データ
２５１候補再利用パッケージデータ
２５３ＲＡＳの他のデータ
２５５受取りモジュール
２５７構成決定モジュール
２５９再利用パッケージ決定モジュール
２６１ＲＡＳの他のモジュール
４００例示的なコンピュータシステム
４０１例示的なコンピュータシステムのＩ／Ｏインタフェース
４０２例示的なコンピュータシステムのプロセッサ
４０３ネットワークインタフェース
４０４ストレージインタフェース
４０５例示的なコンピュータシステムのメモリ
４０６ユーザインタフェース
４０７オペレーティングシステム
４０８ウェブブラウザ
４０９通信ネットワーク
４１１入力装置
４１２出力装置 100 Architecture 101 One or more data sources 103 Data repositories 105 Package Generation System (PGS)
107 Package Repository 109 Reuse Analysis System (RAS)
111 User device 113 User 115 PGS processor 117 PGS I/O interface 119 PGS memory 121 RAS processor 123 RAS I/O interface 125 RAS memory 203 PGS data 205 PGS module 207 First metadata 209 Snapshot data 211 Second Metadata 213 Action sequence data 215 Third metadata 217 Reuse package data 219 Other PGS data 221 Conversion module 223 Metadata generation module 225 Snapshot creation module 227 Sequence construction module 229 Package generation module 231 Other PGS Module 243 RAS Data 245 RAS Module 247 Requirements Data 249 Configuration Data 251 Candidate Reuse Package Data 253 RAS Other Data 255 Receiving Module 257 Configuration Determination Module 259 Reuse Package Determination Module 261 Other RAS Modules 400 Exemplary Computer System 401 I/O Interface of the Example Computer System 402 Processor of the Example Computer System 403 Network Interface 404 Storage Interface 405 Memory of the Example Computer System 406 User Interface 407 Operating System 408 Web Browser 409 Communication Network 411 Input Device 412 Output device

Claims

A method for performing reuse analysis for model lifecycle management, the method comprising:
receiving, by a reuse analysis system (109), input requirements from a user (113) via a user device (111), the input requirements determining a corresponding reuse package from a plurality of reuse packages; the plurality of reuse packages being generated for each of the one or more intermediate steps associated with each of the one or more data processing actions;
determining, by the reuse analysis system (109), a reuse analysis configuration corresponding to a type of reuse package that is appropriate for the input requirements;
the reuse analysis by performing a semantic similarity analysis of the input requirements using one or more pre-stored metadata associated with the plurality of reuse packages based on the reuse analysis configuration; determining, by the system (109), one or more candidate reuse packages from the plurality of reuse packages;
providing the one or more candidate reuse packages as a suggestion to the user by the reuse analysis system (109).

2. The method of claim 1, wherein the input requirements are at least one of business requirements (BR), analysis requirements (AR), and data requirements (DR).

Claim: wherein determining the reuse analysis configuration comprises mapping, by the reuse analysis system (109), the input requirements to the types of reuse packages using one or more classification techniques. The method described in Section 1.

the one or more candidate reuse packages whose one or more pre-stored metadata semantically match the input requirements by exceeding a preset threshold; 2. The method of claim 1, wherein the method is a package.

5. The one or more candidate reuse packages are assigned a score based on the degree of semantic match between the input requirements and the one or more pre-stored metadata. Method described.

Generating the plurality of reusable packages includes:
providing data-as-a-service by converting historical data from a raw format to an easy-to-use format by a package generation system (105) associated with a reusable analysis system (109); first metadata (207) required to generate a reuse package is generated while transforming said historical data, said historical data being received from one or more data sources (101); said providing, stored in a package repository (107) associated with said package generation system (105);
said package generation system (105), while performing said one or more data processing actions, creating one or more state snapshots and providing said data-as-a-service, each of said corresponding data processing actions; constructing an action execution sequence for each of said one or more intermediate steps associated with said action execution sequence, said second metadata (211) for each of said one or more intermediate steps associated with said action execution sequence; said constructing, generated while constructing a sequence;
generating, by the package generation system (105), third metadata (215) for each of the one or more data processing actions, the third metadata (215) comprising: the one or more intermediate steps of the corresponding data processing action, the second metadata (211) generated for each of the one or more intermediate steps, corresponding to the second metadata (211); a reference to the first metadata (207) for each of the one or more intermediate steps, the action execution sequence of each of the one or more intermediate steps, an identifier (ID) corresponding to each action execution sequence, each of the one or more intermediate steps; said generating a predetermined data query, a type of reuse package, and a package identifier (ID) associated with said corresponding data processing action;
The method of claim 1, comprising: generating, by the package generation system (105), the plurality of reuse packages based on the third metadata (215).

Providing the data as a service,
parsing, by the package generation system (105), the historical data to learn parameters, constraints, and relationships of the historical data;
creating a directed data dependency state diagram by the package generation system (105) based on the learned parameters, constraints, and relationships of the historical data and a predetermined data navigation schema; creating a directed data dependency state diagram including a data identifier, the historical data, data navigation commands, and data endpoints, each data endpoint being associated with an application programming interface (API);
generating first metadata (207) by the package generation system (105) based on the directed data dependency state diagram and the predetermined data navigation schema in order to provide the data as a service; 7. The method of claim 6, comprising:

creating one or more state snapshots and constructing the action execution sequence;
Upon providing the data as a service by the package generation system (105), receiving a data query in the original data format;
converting, by the package generation system (105), the data query from the original data format to a searchable index format representing data endpoints;
providing, by the package generation system (105), the process data corresponding to the data query from each of the data endpoints, along with a storage identifier and location of the process data in a data repository (103); said providing, wherein process data is obtained by reviewing said historical data;
The package generation system (105) determines the state of historical data in the data repository (103), the data query, the process data, and the relationship between the process data and the corresponding data action in which the process data is used. storing relationships thereby creating the one or more state snapshots;
receiving a request from a user (113) via a user device (111) to perform the one or more data processing actions on the process data by the package generation system (105);
creating, by the package generation system (105), a step identifier for each of the one or more intermediate steps associated with each of the one or more data processing actions;
The package generation system (105) generates the second metadata (211) for each of the one or more intermediate steps based on the corresponding data processing action, the step identifier, and the process data. to generate;
constructing the action execution sequence by the package generation system (105) based on the flow defined in the second metadata (211) and a predetermined relationship schema;
7. The method of claim 6, comprising storing the constructed action execution sequence for each request by the package generation system (105).

The first metadata (207), the second metadata (211), and the third metadata (215) include pre-stored metadata associated with the plurality of reuse packages. 7. The method of claim 6.

A reuse analysis system (109) for performing reuse analysis for model life cycle management, comprising:
a processor (121);
a memory (125) communicatively coupled to the processor (121), wherein the memory (125), upon execution, provides information to the processor;
receiving input requirements from a user (113) via a user device (111), wherein the input requirements facilitate determining a corresponding reuse package from a plurality of reuse packages; , generated for each of the one or more intermediate steps associated with each of the one or more data processing actions;
determining a reuse analysis configuration corresponding to a type of reuse package that is optimal for the input requirements;
the plurality of reuse packages by performing a semantic similarity analysis of the input requirements using one or more pre-stored metadata associated with the plurality of reuse packages based on the reuse analysis configuration; determining one or more candidate reuse packages from the usage packages;
a memory (125) for storing process instructions for providing the one or more candidate reuse packages as a suggestion to the user.

The reuse analysis system (109) according to claim 10, wherein the input requirements are at least one of business requirements (BR), analysis requirements (AR), and data requirements (DR).

11. To determine the reuse analysis configuration, the processor is configured to map the input requirements to types of reuse packages using one or more classification techniques. reuse analysis system (109).

the one or more candidate reuse packages whose one or more pre-stored metadata semantically match the input requirements by exceeding a preset threshold; The reuse analysis system (109) according to claim 10, which is a package.

the processor assigns a score to the one or more candidate reuse packages based on the degree of semantic match between the input requirements and the one or more pre-stored metadata; The reuse analysis system (109) according to claim 13.

In order to generate the plurality of reuse packages, a package generation system (105) associated with the reuse analysis system (109),
providing data-as-a-service by converting historical data from a raw format to an easy-to-use format; first metadata (207) required for generating said plurality of reuse packages; is generated while transforming said historical data, said historical data being received from one or more data sources (101) and stored in a data repository (103) associated with said package generation system (105); providing said;
creating one or more state snapshots while performing the one or more data processing actions and providing the data-as-a-service, the one or more intermediates associated with each of the corresponding data processing actions; constructing an action execution sequence for each of the steps, wherein second metadata (211) for each of the one or more intermediate systems is generated while constructing the action execution sequence; to build,
generating third metadata (215) for each of the one or more data processing actions, wherein the third metadata (215) includes the corresponding one of the corresponding data processing actions; The above intermediate step, the second metadata (211) generated for each of the one or more intermediate steps, and the first metadata (207) corresponding to the second metadata (211). ), the action execution sequence of each of the one or more intermediate steps, an identifier (ID) corresponding to each action execution sequence, an identifier (ID) associated with each of the one or more intermediate steps and the corresponding data processing action. the plurality of reuse packages based on the third metadata (215); and generating the plurality of reuse packages based on the third metadata (215). A reuse analysis system (109) according to claim 10, configured to perform.

In order to provide the data as a service, the package generation system (105)
parsing the historical data to learn parameters, constraints, and relationships of the historical data;
creating a directed data dependency state diagram based on the learned parameters, constraints, and relationships of the historical data and a predetermined data navigation schema, wherein the directed data dependency state diagram creating an identifier, the historical data, a data navigation command, and a data endpoint, each data endpoint being associated with an application programming interface (API);
Claim: configured to generate the first metadata (207) based on the directed data dependency state diagram and the predetermined data navigation schema for providing the data as a service. Reuse analysis system (109) according to item 15.

the package generation system (105) for creating one or more state snapshots and constructing the action execution sequence;
providing said data-as-a-service, receiving data queries in their original data format;
converting the data query from the original data format to a searchable index format representing data endpoints;
providing said process data corresponding to said data query from each said data endpoint, along with a storage identifier and location of said process data in said data repository (103), said process data detailing said historical data; obtained by considering, providing;
storing the state of historical data in the data repository (103), the data queries, the process data, and the relationships between the process data and the corresponding data actions in which the process data is used; creating and storing the one or more state snapshots;
receiving a request from a user (113) via a user device (111) to perform the one or more data processing actions on the process data;
creating a step identifier for each of the one or more intermediate steps involved in each of the one or more data processing actions;
generating said second metadata (211) for each of said one or more intermediate steps based on said corresponding data processing action, said step identifier and said process data;
constructing the action execution sequence based on the flow defined in the second metadata (211) and a predetermined relationship schema; and storing the constructed action execution sequence for each request. A reuse analysis system (109) according to claim 15, configured to perform.

The first metadata (207), the second metadata (211), and the third metadata (215) include pre-stored metadata associated with the plurality of reuse packages. A reuse analysis system (109) according to claim 15, which forms a reuse analysis system (109).