JP2022543992A

JP2022543992A - Methods, systems and programs for managing operational data in distributed processing systems

Info

Publication number: JP2022543992A
Application number: JP2022501116A
Authority: JP
Inventors: シュナイダー、ジェシカ; ハニス、トーマス; ザイフェルト、ポール
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 2019-08-05
Filing date: 2020-07-23
Publication date: 2022-10-17
Also published as: GB2602213B; GB202202866D0; CN114207590A; US11310126B2; WO2021024076A1; DE112020003744T5; GB2602213A; US20210044499A1

Abstract

分散処理システム内の運用データを管理する方法であって、システムのワークロードを監視して、データ・ソースとデータ・ターゲットの間の運用データの移動の現在の評価を確立し、１つまたは複数のサービス品質基準を損なう結果をもたらす移動の以前の事例を含む以前のデータ移動に関する履歴情報を受信し、現在の評価および履歴情報から次回の運用データのアクションが特定のサービス品質基準を満たさないということを決定し、それに応答してデータ・ソースおよびデータ・ターゲットの定義に従って特定のサービス品質基準を向上させるように適応されたデータ管理最適化インフラストラクチャ（データ・バックプレーン・サービス）を適用することによって、分散処理システム内の運用データを管理する。サービス品質基準に関連して過去の運用結果と相関関係がある過去の運用因子を含む履歴情報を使用してトレーニングされた認知システムを使用して運用結果を予測する。A method of managing operational data in a distributed processing system comprising monitoring system workload to establish a current assessment of operational data movement between data sources and data targets; receive historical information about previous data movements, including previous instances of moves that have resulted in compromised service quality standards of the company, and current assessment and historical information that the next operational data action will not meet specified service quality standards and responsively applying a data management optimization infrastructure (data backplane services) adapted to improve specific service quality standards as defined by data sources and data targets manages operational data in distributed processing systems. Predict operational outcomes using a cognitive system trained using historical information containing past operational factors that correlate with past operational outcomes in relation to service quality criteria.

Description

本発明は、一般に、コンピュータ・システムに関連しており、より詳細には、分散コンピューティング・システムにおいてサービス品質規格を維持する方法に関連している。 The present invention relates generally to computer systems, and more particularly to methods for maintaining quality of service standards in distributed computing systems.

コンピューティング・システムは、長年にわたって著しく複雑になった。初期のコンピューティングでは、プロジェクトに関連するすべてのタスクを処理する単一のコンピュータが存在した。さらに補助的システムが現れ、ネットワーク・コンピューティング（特に、インターネット）が出現するにつれて、コンピューティングの世界の大部分が、分散コンピューティングに変化している。分散コンピューティングまたは分散システムは、異なるネットワーク・コンピュータなどの異なる場所で実装されるコンポーネントを含むシステムである。例としては、ピアツーピア・ネットワーク、オンライン・ゲーム、電話技術、およびデータ管理が挙げられる。 Computing systems have become significantly more complex over the years. In early computing, there was a single computer that handled all the tasks associated with a project. With the advent of more ancillary systems and the advent of networked computing (particularly the Internet), much of the computing world is transforming into distributed computing. A distributed computing or distributed system is a system that includes components implemented in different locations, such as different network computers. Examples include peer-to-peer networks, online gaming, telephony, and data management.

データ管理は、一般に、データ（すなわち、情報）の調達、保守、および使用に関連する技術である。データ自体は、名前およびアドレスなどの顧客の詳細のように、単純なものであることがあり、または金融サービス（例えば、金融犯罪捜査ソリューション）のように、より大規模なものであることがある。そのようなシステムにおけるデータの運用管理は、非常に複雑である。この課題は、マイクロサービスのような分散機能処理アーキテクチャ（distributed functional processing architectures）を使用するシステムにおいて特に当てはまる。マイクロサービスは、疎結合されたサービスの集合としてアプリケーションの構造化を可能にするソフトウェア開発技術である。アプリケーションをより小さい異なるサービスに分解することの１つの利点は、モジュール性を改善し、アプリケーションの理解、開発、テストを容易にし、アーキテクチャの崩壊に対する回復力を高めることである。マイクロサービスは、特に、ハイパーテキスト転送プロトコル（ＨＴＴＰ：hypertext transfer protocol）などの、特定の技術に依存しないプロトコルを使用して、ネットワークを経由して通信し、任意の目標を実現させる個別のプロセスであると考えることができる。 Data management is generally the art of procuring, maintaining, and using data (ie, information). The data itself can be as simple as customer details such as names and addresses, or larger as financial services (e.g. financial crime investigation solutions) . Data management in such systems is very complex. This challenge is especially true in systems that use distributed functional processing architectures such as microservices. Microservices is a software development technique that allows structuring applications as a collection of loosely coupled services. One advantage of decomposing an application into smaller, distinct services is that it improves modularity, makes the application easier to understand, develop, test, and more resilient to architectural collapse. Microservices are discrete processes that communicate over a network to achieve arbitrary goals, especially using technology-neutral protocols such as the hypertext transfer protocol (HTTP). can be considered to exist.

使用される特定のマイクロサービスの性質は、アプリケーションに大きく依存する。金融サービスの不正検出アプリケーションでは、例えば、マイクロサービスは、トランザクションをキューに配置する受信サービス、添付ファイルが存在するかをチェックし、添付ファイルが存在する場合、その添付ファイルを光学式文字認識サービスなどの別のマイクロサービスに送信する添付ファイル・プロセッサ・サービス、現在のトランザクションを分析し、そのトランザクションに関連するいずれかの過去のトランザクションに関連付けるコンテキスト作成サービス、違反を識別するようにクライアントによって設定されたルールを実行する決定実行エンジン、トランザクションを再調査し、外れ値にフラグを立てる分析エンジン、識別された問題に基づいて人間の追跡調査のためのケースを作成するかどうかを判断するケース・マネージャ・サービス、および各トランザクションの処理時の更新内容をクライアントの経費／調達システムに返す通知マネージャを含んでよい。 The nature of the particular microservices used is highly application dependent. In a financial services fraud detection application, for example, a microservice could be a receiving service that queues a transaction, checks if an attachment exists, and if so, passes that attachment to an optical character recognition service, etc. An attachment processor service that sends to another microservice in the Create Context service that analyzes the current transaction and associates it with any past transactions related to it, set by the client to identify violations A decision execution engine that executes rules, an analysis engine that reviews transactions and flags outliers, and a case manager that determines whether to create a case for human follow-up based on identified issues. It may include a service and notification manager that returns updates to the client's expense/procurement system as each transaction is processed.

すべてのコンピューティング・システムに当てはまることであるが、分散コンピューティング・システムを監視し、それらの分散コンピューティング・システムがサービス品質（ＱｏＳ：quality-of-service）要件を満たしていることを保証できることは重要である。ＱｏＳは、電話技術もしくはコンピュータ・ネットワーク、またはクラウド・コンピューティング・サービスなどの、サービスの全体的性能、特に、ネットワークのユーザから見える性能の測定である。サービス品質を定量的に測定するために、多くの場合、ネットワーク・サービスの複数の関連する特徴が考慮される。マイクロサービスと同様に、ＱｏＳ要件の特定の性質は、含まれる特定のアプリケーションに依存する。ＱｏＳ基準は、例えば、データ・タイプに基づく応答時間の要件、ならびに時間およびデータの必要条件を含むコンテキストの必要条件を識別するサービス水準合意において、定められることがある。 As with all computing systems, the ability to monitor distributed computing systems and ensure that they meet quality-of-service (QoS) requirements. is important. QoS is a measure of the overall performance of a service, such as a telephony or computer network, or a cloud computing service, particularly the performance seen by the users of the network. In order to measure quality of service quantitatively, a number of relevant characteristics of network services are often considered. As with microservices, the specific nature of QoS requirements depends on the specific applications involved. QoS criteria may be defined, for example, in service level agreements that identify response time requirements based on data type and contextual requirements including time and data requirements.

本発明は、少なくとも１つの実施形態において、データ・ソースおよびデータ・ターゲットの定義、ならびにデータ・ソースおよびデータ・ターゲットのサービス品質基準を受信することと、分散処理システムの運用ワークロードを監視して、データ・ソースとデータ・ターゲットの間の運用データの移動の現在の評価を確立することと、サービス品質基準のうちの１つまたは複数を損なう結果をもたらす運用データの移動の以前の事例を含む、分散処理システム内の以前の運用データの移動に関する履歴情報を受信することと、現在の評価および履歴情報から、次回の運用データのアクションがサービス品質基準のうちの特定の１つを満たさないということを決定することと、前述の決定に応答して、定義に従って特定のサービス品質基準を向上させるように適応されたデータ管理最適化インフラストラクチャ（data management optimization infrastructure）を自動的に適用することとによって、分散処理システム内の運用データを管理する方法を一般に対象にしている。例示的な実装では、サービス水準合意モデルはサービス品質基準を提供し、サービス水準合意モデルは、データ・タイプ、コンテキストの必要条件、時間／日付の必要条件、および応答時間の要件を含み、データ移動実行履歴モデル（data movement execution history model）は履歴情報を提供し、データ移動実行履歴モデルは、サービスの統計値の履歴、サービスのリソース消費、およびサービスの種類の実行予測を含み、現行システム負荷モデル（current system load model）は運用ワークロードの前述の監視を提供し、現行システム負荷モデルは、リソースの利用、現在のクラスタ・サイズ、および能力評価を含み、データ・タイプおよびＱｏＳ要件モデルはデータ・タイプに関する定義および特性を提供し、データ・タイプおよびＱｏＳ要件モデルは、データ・タイプ定義およびデータ・サービス品質定義を含み、プロセス制御メカニズムは、必要に応じてサービス品質基準を満たすために、プロセス制御メカニズムから遠く離れたネットワークの位置でワーカー・スレッドを生成するようにデータ管理最適化インフラストラクチャを制御し、プロセス制御メカニズムは、データに最適化された選択、データ・サービス、サービス・フィードバック、およびデータ移動のディスパッチを含む。応答時間の要件は、データ・タイプ、コンテキストの必要条件、および時間／データの必要条件に基づく。決定することは、履歴情報を使用してトレーニングされた認知システムを使用することができ、認知システムは、現在の評価に基づいて運用結果を予測し、運用結果は特定のサービス品質基準が満たされないということの指示を提供する。現在の評価は、分散処理システムのリソースのリソース使用状況、リソースの能力、およびリソースの応答時間を含むことができる。データ管理最適化インフラストラクチャは、複数の拡張可能なデータ・バックプレーン・サービス（data backplane services）を含む。例示的なアプリケーションでは、分散処理システムは、不正検出ソリューションを提供し、データ・ソースおよびデータ・ターゲットは、名前、アドレス、電話番号、社会保障番号、または納税者登録番号などの大量の顧客情報、トランザクションの量および種類（ワイヤ、ＡＣＨ、クレジット、負債）などのトランザクション情報、ならびに前述の顧客情報または金融情報の集合を追跡するためのケース管理データを含む、データ・タイプを有し、サービス品質基準は、リソース割り当て、データ完全性仕様、およびサービスの使用可能時間を含み、データ・バックプレーン・サービスは、メッセージング・インターフェイス、ＡＰＩ、ストリーム、またはその他のデータ通信用の通信手段を含む。 The present invention, in at least one embodiment, receives definitions of data sources and data targets and quality of service metrics for the data sources and data targets, and monitors the operational workload of a distributed processing system. , establishing a current assessment of operational data movement between data sources and data targets, and previous instances of operational data movement that have resulted in compromised one or more of the quality of service criteria; , receiving historical information about previous operational data movements within the distributed processing system, and determining from the current evaluation and historical information that an upcoming operational data action will not meet a particular one of the quality of service criteria; and, in response to said determination, automatically applying a data management optimization infrastructure adapted to improve specified quality of service standards as defined; are generally directed to methods of managing operational data in distributed processing systems. In an exemplary implementation, the service level agreement model provides quality of service criteria, the service level agreement model includes data type, context requirements, time/date requirements, and response time requirements, and data movement requirements. The data movement execution history model provides historical information, the data movement execution history model includes historical service statistics, service resource consumption, and service type execution forecasts, and the current system load model (current system load model) provides the aforementioned monitoring of the operational workload, the current system load model includes resource utilization, current cluster size, and capacity assessment; The data type and QoS requirements model provides definitions and characteristics for types, includes data type definitions and data quality of service definitions, and the process control mechanism implements process control functions as needed to meet quality of service criteria. Controls the data management optimization infrastructure to spawn worker threads at locations in the network remote from the mechanism, and the process control mechanism controls data-optimized selection, data services, service feedback, and data Includes move dispatch. Response time requirements are based on data type, context requirements, and time/data requirements. Determining can use a cognitive system trained using historical information, the cognitive system predicts operational outcomes based on current evaluations, and operational outcomes where certain service quality criteria are not met. provide instructions for. Current ratings may include resource utilization of resources of the distributed processing system, resource capabilities, and resource response times. The data management optimization infrastructure includes multiple extensible data backplane services. In an exemplary application, the distributed processing system provides a fraud detection solution, where the data sources and data targets are large amounts of customer information such as names, addresses, phone numbers, social security numbers, or tax registration numbers; have data types, including transaction information such as transaction volume and type (wire, ACH, credit, debt), and case management data for tracking the aforementioned collections of customer or financial information, and quality of service criteria; contains resource allocations, data integrity specifications, and service uptime, and the data backplane service contains messaging interfaces, APIs, streams, or other communication means for data communication.

上記に加えて、本発明のさまざまな実施形態における追加の目的、特徴、および利点が、以下の詳細に記述された説明において明らかになるであろう。 In addition to the above, additional objects, features, and advantages of various embodiments of the present invention will become apparent in the following detailed written description.

本発明は、添付の図面を参照することによって、よりよく理解されることができ、そのさまざまな実施形態の非常に多くの目的、特徴、および利点が、当業者にとって明らかになる。 The present invention may be better understood, and numerous objects, features, and advantages of its various embodiments made apparent to those skilled in the art by referencing the accompanying drawings.

本発明の１つの実装に従って、サービス品質基準によって指示される自動化された運用データ管理を実行するようにプログラムされたコンピュータ・システムのブロック図である。1 is a block diagram of a computer system programmed to perform automated operational data management dictated by quality of service criteria, according to one implementation of the present invention; FIG. 本発明の１つの実装に従う、クラウド・コンピューティング環境の図的表現である。1 is a pictorial representation of a cloud computing environment, according to one implementation of the present invention; 本発明の１つの実装に従って、運用データ管理システムの機能モジュールを示すブロック図である。1 is a block diagram illustrating functional modules of an operational data management system, according to one implementation of the present invention; FIG. サービス水準合意モデル、データ移動実行履歴モデル、現行システム負荷モデル、ならびにデータ・タイプおよびＱｏＳ要件モデルを使用する本発明の１つの実装に従って、自動化された運用データ管理のための１つの解決策を示すモデル図である。1 illustrates one solution for automated operational data management according to one implementation of the present invention using a service level agreement model, a data movement execution history model, a current system load model, and a data type and QoS requirements model; It is a model diagram. 本発明の１つの実装に従って、図３の運用データ管理システムの運用結果を予測するために使用される認知システムのブロック図である。4 is a block diagram of a cognitive system used to predict operational outcomes of the operational data management system of FIG. 3, according to one implementation of the invention; FIG. 本発明の１つの実装に従って、データ管理プロセスの論理の流れを示すチャートである。FIG. 4 is a chart showing the logic flow of a data management process, according to one implementation of the invention; FIG.

異なる図面における同じ参照シンボルの使用は、類似する項目または同一の項目を示す。 The use of the same reference symbols in different drawings indicates similar or identical items.

分散コンピューティングおよびマイクロサービスの使用は複数の利点を提供するが、この方法は新しい問題もシステム設計者に提示する。歴史的に、モノリシック・アプリケーションは、単一の大きい機能ユニットとして実行されて、データにアクセスすることができ、データの移動および複製を最小限に抑えるように最適化されることができた。データは、単一の共通データ・ストアに容易に存在することができ、統合されたメカニズムによって分散され、アクセスされることができ、複数の種類のデータ構造（データベース、ファイルベースなど）内に存在することもできたが、まだデータ・アクセスＡＰＩ層を可能にした。それらのいずれの場合でも、データの完全性、不一致、および流通に伴う問題の機会を導入するデータの移動および冗長性に依存しないための一貫した取り組みが存在した。 While the use of distributed computing and microservices offers multiple advantages, the method also presents new challenges to system designers. Historically, monolithic applications could be executed as a single large functional unit to access data and optimized to minimize data movement and duplication. Data can easily reside in a single common data store, can be distributed and accessed by integrated mechanisms, and exist in multiple types of data structures (databases, file-based, etc.) could have done, but still enabled the data access API layer. In each of those cases, there has been a consistent effort to not rely on data movement and redundancy, which introduces opportunities for data integrity, inconsistencies, and problems with distribution.

これらの仮定はすべて、分散コンピューティング・システムと共に変化する。難しいのは、多くの場合、セグメント化されたサービスを利用するためにデータが複製されなければならないということである。例えば、金融サービス・アプリケーションでは、金融犯罪捜査ソリューションをサポートするために、複数の技術要素を活用するのが望ましい。複数の技術要素を活用することは、例えば、銀行トランザクション・データを同じ個人に属しているとして関連付けるための要素技術を使用すること、その個人の仲間のネットワークを理解すること、またはデータに対して機械学習分析を実行し、不正である可能性がある挙動パターンを識別することを伴うことがある。既存のサービスをこれらの機能に活用することは重要であるが、設計者は、多くの場合、それらのサービスが類似するデータ・レコード（顧客情報、トランザクション・レコードなど）へのアクセスを必要とし、特定のスキーマに存在するか、または期待されるデータ・サービスへの特定のデータ・ストア・アクセス・インターフェイスを使用して、特定の形式でデータ・レコードを取得することを期待するという問題に直面している。そのような場合、アプリケーション作成者は、データがどのように使用できると期待されるか、ならびにデータの移動および複製を最小限に抑える方法を、制御することができない。この問題は、ソリューション・プロバイダ全体が管理するべき問題になり、現在、コンポーネントが変化することがあるが問題が残るため、プロジェクトごとに管理する必要がある。 All of these assumptions change with distributed computing systems. The difficulty is that in many cases the data must be replicated to take advantage of segmented services. For example, in financial services applications, it is desirable to leverage multiple technology elements to support financial crime investigation solutions. Leveraging multiple technology elements means, for example, using elemental technologies to associate bank transaction data as belonging to the same individual, understanding that individual's peer network, or It may involve performing machine learning analysis to identify potentially fraudulent behavioral patterns. Leveraging existing services for these functions is important, but designers often find that those services require access to similar data records (customer information, transaction records, etc.) Faced with the problem of expecting to retrieve data records in a specific format, using a specific data store access interface to a data service that resides in a specific schema or is expected. ing. In such cases, the application writer has no control over how the data is expected to be used and how to minimize data movement and duplication. This problem has become an issue that should be managed by the solution provider as a whole, and now needs to be managed on a project-by-project basis as the components may change but the problem remains.

したがって、そのような分散システムにおいてデータを管理する改善された方法を考案することが望ましい。含まれる特定のシステムに特有のサービス品質（ＱｏＳ）要件を満たすように方法が自動化されることができる場合、さらに有利である。本発明の種々の実施形態では、システムがシステム全体を通じてデータ移動を管理する方法を定義する管理されたＱｏＳの方法を使用して、データの移動、複製、および流通の必要性を満たすための解決策を提供することによって、これらおよびその他の利点が実現される。このシステムは、アプリケーション開発者が、特定のデータ要素のソースおよびターゲットを定義し、データがどのくらい速く移動する必要があるか（流通）、一貫性の目標が何か（例えば、保証された一貫性、最終的な一貫性など）、データがどのように削除されると期待されるか、および複製データがどこで作成されるかを示すＱｏＳ特性を提供できるようにする。このシステムは、拡張可能なインフラストラクチャをデータ移動（更新および削除を含む）に使用することができ、このインフラストラクチャの柔軟なスケーリングによって、ＱｏＳの目標を満たすことができるようにする。 Therefore, it is desirable to devise improved methods of managing data in such distributed systems. It would be further advantageous if the method could be automated to meet the quality of service (QoS) requirements specific to the particular system involved. Various embodiments of the present invention provide solutions for meeting data movement, replication, and distribution needs using managed QoS methods that define how the system manages data movement throughout the system. These and other benefits are realized by providing solutions. This system allows the application developer to define the sources and targets of specific data elements, how fast the data needs to move (distribution), what the consistency goals are (e.g. guaranteed consistency). , eventual consistency, etc.), how data is expected to be deleted, and where duplicate data is created. The system can use a scalable infrastructure for data movement (including updates and deletions), and flexible scaling of this infrastructure allows QoS goals to be met.

ここで図を参照し、特に図１を参照すると、本発明に従って自動化された運用データ管理が実装されることができる、コンピュータ・システムの１つの実施形態１０が示されている。コンピュータ・システム１０は、システム・バス１４に接続された複数のプロセッサ１２ａ、１２ｂを含んでいる対称マルチプロセッサ（ＳＭＰ：symmetric multiprocessor）システムである。システム・バス１４は、インターフェイスをシステム・メモリ１８に提供する結合されたメモリ・コントローラ／ホスト・ブリッジ（ＭＣ／ＨＢ：memory controller/host bridge）１６に、さらに接続されて通信する。システム・メモリ１８は、ローカル・メモリ・デバイスであってよく、または代替として、複数の分散メモリ・デバイスを含んでよく、ダイナミック・ランダムアクセス・メモリ（ＤＲＡＭ：dynamic random-access memory）を含むのが好ましい。示されていないメモリ階層内に、オンボード（Ｌ１）および第２のレベル（Ｌ２）または第３のレベル（Ｌ３）のキャッシュなどの、追加の構造が存在し得る。システム・メモリ１８は、本発明に従って、分散システム、データ定義、ＱｏＳ基準、システム・モニタ、データの最適化、さまざまなバックプレーン・サービス、および運用結果を予測するために使用される認知システムの特定の機能を実行するために必要な運用プログラムを含む、１つまたは複数のアプリケーションまたはソフトウェア・モジュールを読み込んでおり、これらのすべてが、以下でさらに詳細に説明される。図１は、これらのさまざまなコンポーネントを単一のメモリ１８内に示しているが、これらのコンポーネントの一部が、コンピュータ・システム１０に類似するか、またはコンピュータ・システム１０と異なる、他の（遠く離れて位置する）ネットワーク・コンピュータ・システムに存在し得るということが理解される。特に、バックプレーン・サービスは、データの最適化から遠く離れた複数のネットワークの位置で実装されることができる。 Referring now to the figures, and more particularly to FIG. 1, there is shown one embodiment 10 of a computer system in which automated operational data management can be implemented in accordance with the present invention. Computer system 10 is a symmetric multiprocessor (SMP) system including a plurality of processors 12a, 12b connected to system bus 14; System bus 14 is further connected to and communicates with a coupled memory controller/host bridge (MC/HB) 16 that provides an interface to system memory 18 . System memory 18 may be a local memory device or, alternatively, may include multiple distributed memory devices, including dynamic random-access memory (DRAM). preferable. There may be additional structures within the memory hierarchy not shown, such as on-board (L1) and second level (L2) or third level (L3) caches. The system memory 18, in accordance with the present invention, identifies distributed systems, data definitions, QoS criteria, system monitors, data optimization, various backplane services, and cognitive systems used to predict operational outcomes. It loads one or more applications or software modules, including the operating programs necessary to perform the functions of, all of which are described in further detail below. Although FIG. 1 shows these various components in a single memory 18, some of these components may be similar to computer system 10, or may be different from computer system 10. It is understood that it may reside in a networked computer system (located remotely). In particular, backplane services can be implemented at multiple network locations far from data optimization.

ＭＣ／ＨＢ１６は、ＰＣＩ（peripheral componentinterconnect）Ｅｘｐｒｅｓｓリンク２０ａ、２０ｂ、２０ｃとのインターフェイスも含む。各ＰＣＩＥｘｐｒｅｓｓリンク２０ａ、２０ｂは、各ＰＣＩｅアダプタ２２ａ、２２ｂに接続され、各ＰＣＩｅアダプタ２２ａ、２２ｂは、各入出力（Ｉ／Ｏ：input/output）デバイス２４ａ、２４ｂに接続される。ＭＣ／ＨＢ１６は、スイッチ（Ｉ／Ｏファブリック）２８に接続されたＩ／Ｏバス２６とのインターフェイスをさらに含んでよい。スイッチ２８は、複数のＰＣＩリンク２０ｄ、２０ｅ、２０ｆへのＩ／Ｏバスのファンアウトを提供する。これらのＰＣＩリンクは、さらにＰＣＩｅアダプタ２２ｃ、２２ｄ、２２ｅに接続され、次にこれらのＰＣＩｅアダプタは、さらにＩ／Ｏデバイス２４ｃ、２４ｄ、２４ｅをサポートする。Ｉ／Ｏデバイスは、キーボード、グラフィカル・ポインティング・デバイス（マウス）、マイクロホン、ディスプレイ・デバイス、スピーカ、永続的ストレージ・デバイス（ハード・ディスク・ドライブ）またはそのようなストレージ・デバイスのアレイ、ＣＤまたはＤＶＤなどの光ディスク２５（コンピュータ可読ストレージ媒体の１つの例）を受け取る光ディスク・ドライブ、およびネットワーク・カードを含んでよいが、これらに限定されない。各ＰＣＩｅアダプタは、ＰＣＩリンクと各Ｉ／Ｏデバイスの間のインターフェイスを提供する。ＭＣ／ＨＢ１６は、待ち時間の少ない経路を提供し、この経路を介して、プロセッサ１２ａ、１２ｂは、バス・メモリ内またはＩ／Ｏアドレス空間内のどこかにマッピングされたＰＣＩデバイスにアクセスすることができる。ＭＣ／ＨＢ１６は、ＰＣＩデバイスがメモリ１８にアクセスできるようにするための高帯域幅の経路をさらに提供する。スイッチ２８は、異なるエンドポイント間のピアツーピア通信を提供してよく、このデータ・トラフィックは、キャッシュ・コヒーレント・メモリ転送（cache-coherent memory transfers）を伴わない場合、ＭＣ／ＨＢ１６に転送される必要がない。スイッチ２８は、分離した論理コンポーネントとして示されているが、ＭＣ／ＨＢ１６に統合されることができる。 The MC/HB 16 also includes an interface with PCI (peripheral component interconnect) Express links 20a, 20b, 20c. Each PCI Express link 20a, 20b is connected to a respective PCIe adapter 22a, 22b, and each PCIe adapter 22a, 22b is connected to a respective input/output (I/O) device 24a, 24b. MC/HB 16 may further include an interface with I/O bus 26 connected to switch (I/O fabric) 28 . Switch 28 provides I/O bus fanouts to multiple PCI links 20d, 20e, 20f. These PCI links are further connected to PCIe adapters 22c, 22d, 22e, which in turn support further I/O devices 24c, 24d, 24e. I/O devices include keyboards, graphical pointing devices (mouse), microphones, display devices, speakers, persistent storage devices (hard disk drives) or arrays of such storage devices, CDs or DVDs. (one example of a computer-readable storage medium), and network cards. Each PCIe adapter provides an interface between a PCI link and each I/O device. MC/HB 16 provides a low latency path through which processors 12a, 12b can access PCI devices mapped anywhere in bus memory or I/O address space. can be done. MC/HB 16 also provides a high bandwidth path for PCI devices to access memory 18 . Switch 28 may provide peer-to-peer communication between different endpoints, and this data traffic must be forwarded to MC/HB 16 if it does not involve cache-coherent memory transfers. do not have. Switch 28 is shown as a separate logical component, but can be integrated into MC/HB 16 .

この実施形態では、ＰＣＩリンク２０ｃが、ＭＣ／ＨＢ１６をサービス・プロセッサ・インターフェイス３０に接続し、Ｉ／Ｏデバイス２４ａとサービス・プロセッサ３２の間の通信を可能にする。サービス・プロセッサ３２は、ＪＴＡＧインターフェイス３４を介してプロセッサ１２ａ、１２ｂに接続され、プロセッサ１２ａ、１２ｂの動作を中断するアテンション・ライン３６を使用する。サービス・プロセッサ３２は、それ自身のローカル・メモリ３８を含んでよく、システムの起動のための種々のプログラム命令を格納する読み取り専用メモリ（ＲＯＭ：read-only memory）４０に接続される。サービス・プロセッサ３２は、システムの状態および診断の情報を提供するために、ハードウェア・オペレータ・パネル４２にアクセスすることができてもよい。 In this embodiment, PCI link 20c connects MC/HB 16 to service processor interface 30 and allows communication between I/O device 24a and service processor 32. FIG. Service processor 32 is connected to processors 12a, 12b via JTAG interface 34 and uses attention lines 36 to interrupt the operation of processors 12a, 12b. Service processor 32 may include its own local memory 38 and is connected to read-only memory (ROM) 40 that stores various program instructions for booting the system. Service processor 32 may be able to access hardware operator panel 42 to provide system status and diagnostic information.

代替の実施形態では、コンピュータ・システム１０は、これらのハードウェア・コンポーネントもしくはそれらの相互接続の変更、または追加のコンポーネントを含んでよく、そのため、示された例は、本発明に関するどのようなアーキテクチャの制限も意味すると解釈されるべきではない。本発明は、同等のクラウド・コンピューティング・ネットワーク内でさらに実装される。 In alternate embodiments, computer system 10 may include modifications of these hardware components or their interconnections, or additional components, so that the example shown may be any architecture with respect to the present invention. should not be construed to imply a limitation of The present invention is further implemented within equivalent cloud computing networks.

コンピュータ・システム１０の電源が最初に入れられるときに、サービス・プロセッサ３２は、ＪＴＡＧインターフェイス３４を使用して、システム（ホスト）プロセッサ１２ａ、１２ｂおよびＭＣ／ＨＢ１６に問い合わせる。問い合わせの完了後に、サービス・プロセッサ３２は、コンピュータ・システム１０のインベントリおよびトポロジーを取得する。次に、サービス・プロセッサ３２は、コンピュータ・システム１０のコンポーネントに対して、ビルトイン・セルフテスト（ＢＩＳＴ：built-in-self-tests）、基本検証テスト（ＢＡＴ：basicassurance tests）、およびメモリ・テストなどの、さまざまなテストを実行する。テスト中に検出された故障に関するエラー情報が、サービス・プロセッサ３２によってオペレータ・パネル４２に報告される。テスト中に故障していることが検出されたコンポーネントを取り出した後に、システム・リソースの有効な構成がまだ可能である場合、コンピュータ・システム１０は、続行することが許可される。実行コードがメモリ１８に読み込まれ、サービス・プロセッサ３２が、プログラム・コード（例えば、アプリケーションを開始するために使用されるオペレーティング・システム（ＯＳ：operating system）、および特に、本発明の自動化された運用データ管理アプリケーション）の実行のためにホスト・プロセッサ１２ａ、１２ｂを解放し、その実行結果が、システムのハード・ディスク・ドライブ（Ｉ／Ｏデバイス２４）に格納される。ホスト・プロセッサ１２ａ、１２ｂがプログラム・コードを実行している間に、サービス・プロセッサ３２は、冷却ファンの速度および動作、熱センサ、電源制御装置、ならびにプロセッサ１２ａ、１２ｂ、メモリ１８、およびＭＣ／ＨＢ１６のいずれかによって報告された回復可能なエラーおよび回復不可能なエラーなどの、任意の動作パラメータまたはエラーを監視して報告するモードを入力してよい。サービス・プロセッサ３２は、エラーの種類または定義されたしきい値に基づいて、アクションをさらに実行し得る。 Service processor 32 uses JTAG interface 34 to interrogate system (host) processors 12a, 12b and MC/HB 16 when computer system 10 is first powered on. After completing the query, service processor 32 obtains the inventory and topology of computer system 10 . Service processor 32 then subjects the components of computer system 10 to tests such as built-in-self-tests (BIST), basic verification tests (BAT), and memory tests. , run various tests. Error information regarding faults detected during testing is reported by service processor 32 to operator panel 42 . After retrieving a component that was detected to be faulty during testing, computer system 10 is allowed to continue if a valid configuration of system resources is still possible. Executable code is loaded into memory 18 and service processor 32 processes program code (e.g., an operating system (OS) used to start applications and, in particular, automated operation of the present invention). host processor 12a, 12b for execution of a data management application), the results of which are stored on the system's hard disk drive (I/O device 24). While host processors 12a, 12b are executing program code, service processor 32 monitors cooling fan speed and operation, thermal sensors, power control units, and processors 12a, 12b, memory 18, and MC/ A mode may be entered to monitor and report any operating parameter or error, including recoverable and non-recoverable errors reported by any of the HBs 16 . Service processor 32 may further take action based on the type of error or defined thresholds.

本発明は、システム、方法、またはコンピュータ・プログラム製品、あるいはその組み合わせであってよい。コンピュータ・プログラム製品は、プロセッサに本発明の態様を実行させるためのコンピュータ可読プログラム命令を含むコンピュータ可読ストレージ媒体を含んでよい。 The invention may be a system, method, or computer program product, or a combination thereof. The computer program product may include a computer readable storage medium containing computer readable program instructions for causing a processor to carry out aspects of the present invention.

コンピュータ可読ストレージ媒体は、命令実行デバイスによって使用するための命令を保持および格納できる有形のデバイスであることができる。コンピュータ可読ストレージ媒体は、例えば、電子ストレージ・デバイス、磁気ストレージ・デバイス、光ストレージ・デバイス、電磁ストレージ・デバイス、半導体ストレージ・デバイス、またはこれらの任意の適切な組み合わせであってよいが、これらに限定されない。コンピュータ可読ストレージ媒体のさらに具体的な例の非網羅的リストは、ポータブル・フロッピー（Ｒ）・ディスク、ハード・ディスク、ランダム・アクセス・メモリ（ＲＡＭ：random access memory）、読み取り専用メモリ（ＲＯＭ：read-onlymemory）、消去可能プログラマブル読み取り専用メモリ（ＥＰＲＯＭ：erasableprogrammable read-only memoryまたはフラッシュ・メモリ）、スタティック・ランダム・アクセス・メモリ（ＳＲＡＭ：static random access memory）、ポータブル・コンパクト・ディスク読み取り専用メモリ（ＣＤ－ＲＯＭ：compact disc read-only memory）、デジタル・バーサタイル・ディスク（ＤＶＤ：digital versatile disk）、メモリ・スティック、フロッピー（Ｒ）・ディスク、パンチカードまたは命令が記録されている溝の中の隆起構造などの機械的にエンコードされるデバイス、およびこれらの任意の適切な組み合わせを含む。本明細書において使用されるとき、コンピュータ可読ストレージ媒体は、それ自体が、電波またはその他の自由に伝搬する電磁波、導波管またはその他の送信媒体を伝搬する電磁波（例えば、光ファイバ・ケーブルを通過する光パルス）、あるいはワイヤを介して送信される電気信号などの一過性の信号であると解釈されるべきではない。 A computer-readable storage medium can be a tangible device capable of holding and storing instructions for use by an instruction execution device. A computer-readable storage medium may be, for example, but not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination thereof. not. A non-exhaustive list of more specific examples of computer readable storage media include portable floppy disks, hard disks, random access memory (RAM), read only memory (ROM). -onlymemory), erasable programmable read-only memory (EPROM or flash memory), static random access memory (SRAM), portable compact disc read-only memory (CD - ROM (compact disc read-only memory), digital versatile disk (DVD), memory stick, floppy (R) disk, punch card or raised structure in a groove where instructions are recorded and any suitable combination thereof. As used herein, a computer-readable storage medium is itself a radio wave or other freely propagating electromagnetic wave, or an electromagnetic wave propagating through a waveguide or other transmission medium (e.g., passing through a fiber optic cable). It should not be interpreted as a transient signal such as a light pulse) or an electrical signal transmitted over a wire.

本明細書に記載されたコンピュータ可読プログラム命令は、コンピュータ可読ストレージ媒体から各コンピューティング・デバイス／処理デバイスへ、またはネットワーク（例えば、インターネット、ローカル・エリア・ネットワーク、広域ネットワーク、または無線ネットワーク、あるいはその組み合わせ）を介して外部コンピュータまたは外部ストレージ・デバイスへダウンロードされる。このネットワークは、銅伝送ケーブル、光伝送ファイバ、無線送信、ルータ、ファイアウォール、スイッチ、ゲートウェイ・コンピュータ、またはエッジ・サーバ、あるいはその組み合わせを備えてよい。各コンピューティング・デバイス／処理デバイス内のネットワーク・アダプタ・カードまたはネットワーク・インターフェイスは、コンピュータ可読プログラム命令をネットワークから受信し、それらのコンピュータ可読プログラム命令を各コンピューティング・デバイス／処理デバイス内のコンピュータ可読ストレージ媒体に格納するために転送する。 Computer readable program instructions described herein can be transferred from a computer readable storage medium to each computing device/processing device or over a network (e.g., the Internet, local area network, wide area network, or wireless network, or the like). combination) to an external computer or external storage device. The network may comprise copper transmission cables, optical transmission fibers, wireless transmissions, routers, firewalls, switches, gateway computers, or edge servers, or a combination thereof. A network adapter card or network interface within each computing device/processing device receives computer readable program instructions from the network and translates those computer readable program instructions into a computer readable network within each computing device/processing device. Transfer for storage on a storage medium.

本発明の動作を実行するためのコンピュータ可読プログラム命令は、アセンブラ命令、命令セット・アーキテクチャ（ＩＳＡ：instruction-set-architecture）命令、マシン命令、マシン依存命令、マイクロコード、ファームウェア命令、状態設定データ、またはＪａｖａ（Ｒ）、Ｓｍａｌｌｔａｌｋ、Ｃ＋＋などのオブジェクト指向プログラミング言語、および「Ｃ」プログラミング言語もしくは同様のプログラミング言語などの従来の手続き型プログラミング言語を含む１つもしくは複数のプログラミング言語の任意の組み合わせで記述されたソース・コードもしくはオブジェクト・コードであってよい。コンピュータ可読プログラム命令は、ユーザのコンピュータ上で全体的に実行すること、ユーザのコンピュータ上でスタンドアロン・ソフトウェア・パッケージとして部分的に実行すること、ユーザのコンピュータ上およびリモート・コンピュータ上でそれぞれ部分的に実行すること、もしくはリモート・コンピュータ上もしくはサーバ上で全体的に実行することができる。後者のシナリオでは、リモート・コンピュータは、ローカル・エリア・ネットワーク（ＬＡＮ：local area network）もしくは広域ネットワーク（ＷＡＮ：wide areanetwork）を含む任意の種類のネットワークを介してユーザのコンピュータに接続されてよく、または接続は、（例えば、インターネット・サービス・プロバイダを使用してインターネットを介して）外部コンピュータに対して行われてよい。一部の実施形態では、本発明の態様を実行するために、例えばプログラマブルロジック回路、フィールドプログラマブル・ゲート・アレイ（ＦＰＧＡ：field-programmable gate arrays）、またはプログラマブル・ロジック・アレイ（ＰＬＡ：programmable logic arrays）を含む電子回路は、コンピュータ可読プログラム命令の状態情報を利用することによって、電子回路をカスタマイズするためのコンピュータ可読プログラム命令を実行し得る。 Computer readable program instructions for performing the operations of the present invention include assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine-dependent instructions, microcode, firmware instructions, state setting data, or written in any combination of one or more programming languages, including object-oriented programming languages such as Java(R), Smalltalk, C++, and conventional procedural programming languages such as the "C" programming language or similar programming languages; It may be source code or object code. The computer-readable program instructions may execute wholly on the user's computer, partially on the user's computer as a stand-alone software package, partially on the user's computer and partially on the remote computer, respectively. It can run on a remote computer or run entirely on a server. In the latter scenario, the remote computer may be connected to the user's computer via any type of network, including a local area network (LAN) or a wide area network (WAN); Or a connection may be made to an external computer (eg, over the Internet using an Internet service provider). In some embodiments, programmable logic circuits, field-programmable gate arrays (FPGAs), or programmable logic arrays (PLAs), for example, are used to implement aspects of the present invention. ) may execute computer readable program instructions to customize the electronic circuit by utilizing the state information of the computer readable program instructions.

本発明の態様は、本明細書において、本発明の実施形態に従って、方法、装置（システム）、およびコンピュータ・プログラム製品のフローチャート図またはブロック図あるいはその両方を参照して説明される。フローチャート図またはブロック図あるいはその両方の各ブロック、およびフローチャート図またはブロック図あるいはその両方に含まれるブロックの組み合わせが、コンピュータ可読プログラム命令によって実装されるということが理解されるであろう。 Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.

これらのコンピュータ可読プログラム命令は、コンピュータまたはその他のプログラム可能なデータ処理装置のプロセッサを介して実行される命令が、フローチャートまたはブロック図あるいはその両方のブロックに指定される機能／動作を実施する手段を作り出すべく、汎用コンピュータ、専用コンピュータ、または他のプログラム可能なデータ処理装置のプロセッサに提供されてマシンを作り出すものであってよい。これらのコンピュータ可読プログラム命令は、命令が格納されたコンピュータ可読ストレージ媒体がフローチャートまたはブロック図あるいはその両方のブロックに指定される機能／動作の態様を実施する命令を含んでいる製品を含むように、コンピュータ可読ストレージ媒体に格納され、コンピュータ、プログラム可能なデータ処理装置、または他のデバイス、あるいはその組み合わせに特定の方式で機能するように指示できるものであってもよい。 These computer readable program instructions are the means by which instructions executed via a processor of a computer or other programmable data processing apparatus perform the functions/acts specified in the flowchart illustrations and/or block diagrams. For production, it may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine. These computer readable program instructions are such that the computer readable storage medium on which the instructions are stored comprises instructions for implementing aspects of the functions/operations specified in the flowchart and/or block diagram blocks. It may be stored on a computer readable storage medium and capable of instructing a computer, programmable data processing apparatus, or other device, or combination thereof, to function in a particular manner.

コンピュータ可読プログラム命令は、コンピュータ上、その他のプログラム可能な装置上、またはその他のデバイス上で実行される命令が、フローチャートまたはブロック図あるいはその両方のブロックに指定される機能／動作を実施するように、コンピュータ、その他のプログラム可能なデータ処理装置、またはその他のデバイスに読み込まれてもよく、それによって、一連の動作可能なステップを、コンピュータ上、その他のプログラム可能な装置上、またはコンピュータ実装プロセスを生成するその他のデバイス上で実行させる。 Computer readable program instructions are instructions that are executed on a computer or other programmable apparatus or other device to perform the functions/acts specified in the flowcharts and/or block diagrams. , computer, or other programmable data processing apparatus, or other device, thereby executing a series of operable steps on the computer, other programmable apparatus, or computer-implemented process. Make it run on other devices you generate.

図内のフローチャートおよびブロック図は、本発明の種々の実施形態に従って、システム、方法、およびコンピュータ・プログラム製品の可能な実装のアーキテクチャ、機能、および動作を示す。これに関連して、フローチャートまたはブロック図内の各ブロックは、規定された論理機能を実装するための１つまたは複数の実行可能な命令を備える、命令のモジュール、セグメント、または部分を表してよい。一部の代替の実装では、ブロックに示された機能は、図に示された順序とは異なる順序で発生してよい。例えば、連続して示された２つのブロックは、実際には、含まれている機能に応じて、実質的に同時に実行されるか、または場合によっては逆の順序で実行される。ブロック図またはフローチャート図あるいはその両方の各ブロック、およびブロック図またはフローチャート図あるいはその両方に含まれるブロックの組み合わせは、規定された機能もしくは動作を実行するか、または専用ハードウェアとコンピュータ命令の組み合わせを実行する専用ハードウェアベースのシステムによって実装され得るということにも注意する。 The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in a flowchart or block diagram may represent a module, segment, or portion of instructions comprising one or more executable instructions for implementing the specified logical function. . In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession will in fact be executed substantially concurrently, or possibly in the reverse order, depending on the functionality involved. Each block in the block diagrams and/or flowchart illustrations, and combinations of blocks included in the block diagrams and/or flowchart illustrations, perform the specified function or operation, or implement a combination of dedicated hardware and computer instructions. Note also that it may be implemented by a dedicated hardware-based system for execution.

コンピュータ・システム１０は、新しい最適化技術を使用して分散システム内のデータを管理するする自動化された運用データ管理プロセスのためのプログラム命令を実行する。したがって、本発明を具現化するプログラムは、さまざまなデータ管理ツールの従来の特徴をさらに含んでよく、本開示を参照するときに、それらの詳細が当業者にとって明らかになる。それらのツールの一部は、クラウド・コンピューティングに関連し得る。本開示にはクラウド・コンピューティングに関する詳細な説明が含まれているが、本明細書において示された内容の実装は、クラウド・コンピューティング環境に限定されないと理解されるべきである。本発明の実施形態は、現在既知であるか、または今後開発される任意のその他の種類のコンピューティング環境と組み合わせて実装できる。 Computer system 10 executes program instructions for an automated operational data management process that manages data in distributed systems using novel optimization techniques. Thus, programs embodying the invention may further include conventional features of various data management tools, the details of which will be apparent to those skilled in the art upon reviewing this disclosure. Some of those tools may be related to cloud computing. Although this disclosure includes detailed discussion regarding cloud computing, it should be understood that the implementation of the material presented herein is not limited to cloud computing environments. Embodiments of the invention may be implemented in conjunction with any other type of computing environment now known or later developed.

クラウド・コンピューティングは、構成可能な計算リソース（例えば、ネットワーク、ネットワーク帯域幅、サーバ、処理、メモリ、ストレージ、アプリケーション、仮想マシン、およびサービス）の共有プールへの便利なオンデマンドのネットワーク・アクセスを可能にするためのサービス提供モデルであり、管理上の手間またはサービス・プロバイダとのやりとりを最小限に抑えて、それらのリソースを迅速にプロビジョニングおよび解放することができる。クラウド・モデルは、さまざまな特性、サービス・モデル、およびデプロイメント・モデルを含むことができる。 Cloud computing provides convenient, on-demand network access to shared pools of configurable computing resources (e.g., networks, network bandwidth, servers, processing, memory, storage, applications, virtual machines, and services). It is a service delivery model to enable rapid provisioning and release of those resources with minimal administrative effort or interaction with service providers. A cloud model can include different characteristics, service models, and deployment models.

特性は、オンデマンドのサービス、幅広いネットワーク・アクセス、リソース・プール、迅速な順応性、および測定されるサービスを含むことができるが、これらに限定されない。オンデマンドのセルフ・サービスは、サーバの時間およびネットワーク・ストレージなどの計算能力を一方的に、サービス・プロバイダとの人間的なやりとりを必要とせず、必要に応じて自動的にプロビジョニングする、クラウドの利用者の能力のことを指す。幅広いネットワーク・アクセスは、ネットワークを経由して利用可能である、標準的なメカニズムを使用してアクセスされる能力のことを指し、異種のシン・クライアントまたはシック・クライアント・プラットフォーム（例えば、携帯電話、ラップトップ、およびパーソナル・デジタル・アシスタントなど）による利用を促進する。リソース・プールは、プロバイダの計算リソースがプールされ、マルチテナント・モデルを使用して複数の利用者に提供されるときに発生し、さまざまな物理的および仮想的リソースが、要求に従って動的に割り当ておよび再割り当てされる。場所に依存しないという感覚があり、利用者は通常、提供されるリソースの正確な場所に関して管理することも知ることもないが、さらに高い抽象レベルでは、場所（例えば、国、州、またはデータセンター）を指定できる場合がある。迅速な順応性は、クラウドの能力が、迅速かつ柔軟に、場合によっては自動的にプロビジョニングされ、素早くスケールアウトし、迅速に解放されて素早くスケールインすることができるということを意味する。プロビジョニングに使用できる能力は、利用者には、多くの場合、任意の量をいつでも無制限に購入できるように見える。測定されるサービスは、計測機能を活用することによって、サービスの種類（例えば、ストレージ、処理、帯域幅、およびアクティブなユーザのアカウント）に適した抽象レベルで、リソースの使用を自動的に制御および最適化する、クラウド・システムの能力である。リソースの使用状況は監視、制御、および報告することができ、利用されるサービスのプロバイダと利用者の両方に透明性が提供される。 Characteristics can include, but are not limited to, on-demand services, broad network access, resource pools, rapid adaptability, and metered services. On-demand self-service is a cloud-based service that automatically provisions server time and computing power, such as network storage, unilaterally, as needed, without the need for human interaction with the service provider. Refers to the ability of the user. Broad network access refers to the ability to be accessed using standard mechanisms that are available over a network and can be accessed by heterogeneous thin or thick client platforms (e.g. mobile phones, laptops, and personal digital assistants). Resource pooling occurs when a provider's computational resources are pooled and served to multiple consumers using a multi-tenant model, with different physical and virtual resources dynamically allocated according to demand. and reassigned. There is a sense of location independence, where consumers typically have no control or knowledge as to the exact location of the resources provided, but at a higher level of abstraction, location (e.g., country, state, or data center) ) may be specified. Rapid adaptability means that cloud capacity can be provisioned quickly and flexibly, sometimes automatically, scaled out quickly, released quickly and scaled in quickly. The capacity available for provisioning often appears to the consumer as unlimited purchases of any amount at any time. Metered services automatically control and control resource usage at a level of abstraction appropriate to the type of service (e.g., storage, processing, bandwidth, and active user accounts) by leveraging metering capabilities. It is the ability of the cloud system to optimize. Resource usage can be monitored, controlled and reported, providing transparency to both providers and consumers of the services utilized.

サービス・モデルは、ＳａａＳ（Software as a Service）、ＰａａＳ（Platform as a Service）、およびＩａａＳ（Infrastructureas a Service）を含むことができるが、これらに限定されない。ＳａａＳ（Software as aService）は、クラウド・インフラストラクチャ上で稼働しているプロバイダのアプリケーションを利用する、利用者に提供される能力のことを指す。それらのアプリケーションは、Ｗｅｂブラウザなどのシン・クライアント・インターフェイスを介して、さまざまなクライアント・デバイスからアクセスできる。利用者は、ネットワーク、サーバ、オペレーティング・システム、ストレージ、または個々のアプリケーション機能を含む基盤になるクラウド・インフラストラクチャを、限定的なユーザ固有のアプリケーション構成設定を行う可能性を除き、管理することも制御することもない。ＰａａＳ（Platform as a Service）は、プロバイダによってサポートされるプログラミング言語およびツールを使用して作成された、利用者が作成または取得したアプリケーションをクラウド・インフラストラクチャにデプロイする、利用者に提供される能力のことを指す。利用者は、ネットワーク、サーバ、オペレーティング・システム、またはストレージを含む基盤になるクラウド・インフラストラクチャを管理することも制御することもないが、デプロイされたアプリケーション、および場合によってはアプリケーション・ホスティング環境の構成を制御することができる。ＩａａＳ（Infrastructure as a Service）は、処理、ストレージ、ネットワーク、およびその他の基本的な計算リソースをプロビジョニングする、利用者に提供される能力のことを指し、利用者は、オペレーティング・システムおよびアプリケーションを含むことができる任意のソフトウェアをデプロイして実行できる。利用者は、基盤になるクラウド・インフラストラクチャを管理することも制御することもないが、オペレーティング・システム、ストレージ、デプロイされたアプリケーションを制御することができ、場合によっては、選択されたネットワーク・コンポーネント（例えば、ホスト・ファイアウォール）を限定的に制御できる。 Service models can include, but are not limited to, SaaS (Software as a Service), PaaS (Platform as a Service), and IaaS (Infrastructure as a Service). Software as a Service (SaaS) refers to the ability provided to consumers to consume a provider's applications running on a cloud infrastructure. These applications can be accessed from a variety of client devices through thin client interfaces such as web browsers. Customers may also manage the underlying cloud infrastructure, including networks, servers, operating systems, storage, or individual application functions, except for the possibility of limited user-specific application configuration settings. can't even control. PaaS (Platform as a Service) is the ability provided to the customer to deploy applications created or acquired by the customer, written using programming languages and tools supported by the provider, onto a cloud infrastructure. refers to Consumers do not manage or control the underlying cloud infrastructure, including networks, servers, operating systems, or storage, but configure deployed applications and, in some cases, application hosting environments can be controlled. Infrastructure as a Service (IaaS) refers to the ability provided to consumers to provision processing, storage, networking, and other basic computing resources, including operating systems and applications can deploy and run any software that can Consumers do not manage or control the underlying cloud infrastructure, but they do have control over operating systems, storage, deployed applications, and in some cases, selected network components. (e.g. host firewall) can be controlled in a limited manner.

デプロイメント・モデルは、プライベート・クラウド、コミュニティ・クラウド、パブリック・クラウド、およびハイブリッド・クラウドを含むことができるが、これらに限定されない。プライベート・クラウドは、ある組織のためにのみ運用されているクラウド・インフラストラクチャのことを指す。プライベート・クラウドは、この組織またはサード・パーティによって管理することができ、オンプレミスまたはオフプレミスに存在することができる。コミュニティ・クラウドは、複数の組織によって共有され、関心事（例えば、任務、セキュリティ要件、ポリシー、およびコンプライアンスに関する考慮事項）を共有している特定のコミュニティをサポートする、クラウド・インフラストラクチャを含む。コミュニティ・クラウドは、これらの組織またはサード・パーティによって管理することができ、オンプレミスまたはオフプレミスに存在することができる。パブリック・クラウドでは、クラウド・インフラストラクチャは、一般ユーザまたは大規模な業界団体が使用できるようになっており、クラウド・サービスを販売する組織によって所有される。ハイブリッド・クラウドのクラウド・インフラストラクチャは、データおよびアプリケーションの移植を可能にする標準化された技術または独自の技術（例えば、クラウド間の負荷バランスを調整するためのクラウド・バースト）によって固有の実体を残したまま互いに結合された２つ以上のクラウド（プライベート、コミュニティ、またはパブリック）の複合である。 Deployment models can include, but are not limited to, private cloud, community cloud, public cloud, and hybrid cloud. A private cloud is a cloud infrastructure that is operated solely for an organization. A private cloud can be managed by this organization or a third party and can exist on-premises or off-premises. A community cloud includes cloud infrastructure that is shared by multiple organizations and supports a particular community that has shared concerns (eg, missions, security requirements, policies, and compliance considerations). Community clouds can be managed by these organizations or a third party and can exist on-premises or off-premises. In public clouds, the cloud infrastructure is made available to the general public or large industry associations and is owned by an organization that sells cloud services. The cloud infrastructure of a hybrid cloud remains a unique entity with standardized or proprietary technologies that enable data and application portability (e.g., cloud bursting for load balancing between clouds). A composite of two or more clouds (private, community, or public) that remain coupled together.

クラウド・コンピューティング環境は、ステートレス、疎結合、モジュール性、および意味的相互運用性に重点を置いたサービス指向の環境であることができる。クラウド・コンピューティングの中心になるのは、相互接続されたノードのネットワークを含むインフラストラクチャである。図２に、例示的なクラウド・コンピューティング環境５０が示されている。図示されているように、クラウド・コンピューティング環境５０は、クラウドの利用者によって使用されるローカル・コンピューティング・デバイス（例えば、パーソナル・デジタル・アシスタント（ＰＤＡ：personal digital assistant）または携帯電話５４ａ、デスクトップ・コンピュータ５４ｂ、ラップトップ・コンピュータ５４ｃ、または自動車コンピュータ・システム５４ｄ、あるいはその組み合わせなど）が通信できる、ネットワーク５６内の１つまたは複数のクラウド・コンピューティング・ノード５２を含む。ノード５２は、互いに通信してもよい。ノード５２は、１つまたは複数のネットワーク内で、上で説明されたプライベート・クラウド、コミュニティ・クラウド、パブリック・クラウド、もしくはハイブリッド・クラウド、またはこれらの組み合わせなどに、物理的または仮想的にグループ化される（図示されていない）。これによって、クラウド・コンピューティング環境５０は、クラウドの利用者がローカル・コンピューティング・デバイス上でリソースを維持する必要のないインフラストラクチャ、プラットフォーム、またはＳａａＳ、あるいはその組み合わせを提供できる。図２に示されたコンピューティング・デバイス７５４ａ～ｄの種類は、例示のみが意図されており、コンピューティング・ノード５２およびクラウド・コンピューティング環境５０は、任意の種類のネットワークまたはネットワーク・アドレス可能な接続（例えば、Ｗｅｂブラウザを使用した接続）あるいはその両方を経由して任意の種類のコンピュータ制御デバイスと通信できるということが理解される。 A cloud computing environment can be a service-oriented environment with an emphasis on statelessness, loose coupling, modularity, and semantic interoperability. At the heart of cloud computing is an infrastructure that includes a network of interconnected nodes. An exemplary cloud computing environment 50 is shown in FIG. As shown, cloud computing environment 50 includes local computing devices (e.g., personal digital assistants (PDAs) or mobile phones 54a, desktops, etc.) used by cloud subscribers. • including one or more cloud computing nodes 52 in network 56 with which computers 54b, laptop computers 54c, or automotive computer systems 54d, or combinations thereof, can communicate; Nodes 52 may communicate with each other. Nodes 52 may be physically or virtually grouped within one or more networks, such as private clouds, community clouds, public clouds, or hybrid clouds as described above, or combinations thereof. (not shown). This allows cloud computing environment 50 to provide an infrastructure, platform, and/or SaaS that does not require cloud customers to maintain resources on local computing devices. The types of computing devices 754a-d shown in FIG. 2 are intended to be exemplary only, and computing nodes 52 and cloud computing environment 50 may be any type of network or network-addressable. It is understood that any type of computerized device can be communicated via a connection (eg, using a web browser) or both.

ここで図３を参照すると、本発明に従って構築された運用データ管理システム６０の１つの実施形態が示されている。運用データ管理システム６０は、通常、運用機能またはモジュール６２、システム・モニタ機能またはモジュール６４、およびデータ移動最適化機能またはモジュール６６を含む。運用モジュール６２は、データ工学（情報工学）６８、サービス水準合意７０、ならびにデータ・タイプ／オブジェクトの特性およびアドレスの集合７２を含む。データ工学６８は、運用モジュール６２の主要な機能を備えており、アプリケーションを計画、分析、設計、および実装するためのアーキテクチャの方法を含む。その具体的な機能は、特定のアプリケーションに従って変化する。サービス水準合意７０は、サービス・プロバイダと１つまたは複数のクライアントの間の契約である。品質、可用性、および責任などのサービスの特定の特徴が、サービス・プロバイダとクライアントの間で合意される。サービス水準合意は、対応するサービスレベル目標と共に、多数のサービス・パフォーマンス・メトリック（service-performance metrics）を含むことができる。金融サービスの例の場合、サービス水準合意指標は、サービスの可用性、コストのトレードオフ、およびサポートの応答時間を含むことができる。運用モジュール６２において具現化されるようなサービス水準合意７０は、契約に記載された定量値を反映する。データ・タイプ／オブジェクトの特性およびアドレスは、データ工学６８またはサービス水準合意７０のいずれかに適用されるアプリケーションの関連するデータの特徴を説明するために使用される。特性は、データ・タイプの基本的な特徴に加えて、より高度な特徴（データ構造、クラスなど）を含んでよい。金融サービスの例の場合、具体的な特性は、構造化データおよび非構造化データ、グラフ・データ、およびビッグ・データの実装を必要とする大量の情報を含むことができ、データ・ソースおよびデータ・ターゲットは、名前、アドレス、電話番号、社会保障番号、もしくは納税者登録番号、またはその他の識別番号などのデータ・タイプを含む。アドレスは、データが格納され、取り出されることになる（物理的または仮想的な）ネットワークの位置であり、使用されるプロトコル（ＨＴＴＰ、ＴＣＰ／ＩＰなど）に応じて、種々の形態であってよい。 Referring now to Figure 3, one embodiment of an operational data management system 60 constructed in accordance with the present invention is shown. Operational data management system 60 typically includes operational functions or modules 62 , system monitoring functions or modules 64 , and data movement optimization functions or modules 66 . The operations module 62 includes data engineering (information engineering) 68, service level agreements 70, and a set of data type/object properties and addresses 72. FIG. Data Engineering 68 comprises the primary functionality of the Operations Module 62 and includes architectural methods for planning, analyzing, designing and implementing applications. Its specific function will vary according to the specific application. A service level agreement 70 is a contract between a service provider and one or more clients. Certain characteristics of the service such as quality, availability and liability are agreed between the service provider and the client. A service level agreement may include a number of service-performance metrics along with corresponding service level objectives. For the financial services example, service level agreement metrics may include service availability, cost trade-offs, and support response times. The service level agreement 70 as embodied in the operations module 62 reflects the quantitative values stated in the contract. Data type/object properties and addresses are used to describe relevant data characteristics of applications that apply to either data engineering 68 or service level agreements 70 . Properties may include more advanced features (data structures, classes, etc.) in addition to the basic features of the data type. For the financial services example, specific characteristics can include large amounts of information requiring structured and unstructured data, graph data, and big data implementations, and data sources and data • Targets include data types such as names, addresses, phone numbers, social security or tax registration numbers, or other identifying numbers. An address is a network location (physical or virtual) where data is to be stored and retrieved, and can be of various forms depending on the protocol used (HTTP, TCP/IP, etc.) .

システム・モニタ・モジュール６４は、分散データ・システムのコンポーネントの現在および過去の運用性能を追跡する。システム・モニタ・モジュール６４は、現在の情報を、中央処理装置（ＣＰＵ：central processing unit）、ディスク・ドライブまたはその他の永続的（不揮発性）メモリ、揮発性メモリ（すなわち、ＲＡＭ）、これらまたはその他のリソースのクラスタなどの、さまざまなハードウェア・ツール７６から受信する。この情報は、性能の使用状況、割り当て、電力消費、リソースの可用性などの、デバイスに関連付けられた任意のパラメータを含むことができる。この情報は、現在のシステム能力７８を構築するために使用される。システム・モニタ６４は、期間（例えば、ピーク使用時間）、特定のクライアント、または特定のサービスなどの、種々のパラメータと相関関係がある過去の運用性能情報８０（すなわち、デバイスの使用状況および能力の履歴）も含む。 System monitor module 64 tracks current and historical operational performance of components of the distributed data system. System monitor module 64 stores current information in a central processing unit (CPU), disk drive or other persistent (non-volatile) memory, volatile memory (i.e., RAM), these or other resources from various hardware tools 76, such as clusters of resources. This information can include any parameters associated with the device, such as performance usage, allocation, power consumption, resource availability, and the like. This information is used to build current system capabilities 78 . System monitor 64 provides historical operational performance information 80 (i.e., device usage and capabilities) correlated to various parameters such as time periods (e.g., peak usage hours), specific clients, or specific services. history).

データ移動最適化モジュール６６は、分散データ・システム内のデータ移動、ならびに特にシステム設計者によって提供された仕様に対して、一貫性および流通などの、その他のデータ性能要因（data performance factors）を追跡する分離したモニタ８２を含む。この情報は、システム・モニタ・モジュール６４の過去の運用性能８０に提供されることもできる。次に、データ移動最適化モジュールは、下でさらに説明されているように、必要に応じて、現在のデータ性能要因に基づいてデータ同期サービス８４を呼び出すことができる。これらのサービスは、例えば、ファイル・システムのコピー、メッセージング、データベース・アクセス、転送プロトコルなどを含んでよい。したがって、必要に応じてサービス・ワーカー８６が最適化される。 The data movement optimization module 66 tracks data movement within the distributed data system and other data performance factors such as consistency and distribution, particularly against specifications provided by the system designer. It includes a separate monitor 82 that This information may also be provided to the historical operational performance 80 of the system monitor module 64 . The data movement optimization module can then call the data synchronization service 84 based on current data performance factors, as needed, as further described below. These services may include, for example, file system copying, messaging, database access, transfer protocols, and the like. Therefore, service workers 86 are optimized as needed.

例示的な実装では、本発明は、さまざまなモデルを使用して、データ管理を最適化するために使用される入力を提供する。図４に示されているように、プロセス制御メカニズム９０が、サービス水準合意モデル９２、データ移動実行履歴モデル９４、現行システム負荷モデル９６、ならびにデータ・タイプおよびＱｏＳ要件モデル９８から入力を受信する。これらの特徴のいずれかまたはすべてが、コンピュータ・システム１０において具現化されることができる。サービス水準合意モデル９２は、各々について、データ・ソース、ターゲット、および関連するＱｏＳ基準の定義を格納することを規定する。このモデルは、データがシステム全体を通じて配置され、管理される方法を定義する。一般に、アプリケーション開発者は、この定義のみを作成し、その後、実行時に、データがシステムによって自動的に移動されて管理されるため、ソリューション開発者は、システム運用のその部分に重点を置く必要がなく、代わりに領域の価値（domain value）に重点を置くことができ、その領域においてインフラストラクチャのコーディングの投資を必要とせずに、データ・アーキテクチャの変更を採用する柔軟性を有することもできる。例示的な実装では、サービス水準合意モデル９２は、データ・タイプ、コンテキストの必要条件、時間／データの必要条件、および応答時間の要件のリストを含む。 In an exemplary implementation, the present invention uses various models to provide input used to optimize data management. As shown in FIG. 4, process control mechanism 90 receives inputs from service level agreement model 92 , data movement execution history model 94 , current system load model 96 , and data type and QoS requirements model 98 . Any or all of these features may be embodied in computer system 10 . A service level agreement model 92 provides for storing definitions of data sources, targets, and associated QoS criteria for each. This model defines how data is arranged and managed throughout the system. Typically, application developers create only this definition and then at run time the data is automatically moved and managed by the system, so solution developers need to focus on that part of system operation. Instead, they can focus on domain value and have the flexibility to adopt data architecture changes without requiring infrastructure coding investments in that domain. In an exemplary implementation, the service level agreement model 92 includes a list of data types, context requirements, time/data requirements, and response time requirements.

データ移動実行履歴モデル９４はデータ移動要求の過去の運用を反映し、データ移動要求の過去の運用は、処理制御メカニズム９０によって、過去の運用から学習し、現在の運用結果を予測するために使用されることができ、基準を満たす（基準に従う）ために作業工数に適用される必要のあるリソースの量の情報に基づく評価を行うために、使用されることができる。図５と共に下でさらに説明されるように、データ移動実行履歴モデル９４は、機械学習および予測技術を使用して、システムが最適なしきい値で動作することを保証することができる。例示的な実装では、データ移動実行履歴モデル９４は、サービスの統計値の履歴、サービスのリソース消費、およびサービスの種類の実行予測を含む。現行システム負荷モデル９６は、システムの既存のワークロードを追跡する。この情報は、システムの現在の能力、および次回のデータ移動アクションのＱｏＳ基準を満たす能力を理解するために必要とされる。高負荷のシステムは、作業を完了するために、利用可能なリソースと共に、軽度に使用されるシステムよりも多くのバックプレーン操作スレッドの開始を必要とすることがある。当然ながら、これはリアルタイムに変化することがあるため、アクティブな監視および適応が必要である。例示的な実装では、現行システム負荷モデル９６は、リソースの利用、現在のクラスタ・サイズ、および能力評価を含む。データ・タイプおよびＱｏＳ要件モデル９８は、さまざまなデータ・タイプに関する特性、およびＱｏＳ定義を実現できる方法を表す。例えば、待ち時間の少ない流通要件と共にリアルタイムの一貫性を目標にすることは、リレーショナル・データベース、分散ファイル・システム、ブロック・ストレージなどの間で、実装において異なる。例示的な実装では、データ・タイプおよびＱｏＳ要件モデル９８は、データ・タイプ定義およびデータＱｏＳ定義を含む。 A data movement execution history model 94 reflects past operations of data movement requests, which are learned from past operations and used by the processing control mechanism 90 to predict current operational outcomes. and can be used to make an informed assessment of the amount of resources that need to be applied to work effort in order to meet (compliance with) criteria. As further described below in conjunction with FIG. 5, the data movement performance history model 94 can use machine learning and predictive techniques to ensure that the system operates at optimal thresholds. In an exemplary implementation, the data movement performance history model 94 includes history of service statistics, service resource consumption, and service type performance predictions. Current system load model 96 tracks the existing workload of the system. This information is needed to understand the current capabilities of the system and its ability to meet the QoS criteria for the next data movement action. A heavily loaded system may require more backplane operation threads to be started than a lightly used system with available resources to complete work. Of course, this can change in real time, requiring active monitoring and adaptation. In an exemplary implementation, current system load model 96 includes resource utilization, current cluster size, and capacity rating. The data type and QoS requirements model 98 represents the characteristics for various data types and how QoS definitions can be implemented. For example, targeting real-time consistency along with low-latency distribution requirements differ in implementation between relational databases, distributed file systems, block storage, and the like. In an exemplary implementation, data type and QoS requirements model 98 includes data type definitions and data QoS definitions.

プロセス制御メカニズム９０は、柔軟なマイクロサービス１００（データ・バックプレーン）に対して、データを移動する必要性を満たすためにワーカー・スレッドを呼び出すよう指示する。柔軟なマイクロサービス１００は、異種のシステム・コンポーネントおよび技術にわたってデータを実際に移動（更新または削除）するように、拡張可能なインフラストラクチャを構成する。これらのシステムは、従来型であり、周知の予測可能なさまざまな挙動特性を有する。データ・バックプレーン・サービスは、データ・アーキテクチャにおけるデータの通信のためのメカニズムである。データ・バックプレーン・サービスの例は、ＡｐａｃｈｅのＫａｆｋａである。Ｋａｆｋａは、リアルタイムのデータ・フィードを処理するために、統一された高スループットで待ち時間の少ない通信メカニズムを提供する、オープンソースのストリーム処理ソフトウェア・プラットフォームである。不正検出ソリューションのためのデータ・バックプレーン・サービスは、そのようなメッセージング・インターフェイス、アプリケーション・プログラム・インターフェイス（ＡＰＩ：application program interfaces）、ストリーム、またはデータ通信用のその他の通信手段を含むことがある。柔軟なスケーリングとは、要求に応じてインフラストラクチャを動的に拡大または縮小する能力、すなわち、特定の時点でのアプリケーションの必要性に応じて、物理的ディスク空き容量、メモリ、ＣＰＵなどのリソースを増やすか、または減らす能力のことを指している。 The process control mechanism 90 directs the flexible microservices 100 (data backplane) to invoke worker threads to meet the need to move data. Flexible microservices 100 constitute an extensible infrastructure to actually move (update or delete) data across disparate system components and technologies. These systems are conventional and have a variety of known and predictable behavioral characteristics. A data backplane service is a mechanism for communication of data in a data architecture. An example of a data backplane service is Apache's Kafka. Kafka is an open source stream processing software platform that provides a unified, high-throughput, low-latency communication mechanism for processing real-time data feeds. Data backplane services for fraud detection solutions may include such messaging interfaces, application program interfaces (APIs), streams, or other communication means for data communication. . Elastic scaling is the ability to dynamically grow or shrink your infrastructure on demand, i.e. resources such as physical disk space, memory, and CPU, depending on your application's needs at a given point in time. It refers to the ability to increase or decrease.

例示的な実装では、プロセス制御メカニズム９０は、データに最適化された選択、データ・サービス、サービス・フィードバック、およびデータ移動のディスパッチを含む。データに最適化された選択は、応答時間要件をサービス水準合意モデル９２から受信し、データ・サービスによって処理されるサービスの順序を選択する。データ・サービスは、データ・タイプおよびＱｏＳ定義をデータ・タイプおよびＱｏＳ要件モデル９８から受信し、どのバックプレーン・サービスが特定のデータ・タイプに適しているかを決定する。その後、データ・サービスは、必要なデータ・バックプレーン・サービスを開始するように、データ移動のディスパッチを順序付けることができる。データ・バックプレーン・サービスは、プロセス制御メカニズム９０のサービス・フィードバックにフィードバックを提供し、プロセス制御メカニズム９０は、データ移動実行履歴モデル９４においてサービスの統計値の履歴を更新することもできる。 In an exemplary implementation, the process control mechanisms 90 include data optimized selection, data services, service feedback, and data movement dispatch. The data-optimized selection receives response time requirements from the service level agreement model 92 and selects the order of services to be processed by the data service. A data service receives data type and QoS definitions from the data type and QoS requirements model 98 and determines which backplane service is suitable for a particular data type. The data service can then sequence data movement dispatches to initiate the necessary data backplane services. The data backplane service provides feedback to the process control mechanism 90 service feedback, and the process control mechanism 90 can also update the history of service statistics in the data movement execution history model 94 .

好ましい実装では、データ管理システムの予測機能が、新しい認知システムにおいて具現化される。認知システム（深層学習、ディープ・ソート、または深層質問回答（deep question answering）と呼ばれることもある）は、機械学習および問題解決を使用する人工知能の形態である。認知システムは、多くの場合、ニューラル・ネットワークを採用するが、代替の設計が存在する。ニューラル・ネットワークは、さまざまな種類であってよい。フィードフォワード・ニューラル・ネットワークは、ユニット間の接続が循環を形成しない人工ニューラル・ネットワークである。フィードフォワード・ニューラル・ネットワークは、最初に考案された最も単純な種類の人工ニューラル・ネットワークだった。このネットワークでは、情報が、入力ノードから（もしあれば）隠れノードを通って出力ノードへ、１つの方向（前方）のみに移動する。このネットワークには循環もループも存在しない。そのため、このネットワークは、回帰型ニューラル・ネットワークとは異なる。回帰型ニューラル・ネットワークは、ユニット間の接続が有向循環を形成する人工ニューラル・ネットワークの一種である。回帰型ニューラル・ネットワークは、動的な一時的挙動を示すことができるようにするネットワークの内部状態を作り出す。フィードフォワード・ニューラル・ネットワークとは異なり、回帰型ニューラル・ネットワークは、内部メモリを使用して、任意の順序の入力を処理することができる。畳み込みニューラル・ネットワークは、動物の視覚に基づく特定の種類のフィードフォワード・ニューラル・ネットワークであるため、画像データを処理すること特に役立つ。畳み込みニューラル・ネットワークは、通常のニューラル・ネットワークに類似しているが、学習可能な重みおよびバイアスを有するニューロンで構成されている。 In a preferred implementation, the predictive functionality of the data management system is embodied in the new cognitive system. Cognitive systems (sometimes called deep learning, deep sorting, or deep question answering) are forms of artificial intelligence that use machine learning and problem solving. Cognitive systems often employ neural networks, but alternative designs exist. Neural networks can be of various types. A feedforward neural network is an artificial neural network in which the connections between units do not form a cycle. Feedforward neural networks were the first and simplest kind of artificial neural network to be devised. In this network, information travels in only one direction (forward) from input nodes through hidden nodes (if any) to output nodes. There are no cycles or loops in this network. This network is therefore different from recurrent neural networks. A recurrent neural network is a type of artificial neural network in which the connections between units form a directed cycle. A recurrent neural network creates an internal state of the network that allows it to exhibit dynamic temporal behavior. Unlike feedforward neural networks, recurrent neural networks can use internal memory to process inputs in any order. Convolutional neural networks are particularly useful for processing image data because they are a particular type of feedforward neural network based on animal vision. Convolutional neural networks are similar to regular neural networks, but are composed of neurons with learnable weights and biases.

サポート・ベクター・マシン（ＳＶＭ：support vector machine）などの、機械学習のためのニューラル・ネットワークの使用の多くの代替手段が存在する。ＳＶＭは、基本的に、トレーニング例に基づいて多次元の数学的空間を構築し、その空間内に境界を提供し、入力の２項分類（例えば、「良い」回答と「悪い」回答）を可能にする。別の方法は、有向非環状グラフで変数のセットを表すベイジアン・ネットワークを含む。このネットワークは、変数間の確率的関係を計算するために使用される。認知システムは、単一の方法の使用に限られず、すなわち、任意の数のこれらの機械学習アルゴリズムを組み込むことができる。 There are many alternatives to using neural networks for machine learning, such as support vector machines (SVMs). An SVM essentially constructs a multi-dimensional mathematical space based on training examples, provides boundaries within that space, and performs a binary classification of inputs (e.g., “good” and “bad” answers). to enable. Another method involves Bayesian networks that represent a set of variables in a directed acyclic graph. This network is used to compute probabilistic relationships between variables. A cognitive system is not limited to using a single method, ie it can incorporate any number of these machine learning algorithms.

人工知能の最新の実装は、ＩＢＭＷａｔｓｏｎ（ＴＭ）認知技術であり、この技術は、高度な自然言語処理、情報検索、知識表現、自動推論、および機械学習技術を、開領域質問回答の分野に適用する。そのような認知システムは、既存の文書（コーパス）に依存し、人、位置、組織、および特定の物体などの、照会に関連する回答を抽出するか、または肯定的感情および否定的感情を識別するために、それらの文書をさまざまな方法で分析することができる。さまざまな技術を使用して、自然言語を分析すること、ソースを識別すること、仮説を見つけて生成すること、証拠を見つけてスコア付けすること、および仮説をマージして順位付けすることができる。回答のスコア付けおよび順位付けのためのモデルは、質問（入力）と回答（出力）の対の大きいセットに基づいてトレーニングされることができる。同じ回答を独立して見つけるアルゴリズムが多いほど、回答が正しい可能性が高くなり、その結果、全体的スコアまたは信頼水準が高くなる。 The latest implementation of artificial intelligence is IBM Watson™ Cognitive Technology, which applies advanced natural language processing, information retrieval, knowledge representation, automated reasoning, and machine learning techniques to the field of open-domain question answering. Apply. Such cognitive systems rely on pre-existing documents (corpora) to extract answers relevant to queries, such as people, locations, organizations, and specific objects, or to identify positive and negative emotions. To do so, these documents can be analyzed in a variety of ways. Various techniques can be used to analyze natural language, identify sources, find and generate hypotheses, find and score evidence, and merge and rank hypotheses . A model for answer scoring and ranking can be trained based on a large set of question (input) and answer (output) pairs. The more algorithms that independently find the same answer, the more likely the answer is correct, resulting in a higher overall score or confidence level.

図５は、本発明の１つの実装に従って新しい認知システム１２０がトレーニングされて適用されることができる方法を示している。認知システム１２０の予測機能は、トレーニング・データ１２２として使用される履歴情報に基づく。この例では、認知システムは、不正検出ソリューションを提供する金融サービス・アプリケーションの進行中の運用の結果を提供するために使用される。したがって、トレーニング・データ１２２は、ＱｏＳ要件に関する実際の結果を伴うさまざまな状況における前の運用因子の例を構成する。例えば、トレーニング・データ１２２は、ピーク使用時間または活動の一時的静止を反映する時間的情報（時刻、曜日、月の日付、またはその他の暦日付など）、リソースの可用性のスナップショット、サービスを提供されている特定の顧客（または単に、顧客の数）、トランザクション負荷（すなわち、最近要求されたか、または現在処理中のトランザクションの数）、および運用システムによって使用されている通信回線上のネットワーク・トラフィックを含むことができる。トレーニング・データ１２２内の各データ点は、この情報および、ＱｏＳ要件と比較した（すべてのデータ・タイプの）データ移動パラメータと相関関係があるその他の情報を含むことができる。言い換えると、データ点は、一部のＱｏＳ要件が満たされており、他のＱｏＳ要件が満たされていない、特定のデータ管理状態（過去の運用結果）をもたらす入力因子を提供する。このトレーニングは、特定の運用状況に関して、特定のＱｏＳ要件が満たされない可能性を認知システム１２０に学習させる。過去の運用因子は、データ・バックプレーン・サービスからのサービス・フィードバックで更新されることができる。 FIG. 5 illustrates how a new cognitive system 120 can be trained and applied according to one implementation of the invention. The predictive capabilities of cognitive system 120 are based on historical information used as training data 122 . In this example, the cognitive system is used to provide results of ongoing operations of a financial services application that provides fraud detection solutions. Training data 122 thus constitutes examples of previous operational factors in a variety of situations with practical consequences for QoS requirements. For example, training data 122 may include temporal information (such as the time of day, day of the week, date of the month, or other calendar date) that reflects peak usage times or temporary quiescence in activity, snapshots of resource availability, service provision the specific customer (or simply, the number of customers) being served, the transaction load (i.e., the number of transactions recently requested or currently being processed), and the network traffic on the communication lines used by the operational system. can include Each data point in training data 122 may contain this information and other information that correlates with data movement parameters (for all data types) compared to QoS requirements. In other words, the data points provide the input factors that lead to a particular data management state (past operational results) where some QoS requirements are met and others are not. This training allows the cognitive system 120 to learn the likelihood that certain QoS requirements will not be met for certain operational situations. Past operating factors can be updated with service feedback from the data backplane service.

認知システム１２０は、そのようにトレーニングされた後に、現在の因子に基づいて可能性のある挙動を予測するために、運用データ管理システムによって使用されることができる。現行システム運用因子（current system operational factors）１２４は認知システム１２０に供給され、これらの因子は、トレーニング・データと同じ種類（時間的、リソースなど）の入力を含む。特定の機械学習アルゴリズムが認知システム１２０に実装された状態で、予測された運用結果がデータ管理システムのデータ移動最適化１２６に転送されることができる。不正検出の例に加えて、認知システムは、計算、リソース、データ割り当て、またはサービスの使用可能時間の要件のいずれかまたはすべてが現在または近い将来に損なわれる可能性があることの指示を提供することができる。次に、データ移動最適化１２６が、損なわれる特定のＱｏＳ要件に基づいて、これらの欠陥をより効果的に処理するために必要なデータ・バックプレーン・サービスを優先することができる。１つの実装では、予測された運用結果は、ＱｏＳの失敗の可能性を（認知システムによって生成された、特定のＱｏＳ基準が満たされないことの信頼値に基づいて）定量的形態で示す異なる値を割り当てることができ、データ移動最適化１２６は、失敗する可能性が最も高いとして示された要件に関連付けられたサービスを優先することができ、すなわち、それらのサービスを最初に呼び出すか、またはさらに多くのそれらのサービスを、より高い失敗の可能性を有するＱｏＳ基準に提供するか、あるいはその両方を行うことができる。 After being so trained, the cognitive system 120 can be used by an operational data management system to predict likely behavior based on current factors. Current system operational factors 124 are provided to cognitive system 120, and these factors include the same types of inputs (temporal, resource, etc.) as training data. With specific machine learning algorithms implemented in the perception system 120, predicted operational results can be forwarded to the data movement optimization 126 of the data management system. In addition to fraud detection examples, cognitive systems provide indications that any or all of the computation, resource, data allocation, or service uptime requirements may be compromised now or in the near future. be able to. Data movement optimization 126 can then prioritize the necessary data backplane services to more effectively handle these deficiencies based on the specific QoS requirements that are compromised. In one implementation, the predicted operational outcome includes different values that indicate in quantitative form the probability of QoS failure (based on the confidence generated by the cognitive system that certain QoS criteria are not met). and data movement optimization 126 may prioritize services associated with requirements indicated as most likely to fail, i.e., call those services first, or even more , can be offered to QoS criteria with a higher probability of failure, or both.

本発明は、１つの実装に従ってデータ管理プロセス１５０の論理の流れを示す図６のチャートを参照して、さらに理解されることができる。コンピュータ・システム１０または分散システムを含む任意のコンピュータ・システムに対して実行されるプロセス１５０は、ソースおよびターゲットのデータ定義ならびにそれらのソースおよびターゲットのサービス品質基準を受信することから開始する（１５２）。使用される特定のコーディングおよび変数に従って、アプリケーション開発者によって定義が提供されることができる。データ管理システムは、運用ワークロードを継続的に監視する（１５４）。監視される因子は、例えば、リソースの使用状況、能力、および応答時間を含むことができる。それによって、進行中の運用データ・フローに対する現在の評価が確立される（１５６）。現在の評価が、目前の運用結果を予測するために使用される（１５８）。これらの結果は、認知システムによって、過去の運用データの移動に関する履歴情報を使用して識別されることができる（１６０）。予測された結果は、どのＱｏＳ基準が危険にさらされているかを識別するデータ移動最適化を可能にする（１６２）。一部のＱｏＳ基準は、より大きい危険にさらされており、それに応じて、リソースの割り当てにおける優先度が付与される。次に、データ移動最適化が、識別されたＱｏＳ基準を向上させるために、適切な最適化インフラストラクチャを適用することができる（１６４）。最適化インフラストラクチャ（データ・バックプレーン・サービス）は、必要に応じて達成されるべきＱｏＳの目標を満たすために、ワーカー・スレッドを生成する。運用が継続する限り（１６６）、プロセスが反復してボックス１５４に戻り、監視を継続する。 The present invention can be further understood with reference to the chart of FIG. 6, which illustrates the logic flow of data management process 150 according to one implementation. Process 150, which is performed on computer system 10 or any computer system, including distributed systems, begins by receiving source and target data definitions and quality of service criteria for those sources and targets (152). . Definitions can be provided by the application developer according to the specific coding and variables used. The data management system continuously monitors the operational workload (154). The monitored factors can include, for example, resource usage, capacity, and response time. A current valuation is thereby established for ongoing operational data flows (156). Current valuations are used to predict immediate operational outcomes (158). These results can be identified 160 by the cognitive system using historical information about past operational data movements. The predicted results allow data movement optimization to identify which QoS criteria are in jeopardy (162). Some QoS criteria are more at risk and are given priority in resource allocation accordingly. Data movement optimization can then apply the appropriate optimization infrastructure to improve the identified QoS criteria (164). The optimization infrastructure (data backplane services) spawns worker threads as needed to meet the QoS goals to be achieved. As long as operation continues (166), the process iterates back to box 154 to continue monitoring.

それによって、本発明は、データ・バックプレーンが特定の時点でのアプリケーションの必要性に動的に適応することができるように、ＱｏＳの規格を満たすためのワークロードの監視およびその後の予測スケーリングと組み合わせて、データ・バックプレーンの優れた順応性を実現する。本発明は特定の実施形態を参照して説明されたが、この説明は、制限の意味で解釈されるよう意図されていない。開示された実施形態の種々の変更および本発明の代替の実施形態は、本発明の説明を参照するときに、当業者にとって明らかになるであろう。したがって、添付の特許請求の範囲において定義された本発明の思想または範囲から逸脱することなく、そのような変更が行われるということが企図される。 The present invention thereby provides workload monitoring and subsequent predictive scaling to meet QoS specifications so that the data backplane can dynamically adapt to the needs of the application at any given time. Together, they provide superior flexibility for data backplanes. Although the invention has been described with reference to particular embodiments, this description is not meant to be construed in a limiting sense. Various modifications of the disclosed embodiments, as well as alternative embodiments of the invention, will become apparent to persons skilled in the art upon reference to the description of the invention. It is therefore contemplated that such modifications may be made without departing from the spirit or scope of the invention as defined in the appended claims.

Claims

A method of managing operational data in a distributed processing system, comprising:
receiving definitions of data sources and data targets and quality of service criteria for said data sources and said data targets;
monitoring the operational workload of the distributed processing system to establish a current assessment of operational data movement between the data sources and the data targets;
receiving historical information regarding previous operational data movements within the distributed processing system, including previous instances of operational data movements that have resulted in compromising one or more of the quality of service criteria;
determining from the current evaluation and historical information that an upcoming operational data action will not meet a particular one of the quality of service criteria;
and responsive to said determination, automatically applying a data management optimization infrastructure adapted to improve said specified quality of service criteria according to said definition.

a service level agreement model provides the quality of service criteria, the service level agreement model includes data type, context requirements, time/date requirements, and response time requirements;
a data movement execution history model provides the history information, the data movement execution history model includes history of service statistics, service resource consumption, and service type execution prediction;
a current system load model provides said monitoring of said operational workload, said current system load model including resource utilization, current cluster size, and capacity assessment;
a data type and QoS requirements model providing said definitions and characteristics for said data type, said data type and QoS requirements model comprising a data type definition and a data quality of service definition;
a process control mechanism controlling said data management optimization infrastructure to spawn worker threads at network locations remote from said process control mechanism as needed to meet said quality of service criteria; 2. The method of claim 1, wherein the process control mechanisms include data optimized selection, data services, service feedback, and data movement dispatch.

3. The method of claim 2, wherein the response time requirement is based on the data type, the context requirement, and the time/data requirement.

said determining using a cognitive system trained using said historical information, said cognitive system predicting an operational outcome based on said current evaluation, said operational outcome predicting said specified quality of service; 3. The method of claim 1, providing an indication that the criteria are not met.

2. The method of claim 1, wherein the current ratings include resource utilization of resources of the distributed processing system, capabilities of the resources, and response times of the resources.

2. The method of claim 1, wherein the data management optimization infrastructure includes multiple extensible data backplane services.

the distributed processing system provides a fraud detection solution;
for said data sources and data targets to track transaction information including customer name, customer address, customer phone number, customer identification number, transaction volume and transaction type, and said collection of operational data; has a data type that contains case management data for
wherein the quality of service criteria include resource allocations, data integrity specifications, and service uptime;
7. The method of claim 6, wherein the data backplane services include messaging interfaces, application program interfaces, and streams.

A computer system,
one or more processors that process program instructions;
a memory device connected to the one or more processors;
receiving definitions of data sources and data targets and quality of service criteria for said data sources and said data targets; monitoring an operational workload of a distributed processing system to monitor said data sources and said data targets; within said distributed processing system, including establishing a current assessment of operational data movement between targets and previous instances of operational data movement that have resulted in compromised one or more of said quality of service criteria; and determining from said current evaluation and historical information that an upcoming operational data action will not meet a particular one of said quality of service criteria. and automatically applying a data management optimization infrastructure adapted to improve said specified quality of service standards in accordance with said definition, to manage operational data within said distributed processing system. and program instructions residing in said memory device.

a service level agreement model provides the quality of service criteria, the service level agreement model includes data type, context requirements, time/date requirements, and response time requirements;
a data movement execution history model provides the history information, the data movement execution history model includes history of service statistics, service resource consumption, and service type execution prediction;
a current system load model provides said monitoring of said operational workload, said current system load model including resource utilization, current cluster size, and capacity assessment;
a data type and QoS requirements model providing said definitions and characteristics for said data type, said data type and QoS requirements model comprising a data type definition and a data quality of service definition;
a process control mechanism controlling said data management optimization infrastructure to spawn worker threads at network locations remote from said process control mechanism as needed to meet said quality of service criteria; 9. The computer system of claim 8, wherein the process control mechanisms include data-optimized selection, data services, service feedback, and data movement dispatch.

9. The computer system of claim 8, wherein the response time requirement is based on the data type, the context requirement, and the time/data requirement.

said determining using a cognitive system trained using said historical information, said cognitive system predicting an operational outcome based on said current evaluation, said operational outcome predicting said specified quality of service; 9. The computer system of claim 8, providing an indication that the criteria are not met.

9. The computer system of claim 8, wherein said current ratings include resource utilization of resources of said distributed processing system, capabilities of said resources, and response times of said resources.

9. The computer system of claim 8, wherein said data management optimization infrastructure includes multiple extensible data backplane services.

the distributed processing system provides a fraud detection solution;
for said data sources and data targets to track transaction information including customer name, customer address, customer phone number, customer identification number, transaction volume and transaction type, and said collection of operational data; has a data type that contains case management data for
wherein the quality of service criteria include resource allocations, data integrity specifications, and service uptime;
9. The computer system of claim 8, wherein said data backplane services include a messaging interface, an application program interface, and streams.

A computer program product,
a computer readable storage medium;
receiving definitions of data sources and data targets and quality of service criteria for said data sources and said data targets; monitoring an operational workload of a distributed processing system to monitor said data sources and said data targets; within said distributed processing system, including establishing a current assessment of operational data movement between targets and previous instances of operational data movement that have resulted in compromised one or more of said quality of service criteria; and determining from said current evaluation and historical information that an upcoming operational data action will not meet a particular one of said quality of service criteria. and automatically applying a data management optimization infrastructure adapted to improve said specified quality of service standards in accordance with said definition, to manage operational data within said distributed processing system. program instructions residing on said storage medium.

a service level agreement model provides the quality of service criteria, the service level agreement model includes data type, context requirements, time/date requirements, and response time requirements;
a data movement execution history model provides the history information, the data movement execution history model includes history of service statistics, service resource consumption, and service type execution prediction;
a current system load model provides said monitoring of said operational workload, said current system load model including resource utilization, current cluster size, and capacity assessment;
a data type and QoS requirements model providing said definitions and characteristics for said data type, said data type and QoS requirements model comprising a data type definition and a data quality of service definition;
a process control mechanism controlling said data management optimization infrastructure to spawn worker threads at network locations remote from said process control mechanism as needed to meet said quality of service criteria; 16. The computer program product of claim 15, wherein the process control mechanisms include data optimized selection, data services, service feedback, and data movement dispatch.

16. The computer program product of claim 15, wherein the response time requirement is based on the data type, the context requirement, and the time/data requirement.

said determining using a cognitive system trained using said historical information, said cognitive system predicting an operational outcome based on said current evaluation, said operational outcome predicting said specified quality of service; 16. The computer program product of claim 15, providing an indication that a criterion has not been met.

16. The computer program product of claim 15, wherein the current ratings include resource usage of resources of the distributed processing system, capabilities of the resources, and response times of the resources.

16. The computer program product of claim 15, wherein the data management optimization infrastructure includes multiple extensible data backplane services.