JP2021005387A5

JP2021005387A5 -

Info

Publication number: JP2021005387A5
Application number: JP2020142203A
Authority: JP
Filing date: 2020-08-26
Publication date: 2022-04-01
Anticipated expiration: 2036-12-26

Description

今日、大規模なコンピュータインフラストラクチャを使用している企業は、任意の所与の日にその企業のアプリケーションの使用を想定している最大同時ユーザ数に容易に対応するように、仮想デスクトップ／アプリケーションインフラストラクチャの要件を超大規模にする傾向がある。これは、ユーザがパフォーマンスの低下または不当なダウンタイムを被らないことを保証するための手段とみなされている。この手法は合理的に思えるだろうが、残念ながら、ほとんど未使用のまま、または長期間にわたって十分に活用されていないサーバの急増につながっている。この過剰プロビジョニングは、いくつかの例を挙げるだけでも、ライセンス、電力、熱、冷却、およびデータセンターのスペースに関して、企業が年間を通して不必要に増加した設備投資および運用経費を負担することを最終的に意味する。 Today, enterprises with large computer infrastructure can easily accommodate the maximum number of concurrent users they expect to use their applications on any given day. Infrastructure requirements tend to be very large. This is seen as a means of ensuring that users do not suffer performance degradation or unreasonable downtime. This approach may seem reasonable, but unfortunately it has led to a surge in servers that are mostly unused or underutilized for long periods of time. This overprovisioning ultimately means that companies will incur unnecessarily increased capital investment and operating costs throughout the year in terms of licensing, power, heat, cooling, and data center space, to name just a few. Means to.

企業のコンピュータインフラストラクチャのニーズを予想するために、時系列予想のいくつかの異なる手法がこれまでに提案されている。この先行研究は、線形回帰、指数平滑法、自己回帰積分移動平均（ＡＲＩＭＡ）などの古典的なモデルの使用から、サポートベクターマシン（ＳＶＭ）、人工ニューラルネットワーク（ＡＮＮ）、およびファジーロジックなど、より高度な計算知能の非線形手法にまで及んでいる。ｋ最近傍（ｋ－ＮＮ）アルゴリズムも時系列予想に最近拡張されているが、この分野における既存の研究では、いかなる追加の関連メトリクスも参照せずに、個々の過去のデータに焦点が当てられている。したがって、ほとんどのアプリケーションは、過去の類似した最近傍を識別するために１次元の荷重曲線の形状のみを使用して、１次元の一変量時系列に制約される一方で、大きな次元特徴空間間の相関はしばしば無視される。しかしながら、測定されたコンピュータハードウェアメトリクスの多くは、互いに密接に関連しているため、１次元から抽出された履歴に関する知識は、同様の動作を共有する時系列のグループから収集されたものと比較すると不適切である。 Several different methods of time series forecasting have been proposed so far to anticipate the needs of enterprise computer infrastructure. This previous study includes using classical models such as linear regression, exponential smoothing, and autoregressive integrated moving averages (ARIMA), support vector machines (SVM), artificial neural networks (ANN), and fuzzy logic. It extends to highly computational intelligence non-linear methods. The k-nearest neighbor (k-NN) algorithm has also recently been extended to time series conjecture, but existing studies in this area focus on individual historical data without reference to any additional relevant metrics. ing. Therefore, most applications use only one-dimensional load curve shapes to identify similar nearest neighbors in the past, while being constrained to one-dimensional univariate time series, between large dimensional feature spaces. Correlation is often ignored. However, many of the measured computer hardware metrics are closely related to each other, so knowledge of history extracted from one dimension is compared to that collected from time-series groups that share similar behavior. Then it is inappropriate.

一般的な一態様では、本発明は、ネットワークユーザによって要求されるリソース（アプリケーション、データなど）をホストするためにネットワークサーバを採用する企業コンピュータシステムのための、ネットワークリソースおよび／またはインフラストラクチャのニーズを予想するためのコンピュータ実装システムおよび方法に関する。予想に基づいて、ネットワークリソースは、それに応じてスケーリングまたはプロビジョニングすることができる。すなわち、例えば、ネットワークサーバの状態は、過剰な容量を削減しながら、ユーザの要求ニーズを満たすように動的に調整することができる。本発明の予想技法は、クラウドコンピューティング環境にも適用可能である。予想に基づいて、クラウドサーバプールを動的にスケーリングし、そのため、システムの規模は、変化する要求を満たし、システムの負荷が低いときにリソースを浪費することを回避する。 In one general aspect, the invention is a network resource and / or infrastructure need for a corporate computer system that employs a network server to host the resources (applications, data, etc.) required by network users. Regarding computer implementation systems and methods for anticipating. Based on expectations, network resources can be scaled or provisioned accordingly. That is, for example, the state of the network server can be dynamically adjusted to meet the needs of the user while reducing excess capacity. The predictive technique of the present invention is also applicable to cloud computing environments. Based on expectations, the cloud server pool is dynamically scaled, so the scale of the system meets changing demands and avoids wasting resources when the system is under load.

予想は、ネットワークリソースに対するユーザの要求に関する過去のデータに基づいて生成される。あるシナリオでは、相関分析に従ってそれらのメトリクスに基づいてグループ化された要求のバリアントのグループ化に基づいて多変量ｋ最近傍（ｋ－ＮＮ）予想が使用されるので、同じグループ内のメトリクス間の依存性は高くなる。次いで、多変量ｋ－ＮＮアルゴリズムが各グループに対して実行され、多段階先予測が生成される。この手法は、予測速度とともに予測性能が改善されるように、他の無関係なメトリクスの影響を低減させる。別の予想技法は、予測およびストライプサンプリング手法を使用する。単位時間内の未来の要求数を予測し、要求の特徴を分析し、それに従って要求を分類し、スライスサンプリング戦略を決定することによって、未来の詳細なワークロードデータを生成することができる。さらに別の手法は、要求パターン予測に基づく適応スケジューリングアルゴリズムである。このアルゴリズムは、様々なワークロードに対するシステムの適応能力を決定し、過去と現在のシステム負荷に基づいてシステムが使用するクラウドリソースを調整することで、システムが動的負荷を満たしながら無駄を削減することができるようにする。 Forecasts are generated based on historical data about user requests for network resources. In one scenario, multivariate k-nearest neighbor (k-NN) predictions are used based on the grouping of request variants grouped based on those metrics according to correlation analysis, so between the metrics within the same group. Dependency is high. A multivariate k-NN algorithm is then executed for each group to generate a multi-step forward prediction. This technique reduces the impact of other irrelevant metrics so that the prediction performance improves with the prediction speed. Another prediction technique uses prediction and stripe sampling techniques. By predicting the number of future requirements in a unit time, analyzing the characteristics of the requirements, classifying the requirements accordingly, and determining the slice sampling strategy, detailed future workload data can be generated. Yet another approach is an adaptive scheduling algorithm based on demand pattern prediction. This algorithm determines the adaptability of the system to different workloads and adjusts the cloud resources used by the system based on past and present system loads to reduce waste while meeting the dynamic load of the system. To be able to.

本発明の実施形態からのこれらおよび他の利点は、以下の記載から明らかになるであろう。 These and other advantages from embodiments of the invention will become apparent from the description below.

本発明の様々な実施形態は、添付の図面に関連した実施例として本明細書に記載される。 Various embodiments of the invention are described herein as examples in connection with the accompanying drawings.

本発明の様々な実施形態による企業コンピュータシステムのブロック図である。FIG. 3 is a block diagram of a corporate computer system according to various embodiments of the present invention.

本発明の様々な実施形態による図１の企業コンピュータシステムのためのサーバ数推奨を決定するためのプロセスフローのフローチャートである。FIG. 5 is a flow chart of a process flow for determining a server number recommendation for the enterprise computer system of FIG. 1 according to various embodiments of the present invention.

本発明の様々な実施形態による単変量ｋ－ＮＮ手法のための予測方法の一例を図示するグラフである。It is a graph which illustrates an example of the prediction method for the univariate k-NN method by various embodiments of this invention.

本発明の様々な実施形態による図１の予測コンピュータシステムのブロック図である。FIG. 3 is a block diagram of the predictive computer system of FIG. 1 according to various embodiments of the present invention.

本発明の様々な実施形態による図３のコンピュータシステムによって実行されるプロセスフローのフローチャートである。FIG. 3 is a flow chart of a process flow executed by the computer system of FIG. 3 according to various embodiments of the present invention.

本発明の様々な実施形態による図１のリソース予測コンピュータシステムのコントローラの一例を図示する。An example of a controller of the resource prediction computer system of FIG. 1 according to various embodiments of the present invention is illustrated.

１日にわたる企業コンピュータシステムのネットワークサーバに対する典型的な負荷（例えば、要求数）の例を示す。An example of a typical load (eg, number of requests) on a network server of a corporate computer system over a day is shown.

本発明の様々な実施形態による図６のコントローラのための伝達関数の図である。FIG. 6 is a diagram of a transfer function for the controller of FIG. 6 according to various embodiments of the present invention.

一般的な一態様では、本発明の実施形態を使用して、企業に対する未来のコンピュータリソースのニーズを予測することができる。１つの例示的な企業コンピュータシステム１０が図１に図示されている。図１に図示されている企業コンピュータシステム１０は、広域ネットワーク（ＷＡＮ）１４と相互接続された、いくつかのローカルエリアネットワーク（ＬＡＮ）１２を含む。各ＬＡＮ１２は、いくつかのクライアントコンピュータ１６およびいくつかのネットワークサーバ１８を含むことができる。ネットワークサーバ１８は、実装態様に応じて、例えば、そのＬＡＮ１２内の、または他のＬＡＮ１２からのクライアントコンピュータ１６用のコンピュータプログラム、データ、記憶装置、およびプリンタなどのコンピュータリソースをホストしてもよい。 In one general aspect, embodiments of the invention can be used to predict future computer resource needs for an enterprise. One exemplary enterprise computer system 10 is illustrated in FIG. The corporate computer system 10 illustrated in FIG. 1 includes several local area networks (LANs) 12 interconnected with a wide area network (WAN) 14. Each LAN 12 can include some client computers 16 and some network servers 18. The network server 18 may host computer resources such as computer programs, data, storage devices, and printers for the client computer 16 within or from another LAN 12, for example, depending on the implementation mode.

リソース予測コンピュータシステム２０は、様々な実施形態に従って、ネットワークサーバ１８に対する多変量時系列（ＭＴＳ）データに基づいて企業のリソース予測を実行し、ＭＴＳデータはデータベースコンピュータシステム２２に格納される。リソース予測コンピュータシステム２０およびＭＴＳデータベースシステム２２は、図示の目的でＷＡＮ１４に接続されているものとして図１に示されているが、それらの一方または両方を図示されているＬＡＮ１２のうちの１つに含めることができる。それらは、企業のネットワーク内の異なるＬＡＮ１２とＷＡＮ１４に接続することもできる。リソース予測コンピュータシステム２０は、サーバ（複数可）、メインフレーム（複数可）、ワークステーション（複数可）、および／または任意の他の好適なコンピュータデバイスなどの１つ以上の相互接続されたコンピュータデバイスとして実装されてもよい。リソース予測コンピュータシステム２０のそのような各コンピュータ装置は、１つ以上のプロセッサ２４および１つ以上のメモリユニット２６を含んでもよい。メモリユニット２６は、一次コンピュータストレージ（例えば、ＲＡＭおよびＲＯＭ）および二次コンピュータストレージ（例えば、ＨＤＤ、ＳＳＤ、フラッシュ）の両方を含んでもよい。プロセッサ（複数可）２４は、図１に示し、さらに以下に記載するように、変数グルーピングモジュール３０およびｋ最近傍（ｋ－ＮＮ）探索モジュール３２などの、メモリユニット２６に格納されたコンピュータ命令（例えば、ソフトウェア）を実行するマイクロプロセッサ（複数可）を含んでもよい。図示の目的のために、図示されたリソース予測コンピュータシステム２０は、１つのコンピュータのみを含み、ならびに１つのプロセッサ２４および１つのメモリユニット２６のみが図示されているが、本発明はそれに限定されず、リソース予測コンピュータシステム２０を必要に応じてスケーリングすることができることを理解されたい。 The resource prediction computer system 20 executes a company's resource prediction based on multivariate time series (MTS) data for the network server 18 according to various embodiments, and the MTS data is stored in the database computer system 22. The resource prediction computer system 20 and the MTS database system 22 are shown in FIG. 1 as being connected to the WAN 14 for illustration purposes, but one or both of them may be in one of the illustrated LANs 12. Can be included. They can also connect to different LANs 12 and WAN 14s within the corporate network. The resource prediction computer system 20 is one or more interconnected computer devices such as servers (s), mainframes (s), workstations (s), and / or any other suitable computer device. It may be implemented as. Each such computer device of the resource prediction computer system 20 may include one or more processors 24 and one or more memory units 26. The memory unit 26 may include both primary computer storage (eg RAM and ROM) and secondary computer storage (eg HDD, SSD, flash). The processor (s) 24 is a computer instruction (s) stored in the memory unit 26, such as the variable grouping module 30 and the k nearest neighbor (k-NN) search module 32, as shown in FIG. 1 and further described below. For example, it may include a microprocessor (s) running software). For purposes of illustration, the illustrated resource prediction computer system 20 includes only one computer, and only one processor 24 and one memory unit 26, but the invention is not limited thereto. It should be understood that the resource prediction computer system 20 can be scaled as needed.

ＭＴＳデータベース２２は、企業システム１０のネットワークサーバ１８に対する時系列のコンピュータ使用率およびハードウェア統計を格納する。統計には、次のように、各ユーザリソース要求に関連するいくつかの変数の値（以下のものなど）を含めることができる。
－リソースを要求しているユーザのユーザ名
－要求の開始時間
－要求の終了時間
－要求の合計時間
－要求の動作時間
－要求されたプロセスまたはリソース
－要求に対処したネットワークサーバのＩＤ
－要求に対処したネットワークサーバの地理的位置
－ＣＰＵ利用率－ネットワークサーバの一次メモリ（例えば、ＲＡＭ）利用率
－ネットワークサーバのディスクＩＯ（ディスクメモリまたは他の二次コンピュータメモリへの読み書き動作の合計） The MTS database 22 stores time-series computer utilization and hardware statistics for the network server 18 of the enterprise system 10. Statistics can include the values of several variables associated with each user resource request (such as:), as follows:
-User name of the user requesting the resource-Request start time-Request end time-Request total time-Request operation time-Requested process or resource-ID of the network server that responded to the request
-Geographical location of the network server that responded to the request-CPU utilization-Network server primary memory (eg RAM) utilization-Total of network server read / write operations to disk IO (disk memory or other secondary computer memory) )

ＭＴＳデータベース２２は、データベース管理システム（ＤＢＭＳ）ソフトウェアを実行し、かつ好適なＲＡＩＤディスクアレイおよび／または任意の他の好適なデータ記憶機構を含む、必要に応じて１つ以上のデータベースサーバを用いて実装されてもよい。使用量およびハードウェア統計は、ＣＳＶファイルおよび／またはネットワークサーバ１８からの任意の他の好適なデータフォーマットを使用してＭＴＳデータベース２２に送信されてもよい。各ネットワークサーバ１８は、その周期的統計をＭＴＳデータベース２２に送信してもよく、および／またはＬＡＮ１２内の１つ以上のネットワークサーバ１８は、ＬＡＮ１２内の複数のネットワークサーバ１８に関する統計を収集し、その収集をＭＴＳデータベース２２に送信することができる。ＭＴＳデータベース２２は、例えば、オラクル（Ｏｒａｃｌｅ）またはＳＱＬデータベース、または他の任意の好適なデータベースとして実装されてもよい。 The MTS database 22 runs database management system (DBMS) software and optionally uses one or more database servers, including a suitable RAID disk array and / or any other suitable data storage mechanism. It may be implemented. Usage and hardware statistics may be sent to the MTS database 22 using CSV files and / or any other suitable data format from the network server 18. Each network server 18 may send its periodic statistics to the MTS database 22 and / or one or more network servers 18 in the LAN 12 collect statistics about a plurality of network servers 18 in the LAN 12. The collection can be sent to the MTS database 22. The MTS database 22 may be implemented as, for example, an Oracle or SQL database, or any other suitable database.

様々な実施形態では、企業の未来のコンピュータリソースのニーズを予測するために、リソース予測コンピュータシステム２０は、各関連変数が１つのクラスタのメンバーとなるように、かつ同じクラスタを有する変数間の依存性が高いが他のクラスタ内の変数と比較して非常に低いように、関連変数をクラスタへとグループ化する。プロセッサ２４は、変数グルーピングモジュール３０のソフトウェアを実行することによって、この機能を実行してもよい。クラスタが決定された後、リソース予測コンピュータシステム２０は、予測を計算するために様々な実施形態においてｋ－ＮＮ探索を使用する。プロセッサ２４は、ｋ－ＮＮ探索モジュール３２のソフトウェアを実行することによってこの機能を実行してもよい。 In various embodiments, in order to predict the future computer resource needs of an enterprise, the resource prediction computer system 20 is dependent on variables such that each related variable is a member of one cluster and has the same cluster. Group related variables into a cluster so that they are more likely but very low compared to the variables in other clusters. The processor 24 may perform this function by executing the software of the variable grouping module 30. After the cluster is determined, the resource prediction computer system 20 uses k-NN search in various embodiments to calculate the prediction. The processor 24 may perform this function by executing the software of the k-NN search module 32.

図２は、様々な実施形態によるＭＴＳデータベース２２とともにリソース予測コンピュータシステム２０によって実行されてもよい予測プロセスのプロセスフローである。ステップ５０において、ＭＴＳデータベース２２は、企業のネットワークサーバ１８に対する時系列使用量およびハードウェア統計変数を受信し、格納する。データは、５分ごと、１０分ごと、または任意の他の好適な時間増分などの周期的な時間増分に対して収集される。次にステップ５２において、リソース予測コンピュータシステム２０は、データベース２２に格納されているＭＴＳデータを前処理して、ノイズを除去または低減し、かつ／または従来のデータ前処理技法を使用してデータを標準正規分布データへとスケーリングすることができる。 FIG. 2 is a process flow of a prediction process that may be performed by the resource prediction computer system 20 along with the MTS database 22 according to various embodiments. In step 50, the MTS database 22 receives and stores time series usage and hardware statistical variables for the enterprise network server 18. Data is collected for periodic time increments, such as every 5 minutes, every 10 minutes, or any other suitable time increment. Then in step 52, the resource prediction computer system 20 preprocesses the MTS data stored in the database 22 to remove or reduce noise and / or use conventional data preprocessing techniques to preprocess the data. Can be scaled to standard normal distribution data.

高次元時系列を扱うとき、すべての次元に基づく全体的なｋ－ＮＮ推定は、無関係なメトリクス間の干渉のために大きな誤差を持つ可能性がある。しかしながら、測定されたコンピュータハードウェアメトリクスのいくつかは、互いに密接に関連しており、同様の挙動を共有しているので、予測が単変量時系列から単に抽出されると、関連情報は失われることになる。すべての関連変数から集約された知識を利用するために、ｋ－ｎｎ探索を実行する前、高次元ＭＴＳは、より小さいもののグループに分解され、ここでは同じグループ内の変数間の依存性は高いが、他のグループ内の変数との依存性は非常に低い。この動作は、図２のステップ５４において、変数グルーピングモジュール３０のソフトウェアを実行することによって、可変クラスタを生成するリソース予測コンピュータシステム２０のプロセッサ２４によって実行される。そのような分解はまた、モデル空間を大幅に狭めもするので、その後のｋ－ＮＮ探索を高速化する。様々な実装態様では、変数グルーピングは２つの段階、すなわち（ｉ）相関行列構築、および（ｉｉ）変数クラスタリングを含む。 When dealing with high-dimensional time series, the overall k-NN estimation based on all dimensions can have large errors due to interference between irrelevant metrics. However, some of the measured computer hardware metrics are closely related to each other and share similar behavior, so if the prediction is simply extracted from the univariate time series, the relevant information is lost. It will be. In order to utilize the knowledge aggregated from all related variables, before performing the k-nn search, the higher dimensional MTS is decomposed into smaller groups, where the dependencies between variables within the same group are high. However, the dependency with variables in other groups is very low. This operation is performed by the processor 24 of the resource prediction computer system 20 that creates a variable cluster by running the software of the variable grouping module 30 in step 54 of FIG. Such decomposition also significantly narrows the model space, thus accelerating subsequent k-NN searches. In various implementations, variable grouping involves two steps: (i) correlation matrix construction, and (ii) variable clustering.

ｄ個の変数を有し、かつ長さＴのＭＴＳが与えられた場合、２つの変数間の関連性を測定するために相関係数を使用することができる。一実施形態では、２つの変数間の関係の単調性のノンパラメトリックな尺度であるスピアマン（Ｓｐｅａｒｍａｎ）の順位相関係数が使用される。スピアマン相関は、両方のデータセットが正規分布していると仮定しない。サイズｎのサンプルの場合、未処理の変数Ｘ_ｉ、Ｙ_ｉは、ランクｘ_ｉ、ｙ_ｉに変換され、スピアマンの順位相関係数ρは、これらから次のように計算される。

式中、ｘ_ｉ、ｙ_ｉは、未処理の変数を指し、バー付きのｘ、ｙは、変数の平均を指す。 Given d variables and an MTS of length T, the correlation coefficient can be used to measure the association between the two variables. In one embodiment, Spearman's rank correlation coefficient, which is a monotonic nonparametric measure of the relationship between two variables, is used. Spearman's correlation does not assume that both datasets are normally distributed. For a sample of size n, the unprocessed variables X _i and Y _i are converted to ranks x _i and y _i , and Spearman's rank correlation coefficient ρ is calculated from these as follows.

In the equation, x _i and y _i refer to unprocessed variables, and x and y with bars refer to the average of the variables.

ある変数がある時間差後に他の変数に影響を与える可能性があることに留意し、したがって、変数ｉ、ｊの各対について、アルゴリズムは、０から予め定義された「ＭａｘＬａｇ」期間まで探索でき、ρ_ｉｊとなる最大係数を、以下のように選択することができる。

すべての変数の中で、対での相関係数は、ｎ×ｎ相関行列を形成するために計算される。以下の表１は、特定のネットワークサーバホストを用いた実験で構築された例示的な相関行列である。これは、より濃い灰色を有する対称行列が強い相関を示すことに留意されたい。

Keep in mind that one variable can affect other variables after a time lag, so for each pair of variables i, j, the algorithm can search from 0 to a predefined "MaxLag" period. The maximum coefficient of ρ _ij can be selected as follows.

Among all variables, the paired correlation coefficient is calculated to form an n × n correlation matrix. Table 1 below is an exemplary correlation matrix constructed in an experiment with a particular network server host. Note that this is strongly correlated with symmetric matrices with a darker gray color.

上記で構築した相関行列に基づいて、ｎ個の変数をクラスタに分解するために分割方法を適用することができる。様々な実施形態において、アフィニティプロパゲーション（ＡｆｆｉｎｉｔｙＰｒｏｐａｇａｔｉｏｎ）（ＡＰ）クラスタリングアルゴリズムがこのステップで使用される。ＡＰは、データポイント間の「メッセージパッシング」の概念に基づくクラスタリングアルゴリズムである。ｋ平均法（ｋ－ｍｅａｎｓ）およびｋ－ｍｅｄｏｉｄｓ法などの他の既知の分割方法とは異なり、ＡＰでは、アルゴリズムを実行する前にクラスタの数を決定または推定する必要はない。したがって、入力ハードウェアメトリクスが変化するにつれてグループ化の結果が変化する場合があるので、本実施形態により適している。ＡＰについてのさらなる詳細は、その全体が参照により本明細書に組み込まれる、ＢｒｅｎｄａｎＪ．Ｆｒｅｙｅｔａｌ．，“Ｃｌｕｓｔｅｒｉｎｇｂｙｐａｓｓｉｎｇｍｅｓｓａｇｅｓｂｅｔｗｅｅｎｄａｔａｐｏｉｎｔｓ，”Ｓｃｉｅｎｃｅ３１：９７２－９７６（２００７）に見出すことができる。 Based on the correlation matrix constructed above, a partitioning method can be applied to decompose n variables into clusters. In various embodiments, an Affinity Propagation (AP) clustering algorithm is used in this step. AP is a clustering algorithm based on the concept of "message passing" between data points. Unlike other known splitting methods such as k-means and k-means, APs do not need to determine or estimate the number of clusters before running the algorithm. Therefore, the grouping result may change as the input hardware metrics change, making it more suitable for this embodiment. Further details about the AP are incorporated herein by reference in their entirety, Brendan J. et al. Free et al. , "Clustering by passing messages beween data points," Science 31: 972-976 (2007).

ＡＰアルゴリズムは、収束するまでサンプルの対間でメッセージを送信することによってクラスタを作成する。ポイント間で送信されるメッセージには２つのカテゴリがある。一方は、サンプルｋがサンプルｉの見本となるべきであるという責任であり、ｒ（ｉ，ｋ）によって表される。もう一方は、サンプルｉがサンプルｋを見本として選択するべきであるという可用性であり、ａ（ｉ，ｋ）で表され：

となり、式中、ｓ（ｉ，ｋ）は、サンプルｉとｋとの間の類似度である。アルゴリズムは、収束するまで２つの行列の計算を実行する。表２は、表１の行列に基づくクラスタリング結果を示す。

この例で分かるように、各変数０～６は、グループ／クラスタのうちの１つに属する。 The AP algorithm creates a cluster by sending a message between sample pairs until it converges. There are two categories of messages sent between points. One is the responsibility that sample k should be a sample of sample i, represented by r (i, k). The other is the availability that sample i should be selected as a sample of sample k, represented by a (i, k):

In the equation, s (i, k) is the degree of similarity between samples i and k. The algorithm performs the calculation of the two matrices until it converges. Table 2 shows the clustering results based on the matrix of Table 1.

As can be seen in this example, each variable 0-6 belongs to one of the groups / clusters.

可変クラスタが決定された後、ステップ５６において、ｋ－ＮＮアルゴリズムが所望の予測をするために使用される。説明の目的のために、ｋ－ＮＮ探索アルゴリズムは、一変量の状況について、および一定幅の時間間隔で測定された直前の一定数の過去のデータに基づく１ステップ先の値についてのみ最初に図示される。ｔ＝（１，２，・・・，Ｔ）の場合、予想範囲ｈ＝（１，２，・・・，Ｈ）に対して、点Ｔから予測ｘ_Ｔ＋ｈ（ここでｘは⌒付き）行われる有限かつ等距離の時系列ｘ_ｔを考える。最初に、時系列の最近の状態の特徴を表すために、特徴ベクトルの集合が作成される。等しい長さｍのセグメントは、ｍ個の連続した観測値のシーケンスｘ^ｍ _ｔ（ここでｘは→付き）＝［ｘ_t，ｘ_ｔ－１，・・・，ｘ_{ｔ－（ｍ－１）}］^Ｔのベクトルシーケンスｘ^ｍ _ｔ（ここでｘは→付き）とみなされ、ここで、ｍはサンプリング長と称される予め定められた整数である。ベクトルは、すべてのｔ＝（ｍ，ｍ＋１，ｍ＋２，・・・，Ｔ－ｍ）、いわゆるｍ履歴と重複して作成されることに留意されたい。 After the variable cluster has been determined, in step 56 the k-NN algorithm is used to make the desired predictions. For purposes of illustration, the k-NN search algorithm is initially illustrated only for univariate situations and for values one step ahead based on a constant number of previous data measured at constant width time intervals. Will be done. In the case of t = (1, 2, ..., T), the predicted range h = (1, 2, ..., H) is predicted from the point T x _{T + h} (where x is ⌒). Consider a finite and equidistant time series x _t . First, a set of feature vectors is created to represent the features of the recent state of the time series. A segment of equal length m is a sequence of m consecutive observations x ^m _t (where x is with →) = [x _t , x _t-1 , ..., x _{t- (m-1).} ] ^T is considered as a vector sequence x ^m _t (where x is with →), where m is a predetermined integer called the sampling length. Note that the vector overlaps with all t = (m, m + 1, m + 2, ..., Tm), the so-called m history.

次に、過去における行動の類似パターンを識別するために、様々な実施形態において、ユークリッド距離メトリクスなどの所与の距離メトリクスを使用して、すべてのｍ履歴と最後に観測されたベクトルｘ^ｍ _ｔ（ここでｘは→付き）＝［ｘ_Ｔ，ｘ_Ｔ－１，・・・，ｘ_{Ｔ－（ｍ－１）}］^Ｔとの間の距離がそれぞれ計算される。

Then, in various embodiments, a given distance metric, such as the Euclidean distance metric, is used to identify similar patterns of behavior in the past, using all m histories and the last observed vector _x ^mt . (Here, x is with →) = [x _T , x _T-1 , ..., x _{T- (m-1)} ] The distances to ^T are calculated respectively.

最後に、すべてのｍ個の履歴に対する対での距離がランク付けされ、ターゲット特徴ベクトルまでの最小距離を有するｋ個のベクトルのその後の観測値がそれぞれ選択される。ｋ個の最近傍の各々を局所的に重み付けすることができ、その後予測ｘ_Ｔ＋ｈ（ここでｘは⌒付き）を形成するために集約される。

式中、ｎｅｉｇｈ_ｊ，ｈは、近傍のｊの範囲ｈに続く値であり、ｗ_ｊは、重みを表す。図３は、一変量ｋ－ＮＮ手法のための予測方法を図示するグラフである。基準期間（３４１～３８１日目）に最も近い２つの近傍（１７１～２２１日目と１～５１日目）を示す。 Finally, the paired distances to all m histories are ranked and subsequent observations of the k vectors with the minimum distance to the target feature vector are selected respectively. Each of the k nearest neighbors can be locally weighted and then aggregated to form the prediction x _{T + h} (where x is ⌒).

In the equation, negh _{j and h} are values following the range h of nearby j, and w _j represents a weight. FIG. 3 is a graph illustrating a prediction method for the univariate k-NN method. The two neighborhoods (days 171 to 221 and days 1 to 51) closest to the reference period (days 341 to 381) are shown.

多変量の場合、単変量の場合の参照（例えば、最新の）特徴ベクトルおよびｍ履歴は、多次元に拡張される。Ｘ^ｄをｄ次元の多変量時系列（ＭＴＳ）とする。時間ＴにおけるＭＴＳの未来を予測するためには、目標特徴行列は、

であり、式中ｘ^ｍ _Ｔ，ｉ（ここでｘは→付き）は、次元ｉで時間Ｔにおいて最後に観測されたベクトルを表す。同様に、距離メトリクスは、距離メトリクスの合計として定義することができる（例えば、すべてのｄ次元におけるユークリッド距離

ｍ履歴にわたって探索することによって、ＭＴＳの現在の（または他の参照）状態に近いグローバルなｋ個の最近傍を見つけることができ、それらの各々は、ｍ個の連続した観測のｄ個のベクトルによって構成される。ｋ個の最近傍の範囲ｈを有する後続の値は、参照ｋ×ｄ行列として表すことができる。

各次元に対する予測値は、対応する次元での参照項目を集計することによって生成される。

ここで、各次元の近傍は、結果を形成するためにそれぞれ重み付けされる。一実施形態では、重みは、勾配降下アルゴリズムによって得られ、新しいデータと一致するように徐々に調整される。予測は、時間範囲の累積ユーザリソース要求とそれらの要求に対処するために必要なサーバリソースを予想することができる。したがって、予測は、時間範囲ステップに対して予測されたサーバワークロードをもたらす。拡張されたワークロード予想を得るために、このプロセスを追加の時間範囲ステップに対して繰り返すことができる。 For multivariates, the reference (eg, up-to-date) feature vector and m-history for univariates are extended in multiple dimensions. Let X ^d be a d-dimensional multivariate time series (MTS). To predict the future of MTS at time T, the target feature matrix is

In the equation, ^xm _{T, i} (where x is with →) represents the last observed vector in time T in dimension i. Similarly, distance metrics can be defined as the sum of distance metrics (eg, Euclidean distance in all d dimensions).

By searching over the m history, we can find the global k nearest neighbors that are close to the current (or other reference) state of the MTS, each of which is a d vector of m consecutive observations. Consists of. Subsequent values with k nearest neighbor ranges h can be represented as a reference k × d matrix.

Predicted values for each dimension are generated by aggregating the reference items in the corresponding dimension.

Here, the neighborhood of each dimension is each weighted to form a result. In one embodiment, the weights are obtained by a gradient descent algorithm and are gradually adjusted to match the new data. Prediction can predict cumulative user resource requests over a time range and the server resources needed to handle those requests. Therefore, the prediction yields the predicted server workload for the time range step. This process can be repeated for additional time range steps to obtain enhanced workload expectations.

図２のステップ５８において、リソース予測コンピュータシステム２０は、未来の時間範囲ステップにおけるサーバの数を推奨することができる。このようなサーバ数の予測は、多数の有益な目的のために使用することができる。そのような目的の１つは、企業が予想された数のサーバに基づいてそのサーバ１８をプロビジョニングすることを可能にすることである。以下のメトリクスを含むいくつかの計算されたメトリクスがそのような能力計画において使用されてもよい。
・ＣＰＵ／メモリ／ディスクの過負荷率および不足負荷率。これらのメトリクスの計算は、ＣＰＵ、メモリ、およびディスクの過負荷値および不足負荷値（それぞれの過負荷閾値および不足負荷閾値を上回るまたは下回る）を平均することによって実行することができる。閾値は、例えば、過負荷の場合は９０％、不足負荷の場合は１０％とすることができる。
・ＣＰＵ／メモリ／ディスクの過負荷時間率および不足負荷時間率。これらのメトリクスは、全体の期間と比較した、ＣＰＵ／メモリ／ディスクが過負荷になっている（過負荷閾値を上回った）または不足負荷となっている（不足負荷閾値を下回った）時間の長さの比率に基づいて計算することができる。
・全体的な過負荷時間率および不足負荷時間率。この計算では、サーバのためのＣＰＵ／メモリ／ディスクメトリクスのいずれかが過負荷または不足負荷の場合、場合によっては、サーバが過負荷または不足負荷になっているとみなされる。
これらのメトリクスは、上述のｋ－ＮＮ予測されたＣＰＵ／メモリ／ディスクの負荷に基づいて、リソース予測コンピュータシステム２０によって、未来の時間ステップのために予測することができる（ステップ５６参照）。 In step 58 of FIG. 2, the resource prediction computer system 20 can recommend the number of servers in a future time range step. Such server number predictions can be used for a number of useful purposes. One such purpose is to allow an enterprise to provision its servers 18 based on an expected number of servers. Several calculated metrics, including the following metrics, may be used in such capacity planning.
-CPU / memory / disk overload factor and underload factor. Calculations of these metrics can be performed by averaging CPU, memory, and disk overload and underload values (above or below the respective overload and underload thresholds, respectively). The threshold value can be, for example, 90% in the case of overload and 10% in the case of underload.
-CPU / memory / disk overload time rate and underload time rate. These metrics are the length of time that the CPU / memory / disk is overloaded (above the overload threshold) or underloaded (below the underload threshold) compared to the overall period. It can be calculated based on the ratio of the values.
-Overall overload time rate and underload time rate. In this calculation, if any of the CPU / memory / disk metrics for the server is overloaded or underloaded, then the server is considered to be overloaded or underloaded in some cases.
These metrics can be predicted by the resource prediction computer system 20 for future time steps based on the k-NN predicted CPU / memory / disk load described above (see step 56).

これらの予想された過負荷メトリクス／不足負荷メトリクスは、推奨されたサーバ１８の数（図１参照）など、企業コンピュータシステムのためのハードウェアリソースを推奨するための様々な推奨方法論に対する入力パラメータとして使用することができる。３つのそのような推奨方法論は、いわゆる「容量ベースのサーバ数推奨」（Ｃ－ＳＮＲ）、「使用量ベースのサーバ数推奨」（Ｕ－ＳＮＲ）、および「適応型サーバ数推奨」（Ａ－ＳＮＲ）である。Ｃ－ＳＮＲでは、サーバによって処理することができる要求の数が予め定められている。したがって、要求の総数が予測されると、サーバ数を簡単に導出することができる。Ｕ－ＳＮＲでは、ユーザがサーバ要求を実行する最も可能性の高い時間は、ユーザの使用経過および分類に基づいて既知である。要求に対するユーザの平均負荷値もユーザの分類に基づいて認識されるため、ユーザの要求によってシステムにもたらされると予想される負荷を予測することができる。サーバ容量を最大限に活用しようとする割り当てアルゴリズムを適用することによって、サーバ数を決定することができる。Ａ－ＳＮＲでは、比率（実際には閾値）を使用して、新しいユーザセッションを送信するサーバおよび新しいサーバを追加するべきかどうかを決定する。新しい要求が出されたとき、最初の試みは、現在稼働中のサーバがその要求に対処することである。要求は、分類結果から推定される潜在的な負荷レベルに基づいて現在のサーバに割り当てられる。過負荷状態のために新しい要求を現在アクティブなサーバに割り当てることができない場合、要求に対処するために新しいサーバが追加される。同様に、要求が完了すると、残りのワークロードをサーバ１８間で再均衡化することができる。 These expected overload / underload metrics are input parameters for various recommended methodologies for recommending hardware resources for enterprise computer systems, such as the recommended number of servers 18 (see Figure 1). Can be used. Three such recommended methodologies are the so-called "capacity-based server number recommendation" (C-SNR), "usage-based server number recommendation" (U-SNR), and "adaptive server number recommendation" (A-). SNR). In C-SNR, the number of requests that can be processed by the server is predetermined. Therefore, once the total number of requests is predicted, the number of servers can be easily derived. In the U-SNR, the most likely time for a user to execute a server request is known based on the user's usage history and classification. Since the average load value of the user for the request is also recognized based on the classification of the user, the load expected to be brought to the system by the user's request can be predicted. The number of servers can be determined by applying an allocation algorithm that seeks to maximize server capacity. The A-SNR uses the ratio (actually the threshold) to determine whether to add a server to send a new user session and a new server. When a new request is made, the first attempt is for the currently running server to handle the request. Requests are assigned to the current server based on the potential load level estimated from the classification results. If a new request cannot be assigned to the currently active server due to an overload condition, a new server is added to handle the request. Similarly, once the request is complete, the remaining workload can be rebalanced across the servers 18.

このメトリクスのうちのどれが使用されようと、または他のメトリクスが使用されようと、リソース予測コンピュータシステム２０は、予想されたユーザのワークロードに基づいて、特定の期間について企業にとって適切な数のサーバ１８を予想することができる。リソース予測コンピュータシステム２０は、これらのサーバ数推奨をネットワークサーバ１８に対するブローカ（図１参照）として機能するネットワークサーバ４０に通信することができる。リソース予測コンピュータシステム２０から伝達された決定に基づいて、このブローカサーバ（複数可）４０は、ＬＡＮ１２内のいくつのサーバ１８を、様々な時点で起動（フルアクティブ、動作モード）および、または低電力モード（例えば、アイドルモードまたはスリープモード）にするべきかを決定することができ、それに応じてサーバ１８に命令することができる。このようにして、ネットワークサーバ１８のうちのいくつかは、予想された使用法および／または負荷パターンに基づいて、それらが必要とされ予想されていない期間中に低電力モードにすることができる。例えば、図１を参照して、リソース予測コンピュータシステム２０は、特定の期間に必要とされる予想数のネットワークサーバ１８を決定することができる。予想は、１つのＬＡＮ１２内のネットワークサーバ１８または複数のＬＡＮ１２にわたるネットワークサーバ１８に対するものとすることができる。リソース予測コンピュータシステム２０は、この数をブローカサーバ４０に送信することができ、これは、そのＬＡＮ１２および／または他のＬＡＮ１２内の様々なサーバ１８に、適切なときに動作可能な高電力モード（ユーザのリソース要求を処理することができる）にあるか、または低電力モード（ユーザのリソース要求を処理できない）にあるように命令することができる。ネットワークサーバ１８は、それに応じて命令された電力モードを仮定することができる。このようにして、予想されたネットワークリソースのニーズが低いときには、エネルギーおよび関連するコストを節約するために、ある数のネットワークサーバ１８を低電力モードにすることができる。逆に、予想されたネットワークリソースのニーズが高いときには、十分な数のネットワークサーバ１８が予想されるユーザ要求負荷に対処するように準備されることになる。 Regardless of which of these metrics is used, or any other metric, the Resource Prediction Computer System 20 is an appropriate number for the enterprise for a particular time period based on the expected user workload. You can anticipate the server 18. The resource prediction computer system 20 can communicate these server number recommendations to the network server 40, which acts as a broker (see FIG. 1) for the network server 18. Resource Prediction Based on the decisions communicated from the computer system 20, the broker server (s) 40 may start several servers 18 in the LAN 12 at various points in time (fully active, operating mode) and / or low power. It is possible to determine whether to enter a mode (eg, idle mode or sleep mode) and instruct the server 18 accordingly. In this way, some of the network servers 18 can be put into low power mode during periods when they are needed and not expected, based on expected usage and / or load patterns. For example, with reference to FIG. 1, the resource prediction computer system 20 can determine the expected number of network servers 18 required for a particular time period. The conjecture can be for a network server 18 within one LAN 12 or a network server 18 across multiple LANs 12. The resource prediction computer system 20 can send this number to the broker server 40, which is a high power mode capable of operating at the appropriate time to the LAN 12 and / or various servers 18 in the other LAN 12. It can be instructed to be in (can handle the user's resource request) or in low power mode (cannot handle the user's resource request). The network server 18 can assume the power mode instructed accordingly. In this way, a certain number of network servers 18 can be put into low power mode in order to save energy and associated costs when the expected network resource needs are low. Conversely, when the expected network resource needs are high, a sufficient number of network servers 18 will be prepared to handle the expected user-requested load.

様々な実施形態において、プロセッサ２４は（ｋ－ＮＮ探索モジュールソフトウェア３２を実行するとき）、ｋ－ＮＮ探索の前にｋ－ＮＮ探索のためのｋおよびｍパラメータを選択する。モジュール３２は、異なるデータセットに対する予想誤差を最小にするために、ｋおよびｍパラメータを調整するために高クライミングアルゴリズムを使用してもよい。問題に対する恣意的な解（この場合は、ｋおよびｍパラメータ）から始まる高クライミングアルゴリズムの反復アルゴリズムは、次いで、解の単一要素を漸増的に変更することによって、より良好な解を見つけようとする。その変更がより良好な解を生み出す場合、それ以上の改善を見出すことができなくなるまで繰り返される新しい解への漸増変更が行われる。他の実施形態では、ｋおよびｍパラメータは、再確立される。 In various embodiments, the processor 24 (when running the k-NN search module software 32) selects the k and m parameters for the k-NN search prior to the k-NN search. Module 32 may use a high climbing algorithm to adjust the k and m parameters to minimize prediction errors for different data sets. An iterative algorithm of a high climbing algorithm starting with an arbitrary solution to the problem (in this case, the k and m parameters) then seeks to find a better solution by incrementally modifying a single element of the solution. do. If the change produces a better solution, then incremental changes to the new solution are made that are repeated until no further improvement can be found. In other embodiments, the k and m parameters are reestablished.

Ｋ最近傍を有する予測間の関係は、時間とともに変化する可能性がある。したがって、重みは、好ましくは、時間とともに調整される。Ｔｒａｉｎ_ｔ＝［ｘ_ｔ－１，ｘ_ｔ－２，・・・，ｘ_ｔ－ｎ］は、時間ｔにおけるサイズｎの学習セットを表すと仮定する。ｘ_ｔは、時間ｔにおける真の値であり、ｙ_ｔは、時間ｔに対する予測値である。以前に取得した重みに基づいて、ｙ_ｔをＴｒａｉｎ_ｔに基づいて予測することができる。ｘ_ｔの真の値が分かった後、Ｔｒａｉｎ_ｔ＋１＝［ｘ_ｔ，ｘ_ｔ－１，・・・，ｘ_{ｔ－ｎ＋１}］に基づいて重みを更新することができ、この新しい重みは、ｙ_ｔ＋１の予測に使用される。 The relationship between predictions with K-nearest neighbors can change over time. Therefore, the weights are preferably adjusted over time. It is assumed that Train _t = [x _t-1 , x _t-2 , ..., X _tn ] represents a learning set of size n at time t. x _t is the true value at time t and y _t is the predicted value for time t. Based on the previously obtained weights, y _t can be predicted based on Train _t . After knowing the true value of x _t , the weight can be updated based on Train _{t + 1} = [x _t , x _t-1 , ..., x _{tn + 1} ], and this new weight is y _{t + 1} Used for prediction of.

別の一般的な問題は、企業で必要なコンピューティングリソースが動的に変化することである。企業が最高のワークロードに対応するサーバ数を維持している場合、ワークロードが低い時間の間、そのような多数のサーバを必要としないため、これは大きな無駄になる。ユーザの要求を満たすのに十分なサーバを維持するだけでなく、可能な限りエネルギーを節約するという目標を達成するためには、ワークロードを予測する必要がある。したがって、本発明の様々な実施形態は、予測およびストライプサンプリング手法に基づくワークロード生成方法を提供する。単位時間内の未来の要求数を予測し、要求の特徴を分析し、それに応じて要求を分類し、スライスサンプリング戦略を決定することによって、未来の詳細ワークロードデータを生成することができる。未来のワークロードデータは、サーバリソースを動的に調整するためのベースを提供することができる。 Another common problem is the dynamic change in computing resources required by an enterprise. This is a big waste if the enterprise maintains the number of servers for the highest workload, as the workload does not need such a large number of servers during low hours. In addition to maintaining enough servers to meet user demands, workloads need to be predicted to achieve the goal of saving as much energy as possible. Therefore, various embodiments of the present invention provide workload generation methods based on prediction and stripe sampling techniques. Future detailed workload data can be generated by predicting the number of future requirements in a unit time, analyzing the characteristics of the requirements, classifying the requirements accordingly, and determining the slice sampling strategy. Future workload data can provide a basis for dynamically adjusting server resources.

一般的な一態様では、予測およびストライプサンプリング手法をステップへと分割することができる。第１に、時系列予測モデルを適用して単位時間内の要求数を予測する。ｋ最近傍（ＫＮＮ）アルゴリズムに基づいて、要求数時系列データ内の現在のサブシーケンスに類似するＫ個のサブシーケンスが探索される。次いで、線形回帰モデルを適用して、Ｋ個の最近傍シーケンスによって与えられる予測を最終予測結果に合成することができる。 In one general aspect, the prediction and stripe sampling techniques can be divided into steps. First, a time series prediction model is applied to predict the number of requests in a unit time. Based on the k-nearest neighbor (KNN) algorithm, K subsequences similar to the current subsequence in the requested number time series data are searched. A linear regression model can then be applied to synthesize the predictions given by the K nearest neighbor sequences into the final prediction result.

第２に、要求の特徴を分析する。各要求は、異なるリソース（ＣＰＵ負荷、メモリなど）を消費する。さらに、各要求の長さを予測する必要がある。様々な実施形態に従って、分析を３つのタイプへと分割することができる。
ａ．相関分析。要求の異なる属性に対して相関分析を実行することができる。２つの属性が相関している場合、それらは互いに依存するが、一方で相関していない２つの属性は互いに影響しない。
ｂ．分類要求。関連性の高い属性で要求を分類することによって、要求を、内部記憶コストの低い短期間の要求、内部記憶コストの低い長期間の要求、内部記憶コストの高い長期間の要求などの、異なるタイプへと分割することができる。
ｃ．周期分析。周期的な要素によってもたらされるマイナスの影響は、取り除かれるべきである。要求は、勤務時間と非勤務時間との間に著しい違いを示している。要求をサンプリングすることが望まれる場合、同様の周期的時間内の要求がサンプリングのために選択されるべきである。 Second, we analyze the characteristics of the requirements. Each request consumes different resources (CPU load, memory, etc.). In addition, it is necessary to predict the length of each request. Analysis can be divided into three types according to various embodiments.
a. Correlation analysis. Correlation analysis can be performed on attributes with different requirements. When two attributes are correlated, they depend on each other, while two uncorrelated attributes do not affect each other.
b. Classification request. By classifying requests by relevant attributes, you can classify requests into different types, such as short-term requirements with low internal storage costs, long-term requirements with low internal storage costs, and long-term requirements with high internal storage costs. Can be divided into.
c. Periodic analysis. The negative effects of periodic factors should be eliminated. The requirements show a significant difference between working and non-working hours. If it is desired to sample the request, a request within a similar periodic time should be selected for sampling.

第３に、スライスベースのサンプリングおよび期間ベースのサンプリングの組み合わせを用いて、各カテゴリの割合に従って過去のデータから要求が選択され、これらの要求データが最終的な予測にまとめられる。 Third, using a combination of slice-based sampling and period-based sampling, requirements are selected from historical data according to the proportion of each category, and these request data are combined into a final forecast.

このプロセスには、ワークロードが複数のメトリクスに基づいて測定されるという利点があり、これにより、単一のメトリクスの制限を克服する。別の潜在的な利点は、未来の要求数および様々なメトリクスに対する各要求の影響を考慮しながら、各要求によってもたらされるワークロードをモデル化することによってより正確な予測を行うことである。そのようにして、この方法は、全体的なワークロード予想手法の制限を克服する。 This process has the advantage that the workload is measured based on multiple metrics, thereby overcoming the limitations of a single metric. Another potential advantage is to make more accurate predictions by modeling the workload brought about by each requirement, taking into account the number of future requirements and the impact of each requirement on various metrics. As such, this method overcomes the limitations of the overall workload prediction method.

図４は、リソース予測コンピュータシステム２０の図であり、図５は、本発明の様々な実施形態による将来のワークロードトレースを生成するために、この予測およびストライプサンプリング戦略を実施するためにリソース予測コンピュータシステム２０によって実行されるプロセスフローの図である。図３に示すように、そのような実施形態によれば、リソース予測コンピュータシステム２０は、未来のワークロードトレースを生成するための予測およびストライプサンプリングモジュール６０を含んでもよい。予測およびストライプサンプリングモジュール６０は、メモリ（複数可）２６に格納され、かつリソース予測コンピュータシステム２０のプロセッサ（複数可）２４によって実行されるソフトウェアコードとして実装されてもよい。図５の例示的なプロセスフローは、予測およびストライプサンプリングモジュール６０のコードが実行されるときに、リソース予測コンピュータシステム２０によって実施することができる。さらに、データベース２２は、ネットワークサーバのユーザによる過去の要求に関するデータを格納することができる。 FIG. 4 is a diagram of a resource prediction computer system 20, and FIG. 5 is a resource prediction to implement this prediction and stripe sampling strategy to generate future workload traces according to various embodiments of the present invention. It is a figure of the process flow executed by the computer system 20. As shown in FIG. 3, according to such an embodiment, the resource prediction computer system 20 may include a prediction and stripe sampling module 60 for generating future workload traces. The prediction and stripe sampling module 60 may be implemented as software code stored in the memory (s) 26 and executed by the processor (s) 24 of the resource prediction computer system 20. The exemplary process flow of FIG. 5 can be performed by the resource prediction computer system 20 when the code of the prediction and stripe sampling module 60 is executed. Further, the database 22 can store data regarding past requests by users of the network server.

ステップ６２において、単位時間当たりの要求数が予測される。単位時間ごとにユーザによっていくつの要求が送信されるかを予測する問題は、時系列予測問題としてモデル化することができる。したがって、Ｋ個の最近傍分類アルゴリズムを使用して、データベース２２内に格納されている過去のデータから現在のサブシーケンスと同様のサブシーケンスを見つけ出すことができる。次いで、これらの類似のサブシーケンスに基づいて得られた予測を一緒に組み合わせて、最終予測結果を生成することができる。時間間隔が単位時間Ｔであると仮定すると、各単位時間内に発生する要求数は、Ｎ＝［ｎ_１，ｎ_２，・・・，ｎ_ｐ］として表すことができ、式中、ｎ_１は、ｉ番目の単位時間内に発生した要求数であり、ｐは、次のように表すことができる。

要求数が現時点のそれらと類似している場合を見つけるために、最新のｍ個の観測値からなるサブシーケンスを描画することができる。

もう一方の観測は、次のように表すことができる。

In step 62, the number of requests per unit time is predicted. The problem of predicting how many requests will be sent by the user per unit time can be modeled as a time series prediction problem. Therefore, using K nearest neighbor classification algorithms, it is possible to find a subsequence similar to the current subsequence from the past data stored in the database 22. The predictions obtained based on these similar subsequences can then be combined together to produce the final prediction result. Assuming that the time interval is the unit time T, the number of requests generated in each unit time can be expressed as N = [n ₁ , n ₂ , ..., n _p ], and in the equation, n ₁ Is the number of requests generated within the i-th unit time, and p can be expressed as follows.

In order to find out when the number of requests is similar to those at the present time, a subsequence consisting of the latest m observations can be drawn.

The other observation can be expressed as follows.

複数のサブシーケンスＮ_{ｉ，ｉ＋ｍ－１}＝［ｎ_ｉ，・・・，ｎ_{ｉ＋ｍ－１}］は、Ｎ_ｎｏｗに最も類似する過去のデータから見出すことができる。過去のサブシーケンスと現在のサブシーケンスとの間の類似性を測定するための評価基準を使用することができる。Ｄｉｓ（Ｘ，Ｙ）を２つの時系列間の類似性を測定するための距離関数とする。距離関数ユークリッド距離を使用することができ、これは以下の通りである。

A plurality of subsequences _{Ni, i + m-1} = [ni _i , ..., _{ni + m-1} ] can be found from the past data most similar to _Now . Evaluation criteria can be used to measure the similarity between past subsequences and current subsequences. Let Dis (X, Y) be a distance function for measuring the similarity between two time series. The distance function Euclidean distance can be used, which is as follows.

各過去のサブシーケンスと現在のサブシーケンスとの間の距離を計算することができる。過去のシーケンスは、サブシーケンスセット｛Ｎ_{１，１＋ｍ－１}，Ｎ_{２，２＋ｍ－１}，・・・，Ｎ_{ｐ－２＊ｍ＋１，ｐ－ｍ}｝へと分割することができる。次いで、範囲０＜ｉ≦ｐ－２＊ｍ＋１でｉを見つけるために距離を計算することができ、それは、Ｄｉｓ（Ｎ_{ｉ，ｉ＋ｍ－１}，Ｎ_ｎｏｗ）の値が最小になるようにする。次いで、現在のものに最も類似したサブシーケンスＮ_{ｉ，ｉ＋ｍ－１}を決定することができる。 The distance between each past subsequence and the current subsequence can be calculated. Past sequences can be divided into subsequence sets {N _{1,1 + m-1} , N _{2,2 + m-1} , ..., N _{p-2 * m + 1, pm} }. The distance can then be calculated to find i in the range 0 <i≤p-2 * m + 1, which ensures that the value of Dis (N _{i, i + m-1} , _Now ) is minimized. The subsequence _{Ni, i + m-1} most similar to the current one can then be determined.

セット内のすべての過去のサブシーケンスについて、距離ｄ_ｉ＝Ｄｉｓ（Ｎ_ｉ，_{ｉ＋ｍ－１}，Ｎ_ｎｏｗ）を計算して距離セットＤ＝｛ｄ_１，ｄ_２，・・・・・・，ｄ_{ｐ－２＊ｍ＋１}｝を得ることができる。Ｄ内のすべての要素がソートされた後、最近接距離［ｄ_ａ，ｄ_ｂ，・・・・・・，ｄ_ｋ］を決定することができ、距離［ａ，ｂ，・・・，・・・，ｋ］のインデックスは、最も類似したサブシーケンスのインデックスである。 For all past subsequences in the set, the distance di = Dis (N _i , _i _{+ m-1} , _Now ) is calculated and the distance set D = {d ₁ , d ₂ , ..., d. _{p-2 * m + 1} } can be obtained. After all the elements in D have been sorted, the closest distance [d _a , db, ..., d _k ] can be determined, and the distance [a, _b , ..., D k] can be determined. The index of [・, k] is the index of the most similar subsequence.

これらＫ個の最近傍シーケンスの直後のサブシーケンスは、最終予測を生成するために結合されるのが好ましい。２つの要因を考慮することが好ましい。第１に、類似性は、モデルで考慮されるべきであり、概して、類似性が高いものほどより大きい重みを有するべきである。第２に、サブシーケンスの出現時間、概してより最近の出現時間はより大きい重みを有するべきである。したがって、様々な実施形態では、ｋ個のサブシーケンスは、現在のサブシーケンスとの類似性および発生時間に従ってソートされる。次いで、線形回帰手法を適用して、ｋ番目の予測に対する重みを学習することができる。 The subsequences immediately following these K nearest neighbor sequences are preferably combined to generate a final prediction. It is preferable to consider two factors. First, similarity should be considered in the model, and in general, the higher the similarity, the greater the weight. Second, subsequence appearance times, generally more recent appearance times, should have greater weight. Therefore, in various embodiments, the k subsequences are sorted according to their similarity to the current subsequence and the time of occurrence. You can then apply a linear regression technique to learn the weights for the kth prediction.

次に、ステップ６４において、要求特徴または属性が分析される。属性は、上述のように、要求の持続時間、ディスクＩＯ、ＣＰＵおよび／またはメモリ使用率などの要求データを含むことができる。このステップ６４は、様々な実施形態による３つのサブステップ、ステップ６６における訂正分析、ステップ６８における要求分類、ステップ７０における周期性分析を含むことができる。訂正分析ステップ６６において、各要求は概して、異なるワークロードを様々なコンピュータリソースにもたらす。したがって、ワークロードを特徴付けるために、複数の特徴を使用するべきである。１つの問題は、これらのワークロード特徴に相関があるかどうかである。例えば、ＣＰＵの消費量が少なくなると、ディスク入力／出力の消費量も減少するという主張が、あるアプリケーションに当てはまるかどうかが分かるはずである。 Next, in step 64, the required feature or attribute is analyzed. Attributes can include request data such as request duration, disk IO, CPU and / or memory usage, as described above. This step 64 can include three sub-steps according to various embodiments, a correction analysis in step 66, a requirement classification in step 68, and a periodic analysis in step 70. In correction analysis step 66, each requirement generally results in different workloads for different computer resources. Therefore, multiple features should be used to characterize the workload. One question is whether these workload features are correlated. For example, you should know if the claim that as CPU consumption decreases, so does disk input / output consumption applies to an application.

様々な実施形態に従って、ピアソン相関係数をこのステップで使用することができる。要求ワークロードが２つの特定の特徴を有していると仮定すると、これは２つの時系列Ｘ＝［ｘ_１，ｘ_２，・・・，ｘ_Ｎ］およびＹ＝［ｙ_１，ｙ_２，・・・，ｙ_Ｎ］によってさらに表すことができ、式中、ｉ番目の要求の値は（ｘ_ｉ，ｙ_ｉ）として表すことができ、これら２つの特徴間の相関関係は、次のように表すことができる。

この相関係数の絶対値が大きいほど、相関度は高い。値が０に近い場合、２つの特徴は、ほぼ独立していることを示す。 According to various embodiments, the Pearson correlation coefficient can be used in this step. Assuming that the request workload has two specific features, this is two time series X = [x ₁ , x ₂ , ..., X _N ] and Y = [y ₁ , y ₂ , ,. ..., y _N ] can be further expressed, and in the equation, the value of the i-th request can be expressed as (x _i , y _i ), and the correlation between these two features is as follows. Can be expressed in.

The larger the absolute value of this correlation coefficient, the higher the degree of correlation. When the value is close to 0, the two features are almost independent.

要求分類ステップ６８に関しては、特徴数に対する特徴値の分布を観察することによって、いくつかの特徴が複数のピークを有することがあり、これは特徴値を分類することができることを示す。要求をクラスの比率に従って過去からサンプリングすることができるように、異なるタイプへと分類することができる。分類プロセスの間、分類は、相関のある特徴についてなされるべきである。このステップでは、古典的なｋ平均法クラスタリングアルゴリズムに基づく二等分Ｋ平均法を、局所最適解に陥ることを回避するために使用することができる。このクラスタリングアルゴリズムは、３つのステップに分割することができる。第１のステップは、クラスタリング中心を見つけることであり、第２のステップは、各点とクラスタ中心との間の距離を計算し、各点を最も近いクラスタに入れることであり、第３のステップは、すべてのクラスタ座標の平均値を新しいクラスタ中心として計算することである。二等分Ｋ平均クラスタリング法は、最初にすべての点を１つのクラスタの中へと入れ、次いで１つのクラスタを２つに分割し、最後に最大誤差を低減させることができるクラスタを２つのクラスタに分割することを選択することによってさらに強化することができる。このプロセスは、クラスタの数がユーザによって与えられたＫの数に等しくなるまで繰り返すことができる。 Regarding the requirement classification step 68, by observing the distribution of the feature values with respect to the number of features, some features may have a plurality of peaks, which indicates that the feature values can be classified. Requests can be categorized into different types so that they can be sampled from the past according to the proportion of the class. During the classification process, classification should be made for correlated features. In this step, a bisected K-means clustering algorithm based on the classical k-means clustering algorithm can be used to avoid falling into a locally optimal solution. This clustering algorithm can be divided into three steps. The first step is to find the clustering center, the second step is to calculate the distance between each point and the cluster center and put each point in the nearest cluster, the third step. Is to calculate the average of all cluster coordinates as the new cluster center. The bisected K-means clustering method first puts all the points into one cluster, then divides one cluster into two, and finally two clusters that can reduce the maximum error. It can be further enhanced by choosing to divide into. This process can be repeated until the number of clusters is equal to the number of Ks given by the user.

循環的要因は、予測における重要な要因である。サーバへの要求は、人々から行われるため、勤務時間と勤務時間外の要求数が異なることは明らかである。したがって、循環的要因の影響は、周期性分析ステップ７０において考慮されるべきである。一実施形態では、高速フーリエ変換（ＦＦＴ）を使用して、要求特徴シーケンスのサイクル長を計算することができる。時間間隔ｔにおける特性ｘ_ｉの平均値を得ることができる。次いで、サンプリングレートｆ_ｓ＝１／ｔで、時系列ｆ（ｎ）、式中ｎ＝０，１，２，・・・，Ｎ－１、を決定することができる。 Cyclic factors are important factors in forecasting. Since the requests to the server are made by people, it is clear that the number of requests during and after working hours is different. Therefore, the effects of cyclical factors should be considered in periodic analysis step 70. In one embodiment, the Fast Fourier Transform (FFT) can be used to calculate the cycle length of the required feature sequence. The average value of the characteristic x _i at the time interval t can be obtained. Next, at a sampling rate f _s = 1 / t, the time series f (n) and n = 0, 1, 2, ..., N-1 in the equation can be determined.

ｆ（ｎ）の離散フーリエ変換（ＤＦＴ）をＦ（ｋ）と仮定して、

を得ることができ、式中、

ｋ＝０，１，２，・・・，Ｎ－１である。ＤＦＴを得た後、各ｋは、次のように表すことができる、離散周波数値に対応する。

シーケンスが周期的である場合、フーリエ変換関数のスペクトルは、周期の逆数でヒットし、衝撃値によってｆ（ｎ）の周期を得ることができる。単一の特徴によって表される要求負荷については、対応するサイクル時点から要求をサンプリングすることだけが必要とされる。複数のカテゴリへと分類されている要求については、あらゆる時間単位における各カテゴリの比率を予測することができ、これはステップ６２で導入されたＫ－ＮＮ手法を採用することによって行うことができる。 Assuming that the discrete Fourier transform (DFT) of f (n) is F (k),

Can be obtained during the ceremony,

k = 0, 1, 2, ..., N-1. After obtaining the DFT, each k corresponds to a discrete frequency value, which can be expressed as:

When the sequence is periodic, the spectrum of the Fourier transform function hits with the reciprocal of the period, and the impact value can give the period of f (n). For demand loads represented by a single feature, it is only necessary to sample the demand from the corresponding cycle point in time. For requests classified into multiple categories, the ratio of each category in any time unit can be predicted, which can be done by adopting the K-NN method introduced in step 62.

分析の結果により、ステップ７２において、未来の要求をシミュレートするために、過去のデータから要求データを抽出することができる。周期性分析により、サイクルの長さがＴ_{ｐｅｒｉｄ}であることが示されたと仮定する。分類では、要求は、Ｋ個のクラスに分割され、過去の要求収集Ｎ_{ｈｉｓｔｏｒｙ}は、いくつかのサブセットに分割される。

From the results of the analysis, in step 72, the requirement data can be extracted from the past data in order to simulate future requirements. It is assumed that periodicity analysis has shown that the cycle length is _Tperid . In the classification, the requests are divided into K classes, and the past request collection N _history is divided into several subsets.

ｐ個の要求がある時間間隔ｔ_{ｐｒｅｄｉｃｔ}で未来発生すると予測され、ｋ－ｓｔクラスの予測された比率がｗｋであると仮定する。したがって、カテゴリｋの要求数は、ｐｋ＝ｗｋ＊ｐである。過去のサブセットＮ_{ｔｐｒｅｄｉｃｔ，ｋ}，ｐ_ｋのサイズの範囲内で、要求を選択するために、乱数を生成することができる。次いで、すべてのカテゴリの要求を、シミュレートされた未来の要求の集合体に統合することができる。各過去の要求の発生時刻は、各単位時間間隔内で固定されていない。その相対的な開始時間は、次のように表すことができる。
ｔ_ｕｎｉｔ＝ｔ_{ａｃｔｕａｌ}－ｋＴ、ｋ＝０，１，２，・・・
未来の要求シーケンスのシミュレーション中に、実際の要求時間は、ｔ_{ｐｒｅｄｉｃｔ}＝ｔ_ｕｎｉｔ＋ｔ_{ｐｅｒｉｏｄ}として表すことができる。
上記の方法により、未来のワークロードトレースを生成することができる。また、未来のワークロードトレースに基づいて、予想されるワークロードに基づいて企業のユーザのニーズを満たすことができるように未来の期間についてサーバ数推奨を行うことができる（図２のステップ５８参照）。 It is assumed that p requests will occur in the future at a certain time interval _tpredict , and the predicted ratio of the k-st class is wk. Therefore, the number of requests for category k is pk = wk * p. Random numbers can be generated to select a request within a size range of the past subset N _{tredict, k} _, pk. Requests of all categories can then be integrated into a collection of simulated future requirements. The time of occurrence of each past request is not fixed within each unit time interval. The relative start time can be expressed as:
t _unit = t _actual -kT, k = 0,1,2, ...
During the simulation of future request sequences, the actual request time can be expressed as _tpredict = t _unit + _tperiod .
By the above method, future workload traces can be generated. Also, based on future workload traces, server count recommendations can be made for future time periods to meet the needs of enterprise users based on expected workloads (see step 58 in FIG. 2). ).

上述の解決策の１つの利点は、ワークロードが複数のメトリクスに基づいて測定されることであり、これは単一のメトリクスの制限を克服する。別の利点は、未来の要求数と各要求が様々なメトリクスに与える影響を考慮しながら、各要求によってもたらされるワークロードをモデル化することで、より正確な予測を行うことである。そのため、この解決策は、全体的なワークロード予想手法で経験した制限を克服する。 One advantage of the above solution is that the workload is measured based on multiple metrics, which overcomes the limitation of a single metric. Another advantage is to make more accurate predictions by modeling the workload that each requirement brings, taking into account the number of future requirements and the impact of each requirement on different metrics. Therefore, this solution overcomes the limitations experienced with the overall workload prediction approach.

予想された使用パターンの別の潜在的な有益な用途は、サービスとしてのデスクトップ（ＤａａＳ）を提供するクラウドコンピューティングベンダーによってホストされている、仮想デスクトップに移行する企業向けである。企業は、予測を実行して、ＣＰＵ数、ＲＡＭ容量、および／またはストレージ容量などの、予想されたＤａａＳリソースサイズを決定することができる。そのようにして、企業は、そのニーズのために過度に多くのクラウドコンピューティングリソースを保有しておくこと（かつ、それによってそのニーズのために過払いすること）、および過度に少ないラウドコンピューティングリソースしか保有しておかないこと（かつ、それによってそのユーザに対する必要とされるリソースを有しないこと）を回避することができる。 Another potential useful use of the expected usage pattern is for enterprises migrating to virtual desktops hosted by cloud computing vendors that offer desktops as a service (DaaS). Enterprises can perform forecasts to determine expected DaaS resource sizes such as CPU count, RAM capacity, and / or storage capacity. As such, an entity has too much cloud computing resources for its needs (and thereby overpaying for those needs), and too few loud computing resources. It is possible to avoid having only possession (and thereby not having the required resources for that user).

別の有益な用途は、企業が特定のワークロードまたはリソースに対して必要とするネットワークサーバ１８の数を決定することである。（上述の推奨方法のいずれかに従って）必要なサーバ数を示す特定のリソースに対する企業の予想された使用パターンに基づいて、企業は適切な数のサーバを調達することができる。例えば、システムが高負荷状態にあるとき、コントローラは要求を共有するために、例えば、仮想サーバなどのサーバをシステムに追加する。システムの負荷が低い場合、コントローラは、余分なリソースを返してコストを削減する。図６は、そのような決定をするためのコントローラ１２０（例えば、図１に示されているリソース予測コンピュータシステム２０）を図示している。 Another useful use is to determine the number of network servers 18 that an enterprise needs for a particular workload or resource. Based on the company's expected usage pattern for a particular resource that indicates the number of servers required (according to one of the recommended methods above), the company can procure the appropriate number of servers. For example, when the system is under heavy load, the controller adds a server to the system, for example, a virtual server, to share the request. When the system load is low, the controller returns extra resources to reduce costs. FIG. 6 illustrates a controller 120 (eg, the resource prediction computer system 20 shown in FIG. 1) for making such a decision.

システムの要求は、時系列データとして説明することができ、すなわち要求の数は、時間とともに周期的に変化している。例えば、昼間の要求は、概して夜間の要求よりも多くなる。本発明の様々な実施形態では、コントローラ１２０は、未来の要求を予測し、異なる方針に従ってシステムを調整する。プロセスの一般的なステップは、図６に関連して以下に説明される。 The requirements of the system can be described as time series data, i.e., the number of requirements changes cyclically over time. For example, daytime demands are generally higher than nighttime demands. In various embodiments of the invention, the controller 120 anticipates future demands and adjusts the system according to different policies. The general steps of the process are described below in connection with FIG.

ｋ－ＮＮ予測部１２２は、未来の要求を予測するためにｋ－ＮＮアルゴリズムを使用する。様々な実施形態では、予測部１２２は、特定の期間数ではなく、次の期間中に要求パターンを予測するだけである。要求パターンは、増加、減少、および変動の３つのタイプに分割することができる。要求パターンは、要求数がＸ１期間（例えば、３期間）にわたって増加した場合、またはＺ１期間（例えば、Ｚ１＝２）中にＹ１％（例えば、４０％）を超えて増加した場合、増加として分類することができる。増加パターンの後の次のパターンもまた、それらが連続的な増加しない期間を満たすまで増加パターンとして分類されることになる。Ｚ２期間（例えば、Ｚ２＝Ｚ１＝２）中に要求数がＸ２期間にわたって減少し続ける（例えば、Ｘ２＝Ｘ１＝３）、またはＹ２％を超えて減少する（例えば、Ｙ２＝Ｙ１＝４０％）場合、要求パターンは、減少として分類される。他のパターンは、変動として分類され、これを除いて様々な実施形態では、増加または減少パターンに続く期間も、場合によっては減少期間または増加期間が生じるまで、増加または減少に分類することができる。 The k-NN prediction unit 122 uses the k-NN algorithm to predict future demands. In various embodiments, the predictor 122 only predicts the request pattern during the next period, not a specific number of periods. Request patterns can be divided into three types: increase, decrease, and variation. The request pattern is classified as an increase if the number of requests increases over the X1 period (eg, 3 periods) or if it increases by more than Y1% (eg, 40%) during the Z1 period (eg, Z1 = 2). can do. The next pattern after the increase pattern will also be classified as an increase pattern until they meet a continuous non-increase period. During the Z2 period (eg, Z2 = Z1 = 2), the number of requests continues to decrease over the X2 period (eg, X2 = X1 = 3), or decreases by more than Y2% (eg, Y2 = Y1 = 40%). If so, the request pattern is classified as a decrease. Other patterns are classified as variability, and in various embodiments, the period following the increase or decrease pattern can also be classified as increase or decrease, in some cases until a decrease or increase period occurs. ..

図７は、１日の典型的なサービスシステムの負荷を示す。この例では、サンプル期間は、１５分ごとである。この例では、午前７時から午前８時まで、要求データは、増加傾向にある。要求数は、この時間枠内でわずかな減少期間を有して急速に増加する。午前８時から午前９時まで、要求数は、減少パターンにあり、要求数は、この１時間の期間の大部分で減少する。午前１０時から午後２時（すなわち、時間／時間軸上で１４時）まで、要求データは、この４時間の期間中要求数が変動し続けるので、変動パターンにある。他の期間も同様に分類することができる。要求パターンを３つのタイプに分割した後、ｋ－ＮＮ予測部１２２は、次の制御サイクルのための要求パターンを予測する。すべての制御サイクルの要求数が記録される。予測部１２２は、いくつかの連続する点（例えば、５つの連続する点）を１つのグループとしてグループ化し、最新のデータグループを分類するためのサンプルデータとする。次いで、ｋ－ＮＮ予測部１２２は、過去のデータから類似グループを探索する。様々な実施形態において、要求は通常周期性を示すので、ｋ－ＮＮ予測部１２２は、履歴内の類似の周期的時間を探索する。２つのグループ間の分散は、それらの間の類似度を決定するために使用される。次いで、ｋ－ＮＮ予測部１２２は、Ｋ個の最も類似したグループ（例えば、Ｋ＝５）を選択する。これらの類似のグループを見つけた後、ｋ－ＮＮ予測部１２２は、グループの次の期間のパターンを得て、次の期間に対する要求パターンの予測として最も高い頻度のパターンを取る。 FIG. 7 shows a typical service system load for a day. In this example, the sample period is every 15 minutes. In this example, the request data is on the increase from 7:00 am to 8:00 am. The number of requests increases rapidly with a slight decrease period within this time frame. From 8:00 am to 9:00 am, the number of requests is in a decreasing pattern, and the number of requests decreases for most of this one-hour period. From 10 am to 2 pm (ie, 14:00 on the time / time axis), the request data is in a variation pattern as the number of requests continues to fluctuate during this 4 hour period. Other periods can be categorized in the same way. After dividing the request pattern into three types, the k-NN prediction unit 122 predicts the request pattern for the next control cycle. The number of requests for all control cycles is recorded. The prediction unit 122 groups several consecutive points (for example, five consecutive points) into one group, and uses it as sample data for classifying the latest data group. Next, the k-NN prediction unit 122 searches for a similar group from past data. In various embodiments, the request exhibits normal periodicity, so the k-NN predictor 122 searches for similar periodic times in the history. The variance between the two groups is used to determine the similarity between them. The k-NN prediction unit 122 then selects K most similar groups (eg, K = 5). After finding these similar groups, the k-NN prediction unit 122 obtains the pattern for the next period of the group and takes the most frequent pattern as a prediction of the request pattern for the next period.

３つのパターンは、異なる特徴を有している。その結果、意志決定部１２６は、異なる制御方針を適用することができる。増加するパターンでは、仮想プロバイダーからサーバリソースを取得してサービスの開始を待つのに時間がかかるため、事前にサーバを追加する必要がある。以下の式は、追加サーバの必要数を計算するために使用することができる。

ここで、ｕ（ｔ）は、時刻ｔにおける制御出力であり、これは追加するサーバ数であり、ｅ（ｔ）は、システムの負荷と理想的な負荷との偏差であり、Ｋは、比例係数であり、ｒ´（ｔ）は、時間ｔにおける予測要求数であり、Ｔ_ｃは、制御周期であり、Ａは、予測影響係数である（予測が制御プロセスにどの程度影響するかを決定する）。式の右側の第１の部分（すなわち、Ｋｅ（ｔ））は、従来の比例制御である。第２の部分（すなわち、ＡＫ（ｒ´（ｔ＋Ｔ_ｃ）－ｒ´（ｔ）））は予測の影響を制御プロセスに導入し、予測影響係数によって出力に影響を与える。 The three patterns have different characteristics. As a result, the decision-making unit 126 can apply different control policies. In the increasing pattern, it takes time to get the server resource from the virtual provider and wait for the service to start, so it is necessary to add the server in advance. The following formula can be used to calculate the required number of additional servers.

Here, u (t) is the control output at time t, this is the number of servers to be added, e (t) is the deviation between the system load and the ideal load, and K is proportional. It is a coefficient, r'(t) is the number of prediction requests at time t, T _c is the control period, and A is the prediction influence coefficient (determines how much the prediction affects the control process). do). The first part on the right side of the equation (ie, Ke (t)) is conventional proportional control. The second part (ie, AK (r'(t + T _c ) -r' (t))) introduces the influence of the prediction into the control process and influences the output by the prediction influence factor.

減少パターンでは、サーバの低減が遅延をもたらすことはめったにないので、事前にサーバを減少させる必要性はほとんど、またはまったくない。したがって、意思決定部１２６は、システムの負荷が低いときにサーバを閉じてサーバに戻すことができる。計算は、式ｕ（ｔ）＝Ｋｅ（ｔ）に従って行うことができる。増加パターンに対する上述の式と比較して、減少パターンの式は、比例制御部分のみを含む。 In the reduction pattern, server reduction rarely results in delays, so there is little or no need to reduce servers in advance. Therefore, the decision-making unit 126 can close the server and return it to the server when the load on the system is low. The calculation can be performed according to the equation u (t) = Ke (t). Compared to the above equation for the increasing pattern, the decreasing pattern equation contains only the proportional control part.

変動パターンでは、意思決定部１２６は、要求が頻繁に変動する場合があるので、システムの負荷の変化に慎重に対処しなければならない。意志決定部１２６が、負荷に応じてサーバを変動させるだけの場合、サーバの起動と停止を繰り返すことで発生するエネルギー消費コストに加えて、サーバが１時間単位で支払われるため、システムが不安定になり、多大な無駄が生じる可能性がある。そのため、変動パターンについては、意思決定部１２６は、以下の式を使用することができる。

ｅ（ｔ）≧０のとき、意志決定部１２６は、比例制御によってサーバを追加するだけである。ｅ（ｔ）＜０の場合、これはサーバ数を低減する必要があり得ることを意味し、意思決定部１２６は、最後の３つの制御期間（または他のいくつかの制御期間）にわたる負荷レベルを収集し、これらの期間すべての間システムが低負荷状態にあった場合、コントローラは、（３つの制御期間のうちの）負荷が最も大きい制御期間を選択して、減少パターンについて上述したようにクローズして戻すサーバ数を計算する。制御ポリシーの目的は、システムの安定性を確保しながらサーバ数を安定させることである。 In the variation pattern, the decision-making unit 126 must carefully deal with changes in the load of the system because the requirements may vary frequently. If the decision-making unit 126 only changes the server according to the load, the system is unstable because the server is paid on an hourly basis in addition to the energy consumption cost generated by repeatedly starting and stopping the server. This can lead to a great deal of waste. Therefore, for the fluctuation pattern, the decision-making unit 126 can use the following equation.

When e (t) ≧ 0, the decision-making unit 126 only adds a server by proportional control. If e (t) <0, this means that the number of servers may need to be reduced, and the decision-making unit 126 has a load level over the last three control periods (or some other control period). If the system was in a low load state for all of these periods, the controller would select the control period with the highest load (out of the three control periods) and as described above for the reduction pattern. Calculate the number of servers to close and return. The purpose of the control policy is to stabilize the number of servers while ensuring the stability of the system.

サーバを追加することは、いくらかの遅延を有する場合があるため、コントローラ１２０は、遅延の影響を補償するためにスミス予測部１２４を含んでもよい。スミス予測部１２４は、利用可能／購入されているがまだ使用されていないサーバ数を入力として受信することができる。この入力に基づいて、スミス予測部１２４は、これらのサーバの容量を推定し、これらの保留容量をシステムの現在の負荷に対して補償する。次いで、補償されたシステムの負荷は、意思決定プロセスで使用するために意思決定部１２６に配信される。スミス予測部１２４の原理は、様々な実施形態において、補償部分がシステムのフィードバックに導入され、それが遅延されたフィードバック量を事前に意志決定部１２６に入れることである。 Since adding a server may have some delay, the controller 120 may include a Smith predictor 124 to compensate for the effects of the delay. The Smith predictor 124 can receive as input the number of servers available / purchased but not yet used. Based on this input, Smith Predictor 124 estimates the capacity of these servers and compensates for these reserved capacities against the current load on the system. The compensated system load is then delivered to decision-making unit 126 for use in the decision-making process. The principle of Smith Prediction Unit 124 is that, in various embodiments, a compensation portion is introduced into the feedback of the system, which pre-populates the decision-making unit 126 with the delayed feedback amount.

図８は、本発明の様々な実施形態によるコントローラ１２０の伝達関数である。この図では、Ｇ_０（ｓ）ｅ^－ｔｓは、システム内の遅れを表し、スミス予測部の伝達関数は、Ｇ_０（ｓ）（１－ｅ^－ｔｓ）である。スミス予測部１２４は、ｋ－ＮＮ予測部１２２Ｇ_τ（ｓ）に接続され、純粋な時間遅延補償を有するコントローラが形成される。スミス予測部を追加すると、システムの閉ループ伝達関数は、次のようになる。

特性方程式は、次のとおりである。

特性方程式内の遅延項は、スミス予測部の補償後に除去され、これはシステムに対する遅延の影響を低減することが分かる。 FIG. 8 is a transfer function of the controller 120 according to various embodiments of the present invention. In this figure, G ₀ (s) e- ^ts represents a delay in the system, and the transfer function of the Smith predictor is G ₀ (s) (1-e- ^ts ). The Smith prediction unit 124 is connected to the k-NN prediction unit 122 G _τ (s) to form a controller with pure time delay compensation. With the addition of the Smith predictor, the system's closed-loop transfer function looks like this:

The characteristic equation is as follows.

It can be seen that the delay term in the characteristic equation is removed after the Smith predictor compensates, which reduces the effect of the delay on the system.

学習部１２８は、システム負荷および状態情報を収集し、その情報を使用して、意思決定部１２６において比例係数Ｋをオンラインで調整する。比例係数Ｋは、コントローラ１２０がシステムの負荷にどのように反応するかを決定する。Ｋ値が大きいほど、より多くの調整が行われるが、それによってシステムはより変動しやすくなる。学習部１２８は、式に従ってオンラインでＫを調整することができ

式中、Ｔは、サンプリング期間の長さであり、ｓは、この期間の平均サーバ数であり、ｒは、この期間の完了した要求数であり、ｓＴ／ｒは、単位時間あたりの単位サーバの効率を表し、Ｌは、修正係数で、調整率を決定する。Ｌの値が大きいと、Ｋの応答はより速くなるが、変動にもつながる場合がある。 The learning unit 128 collects system load and state information and uses that information to adjust the proportionality factor K online in the decision unit 126. The proportionality factor K determines how the controller 120 reacts to the load of the system. The higher the K value, the more adjustments are made, which makes the system more volatile. The learning unit 128 can adjust K online according to the formula.

In the equation, T is the length of the sampling period, s is the average number of servers in this period, r is the number of completed requests in this period, and sT / r is the unit server per unit time. Represents the efficiency of, and L is a correction coefficient, which determines the adjustment rate. The larger the value of L, the faster the response of K, but it can also lead to fluctuations.

コントローラ１２０は、部材２６に格納されている適切なソフトウェア命令を用いて上述のプロセスを実施することができ、コンピュータシステム２０のプロセッサ（複数可）２４によって実行される（図１参照）。例えば、メモリ２６は、上述のプロセスを実施するためにプロセッサ（複数可）２４によって実行される、ｋ－ＮＮ予測、スミス予測、学習、および意思決定のためのソフトウェアモジュールを含んでもよい。その結果をブローカコンピュータシステム４０に提供することができ、場合によっては、それはネットワークサーバのランプアップまたはランプダウンを実行することができる。これら４つの部分（１２２、１２４、１２６、１２８）を組み入れ、様々な要求パターンに対して異なる制御ポリシーを適用することによって、本発明の実施形態は、ほとんどの期間にわたって、スケーラブルクラウドシステムに適切な制御を行うことができ、システムの性能を保証し、かつ不要なコストを削減する。 The controller 120 can perform the above-mentioned processes using appropriate software instructions stored in the member 26 and is executed by the processor (s) 24 of the computer system 20 (see FIG. 1). For example, the memory 26 may include software modules for k-NN prediction, Smith prediction, learning, and decision making, which are executed by the processor (s) 24 to perform the process described above. The result can be provided to the broker computer system 40, which in some cases can ramp up or down the network server. By incorporating these four parts (122, 124, 126, 128) and applying different control policies for different request patterns, embodiments of the invention are suitable for scalable cloud systems for most of the time. It can be controlled, guarantees the performance of the system, and reduces unnecessary costs.

さらに別の有益な用途は、仮想ユーザのワークロードのシミュレーションである。仮想システムの模擬負荷テストのために、実際のクライアントワークロードに基づいた履歴的な時間データおよび未来の時間データを生成することができる。 Yet another useful use is the simulation of virtual user workloads. Historical and future time data based on real client workloads can be generated for simulated load testing of virtual systems.

したがって、様々な実施形態では、本発明は、企業コンピュータシステムに対するネットワークリソースのニーズを予想するコンピュータシステムおよび関連するコンピュータ実装方法を対象とする。様々な実施形態によれば、コンピュータデータベースシステムは、複数のネットワークサーバに関する多変量時系列（ＭＴＳ）性能データを受信して格納する。ＭＴＳ性能データは、一連の過去のサンプリング時間についての複数のネットワークサーバに対する複数のｄ個の性能変数に対するデータを含む。プログラムされたコンピュータシステム（例えば、リソース予測コンピュータシステム２０）は、ＭＴＳ性能データ内の性能変数の各々が変数グループに属するように、ＭＴＳ性能データ内の変数を２つ以上の変数グループにグループ化する。次いで、プログラムされたコンピュータシステムは、１つ以上の未来の時間範囲ステップで変数の予測を計算することによって、企業コンピュータシステムのネットワークサーバの未来のワークロードの予測を計算する。プログラムされたコンピュータシステムは、以下を含むステップを実行することによって予測を計算する。
（ｉ）２つ以上の変数グループに適用されるｋ最近傍探索アルゴリズムを使用して、ＭＴＳ性能データの基準状態に対するｋ個の最近傍を見つけること、および
（ｉｉ）ｋ個の最近傍の加重平均を計算すること。次いで、プログラムされたコンピュータシステムは、計算された予測に基づいて、１つ以上の未来の時間範囲ステップの各々において、企業コンピュータシステムのユーザによるリソース要求に対処するために、企業が動作モードにあるために必要な推奨ネットワークサーバ数を決定することができる。 Accordingly, in various embodiments, the present invention is directed to computer systems and related computer implementation methods that anticipate the needs of network resources for enterprise computer systems. According to various embodiments, the computer database system receives and stores multivariate time series (MTS) performance data for a plurality of network servers. The MTS performance data includes data for a plurality of d performance variables for a plurality of network servers for a series of past sampling times. A programmed computer system (eg, resource prediction computer system 20) groups variables in MTS performance data into two or more variable groups such that each of the performance variables in MTS performance data belongs to a variable group. .. The programmed computer system then computes the forecasts of future workloads on the network servers of the corporate computer system by computing the forecasts of the variables in one or more future time range steps. The programmed computer system calculates the prediction by performing a step that includes:
(I) Finding k nearest neighbors to the reference state of the MTS performance data using the k nearest neighbor search algorithm applied to two or more variable groups, and (ii) weighting k nearest neighbors. Calculate the average. The programmed computer system is then put into operation mode to address resource demands by users of the enterprise computer system at each of one or more future time range steps, based on calculated predictions. You can determine the recommended number of network servers required for this.

様々な実装態様では、複数のネットワークサーバに対する性能変数は、少なくとも、単位期間あたりのＣＰＵ負荷、一次コンピュータメモリ利用率、および二次コンピュータストレージ入出力（ＩＯ）操作を示す変数を含む。また、変数をグループ化するステップは、プログラムされたコンピュータシステムによって、性能変数の各対間の相関を示す相関行列を計算するステップと、プログラムされたコンピュータシステムによって、クラスタリングアルゴリズムを使用して相関行列に基づいて変数グループを決定するステップと、も含んでもよい。相関行列はスピアマン相関行列を含んでもよく、クラスタリングアルゴリズムは、アフィニティプロパゲーションクラスタリングアルゴリズムを含んでもよい。 In various implementations, performance variables for multiple network servers include at least variables that indicate CPU load per unit period, primary computer memory utilization, and secondary computer storage input / output (IO) operations. Also, the steps for grouping variables are the step of calculating the correlation matrix showing the correlation between each pair of performance variables by the programmed computer system and the step of using the clustering algorithm by the programmed computer system. It may also include a step of determining a variable group based on. The correlation matrix may include a Spearman correlation matrix, and the clustering algorithm may include an affinity propagation algorithm.

また、ＭＴＳ性能データの基準状態に最も近いｋ個の最近傍を見つけるステップは、（ｉ）ＭＴＳの基準状態を表すベクトルとＭＴＳデータを表す複数のベクトルの各々との間の距離をそれぞれ以前のサンプリング時間において計算することと、（ｉｉ）ＭＴＳの基準状態を表すベクトルまでの距離が最も小さい過去のサンプリング時間についてｋ個のベクトルを決定することと、を含んでもよい。計算された距離は、ユークリッド距離であってもよい。 Further, in the step of finding the k nearest neighbors closest to the reference state of the MTS performance data, (i) the distance between each of the vector representing the reference state of the MTS and each of the plurality of vectors representing the MTS data is previously set. It may include calculating in sampling time and (ii) determining k vectors for the past sampling time with the shortest distance to the vector representing the reference state of MTS. The calculated distance may be the Euclidean distance.

さらに、プログラムされたコンピュータシステムは、未来の時間範囲ステップのうちの少なくとも１つについて、企業コンピュータシステムのユーザによるリソース要求に対処するために、企業が動作モードにあるために必要な推奨ネットワークサーバ数を示すデータを送信してもよい。したがって、１つ以上のブローカコンピュータシステムは、推奨ネットワークサーバ数が少なくとも１つの未来の時間範囲ステップの間ユーザによるリソース要求に対処するための動作モードになるように、ネットワークサーバに命令してもよい。 In addition, the programmed computer system is the recommended number of network servers required for the enterprise to be in operating mode in order to address resource demands by users of the enterprise computer system for at least one of the future time range steps. Data indicating the above may be transmitted. Therefore, one or more broker computer systems may instruct network servers so that the recommended number of network servers is in an operating mode for addressing resource requests by users for at least one future time range step. ..

別の一般的な態様では、本発明は、未来の期間にわたるネットワークサーバの未来のワークロードを予測するためのコンピュータシステムおよび関連するコンピュータ実装方法を対象とする。プログラムされたコンピュータシステム（例えば、リソース予測コンピュータシステム２０）は、期間のｋ個の最も最近のサブシーケンスのソートに基づいて、未来の期間についての要求数ｐを予測し、単位時間Ｔ当たりの企業コンピュータシステムのユーザによるネットワークサーバへの要求数は、最近の期間の現在のサブシーケンスと最も類似している。次いで、プログラムされたコンピュータシステムは、（データベース２２に格納されたデータに基づいて）過去の要求を要求の属性に基づいて２つ以上の要求タイプクラスに分類する。次いで、プログラムされたコンピュータシステムは、２つ以上の要求タイプクラスの各々における過去の要求の比率に基づいて、２つ以上の要求タイプクラスの各々について、未来の期間における要求の比率を予測する。次いで、プログラムされたコンピュータシステムは、要求タイプクラスについて１つ以上の要求属性に対する周期性を決定する。次いで、プログラムされたコンピュータシステムは、ｐ個のサンプルが、２つ以上の要求タイプクラスの各々の予測比率を有するように、かつｐ個のサンプルが、要求タイプクラスの周期性に基づいて、将来の期間と同じ要求サイクルポイントからのものであるように、ｐ個の過去の要求をサンプリングする。最後に、プログラムされたコンピュータシステムは、ｐ個のサンプリングされた過去の要求を合成して未来の期間のネットワークサーバに対するワークロードトレースを得る。 In another general aspect, the invention relates to computer systems and related computer implementation methods for predicting future workloads of network servers over future time periods. A programmed computer system (eg, a resource prediction computer system 20) predicts the number of requests p for a future period based on a sort of k most recent subsequences of the period, and the enterprise per unit time T. The number of requests from a computer system user to a network server is most similar to the current subsequence for a recent period. The programmed computer system then classifies past requests (based on the data stored in database 22) into two or more request type classes based on the attributes of the request. The programmed computer system then predicts the ratio of requirements in the future period for each of the two or more request type classes, based on the ratio of past requirements in each of the two or more request type classes. The programmed computer system then determines the periodicity for one or more request attributes for the request type class. The programmed computer system will then be configured so that the p samples have a predicted ratio of each of the two or more request type classes, and the p samples are based on the periodicity of the request type class in the future. Sample p past requests so that they are from the same request cycle point as the period of. Finally, the programmed computer system synthesizes p sampled past requests to obtain a workload trace for a network server in a future period.

様々な実装態様によれば、１つ以上のブローカシステムは、予測された未来のワークロードに基づいて、未来の期間にネットワークサーバの状態を調整することができる。また、プログラムされたコンピュータシステムは、過去のリクエストの属性の相関分析を実行し、相関分析に基づいて過去の要求を２つ以上のクラスに分類することを含め、要求の属性に基づいて、過去の要求を２つ以上の要求タイプクラスに分類することもできる。プログラムされたコンピュータシステムは、相関分析を実行する際に要求の属性のペア間のピアソン相関係数を計算し、要求の要求属性に基づいて、要求を２つ以上の要求に分割するためにクラスタリングアルゴリズムを使用することができる。また、高速フーリエ変換を周期性分析に使用して、２つ以上のクラスの１つ以上の要求属性のサイクル長を計算することもできる。 According to various implementations, one or more broker systems can adjust the state of the network server for future time periods based on the predicted future workload. The programmed computer system also performs a correlation analysis of the attributes of past requests, including classifying past requests into two or more classes based on the correlation analysis, based on the attributes of the past . Requests can also be classified into two or more request type classes. The programmed computer system calculates the Pearson correlation coefficient between the pairs of request attributes when performing a correlation analysis and clusters to divide the request into two or more requests based on the request attributes of the request. Algorithms can be used. It is also possible to use the Fast Fourier Transform for periodic analysis to calculate the cycle length of one or more required attributes of two or more classes.

別の一般的な態様によれば、本発明は、企業コンピュータシステムに対するネットワークリソースのニーズを予想するコンピュータシステムおよび関連するコンピュータ実装方法を対象とする。プログラムされたコンピュータシステム（例えば、リソース予測コンピュータシステム２０）は、企業コンピュータシステムユーザのユーザ要求データに基づいて、未来の時間範囲ステップについて予測されたユーザリソース要求パターンを決定する。次いで、プログラムされたコンピュータシステムは、未来の時間範囲ステップに対する予測されたユーザリソース要求パターンの分類に基づいて、未来の時間範囲ステップについて企業が必要とする推奨サーバ数を計算する。そうすることで、プログラムされたコンピュータシステムは、未来の時間範囲ステップに対する予測されたユーザリソース要求パターンが増加として分類されるときに、第１の制御ポリシーを使用して推奨サーバ数を計算し、未来の時間範囲ステップに対する予測されたユーザリソース要求パターンが減少として分類されたときに、第２の制御ポリシーを使用して推奨サーバ数を計算する。 According to another general aspect, the invention relates to computer systems and related computer implementation methods that anticipate the need for network resources for enterprise computer systems. The programmed computer system (eg, the resource prediction computer system 20) determines the predicted user resource request pattern for future time range steps based on the user request data of the enterprise computer system user. The programmed computer system then calculates the recommended number of servers required by the enterprise for future time range steps based on the classification of predicted user resource request patterns for future time range steps. In doing so, the programmed computer system uses the first control policy to calculate the recommended number of servers when the predicted user resource request pattern for future time range steps is classified as increasing. The second control policy is used to calculate the recommended number of servers when the predicted user resource request pattern for future time range steps is classified as diminished.

様々な実装態様では、第１の制御ポリシーは、比例定数Ｋを含む第１の式を使用して推奨サーバ数を計算し、第２の制御ポリシーは、比例定数Ｋを含む第２の式を使用して推奨サーバ数を計算する。その場合、プログラムされたコンピュータシステムは、単位期間内の単位ネットワークサーバの効率に基づいて、第１および第２の制御ポリシーに対して、比例定数Ｋをオンラインで調整することもできる。また、プログラムされたコンピュータシステムは、未来の時間範囲ステップに対する予測されたユーザリソース要求パターンが増加または減少として分類されないときに、第３の制御ポリシーを使用して推奨サーバ数を計算することもできる。３番目の制御ポリシーは、比例定数Ｋを含む（第３の）式を使用して推奨サーバ数を計算することもできる。 In various implementations, the first control policy uses the first equation containing the proportionality constant K to calculate the recommended number of servers, and the second control policy uses the second equation containing the proportionality constant K. Use to calculate the recommended number of servers. In that case, the programmed computer system can also adjust the proportionality constant K online for the first and second control policies, based on the efficiency of the unit network server within the unit period. The programmed computer system can also use a third control policy to calculate the recommended number of servers when the predicted user resource request pattern for future time range steps is not classified as increasing or decreasing. .. The third control policy can also calculate the recommended number of servers using the (third) equation containing the proportionality constant K.

様々な実装態様では、未来の時間範囲ステップに対する予測されたユーザリソース要求パターンは１つ以上の条件が満たされると増加して分類され、この１つ以上の条件は、予測要求数が連続した時間範囲ステップの第１の閾値数（Ｎ１）を超えて増加したという第１の条件と、予測要求数が連続した時間範囲ステップの第２の閾値数（Ｎ２）を超えて第１の閾値パーセンテージより多く増加したという第２の条件と、を含む。また、増加の分類のための１つ以上の条件は、（ｉ）直前の時間範囲ステップが増加として分類され、かつ（ｉｉ）未来の時間範囲ステップが減少として分類されないという第３の条件を含むことができる。同様に、未来の時間範囲ステップに対する予測されたユーザリソース要求パターンは、１つ以上の条件が満たされると減少として分類することができ、この１つ以上の条件は、予測要求数が、連続した時間範囲ステップの第３の閾値数（Ｎ３）を超えて減少したという第１の条件、および予測要求数が、連続した時間範囲ステップの第４の閾値数（Ｎ４）を超えて第２の閾値パーセンテージより多く減少したという第２の条件を含む。（ｉ）直前の時間範囲ステップが減少として分類され、かつ（ｉｉ）未来の時間範囲ステップが増加として分類されないという、減少の分類のための第３の条件もある可能性がある。プログラムされたコンピュータシステムはまた、増加する未来の時間範囲ステップのために、企業によって必要とされる推奨サーバ数を計算するときに、現在使用されていない利用可能なネットワークサーバの容量を補償することもできる。 In various implementations, the predicted user resource request patterns for future time range steps are categorized as increasing when one or more conditions are met, which one or more conditions are the duration of the predicted number of requests. From the first condition that the number of prediction requests exceeds the first threshold number (N1) of the range step and the first threshold percentage exceeds the second threshold number (N2) of the continuous time range step. Includes the second condition that there is a large increase. Also, one or more conditions for the classification of increase include a third condition that (i) the immediately preceding time range step is classified as an increase and (ii) the future time range step is not classified as a decrease. be able to. Similarly, a predicted user resource request pattern for a future time range step can be classified as a decrease when one or more conditions are met, one or more of which is a contiguous number of predicted requests. The first condition that the number of prediction requests decreased by exceeding the third threshold number (N3) of the time range step, and the second threshold value exceeded the fourth threshold number (N4) of the continuous time range step. Includes a second condition that it is reduced by more than a percentage. There may also be a third condition for the classification of declines, where (i) the previous time range step is classified as decreasing and (ii) the future time range step is not classified as increasing. The programmed computer system also compensates for the capacity of available network servers that are not currently in use when calculating the recommended number of servers required by the enterprise for an increasing future time range step. You can also.

例えば、図１は、本発明の態様を理解するのに十分な企業コンピュータシステム１０の構成要素のみを図示していることに留意されたい。企業コンピュータシステム１０は、有線（例えば、ツイストペアケーブルを介したイーサネット）または無線（例えば、Ｗｉ－Ｆｉ）の通信リンクを有する、多数の相互接続ネットワークされたＬＡＮ、ＷＡＮ、ＭＡＮなどを含むことができることを認識するべきである。ネットワークサーバ１８は、企業の要求に応じて多数の異なるタイプのリソースに対処することができ、クライアントコンピュータ１６は、ラップトップ、パーソナルコンピュータ、タブレットコンピュータ、スマートフォンなどのような、任意の好適なタイプのネットワーク対応エンドユーザコンピュータデバイスとすることができる。リソース予測コンピュータシステム２０は、１つ以上のネットワークコンピュータ装置によって実装することができる。リソース予測コンピュータシステム２０が複数のコンピュータ装置を含む場合、それらは１つ以上のＬＡＮ、ＷＡＮ、ＭＡＮなどによって相互接続ネットワークすることができる。さらに、企業コンピュータシステム１０は、サーバ１８をプロビジョニングするための追加のブローカコンピュータ４０を含んでもよい。 It should be noted, for example, that FIG. 1 illustrates only the components of a corporate computer system 10 sufficient to understand aspects of the invention. The corporate computer system 10 can include a large number of interconnected LANs, WANs, MANs, etc. with wired (eg, Ethernet over twisted pair cable) or wireless (eg, Wi-Fi) communication links. Should be recognized. The network server 18 can handle a large number of different types of resources according to the demands of the enterprise, and the client computer 16 can be any suitable type such as laptops, personal computers, tablet computers, smartphones and the like. It can be a network-enabled end-user computer device. The resource prediction computer system 20 can be implemented by one or more network computer devices. When the resource prediction computer system 20 includes a plurality of computer devices, they can be interconnected by one or more LANs, WANs, MANs, and the like. In addition, the corporate computer system 10 may include an additional broker computer 40 for provisioning the server 18.

本明細書に記載のソフトウェアモジュールは、プロセッサ（複数可）２４がモジュールのソフトウェアプログラムを実行するときに、プロセッサ（複数可）２４が本明細書に記載のモジュールの機能を実行させるように、パイソン（Ｐｙｔｈｏｎ）などの任意の好適なコンピュータ言語で書かれた１つ以上のコンピュータプログラム内に実装することができる。命令のプログラムを実行するために好適なプロセッサ２４は、例として、一般および特殊用途の両方のマイクロプロセッサを含む。さらに、本明細書に記載の任意のコンピュータ構成要素は、単一のプロセッサまたは複数のプロセッサを含むことができる。プロセッサ２４は、読み出し専用メモリ、もしくはランダムアクセスメモリ、またはその両方から命令およびデータを受信する。 The software modules described herein are pythons such that when the processor (s) 24 execute the software program of the module, the processor (s) 24 may perform the functions of the module described herein. It can be implemented in one or more computer programs written in any suitable computer language such as (Python). Suitable processors 24 for executing instructional programs include, for example, both general and special purpose microprocessors. In addition, any computer component described herein can include a single processor or multiple processors. The processor 24 receives instructions and data from read-only memory and / or random access memory.

本明細書は、多くの具体的な実装態様の詳細を含むが、これらはいかなる発明の範囲または特許請求され得るものの限定としても解釈されるべきではなく、むしろ特定の発明の特定の実装態様に特有である可能性のある特徴の説明として解釈されるべきである。本明細書において別々の実装態様の文脈で記載されている特定の特徴は、単一の実装態様において組み合わせて実装することもできる。逆に、単一の実装態様の文脈で記載されている様々な特徴は、別々にまたは任意の好適な副次的組み合わせで複数の実装態様で実施することもできる。さらに、特徴は、ある組み合わせにおいて作用するものとして上で説明されてもよく、かつ当初はそのように特許請求されてもよいが、場合によっては特許請求された組み合わせからの１つ以上の特徴を切り取ることができ、また特許請求された組み合わせは副次的組み合わせまたは副次的組み合わせの変形を対象としてもよい。 The present specification contains details of many specific implementations, which should not be construed as a limitation of any scope of the invention or claims, but rather to a particular implementation of a particular invention. It should be interpreted as an explanation of features that may be unique. Certain features described herein in the context of different implementations can also be implemented in combination in a single implementation. Conversely, the various features described in the context of a single implementation can also be implemented in multiple implementations separately or in any suitable secondary combination. Further, the features may be described above as acting in a combination, and may be initially claimed as such, but in some cases one or more features from the claimed combination. It can be cut out, and the claimed combination may be subject to a secondary combination or a modification of the secondary combination.

同様に、動作は特定の順序で図面に描かれているが、これは望ましい結果を達成するためには、そのような動作が示された特定の順序または順次に実行されること、またはすべての図示された動作が実行されることが必要であるように理解するべきではない。特定の状況では、多重タスク処理および並列処理が有利な場合がある。さらに、上述の実装態様における様々なシステム構成要素の分離は、すべての実装態様においてそのような分離を必要とすると理解されるべきではなく、記載されたプログラム構成要素およびシステムは、概して単一の製品において統合することができ、または複数の製品にパッケージ化することができると理解されるべきである。 Similarly, the actions are depicted in the drawing in a particular order, but this is to be performed in a particular order or sequence in which such actions are shown, or all, in order to achieve the desired result. It should not be understood that the illustrated actions need to be performed. In certain situations, multiple task processing and parallel processing may be advantageous. Moreover, the separation of the various system components in the implementations described above should not be understood as requiring such separation in all implementations, and the program components and systems described are generally single. It should be understood that it can be integrated in a product or packaged into multiple products.

このようにして、主題の特定の実装態様を記載した。他の実装態様は、以下の特許請求の範囲内にある。場合によっては、請求項に記載されている動作は異なる順序で実行することができ、それでも望ましい結果を達成することができる。さらに、添付の図面に示されたプロセスは、望ましい結果を達成するために、示された特定の順序、または順次順序を必ずしも必要としない。特定の実装態様では、多重タスク処理および並列処理が有利である場合がある。
In this way, specific implementation embodiments of the subject have been described. Other implementations are within the scope of the following claims. In some cases, the operations described in the claims can be performed in a different order and still achieve the desired result. Moreover, the process shown in the accompanying drawings does not necessarily require the particular order or sequential order shown to achieve the desired result. In certain implementations, multiple task processing and parallel processing may be advantageous.

Claims

A computer implementation method for predicting future workloads of a network server of a corporate computer system over a future period, wherein the corporate computer system hosts multiple networks hosting computer resources for users of the corporate computer system. The above method is equipped with a server.
The programmed computer system predicts the number of requests during the future period based on the sort of k past subsequences, each of which is a unit. It contains a time series of requests per hour, the k past subsequences are the k past subsequences closest to the current subsequence, and the current subsequence is the current time. Predicting and predicting, which is a time series of requests per unit time
The programmed computer system classifies past requests in the k past subsequences into two or more request type classes based on the attributes of the past requests .
The future for each of the two or more request type classes based on the ratio of the past requests of the k past subsequences in each of the two or more request type classes by the programmed computer system. Predicting the ratio of requests within the period of
The programmed computer system determines the periodicity of the past request attributes for the two or more request type classes.
Sampling p past requests by a programmed computer system, the sampled p past requests having a predicted ratio for each of the two or more request type classes. And the sampled p past requests are from the same cycle point in time as the future period, based on the periodicity of the attributes of the past requests in the two or more request type classes. Sampling and
To obtain a workload trace for the plurality of network servers for the future period by synthesizing the p past requests sampled by the programmed computer system, in the workload trace. Includes, captures, and methods of future workloads of said plurality of network servers in said future time period .

The method of claim 1, further comprising adjusting the state of the plurality of network servers in the future period based on the predicted future workload.

It is possible to classify the past request into the two or more request type classes based on the attributes of the past request.
Performing a correlation analysis of the attributes of the past request by the programmed computer system,
The method of claim 1, comprising classifying the past requirements into the two or more classes by the programmed computer system based on the correlation analysis.

The method of claim 3, wherein performing the correlation analysis comprises calculating a Pearson correlation coefficient between the pairs of attributes of the past request.

Classification of the past request uses a clustering algorithm for the programmed computer system to divide the past request into the two or more request type classes based on the attributes of the past request. The method according to claim 4, comprising the above.

Determining the periodicity for the past request attributes of the two or more request type classes is a fast Fourier transform to calculate the cycle length of the past request attributes of the two or more request type classes. The method of claim 1, comprising using a transform.

A system for predicting future workloads of network servers in a corporate computer system over a future period, wherein the corporate computer system hosts multiple network servers that host computer resources for users of the corporate computer system. In preparation, the system
A computer database system that communicates with the plurality of network servers to store data about past requests made by the user to the plurality of network servers.
A predictive computer system that communicates with the computer database system, wherein the predictive computer system
Predicting the number of requests for future time periods based on the sort of k past subsequences, each of the k past subsequences containing a time series of requests per unit time. , The k past subsequences are the k past subsequences closest to the current subsequence, and the current subsequence is a time series of requests per unit time in the current time. Predicting and
To classify past requests in the k past subsequences into two or more request type classes based on the attributes of the past requests .
Percentage of requirements in the future period for each of the two or more request type classes, based on the ratio of the past requirements of the k past subsequences in each of the two or more request type classes. To predict and
Determining the periodicity of the attributes of the past request for the two or more request type classes.
By sampling p past requests, the sampled p past requests have the predicted proportions for each of the two or more request type classes and are sampled . The p past requests made are sampled from the same cycle point in time as the future period, based on the periodicity of the attributes of the past requests in the two or more request type classes. To do and
Combining the p past requests sampled to obtain a workload trace for the plurality of network servers for the future period, the workload trace is for the future period. A predictive computer system, which is programmed to predict the future workload of the network server by performing steps including acquiring and including the future workload of the plurality of network servers. The system.

It further comprises a broker computer system that communicates with the predictive computer system, for the broker computer system to adjust the state of the network server in the future period based on the predicted future workload. , The system according to claim 7.

The predictive computer system
Performing a correlation analysis of the attributes of the past request,
Based on the correlation analysis, the past requirements are classified into the two or more classes, and the past requirements are classified based on the attributes of the past requirements by performing a step including. The system according to claim 7, which is programmed to classify into two or more request type classes.

9. The predictive computer system is programmed to perform the correlation analysis by performing a step comprising calculating a Pearson correlation coefficient between the pairs of attributes of the past request. The system described in.

The past by performing steps that include the predictive computer system using a clustering algorithm to divide the past request into the two or more request type classes based on the attributes of the past request. 10. The system of claim 10, which is programmed to classify the requirements of.

The two or more by performing a step in which the predictive computer system includes using a Fast Fourier Transform to calculate the cycle length of the attributes of the past request of the two or more request type classes. 7. The system of claim 7, programmed to determine the periodicity of the request type class with respect to the attributes of the past request.