JP3805711B2

JP3805711B2 - How to identify bottlenecks in the site area

Info

Publication number: JP3805711B2
Application number: JP2002103695A
Authority: JP
Inventors: 正樹徳久; 芳之千葉; 宜伯川村; 慎也能上
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2002-04-05
Filing date: 2002-04-05
Publication date: 2006-08-09
Anticipated expiration: 2022-04-05
Also published as: JP2003298655A

Description

【０００１】
【発明の属する技術分野】
本発明は、サイト領域内ボトルネック特定方法に関し、詳しくは、インターネットのエンド・トゥ・サーバ（End-to-Server ）におけるサイト領域に設置された任意のコンピュータにより、当該サイト領域に存在する複数のネットワーク要素がボトルネックを構成するか否かを特定するためのサイト領域内ボトルネック特定方法に係わる。
【０００２】
【従来の技術】
近年、インターネットにおける性能、経済性、信頼性といったネットワーク品質に対する関心が高まり、広域ネットワークの特性を計測する手法の重要性が増してきている。
【０００３】
例えば、ＭＳＰ（Managed Service Provider）において、映像等の大衆向け配信サービスを提供するアプリケーション、ネットワーク、サーバを対象とした品質管理サービス事業を効率良く実現するには、対象ネットワークの品質劣化監視及びその検出分析を行うことが必須となる。
【０００４】
このため、上記のようなサービスを提供する際に、エンドユーザからの申告等によりネットワーク品質に劣化が生じていることが判明した場合には、サービス提供者は、その劣化の要因となっている対象ネットワーク上の箇所、即ちボトルネックを早急かつ的確に特定し、これに対処することが責務であるといえる。
【０００５】
従来、対象ネットワーク上のエンド・トゥ・サーバにおけるボトルネックを特定する方法としては、その対象ネットワーク上に存在する全てのサーバの中で、何れのサーバがボトルネックとなっているかをリモートで逐一検査していくものが知られている。
【０００６】
しかし、この方法では、対象ネットワークが大規模になるにつれて、自ずと適用性が悪くなり、当該対象ネットワーク中におけるボトルネック特定箇所を総合的に判断することも困難である。
【０００７】
これに対し、対象ネットワーク上のエンド・トゥ・エンド（End-to-End）におけるボトルネックを特定する方法として、ＭＩＮＣプロジェクト（ＭＩＮＣ：Multicast-based Inference of Network-internal Characteristics ）によるものが知られている。
【０００８】
このＭＩＮＣプロジェクトによる方法は、マルチキャストにより、対象ネットワーク上の１つの始点から多数の終点へ向けて試験パケットを送信し、このときのエンド・トゥ・エンドの観測データからパス上の特性を得て、その対象ネットワーク中のパケットロスや遅延を推定するものである。
【０００９】
【発明が解決しようとする課題】
しかしながら、上述のＭＩＮＣプロジェクトによる方法は、マルチキャスト適用時における木構造の特質から理論的には高い精度を持つものの、インターネットが、その広域性と管理主体の分散に起因して、その状態を直接的に管理し制御することが困難な対象であるため、以下に示すような問題を有している。
【００１０】
即ち、ＭＩＮＣプロジェクトによる方法に用いられるマルチキャストは、現在運用されているインターネットでは実用的でないため柔軟性が低く、また、試験パケットによる片道特性の観測は、現在運用されているインターネットでは困難な場合がある。
【００１１】
以上の問題を解消するには、所要のボトルネック特定方法が、パス全体が木構造をとらないネットワークに対しても適用可能であることが条件となるが、以上のＭＩＮＣプロジェクトによる手法を木構造以外のネットワークに適用することはできず、現時点では、これを実際のＩＳＰ（Internet Service Provider ）に適用することも事実上困難である。
【００１２】
即ち、現状では、インターネット上に分散した複数のネットワーク要素に生じ得るボトルネックを直接的に特定することが困難であるため、そのボトルネックを他の計測データから統計的に推定可能な新たな技術が必要とされている。
【００１３】
ここにおいて、本発明の解決すべき主要な目的は、次のとおりである。
【００１４】
即ち、本発明の第１の目的は、サイト領域に存在する複数のネットワーク要素におけるボトルネックを効率良く特定することの可能なサイト領域内ボトルネック特定方法を提供せんとするものである。
【００１５】
本発明の第２の目的は、サイト領域に存在する１以上のサーバにおけるボトルネックを特に効率良く特定することの可能なサイト領域内ボトルネック特定方法を提供せんとするものである。
【００１６】
本発明の他の目的は、明細書、図面、特に特許請求の範囲の各請求項の記載から、自ずと明らかとなろう。
【００１７】
【課題を解決するための手段】
本発明方法においては、インターネットのエンド・トゥ・サーバにおけるサイト領域に設置された任意のコンピュータにおいて、各ネットワーク要素の処理能力の判定に供し得るログ情報を収集するログ情報収集処理と、各ネットワーク要素のログ情報に基づいて、当該各ネットワーク要素がボトルネックを構成するか否かをそれぞれ判定するボトルネック判定処理と、各ネットワーク要素中にボトルネックを構成する１以上のネットワーク要素が存在した場合に、当該ネットワーク要素の個体識別子を記載してなるボトルネックリストを出力するボトルネックリスト出力処理とを順次実行する、という特徴的構成手法を講じる。
【００１８】
さらに具体的詳細に述べると、当該課題の解決は、本発明が、以下に列挙する上位概念から下位概念に亙る新規な特徴的構成手法を採用することにより、前記目的を達成するよう為される。
【００１９】
即ち、本発明方法の第１の特徴は、インターネットのエンド・トゥ・サーバにおけるサイト領域に設置された任意のコンピュータにより、当該サイト領域に存在する複数のネットワーク要素がボトルネックを構成するか否かを特定するため、前記コンピュータにおいて、前記複数のネットワーク要素から、各ネットワーク要素の処理能力の判定に供し得るログ情報を所定時間に亙ってそれぞれ収集するログ情報収集処理と、その収集した前記各ネットワーク要素の前記ログ情報に基づいて、当該各ネットワーク要素が前記ボトルネックを構成するか否かをそれぞれ判定するボトルネック判定処理と、この判定の結果、前記各ネットワーク要素中に前記ボトルネックを構成する１以上のネットワーク要素が存在した場合に、当該ネットワーク要素の個体識別子を記載してなるボトルネックリストを出力するボトルネックリスト出力処理と、を順次実行するサイト領域内ボトルネック特定方法であって、前記コンピュータは、前記ボトルネック判定処理において、その収集した前記１以上のログ情報中において前記ボトルネックの判定基準とすべき単一のログ情報を当該コンピュータ内に予め対応設定された判定閾値と比較した結果、当該単一のログ情報が当該判定閾値の許容範囲外であった場合に、該当する特定のサーバの個体識別子を前記ボトルネックリストに記載するボトルネックサーバ記載処理、を経た後の当該ボトルネックリスト中に、指定数を超える当該特定のサーバの前記個体識別子が存在すると、前記ログ情報収集処理において前記所定時間内に収集した当該単一のログ情報の平均値を算出するログ情報平均値算出処理と、その算出した前記単一のログ情報の前記平均値を元に、前記特定のサーバの前記個体識別子を前記判定閾値との差異が大きいものから順に前記指定数だけ抽出するサーバ指定数抽出処理と、その指定数だけ抽出した前記特定のサーバの前記個体識別子により、前記ボトルネックサーバ記載処理を経た後の前記ボトルネックリストの内容を書き換えるボトルネックリスト書換処理と、を順次実行してなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２１】
本発明方法の第２の特徴は、上記本発明方法の第１の特徴における前記複数のネットワーク要素が、その１つの要素として、１以上からなるサーバを含んで構成され、前記ログ情報収集処理が、所要の前記ログ情報として、当該１以上のサーバにおける利用可能ＲＡＭメモリ残量、ＮＩＣ使用総帯域、ＣＰＵ使用率、ＨＤＤデータ読込み速度、接続待ちクライアント数、及び秒間累積発生エラー数のうち１以上を収集してなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２２】
本発明方法の第３の特徴は、上記本発明方法の第２の特徴における前記ログ情報収集処理が、前記１以上のサーバに関する前記１以上のログ情報の収集に際し、前記コンピュータ内に設定されたパフォーマンスモニタを用いてなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２３】
本発明方法の第４の特徴は、上記本発明方法の第２又は第３の特徴における前記コンピュータが、前記ログ情報収集処理において、前記１以上のサーバ中に前記１以上のログ情報を前記所定時間内に収集できないものが存在した場合、そのログ情報収集不能な特定のサーバに関する前記ボトルネック判定処理を実行することなく、その特定のサーバの個体識別子を前記ボトルネックリストに追記する故障候補サーバ追記処理、を実行してなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２４】
本発明方法の第５の特徴は、インターネットのエンド・トゥ・サーバにおけるサイト領域に設置された任意のコンピュータにより、当該サイト領域に存在する複数のネットワーク要素がボトルネックを構成するか否かを特定するため、前記コンピュータにおいて、前記複数のネットワーク要素から、各ネットワーク要素の処理能力の判定に供し得るログ情報を所定時間に亙ってそれぞれ収集するログ情報収集処理と、その収集した前記各ネットワーク要素の前記ログ情報に基づいて、当該各ネットワーク要素が前記ボトルネックを構成するか否かをそれぞれ判定するボトルネック判定処理と、この判定の結果、前記各ネットワーク要素中に前記ボトルネックを構成する１以上のネットワーク要素が存在した場合に、当該ネットワーク要素の個体識別子を記載してなるボトルネックリストを出力するボトルネックリスト出力処理と、を順次実行するサイト領域内ボトルネック特定方法であって、前記複数のネットワーク要素が、その１つの要素として、１以上からなるファイアウォールを含んで構成され、前記ログ情報収集処理が、所要の前記ログ情報として、当該１以上のファイアウォールにおけるパケットロス率を収集し、前記コンピュータが、前記ボトルネック判定処理において、その収集した前記パケットロス率を、当該コンピュータ内に前記各ネットワーク要素毎の前記ログ情報と対応して予め設定された判定閾値と比較した結果、当該パケットロス率が当該判定閾値の許容範囲外であった場合に、該当する特定のファイアウォールの個体識別子を前記ボトルネックリストに記載するボトルネックファイアウォール記載処理、を実行してなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２５】
本発明方法の第６の特徴は、上記本発明方法の第５の特徴における前記ログ情報収集処理が、前記１以上のファイアウォールに関する前記パケットロス率の収集に際し、前記コンピュータ内に設定された管理情報ベースを用いてなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２６】
本発明方法の第７の特徴は、インターネットのエンド・トゥ・サーバにおけるサイト領域に設置された任意のコンピュータにより、当該サイト領域に存在する複数のネットワーク要素がボトルネックを構成するか否かを特定するため、前記コンピュータにおいて、前記複数のネットワーク要素から、各ネットワーク要素の処理能力の判定に供し得るログ情報を所定時間に亙ってそれぞれ収集するログ情報収集処理と、その収集した前記各ネットワーク要素の前記ログ情報に基づいて、当該各ネットワーク要素が前記ボトルネックを構成するか否かをそれぞれ判定するボトルネック判定処理と、この判定の結果、前記各ネットワーク要素中に前記ボトルネックを構成する１以上のネットワーク要素が存在した場合に、当該ネットワーク要素の個体識別子を記載してなるボトルネックリストを出力するボトルネックリスト出力処理と、を順次実行するサイト領域内ボトルネック特定方法であって、前記複数のネットワーク要素が、その１つの要素として、１以上からなる負荷分散装置を含んで構成され、前記ログ情報収集処理が、所要の前記ログ情報として、当該１以上の負荷分散装置におけるパケットロス率を収集し、前記コンピュータが、前記ボトルネック判定処理において、その収集した前記パケットロス率を、当該コンピュータ内に前記各ネットワーク要素毎の前記ログ情報と対応して予め設定された判定閾値と比較した結果、当該パケットロス率が当該判定閾値の許容範囲外であった場合に、該当する特定の負荷分散装置の個体識別子を前記ボトルネックリストに記載するボトルネック負荷分散装置記載処理、を実行してなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２７】
本発明方法の第８の特徴は、上記本発明方法の第７の特徴における前記ログ情報収集処理が、前記１以上の負荷分散装置から取得したパケット送受信のカウンタ数に基づいて、所要の前記パケットロス率を計算する処理を伴いてなる、サイト領域内ボトルネック特定方法。
【００２８】
本発明方法の第９の特徴は、インターネットのエンド・トゥ・サーバにおけるサイト領域に設置された任意のコンピュータにより、当該サイト領域に存在する複数のネットワーク要素がボトルネックを構成するか否かを特定するため、前記コンピュータにおいて、前記複数のネットワーク要素から、各ネットワーク要素の処理能力の判定に供し得るログ情報を所定時間に亙ってそれぞれ収集するログ情報収集処理と、その収集した前記各ネットワーク要素の前記ログ情報に基づいて、当該各ネットワーク要素が前記ボトルネックを構成するか否かをそれぞれ判定するボトルネック判定処理と、この判定の結果、前記各ネットワーク要素中に前記ボトルネックを構成する１以上のネットワーク要素が存在した場合に、当該ネットワーク要素の個体識別子を記載してなるボトルネックリストを出力するボトルネックリスト出力処理と、を順次実行するサイト領域内ボトルネック特定方法であって、前記複数のネットワーク要素が、その１つの要素として、１以上からなるサーバＬＡＮを含んで構成され、前記ログ情報収集処理が、所要の前記ログ情報として、当該１以上のサーバＬＡＮにおけるパケットロス率を収集し、前記コンピュータが、前記ボトルネック判定処理において、その収集した前記パケットロス率を当該コンピュータ内に前記各ネットワーク要素毎の前記ログ情報と対応して予め設定された判定閾値と比較した結果、当該パケットロス率が当該判定閾値の許容範囲外であった場合に、該当する特定のサーバＬＡＮの個体識別子を前記ボトルネックリストに記載するボトルネックサーバＬＡＮ記載処理、を実行してなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００２９】
本発明方法の第１０の特徴は、上記本発明方法の第９の特徴における前記ログ情報収集処理が、前記１以上のサーバＬＡＮから取得したパケット送受信のカウンタ数に基づいて、所要の前記パケットロス率を計算する処理を伴ってなる、サイト領域内ボトルネック特定方法の構成採用にある。
【００３１】
【発明の実施の形態】
以下、本発明の実施の形態につき、添付図面を参照しつつ、サイト領域に存在する複数のネットワーク要素におけるボトルネック特定を実現するための第１方法例と、同サイト領域に存在する１以上のサーバにおけるボトルネック特定を実現するための第２方法例とを順に挙げて説明する。
【００３２】
（第１方法例）
まず初めに、図１は、本発明の第１方法例に係るサイト領域内ボトルネック特定方法に適用されるサイト領域のシステム構成を示す図であり、図２は、図１に示したサイト領域内の各ネットワーク要素から収集されるログ情報の判定閾値を規定したボトルネック閾値テーブルを示す図である。
【００３３】
まず、図１に示すように、本第１方法例に係るサイト領域内ボトルネック特定方法は、サイト領域に存在する複数のネットワーク要素におけるボトルネック特定を実現するための前提として、インターネット１との接続が図られたサイト領域２ａ内に、３つのサーバＳ１，Ｓ２及びＳ３と、２つのファイアウォールＦ１及びＦ２と、２つの負荷分担装置Ｂ１及びＢ２と、２つのサーバＬＡＮ（ＬＡＮ：Local Area Network）：Ｐ１及びＰ２とを有するシステムに適用されるものとする。
【００３４】
一方、図２に示すように、ネットワーク要素をなすサーバＳ１，Ｓ２及びＳ３には、ログ情報判定項目として、判定閾値を１００ＭＢ／ｓ（メガバイト／秒．上限値）としたＮＩＣ使用総帯域（ＮＩＣ：Network Information Card）と、判定閾値を５０％（上限値）としたＣＰＵ使用率（ＣＰＵ：Central Processing Unit ）とが設定されているものとする。
【００３５】
また、ネットワーク要素をなすファイアウォールＦ１及びＦ２、負荷分担装置Ｂ１及びＢ２、並びにサーバＬＡＮ：Ｐ１及びＰ２には、ログ情報判定項目として、それぞれ、判定閾値を２．０％としたパケットロス率が設定されているものとする。
【００３６】
なお、以上のログ情報判定項目及び対応する判定閾値が設定されたボトルネック閾値テーブル３ａは、サイト領域２ａに設置された任意のコンピュータに保持されるようになっており、同コンピュータとしては、サイト領域２ａ内に存在する何れかのサーバ（Ｓ１，Ｓ２又はＳ３）を当てたり、或いは、これらサーバ（Ｓ１，Ｓ２又はＳ３）とは独立して、同サイト領域２ａ内に個別に設置することが可能である。
【００３７】
次に、図３は、本発明の第１方法例に係るサーバ領域内ボトルネック特定方法を説明するためのフローチャートである。
【００３８】
同図に示すように、この方法例に係るサーバ領域内ボトルネック特定方法は、上述したコンピュータが、まず、サーバＳ１，Ｓ２及びＳ３におけるボトルネックの特定（ＳＴ１）と、ファイアウォールＦ１及びＦ２におけるボトルネックの特定（ＳＴ２）と、負荷分担装置Ｂ１及びＢ２におけるボトルネックの特定（ＳＴ３）と、サーバＬＡＮ：Ｐ１及びＰ２におけるボトルネックの特定（ＳＴ４）とをそれぞれ実行することにより開始される（実行順序は問わず）。
【００３９】
以上の各ボトルネックの特定に際し、コンピュータは、各ネットワーク要素の処理能力の判定に供し得るログ情報として、サーバＳ１，Ｓ２及びＳ３からは、ＮＩＣ使用総帯域及びＣＰＵ使用率を所定時間に亙って収集し、ファイアウォールＦ１及びＦ２、負荷分担装置Ｂ１及びＢ２、並びにサーバＬＡＮ：Ｐ１及びＰ２からは、パケットロス率を所定時間に亙って収集する（ログ情報収集処理）。
【００４０】
なお、コンピュータが、サーバＳ１，Ｓ２及びＳ３からＮＩＣ使用総帯域及びＣＰＵ使用率を収集する際には、同コンピュータ内に設定されたパフォーマンスモニタ（ソフトウェア手段．図示せず）を利用することができ、ファイアウォールＦ１及びＦ２からパケットロス率を収集する際には、同コンピュータ内に設定された管理情報ベース（ＭＩＢ：Management Information Base ）を利用することができる。
【００４１】
また、同コンピュータが、負荷分担装置Ｂ１及びＢ２並びにサーバＬＡＮ：Ｐ１及びＰ２からパケットロス率を収集する際には、これら各ネットワーク要素から取得したパケット送受信のカウンタ数に基づいて、所要のパケットロス率を計算するようにするとよい。
【００４２】
次に、コンピュータは、図４（ａ）〜（ｄ）に例示するように、各ネットワーク要素毎に得られるログ情報の観測値を、自身が保持するログ情報観測値テーブル４ａに書き出し、当該ログ情報観測値と、これらに対応して予め設定されたボトルネック閾値テーブル３ａにおける判定閾値とを比較して、それら各ネットワーク要素がボトルネックを構成するか否かを判定する（ボトルネック判定処理）。なお、図中に示す「○」は、該当するネットワーク要素がボトルネックを構成しないと判定されたことを意味する（後に示す「●」は、該当するネットワーク要素がボトルネックを構成すると判定されたことを意味する。以下同じ）。
【００４３】
このとき、コンピュータは、先に収集したＮＩＣ使用総帯域及びＣＰＵ使用率のうちボトルネックの判定基準とすべき単一のログ情報（詳細は第２方法例にて述べる）を、対応する上記判定閾値と比較した結果、その単一のログ情報が判定閾値の許容範囲外であった場合に、該当するサーバの個体識別子（Ｓ１，Ｓ２又はＳ３）をボトルネックリスト（図示せず）に記載するようにする（ボトルネックサーバ記載処理）。
【００４４】
また、同コンピュータは、収集したパケットロス率を、対応する上記判定閾値と比較した結果、それが判定閾値の許容範囲外であった場合に、該当する特定のファイアウォールの個体識別子（Ｆ１又はＦ２）、負荷分担装置の個体識別子（Ｂ１又はＢ２）、及びサーバＬＡＮの個体識別子（Ｐ１又はＰ２）をボトルネックリストに記載するようにする（ボトルネックファイアウォール記載処理、ボトルネック負荷分担装置記載処理、及びボトルネックサーバＬＡＮ記載処理）。
【００４５】
次に、コンピュータは、上記ボトルネックリストが空（φ）であるか否かを判別する（ＳＴ５）。以上の例示では、全てのネットワーク要素がボトルネックを構成しないと判定されており、その結果、ボトルネックリストが空であるため（ＳＴ５；ＹＥＳ）、同コンピュータは、今回の観測では、特定すべきボトルネックがサイト領域２ａ内に全く存在しなかったものと判断する（ＳＴ６）。
【００４６】
これに対し、図５（ａ）〜（ｄ）に例示するように、ログ情報観測値テーブル４ｂに書き出された各ネットワーク要素毎のログ情報観測値のうち、サーバＳ２のＣＰＵ使用率とファイアウォールＦ１のパケットロス率とが、共に判定閾値の許容範囲外であり、これら各ネットワーク要素がボトルネックを構成すると判定された場合、コンピュータは、該当する個体識別子Ｓ２及びＦ１をボトルネックリストに記載する。
【００４７】
そして、コンピュータは、上記ボトルネックリストが空であるか否かを判別するが、今回は、当該ボトルネックリストが空ではないため（ＳＴ５；ＮＯ）、ボトルネックが、サイト領域２ａ内のサーバＳ２及びファイアウォールＦ１に存在していると判断し、そのボトルネックリストを外部に出力する（ＳＴ７）。
【００４８】
（第２方法例）
続いて、図６は、本発明の第２方法例に係るサイト領域内ボトルネック特定方法に適用されるサイト領域の部分システム構成を示す図であり、図７は、本発明の第２方法例に係るサーバ領域内ボトルネック特定方法を説明するためのフローチャートである。
【００４９】
まず、図６に示すように、本第２方法例に係るサイト領域内ボトルネック特定方法は、サイト領域に存在する１以上のサーバにおけるボトルネック特定を実現するための前提として、サイト領域２ｂ内に、５つのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５を有するシステムに適用されるものとする（ファイアウォール、負荷分担装置、及びサーバＬＡＮの数及び有無は問わない）。
【００５０】
また、上記サーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５に関するログ情報判定項目としては、第１方法例における場合と同様、判定閾値を１００ＭＢ／ｓ（上限値）としたＮＩＣ使用総帯域と、判定閾値を５０％（上限値）としたＣＰＵ使用率とが設定されており、これらＮＩＣ使用総帯域及びＣＰＵ使用率のうち、ボトルネックの判定基準、即ち、優先的に判定すべき単一のログ情報として、システム保守者（図示せず）により「ＣＰＵ使用率」が選択されているものとする。
【００５１】
そして、図７に示すように、この方法例に係るサーバ領域内ボトルネック特定方法は、サイト領域２ｂ内のコンピュータが、まず、サーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５におけるログ情報を、所定時間Ｔに亙り間隔ｔでｘ回収集するためのタイムスライスτ（１≦τ≦ｘ，ｘ＝Ｔ／ｔ）に「１」をセットすると共に（ＳＴ１１）、サーバの個体識別子をＳｉ（１≦ｉ≦ｎ，ｎ＝５）としたときのカウンタに「１」をセットし（ＳＴ１２）、さらに、該当するＳ１のログ情報の収集（ＳＴ１３）を実行することにより開始される。なお、以降の説明では、簡単のため、各サーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５に関するログ情報の収集回数ｘを「ｘ＝３」とする。
【００５２】
次に、コンピュータは、サーバＳ１に関するログ情報が正常に収集できたか否かを判別し（ＳＴ１４）、当該ログ情報が正常に収集できている場合には（ＳＴ１４；ＹＥＳ）、タイムスライスτ＝１におけるサーバＳ１内のボトルネックを判定する（ＳＴ１５）。
【００５３】
次に、コンピュータは、カウンタｉを「１」インクリメントし（ＳＴ１６）、上述したタイムスライスτ＝１におけるＳＴ１３以降の処理を、サーバＳ２，Ｓ３，Ｓ４及びＳ５についても同様に実行し（ＳＴ１７：ＮＯ）、そのカウンタｉの値が規定値の「５（＝ｎ）」を上回った時点で（ＳＴ１７；ＹＥＳ）、タイムスライスτを「１」インクリメントする（ＳＴ１８）。
【００５４】
そして、コンピュータは、今度は、タイムスライスτ＝２におけるＳＴ１２以降の処理を、全てのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５についても同様に実行し（ＳＴ１９；ＮＯ）、以下、この繰り返し処理を、そのタイムスライスτの値が規定値の「３（＝ｘ）」を上回るまで実行する（ＳＴ１９；ＹＥＳ）。
【００５５】
以上の処理の結果、コンピュータ内に、図８に示すようなログ情報観測値テーブル４ｃが得られたとする。このとき、全てのタイムスライスτ＝１，２，３において所要のログ情報が正常に収集できており、しかも、全てのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５に関し、判定閾値を超過したログ情報が全く存在していないため、コンピュータは、今回の観測では、特定すべきボトルネックが、何れのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５にも存在しなかったものと判断する（ボトルネックリストには何も記載しない）。
【００５６】
これに対し、コンピュータ内に、図９に示すようなログ情報観測値テーブル４ｄが得られた場合、同コンピュータは、サーバＳ１における所要のログ情報（ＮＩＣ使用総帯域及びＣＰＵ使用率）が、全てのタイムスライスτ＝１，２，３の何れにおいても正常に収集できなかったとして（ＳＴ１４；ＮＯ）、該当する個体識別子Ｓ１を、自身に保持される故障候補リスト（図示せず）に記載する（ＳＴ２０）。但し、この故障候補リストへの個体識別子の記載は、図７のフローチャートからも明らかなように、実際には、前述した処理の過程において（所要のログ情報を正常に収集できなかったことが判明した時点で）随時実行される。
【００５７】
次に、コンピュータは、上記故障候補リストに、同一の個体識別子により特定されるサーバ（Ｓ１）がｘ個（３個）あるか否かを判別するが（ＳＴ２１）、図９のログ情報観測値テーブル４ｄによれば、サーバＳ１における所要のログ情報が全てのタイムスライスτ＝１，２，３において正常に収集されておらず、その度に上述のＳＴ２０の処理が実行され、当該故障候補リストには同一のサーバ（個体識別子Ｓ１）が３個存在することになるため（ＳＴ２１；ＹＥＳ）、同コンピュータは、その故障候補リストの内容（個体識別子Ｓ１）をボトルネックリストに追記する（ＳＴ２２．故障候補サーバ追記処理）。
【００５８】
なお、図９のログ情報観測値テーブル４ｄに示されるサーバＳ１以外の残りのサーバＳ２，Ｓ３，Ｓ４及びＳ５に関しては、判定閾値を超過したログ情報が存在していないため、コンピュータは、特定すべきボトルネックが、これら残りのサーバＳ２，Ｓ３，Ｓ４及びＳ５には存在しなかったものと判断する（これら残りのサーバＳ２，Ｓ３，Ｓ４及びＳ５に関しては、ボトルネックリストには何も記載しない）。
【００５９】
次に、コンピュータは、上記故障候補リストに、同一の個体識別子により特定されるサーバがｘ個存在しない場合には（ＳＴ２１；ＮＯ）、特定すべきボトルネックが、何れのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５にも存在していないとして、今度は、ボトルネックリストに、ｋ個を超えるサーバ（個体識別子）があるか否かを判別する（ＳＴ２３）。
【００６０】
なお、以上に示す「ｋ」なる値は、システム保守者により指定される数（指定数）であり、上記ボトルネックリスト中に、仮にこの指定数を超えるサーバ（個体識別子）が存在している場合に、障害が重度に及んでいるボトルネックのみを最終的なボトルネックリストへ出力させて、軽度のボトルネックに関する不要な出力を排除するためのものである。なお、以降の説明では、簡単のため、この指定数ｋを「ｋ＝３」とする。
【００６１】
ここで、システム保守者によりボトルネックの判定基準として事前に選択された「ＣＰＵ使用率」に基づき、サーバＳ１，Ｓ２，Ｓ３及びＳ４がボトルネックと判定され、それらの個体識別子Ｓ１，Ｓ２，Ｓ３及びＳ４がボトルネックリストに記載されているものとする。
【００６２】
そして、これに伴い、コンピュータ内に、図１０に示すようなログ情報観測値テーブル４ｅが得られており、サーバＳ１，Ｓ２及びＳ３に関するＣＰＵ使用率が、全てのタイムスライスτ＝１，２，３において判定閾値を超過し、かつ、サーバＳ４に関するＣＰＵ使用率が、タイムスライスτ＝３において判定閾値を超過しているものとする（超過項目を下線により示す。以下同じ）。
【００６３】
このとき、コンピュータは、ボトルネックリストにｋ個（３個）を超える４個のサーバの個体識別子Ｓ１，Ｓ２，Ｓ３及びＳ４が存在するとして（ＳＴ２３；ＹＥＳ）、そのボトルネックリストに記載のサーバＳ１，Ｓ２，Ｓ３及びＳ４に関する各ログ情報、即ち、各ＣＰＵ使用率の平均値を算出する（ＳＴ２４．ログ情報平均値算出処理）。
【００６４】
この結果、コンピュータは、図１１に示すような新たなログ情報観測値テーブル４ｆを得て、これら各ＣＰＵ使用率（各ログ情報）の平均値を元に、各サーバの個体識別子Ｓ１，Ｓ２，Ｓ３及びＳ４を判定閾値（５０％）との差異が大きい順にソートし（Ｓ２（＝９０）＞Ｓ１（＝７０）＞Ｓ３（＝６０）＞Ｓ４（＝４０））、上位３個（ｋ個）のサーバに関する個体識別子、即ち、Ｓ２，Ｓ１及びＳ３を抽出する（ＳＴ２５．サーバ指定数抽出処理）。
【００６５】
なお、図１０のログ情報観測値テーブル４ｅに示される全てのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５には、ＮＩＣ使用総帯域に関して判定閾値を超過したログ情報が存在していないため、コンピュータは、同ＮＩＣ使用総帯域につき特定すべきボトルネックが、何れのサーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５にも存在しなかったものと判断する（ボトルネックリストには何も記載しない）。
【００６６】
そして以下、コンピュータは、上述したＳＴ２５の処理で抽出した上位３個のサーバに関する個体識別子Ｓ２，Ｓ１及びＳ３を、最終的なボトルネックリストに出力すると共に、前述したＳＴ２０〜ＳＴ２２の処理において、ボトルネックリストに故障候補リストの内容が追記された場合には、その内容により示されるサーバの個体識別子（図９の例では「Ｓ１」）を、故障のサーバを示すものとして、最終的なボトルネックリストに出力する（ＳＴ２６）。
【００６７】
これに対し、コンピュータ内に、図１２に示すようなログ情報観測値テーブル４ｇが得られており、サーバＳ１に関するＣＰＵ使用率のみが、タイムスライスτ＝３において判定閾値を超過している場合、同コンピュータは、ボトルネックリストに、ｋ個（３個）を超えない１個のサーバの個体識別子Ｓ１のみが存在するとして（ＳＴ２３；ＮＯ）、上述したＳＴ２４及びＳＴ２５の処理を実行することなく、該当する個体識別子Ｓ１を、ボトルネックを構成するサーバとして、最終的なボトルネックリストに出力する。
【００６８】
なお、以上の第２方法例の説明では、サーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５に関するログ情報判定項目として、ＮＩＣ使用総帯域及びＣＰＵ使用率を挙げたが、この他にも、例えば、利用可能ＲＡＭメモリ残量（ＲＡＭ：Random Access Memory）、ＨＤＤデータ読込み速度（ＨＤＤ：Hard Disk Drive ）、接続待ちクライアント数、秒間累積発生エラー数などを併せて適用することが可能であり、これら各ログ情報判定項目は、何れも、コンピュータ内に設定された前述のパフォーマンスモニタにより収集することが可能である。
【００６９】
最後に、以上の第２方法例で説明したサーバ内ボトルネック判定処理（図７のフローチャートにおけるＳＴ１５の処理）につき、上述した多数のログ情報判定項目を適用した場合の具体例を挙げて説明する。
【００７０】
図１３は、図６に示したサイト領域２ｂ内の各サーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５から収集される各種ログ情報の判定閾値を規定したボトルネック閾値テーブルを示す図であり、図１４は、図７に示したサーバ内ボトルネック判定処理の詳細を説明するためのフローチャートである。
【００７１】
まず、図１３に示すように、コンピュータ内に設定されたボトルネック閾値テーブル３ｂには、サイト領域２ｂ内の各サーバＳ１，Ｓ２，Ｓ３，Ｓ４及びＳ５のログ情報判定項目として、判定閾値を１００ＭＢ（下限値）とした利用可能ＲＡＭメモリ残量と、判定閾値を１０ＭＢ／ｓ（上限値）としたＮＩＣ使用総帯域と、判定閾値を５０％（上限値）としたＣＰＵ使用率と、判定閾値を５０ＭＢ／ｓ（上限値）としたＨＤＤデータ読込み速度と、判定閾値を１（上限値）とした接続待ちクライアント数と、判定閾値を１個／ｓ（上限値）とした秒間累積発生エラー数とが設定されているものとする。
【００７２】
また、サーバＳ１に関するログ情報として、利用可能ＲＡＭメモリ残量：２００ＭＢ、ＮＩＣ使用総帯域：５ＭＢ／ｓ、ＣＰＵ使用率：３０％、ＨＤＤデータ読込み速度：３０ＭＢ／ｓ、接続待ちクライアント数：０、秒間累積発生エラー数：０個／ｓがそれぞれ収集され、サーバＳ２に関するログ情報として、利用可能ＲＡＭメモリ残量：５０ＭＢ、ＮＩＣ使用総帯域：５ＭＢ／ｓ、ＣＰＵ使用率：３０％、ＨＤＤデータ読込み速度：３０ＭＢ／ｓ、接続待ちクライアント数：０、秒間累積発生エラー数：０個／ｓがそれぞれ収集されたものとする。
【００７３】
ここで、まず、サーバＳ１のログ情報に基づくサーバ内ボトルネック判定処理に際しては、図１４に示すように、利用可能ＲＡＭメモリ残量が閾値未満（ＳＴ３１；ＮＯ）であり、ＮＩＣ使用総帯域が閾値以上（ＳＴ３２；ＮＯ）であり、ＣＰＵ使用率が閾値以上（ＳＴ３３；ＮＯ）であり、ＨＤＤデータ読込み速度が閾値以上（ＳＴ３４；ＮＯ）であり、接続待ちクライアント数が閾値以上（ＳＴ３５；ＮＯ）であり、秒間累積発生エラー数が閾値以上（ＳＴ３６；ＮＯ）であるため、コンピュータは、ボトルネックリストにサーバＳ１（個体識別子）を記載しない（ＳＴ３７）。
【００７４】
これに対し、サーバＳ２のログ情報に基づくサーバ内ボトルネック判定処理に際しては、同図に示すように、利用可能ＲＡＭメモリ残量が閾値以上（ＳＴ３１；ＹＥＳ）であるため（図１３参照）、コンピュータは、ボトルネックリストにサーバＳ２（個体識別子）を記載するようにする（ＳＴ３８）。
【００７５】
以上、本発明の実施の形態につき、第１及び第２方法例を挙げて説明したが、本発明は、必ずしも上述した手法にのみ限定されるものではなく、本発明にいう目的を達成し、後述の効果を有する範囲内において、適宜、変更実施することが可能なものである。
【００７６】
【発明の効果】
以上、詳細に説明したように、本発明によれば、サイト領域に存在する複数のネットワーク要素の性能低下や障害などの故障を、観測値を元にいち早く把握するようにしたことから、それら複数のネットワーク要素におけるボトルネック、特に、１以上のサーバにおけるボトルネックを、極めて効率良く特定することが可能になる。
【図面の簡単な説明】
【図１】本発明の第１方法例に係るサイト領域内ボトルネック特定方法に適用されるサイト領域のシステム構成を示す図である。
【図２】図１に示したサイト領域内の各ネットワーク要素から収集されるログ情報の判定閾値を規定したボトルネック閾値テーブルを示す図である。
【図３】本発明の第１方法例に係るサーバ領域内ボトルネック特定方法を説明するためのフローチャートである。
【図４】本発明の第１方法例において適用されるログ情報観測値テーブルの一例を示す図である。
【図５】本発明の第１方法例において適用されるログ情報観測値テーブルの他の例を示す図である。
【図６】本発明の第２方法例に係るサイト領域内ボトルネック特定方法に適用されるサイト領域の部分システム構成を示す図である。
【図７】本発明の第２方法例に係るサーバ領域内ボトルネック特定方法を説明するためのフローチャートである。
【図８】本発明の第２方法例において適用されるログ情報観測値テーブルの一例を示す図である。
【図９】本発明の第２方法例において適用されるログ情報観測値テーブルの他の例を示す図である。
【図１０】本発明の第２方法例において適用されるログ情報観測値テーブルのさらに他の例を示す図である。
【図１１】図１０に示したログ情報観測値テーブルからログ情報の平均値を算出して得た新たなログ情報観測値テーブルを示す図である。
【図１２】本発明の第２方法例において適用されるログ情報観測値テーブルのさらにまた他の例を示す図である。
【図１３】図６に示したサイト領域内の各サーバ収集される各種ログ情報の判定閾値を規定したボトルネック閾値テーブルを示す図である。
【図１４】図７に示したサーバ内ボトルネック判定処理の詳細を説明するためのフローチャートである。
【符号の説明】
１…インターネット
２ａ，２ｂ…サイト領域
３ａ，３ｂ…ボトルネック閾値テーブル
４ａ〜４ｇ…ログ情報観測値テーブル
Ｓ１〜Ｓ５…サーバ
Ｆ１，Ｆ２…ファイアウォール
Ｂ１，Ｂ２…負荷分散装置
Ｐ１，Ｐ２…サーバＬＡＮ[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a method for identifying bottlenecks in a site area. More specifically, the present invention relates to a plurality of computers existing in a site area in an end-to-server of the Internet. The present invention relates to a bottleneck identification method in a site area for identifying whether a network element constitutes a bottleneck.
[0002]
[Prior art]
In recent years, interest in network quality such as performance, economy, and reliability on the Internet has increased, and the importance of techniques for measuring the characteristics of wide area networks has increased.
[0003]
For example, in MSP (Managed Service Provider), in order to efficiently realize a quality management service business for applications, networks, and servers that provide distribution services for the public such as video, quality degradation monitoring and detection of the target network Analysis is essential.
[0004]
For this reason, when it is found that the network quality has deteriorated due to the declaration from the end user when providing the service as described above, the service provider becomes a factor of the deterioration. It can be said that it is the responsibility to quickly and accurately identify a location on the target network, that is, a bottleneck, and deal with it.
[0005]
Conventionally, as a method of identifying a bottleneck in an end-to-server on a target network, it is remotely inspected which server is the bottleneck among all the servers existing on the target network. What is going on is known.
[0006]
However, in this method, as the target network becomes large, the applicability naturally deteriorates, and it is difficult to comprehensively determine the bottleneck specific location in the target network.
[0007]
On the other hand, the MINC project (MINC: Multicast-based Inference of Network-internal Characteristics) is known as a method for identifying the bottleneck at the end-to-end (End-to-End) on the target network. Yes.
[0008]
The MINC project method transmits a test packet from one start point to a number of end points on the target network by multicast, obtains characteristics on the path from the end-to-end observation data at this time, It estimates the packet loss and delay in the target network.
[0009]
[Problems to be solved by the invention]
However, although the above-mentioned method by the MINC project has a theoretically high accuracy due to the characteristics of the tree structure when multicast is applied, the Internet directly determines the state due to its wide area and the dispersion of management subjects. However, since it is difficult to manage and control, it has the following problems.
[0010]
That is, the multicast used in the method by the MINC project is not practical because it is not practical on the currently operated Internet, and it is difficult to observe the one-way characteristics using test packets on the currently operated Internet. is there.
[0011]
In order to solve the above problems, it is necessary that the required bottleneck identification method can be applied to a network in which the entire path does not take a tree structure. It is impossible to apply to other networks, and at present, it is practically difficult to apply this to an actual ISP (Internet Service Provider).
[0012]
That is, at present, it is difficult to directly identify a bottleneck that can occur in a plurality of network elements distributed on the Internet, so a new technology that can statistically estimate the bottleneck from other measurement data Is needed.
[0013]
Here, the main objects to be solved by the present invention are as follows.
[0014]
That is, a first object of the present invention is to provide a method for identifying bottlenecks in a site area that can efficiently identify bottlenecks in a plurality of network elements existing in the site area.
[0015]
The second object of the present invention is to provide a bottleneck identification method in a site area that can particularly efficiently identify a bottleneck in one or more servers existing in the site area.
[0016]
Other objects of the present invention will become apparent from the specification, drawings, and particularly the description of each claim.
[0017]
[Means for Solving the Problems]
In the method of the present invention, in any computer installed in the site area of the Internet end-to-server, log information collection processing for collecting log information that can be used for determination of the processing capability of each network element, and each network element Bottleneck determination processing for determining whether or not each network element constitutes a bottleneck based on the log information, and when one or more network elements constituting the bottleneck exist in each network element A characteristic configuration technique is taken in which bottleneck list output processing for outputting a bottleneck list in which individual identifiers of the network elements are described is sequentially executed.
[0018]
More specifically, in order to solve the problem, the present invention achieves the above-mentioned object by adopting a novel characteristic configuration method ranging from a superordinate concept to a subordinate concept listed below. .
[0019]
That is, the first feature of the method of the present invention is whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server. In the computer, a log information collection process for collecting log information that can be used for determination of processing capability of each network element over a predetermined time from the plurality of network elements, and the collected each Based on the log information of the network element, bottleneck determination processing for determining whether or not each network element constitutes the bottleneck, and as a result of this determination, the bottleneck is configured in each network element. If one or more network elements exist, A bottleneck list output process for outputting a bottleneck list in which individual identifiers are described, and a bottleneck identification method in a site area that sequentially executes the computer, wherein the computer collects the bottleneck list in the bottleneck determination process As a result of comparing the single log information to be used as the bottleneck determination criterion in the one or more log information with a determination threshold value set in advance in the computer, the single log information is determined as the determination threshold value. The specific server exceeding the specified number in the bottleneck list after the bottleneck server description process in which the individual identifier of the corresponding specific server is described in the bottleneck list when it is outside the allowable range If the individual identifier is present, the single log collected within the predetermined time in the log information collection process. A log information average value calculation process for calculating an average value of information and a large difference between the individual identifier of the specific server and the determination threshold based on the calculated average value of the single log information A bottle that rewrites the contents of the bottleneck list after the bottleneck server description process by using the server-specified number extraction process for extracting the specified number in order from the first and the individual identifier of the specific server extracted by the specified number. It is in the configuration adoption of the bottleneck identification method in the site area, which is obtained by sequentially executing the neck list rewriting process.
[0021]
According to a second feature of the method of the present invention, the plurality of network elements in the first feature of the method of the present invention include one or more servers as one element, and the log information collection processing is performed. As the required log information, at least one of the remaining RAM memory available in the one or more servers, the total bandwidth used by the NIC, the CPU usage rate, the HDD data reading speed, the number of clients waiting for connection, and the cumulative number of errors generated per second In the site adoption of the method for identifying bottlenecks in the site area.
[0022]
A third feature of the method of the present invention is that the log information collection processing in the second feature of the method of the present invention is set in the computer when collecting the one or more log information relating to the one or more servers. The configuration of the bottleneck identification method in the site area using the performance monitor is adopted.
[0023]
According to a fourth feature of the method of the present invention, in the log information collection process, the computer according to the second or third feature of the method of the present invention stores the one or more log information in the one or more servers. Failure candidate server that adds the individual identifier of the specific server to the bottleneck list without executing the bottleneck determination process for the specific server that cannot collect log information when there is something that cannot be collected in time This is in the configuration adoption of the bottleneck identification method in the site area, which is obtained by executing the additional recording process.
[0024]
The fifth feature of the method of the present invention is to identify whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server. Therefore, in the computer, log information collection processing for collecting log information that can be used for determination of processing capability of each network element over a predetermined time from the plurality of network elements, and the collected network elements Based on the log information, a bottleneck determination process for determining whether or not each network element constitutes the bottleneck, and, as a result of this determination, 1 constitutes the bottleneck in each network element. If the above network elements exist, A bottleneck list output process for outputting a bottleneck list in which identifiers are described, and a bottleneck identification method in a site area that sequentially executes the plurality of network elements as one element from one or more The log information collection process collects the packet loss rate in the one or more firewalls as the required log information, and the computer collects the collected data in the bottleneck determination process. When the packet loss rate is outside the allowable range of the determination threshold as a result of comparing the packet loss rate with a determination threshold set in advance corresponding to the log information for each network element in the computer The individual identifier of the particular firewall in question, the bottleneck list Bottleneck firewall according the process described, formed by a running, in the configuration adopted in the site area bottleneck identification process.
[0025]
A sixth feature of the method of the present invention is that the log information collection processing in the fifth feature of the method of the present invention is the management information set in the computer when collecting the packet loss rate related to the one or more firewalls. It is in the configuration adoption of the bottleneck identification method in the site area using the base.
[0026]
The seventh feature of the method of the present invention is to identify whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server. Therefore, in the computer, log information collection processing for collecting log information that can be used for determination of processing capability of each network element over a predetermined time from the plurality of network elements, and the collected network elements Based on the log information, a bottleneck determination process for determining whether or not each network element constitutes the bottleneck, and, as a result of this determination, 1 constitutes the bottleneck in each network element. If the above network elements exist, A bottleneck list output process for outputting a bottleneck list in which identifiers are described, and a bottleneck identification method in a site area that sequentially executes the plurality of network elements as one element from one or more The log information collection process collects the packet loss rate in the one or more load distribution apparatuses as the required log information, and the computer performs the bottleneck determination process. As a result of comparing the collected packet loss rate with a determination threshold value set in advance corresponding to the log information for each network element in the computer, the packet loss rate is outside the allowable range of the determination threshold value. If there is an error, the individual identifier of the corresponding specific load balancer is recorded in the bottleneck list. Runekku load balancer described process, formed by execution, in the configuration adopted in the site area bottleneck identification process.
[0027]
According to an eighth feature of the method of the present invention, the log information collection processing according to the seventh feature of the method of the present invention is based on the number of packet transmission / reception counters acquired from the one or more load balancers. A bottleneck identification method in the site area, which involves processing to calculate the loss rate.
[0028]
The ninth feature of the method of the present invention is to determine whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server. Therefore, in the computer, log information collection processing for collecting log information that can be used for determination of processing capability of each network element over a predetermined time from the plurality of network elements, and the collected network elements Based on the log information, a bottleneck determination process for determining whether or not each network element constitutes the bottleneck, and, as a result of this determination, 1 constitutes the bottleneck in each network element. If the above network elements exist, A bottleneck list output process for outputting a bottleneck list in which identifiers are described, and a bottleneck identification method in a site area that sequentially executes the plurality of network elements as one element from one or more The log information collection process collects the packet loss rate in the one or more server LANs as the required log information, and the computer collects the log information in the bottleneck determination process. When the packet loss rate is outside the allowable range of the determination threshold as a result of comparing the packet loss rate with the determination threshold set in advance corresponding to the log information for each network element in the computer And the individual identifier of the corresponding specific server LAN in the bottleneck list. The neck server LAN described processing, by executing, in the configuration adopted in the site area bottleneck identification process.
[0029]
According to a tenth feature of the method of the present invention, the log information collection process according to the ninth feature of the method of the present invention is based on the number of packet transmission / reception counters acquired from the one or more server LANs. It is in the configuration adoption of the bottleneck identification method in the site area, which is accompanied by processing for calculating the rate.
[0031]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, a first example method for realizing bottleneck identification in a plurality of network elements existing in a site area, and one or more existing in the site area, with reference to the accompanying drawings, according to an embodiment of the present invention A second method example for realizing bottleneck identification in the server will be described in order.
[0032]
(First method example)
First, FIG. 1 is a diagram showing a system configuration of a site area applied to the bottleneck identification method in the site area according to the first method example of the present invention, and FIG. 2 is a diagram showing the site area shown in FIG. It is a figure which shows the bottleneck threshold value table which prescribed | regulated the determination threshold value of the log information collected from each network element.
[0033]
First, as shown in FIG. 1, the site area bottleneck identification method according to the first method example is based on the assumption that the bottleneck identification in a plurality of network elements existing in the site area is performed with the Internet 1. In the connected site area 2a, three servers S1, S2 and S3, two firewalls F1 and F2, two load sharing devices B1 and B2, and two server LANs (LAN: Local Area Network) : Applicable to a system having P1 and P2.
[0034]
On the other hand, as shown in FIG. 2, the servers S1, S2 and S3 constituting the network element have a NIC usage total bandwidth (NIC) with a determination threshold as 100 MB / s (megabytes / second, upper limit) as a log information determination item. : Network Information Card) and a CPU usage rate (CPU: Central Processing Unit) with a determination threshold of 50% (upper limit value) are set.
[0035]
Also, the firewalls F1 and F2, which constitute the network elements, the load sharing apparatuses B1 and B2, and the server LANs: P1 and P2, respectively, have a packet loss rate with a determination threshold of 2.0% as log information determination items. It is assumed that
[0036]
The bottleneck threshold value table 3a in which the log information determination items and the corresponding determination threshold values are set is stored in an arbitrary computer installed in the site area 2a. Either one of the servers (S1, S2 or S3) existing in the area 2a can be applied, or can be individually installed in the site area 2a independently of these servers (S1, S2 or S3). Is possible.
[0037]
Next, FIG. 3 is a flowchart for explaining a method of identifying a bottleneck in the server area according to the first method example of the present invention.
[0038]
As shown in the figure, in the server area bottleneck identification method according to this method example, the computer described above first identifies the bottleneck (ST1) in the servers S1, S2, and S3, and the bottles in the firewalls F1 and F2. This is started by executing the identification of the neck (ST2), the identification of the bottleneck in the load sharing apparatuses B1 and B2 (ST3), and the identification of the bottleneck in the server LANs P1 and P2 (ST4), respectively (execution) Any order).
[0039]
When identifying each bottleneck as described above, the computer obtains the NIC total bandwidth and CPU usage rate from the servers S1, S2 and S3 over a predetermined period of time as log information that can be used for determining the processing capability of each network element. From the firewalls F1 and F2, the load sharing apparatuses B1 and B2, and the server LANs P1 and P2, packet loss rates are collected over a predetermined time (log information collection process).
[0040]
When the computer collects the total NIC usage bandwidth and CPU usage rate from the servers S1, S2 and S3, a performance monitor (software means, not shown) set in the computer can be used. When collecting packet loss rates from the firewalls F1 and F2, a management information base (MIB) set in the computer can be used.
[0041]
When the computer collects packet loss rates from the load sharing devices B1 and B2 and the server LANs P1 and P2, the required packet loss is determined based on the number of packet transmission / reception counters acquired from each of these network elements. The rate should be calculated.
[0042]
Next, as illustrated in FIGS. 4A to 4D, the computer writes the observation value of the log information obtained for each network element into the log information observation value table 4a held by itself, and the log The information observation value is compared with the determination threshold value in the bottleneck threshold value table 3a set in advance corresponding to these, and it is determined whether or not each network element constitutes a bottleneck (bottleneck determination process). . In the figure, “◯” means that the corresponding network element is determined not to constitute a bottleneck (“●” shown later is determined to be that the corresponding network element constitutes a bottleneck) Meaning the same).
[0043]
At this time, the computer uses a single log information (details will be described in the second method example) to be used as a bottleneck determination criterion among the previously collected NIC usage total bandwidth and CPU usage rate, and the corresponding determination described above. If the single log information is outside the allowable range of the determination threshold as a result of comparison with the threshold, the individual identifier (S1, S2 or S3) of the corresponding server is entered in the bottleneck list (not shown). (Bottleneck server description process).
[0044]
Further, the computer compares the collected packet loss rate with the corresponding determination threshold value, and if it is outside the allowable range of the determination threshold value, the individual identifier (F1 or F2) of the corresponding specific firewall The individual identifier (B1 or B2) of the load sharing device and the individual identifier (P1 or P2) of the server LAN are described in the bottleneck list (bottleneck firewall description processing, bottleneck load sharing device description processing, and Bottleneck server LAN description process).
[0045]
Next, the computer determines whether or not the bottleneck list is empty (φ) (ST5). In the above example, it is determined that all network elements do not constitute a bottleneck, and as a result, the bottleneck list is empty (ST5; YES), so the computer should be specified in this observation. It is determined that there is no bottleneck in the site area 2a (ST6).
[0046]
On the other hand, as illustrated in FIGS. 5A to 5D, among the log information observation values for each network element written in the log information observation value table 4b, the CPU usage rate of the server S2 and the firewall When it is determined that the packet loss rate of F1 is outside the allowable range of the determination threshold and each of these network elements constitutes a bottleneck, the computer describes the corresponding individual identifiers S2 and F1 in the bottleneck list. .
[0047]
Then, the computer determines whether or not the bottleneck list is empty. However, since the bottleneck list is not empty this time (ST5; NO), the bottleneck is the server S2 in the site area 2a. And it is determined that it exists in the firewall F1, and the bottleneck list is output to the outside (ST7).
[0048]
(Second method example)
Next, FIG. 6 is a diagram showing a partial system configuration of the site region applied to the site region bottleneck identification method according to the second method example of the present invention, and FIG. 7 is a second method example of the present invention. It is a flowchart for demonstrating the bottleneck identification method in a server area | region which concerns on.
[0049]
First, as shown in FIG. 6, the bottleneck identification method in the site area according to the second method example is based on the assumption that the bottleneck identification in one or more servers existing in the site area is realized in the site area 2b. In addition, the present invention is applied to a system having five servers S1, S2, S3, S4, and S5 (regardless of the number and presence of firewalls, load sharing devices, and server LANs).
[0050]
Further, as the log information determination items related to the servers S1, S2, S3, S4, and S5, as in the case of the first method example, the NIC use total bandwidth with the determination threshold set to 100 MB / s (upper limit value), and the determination threshold The CPU usage rate is set to 50% (upper limit value). Of these NIC usage total bandwidth and CPU usage rate, the bottleneck criterion, that is, the single log information to be preferentially determined. Assuming that “CPU usage rate” is selected by a system maintenance person (not shown).
[0051]
Then, as shown in FIG. 7, in the server area bottleneck identification method according to this method example, the computer in the site area 2b first stores the log information in the servers S1, S2, S3, S4 and S5 for a predetermined time. “1” is set in a time slice τ (1 ≦ τ ≦ x, x = T / t) for collecting x times at intervals t over T (ST11), and the individual identifier of the server is Si (1 ≦ i It is started by setting “1” to the counter when (≦ n, n = 5) (ST12) and further collecting the log information of the corresponding S1 (ST13). In the following description, for the sake of simplicity, it is assumed that the number x of log information collection about the servers S1, S2, S3, S4, and S5 is “x = 3”.
[0052]
Next, the computer determines whether or not the log information related to the server S1 has been normally collected (ST14). If the log information has been normally collected (ST14; YES), the time slice τ = 1. The bottleneck in the server S1 is determined (ST15).
[0053]
Next, the computer increments the counter i by “1” (ST16), and similarly executes the above-described processing after ST13 in the time slice τ = 1 for the servers S2, S3, S4, and S5 (ST17: NO). ) When the value of the counter i exceeds the prescribed value “5 (= n)” (ST17; YES), the time slice τ is incremented by “1” (ST18).
[0054]
Then, this time, the computer executes the processing after ST12 in the time slice τ = 2 similarly for all the servers S1, S2, S3, S4 and S5 (ST19; NO). The process is executed until the value of the time slice τ exceeds the prescribed value “3 (= x)” (ST19; YES).
[0055]
As a result of the above processing, it is assumed that the log information observation value table 4c as shown in FIG. 8 is obtained in the computer. At this time, required log information can be normally collected in all the time slices τ = 1, 2, and 3, and the log information that exceeds the determination threshold for all the servers S1, S2, S3, S4, and S5. Therefore, the computer determines that the bottleneck to be identified does not exist in any of the servers S1, S2, S3, S4, and S5 in this observation (in the bottleneck list). Does not describe anything).
[0056]
On the other hand, when the log information observation value table 4d as shown in FIG. 9 is obtained in the computer, the computer has all the necessary log information (NIC usage total bandwidth and CPU usage rate) in the server S1. If the time slices τ = 1, 2, and 3 cannot be normally collected (ST14; NO), the corresponding individual identifier S1 is described in a failure candidate list (not shown) held by itself. (ST20). However, as described in the flowchart of FIG. 7, the description of the individual identifier in the failure candidate list is actually found in the course of the above-described processing (the required log information could not be collected normally). At any time).
[0057]
Next, the computer determines whether there are x (three) servers (S1) identified by the same individual identifier in the failure candidate list (ST21), but the log information observation value of FIG. According to the table 4d, required log information in the server S1 is not normally collected in all the time slices τ = 1, 2, 3, and the processing of ST20 described above is executed each time, and the failure candidate list Since there are three identical servers (individual identifier S1) (ST21; YES), the computer adds the contents of the failure candidate list (individual identifier S1) to the bottleneck list (ST22. Failure candidate server addition process).
[0058]
Note that for the remaining servers S2, S3, S4, and S5 other than the server S1 shown in the log information observation value table 4d in FIG. 9, there is no log information that exceeds the determination threshold, so the computer specifies Judge that the bottleneck should not exist in these remaining servers S2, S3, S4 and S5 (nothing is written in the bottleneck list for these remaining servers S2, S3, S4 and S5) ).
[0059]
Next, when there are no x servers identified by the same individual identifier in the failure candidate list (ST21; NO), the computer should identify which server S1, S2, S3 is the bottleneck to be identified. , S4 and S5, it is determined whether there are more than k servers (individual identifiers) in the bottleneck list (ST23).
[0060]
The value “k” shown above is the number (specified number) specified by the system maintainer, and there are temporarily servers (individual identifiers) exceeding the specified number in the bottleneck list. In this case, only the bottleneck where the fault is serious is output to the final bottleneck list, and unnecessary output related to the minor bottleneck is excluded. In the following description, for the sake of simplicity, the designated number k is “k = 3”.
[0061]
Here, the servers S1, S2, S3, and S4 are determined as bottlenecks based on the “CPU usage rate” selected in advance by the system maintainer as a bottleneck determination criterion, and their individual identifiers S1, S2, S3 are determined. And S4 are described in the bottleneck list.
[0062]
As a result, a log information observation value table 4e as shown in FIG. 10 is obtained in the computer, and the CPU usage rates for the servers S1, S2 and S3 are all time slices τ = 1, 2, It is assumed that the determination threshold value is exceeded at 3 and the CPU usage rate related to the server S4 exceeds the determination threshold value at time slice τ = 3 (excess items are indicated by an underline, the same applies hereinafter).
[0063]
At this time, the computer assumes that there are four server individual identifiers S1, S2, S3, and S4 exceeding k (three) in the bottleneck list (ST23; YES), and the servers described in the bottleneck list Each log information regarding S1, S2, S3, and S4, that is, an average value of each CPU usage rate is calculated (ST24. Log information average value calculation process).
[0064]
As a result, the computer obtains a new log information observation value table 4f as shown in FIG. 11, and based on the average value of each CPU usage rate (each log information), the individual identifiers S1, S2, and S2 of each server. S3 and S4 are sorted in descending order from the determination threshold (50%) (S2 (= 90)> S1 (= 70)> S3 (= 60)> S4 (= 40)), and the top three (k) ), That is, S2, S1 and S3 are extracted (ST25. Server specified number extraction process).
[0065]
Note that in all the servers S1, S2, S3, S4, and S5 shown in the log information observation value table 4e in FIG. 10, there is no log information that exceeds the determination threshold with respect to the NIC use total bandwidth. Therefore, it is determined that the bottleneck to be specified for the total bandwidth used by the NIC does not exist in any of the servers S1, S2, S3, S4, and S5 (nothing is described in the bottleneck list).
[0066]
Thereafter, the computer outputs individual identifiers S2, S1 and S3 relating to the top three servers extracted in the process of ST25 described above to the final bottleneck list, and in the processes of ST20 to ST22 described above, When the content of the failure candidate list is added to the neck list, the individual identifier (“S1” in the example of FIG. 9) of the server indicated by the content is used to indicate the failed server, and the final bottleneck It outputs to the list (ST26).
[0067]
On the other hand, when the log information observation value table 4g as shown in FIG. 12 is obtained in the computer, and only the CPU usage rate related to the server S1 exceeds the determination threshold in the time slice τ = 3, The computer assumes that there is only one server individual identifier S1 that does not exceed k (three) in the bottleneck list (ST23; NO), without executing the above-described processing of ST24 and ST25. The corresponding individual identifier S1 is output to the final bottleneck list as a server constituting the bottleneck.
[0068]
In the above description of the second method example, the NIC use total bandwidth and the CPU use rate are cited as the log information determination items regarding the servers S1, S2, S3, S4, and S5. Available RAM memory remaining capacity (RAM: Random Access Memory), HDD data reading speed (HDD: Hard Disk Drive), number of clients waiting to be connected, number of errors generated in seconds, etc. can be applied together. Any of the information determination items can be collected by the above-described performance monitor set in the computer.
[0069]
Finally, with respect to the in-server bottleneck determination process (the process of ST15 in the flowchart of FIG. 7) described in the second method example, a specific example in the case where the above-described many log information determination items are applied will be described. .
[0070]
FIG. 13 is a diagram showing a bottleneck threshold table that defines determination thresholds for various log information collected from the servers S1, S2, S3, S4, and S5 in the site region 2b shown in FIG. FIG. 8 is a flowchart for explaining details of a bottleneck determination process in the server shown in FIG. 7.
[0071]
First, as shown in FIG. 13, the bottleneck threshold table 3b set in the computer has a determination threshold of 100 MB as log information determination items for each of the servers S1, S2, S3, S4 and S5 in the site area 2b. The available RAM memory remaining amount (lower limit value), the NIC usage total bandwidth with a determination threshold value of 10 MB / s (upper limit value), the CPU usage rate with a determination threshold value of 50% (upper limit value), and the determination threshold value HDD data reading speed with 50MB / s (upper limit), the number of clients waiting for connection with a judgment threshold of 1 (upper limit), and the cumulative number of errors generated per second with a judgment threshold of 1 / s (upper limit) And are set.
[0072]
Further, as log information related to the server S1, available RAM memory remaining amount: 200 MB, NIC used total bandwidth: 5 MB / s, CPU usage rate: 30%, HDD data reading speed: 30 MB / s, number of clients waiting for connection: 0, Accumulated error count per second: 0 / s, collected as log information related to server S2, available RAM memory remaining capacity: 50MB, total NIC bandwidth used: 5MB / s, CPU usage rate: 30%, HDD data read It is assumed that the speed: 30 MB / s, the number of clients waiting for connection: 0, and the number of accumulated errors generated per second: 0 / s.
[0073]
First, in the server bottleneck determination process based on the log information of the server S1, as shown in FIG. 14, the available RAM memory remaining amount is less than the threshold (ST31; NO), and the total NIC use bandwidth is More than the threshold (ST32; NO), the CPU usage rate is more than the threshold (ST33; NO), the HDD data reading speed is more than the threshold (ST34; NO), and the number of clients waiting for connection is more than the threshold (ST35; NO). ) And the cumulative number of errors generated per second is equal to or greater than the threshold (ST36; NO), the computer does not describe the server S1 (individual identifier) in the bottleneck list (ST37).
[0074]
On the other hand, in the in-server bottleneck determination process based on the log information of the server S2, as shown in the figure, the available RAM memory remaining amount is equal to or greater than the threshold (ST31; YES) (see FIG. 13). The computer describes the server S2 (individual identifier) in the bottleneck list (ST38).
[0075]
As described above, the embodiments of the present invention have been described with reference to the first and second method examples. However, the present invention is not necessarily limited only to the above-described method, and the object of the present invention is achieved. Modifications can be made as appropriate within the scope of the effects described below.
[0076]
【The invention's effect】
As described above in detail, according to the present invention, it is possible to quickly grasp failures such as performance degradation and failures of a plurality of network elements existing in the site area based on observation values. It is possible to identify a bottleneck in each network element, particularly a bottleneck in one or more servers, extremely efficiently.
[Brief description of the drawings]
FIG. 1 is a diagram showing a system configuration of a site area applied to an intra-site area bottleneck identification method according to a first method example of the present invention.
FIG. 2 is a diagram showing a bottleneck threshold value table that defines determination thresholds for log information collected from each network element in the site area shown in FIG. 1;
FIG. 3 is a flowchart for explaining a server area bottleneck identification method according to a first method example of the present invention;
FIG. 4 is a diagram showing an example of a log information observation value table applied in the first method example of the present invention.
FIG. 5 is a diagram showing another example of a log information observation value table applied in the first method example of the present invention.
FIG. 6 is a diagram showing a partial system configuration of a site area applied to a site area bottleneck identification method according to a second method example of the present invention.
FIG. 7 is a flowchart for explaining a server area bottleneck identification method according to a second method example of the present invention;
FIG. 8 is a diagram showing an example of a log information observation value table applied in the second method example of the present invention.
FIG. 9 is a diagram showing another example of the log information observation value table applied in the second method example of the present invention.
FIG. 10 is a diagram showing still another example of the log information observation value table applied in the second method example of the present invention.
11 is a diagram showing a new log information observation value table obtained by calculating an average value of log information from the log information observation value table shown in FIG.
FIG. 12 is a diagram showing still another example of the log information observation value table applied in the second method example of the present invention.
13 is a diagram showing a bottleneck threshold value table defining determination threshold values for various log information collected by each server in the site area shown in FIG. 6;
14 is a flowchart for explaining the details of the in-server bottleneck determination process shown in FIG. 7; FIG.
[Explanation of symbols]
1 ... Internet
2a, 2b ... Site area
3a, 3b ... Bottleneck threshold table
4a-4g ... Log information observation value table
S1-S5 ... Server
F1, F2 ... Firewall
B1, B2 ... Load balancer
P1, P2 ... Server LAN

Claims

In order to identify whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server, Based on the log information collection process for collecting log information that can be used for determining the processing capability of each network element over a predetermined time from the network element, and the collected log information of each network element, A bottleneck determination process for determining whether or not a network element constitutes the bottleneck, and as a result of this determination, when one or more network elements constituting the bottleneck exist in the network elements, A bot that describes the individual identifier of the network element A site area bottleneck identification method of performing a bottleneck list output process of outputting the neck list, sequentially,
The computer
In the bottleneck determination process, as a result of comparing the single log information that should be used as the determination criterion for the bottleneck in the collected one or more log information with a determination threshold value set in advance in the computer, In the bottleneck list after the bottleneck server description process in which the individual identifier of the corresponding specific server is described in the bottleneck list when single log information is outside the allowable range of the determination threshold If there are more individual identifiers of the specific server than the specified number,
Log information average value calculation processing for calculating an average value of the single log information collected within the predetermined time in the log information collection processing;
Based on the calculated average value of the single log information, the server specified number extraction process of extracting the specified number of the individual identifiers of the specific server in descending order of the difference from the determination threshold;
The bottleneck list rewriting process for rewriting the contents of the bottleneck list after the bottleneck server description process is sequentially executed by the individual identifier of the specific server extracted by the designated number,
A bottleneck identification method in a site region characterized by the above.

The plurality of network elements are:
As one element, it is configured to include a server consisting of one or more,
The log information collection process includes:
As the required log information, at least one of available RAM memory remaining capacity, NIC usage total bandwidth, CPU usage rate, HDD data reading speed, number of clients waiting for connection, and number of accumulated errors generated per second as the one or more servers. collect,
The method for identifying bottlenecks in a site area according to claim 1.

The log information collection process includes:
In collecting the one or more log information relating to the one or more servers,
Using a performance monitor set in the computer,
The method for identifying bottlenecks in the site area according to claim 2.

The computer
In the log information collection process, when one or more servers cannot collect the one or more log information within the predetermined time,
Without executing the bottleneck determination process for a specific server that cannot collect log information, the failure candidate server additional process for adding the individual identifier of the specific server to the bottleneck list is executed.
The method for identifying a bottleneck in a site area according to claim 2 or 3, wherein:

In order to identify whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server, Based on the log information collection process for collecting log information that can be used for determining the processing capability of each network element over a predetermined time from the network element, and the collected log information of each network element, A bottleneck determination process for determining whether or not a network element constitutes the bottleneck, and as a result of this determination, when one or more network elements constituting the bottleneck exist in the network elements, A bot that describes the individual identifier of the network element A site area bottleneck identification method of performing a bottleneck list output process of outputting the neck list, sequentially,
The plurality of network elements are:
As one element, it is configured to include a firewall consisting of one or more,
The log information collection process includes:
Collect the packet loss rate in the one or more firewalls as the required log information,
The computer
In the bottleneck determination process, as a result of comparing the collected packet loss rate with a determination threshold set in advance corresponding to the log information for each network element in the computer, the packet loss rate is If it is outside the allowable threshold range,
Executing a bottleneck firewall description process in which an individual identifier of the corresponding specific firewall is described in the bottleneck list;
A bottleneck identification method in a site region characterized by the above.

The log information collection process includes:
In collecting the packet loss rate for the one or more firewalls,
Using a management information base set in the computer,
The method for identifying bottlenecks in a site area according to claim 5.

In order to identify whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server, Based on the log information collection process for collecting log information that can be used for determining the processing capability of each network element over a predetermined time from the network element, and the collected log information of each network element, A bottleneck determination process for determining whether or not a network element constitutes the bottleneck, and as a result of this determination, when one or more network elements constituting the bottleneck exist in the network elements, A bot that describes the individual identifier of the network element A site area bottleneck identification method of performing a bottleneck list output process of outputting the neck list, sequentially,
The plurality of network elements are:
As one element, it is configured to include a load balancer consisting of one or more,
The log information collection process includes:
As the required log information, the packet loss rate in the one or more load balancers is collected,
The computer
In the bottleneck determination process, as a result of comparing the collected packet loss rate with a determination threshold set in advance corresponding to the log information for each network element in the computer, the packet loss rate is If it is outside the allowable threshold range,
Executing a bottleneck load distribution device description process for describing an individual identifier of the corresponding specific load distribution device in the bottleneck list;
A bottleneck identification method in a site region characterized by the above.

The log information collection process includes:
With a process of calculating the required packet loss rate based on the number of packet transmission / reception counters acquired from the one or more load balancers;
8. The method for identifying bottlenecks in a site area according to claim 7.

In order to identify whether or not a plurality of network elements existing in the site area constitute a bottleneck by an arbitrary computer installed in the site area of the Internet end-to-server, Based on the log information collection process for collecting log information that can be used for determining the processing capability of each network element over a predetermined time from the network element, and the collected log information of each network element, A bottleneck determination process for determining whether or not a network element constitutes the bottleneck, and as a result of this determination, when one or more network elements constituting the bottleneck exist in the network elements, A bot that describes the individual identifier of the network element A site area bottleneck identification method of performing a bottleneck list output process of outputting the neck list, sequentially,
The plurality of network elements are:
As one element, it is configured to include a server LAN consisting of one or more,
The log information collection process includes:
Collect the packet loss rate in the one or more server LANs as the required log information,
The computer
In the bottleneck determination process, as a result of comparing the collected packet loss rate with a determination threshold set in advance corresponding to the log information for each network element in the computer, the packet loss rate is determined If it was outside the threshold tolerance,
Executing a bottleneck server LAN description process for describing an individual identifier of the corresponding specific server LAN in the bottleneck list;
A bottleneck identification method in a site region characterized by the above.

The log information collection process includes:
A process of calculating the required packet loss rate based on the number of packet transmission / reception counters acquired from the one or more server LANs;
The method for identifying bottlenecks in a site area according to claim 9.