JP6471110B2

JP6471110B2 - Suspected failure location estimation apparatus, failure location estimation method, and failure location estimation program

Info

Publication number: JP6471110B2
Application number: JP2016033992A
Authority: JP
Inventors: 愛子尾居; 直樹武; 浩行大西
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-02-25
Filing date: 2016-02-25
Publication date: 2019-02-13
Anticipated expiration: 2036-02-25
Also published as: JP2017152949A

Description

本発明は、通信ネットワーク上における故障被疑箇所を推定する技術に関する。 The present invention relates to a technique for estimating a suspected failure location on a communication network.

本願出願人は、通信ネットワーク上で提供されるＥ２Ｅ（End to End）サービスの性能を評価するために、ＭＩＢ（Management Information Base）情報を用いて劣化箇所を特定する技術を提案している（特許文献１参照）。 In order to evaluate the performance of an E2E (End to End) service provided on a communication network, the applicant of the present application has proposed a technique for identifying a degradation location using MIB (Management Information Base) information (patent) Reference 1).

特開２００６−３３７１５号公報JP 2006-33715 A

しかし、特許文献１に記載の技術は、１事業者内に閉じたＥ２Ｅサービスを想定したものであり、他事業者のＭＩＢ情報を取得することは困難であるため、複数事業者を跨いだＥ２Ｅサービスに対して適用することは困難である。
また、特許文献１に記載の技術は、１事業者内に閉じたＥ２Ｅサービスであっても、サービスが大規模な場合には、経路上の全ての装置からＭＩＢ情報を収集して解析することは困難である。 However, the technology described in Patent Document 1 assumes an E2E service closed within one operator, and it is difficult to acquire MIB information of other operators. It is difficult to apply to services.
In addition, the technique described in Patent Document 1 collects and analyzes MIB information from all devices on a route even when an E2E service is closed within one service provider when the service is large-scale. It is difficult.

一方、本願出願人は、複数事業者を跨いだＥ２Ｅサービスにおいて故障被疑箇所を推定する技術を提案している（特願２０１５−１５６６２６、本願出願時未公開）。
かかる技術では、複数のサービス監視情報に基づいて、障害が発生した通信サービスに最も多く含まれるノードが故障被疑箇所であると推定している。 On the other hand, the applicant of the present application has proposed a technique for estimating a suspected failure location in an E2E service across multiple operators (Japanese Patent Application No. 2015-156626, unpublished at the time of application).
In this technique, it is estimated that a node most frequently included in a communication service in which a failure has occurred is a suspected failure location based on a plurality of service monitoring information.

しかし、かかる技術は、Ｅ２Ｅサービス監視の具体的な手法については言及されていない。また、かかる技術は、単一の故障を対象としており、多重故障には迅速に対応することができない。また、かかる技術は、故障被疑箇所を推定するために、一般的には公開されていないＩＳＰ網の構成情報（の推定）を必要とする。 However, this technique does not mention a specific method of E2E service monitoring. In addition, such a technique targets a single failure and cannot quickly cope with multiple failures. In addition, such a technique requires (estimated) configuration information of an ISP network that is not generally disclosed in order to estimate a suspected failure location.

本発明は、前記事情に鑑みて創案されたものであり、複数事業者を跨いだＥ２Ｅサービスや大規模なＥ２Ｅサービスにおいても故障被疑箇所を好適に推定することが可能な故障被疑箇所推定装置、故障被疑箇所推定方法および故障被疑箇所推定プログラムを提供することを課題とする。 The present invention was devised in view of the above circumstances, and a suspected failure location estimating apparatus capable of suitably estimating a suspected failure location even in an E2E service or a large-scale E2E service across multiple operators, It is an object of the present invention to provide a failure suspected place estimation method and a failure suspected place estimation program.

前記課題を解決するために、本発明は、ユーザ端末装置とクラウドネットワーク内のｗｅｂサーバ装置との通信ネットワークにおける故障被疑箇所を推定する故障被疑箇所推定装置であって、前記通信ネットワークの各部位におけるｗｅｂページ読込に要する時間に関するｗｅｂページ読込時間情報と、前記クラウドネットワーク内の応答性に関する応答性情報と、前記ｗｅｂサーバ装置の性能に関する性能情報と、を収集する収集部と、収集された前記ｗｅｂページ読込時間情報、前記応答性情報及び前記性能情報に基づいて、前記通信ネットワークにおける前記故障被疑箇所を推定する推定部と、を備えることを特徴とする。 In order to solve the above-mentioned problem, the present invention is a failure suspected place estimation device that estimates a suspected place in a communication network between a user terminal device and a web server device in a cloud network, a collecting unit that collects web page reading time information relating to the time required for reading the web page, responsiveness information relating to responsiveness in the cloud network, and performance information relating to the performance of the web server device; and the collected web An estimation unit configured to estimate the suspected fault location in the communication network based on page reading time information, the responsiveness information, and the performance information.

かかる構成によると、複数事業者を跨いだＥ２Ｅサービスや大規模なＥ２Ｅサービスにおいても故障被疑箇所を好適に推定することができる。 According to such a configuration, a suspected failure location can be suitably estimated even in an E2E service or a large-scale E2E service straddling a plurality of operators.

前記通信ネットワークは、前記ユーザ端末装置と前記クラウドネットワークとを通信可能に接続するネットワークを含んでおり、前記収集部は、前記ユーザ端末装置の位置情報を収集し、前記推定部は、前記ネットワークが前記故障被疑箇所であると推定した場合に、収集された前記位置情報に基づいて、前記ネットワーク内における前記故障被疑箇所を絞り込む構成であってもよい。 The communication network includes a network that connects the user terminal device and the cloud network in a communicable manner, the collection unit collects location information of the user terminal device, and the estimation unit The configuration may be such that when it is estimated that the location is the suspected failure location, the suspected failure location in the network is narrowed down based on the collected location information.

かかる構成によると、ネットワークが故障被疑箇所である場合にユーザ端末装置の位置情報に基づいて故障被疑箇所を絞り込むので、故障被疑箇所をより好適に推定することができる。 According to such a configuration, the suspected failure location is narrowed down based on the location information of the user terminal device when the network is the suspected failure location, so that the suspected failure location can be estimated more appropriately.

また、本発明は、ユーザ端末装置とクラウドネットワーク内のｗｅｂサーバ装置との通信ネットワークにおける故障被疑箇所を推定する故障被疑箇所推定装置による故障被疑箇所推定方法であって、前記故障被疑箇所推定装置が、前記通信ネットワークの各部位におけるｗｅｂページ読込に要する時間に関するｗｅｂページ読込時間情報と、前記クラウドネットワーク内の応答性に関する応答性情報と、前記ｗｅｂサーバ装置の性能に関する性能情報と、を収集する収集ステップと、前記故障被疑箇所推定装置が、収集された前記ｗｅｂページ読込時間情報、前記応答性情報及び前記性能情報に基づいて、前記通信ネットワークにおける前記故障被疑箇所を推定する推定ステップと、を含むことを特徴とする。 Moreover, this invention is a failure suspected place estimation method by the failure suspected place estimation apparatus which estimates the suspected place of failure in the communication network of a user terminal device and the web server apparatus in a cloud network, Comprising: Collection for collecting web page reading time information relating to the time required for reading web pages in each part of the communication network, responsiveness information relating to responsiveness in the cloud network, and performance information relating to the performance of the web server device And a step of estimating the suspected fault location in the communication network based on the collected web page reading time information, the responsiveness information, and the performance information. It is characterized by that.

前記通信ネットワークは、前記ユーザ端末装置と前記クラウドネットワークとを通信可能に接続するネットワークを含んでおり、前記収集ステップにおいて、前記故障被疑箇所推定装置が、前記ユーザ端末装置の位置情報を収集し、前記推定ステップにおいて、前記故障被疑箇所推定装置が、前記ネットワークが前記故障被疑箇所であると推定した場合に、収集された前記位置情報に基づいて、前記ネットワーク内における前記故障被疑箇所を絞り込む構成であってもよい。 The communication network includes a network that connects the user terminal device and the cloud network in a communicable manner, and in the collecting step, the failure suspected place estimating device collects position information of the user terminal device, In the estimation step, when the suspected fault location estimating apparatus estimates that the network is the suspected fault location, the suspected fault location in the network is narrowed down based on the collected location information. There may be.

また、本発明は、コンピュータを前記故障被疑箇所推定装置として機能させるＡＰＩ連携プログラムとしても具現化可能である。 The present invention can also be embodied as an API cooperation program that causes a computer to function as the suspected failure point estimation apparatus.

本発明によると、複数事業者を跨いだＥ２Ｅサービスや大規模なＥ２Ｅサービスにおいても故障被疑箇所を好適に推定することができる。 According to the present invention, it is possible to suitably estimate a suspected failure location even in an E2E service or a large-scale E2E service across a plurality of business operators.

本発明の実施形態に係る故障被疑箇所推定装置が適用された通信ネットワークを示す模式図である。It is a schematic diagram which shows the communication network to which the failure suspected place estimation apparatus which concerns on embodiment of this invention was applied. NavigationTiming APIによって計測されたｗｅｂページ読込時間情報を示す模式図である。It is a schematic diagram which shows the web page reading time information measured by NavigationTiming API. NavigationTiming APIによる計測値の一覧を示す表である。It is a table | surface which shows the list of the measured values by NavigationTiming API. CloudWatch APIによる計測値の一覧を示す表である。It is a table | surface which shows the list of the measured values by CloudWatch API. Geolocation APIによる計測値の一覧を示す表である。It is a table | surface which shows the list of the measured value by Geolocation API. 計測値の閾値を決定する手法を説明するための図であり、（ａ）は、計測値の経時変化の一例を示すグラフ、（ｂ）は、計測値の比率の経時変化の一例を示すグラフである。It is a figure for demonstrating the method of determining the threshold value of a measured value, (a) is a graph which shows an example of a time-dependent change of a measured value, (b) is a graph which shows an example of a time-dependent change of the ratio of a measured value. It is. 故障被疑箇所推定装置の動作例を説明するためのフローチャートである。It is a flowchart for demonstrating the operation example of a failure suspected location estimation apparatus. 故障被疑箇所推定装置の動作例を説明するためのフローチャートである。It is a flowchart for demonstrating the operation example of a failure suspected location estimation apparatus.

続いて、本発明の実施形態について図面を参照して説明する。図１に示すように、本発明の実施形態に係る故障被疑箇所推定装置１００は、ユーザ端末装置２とｗｅｂサーバ装置７０とを接続する通信ネットワーク１における故障被疑箇所を推定する装置である。 Next, embodiments of the present invention will be described with reference to the drawings. As shown in FIG. 1, a suspected failure point estimation device 100 according to an embodiment of the present invention is a device that estimates a suspected failure point in a communication network 1 that connects a user terminal device 2 and a web server device 70.

通信ネットワーク１は、複数のユーザ端末装置２側のネットワーク３として、ユーザ端末装置２と通信可能に接続される複数のアクセス網２０と、アクセス網２０と通信可能に接続される複数のＩＳＰ（Internet Service Provider）網３０と、ＩＳＰ網３０と通信可能に接続されるインターネット網４０と、を備える。 The communication network 1 is a network 3 on the side of a plurality of user terminal devices 2, a plurality of access networks 20 that are communicably connected to the user terminal devices 2, and a plurality of ISPs (Internet) that are communicably connected to the access network 20. Service Provider) network 30 and an Internet network 40 that is communicably connected to the ISP network 30.

アクセス網２０は、ユーザ端末装置２と最初に接続されるネットワークであり、市町村又は都道府県単位の範囲に敷設されている。ＩＳＰ網３０は、各通信事業者によって管理されているネットワークであり、複数の都道府県に跨る広範囲に敷設されている。ここで、ユーザ端末装置２がｗｅｂサーバ装置７０との通信を行う際にどのＩＳＰ網３０が用いられるかは、通信事業者ごとに異なるルールによって決定されたり変更されたりする。 The access network 20 is a network that is initially connected to the user terminal device 2 and is laid in a range of municipalities or prefectures. The ISP network 30 is a network managed by each telecommunications carrier, and is laid in a wide range across a plurality of prefectures. Here, which ISP network 30 is used when the user terminal device 2 communicates with the web server device 70 is determined or changed by a different rule for each communication carrier.

また、通信ネットワーク１は、ｗｅｂサーバ装置７０側のネットワーク構造すなわちクラウドネットワーク４として、インターネット網４０と通信可能に接続されるロードバランサ（ＥＬＢ：Elastic Load Balancer）５０と、ロードバランサ５０と通信可能に接続されるＤＣ（Data Communication）網６０と、ＤＣ網６０と通信可能に接続される複数のｗｅｂサーバ装置７０と、を備える。 In addition, the communication network 1 can communicate with a load balancer (ELB: Elastic Load Balancer) 50 and a load balancer 50 that are connected to the Internet network 40 as a network structure on the web server device 70 side, that is, a cloud network 4. A DC (Data Communication) network 60 to be connected and a plurality of web server devices 70 to be communicably connected to the DC network 60 are provided.

また、通信ネットワーク１は、ＤＮＳ（Domain Name System）サーバ装置８０と、ＩＰ Geolocation ＤＢ（Internet Protocol Geolocation DataBase）装置９０と、を備える。 In addition, the communication network 1 includes a DNS (Domain Name System) server device 80 and an IP Geolocation DB (Internet Protocol Geolocation DataBase) device 90.

ＤＮＳサーバ装置８０は、アクセス網２０、ＩＳＰ網３０又はインターネット網４０内に含まれており、ドメイン名とＩＰアドレスとが関連付けられたデータベースを備え、問い合わせに応じてドメイン名又はＩＰアドレスを返信する装置である。 The DNS server device 80 is included in the access network 20, the ISP network 30 or the Internet network 40, has a database in which domain names and IP addresses are associated, and returns a domain name or IP address in response to an inquiry. Device.

ＩＰ Geolocation ＤＢ装置９０はインターネット網４０内に含まれており、、ＩＰアドレスと当該ＩＰアドレスが付与された装置の名称及び位置情報（緯度、経度等）とが関連付けられたデータベースである。当該データベースにおいて、ＩＰアドレスが付与された装置のホストネーム、当該装置が所属するＩＳＰ網３０のＩＳＰ名、組織名、座標等がＩＰアドレスと関連付けられていてもよい。ＩＰ Geolocation ＤＢ装置９０としては、ＤＢ−ＩＰ、ipinfo.io、MAXMIND等のデータベース装置が利用可能である。 The IP Geolocation DB device 90 is included in the Internet network 40, and is a database in which an IP address is associated with the name and location information (latitude, longitude, etc.) of the device to which the IP address is assigned. In the database, the host name of the device to which the IP address is assigned, the ISP name of the ISP network 30 to which the device belongs, the organization name, coordinates, and the like may be associated with the IP address. As the IP Geolocation DB device 90, database devices such as DB-IP, ipinfo.io, and MAXMIND can be used.

＜故障被疑箇所推定装置＞
故障被疑箇所推定装置１００は、通信ネットワーク１における故障被疑箇所を推定する装置である。故障被疑箇所推定装置１００は、ＣＰＵ（Central Processing Unit）、ＲＯＭ（Read-Only Memory）、ＲＡＭ（Random Access Memory）、入出力回路等によって構成されており、機能部として、収集部１０１と、推定部１０２と、を備える。 <Sustained failure location estimation device>
The suspected failure location estimation apparatus 100 is a device that estimates a suspected failure location in the communication network 1. The suspected failure point estimation apparatus 100 is configured by a CPU (Central Processing Unit), a ROM (Read-Only Memory), a RAM (Random Access Memory), an input / output circuit, and the like. Unit 102.

≪収集部≫
収集部１０１は、故障被疑箇所を推定するために以下の情報を収集（取得）し、収集された情報を推定部１０２へ出力する。
１．ユーザ端末装置２がｗｅｂサーバ装置７０にアクセスした際にNavigation Timing API（Application Programming Interface）によって計測したｗｅｂページ読込時間情報
２．ユーザ端末装置２がGeolocation APIによって計測した当該ユーザ端末装置２の位置情報
３．ロードバランサ５０がCloud Watch APIによって計測したクラウドネットワーク４内の応答性情報
４．ｗｅｂサーバ装置７０がCloud Watch APIによって計測したｗｅｂサーバ装置７０の性能情報
５．ＩＰ Geolocation ＤＢ装置９０に記憶されたｗｅｂサーバ装置７０の位置情報、及び、ＩＳＰ情報（例えば、ＩＳＰ網３０の名称すなわちＩＳＰ名） ≪Collection Department≫
The collection unit 101 collects (acquires) the following information in order to estimate the suspected failure location, and outputs the collected information to the estimation unit 102.
1. 1. Web page read time information measured by a navigation timing API (Application Programming Interface) when the user terminal device 2 accesses the web server device 2. 2. location information of the user terminal device 2 measured by the user terminal device 2 using the Geolocation API; 3. Response information in the cloud network 4 measured by the load balancer 50 using the Cloud Watch API. 4. Performance information of the web server device 70 measured by the web server device 70 using the Cloud Watch API Location information of the web server device 70 stored in the IP Geolocation DB device 90 and ISP information (for example, the name of the ISP network 30, that is, the ISP name)

ここで、ユーザ端末装置２は、例えば以下に示す３つの手法によって各種の情報を自動収集するためのプログラム（Javascript（登録商標））が挿入されている。
１．ｗｅｂサーバ装置７０（ｗｅｂサービス事業者）が予めJavascriptを埋め込んだｗｅｂページを用意する。
２．プロキシ等を用意し、プロキシ等を通過する全てのｗｅｂページにJavascriptを強制的に挿入する。
３．ユーザに依頼し、Javascriptが埋め込まれたｗｅｂページをユーザ端末装置２のブックマークレット等に登録してもらう。
ユーザ端末装置２は、かかるJavascriptの実行によって、ｗｅｂページ読込時間情報、位置情報及びアクセス先のｗｅｂサーバ装置７０のアクセス先ＵＲＬ（Uniform Resource Locater）を収集し、収集された情報をAjax（Asynchronous JavaScript + XML）等を用いて故障被疑箇所推定装置１００へ送信する。 Here, in the user terminal device 2, for example, a program (Javascript (registered trademark)) for automatically collecting various types of information is inserted by the following three methods.
1. A web server device 70 (web service provider) prepares a web page in which Javascript is embedded in advance.
2. Prepare a proxy, etc., and forcibly insert Javascript into all web pages that pass through the proxy.
3. The user is requested to register a web page in which Javascript is embedded in a bookmarklet or the like of the user terminal device 2.
By executing the Javascript, the user terminal device 2 collects web page reading time information, location information, and access destination URL (Uniform Resource Locater) of the access destination web server device 70, and uses the collected information as Ajax (Asynchronous JavaScript). + XML) or the like, and transmits it to the suspected fault location estimation apparatus 100.

また、収集部１０１は、収集されたアクセス先ＵＲＬを用いてＤＮＳサーバ装置８０を参照する（nslookup）ことによって、アクセス先ＵＲＬに対応するアクセス先ＩＰアドレスを取得する。続いて、収集部１０１は、取得されたアクセス先ＩＰアドレスを用いてＩＰ Geolocation装置を参照することによって、アクセス先ＩＰアドレスに対応するｗｅｂサーバ装置７０の位置情報、及び、ＩＳＰ情報を取得する。 The collection unit 101 obtains an access destination IP address corresponding to the access destination URL by referring to the DNS server device 80 using the collected access destination URL (nslookup). Subsequently, the collection unit 101 acquires the location information and ISP information of the web server device 70 corresponding to the access destination IP address by referring to the IP Geolocation device using the acquired access destination IP address.

ユーザ端末装置２がGeolocation APIによって当該ユーザ端末装置２の位置情報を収集する手法としては、位置情報を1回だけ取得するgetCurrentPosition、位置情報を定期的に監視して取得するwatchPosition、及び、watchPositionによる位置情報の監視をクリアするclearWatchのいずれか等が用いられる。 As a method for collecting the location information of the user terminal device 2 by the Geolocation API, the user terminal device 2 uses getCurrentPosition that acquires the location information only once, watchPosition that periodically monitors and acquires the location information, and watchPosition. Any one of clearWatch that clears the monitoring of position information is used.

なお、ユーザ端末装置２の位置情報に関しては、収集部１０１が、ユーザ端末装置２のＩＰアドレスを用いてＩＰ Geolocation装置を参照することによっても取得可能である。 Note that the location information of the user terminal device 2 can also be obtained by the collection unit 101 referring to the IP Geolocation device using the IP address of the user terminal device 2.

＜Navigation Timing APIによるｗｅｂページ読込時間情報＞
Navigation Timing APIは、ユーザ端末装置２がブラウザでｗｅｂページを表示する際に、通信ネットワーク１のどこでどれだけの時間がかかっているのかを計測する手法である。図２に示すように、Navigation Timing APIによって計測されるプロセスは、Prompt for unload、redirect、unload、App cache、DNS、TCP、Request、Response、Processing、DOM Content Loaded、loadを含む。 <Web page reading time information by Navigation Timing API>
The Navigation Timing API is a method of measuring where and how long it takes in the communication network 1 when the user terminal device 2 displays a web page with a browser. As shown in FIG. 2, the processes measured by the Navigation Timing API include Prompt for unload, redirect, unload, App cache, DNS, TCP, Request, Response, Processing, DOM Content Loaded, and load.

ユーザ端末装置２は、Navigation Timing APIによって、ｗｅｂページ読込時間情報として以下に示す時刻情報を計測し、計測結果を故障被疑箇所推定装置１００へ送信する。
・startTime（navigationStart）：Prompt for unloadの終了時刻
・redirectStart：redirectの開始時刻
・redirectEnd：redirectの終了時刻
・unloadEventStart：unloadの開始時刻
・unloadEventEnd：unloadの終了時刻
・fetchStart：App cacheの開始時刻
・domainLookupStart：ＤＮＳサーバ装置８０におけるdomainLookupの開始時刻
・domainLookupEnd：ＤＮＳサーバ装置８０におけるdomainLookupの終了時刻
・connectStart：ＴＣＰ（Transmission Control Protocol）の接続開始時刻
・secureConnectionStart：ＴＣＰのsecureConnectionの開始時刻
・connectEnd：ＴＣＰの接続終了時刻
・requestStart：Requestの開始時刻
・responseStart：Responseの開始時刻
・responseEnd：Responseの終了時刻
・domLoading：Processingの開始時刻であり、ブラウザが最初に受け取ったＨＴＭＬ（Hyper Text Markup Language）ドキュメントのバイト解析の開始時刻（ＨＴＭＬドキュメント自体の読込開始時刻）
・domInteractive：ブラウザがＨＴＭＬの解析を完了し、ＤＯＭ（Document Object Model）構築を完了した時刻（ＨＴＭＬドキュメントの読込が完了し、非同期に取得することができる画像等のリソースの読込開始時刻）
・domComplete：Processingの終了時刻であり、ｗｅｂページの全てのリソース（画像等）のダウンロードの完了時刻
・domContentLoadedEventStart：DOMContentLoadedの開始時刻
・domContentLoadedEventEnd：DOMContentLoadedの終了時刻
・loadEventStart：loadの開始時刻
・loadEventEnd：loadの終了時刻 The user terminal device 2 measures the time information shown below as the web page reading time information by the Navigation Timing API, and transmits the measurement result to the suspected failure point estimation device 100.
-StartTime (navigationStart): Prompt for unload end time-redirectStart: redirect start time-redirectEnd: redirect end time-unloadEventStart: unload start time-unloadEventEnd: unload end time-fetchStart: App cache start time-domainLookupStart : DomainLookup start time in DNS server device 80 domainLookupEnd: domainLookup end time in DNS server device 80 connectStart: TCP (Transmission Control Protocol) connection start time secureconnectionStart: TCP secureConnection start time connectEnd: TCP connection End time • requestStart: Request start time • responseStart: Response start time • responseEnd: Response end time • domLoading: Processing start time, byte analysis of HTML (Hyper Text Markup Language) document received by browser first Start time (HTML document itself Body reading start time)
・ DomInteractive: The time when the browser completes the HTML analysis and the DOM (Document Object Model) construction is completed (the reading start time of images and other resources that can be acquired asynchronously when the reading of the HTML document is completed)
-DomComplete: Processing end time, download completion time of all resources (images, etc.) on the web page-domContentLoadedEventStart: DOMContentLoaded start time-domContentLoadedEventEnd: DOMContentLoaded end time-loadEventStart: load start time-loadEventEnd: load End time of

収集部１０１は、ユーザ端末装置２によって送信されたｗｅｂページ読込時間情報（時刻情報）を受信し、受信されたｗｅｂページ読込時間情報に基づいて、計測値０１〜１３（ｗｅｂページ読込時間情報）を算出し、算出された計測値０１〜１３を推定部１０２へ出力する。 The collection unit 101 receives the web page reading time information (time information) transmitted by the user terminal device 2, and based on the received web page reading time information, the measured values 01 to 13 (web page reading time information) And the calculated measurement values 01 to 13 are output to the estimation unit 102.

計測値０１〜１３について、図３を参照して説明する。
計測値０１は、トータルのｗｅｂページ読込時間である。
（計測値０１）＝（loadEventEnd）−（navigationStart）
計測値０１が長い場合には、通信ネットワーク１のどこかに故障が発生しているおそれがある。
計測値０２は、redirectに要する時間である。
（計測値０２）＝（redirectEnd）−（redirectStart）
計測値０２が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値０３は、unloadに要する時間である。
（計測値０３）＝（unloadEventEnd）−（unloadEventStart）
計測値０３が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値０４は、App cacheに要する時間である。
（計測値０４）＝（domainLookupStart）−（fetchStart）
計測値０４が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値０５は、DNS（名前解決）に要する時間である。
（計測値０５）＝（domainLookupEnd）−（domainLookupStart）
計測値０５が長い場合には、ユーザ端末装置２、アクセス網２０、ＩＳＰ網３０及びＤＮＳサーバ装置８０の少なくとも一つに故障が発生しているおそれがある。 The measured values 01 to 13 will be described with reference to FIG.
The measured value 01 is the total web page reading time.
(Measured value 01) = (loadEventEnd) − (navigationStart)
If the measured value 01 is long, there is a possibility that a failure has occurred somewhere in the communication network 1.
The measured value 02 is the time required for redirect.
(Measured value 02) = (redirectEnd) − (redirectStart)
When the measured value 02 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measurement value 03 is the time required for unloading.
(Measured value 03) = (unloadEventEnd) − (unloadEventStart)
When the measured value 03 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measured value 04 is the time required for App cache.
(Measured value 04) = (domainLookupStart)-(fetchStart)
When the measured value 04 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measured value 05 is a time required for DNS (name resolution).
(Measured value 05) = (domainLookupEnd)-(domainLookupStart)
When the measured value 05 is long, there is a possibility that a failure has occurred in at least one of the user terminal device 2, the access network 20, the ISP network 30, and the DNS server device 80.

計測値０６は、DNS（名前解決）からTCP-SYN送信までの内部処理に要する時間である。
（計測値０６）＝（connectStart）−（domainLookupEnd）
計測値０６が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値０７は、TCP接続処理（3-way handshake）に要する時間である。
（計測値０７）＝（connectEnd）−（connectStart）
計測値０７が長い場合には、ユーザ端末装置２、アクセス網２０、ＩＳＰ網３０、インターネット網４０、ＤＣ網６０、ｗｅｂサーバ装置７０、ＶＭ（Vertial Machine）及びｗｅｂＡＰＬ（APpLication）の少なくとも一つに故障が発生しているおそれがある。ここで、ｗｅｂサーバ装置７０は、ハードウェアリソース上に仮想化層として仮想化ソフトウェア（Hypervisor等）が実装されることによって仮想化されており、ＶＭ（Vertial Machine）として、１つの管理用仮想マシンと、複数の仮想マシンと、を備える。また、１つのＶＭにおいて、１つ以上のｗｅｂＡＰＬが作動している。
計測値０８は、TCP接続完了からHTTPリクエスト送信までの内部処理に要する時間である。
（計測値０８）＝（requestStart）−（connectEnd）
計測値０８が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値０９は、HTTPリクエスト送信から第一HTTPレスポンス受信までに要する時間である。
（計測値０９）＝（responseStart）−（requestStart）
計測値０９は、ユーザ端末装置２→ｗｅｂＡＰＬ→ユーザ端末装置２の往復通過時間である。
計測値０９が長い場合には、アクセス網２０、ＩＳＰ網３０、インターネット網４０、ＤＣ網６０、ｗｅｂサーバ装置７０、ＶＭ及びｗｅｂＡＰＬの少なくとも一つに故障が発生しているおそれがある。
計測値１０は、HTTPリクエスト受信に要する時間である。
（計測値１０）＝（responseEnd）−（responseStart）
計測値１０は、ｗｅｂＡＰＬ→ユーザ端末装置２の片道通過時間である。
計測値１０が長い場合には、アクセス網２０、ＩＳＰ網３０、インターネット網４０、ＤＣ網６０、ｗｅｂサーバ装置７０、ＶＭ及びｗｅｂＡＰＬの少なくとも一つに故障が発生しているおそれがある。 The measured value 06 is the time required for internal processing from DNS (name resolution) to TCP-SYN transmission.
(Measured value 06) = (connectStart)-(domainLookupEnd)
When the measured value 06 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measured value 07 is the time required for the TCP connection process (3-way handshake).
(Measured value 07) = (connectEnd) − (connectStart)
When the measured value 07 is long, at least one of the user terminal device 2, the access network 20, the ISP network 30, the Internet network 40, the DC network 60, the web server device 70, VM (Vertial Machine), and webAPL (APpLication). A failure may have occurred. Here, the web server device 70 is virtualized by mounting virtualization software (Hypervisor or the like) as a virtualization layer on a hardware resource, and one virtual machine for management is used as a VM (Vertial Machine). And a plurality of virtual machines. Further, one or more webAPLs are operating in one VM.
The measured value 08 is the time required for internal processing from TCP connection completion to HTTP request transmission.
(Measured value 08) = (requestStart) − (connectEnd)
When the measured value 08 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measured value 09 is the time required from the HTTP request transmission to the first HTTP response reception.
(Measured value 09) = (responseStart) − (requestStart)
The measured value 09 is the round trip time of the user terminal device 2 → webAPL → user terminal device 2.
When the measured value 09 is long, there is a possibility that a failure has occurred in at least one of the access network 20, the ISP network 30, the Internet network 40, the DC network 60, the web server device 70, the VM, and the webAPL.
The measured value 10 is the time required for receiving the HTTP request.
(Measured value 10) = (responseEnd) − (responseStart)
The measured value 10 is a one-way passage time of the web APL → user terminal device 2.
When the measurement value 10 is long, there is a possibility that a failure has occurred in at least one of the access network 20, the ISP network 30, the Internet network 40, the DC network 60, the web server device 70, the VM, and the webAPL.

計測値１１は、HTTPレスポンス受信完了からHTMLドキュメント読込開始までの内部処理に要する時間である。
（計測値１１）＝（domLoading）−（responseEnd）
計測値１１が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値１２は、HTMLドキュメント読込に要する時間である。
（計測値１２）＝（domInteractive）−（domLoading）
計測値１２が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。
計測値１３は、（非同期での）画像等読込に要する時間である。
（計測値１３）＝（domComplete）−（domInteractive）
計測値１３が長い場合には、ユーザ端末装置２に故障が発生しているおそれがある。 The measurement value 11 is the time required for internal processing from the completion of HTTP response reception to the start of HTML document reading.
(Measured value 11) = (domLoading) − (responseEnd)
When the measured value 11 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measured value 12 is the time required for reading the HTML document.
(Measured value 12) = (domInteractive) − (domLoading)
When the measured value 12 is long, there is a possibility that a failure has occurred in the user terminal device 2.
The measured value 13 is the time required for reading an image (asynchronously).
(Measured value 13) = (domComplete) − (domInteractive)
When the measured value 13 is long, there is a possibility that a failure has occurred in the user terminal device 2.

＜CloudWatch APIによる応答性情報＞
CloudWatch APIは、クラウドネットワーク４内の応答性に関する応答性情報を計測する手法である。クラウドネットワーク４内のロードバランサ５０は、CloudWatch APIによって、以下に示す応答性情報を計測し、計測結果を故障被疑箇所推定装置１００へ送信する。
・Latency（単位：秒）：ロードバランサ５０がリクエストを送信し、応答を受信するまでの経過時間 <Response information by CloudWatch API>
The CloudWatch API is a method for measuring responsiveness information related to responsiveness in the cloud network 4. The load balancer 50 in the cloud network 4 measures the following responsiveness information using the CloudWatch API, and transmits the measurement result to the failure suspected place estimation apparatus 100.
Latency (unit: seconds): Elapsed time until the load balancer 50 sends a request and receives a response

＜CloudWatch APIによる性能情報＞
また、CloudWatch APIは、ｗｅｂサーバ装置７０の性能に関する性能情報を計測する手法である。ｗｅｂサーバ装置（ＶＭすなわちＥＣ２）７０は、CloudWatch APIによって、以下に示す性能情報を計測し、計測結果を故障被疑箇所推定装置１００へ送信する。
・CPU Utilization（単位：％）：割り当てられたＥＣ２コンピュートユニットのうち、現在インスタンス上で使用されているものの比率
・NetworkIn（単位：Byte）：指定インスタンスによって受信されたバイト数であり、１インスタンス上の１アプリケーションへのネットワークトラヒック量
・NetworkOut（単位：Byte）：指定インスタンスから送信されたバイト数であり、１インスタンス上の１アプリケーションからのネットワークトラヒック量
本実施形態において、ｗｅｂサーバ装置７０の性能に関する性能情報は、ｗｅｂサーバ装置７０の稼働状態に関する稼働状態情報であるともいえる。 <Performance information by CloudWatch API>
The CloudWatch API is a method for measuring performance information related to the performance of the web server device 70. The web server device (VM, that is, EC2) 70 measures performance information shown below by CloudWatch API, and transmits the measurement result to the suspected failure point estimation device 100.
-CPU Utilization (unit:%): Ratio of allocated EC2 compute units currently used on the instance-NetworkIn (unit: Byte): Number of bytes received by the specified instance, on one instance Network traffic amount to one application of the network / NetworkOut (unit: Byte): This is the number of bytes transmitted from the specified instance, and the network traffic amount from one application on one instance. In this embodiment, the performance of the web server device 70 It can be said that the performance information is operating state information related to the operating state of the web server device 70.

収集部１０１は、ロードバランサ５０によって送信された応答性情報を受信し、受信された応答性情報に基づいて、計測値１４〜１８（応答性情報）を算出し、算出された計測値１４〜１８を推定部１０２へ出力する。 The collection unit 101 receives the responsiveness information transmitted by the load balancer 50, calculates measurement values 14-18 (responsiveness information) based on the received responsiveness information, and calculates the calculated measurement values 14-14. 18 is output to the estimation unit 102.

計測値１４〜１８について、図５を参照して説明する。
計測値１４は、ロードバランサ５０とｗｅｂＡＰＬ（ＶＭ）間のＲＴＴ（Round Trip Time）である。
（計測値１４）＝（Latency）
計測値１４が長い場合には、ＤＣ網６０、ｗｅｂサーバ装置７０、ＶＭ及びｗｅｂＡＰＬの少なくとも一つに故障が発生しているおそれがある。
計測値１５は、ｗｅｂＡＰＬのＣＰＵ使用率である。
（計測値１５）＝（CPU Utilization）
計測値１５が大きい場合には、ｗｅｂサーバ装置７０、ＶＭ及びｗｅｂＡＰＬの少なくとも一つに故障が発生しているおそれがある。
計測値１６は、ｗｅｂＡＰＬへのネットワークトラヒック流入量である。
（計測値１６）＝（NetworkIn）
計測値１６が急減した場合には、ユーザ端末装置２、アクセス網２０、ＩＳＰ網３０、インターネット網４０、ＤＣ網６０、ｗｅｂサーバ装置７０及びＶＭの少なくとも一つに故障が発生している、又は、単にトラヒック量が減少しているおそれがある。
計測値１７は、ｗｅｂＡＰＬからのネットワークトラヒック流出量である。
（計測値１７）＝（NetworkOut）
計測値１７が急減した場合には、ユーザ端末装置２、アクセス網２０、ＩＳＰ網３０、インターネット網４０、ＤＣ網６０、ｗｅｂサーバ装置７０、ＶＭ及びｗｅｂＡＰＬの少なくとも一つに故障が発生している、又は、単にトラヒック量が減少しているおそれがある。
計測値１８は、ｗｅｂＡＰＬにおけるトラヒック処理率である。
（計測値１８）＝（NetworkOut）／（NetworkIn）
計測値１８が小さい場合には、ｗｅｂＡＰＬに故障が発生しているおそれがある。 The measurement values 14 to 18 will be described with reference to FIG.
The measured value 14 is an RTT (Round Trip Time) between the load balancer 50 and the webAPL (VM).
(Measured value 14) = (Latency)
When the measured value 14 is long, there is a possibility that a failure has occurred in at least one of the DC network 60, the web server device 70, the VM, and the webAPL.
The measured value 15 is the CPU usage rate of webAPL.
(Measured value 15) = (CPU Utilization)
When the measured value 15 is large, there is a possibility that a failure has occurred in at least one of the web server device 70, VM, and webAPL.
The measurement value 16 is the amount of network traffic flowing into the webAPL.
(Measured value 16) = (NetworkIn)
If the measured value 16 suddenly decreases, a failure has occurred in at least one of the user terminal device 2, the access network 20, the ISP network 30, the Internet network 40, the DC network 60, the web server device 70, and the VM, or There is a risk that the traffic volume is simply decreasing.
The measured value 17 is an outflow amount of network traffic from the webAPL.
(Measured value 17) = (NetworkOut)
When the measured value 17 decreases rapidly, a failure has occurred in at least one of the user terminal device 2, the access network 20, the ISP network 30, the Internet network 40, the DC network 60, the web server device 70, the VM, and the web APL. Or, there is a possibility that the traffic volume is simply decreasing.
The measured value 18 is a traffic processing rate in webAPL.
(Measured value 18) = (NetworkOut) / (NetworkIn)
When the measured value 18 is small, there is a possibility that a failure has occurred in the webAPL.

なお、収集部１０１が各計測値０１〜１８を算出する手法に代えて、ユーザ端末装置２及びロードバランサ５０がそれぞれ各計測値０１〜１８を算出して故障被疑箇所推定装置１００へ送信する構成であってもよい。 Instead of the method in which the collection unit 101 calculates the measured values 01 to 18, the user terminal device 2 and the load balancer 50 calculate the measured values 01 to 18, respectively, and transmit them to the suspected fault location estimating apparatus 100. It may be.

＜Geolocation APIによる位置情報＞
Geolocation APIは、ユーザ端末装置２の位置情報を計測する手法である。ユーザ端末装置２は、Geolocation APIによって、図５に示す位置情報を計測し、計測結果を故障被疑箇所推定装置１００へ送信する。
・latitude：緯度
・longitude：経度
・altitude：高度
・accuracy：緯度及び経度の誤差
・altitudeAccuracy：高度の誤差
・heading：方角
・speed：速度 <Location information by Geolocation API>
The Geolocation API is a technique for measuring position information of the user terminal device 2. The user terminal device 2 measures the position information shown in FIG. 5 by the Geolocation API, and transmits the measurement result to the suspected failure point estimation device 100.
・ Latitude: latitude ・ longitude: longitude ・ altitude: altitude ・ accuracy: error in latitude and longitude ・ altitudeAccuracy: error in altitude ・ heading: direction ・ speed: speed

≪推定部≫
推定部１０２は、収集部１０１によって収集されて算出された計測値０１〜１８を取得し、取得された計測値０１〜１８に基づいて、故障被疑箇所を推定する。また、推定部１０２は、ネットワーク３が故障被疑箇所であると推定した場合に、収集されたユーザ端末装置２の位置情報に基づいて、ネットワーク３内における故障被疑箇所を絞り込む。かかる故障被疑箇所の推定手法及び絞り込み手法については、以下の動作例で詳細に説明する。 ≪Estimation part≫
The estimation unit 102 acquires measurement values 01 to 18 that are collected and calculated by the collection unit 101, and estimates a suspected failure location based on the acquired measurement values 01 to 18. Further, when the estimation unit 102 estimates that the network 3 is a suspected failure location, the estimation unit 102 narrows down the suspected failure location in the network 3 based on the collected location information of the user terminal device 2. The method for estimating and narrowing down the suspected failure location will be described in detail in the following operation example.

＜動作例＞
続いて、NavigationTiming API及びCloudWatch APIを用いた故障被疑箇所の推定方法（詳細には、計測値の収集後の閾値判定の方法）の一例について、図７及び図８を参照して説明する。なお、推定部１０２は、各計測値０１〜１４を閾値判定することによってＥ２Ｅサービスの品質劣化を検出し、各判定結果の組み合わせによって故障被疑箇所を推定する。推定部１０２は、計測値０１〜１４ごとの閾値を、過去の計測値に基づいて現在の計測値を最尤推定した推定結果に基づいて設定しており、かかる計測値０１〜１４ごとの閾値は、推定部１０２に予め記憶されている。閾値の設定には、各種分布、平均、最頻値（mode）等が利用可能である。すなわち、推定部１０２は、複数のユーザ端末装置２の計測値の計時変化を監視し、図６（ａ）に示すように１つのユーザ端末装置２に関する計測値が急増したり、図６（ｂ）に示すように１つのユーザ端末装置２に関する計測値の他の計測値に対するバランスが崩れたりした場合に、故障被疑箇所があると推定する。なお、以下の動作例では、計測値０７，１７が使用されていないため、これらの計測値０７，１７の収集を省略することも可能である。 <Operation example>
Next, an example of a method for estimating a suspected fault location using the NavigationTiming API and the CloudWatch API (specifically, a method for determining a threshold value after collecting measurement values) will be described with reference to FIGS. In addition, the estimation part 102 detects the quality degradation of E2E service by carrying out threshold determination of each measured value 01-14, and estimates a failure suspected place with the combination of each determination result. The estimation unit 102 sets a threshold value for each of the measurement values 01 to 14 based on an estimation result obtained by performing maximum likelihood estimation of the current measurement value based on the past measurement values, and the threshold value for each of the measurement values 01 to 14 Is stored in the estimation unit 102 in advance. Various distributions, averages, mode values, etc. can be used for setting the threshold. That is, the estimation unit 102 monitors the time change of the measurement values of the plurality of user terminal devices 2, and as shown in FIG. 6A, the measurement value regarding one user terminal device 2 increases rapidly, or FIG. ), It is estimated that there is a suspected failure location when the balance of the measurement values related to one user terminal device 2 with respect to other measurement values is lost. In the following operation example, since the measurement values 07 and 17 are not used, collection of these measurement values 07 and 17 can be omitted.

図７に示すように、計測値０１が閾値未満である場合（ステップＳ１でＹｅｓ）には、Ｅ２Ｅサービスに遅延は発生していないため、推定部１０２は、サービス正常である（故障被疑箇所なし）と判定する（ステップＳ２）。 As shown in FIG. 7, when the measured value 01 is less than the threshold value (Yes in step S1), the delay is not generated in the E2E service, so the estimation unit 102 is normal in service (no suspected failure location) ) (Step S2).

一方、計測値０１が閾値以上である場合（ステップＳ１でＮｏ）には、Ｅ２Ｅ（End toEnd）サービスに遅延が発生している。続いて、計測値１４が閾値未満である場合（ステップＳ３でＹｅｓ）には、クラウドネットワーク４内に遅延は発生していないため、推定部１０２は、ユーザ端末装置２及びネットワーク３（アクセス網２０、ＩＳＰ網３０、インターネット網４０及びＤＮＳサーバ装置８０）側に故障被疑箇所があると判定する（ステップＳ４）。 On the other hand, when the measured value 01 is equal to or greater than the threshold value (No in step S1), a delay occurs in the E2E (End to End) service. Subsequently, when the measured value 14 is less than the threshold value (Yes in step S3), since no delay has occurred in the cloud network 4, the estimation unit 102 uses the user terminal device 2 and the network 3 (access network 20). The ISP network 30, the Internet network 40, and the DNS server device 80) are determined to have a suspected failure location (step S4).

一方、計測値１４が閾値以上である場合（ステップＳ３でＮｏ）には、クラウド（ロードバランサ５０、ＤＣ網６０及びｗｅｂサーバ装置７０）内に遅延が発生しているため、推定部１０２は、クラウド側に故障被疑箇所があると判定する（ステップＳ５）。 On the other hand, when the measured value 14 is equal to or greater than the threshold (No in step S3), a delay occurs in the cloud (the load balancer 50, the DC network 60, and the web server device 70). It is determined that there is a suspected failure location on the cloud side (step S5).

ステップＳ５の後、計測値１５が閾値未満である場合（ステップＳ１１でＹｅｓ）には、ｗｅｂサーバ装置７０、ＶＭ（Virtial Machine）及びｗｅｂＡＰＬ（A Programming Language）は正常であるため、推定部１０２は、ＤＣ網６０が故障被疑箇所であると判定する（ステップＳ１２）。 After step S5, when the measured value 15 is less than the threshold value (Yes in step S11), the web server device 70, VM (Virtial Machine), and webAPL (A Programming Language) are normal, and therefore the estimation unit 102 The DC network 60 is determined to be a suspected failure location (step S12).

また、計測値１５が閾値以上であり（ステップＳ１１でＮｏ）、計測値１６が閾値未満である場合（ステップＳ１３でＹｅｓ）には、ＣＰＵ使用率が高く、かつ、ｗｅｂＡＰＬへの流入トラヒック量は増加していないため、推定部１０２は、ｗｅｂサーバ装置７０、ＶＭ及びｗｅｂＡＰＬの少なくとも一つが故障被疑箇所であると判定する（ステップＳ１４）。 If the measured value 15 is equal to or greater than the threshold (No in step S11) and the measured value 16 is less than the threshold (Yes in step S13), the CPU usage rate is high and the inflow traffic to the webAPL is Since the number has not increased, the estimation unit 102 determines that at least one of the web server device 70, VM, and webAPL is a suspected failure location (step S14).

また、計測値１６が閾値以上であり（ステップＳ１３でＮｏ）、計測値１８が閾値未満である場合（ステップＳ１５でＹｅｓ）には、ｗｅｂＡＰＬへの流入トラヒック量が増加しており、かつ、ｗｅｂＡＰＬの処理率が低下しているため、推定部１０２は、ｗｅｂＡＰＬが故障被疑箇所であると判定する（ステップＳ１６）。 Further, when the measured value 16 is equal to or greater than the threshold value (No in step S13) and the measured value 18 is less than the threshold value (Yes in step S15), the inflow traffic amount to the webAPL is increased, and the webAPL Therefore, the estimation unit 102 determines that the webAPL is a suspected failure location (step S16).

また、計測値１５が閾値以上、計測値１６が閾値以上、かつ、計測値１８が閾値以上である場合（ステップＳ１１，Ｓ１３，Ｓ１５でＮｏ）には、ＣＰＵ使用率が高く、ｗｅｂＡＰＬへの流入トラヒック量が増加し、かつ、ｗｅｂＡＰＬの処理率は正常であるため、推定部１０２は、サービス正常（ユーザ利用が増加した）と判定する（ステップＳ１７）。 When the measured value 15 is equal to or greater than the threshold value, the measured value 16 is equal to or greater than the threshold value, and the measured value 18 is equal to or greater than the threshold value (No in steps S11, S13, and S15), the CPU usage rate is high and the flow into the webAPL Since the traffic volume increases and the processing rate of webAPL is normal, the estimation unit 102 determines that the service is normal (user usage has increased) (step S17).

なお、ステップＳ１１，Ｓ１３，Ｓ１５における閾値判定は、順序を入れ替えて行われてもよく、並列に行われてもよい。 In addition, the threshold determination in steps S11, S13, and S15 may be performed by changing the order, or may be performed in parallel.

また、ステップＳ４の後、図８に示すように、計測値０２〜０４，０６，０８，１１〜１３のいずれかがそれぞれの閾値未満である場合（ステップＳ２１でＹｅｓ）には、ユーザ端末装置２内での各処理は正常であるため、推定部１０２は、アクセス網２０、ＩＳＰ網３０、インターネット網４０及びＤＮＳサーバ装置８０の少なくとも一つが故障被疑箇所であると判定する（ステップＳ２２）。 Further, after step S4, as shown in FIG. 8, when any of the measured values 02 to 04, 06, 08, and 11 to 13 is less than the respective threshold values (Yes in step S21), the user terminal device 2 is normal, the estimation unit 102 determines that at least one of the access network 20, the ISP network 30, the Internet network 40, and the DNS server device 80 is a suspected failure location (step S22).

ステップＳ２２の後、計測値０５が閾値未満である場合（ステップＳ２３でＹｅｓ）には、アクセス網２０、ＩＳＰ網３０及びＤＮＳサーバ装置８０は正常であるため、推定部１０２は、インターネット網４０が故障被疑箇所であると判定する（ステップＳ２４）。 If the measured value 05 is less than the threshold value after step S22 (Yes in step S23), the access network 20, the ISP network 30, and the DNS server device 80 are normal. It determines with it being a failure suspected location (step S24).

また、計測値０５が閾値以上であり（ステップＳ２３でＮｏ）、計測値０９，１０がともにそれぞれの閾値未満である場合（ステップＳ２５でＹｅｓ）には、アクセス網２０及びＩＳＰ網３０は正常であるため、推定部１０２は、ＤＮＳサーバ装置８０が故障被疑箇所であると判定する（ステップＳ２６）。 Further, when the measured value 05 is equal to or greater than the threshold value (No in step S23) and the measured values 09 and 10 are both less than the respective threshold values (Yes in step S25), the access network 20 and the ISP network 30 are normal. Therefore, the estimation unit 102 determines that the DNS server device 80 is a suspected failure location (step S26).

一方、計測値０５が閾値以上、かつ、計測値０９，１０の閾値のいずれかがそれぞれの閾値以上である場合（ステップＳ２３，Ｓ２５でＮｏ）には、アクセス網２０及びＩＳＰ網３０が異常であるため、推定部１０２は、アクセス網２０、ＩＳＰ網３０及びインターネット網４０の少なくとも一つが故障被疑箇所であると判定する（ステップＳ２７）。 On the other hand, when the measured value 05 is equal to or greater than the threshold and any of the measured values 09 and 10 is equal to or greater than each threshold (No in steps S23 and S25), the access network 20 and the ISP network 30 are abnormal. Therefore, the estimation unit 102 determines that at least one of the access network 20, the ISP network 30, and the Internet network 40 is a suspected failure location (step S27).

また、計測値０２〜０４，０６，０８，１１〜１３の閾値がともにそれぞれの閾値以上である場合（ステップＳ２１でＮｏ）には、ユーザ端末装置２内での処理のいずれかが異常であるため、推定部１０２は、ユーザ端末装置２が故障被疑箇所であると判定する（ステップＳ２８）。 When the threshold values of the measurement values 02 to 04, 06, 08, and 11 to 13 are both equal to or greater than the respective threshold values (No in step S21), one of the processes in the user terminal device 2 is abnormal. Therefore, the estimation unit 102 determines that the user terminal device 2 is a suspected failure location (step S28).

なお、ステップＳ２３，Ｓ２５における閾値判定は、順序を入れ替えて行われてもよく、並列に行われてもよい。 In addition, the threshold determination in steps S23 and S25 may be performed by changing the order, or may be performed in parallel.

また、ステップＳ２７の後に、推定部１０２は、アクセス網２０、ＩＳＰ網３０及びインターネット網４０の少なくとも一つが故障被疑箇所であるという判定結果と関連するユーザ端末装置２の位置情報に基づいて、故障被疑箇所を絞り込む（ステップＳ２９）。例えば、推定部１０２は、当該判定結果が得られたユーザ端末装置２の位置情報が、ある地域に集中している場合には、当該地域に対応するアクセス網２０が故障被疑箇所であると推定する。また、推定部１０２は、当該判定結果が得られたユーザ端末装置２の位置情報が広範囲（例えば、複数（２つ，３つ程度）の都道府県に跨る範囲）にわたる場合には、当該広範囲に対応するＩＳＰ網３０が故障被疑箇所であると推定する。また、推定部１０２は、前記以外、すなわち、当該判定結果が得られたユーザ端末装置２の位置情報がさらに広範囲にわたる場合には、インターネット網４０が故障被疑箇所であると推定する。 In addition, after step S27, the estimation unit 102 determines the failure based on the location information of the user terminal device 2 related to the determination result that at least one of the access network 20, the ISP network 30, and the Internet network 40 is a suspected failure location. The suspected places are narrowed down (step S29). For example, when the position information of the user terminal device 2 from which the determination result is obtained is concentrated in a certain area, the estimation unit 102 estimates that the access network 20 corresponding to the area is a suspected failure location. To do. Moreover, the estimation part 102 will be in the said wide range, when the positional information on the user terminal device 2 from which the said determination result was obtained covers wide range (for example, the range which straddles several (about two or three) prefectures). It is estimated that the corresponding ISP network 30 is a suspected failure location. In addition, the estimation unit 102 estimates that the Internet network 40 is a suspected failure location other than the above, that is, when the position information of the user terminal device 2 from which the determination result is obtained covers a wider range.

本発明の実施形態に係る故障被疑箇所推定装置１００及び故障被疑箇所推定方法は、ｗｅｂページ読込時間情報（例えば、NavigationTiming APIによって計測）、応答性情報（例えば、CloudWatch APIによって計測）及び性能情報（例えば、CloudWatch APIによって計測）を用いて通信ネットワーク１における故障被疑箇所を推定するので、複数事業者を跨いだＥ２Ｅサービスや大規模なＥ２Ｅサービスにおいても故障被疑箇所を好適に推定することができる。
また、故障被疑箇所推定装置１００及び故障被疑箇所推定方法は、ｗｅｂサービス以外のサービスに関しても、故障被疑箇所において何らかの影響があると間接的に推定することができる。 The suspected failure location estimation apparatus 100 and the suspected failure location estimation method according to the embodiment of the present invention include web page reading time information (for example, measured by the NavigationTiming API), responsiveness information (for example, measured by the CloudWatch API), and performance information ( For example, the suspected failure location in the communication network 1 is estimated using (measured by CloudWatch API), and therefore the suspected failure location can be suitably estimated even in an E2E service or a large-scale E2E service across multiple operators.
Moreover, the failure suspected place estimation apparatus 100 and the failure suspected place estimation method can indirectly estimate that there is some influence on the trouble suspected place even for services other than the web service.

また、故障被疑箇所推定装置１００及び故障被疑箇所推定方法は、ネットワーク３が故障被疑箇所である場合にユーザ端末装置２の位置情報に基づいて故障被疑箇所を絞り込むので、故障被疑箇所をより好適に推定することができる。 Further, the suspected failure point estimation device 100 and the suspected failure location estimation method narrow down the suspected failure location based on the location information of the user terminal device 2 when the network 3 is the suspected failure location, so that the suspected failure location is more suitably selected. Can be estimated.

以上、本発明の実施形態について説明したが、本発明は前記実施形態に限定されず、本発明の要旨を逸脱しない範囲で適宜変更可能である。例えば、ステップＳ１２，Ｓ１４の後にそれぞれステップＳ１３，Ｓ１５へ移行するフローとしたり、ステップＳ２４の後にステップＳ２５へ移行するフローとしたりすることによって、多重の故障被疑箇所をより好適に推定することができる。
また、本発明は、コンピュータを故障被疑箇所推定装置１００として機能させる故障被疑箇所推定プログラムとしても具現化可能である。 As mentioned above, although embodiment of this invention was described, this invention is not limited to the said embodiment, In the range which does not deviate from the summary of this invention, it can change suitably. For example, multiple suspected fault locations can be estimated more suitably by using a flow that moves to steps S13 and S15 after steps S12 and S14, or a flow that moves to step S25 after step S24. .
The present invention can also be embodied as a suspected failure location estimation program that causes a computer to function as the failure suspected location estimation apparatus 100.

１通信ネットワーク
２ユーザ端末装置
３ネットワーク
４クラウドネットワーク
７０ｗｅｂサーバ装置
１００故障被疑箇所推定装置
１０１収集部
１０２推定部 DESCRIPTION OF SYMBOLS 1 Communication network 2 User terminal device 3 Network 4 Cloud network 70 Web server apparatus 100 Failure suspected part estimation apparatus 101 Collection part 102 Estimation part

Claims

A suspected failure location estimation device for estimating a suspected failure location in a communication network between a user terminal device and a web server device in a cloud network,
A collection unit that collects web page reading time information relating to the time required for web page reading in each part of the communication network, responsiveness information relating to responsiveness in the cloud network, and performance information relating to the performance of the web server device. When,
Based on the collected web page reading time information, the responsiveness information, and the performance information, an estimation unit that estimates the suspected failure location in the communication network;
An apparatus for estimating a suspected fault location, comprising:

The communication network includes a network that connects the user terminal device and the cloud network in a communicable manner,
The collection unit collects position information of the user terminal device,
The said estimation part narrows down the said suspected fault location in the said network based on the collected said positional information, when the said network presumes that it is the said suspected fault location. Failure suspected part estimation device.

A suspected fault location estimation method by a suspected fault location estimation device that estimates a suspected fault location in a communication network between a user terminal device and a web server device in a cloud network,
Web page reading time information related to the time required for Web page reading in each part of the communication network, the responsiveness information related to the responsiveness in the cloud network, and the performance related to the performance of the web server device. A collection step for collecting information; and
An estimation step in which the suspected failure location estimation device estimates the suspected failure location in the communication network based on the collected web page reading time information, the responsiveness information, and the performance information;
A suspected failure point estimation method characterized by comprising:

The communication network includes a network that connects the user terminal device and the cloud network in a communicable manner,
In the collecting step, the failure suspected place estimating device collects position information of the user terminal device,
In the estimation step, the suspected fault location estimating apparatus narrows down the suspected fault location in the network based on the collected location information when the network presumes that the network is the suspected fault location. The method for estimating a suspected fault location according to claim 3.

A computer as a suspected failure location estimating device for estimating a suspected failure location in a communication network between a user terminal device and a web server device in a cloud network,
A collection unit that collects web page reading time information relating to the time required for web page reading in each part of the communication network, responsiveness information relating to responsiveness in the cloud network, and performance information relating to the performance of the web server device. ,
Based on the collected web page reading time information, the responsiveness information, and the performance information, an estimation unit that estimates the suspected failure location in the communication network,
Failure suspected part estimation program to function as.

The communication network includes a network that connects the user terminal device and the cloud network in a communicable manner,
The collection unit collects position information of the user terminal device,
The said estimation part narrows down the said suspected fault location in the said network based on the collected said positional information, when it presumes that the said network is the said suspected fault location. Failure suspected part estimation program.