JP6618971B2

JP6618971B2 - Estimation apparatus, estimation method, and program

Info

Publication number: JP6618971B2
Application number: JP2017202490A
Authority: JP
Inventors: 川口　銀河; 銀河川口
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-10-19
Filing date: 2017-10-19
Publication date: 2019-12-11
Anticipated expiration: 2037-10-19
Also published as: JP2019075030A

Description

本発明は、推定装置、推定方法及びプログラムに関する。 The present invention relates to an estimation device, an estimation method, and a program.

インターネットの利用が普及している中で、ｗｅｂブラウザを快適に利用できるかどうかが、ユーザが「インターネットを快適に利用できるか」について大きな位置を占めている。 With the widespread use of the Internet, whether or not a web browser can be used comfortably occupies a large position as to whether the user can use the Internet comfortably.

そのため、ユーザがｗｅｂページを閲覧する際の表示待ち時間を対象に品質管理することが重要である。 Therefore, it is important to perform quality control for the display waiting time when the user browses the web page.

また、一般にＬＴＥ（Long Term Evolution）に代表されるようなモバイルネットワークでは、混雑状況が、エリア・時間帯で異なるため、エリアごとに利用品質が異なり、また時間帯によっても品質が異なる。 In general, in a mobile network represented by LTE (Long Term Evolution), the congestion status varies depending on the area and time zone, so the use quality differs for each area, and the quality also varies depending on the time zone.

エリア・時間帯ごとの品質を把握する方法としては、個別端末から実際にコンテンツアクセスを行い計測する方法（アクティブ計測）がある（例えば、特許文献１）。 As a method of grasping the quality for each area / time zone, there is a method (active measurement) in which content is actually accessed and measured from an individual terminal (for example, Patent Document 1).

特開２０１７−９２６６１号公報JP 2017-92661 A

しかしながら、各エリア・時間帯でアクティブ計測を行うのは経済的ではない。また、ｗｅｂページ・サーバは千差万別であり、個別のｗｅｂページごとに同じネットワークの品質劣化の影響下でも、表示待ち時間への影響に差異等があるため、ｗｅｂページの表示待ち時間の品質（劣化程度）の判断が困難である。 However, it is not economical to perform active measurement in each area and time zone. In addition, the web page server is very different, and there is a difference in the influence on the display waiting time even under the influence of the same network quality degradation for each individual web page. Judgment of quality (deterioration degree) is difficult.

本発明は、上記の点に鑑みてなされたものであって、個別のｗｅｂページの影響が除去された表示待ち時間の品質の推定を可能とすることを目的とする。 The present invention has been made in view of the above points, and an object of the present invention is to make it possible to estimate the quality of a display waiting time from which the influence of individual web pages is removed.

そこで上記課題を解決するため、推定装置は、地域及び時間帯の少なくともいずれか一方が異なる複数回の試行ごとに、ｗｅｂページの表示待ち時間の計測データに基づいて、前記表示待ち時間の品質を示す指標の代表値を算出する算出部と、前記試行ごとに、当該試行に係る前記代表値について、閾値との比較結果を示す論理値を出力する比較部と、前記試行ごとに、当該試行に係る地域及び時間帯のそれぞれにおける通信トラフィックの状況を示すパラメータを抽出する抽出部と、前記各試行の前記論理値を目的変数とし、前記各試行の前記パラメータを説明変数として推定器を学習させる学習部と、或る地域及び時間帯における前記パラメータを前記推定器に入力して、前記地域及び時間帯におけるｗｅｂページの表示待ち時間の品質を推定する推定部と、を有する。 Therefore, in order to solve the above-described problem, the estimation device determines the quality of the display waiting time based on the display waiting time measurement data on the web page for each of a plurality of trials in which at least one of the region and the time zone is different. A calculation unit that calculates a representative value of the index to be indicated, a comparison unit that outputs a logical value indicating a comparison result with a threshold value for the representative value related to the trial for each trial, and a trial for each trial. An extraction unit that extracts parameters indicating the status of communication traffic in each region and time zone, and learning that causes the estimator to learn using the logical value of each trial as an objective variable and the parameters of each trial as explanatory variables And the parameters in a certain region and time zone are input to the estimator, and the display latency quality of the web page in the region and time zone Having an estimation unit for estimating for.

個別のｗｅｂページの影響が除去された表示待ち時間の品質の推定を可能とすることができる。 It is possible to estimate the quality of the display latency without the influence of individual web pages.

本発明の実施の形態における推定装置１０のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of the estimation apparatus 10 in embodiment of this invention. 本発明の実施の形態における推定装置１０の機能構成例を示す図である。It is a figure which shows the function structural example of the estimation apparatus 10 in embodiment of this invention. ｗｅｂ劣化分析部１１が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the web deterioration analysis part 11 performs. トラフィック情報抽出部１２が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the traffic information extraction part 12 performs. 学習段階において品質推定部１３が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the quality estimation part 13 performs in a learning stage. 推定段階において推定装置１０が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the estimation apparatus 10 performs in an estimation stage.

以下、図面に基づいて本発明の実施の形態を説明する。図１は、本発明の実施の形態における推定装置１０のハードウェア構成例を示す図である。図１の推定装置１０は、それぞれバスＢで相互に接続されているドライブ装置１００、補助記憶装置１０２、メモリ装置１０３、ＣＰＵ１０４、及びインタ段階装置１０５等を有するコンピュータである。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a diagram illustrating a hardware configuration example of an estimation apparatus 10 according to an embodiment of the present invention. The estimation device 10 in FIG. 1 is a computer having a drive device 100, an auxiliary storage device 102, a memory device 103, a CPU 104, an inter-stage device 105, and the like that are mutually connected by a bus B.

推定装置１０での処理を実現するプログラムは、ＣＤ−ＲＯＭ等の記録媒体１０１によって提供される。プログラムを記憶した記録媒体１０１がドライブ装置１００にセットされると、プログラムが記録媒体１０１からドライブ装置１００を介して補助記憶装置１０２にインストールされる。但し、プログラムのインストールは必ずしも記録媒体１０１より行う必要はなく、ネットワークを介して他のコンピュータよりダウンロードするようにしてもよい。補助記憶装置１０２は、インストールされたプログラムを格納すると共に、必要なファイルやデータ等を格納する。 A program that realizes processing in the estimation apparatus 10 is provided by a recording medium 101 such as a CD-ROM. When the recording medium 101 storing the program is set in the drive device 100, the program is installed from the recording medium 101 to the auxiliary storage device 102 via the drive device 100. However, the program need not be installed from the recording medium 101 and may be downloaded from another computer via a network. The auxiliary storage device 102 stores the installed program and also stores necessary files and data.

メモリ装置１０３は、プログラムの起動指示があった場合に、補助記憶装置１０２からプログラムを読み出して格納する。ＣＰＵ１０４は、メモリ装置１０３に格納されたプログラムに従って推定装置１０に係る機能を実行する。インタ段階装置１０５は、ネットワークに接続するためのインタ段階として用いられる。 The memory device 103 reads the program from the auxiliary storage device 102 and stores it when there is an instruction to start the program. The CPU 104 executes a function related to the estimation device 10 according to a program stored in the memory device 103. The inter-stage device 105 is used as an inter-stage for connecting to a network.

図２は、本発明の実施の形態における推定装置１０の機能構成例を示す図である。図２において、推定装置１０は、ｗｅｂ劣化分析部１１、トラフィック情報抽出部１２及び品質推定部１３等を有する。これら各部は、推定装置１０にインストールされた１以上のプログラムが、ＣＰＵ１０４に実行させる処理により実現される。 FIG. 2 is a diagram illustrating a functional configuration example of the estimation device 10 according to the embodiment of the present invention. In FIG. 2, the estimation device 10 includes a web deterioration analysis unit 11, a traffic information extraction unit 12, a quality estimation unit 13, and the like. Each of these units is realized by processing that one or more programs installed in the estimation apparatus 10 cause the CPU 104 to execute.

ｗｅｂ劣化分析部１１は、エリア（地域）及び時間帯の組み合わせによって区別されるＺ回の試行のそれぞれごとに、１種類以上（Ｋ種類）のｗｅｂページ（ｗｅｂコンテンツ）についての端末における表示待ち時間の計測結果の集合である端末計測データＤ１を分析する。ｗｅｂ劣化分析部１１は、分析の結果として、表示待ち時間の品質を示す指標の一例である劣化度等を出力する。 The web degradation analysis unit 11 displays the display waiting time at the terminal for one or more types (K types) of web pages (web contents) for each of Z trials distinguished by the combination of area (region) and time zone. Terminal measurement data D1, which is a set of measurement results, is analyzed. As a result of the analysis, the web deterioration analysis unit 11 outputs a deterioration degree, which is an example of an index indicating the quality of the display waiting time.

トラフィック情報抽出部１２は、上記各試行のエリア及び時間帯におけるネットワークのトラフィック計測データＤ２から、通信トラフィックの状況を示すパラメータ等を抽出（取得）する。 The traffic information extraction unit 12 extracts (acquires) a parameter or the like indicating the state of communication traffic from the network traffic measurement data D2 in each trial area and time zone.

品質推定部１３は、学習段階において、ｗｅｂ劣化分析部１１のから出力される情報とトラフィック情報抽出部１２によって抽出される情報とを１以上推定器に学習させる。品質推定部１３は、また、推定段階において、任意のエリア及び時間帯におけるトラフィック計測データＤ２が入力されると、当該エリア及び時間帯におけるｗｅｂページの表示待ち時間の劣化の程度を、１以上の推定器を用いて推定する。 In the learning stage, the quality estimation unit 13 causes the estimator to learn one or more information output from the web deterioration analysis unit 11 and information extracted by the traffic information extraction unit 12. In addition, when the traffic measurement data D2 in an arbitrary area and time zone is input in the estimation stage, the quality estimation unit 13 sets the degree of deterioration of the display waiting time of the web page in the area and time zone to 1 or more. Estimate using an estimator.

推定器は、例えば、ＳＶＭ（Support Vector Machine）である。但し、数値ベクトルから１つの論理値を推定可能であれば他の推定器が用いられてもよい。 The estimator is, for example, an SVM (Support Vector Machine). However, other estimators may be used as long as one logical value can be estimated from a numerical vector.

端末計測データＤ１の収集方法の一例について説明する。ここでは、計測対象のｗｅｂページのＵＲＬとして、ＵＲＬ＿１，ＵＲＬ＿２...ＵＲＬ＿Ｋ：（Ｋ個の計測対象ＵＲＬ；Ｋは１以上の任意の自然数（すなわち、１つのＵＲＬでもよい。））が設定されているとする。ＵＲＬは、特定のものに限定されないが、よく利用されるようなｗｅｂページを多様な種類を含むのが望ましい。 An example of a method for collecting terminal measurement data D1 will be described. Here, URL_1, URL_2... URL_K: (K measurement target URLs; K is an arbitrary natural number of 1 or more (that is, one URL may be used)) is set as the URL of the measurement target web page. Suppose that The URL is not limited to a specific URL, but it is desirable to include various types of web pages that are frequently used.

試行ごとに以下が行われる。なお、試行とは、上記したように、計測対象のエリア及び時間帯（期間）の組み合わせによって区別される概念である。したがって、エリア及び時間帯の少なくともいずれか一方が異なると、試行は異なる。 For each trial: The trial is a concept that is distinguished by a combination of an area to be measured and a time zone (period) as described above. Therefore, if at least one of the area and the time zone is different, the trial is different.

（１）１つの試行に対応するエリア・時間帯について、Ｋ個の計測対象ＵＲＬについて、表示待ち時間を、ｗｅｂページを表示可能な端末を用いてＰ回計測する。すなわち、１回の試行についてＰ×Ｋ回の計測が行われる。但し、全てのＵＲＬについて、Ｐの値は共通でなくてもよい。また、Ｐ＝１であってもよい。すなわち、各ＵＲＬについて、必ずしも複数回の計測が行われなくてもよい。当該エリア及び時間帯の代表値を取得するため、同一エリア及び同一時間帯の範囲内で、ある程度異なる場所及び時刻で計測が行われるのが望ましいが、計測パターン（例えば、同一エリア及び同一時間帯で、場所及び時刻をどのように選択するか）は任意でよく、計測手法は、特定のものに限定されない。また、表示待ち時間を取得する方法も特定のものに限定されないが、navigation timing APIを用いてnavigationStartとloadEventEndとの時間差を用いる方法が利用されてもよい。なお、各試行におけるＰ回の計測は、集計時間幅Ｗ内に終了するとする。 (1) For the area / time zone corresponding to one trial, the display waiting time is measured P times for K measurement target URLs using a terminal capable of displaying a web page. That is, P × K measurements are performed for one trial. However, the value of P may not be common for all URLs. Moreover, P = 1 may be sufficient. That is, it is not always necessary to perform multiple measurements for each URL. In order to obtain the representative values of the area and time zone, it is desirable that measurements are performed at somewhat different places and times within the same area and time zone, but the measurement pattern (for example, the same area and the same time zone) And how to select the place and time) may be arbitrary, and the measurement method is not limited to a specific one. The method for acquiring the display wait time is not limited to a specific method, but a method using a time difference between navigationStart and loadEventEnd using the navigation timing API may be used. Note that the P measurements in each trial are completed within the total time width W.

（２）（１）をＺ通りの試行について実行する。 (2) Perform (1) for Z trials.

（１）及び（２）の実行の結果として取得される端末計測データＤ１は、以下において、「Ｌ（ｚ，ｙ，ｘ）」として表記される計測値（表示待ち時間）の集合である。ここで、Ｌ（ｚ，ｙ，ｘ）は、ｚ回目（ｚ：１〜Ｚ）試行のｙ番目ＵＲＬ（ｙ：１〜Ｋ）のｘ回目（ｘ：１〜Ｐ）計測での表示待ち時間を示す。 The terminal measurement data D1 acquired as a result of the execution of (1) and (2) is a set of measurement values (display waiting time) denoted as “L (z, y, x)” below. Here, L (z, y, x) is a display waiting time in the x-th (x: 1 to P) measurement of the y-th URL (y: 1 to K) of the z-th (z: 1 to Z) trial. Indicates.

次に、トラフィック計測データＤ２の収集方法について説明する。こここでは、上記各試行に対応するエリア及び時間帯ごとの通信トラフィックの品質情報を、個別アクティブ計測なしに評価するため、例えば、当該エリア内の全て、又は（サンプリングしている場合は）一部の通信トラフィックを処理するルータ等の装置のモニタ用ポートにＤＰＩ（Deep Packet Inspection）装置を接続し、ＤＰＩ装置を用いてトラフィックを監視することで、トラフィック計測データＤ２を収集する。したがって、トラフィック計測データＤ２は、ＤＰＩ装置によって監視されるトラフィックの履歴情報（ログ）であってもよい。監視対象は、Ｚ回の試行のそれぞれに対応するエリア及び時間帯（時間幅Ｗ）のトラフィックとする（但し、表示待ち時間の計測に関するトラフィックに限定されない）。表示待ち時間の計測がモバイル・ＬＴＥであれば特定の基地局（ｅＮＢ）又は特定のセクタ（ＥＣＩ）におけるトラフィックが監視対象とされ、固定網であれば特定の収容ルータ等におけるトラフィックが監視対象とされるが、これらの基地局、セクタ、又は収容ルータ等の特定方法は、所定のものに限定されない。網上流でモニタ出来る権限を持つものは当然に把握できているものとする。 Next, a method for collecting the traffic measurement data D2 will be described. Here, in order to evaluate the communication traffic quality information for each area and each time zone corresponding to each trial without the individual active measurement, for example, all of the area or (if sampling) one Traffic measurement data D2 is collected by connecting a DPI (Deep Packet Inspection) device to a monitoring port of a device such as a router that processes the communication traffic of each unit and monitoring the traffic using the DPI device. Therefore, the traffic measurement data D2 may be history information (log) of traffic monitored by the DPI device. The monitoring target is the traffic corresponding to each of the Z trials and the traffic in the time zone (time width W) (however, it is not limited to the traffic related to the measurement of the display waiting time). If the display waiting time measurement is mobile / LTE, traffic in a specific base station (eNB) or a specific sector (ECI) is to be monitored, and if it is a fixed network, traffic in a specific accommodating router or the like is to be monitored. However, the identification method of these base stations, sectors, accommodating routers, etc. is not limited to a predetermined one. It is assumed that those who have the authority to monitor at the upstream side of the network are naturally understood.

以下、推定装置１０が実行する処理手順について説明する。図３は、ｗｅｂ劣化分析部１１が実行する処理手順の一例を説明するためのフローチャートである。 Hereinafter, the process procedure which the estimation apparatus 10 performs is demonstrated. FIG. 3 is a flowchart for explaining an example of a processing procedure executed by the web deterioration analysis unit 11.

ステップＳ１０１において、ｗｅｂ劣化分析部１１は、Ｋ個のＵＲＬのそれぞれについて、表示待ち時間に関する基準値を算出する。例えば、或るＵＲＬに対する基準値であれば、当該ＵＲＬに係るＺ×Ｐ個の計測値の中央値、１０％値、最小値等が、基準値とされてもよい。又は、基準値は、推定装置１０のユーザによって事前に設定されてもよい。以下、ＵＲＬ＿１，ＵＲＬ＿２...ＵＲＬ＿Ｋに対する基準値を、Ｗｂ＿１，Ｗｂ＿２，．．Ｗｂ＿Ｋとする。この場合、中央値を基準値とするのであれば、Ｗｂ＿１の基準値は以下の通りである。
Ｗｂ＿１＝（Ｌ（ｚ，１，ｘ）の表示待ち時間の集合の中央値）
続いて、ｗｅｂ劣化分析部１１は、Ｚ回の試行ごとに、Ｋ個の各ＵＲＬに対する「劣化度（Ｄｗ（ｚ，ｙ，ｘ）」を以下のように算出する（Ｓ１０２）。
Ｄｗ（ｚ，ｙ，ｘ）＝Ｌ（ｚ，ｙ，ｘ）／Ｗｂ＿ｙ
すなわち、劣化度Ｄｗ（ｚ，ｙ，ｘ）は、基準値に対する計測値の割合であり、本実施の形態において、ｗｅｂページの表示待ち時間の品質（劣化の程度）を示す指標の一例である。なお、ｗｅｂページの表示待ち時間の品質を把握可能な方法であれば、他の方法によって劣化度Ｄｗ（ｚ，ｙ，ｘ）が算出されてもよい。 In step S101, the web deterioration analysis unit 11 calculates a reference value related to the display waiting time for each of the K URLs. For example, if it is a reference value for a URL, the median value, 10% value, minimum value, etc. of Z × P measurement values related to the URL may be used as the reference value. Alternatively, the reference value may be set in advance by the user of the estimation device 10. Hereinafter, reference values for URL_1, URL_2 ... URL_K are set as Wb_1, Wb_2,. . Let Wb_K. In this case, if the median is used as the reference value, the reference value of Wb_1 is as follows.
Wb_1 = (median value of the set of display waiting times of L (z, 1, x))
Subsequently, the web deterioration analysis unit 11 calculates “deterioration degree (Dw (z, y, x))” for each of the K URLs for each of Z trials as follows (S102).
Dw (z, y, x) = L (z, y, x) / Wb_y
That is, the deterioration degree Dw (z, y, x) is a ratio of the measured value to the reference value, and is an example of an index indicating the quality (deterioration degree) of the display wait time of the web page in the present embodiment. . Note that the degradation degree Dw (z, y, x) may be calculated by another method as long as the quality of the display waiting time of the web page can be grasped.

続いて、ｗｅｂ劣化分析部１１は、Ｚ回の試行ごとに、「劣化度代表値（Ｄ＿ｚ）」を算出する（Ｓ１０３）。 Subsequently, the web deterioration analysis unit 11 calculates a “deterioration degree representative value (D_z)” for every Z trials (S103).

具体的には、ｗｅｂ劣化分析部１１は、Ｄ＿ｚ＝（Ｄｗ（ｉ，ｊ，ｌ）のうち，ｉがｚに固定され、ｊ及びｌは、それぞれが取りうる全ての値である集合（すなわち、Ｐ×Ｋ個の要素を含む集合）を、全てのｚについて取り出し、各集合のそれぞれのｄ＿ｔｈ分位点（ｑｕａｎｔｉｌｅ）の値）を算出する。ｄ＿ｔｈは、分位点で与える劣化閾値であり、予め設定される。例えば、ｄ＿ｔｈが、０．８（すなわち、８０パーセンタイル）であれば、Ｄｗ（ｚ，ｊ，ｌ）の集合の８０％値（８０パーセンタイル）が算出される。なお、Ｋ＝１であり、かつ、Ｐ＝１である場合、劣化度代表値（Ｄ＿ｚ）＝劣化度（Ｄｗ（ｚ，1，１）とされてもよい。 More specifically, the web deterioration analysis unit 11 determines that a set of D_z = (Dw (i, j, l), i is fixed to z, and j and l are all possible values (ie, , A set including P × K elements) is extracted for all z, and the respective d_th quantile values of each set are calculated. d_th is a deterioration threshold given by the quantile, and is set in advance. For example, if d_th is 0.8 (that is, the 80th percentile), the 80% value (80th percentile) of the set of Dw (z, j, l) is calculated. When K = 1 and P = 1, the deterioration level representative value (D_z) = deterioration level (Dw (z, 1, 1) may be set.

続いて、ｗｅｂ劣化分析部１１は、Ｚ回の試行ごとに「劣化度論理値（Ｄｂ＿（ｚ，ｓ））」を算出する（Ｓ１０４）。ここで、ｓは、Ｄ＿ｚに対する劣化判定閾値劣化判定閾値に対する変数であり、本実施の形態では、予め、（−ｍ，−（ｍ−１），...，−１，０，１，２，...ｎ−１，ｎ）のｍ＋ｎ＋１個だけ、多段階のＴｓｈ＿ｓが設定される。すなわち、ｍ及びｎは、劣化判定閾値Ｔｓｈ＿ｓの数（段階数）を規定するパラメータであり、予め設定される。なお、ｎ＝ｍでもよいし、ｎ≠ｍでもよい。また、推定器を１つのみ用いる場合には、ｍ＝ｎ＝０でよい（すなわち、Ｔｓｈ＿ｓは１つでよい）。 Subsequently, the web deterioration analysis unit 11 calculates “deterioration degree logical value (Db_ (z, s))” for each Z trials (S104). Here, s is a variable for the deterioration determination threshold for D_z. In the present embodiment, (−m, − (m−1),. ,...,..., N−1, n) are set to m + n + 1 multi-stage Tsh_s. That is, m and n are parameters that define the number (the number of steps) of the deterioration determination threshold value Tsh_s, and are set in advance. Note that n = m or n ≠ m. Further, when only one estimator is used, m = n = 0 may be used (that is, one Tsh_s may be used).

したがって、１回の試行に関して以下のように算出されるＤｂ＿（ｚ，ｓ）の個数は、ｍ＋ｎ＋１個である。
Ｄｂ＿（ｚ，ｓ）＝ｂｏｏｌ（Ｄ＿ｚ＞Ｔｓｈ＿ｓ）
すなわち、各試行について、劣化判定閾値Ｔｓｈ＿ｓごとに、Ｄ＿ｚと当該劣化判定閾値Ｔｓｈ＿ｓとが比較され、比較結果を示す論理値が、Ｄｂ＿（ｚ，ｓ）に代入される。 Therefore, the number of Db_ (z, s) calculated as follows for one trial is m + n + 1.
Db_ (z, s) = boole (D_z> Tsh_s)
That is, for each trial, for each deterioration determination threshold value Tsh_s, D_z is compared with the deterioration determination threshold value Tsh_s, and a logical value indicating the comparison result is substituted into Db_ (z, s).

Ｄｂ＿（ｚ，ｓ）は、真であれば「劣化」、偽であれば「非劣化」を表す論理値群であり、閾値の違い（ｓごと）で、異なる強度の劣化を表している。なお、劣化度代表値Ｄ＿ｚは、ｓ（閾値の種類）に非依存であり、劣化判定閾値Ｔｓｈ＿ｓは試行（ｚ）に非依存）である。 Db_ (z, s) is a logical value group that represents “deterioration” if true, and “non-deterioration” if false, and represents different intensity degradations with different thresholds (for each s). The deterioration level representative value D_z is independent of s (threshold type), and the deterioration determination threshold value Tsh_s is independent of trial (z).

続いて、トラフィック情報抽出部１２が実行する処理手順について説明する。図４は、トラフィック情報抽出部１２が実行する処理手順の一例を説明するためのフローチャートである。 Next, a processing procedure executed by the traffic information extraction unit 12 will be described. FIG. 4 is a flowchart for explaining an example of a processing procedure executed by the traffic information extraction unit 12.

ステップＳ２０１において、トラフィック情報抽出部１２は、変数ｉに１を代入する。変数ｉは、図４の説明において、各試行を識別するための変数である。以下、ｉ番目の試行を「試行ｉ」という。 In step S201, the traffic information extraction unit 12 substitutes 1 for a variable i. The variable i is a variable for identifying each trial in the description of FIG. Hereinafter, the i-th trial is referred to as “trial i”.

続いて、トラフィック情報抽出部１２は、トラフィック計測データＤ２から、試行ｉに対応するログデータを取得する（Ｓ２０２）。具体的には、試行ｉに対応するエリア及び時間帯（集計時間幅Ｗ）に対応するログデータが取得される。以下、取得されたログデータに係る通信トラフィックを「対象トラフィック」という。試行ｉに対応するエリアは、エリアが基地局単位又はセクタ単位であり、ログデータにｅＮＢのＩＰアドレス又はＣＩＤが付与されている場合には、試行ｉに対応するエリアのｅＮＢのＩＰアドレスに基づいて、又は当該エリアのセクタのＣＩＤに基づいて、対象トラフィックのログデータを絞り込むことができる。一方、ｅＮＢのＩＰアドレス又はＣＩＤがログデータに付与されていない場合、試行ｉに対応するエリアのモニタ用ポートで観測できる範囲が、試行ｉに対応するエリアに関するログデータとなる。 Subsequently, the traffic information extraction unit 12 acquires log data corresponding to the trial i from the traffic measurement data D2 (S202). Specifically, log data corresponding to the area and time zone (total time width W) corresponding to trial i is acquired. Hereinafter, the communication traffic related to the acquired log data is referred to as “target traffic”. The area corresponding to the trial i is based on the eNB IP address of the area corresponding to the trial i when the area is a base station unit or a sector unit and the log data is assigned the IP address or CID of the eNB. Or the log data of the target traffic can be narrowed down based on the CID of the sector in the area. On the other hand, when the IP address or CID of the eNB is not given to the log data, the range that can be observed at the monitoring port of the area corresponding to the trial i is the log data regarding the area corresponding to the trial i.

続くステップＳ２０３〜Ｓ２０７において、トラフィック情報抽出部１２は、対象トラフィックに係るログデータから、試行ｉに係る通信トラフィックの状況を示すパラメータを抽出する。 In subsequent steps S203 to S207, the traffic information extraction unit 12 extracts a parameter indicating the status of the communication traffic related to the trial i from the log data related to the target traffic.

ステップＳ２０３において、トラフィック情報抽出部１２は、対象トラフィックの特定のポート番号に係る時間内スループット及び総転送量に基づいて、以下のステップＳ２０３−１〜Ｓ２０３−３を実行することにより、試行ｉに関する低スループットの代表値を算出する。 In step S203, the traffic information extraction unit 12 performs the following steps S203-1 to S203-3 on the basis of the in-time throughput and the total transfer amount related to the specific port number of the target traffic, thereby regarding the trial i. Calculate a representative value for low throughput.

ステップＳ２０３−１において、トラフィック情報抽出部１２は、対象トラフィックから、以下の（１）及び（２）の条件を満たすＴＣＰフローを抽出する。
（１）ＴＣＰポートがｔｈｐｔ＿ｐｏｒｔであること。ここで、ｔｈｐｔ＿ｐｏｒｔは、事前設定されるパラメータの一つであり、集計対象のＴＣＰスループットのポート番号を示す。本実施の形態において、ｔｈｐｔ＿ｐｏｒｔの値は「８０」である。
（２）更に、事前設定パラメータとして、ｔｈ＿ｔｈｐｔ＿Ｌ及びｔｈ＿ｔｈｐｔ＿Ｕが設定されていれば、（１）に該当するＴＣＰフローのうち、総転送量がｔｈ＿ｔｈｐｔ＿Ｌ以上かつｔｈ＿ｔｈｐｔ＿Ｕ以下のＴＣＰフローであること。ここで、ｔｈ＿ｔｈｐｔ＿Ｌは、抽出対象の下限フローサイズであり、ｔｈ＿ｔｈｐｔ＿Ｕは、抽出対象の上限フローサイズを示す。なお、ｔｈ＿ｔｈｐｔ＿Ｌ及びｔｈ＿ｔｈｐｔ＿Ｕのいずれについても、０が設定された場合には、（２）の条件は無効とされる。 In step S203-1, the traffic information extraction unit 12 extracts a TCP flow that satisfies the following conditions (1) and (2) from the target traffic.
(1) The TCP port is thpt_port. Here, thpt_port is one of preset parameters, and indicates the port number of the TCP throughput to be counted. In the present embodiment, the value of thpt_port is “80”.
(2) Further, if th_thpt_L and th_thpt_U are set as the preset parameters, the TCP flow corresponding to (1) is a TCP flow having a total transfer amount of th_thpt_L or more and th_thpt_U or less. Here, th_thpt_L is the lower limit flow size to be extracted, and th_thpt_U represents the upper limit flow size to be extracted. Note that if both th_thpt_L and th_thpt_U are set to 0, the condition (2) is invalidated.

ステップＳ２０３−２において、トラフィック情報抽出部１２は、試行ｉの時間帯（集計時間幅Ｗ）を分割（細分化）する集計単位時間τごとに、抽出されたＴＣＰフローごとのスループット値（時間内転送量／τ）を集計し、当該スループット値の集合のｔｈｐｔ＿ｐ１分位点の値を算出する。 In step S203-2, the traffic information extraction unit 12 determines the throughput value (within the time) for each extracted TCP flow for each aggregation unit time τ that divides (subdivides) the time zone (total time width W) of trial i. Transfer amount / τ) is totaled, and the value of thpt_p1 quantile of the set of throughput values is calculated.

ここで、ｔｈｐｔ＿ｐ１及び以下のｔｈｐｔ＿ｐ２は、事前設定パラメータの一つであり、分位点（パーセンタイル）によって指定される、スループット値の抽出閾値である。したがって、例えば、ｔｈｐｔ＿ｐ１が０．２の場合、集計されたスループット値の集合の２０パーセンタイルが算出される。また、τも、事前設定パラメータの一つであり、Ｗ＝τ＊κとする（κは整数）。したがって、ステップＳ２０３−２は、κ回繰り返され、κ個のｔｈｐｔ＿ｐ１分位点の値の組が得られる。 Here, thpt_p1 and the following thpt_p2 are one of the preset parameters, and are throughput value extraction threshold values designated by quantiles (percentiles). Therefore, for example, when thpt_p1 is 0.2, the 20th percentile of the aggregated throughput value set is calculated. Also, τ is one of the preset parameters, and W = τ * κ (κ is an integer). Accordingly, step S203-2 is repeated κ times to obtain a set of κ thpt_p1 quantile values.

ステップＳ２０３−３において、トラフィック情報抽出部１２は、κ個の値の組におけるｔｈｐｔ＿ｐ２（例えば、０．２等）分位点の値を算出する。ステップＳ２０３−３にて算出されたｔｈｐｔ＿ｐ２分位点の値が、スループットの代表値（以下、「Ｔｈ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S203-3, the traffic information extraction unit 12 calculates a thpt_p2 (for example, 0.2 etc.) quantile value in a set of κ values. The value of thpt_p2 quantile calculated in step S203-3 is a representative value of throughput (hereinafter referred to as “Th1_z” (z is 1. Z)).

ステップＳ２０３に続いて、トラフィック情報抽出部１２は、対象トラフィック内のＴＣＰフローのａｃｋ応答等から計算したＲＴＴサンプルに基づいて、以下のステップＳ２０４−１〜Ｓ２０４−３を実行することにより、試行ｉに関する高ＲＴＴの代表値を算出する（Ｓ２０４）。なお、ＴＣＰのパケット列からＲＴＴ（Round Trip Time）を算出する方法は、所定のものに限定されない。既存の方法の一例として、パケット分析ツールｗｉｒｅｓｈａｒｋにおいてｔｃｐ．ａｎａｌｙｓｉｓ．ａｃｋ＿ｒｔｔで読み取る方法が挙げられる。 Subsequent to step S203, the traffic information extracting unit 12 executes the following steps S204-1 to S204-3 based on the RTT sample calculated from the ack response of the TCP flow in the target traffic, thereby performing trial i. The representative value of the high RTT is calculated (S204). Note that a method for calculating RTT (Round Trip Time) from a TCP packet sequence is not limited to a predetermined method. As an example of an existing method, in the packet analysis tool wireshark, tcp. analysis. There is a method of reading with ack_rtt.

ステップＳ２０４−１において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、対象トラフィックに対するＲＴＴデータ群を抽出する。 In step S204-1, the traffic information extraction unit 12 extracts an RTT data group for the target traffic for each total unit time τ of trial i.

ステップＳ２０４−２において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、抽出されたＲＴＴデータ群のｒｔｔ＿ｐ１分位点の値を算出する。なお、ステップＳ２０４−１では、κ個のＲＴＴデータ群が抽出されている。したがって、ステップＳ２０４−２では、κ個のｒｔｔ＿ｐ１分位点の値の組が算出される。なお、ｒｔｔ＿ｐ１及び以下のｔｈｐｔ＿ｐ２は、事前設定パラメータの一つであり、スループット抽出閾値を分位点（例えば、０．１，０．２等）で指定する値である。 In step S204-2, the traffic information extraction unit 12 calculates the value of the rtt_p1 quantile of the extracted RTT data group for each aggregation unit time τ of trial i. In step S204-1, κ RTT data groups are extracted. Accordingly, in step S204-2, a set of values of κ rtt_p1 quantiles is calculated. Note that rtt_p1 and the following thpt_p2 are one of the preset parameters, and are values that specify the throughput extraction threshold value with quantile points (for example, 0.1, 0.2, etc.).

ステップＳ２０４−３において、トラフィック情報抽出部１２は、κ個のｒｔｔ＿ｐ１分位点の値のｒｔｔ＿ｐ２分位点の値を算出する。ステップＳ２０４−３で算出されたｒｔｔ＿ｐ２分位点の値が、高ＲＴＴの代表値（以下、「Ｒｔｔ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S204-3, the traffic information extraction unit 12 calculates the value of the rtt_p2 quantile of κ rtt_p1 quantile values. The value of the rtt_p2 quantile calculated in step S204-3 is a representative value of high RTT (hereinafter referred to as “Rtt1_z” (z is 1. Z)).

ステップＳ２０４に続いて、トラフィック情報抽出部１２は、対象トラフィック内のＴＣＰフローの解析から計算したτごとのＴＣＰ再送発生比率（ＴＣＰパケット中の再送パケットの比率）に基づいて、以下のステップＳ２０５−１〜Ｓ２０５−２を実行することにより、試行ｉに関する再送率の代表値を算出する（Ｓ２０５）。なお、ＴＣＰのパケット列から再送を検出する方法は、所定のものに限定されない。既存の方法の一例として、パケット分析ツールｗｉｒｅｓｈａｒｋにおいてｔｃｐ．ａｎａｌｙｓｉｓ．ｒｅｔｒａｎｓｍｉｓｓｉｏｎ等のパラメータから評価する方法が挙げられる。 Subsequent to step S204, the traffic information extraction unit 12 performs the following step S205− based on the TCP retransmission occurrence ratio (ratio of retransmission packets in TCP packets) for each τ calculated from the analysis of the TCP flow in the target traffic. By executing 1 to S205-2, the representative value of the retransmission rate for trial i is calculated (S205). The method for detecting retransmission from a TCP packet sequence is not limited to a predetermined method. As an example of an existing method, in the packet analysis tool wireshark, tcp. analysis. There is a method of evaluating from parameters such as transmission.

ステップＳ２０５−１において、トラフィック情報抽出部１２は、集計単位時間τごとに、対象トラフィックに対しての再送率を抽出する。したがって、κ個の再送率の値の組が得られる。 In step S205-1, the traffic information extraction unit 12 extracts a retransmission rate for the target traffic for each counting unit time τ. Therefore, a set of κ retransmission rate values is obtained.

ステップＳ２０５−２において、トラフィック情報抽出部１２は、κ個の再送率のｒｅｔ＿ｐ分位点の値を算出する。なお、ｒｅｔ＿ｐは、事前設定パラメータの一つであり、再送率抽出閾値を分位点（例えば、０．９，０．８等）で指定する値である。このｒｅｔ＿ｐ分位点の値が再送率の代表値（以下、「Ｒｅｔ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S205-2, the traffic information extraction unit 12 calculates the value of the ret_p quantile of κ retransmission rates. Note that ret_p is one of the preset parameters, and is a value that designates the retransmission rate extraction threshold with a quantile (eg, 0.9, 0.8, etc.). The value of the ret_p quantile is a representative value of the retransmission rate (hereinafter referred to as “Ret1_z” (z is 1... Z)).

ステップＳ２０５に続いて、トラフィック情報抽出部１２は、以下のステップＳ２０６−１〜Ｓ２０６−３を実行することにより、ＤＮＳ応答時間の代表値を算出する（Ｓ２０６）。ＤＮＳ応答時間とは、対象トラフィック内のＤＮＳ検索の解析から計算したＤＮＳ検索からＤＮＳ応答までの時間である。 Subsequent to step S205, the traffic information extraction unit 12 calculates the representative value of the DNS response time by executing the following steps S206-1 to S206-3 (S206). The DNS response time is the time from the DNS search to the DNS response calculated from the DNS search analysis in the target traffic.

ステップＳ２０６−１において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、対象トラフィックに対してのＤＮＳ−ＲＴＴデータ集合を抽出する。 In step S206-1, the traffic information extraction unit 12 extracts a DNS-RTT data set for the target traffic for each aggregation unit time τ of trial i.

ステップＳ２０６−２において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、抽出されたＤＮＳ−ＲＴＴデータ集合について、ｄｎｓｔ＿ｐ１分位点の値を算出する。なお、ステップＳ２０６−１では、κ個のＤＮＳ−ＲＴＴデータ集合が抽出されている。したがって、ステップＳ２０６−２では、κ個のｄｎｓｔ＿ｐ１分位点の値の組が算出される。なお、ｄｎｓｔ＿ｐ１及び以下のｄｎｓｔ＿ｐ２は、事前設定パラメータの一つであり、ＤＮＳ応答時間抽出閾値を分位点（例えば、０．９，０．８等）で指定する値である。 In step S206-2, the traffic information extraction unit 12 calculates the value of the dnst_p1 quantile for the extracted DNS-RTT data set for each aggregation unit time τ of trial i. In step S206-1, κ DNS-RTT data sets are extracted. Therefore, in step S206-2, a set of values of κ dnst_p1 quantiles is calculated. Note that dnst_p1 and the following dnst_p2 are one of the preset parameters, and are values that specify the DNS response time extraction threshold value with a quantile (for example, 0.9, 0.8, etc.).

ステップＳ２０６−３において、トラフィック情報抽出部１２は、κ個のｄｎｓｔ＿ｐ１分位点の値のｄｎｓｔ＿ｐ２分位点の値を算出する。このｄｎｓｔ＿ｐ２分位点の値が、ＤＮＳ応答時間（ＤＮＳ−ＲＴＴ）の代表値（以下、「Ｄｒｔ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S206-3, the traffic information extraction unit 12 calculates the value of the dnst_p2 quantile of the κ dnst_p1 quantile values. The value of the dnst_p2 quantile is a representative value of DNS response time (DNS-RTT) (hereinafter referred to as “Drt1_z” (z is 1. Z)).

ステップＳ２０６に続いて、トラフィック情報抽出部１２は、以下のステップＳ２０７−１〜Ｓ２０７−３を実行することにより、ＤＮＳ応答率の代表値を算出する（Ｓ２０７）。ＤＮＳ応答率とは、モニタ対象トラフィック内のＤＮＳ検索の解析から計算したＤＮＳ検索に対し応答が有った比率である。 Following step S206, the traffic information extraction unit 12 calculates the representative value of the DNS response rate by executing the following steps S207-1 to S207-3 (S207). The DNS response rate is a ratio of responses to the DNS search calculated from the DNS search analysis in the monitored traffic.

ステップＳ２０７−１において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、対象トラフィックに対してのＤＮＳ検索数と当該検索に対するＤＮＳ応答の数とを抽出する。なお、応答は、τより後の時間になる場合もあるため、ｄｎｓ＿ｔｉｍｅｏｕｔ経過時までを集計対象とする。ｄｎｓ＿ｔｉｍｅｏｕｔは、事前設定パラメータの一つであり、ＤＮＳ応答率算出時のタイムアウト時間である。 In step S207-1, the traffic information extraction unit 12 extracts the number of DNS searches for the target traffic and the number of DNS responses to the search for each aggregation unit time τ of trial i. In addition, since the response may be a time later than τ, the time until dns_timeout elapses is counted. dns_timeout is one of the preset parameters, and is a timeout time when calculating the DNS response rate.

ステップＳ２０７−２において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、応答率（ＤＮＳ応答数／ＤＮＳ検索数）を算出する。検索数が０のτについては値なしとする。なお、ステップＳ２０７−２では、κ個の応答率が算出される。したがって、κ個の応答率の組が得られる。 In step S207-2, the traffic information extraction unit 12 calculates a response rate (number of DNS responses / number of DNS searches) for each aggregation unit time τ of trial i. There is no value for τ where the number of searches is 0. In step S207-2, κ response rates are calculated. Therefore, a set of κ response rates is obtained.

ステップＳ２０７−３において、トラフィック情報抽出部１２は、κ個の応答率の組から「値なし」のものがあれば除外し、κ個（除外してκ未満のこともある）の値のｄｎｓｒ＿ｐ分位点の値を算出する。ｄｎｓｒ＿ｐは、事前設定パラメータの一つであり、ＤＮＳ応答率抽出閾値を分位点（例えば、０．２等）で指定する値である。このｄｎｓｒ＿ｐ分位点の値が、ＤＮＳ応答率の代表値（以下、「Ｄｒｐ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S207-3, the traffic information extracting unit 12 excludes any “no value” from the set of κ response rates, and dnsr_p of κ (may be less than κ). Calculate the quantile value. dnsr_p is one of the preset parameters, and is a value that designates the DNS response rate extraction threshold using a quantile (for example, 0.2). The value of this dnsr_p quantile is a representative value of the DNS response rate (hereinafter referred to as “Drp1_z” (z is 1.... Z)).

ステップＳ２０２〜Ｓ２０７は、Ｚ回の全ての試行について実行される（Ｓ２０８、Ｓ２０９）。その結果、スループット代表値Ｔｈ１として、以下のようなベクトルが得られる。
Ｔｈ１＝（Ｔｈ＿１，Ｔｈ１＿２，Ｔｈ１＿３…Ｔｈ１＿ｚ）
なお、ｔｈｐｔ＿ｐ１，ｔｈｐｔ＿ｐ２の組が、複数設定されてもよい。その場合、当該組毎に、Ｔｈ２＿ｚ，Ｔｈ３＿ｚ等のベクトルが得られる。 Steps S202 to S207 are executed for all Z trials (S208, S209). As a result, the following vector is obtained as the throughput representative value Th1.
Th1 = (Th_1, Th1_2, Th1_3... Th1_z)
A plurality of sets of thpt_p1 and thpt_p2 may be set. In that case, vectors such as Th2_z and Th3_z are obtained for each group.

また、高ＲＴＴ代表値Ｒｔｔ１として、以下のようなベクトルが得られる。
Ｒｔｔ１＝（Ｒｔｔ１＿１，Ｒｔｔ１＿２，…Ｒｔｔ１＿ｚ）
なお、ｒｔｔ＿ｐ１，ｒｔｔ＿ｐ２の組が、複数設定されてもよい。その場合、当該組ごとに、Ｒｔｔ２＿ｚ，Ｒｔｔ３＿ｚ等のベクトルが得られる。 Further, as the high RTT representative value Rtt1, the following vector is obtained.
Rtt1 = (Rtt1_1, Rtt1_2,... Rtt1_z)
A plurality of sets of rtt_p1 and rtt_p2 may be set. In that case, vectors such as Rtt2_z and Rtt3_z are obtained for each group.

また、再送率代表値Ｒｅｔ１として、以下のようなベクトルが得られる。
Ｒｅｔ１＝（Ｒｅｔ１＿１，Ｒｅｔ１＿２，…Ｒｅｔ１＿ｚ）
なお、ｒｅｔ＿ｐについて、複数の値が設定されてもよい。その場合、当該値ごとに、Ｒｅｔ２＿ｚ，Ｒｅｔ３＿ｚ等のベクトルが得られる。 Further, the following vector is obtained as the retransmission rate representative value Ret1.
Ret1 = (Ret1_1, Ret1_2,... Ret1_z)
A plurality of values may be set for ret_p. In that case, vectors such as Ret2_z and Ret3_z are obtained for each value.

また、ＤＮＳ応答時間代表値Ｄｒｔ１として、以下のようなベクトルが得られる。
Ｄｒｔ１＝（Ｄｒｔ１＿１，Ｄｒｔ１＿２，…Ｄｒｔ１＿ｚ）
なお、ｄｎｓｔ＿ｐ１、ｄｎｓｔ＿ｐ２の組が複数設定されてもよい。その場合、当該組ごとに、Ｄｒｔ２＿ｚ，Ｄｒｔ３＿ｚ等のベクトルが得られる。 Further, as the DNS response time representative value Drt1, the following vector is obtained.
Drt1 = (Drt1_1, Drt1_2, ... Drt1_z)
A plurality of sets of dnst_p1 and dnst_p2 may be set. In that case, vectors such as Drt2_z and Drt3_z are obtained for each group.

更に、ＤＮＳ応答率Ｄｒｐ１として、以下のようなベクトルが得られる。
Ｄｒｐ１＝（Ｄｒｐ１＿１，Ｄｒｐ１＿２，…Ｄｒｐ１＿ｚ）
なお、ｄｎｓｒ＿ｐについて、複数の値が設定されてもよい。その場合、当該値ごとに、Ｄｒｐ２＿ｚ、Ｄｒｐ３＿ｚ等のベクトルが得られる。 Furthermore, the following vectors are obtained as the DNS response rate Drp1.
Drp1 = (Drp1_1, Drp1_2,... Drp1_z)
A plurality of values may be set for dnsr_p. In that case, vectors such as Drp2_z and Drp3_z are obtained for each value.

続いて、トラフィック情報抽出部１２は、それぞれが長さＺの上記各代表値のベクトルを組（行列）として、以下のＤＰＩｖｅｃを出力する（Ｓ２１０）。
ＤＰＩｖｅｃ＝（Ｔｈ１（，Ｔｈ２，Ｔｈ３．．），Ｒｔｔ１（，Ｒｔｔ２，Ｒｔｔ３．．），Ｒｅｔ１（，Ｒｅｔ２，Ｒｅｔ３．．），Ｄｒｔ１（，Ｄｒｔ２，Ｄｒｔ３），Ｄｒｐ１（，Ｄｒｐ２，Ｄｒｐ３．．））
なお、（，Ｔｈ２，Ｔｈ３．．）、（，Ｒｔｔ２，Ｒｔｔ３．．）、（，Ｒｅｔ２，Ｒｅｔ３．．）、（，Ｄｒｔ２，Ｄｒｔ３）及び（，Ｄｒｐ２，Ｄｒｐ３．．）は、事前設定パラメータとしての閾値（分位点）の組の数によって、構成するベクトルの数が変化することを示す。 Subsequently, the traffic information extraction unit 12 outputs the following DPIvec with each vector of the representative values having a length Z as a set (matrix) (S210).
DPIvec = (Th1 (, Th2, Th3 ..), Rtt1 (, Rtt2, Rtt3 ..), Ret1 (, Ret2, Ret3 ..), Drt1 (, Drt2, Drt3), Drp1 (, Drp2, Drp3 ..) )
(, Th2, Th3 ...), (, Rtt2, Rtt3 ...), (, Ret2, Ret3 ...), (, Drt2, Drt3) and (, Drp2, Drp3 ...) are preset parameters. This indicates that the number of constituent vectors changes depending on the number of threshold values (quantile points).

続いて、学習段階において品質推定部１３が実行する処理手順について説明する。図５は、学習段階において品質推定部１３が実行する処理手順の一例を説明するためのフローチャートである。 Then, the process procedure which the quality estimation part 13 performs in a learning stage is demonstrated. FIG. 5 is a flowchart for explaining an example of a processing procedure executed by the quality estimation unit 13 in the learning stage.

ステップＳ３０１において、品質推定部１３は、ｗｅｂ劣化分析部１１によって算出された、ｍ＋ｎ＋１個の長さＺの以下の論理値ベクトルである劣化度論理値Ｄｂ＿（ｚ，ｓ）を入力する。
Ｄｂ＿（ｚ，ｓ）＝（Ｄｂ＿（ｚ，−ｍ），Ｄｂ＿（ｚ，−ｍ＋１），…Ｄｂ＿（ｚ，ｎ−１），Ｄｂ＿（ｚ，ｎ））
なお、Ｄｂ＿（ｚ，ｓ）は、ｓを固定すると要素数Ｚ個のベクトルである。ｓはｍ＋ｎ＋１個の値を取るので、Ｄｂ＿（ｚ，ｓ）は、ｍ＋ｎ＋１個のＺ個のベクトルを要素とするベクトルとなる。 In step S301, the quality estimation unit 13 inputs the deterioration degree logical value Db_ (z, s), which is the following logical value vector of m + n + 1 lengths Z calculated by the web deterioration analysis unit 11.
Db_ (z, s) = (Db_ (z, −m), Db_ (z, −m + 1),... Db_ (z, n−1), Db_ (z, n))
Note that Db_ (z, s) is a vector having Z elements when s is fixed. Since s takes m + n + 1 values, Db_ (z, s) is a vector whose elements are m + n + 1 Z vectors.

続いて、品質推定部１３は、トラフィック情報抽出部１２によって出力された、以下の代表値のベクトルの組（行列）を入力する（Ｓ３０２）。
ＤＰＩｖｅｃ＝（Ｔｈ１（，Ｔｈ２，Ｔｈ３．．），Ｒｔｔ１（，Ｒｔｔ２，Ｒｔｔ３．．），Ｒｅｔ１（，Ｒｅｔ２，Ｒｅｔ３．．），Ｄｒｔ１（，Ｄｒｔ２，Ｄｒｔ３），Ｄｒｐ１（，Ｄｒｐ２，Ｄｒｐ３．．））
続いて、品質推定部１３は、ｍ＋ｎ＋１個（すなわち、１以上）の推定器のインスタンスを生成する（Ｓ３０３）。すなわち、劣化判定閾値Ｔｓｈ＿ｓの数と同数の（劣化判定閾値Ｔｓｈ＿ｓごとの）推定器が用意される。各推定器を「Ｓ＿ｓ」と表記する。ここで、ｓは、ｍ＋ｎ＋１種類のＴｓｈ＿ｓに対応する。 Subsequently, the quality estimation unit 13 inputs the following set of representative value vectors (matrix) output by the traffic information extraction unit 12 (S302).
DPIvec = (Th1 (, Th2, Th3 ..), Rtt1 (, Rtt2, Rtt3 ..), Ret1 (, Ret2, Ret3 ..), Drt1 (, Drt2, Drt3), Drp1 (, Drp2, Drp3 ..) )
Subsequently, the quality estimation unit 13 generates m + n + 1 (that is, one or more) instances of the estimator (S303). That is, as many estimators as the number of deterioration determination threshold values Tsh_s (for each deterioration determination threshold value Tsh_s) are prepared. Each estimator is denoted as “S_s”. Here, s corresponds to m + n + 1 types of Tsh_s.

続いて、品質推定部１３は、Ｄｂ＿（ｚ，ｓ）を目的変数とし、ＤＰＩｖｅｃを説明変数として、各Ｓ＿ｓを学習させる（Ｓ３０４）。なお、各Ｓ＿ｓは、ｓが共通するＤｂ＿（ｚ，ｓ）について学習を行う。これでＳ＿ｓは学習済みとなり、ｗｅｂページの表示待ち時間の品質の推定に用いることが可能な状態となる。 Subsequently, the quality estimator 13 learns each S_s using Db_ (z, s) as an objective variable and DPIvec as an explanatory variable (S304). Each S_s learns about Db_ (z, s) with common s. S_s is now learned, and can be used to estimate the quality of the display latency of the web page.

続いて、推定段階において推定装置１０が実行する処理手順について説明する。図６は、推定段階において推定装置１０が実行する処理手順の一例を説明するためのフローチャートである。 Then, the process procedure which the estimation apparatus 10 performs in an estimation stage is demonstrated. FIG. 6 is a flowchart for explaining an example of a processing procedure executed by the estimation apparatus 10 in the estimation stage.

図６の処理手順の前提として、推定対象のエリア及び時間帯に対応するトラフィック計測データＤ２（例えば、ＤＰＩのログデータ）が収集されているとする。当該トラフィック計測データＤ２を、以下「推定対象計測データ」という。なお、ｗｅｂページの表示待ち時間の計測は不要である。当該表示待ち時間の劣化の度合いが目的変数（推定対象）だからである。 As a premise of the processing procedure of FIG. 6, it is assumed that traffic measurement data D2 (for example, DPI log data) corresponding to the estimation target area and time zone is collected. The traffic measurement data D2 is hereinafter referred to as “estimation target measurement data”. It is not necessary to measure the display wait time of the web page. This is because the degree of deterioration of the display waiting time is an objective variable (estimation target).

ステップＳ４０１において、トラフィック情報抽出部１２は、推定対象計測データを入力する。続くステップＳ４０２〜Ｓ４０６において、トラフィック情報抽出部１２は、図４のステップＳ２０３〜Ｓ２０７と同様の処理を実行して、推定対象計測データから各代表値を算出（抽出）する。但し、推定対象のエリア及び時間帯は１つであるため、Ｚ＝１である。 In step S401, the traffic information extraction unit 12 inputs estimation target measurement data. In subsequent steps S402 to S406, the traffic information extraction unit 12 calculates (extracts) each representative value from the estimation target measurement data by executing the same processing as in steps S203 to S207 of FIG. However, since the estimation target area and the time zone are one, Z = 1.

その結果、トラフィック情報抽出部１２は、図５のステップＳ２１０と同様に、長さＺ＝１である、ＤＰＩｖｅｃと同じ形式の行列を得る。以下、当該行列を「ＤＰＩｖｅｃ＿ｎｅｗ」という。 As a result, the traffic information extraction unit 12 obtains a matrix of the same format as DPIvec, with a length Z = 1, as in step S210 of FIG. Hereinafter, the matrix is referred to as “DPIvec_new”.

続いて、品質推定部１３は、以下のように、各Ｓ＿ｓ（各推定器）に対してＤＰＩｖｅｃ＿ｎｅｗを入力することで、各劣化判定閾値Ｔｓｈ＿ｓに応じた、劣化か否かの論理値の組（ベクトル）であるＥｓｔ＿ｓを得る（Ｓ４０７）
（複数のｓに対し）Ｅｓｔ＿ｓ＝Ｓ＿ｓ（ＤＰＩｖｅｃ＿ｎｅｗ）
なお、Ｅｓｔ＿ｓは、ｍ＋ｎ＋１の長さの論理値の組（ベクトル）となる。 Subsequently, the quality estimation unit 13 inputs DPIvec_new to each S_s (each estimator) as follows, and thereby sets of logical values indicating whether or not the degradation is in accordance with each degradation determination threshold value Tsh_s ( Est_s which is a vector) is obtained (S407).
Est_s = S_s (DPIvec_new) (for multiple s)
Est_s is a set (vector) of logical values having a length of m + n + 1.

典型的には、Ｔｓｈ＿ｓの値が最大のｓについて真（劣化）であり、Ｔｓｈ＿ｓが順に小さくなるようｓを変えていくと、或るｓがθであるときに偽（非劣化）となり、それ以下のＴｓｈ＿ｓでは全部偽（非劣化）というパターンとなる（但し、全部偽、全部真、又は順番が壊れていることも起こりうる。）。 Typically, the value of Tsh_s is true (deteriorated) for the largest s, and when s is changed so that Tsh_s decreases in order, it becomes false (non-degraded) when a certain s is θ, In the following Tsh_s, the pattern is all false (non-degraded) (however, all false, all true, or the order may be broken).

品質推定部１３は、Ｅｓｔ＿ｓからこのようなθを探索し、探索されたθに対応したＴｓｈ＿θを「推定された劣化度」（推定対象のエリア及び時間帯に対する劣化度の推定値）として出力する（Ｓ４０９）。具体的には、例えば、全部偽（非劣化）であれば、品質推定部１３は、「非劣化」と出力する。この場合、（最少の）Ｔｓｈ＿ｓ未満の劣化となる。また、全部真（劣化）であれば、品質推定部１３は、「（最大の）Ｔｓｈ＿ｓ以上」を出力する。また、全部偽でなく、かつ、全部真でない場合、偽と真とが、或るＴｓｈ＿ｓを境に区分されていない場合でも、品質推定部１３は、Ｔｓｈ＿ｓが大きい側から順に値を確認し、最初に偽となったＴｓｈ＿θを使い「Ｔｓｈ＿θ以上」を出力する。このＴｓｈ＿θは（個別のｗｅｂページのみには依存しない）ｗｅｂページの表示待ち時間の劣化発生の度合の推定結果である。 The quality estimation unit 13 searches for such θ from Est_s, and outputs Tsh_θ corresponding to the searched θ as an “estimated degradation level” (estimated degradation level for the estimation target area and time zone). (S409). Specifically, for example, if all are false (non-degraded), the quality estimation unit 13 outputs “non-degraded”. In this case, the degradation is less than (minimum) Tsh_s. If all are true (deterioration), the quality estimation unit 13 outputs “(maximum) Tsh_s or more”. In addition, when all are not false and all are not true, even if false and true are not classified with a certain Tsh_s as a boundary, the quality estimation unit 13 checks the values in order from the side with the largest Tsh_s, Using Tsh_θ that becomes false first, “Tsh_θ or more” is output. This Tsh_θ is an estimation result of the degree of occurrence of deterioration of the display latency of the web page (which does not depend only on the individual web page).

なお、本実施の形態では、通信トラフィックの状況を示すパラメータを抽出するパラメータの一例として、低スループット代表値、高ＲＴＴ代表値、再送率代表値、ＤＮＳ応答時間代表値、及びＤＮＳ応答率代表値を用いる例を説明したが、必ずしもこれら全部が用いられなくてもよく、これらの一部（例えば、いずれか一つ）が用いられてもよい。また、通信トラフィックの状況を把握可能なパラメータであれば、これら以外のパラメータが用いられてもよい。 In this embodiment, as an example of a parameter for extracting a parameter indicating the state of communication traffic, a low throughput representative value, a high RTT representative value, a retransmission rate representative value, a DNS response time representative value, and a DNS response rate representative value However, all of these may not necessarily be used, and some (for example, any one) of them may be used. In addition, parameters other than these may be used as long as the parameters can grasp the status of communication traffic.

上述したように、本実施の形態によれば、ｗｅｂページの表示待ち時間の計測値と、当該表示待ち時間の計測のエリア及び時間帯に対応する通信トラフィックに関する情報との関係を推定器に学習させ、推定したい未知データの通信トラフィックに関する情報を当該推定器に入力することで（個別ページのみには依存しない）ｗｅｂページの待ち時間の劣化の程度を推定することができる。したがって、本実施の形態によれば、個別のｗｅｂページの影響が除去された表示待ち時間の品質の推定を可能とすることができる。 As described above, according to the present embodiment, the estimator learns the relationship between the measurement value of the display latency of the web page and the information related to the communication traffic corresponding to the display latency measurement area and the time zone. Then, by inputting information related to communication traffic of unknown data to be estimated to the estimator, it is possible to estimate the degree of deterioration of the waiting time of the web page (which does not depend only on the individual page). Therefore, according to the present embodiment, it is possible to estimate the quality of the display waiting time from which the influence of individual web pages is removed.

また、本実施の形態では、複数のｗｅｂページ・サーバを学習に用い、また「ｗｅｂページの表示待ち時間の劣化」という一般的特性を（ネットワーク起因でなくｗｅｂページ側の理由等で）一部だけ劣化するものの影響を除外して抽出するために、ｗｅｂページごとの基準で正規化した「劣化度」の考え方、及び複数の計測データの分位点によりデータを抽出する手法を用いることで、個別ｗｅｂページ毎の待ち時間の差を吸収してｗｅｂページの待ち時間の劣化を捉える。したがって、ｗｅｂページ・サーバが千差万別であり、同じ時間・場所で利用していても個々のページにより待ち時間は異なるといった問題を解消することができる。 Further, in this embodiment, a plurality of web page servers are used for learning, and a general characteristic of “deterioration of display latency of web pages” is partially (because of the reason of the web page side instead of network origin). In order to extract by excluding the influence of what deteriorates only, by using the concept of “degradation degree” normalized by the standard for each web page and the method of extracting data by the quantile of multiple measurement data, By absorbing the difference in waiting time for each individual web page, the deterioration of the waiting time of the web page is captured. Therefore, it is possible to solve the problem that the web page server is various and the waiting time varies depending on each page even if the web page server is used at the same time and place.

また、本実施の形態では、パッシブ計測データ（ユーザ・通信相手・目的もバラバラの通信データ解析ログ（ＤＰＩのログ））の内容の不安定性に対し、（表示待ち時間の劣化時に対応するログも劣化しているなど）劣化情報と対応性が良く、極端な異常値に左右されにくいトラフィックデータの分位点を抽出することで、平均値等を抽出する場合と比較して、斯かる不安定性に適切に対応することができる。 In the present embodiment, in response to instability of the contents of passive measurement data (communication data analysis log (DPI log) with different users, communication partners, and purposes), there is also a log corresponding to the deterioration of the display waiting time. This is instability compared with the case of extracting the average value etc. by extracting the traffic data quantile, which has good compatibility with deterioration information and is not easily influenced by extreme abnormal values. Can respond appropriately.

また、同じＮＷ劣化影響下でも個別ｗｅｂページごとに表示待ち時間の劣化への影響の出方が異なるが、本実施の形態では、複数の対象ｗｅｂページを一括して扱い、その統計量で評価することで「典型的な劣化影響の出方」として定量化することが出来る。また、一般に劣化判定は二値（論理値）で行う。本実施の形態でも論理値の推定器を用いるが、本実施の形態では「劣化度」として定量的な値を用いるため、学習時の論理値ベクトルに変換する際に（複数の劣化レベルに対応した）複数用意し、それぞれ異なるレベルの推定器を用意することで、劣化度合いの判定も併せて実現することができる
なお、本実施の形態において、ｗｅｂ劣化分析部１１は、算出部及び比較部の一例である。トラフィック情報抽出部１２は、取得部の一例である。品質推定部１３は、学習部及び推定部の一例である。 In addition, in the present embodiment, a plurality of target web pages are handled in a lump and evaluated by their statistics, although the influence on the display latency degradation differs for each individual web page even under the same NW degradation effect. By doing so, it can be quantified as “typical appearance of degradation effect”. In general, deterioration determination is performed with binary values (logical values). In this embodiment, a logical value estimator is also used. However, in this embodiment, since a quantitative value is used as the “degradation degree”, when converting to a logical value vector during learning (corresponding to a plurality of deterioration levels) By preparing a plurality of estimators having different levels, it is possible to realize the determination of the degree of deterioration. In this embodiment, the web deterioration analysis unit 11 includes a calculation unit and a comparison unit. It is an example. The traffic information extraction unit 12 is an example of an acquisition unit. The quality estimation unit 13 is an example of a learning unit and an estimation unit.

以上、本発明の実施の形態について詳述したが、本発明は斯かる特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 Although the embodiments of the present invention have been described in detail above, the present invention is not limited to such specific embodiments, and various modifications can be made within the scope of the gist of the present invention described in the claims. Deformation / change is possible.

１０推定装置
１１ｗｅｂ劣化分析部
１２トラフィック情報抽出部
１３品質推定部
１００ドライブ装置
１０１記録媒体
１０２補助記憶装置
１０３メモリ装置
１０４ＣＰＵ
１０５インタ段階装置
Ｂバス DESCRIPTION OF SYMBOLS 10 Estimation apparatus 11 Web deterioration analysis part 12 Traffic information extraction part 13 Quality estimation part 100 Drive apparatus 101 Recording medium 102 Auxiliary storage apparatus 103 Memory apparatus 104 CPU
105 Inter stage device B bus

Claims

A calculation unit that calculates a representative value of the index indicating the quality of the display waiting time based on the measurement data of the display waiting time on the web page for each of a plurality of trials in which at least one of the region and the time zone is different.
For each trial, for the representative value related to the trial, a comparison unit that outputs a logical value indicating a comparison result with a threshold value;
For each trial, an acquisition unit that obtains parameters indicating the status of communication traffic in each of the regions and time zones involved in the trial;
A learning unit that learns the estimator using the logical value of each trial as an objective variable and the parameter of each trial as an explanatory variable;
An estimation unit that inputs the parameters in a certain region and time zone to the estimator and estimates the quality of display latency of the web page in the region and time zone;
The estimation apparatus characterized by having.

The calculation unit, for each trial, calculates a representative value of an index indicating the quality of display latency of the web page based on measurement data of display latency of each of a plurality of web pages.
The estimation apparatus according to claim 1.

For each trial, the comparison unit outputs a logical value group indicating a comparison result with a plurality of threshold values for the representative value related to the trial,
The learning unit uses the logical value group of each trial as an objective variable, the parameter of each trial as an explanatory variable, and learns a plurality of estimators corresponding to each of the plurality of threshold values,
The estimation unit inputs the parameter in a certain region and time zone to the estimator corresponding to each of the plurality of thresholds, and estimates the quality of the display waiting time of the web page in the region and time zone.
The estimation apparatus according to claim 1 or 2, wherein

The acquisition unit acquires the parameter for each of a plurality of unit times that divide the time zone, calculates a representative value of the parameter acquired for each unit time,
The learning unit uses the logical value of each trial as an objective variable, and trains an estimator using a representative value of the parameter of each trial as an explanatory variable.
The estimation apparatus according to claim 1, wherein:

The calculation unit calculates a representative value of an index indicating the quality of the display latency of the web page based on a plurality of measurement data of the display latency of the web page for each trial.
The estimation apparatus according to claim 1, wherein:

A calculation procedure for calculating a representative value indicating the quality of the display waiting time based on the measurement data of the display waiting time on the web page for each of a plurality of trials in which at least one of the region and the time zone is different.
For each trial, for the representative value related to the trial, a comparison procedure for outputting a logical value indicating a comparison result with a threshold value;
For each trial, an acquisition procedure for obtaining parameters indicating the status of communication traffic in each of the regions and time zones involved in the trial;
A learning procedure for learning an estimator using the logical value of each trial as an objective variable and the parameter of each trial as an explanatory variable;
An estimation procedure for inputting the parameters in a certain region and time zone to the estimator to estimate the quality of display latency of the web page in the region and time zone;
Is performed by a computer.

6. A program that causes a computer to function as each unit according to claim 1.