JP2019075030A

JP2019075030A - Estimation device, estimation method and program

Info

Publication number: JP2019075030A
Application number: JP2017202490A
Authority: JP
Inventors: 川口　銀河; Ginga Kawaguchi; 銀河川口
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-10-19
Filing date: 2017-10-19
Publication date: 2019-05-16
Anticipated expiration: 2037-10-19
Also published as: JP6618971B2

Abstract

To enable estimation of the quality of a display waiting time in which the influence of individual web pages has been eliminated.SOLUTION: An estimation device includes: a calculation unit that calculates a representative value of an index indicating a quality of a display waiting time, for each of a plurality of trials in which at least one of an area and a time zone is different, on the basis of measurement data of the display waiting time of a web page; a comparison unit that outputs a logical value indicating a result of comparison of the representative value associated with the trial with a threshold value for each of the trials; an extraction unit that extracts a parameter indicating a status of a communication traffic in the area and the time zone associated with the trial for each of the trials; a learning unit that causes an estimator to perform learning using the logical value of each of the trials as an objective variable and the parameter of each of the trials as an explanatory variable; and an estimation unit that inputs the parameters in a certain area and a certain time zone to the estimator to estimate the quality of the display waiting time of a web page in the area and the time zone.SELECTED DRAWING: Figure 2

Description

本発明は、推定装置、推定方法及びプログラムに関する。 The present invention relates to an estimation apparatus, an estimation method, and a program.

インターネットの利用が普及している中で、ｗｅｂブラウザを快適に利用できるかどうかが、ユーザが「インターネットを快適に利用できるか」について大きな位置を占めている。 With the widespread use of the Internet, whether or not a web browser can be used comfortably occupies a large position as to whether the user can comfortably use the Internet.

そのため、ユーザがｗｅｂページを閲覧する際の表示待ち時間を対象に品質管理することが重要である。 Therefore, it is important to perform quality control on display waiting time when a user browses a web page.

また、一般にＬＴＥ（Long Term Evolution）に代表されるようなモバイルネットワークでは、混雑状況が、エリア・時間帯で異なるため、エリアごとに利用品質が異なり、また時間帯によっても品質が異なる。 In addition, in a mobile network generally represented by LTE (Long Term Evolution), the congestion situation differs between areas and time zones, so the quality of use differs depending on areas, and the quality also varies depending on time zones.

エリア・時間帯ごとの品質を把握する方法としては、個別端末から実際にコンテンツアクセスを行い計測する方法（アクティブ計測）がある（例えば、特許文献１）。 As a method of grasping the quality for each area and time zone, there is a method (active measurement) in which content is actually accessed from an individual terminal and measurement is made (for example, Patent Document 1).

特開２０１７−９２６６１号公報JP, 2017-92661, A

しかしながら、各エリア・時間帯でアクティブ計測を行うのは経済的ではない。また、ｗｅｂページ・サーバは千差万別であり、個別のｗｅｂページごとに同じネットワークの品質劣化の影響下でも、表示待ち時間への影響に差異等があるため、ｗｅｂページの表示待ち時間の品質（劣化程度）の判断が困難である。 However, performing active measurement in each area and time zone is not economical. In addition, web page servers are different from one another, and even under the influence of the same network quality degradation for each individual web page, there is a difference in the impact on display latency, etc. It is difficult to judge the quality (degree of deterioration).

本発明は、上記の点に鑑みてなされたものであって、個別のｗｅｂページの影響が除去された表示待ち時間の品質の推定を可能とすることを目的とする。 The present invention has been made in view of the above, and it is an object of the present invention to enable estimation of the quality of display waiting time in which the influence of individual web pages is removed.

そこで上記課題を解決するため、推定装置は、地域及び時間帯の少なくともいずれか一方が異なる複数回の試行ごとに、ｗｅｂページの表示待ち時間の計測データに基づいて、前記表示待ち時間の品質を示す指標の代表値を算出する算出部と、前記試行ごとに、当該試行に係る前記代表値について、閾値との比較結果を示す論理値を出力する比較部と、前記試行ごとに、当該試行に係る地域及び時間帯のそれぞれにおける通信トラフィックの状況を示すパラメータを抽出する抽出部と、前記各試行の前記論理値を目的変数とし、前記各試行の前記パラメータを説明変数として推定器を学習させる学習部と、或る地域及び時間帯における前記パラメータを前記推定器に入力して、前記地域及び時間帯におけるｗｅｂページの表示待ち時間の品質を推定する推定部と、を有する。 Therefore, in order to solve the above problem, the estimation apparatus determines the quality of the display waiting time based on measurement data of the display waiting time of the web page for each of a plurality of trials in which at least one of the area and the time zone is different. For each trial, a calculation unit that calculates a representative value of the indicator to be shown, a comparison unit that outputs a logical value indicating a comparison result with the threshold value for the representative value related to the trial, and for each trial The extraction unit which extracts the parameter which shows the condition of the communication traffic in each of the area and the time zone which concerns, the learning which makes the aforementioned logical value of each trial the objective variable and the parameter of the trial the explanatory variable. Section and the parameters of a certain area and time zone are input to the estimator to display the quality of the display latency of the web page in the area and time zone Having an estimation unit for estimating for.

個別のｗｅｂページの影響が除去された表示待ち時間の品質の推定を可能とすることができる。 It is possible to enable estimation of the quality of display latency in which the influence of individual web pages has been removed.

本発明の実施の形態における推定装置１０のハードウェア構成例を示す図である。It is a figure which shows the hardware structural example of the estimation apparatus 10 in embodiment of this invention. 本発明の実施の形態における推定装置１０の機能構成例を示す図である。It is a figure which shows the function structural example of the estimation apparatus 10 in embodiment of this invention. ｗｅｂ劣化分析部１１が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the web degradation analysis part 11 performs. トラフィック情報抽出部１２が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the traffic information extraction part 12 performs. 学習段階において品質推定部１３が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the quality estimation part 13 performs in a learning stage. 推定段階において推定装置１０が実行する処理手順の一例を説明するためのフローチャートである。It is a flowchart for demonstrating an example of the process sequence which the estimation apparatus 10 performs in an estimation step.

以下、図面に基づいて本発明の実施の形態を説明する。図１は、本発明の実施の形態における推定装置１０のハードウェア構成例を示す図である。図１の推定装置１０は、それぞれバスＢで相互に接続されているドライブ装置１００、補助記憶装置１０２、メモリ装置１０３、ＣＰＵ１０４、及びインタ段階装置１０５等を有するコンピュータである。 Hereinafter, embodiments of the present invention will be described based on the drawings. FIG. 1 is a diagram showing an example of a hardware configuration of an estimation device 10 according to an embodiment of the present invention. The estimation device 10 in FIG. 1 is a computer having a drive device 100, an auxiliary storage device 102, a memory device 103, a CPU 104, an interstage device 105, and the like mutually connected by a bus B.

推定装置１０での処理を実現するプログラムは、ＣＤ−ＲＯＭ等の記録媒体１０１によって提供される。プログラムを記憶した記録媒体１０１がドライブ装置１００にセットされると、プログラムが記録媒体１０１からドライブ装置１００を介して補助記憶装置１０２にインストールされる。但し、プログラムのインストールは必ずしも記録媒体１０１より行う必要はなく、ネットワークを介して他のコンピュータよりダウンロードするようにしてもよい。補助記憶装置１０２は、インストールされたプログラムを格納すると共に、必要なファイルやデータ等を格納する。 A program for realizing the processing in the estimation device 10 is provided by a recording medium 101 such as a CD-ROM. When the recording medium 101 storing the program is set in the drive apparatus 100, the program is installed from the recording medium 101 to the auxiliary storage apparatus 102 via the drive apparatus 100. However, the installation of the program does not necessarily have to be performed from the recording medium 101, and may be downloaded from another computer via a network. The auxiliary storage device 102 stores the installed program and also stores necessary files and data.

メモリ装置１０３は、プログラムの起動指示があった場合に、補助記憶装置１０２からプログラムを読み出して格納する。ＣＰＵ１０４は、メモリ装置１０３に格納されたプログラムに従って推定装置１０に係る機能を実行する。インタ段階装置１０５は、ネットワークに接続するためのインタ段階として用いられる。 The memory device 103 reads out the program from the auxiliary storage device 102 and stores it when there is an instruction to start the program. The CPU 104 executes the function related to the estimation device 10 in accordance with the program stored in the memory device 103. The inter-stage device 105 is used as an inter-stage for connecting to a network.

図２は、本発明の実施の形態における推定装置１０の機能構成例を示す図である。図２において、推定装置１０は、ｗｅｂ劣化分析部１１、トラフィック情報抽出部１２及び品質推定部１３等を有する。これら各部は、推定装置１０にインストールされた１以上のプログラムが、ＣＰＵ１０４に実行させる処理により実現される。 FIG. 2 is a diagram showing an example of a functional configuration of the estimation device 10 according to the embodiment of the present invention. In FIG. 2, the estimation device 10 includes a web degradation analysis unit 11, a traffic information extraction unit 12, a quality estimation unit 13, and the like. These units are realized by processing that one or more programs installed in the estimation device 10 cause the CPU 104 to execute.

ｗｅｂ劣化分析部１１は、エリア（地域）及び時間帯の組み合わせによって区別されるＺ回の試行のそれぞれごとに、１種類以上（Ｋ種類）のｗｅｂページ（ｗｅｂコンテンツ）についての端末における表示待ち時間の計測結果の集合である端末計測データＤ１を分析する。ｗｅｂ劣化分析部１１は、分析の結果として、表示待ち時間の品質を示す指標の一例である劣化度等を出力する。 The web degradation analysis unit 11 is a display waiting time in the terminal for one or more (K types) of web pages (web contents) for each of Z trials distinguished by a combination of area (region) and time zone. Terminal measurement data D1 which is a set of measurement results of The web degradation analysis unit 11 outputs, as a result of the analysis, a degree of degradation or the like which is an example of an index indicating the quality of the display waiting time.

トラフィック情報抽出部１２は、上記各試行のエリア及び時間帯におけるネットワークのトラフィック計測データＤ２から、通信トラフィックの状況を示すパラメータ等を抽出（取得）する。 The traffic information extraction unit 12 extracts (acquires) a parameter or the like indicating the status of communication traffic from the traffic measurement data D2 of the network in the area and time zone of each trial.

品質推定部１３は、学習段階において、ｗｅｂ劣化分析部１１のから出力される情報とトラフィック情報抽出部１２によって抽出される情報とを１以上推定器に学習させる。品質推定部１３は、また、推定段階において、任意のエリア及び時間帯におけるトラフィック計測データＤ２が入力されると、当該エリア及び時間帯におけるｗｅｂページの表示待ち時間の劣化の程度を、１以上の推定器を用いて推定する。 At the learning stage, the quality estimation unit 13 causes the one or more estimators to learn the information output from the web degradation analysis unit 11 and the information extracted by the traffic information extraction unit 12. Also, when traffic measurement data D2 in an arbitrary area and time zone is input at the estimation stage, quality estimation unit 13 sets the degree of deterioration of the display page display waiting time in the area and time zone to one or more. Estimate using an estimator.

推定器は、例えば、ＳＶＭ（Support Vector Machine）である。但し、数値ベクトルから１つの論理値を推定可能であれば他の推定器が用いられてもよい。 The estimator is, for example, a SVM (Support Vector Machine). However, other estimators may be used as long as one logical value can be estimated from the numerical vector.

端末計測データＤ１の収集方法の一例について説明する。ここでは、計測対象のｗｅｂページのＵＲＬとして、ＵＲＬ＿１，ＵＲＬ＿２...ＵＲＬ＿Ｋ：（Ｋ個の計測対象ＵＲＬ；Ｋは１以上の任意の自然数（すなわち、１つのＵＲＬでもよい。））が設定されているとする。ＵＲＬは、特定のものに限定されないが、よく利用されるようなｗｅｂページを多様な種類を含むのが望ましい。 An example of a method of collecting the terminal measurement data D1 will be described. Here, URL_1, URL_2 ... URL_K: (K measurement target URLs; K is an arbitrary natural number of 1 or more (that is, one URL may be used)) is set as the URL of the measurement target web page. It is assumed that The URL is not limited to a specific one, but it is desirable to include various types of web pages that are often used.

試行ごとに以下が行われる。なお、試行とは、上記したように、計測対象のエリア及び時間帯（期間）の組み合わせによって区別される概念である。したがって、エリア及び時間帯の少なくともいずれか一方が異なると、試行は異なる。 For each trial: As described above, the trial is a concept that is distinguished by the combination of the area to be measured and the time zone (period). Therefore, if at least one of the area and the time zone is different, the trial is different.

（１）１つの試行に対応するエリア・時間帯について、Ｋ個の計測対象ＵＲＬについて、表示待ち時間を、ｗｅｂページを表示可能な端末を用いてＰ回計測する。すなわち、１回の試行についてＰ×Ｋ回の計測が行われる。但し、全てのＵＲＬについて、Ｐの値は共通でなくてもよい。また、Ｐ＝１であってもよい。すなわち、各ＵＲＬについて、必ずしも複数回の計測が行われなくてもよい。当該エリア及び時間帯の代表値を取得するため、同一エリア及び同一時間帯の範囲内で、ある程度異なる場所及び時刻で計測が行われるのが望ましいが、計測パターン（例えば、同一エリア及び同一時間帯で、場所及び時刻をどのように選択するか）は任意でよく、計測手法は、特定のものに限定されない。また、表示待ち時間を取得する方法も特定のものに限定されないが、navigation timing APIを用いてnavigationStartとloadEventEndとの時間差を用いる方法が利用されてもよい。なお、各試行におけるＰ回の計測は、集計時間幅Ｗ内に終了するとする。 (1) For the area / time zone corresponding to one trial, the display waiting time is measured P times using the terminal capable of displaying the web page for K measurement target URLs. That is, P × K measurements are performed for one trial. However, the value of P may not be common to all the URLs. Also, P may be 1. That is, measurement may not necessarily be performed multiple times for each URL. In order to obtain the representative value of the area and time zone, it is desirable that the measurement be performed at different places and times within the same area and time zone, but the measurement pattern (for example, the same area and time zone) The choice of location and time may be arbitrary, and the measurement method is not limited to a specific one. Also, the method for acquiring the display waiting time is not limited to a specific one, but a method using the time difference between navigationStart and loadEventEnd using the navigation timing API may be used. Note that it is assumed that the P measurements in each trial end within the totalized time width W.

（２）（１）をＺ通りの試行について実行する。 (2) Perform (1) for Z trials.

（１）及び（２）の実行の結果として取得される端末計測データＤ１は、以下において、「Ｌ（ｚ，ｙ，ｘ）」として表記される計測値（表示待ち時間）の集合である。ここで、Ｌ（ｚ，ｙ，ｘ）は、ｚ回目（ｚ：１〜Ｚ）試行のｙ番目ＵＲＬ（ｙ：１〜Ｋ）のｘ回目（ｘ：１〜Ｐ）計測での表示待ち時間を示す。 The terminal measurement data D1 acquired as a result of the execution of (1) and (2) is a set of measurement values (display waiting time) represented as "L (z, y, x)" below. Here, L (z, y, x) is the display waiting time at the x-th (x: 1 to P) measurement of the y-th URL (y: 1 to K) of the z-th (z: 1 to Z) trial Indicates

次に、トラフィック計測データＤ２の収集方法について説明する。こここでは、上記各試行に対応するエリア及び時間帯ごとの通信トラフィックの品質情報を、個別アクティブ計測なしに評価するため、例えば、当該エリア内の全て、又は（サンプリングしている場合は）一部の通信トラフィックを処理するルータ等の装置のモニタ用ポートにＤＰＩ（Deep Packet Inspection）装置を接続し、ＤＰＩ装置を用いてトラフィックを監視することで、トラフィック計測データＤ２を収集する。したがって、トラフィック計測データＤ２は、ＤＰＩ装置によって監視されるトラフィックの履歴情報（ログ）であってもよい。監視対象は、Ｚ回の試行のそれぞれに対応するエリア及び時間帯（時間幅Ｗ）のトラフィックとする（但し、表示待ち時間の計測に関するトラフィックに限定されない）。表示待ち時間の計測がモバイル・ＬＴＥであれば特定の基地局（ｅＮＢ）又は特定のセクタ（ＥＣＩ）におけるトラフィックが監視対象とされ、固定網であれば特定の収容ルータ等におけるトラフィックが監視対象とされるが、これらの基地局、セクタ、又は収容ルータ等の特定方法は、所定のものに限定されない。網上流でモニタ出来る権限を持つものは当然に把握できているものとする。 Next, a method of collecting traffic measurement data D2 will be described. Here, in order to evaluate communication traffic quality information for each area and time zone corresponding to each trial without individual active measurement, for example, all in the area or (if sampling) one A DPI (Deep Packet Inspection) device is connected to a monitoring port of a device such as a router that processes communication traffic, and traffic measurement data D2 is collected by monitoring the traffic using the DPI device. Therefore, the traffic measurement data D2 may be history information (log) of traffic monitored by the DPI device. The monitoring target is traffic in the area and time zone (time width W) corresponding to each of Z trials (however, the traffic is not limited to the traffic related to the display latency measurement). If the measurement of display latency is mobile or LTE, traffic in a specific base station (eNB) or a specific sector (ECI) is to be monitored, and if it is a fixed network, traffic in a specific accommodation router or the like is to be monitored. However, identification methods such as these base stations, sectors, and accommodating routers are not limited to predetermined ones. It is assumed that those who have the authority to monitor upstream of the network can be understood naturally.

以下、推定装置１０が実行する処理手順について説明する。図３は、ｗｅｂ劣化分析部１１が実行する処理手順の一例を説明するためのフローチャートである。 Hereinafter, the process procedure which the estimation apparatus 10 performs is demonstrated. FIG. 3 is a flowchart for explaining an example of the processing procedure executed by the web degradation analysis unit 11.

ステップＳ１０１において、ｗｅｂ劣化分析部１１は、Ｋ個のＵＲＬのそれぞれについて、表示待ち時間に関する基準値を算出する。例えば、或るＵＲＬに対する基準値であれば、当該ＵＲＬに係るＺ×Ｐ個の計測値の中央値、１０％値、最小値等が、基準値とされてもよい。又は、基準値は、推定装置１０のユーザによって事前に設定されてもよい。以下、ＵＲＬ＿１，ＵＲＬ＿２...ＵＲＬ＿Ｋに対する基準値を、Ｗｂ＿１，Ｗｂ＿２，．．Ｗｂ＿Ｋとする。この場合、中央値を基準値とするのであれば、Ｗｂ＿１の基準値は以下の通りである。
Ｗｂ＿１＝（Ｌ（ｚ，１，ｘ）の表示待ち時間の集合の中央値）
続いて、ｗｅｂ劣化分析部１１は、Ｚ回の試行ごとに、Ｋ個の各ＵＲＬに対する「劣化度（Ｄｗ（ｚ，ｙ，ｘ）」を以下のように算出する（Ｓ１０２）。
Ｄｗ（ｚ，ｙ，ｘ）＝Ｌ（ｚ，ｙ，ｘ）／Ｗｂ＿ｙ
すなわち、劣化度Ｄｗ（ｚ，ｙ，ｘ）は、基準値に対する計測値の割合であり、本実施の形態において、ｗｅｂページの表示待ち時間の品質（劣化の程度）を示す指標の一例である。なお、ｗｅｂページの表示待ち時間の品質を把握可能な方法であれば、他の方法によって劣化度Ｄｗ（ｚ，ｙ，ｘ）が算出されてもよい。 In step S101, the web degradation analysis unit 11 calculates a reference value regarding display waiting time for each of the K URLs. For example, if it is a reference value for a certain URL, a median, a 10% value, a minimum value or the like of Z × P measurement values of the URL may be used as the reference value. Alternatively, the reference value may be preset by the user of the estimation device 10. Hereinafter, the reference values for URL_1, URL_2 ... URL_K are Wb_1, Wb_2,. . Assume Wb_K. In this case, if the median is used as the reference value, the reference value of Wb_1 is as follows.
Wb_1 = (median of display latency set of L (z, 1, x))
Subsequently, the web degradation analysis unit 11 calculates the “deterioration degree (Dw (z, y, x)”) for each of the K URLs as follows, every Z trials (S102).
Dw (z, y, x) = L (z, y, x) / Wb_y
That is, the deterioration degree Dw (z, y, x) is a ratio of the measured value to the reference value, and in the present embodiment, is an example of an index indicating the quality (degree of deterioration) of the display waiting time of the web page. . The degradation degree Dw (z, y, x) may be calculated by another method as long as the quality of the display waiting time of the web page can be grasped.

続いて、ｗｅｂ劣化分析部１１は、Ｚ回の試行ごとに、「劣化度代表値（Ｄ＿ｚ）」を算出する（Ｓ１０３）。 Subsequently, the web degradation analysis unit 11 calculates the “deterioration degree representative value (D_z)” every time Z trials (S103).

具体的には、ｗｅｂ劣化分析部１１は、Ｄ＿ｚ＝（Ｄｗ（ｉ，ｊ，ｌ）のうち，ｉがｚに固定され、ｊ及びｌは、それぞれが取りうる全ての値である集合（すなわち、Ｐ×Ｋ個の要素を含む集合）を、全てのｚについて取り出し、各集合のそれぞれのｄ＿ｔｈ分位点（ｑｕａｎｔｉｌｅ）の値）を算出する。ｄ＿ｔｈは、分位点で与える劣化閾値であり、予め設定される。例えば、ｄ＿ｔｈが、０．８（すなわち、８０パーセンタイル）であれば、Ｄｗ（ｚ，ｊ，ｌ）の集合の８０％値（８０パーセンタイル）が算出される。なお、Ｋ＝１であり、かつ、Ｐ＝１である場合、劣化度代表値（Ｄ＿ｚ）＝劣化度（Ｄｗ（ｚ，1，１）とされてもよい。 Specifically, in the web degradation analysis unit 11, a set in which i is fixed to z and j and l are all possible values of D_z = (Dw (i, j, l) (i.e. , A set including P × K elements) is extracted for all z, and values of d_th quantiles of the respective sets are calculated. d_th is a deterioration threshold given at a quantile and is set in advance. For example, if d_th is 0.8 (ie, 80th percentile), 80% value (80th percentile) of the set of Dw (z, j, l) is calculated. If K = 1 and P = 1, the degradation degree representative value (D_z) may be set to the degradation degree (Dw (z, 1, 1)).

続いて、ｗｅｂ劣化分析部１１は、Ｚ回の試行ごとに「劣化度論理値（Ｄｂ＿（ｚ，ｓ））」を算出する（Ｓ１０４）。ここで、ｓは、Ｄ＿ｚに対する劣化判定閾値劣化判定閾値に対する変数であり、本実施の形態では、予め、（−ｍ，−（ｍ−１），...，−１，０，１，２，...ｎ−１，ｎ）のｍ＋ｎ＋１個だけ、多段階のＴｓｈ＿ｓが設定される。すなわち、ｍ及びｎは、劣化判定閾値Ｔｓｈ＿ｓの数（段階数）を規定するパラメータであり、予め設定される。なお、ｎ＝ｍでもよいし、ｎ≠ｍでもよい。また、推定器を１つのみ用いる場合には、ｍ＝ｎ＝０でよい（すなわち、Ｔｓｈ＿ｓは１つでよい）。 Subsequently, the web deterioration analysis unit 11 calculates “the deterioration degree logical value (Db_ (z, s))” every Z trials (S104). Here, s is a variable for the degradation determination threshold for D_z and the degradation determination threshold, and in the present embodiment, (−m, − (m−1),. ,..., N−1, n), multi-stage Tsh_s is set. That is, m and n are parameters that define the number (the number of steps) of the deterioration determination threshold Tsh_s, and are set in advance. Note that n may be m or n ≠ m. Also, if only one estimator is used, then m = n = 0 (i.e., Tsh_s may be one).

したがって、１回の試行に関して以下のように算出されるＤｂ＿（ｚ，ｓ）の個数は、ｍ＋ｎ＋１個である。
Ｄｂ＿（ｚ，ｓ）＝ｂｏｏｌ（Ｄ＿ｚ＞Ｔｓｈ＿ｓ）
すなわち、各試行について、劣化判定閾値Ｔｓｈ＿ｓごとに、Ｄ＿ｚと当該劣化判定閾値Ｔｓｈ＿ｓとが比較され、比較結果を示す論理値が、Ｄｂ＿（ｚ，ｓ）に代入される。 Therefore, the number of Db_ (z, s) calculated as follows for one trial is m + n + 1.
Db_ (z, s) = bool (D_z> Tsh_s)
That is, for each trial, D_z and the deterioration determination threshold Tsh_s are compared for each deterioration determination threshold Tsh_s, and a logical value indicating the comparison result is substituted into Db_ (z, s).

Ｄｂ＿（ｚ，ｓ）は、真であれば「劣化」、偽であれば「非劣化」を表す論理値群であり、閾値の違い（ｓごと）で、異なる強度の劣化を表している。なお、劣化度代表値Ｄ＿ｚは、ｓ（閾値の種類）に非依存であり、劣化判定閾値Ｔｓｈ＿ｓは試行（ｚ）に非依存）である。 Db_ (z, s) is a logical value group that represents “deterioration” if true, and “non-deterioration” if false, and represents different deterioration in intensity by the difference in threshold (every s). Note that the degradation degree representative value D_z is independent of s (type of threshold), and the degradation determination threshold Tsh_s is independent of trial (z).

続いて、トラフィック情報抽出部１２が実行する処理手順について説明する。図４は、トラフィック情報抽出部１２が実行する処理手順の一例を説明するためのフローチャートである。 Subsequently, a processing procedure performed by the traffic information extraction unit 12 will be described. FIG. 4 is a flowchart for explaining an example of the processing procedure executed by the traffic information extraction unit 12.

ステップＳ２０１において、トラフィック情報抽出部１２は、変数ｉに１を代入する。変数ｉは、図４の説明において、各試行を識別するための変数である。以下、ｉ番目の試行を「試行ｉ」という。 In step S201, the traffic information extraction unit 12 substitutes 1 into a variable i. The variable i is a variable for identifying each trial in the description of FIG. Hereinafter, the i-th trial is referred to as “trial i”.

続いて、トラフィック情報抽出部１２は、トラフィック計測データＤ２から、試行ｉに対応するログデータを取得する（Ｓ２０２）。具体的には、試行ｉに対応するエリア及び時間帯（集計時間幅Ｗ）に対応するログデータが取得される。以下、取得されたログデータに係る通信トラフィックを「対象トラフィック」という。試行ｉに対応するエリアは、エリアが基地局単位又はセクタ単位であり、ログデータにｅＮＢのＩＰアドレス又はＣＩＤが付与されている場合には、試行ｉに対応するエリアのｅＮＢのＩＰアドレスに基づいて、又は当該エリアのセクタのＣＩＤに基づいて、対象トラフィックのログデータを絞り込むことができる。一方、ｅＮＢのＩＰアドレス又はＣＩＤがログデータに付与されていない場合、試行ｉに対応するエリアのモニタ用ポートで観測できる範囲が、試行ｉに対応するエリアに関するログデータとなる。 Subsequently, the traffic information extraction unit 12 acquires log data corresponding to the trial i from the traffic measurement data D2 (S202). Specifically, log data corresponding to the area corresponding to the trial i and the time zone (totaling time width W) is acquired. Hereinafter, communication traffic related to the acquired log data is referred to as “target traffic”. In the area corresponding to trial i, the area is in base station units or sector units, and when the IP address or CID of the eNB is added to the log data, the IP address of the eNB in the area corresponding to trial i is used. The log data of the target traffic can be narrowed down based on the CID of the sector of the area or the area. On the other hand, when the IP address or CID of the eNB is not assigned to the log data, the range that can be observed by the monitoring port of the area corresponding to the trial i is the log data regarding the area corresponding to the trial i.

続くステップＳ２０３〜Ｓ２０７において、トラフィック情報抽出部１２は、対象トラフィックに係るログデータから、試行ｉに係る通信トラフィックの状況を示すパラメータを抽出する。 In the subsequent steps S203 to S207, the traffic information extraction unit 12 extracts a parameter indicating the status of the communication traffic related to the trial i from the log data related to the target traffic.

ステップＳ２０３において、トラフィック情報抽出部１２は、対象トラフィックの特定のポート番号に係る時間内スループット及び総転送量に基づいて、以下のステップＳ２０３−１〜Ｓ２０３−３を実行することにより、試行ｉに関する低スループットの代表値を算出する。 In step S203, the traffic information extraction unit 12 relates to the trial i by executing the following steps S203-1 to S203-3 based on the in-time throughput and the total transfer amount related to the specific port number of the target traffic. Calculate low throughput representative values.

ステップＳ２０３−１において、トラフィック情報抽出部１２は、対象トラフィックから、以下の（１）及び（２）の条件を満たすＴＣＰフローを抽出する。
（１）ＴＣＰポートがｔｈｐｔ＿ｐｏｒｔであること。ここで、ｔｈｐｔ＿ｐｏｒｔは、事前設定されるパラメータの一つであり、集計対象のＴＣＰスループットのポート番号を示す。本実施の形態において、ｔｈｐｔ＿ｐｏｒｔの値は「８０」である。
（２）更に、事前設定パラメータとして、ｔｈ＿ｔｈｐｔ＿Ｌ及びｔｈ＿ｔｈｐｔ＿Ｕが設定されていれば、（１）に該当するＴＣＰフローのうち、総転送量がｔｈ＿ｔｈｐｔ＿Ｌ以上かつｔｈ＿ｔｈｐｔ＿Ｕ以下のＴＣＰフローであること。ここで、ｔｈ＿ｔｈｐｔ＿Ｌは、抽出対象の下限フローサイズであり、ｔｈ＿ｔｈｐｔ＿Ｕは、抽出対象の上限フローサイズを示す。なお、ｔｈ＿ｔｈｐｔ＿Ｌ及びｔｈ＿ｔｈｐｔ＿Ｕのいずれについても、０が設定された場合には、（２）の条件は無効とされる。 In step S203-1, the traffic information extraction unit 12 extracts, from the target traffic, a TCP flow that satisfies the following conditions (1) and (2).
(1) The TCP port is thpt_port. Here, thpt_port is one of the preset parameters, and indicates the TCP throughput port number to be aggregated. In the present embodiment, the value of thpt_port is “80”.
(2) Furthermore, if th_thpt_L and th_thpt_U are set as pre-set parameters, the TCP flow having a total transfer amount of th_thpt_L or more and th_thpt_U or less among the TCP flows corresponding to (1). Here, th_thpt_L is a lower limit flow size to be extracted, and th_thpt_U indicates an upper limit flow size to be extracted. When 0 is set for both th_thpt_L and th_thpt_U, the condition (2) is invalidated.

ステップＳ２０３−２において、トラフィック情報抽出部１２は、試行ｉの時間帯（集計時間幅Ｗ）を分割（細分化）する集計単位時間τごとに、抽出されたＴＣＰフローごとのスループット値（時間内転送量／τ）を集計し、当該スループット値の集合のｔｈｐｔ＿ｐ１分位点の値を算出する。 In step S203-2, the traffic information extraction unit 12 extracts the throughput value (within time) of each extracted TCP flow for each aggregation unit time τ for dividing (segmenting) the time zone of the trial i (aggregation time width W). Transfer amount / τ) is calculated, and the value of thpt_p1 quantile of the set of throughput values is calculated.

ここで、ｔｈｐｔ＿ｐ１及び以下のｔｈｐｔ＿ｐ２は、事前設定パラメータの一つであり、分位点（パーセンタイル）によって指定される、スループット値の抽出閾値である。したがって、例えば、ｔｈｐｔ＿ｐ１が０．２の場合、集計されたスループット値の集合の２０パーセンタイルが算出される。また、τも、事前設定パラメータの一つであり、Ｗ＝τ＊κとする（κは整数）。したがって、ステップＳ２０３−２は、κ回繰り返され、κ個のｔｈｐｔ＿ｐ１分位点の値の組が得られる。 Here, thpt_p1 and the following thpt_p2 are one of preset parameters, and are extraction threshold values of throughput values designated by quantiles (percentiles). Therefore, for example, when thpt_p1 is 0.2, the 20th percentile of the set of aggregated throughput values is calculated. Also, τ is one of preset parameters, and W = τ * κ (κ is an integer). Thus, step S203-2 is repeated κ times to obtain a set of κ thpt_p1 quantile values.

ステップＳ２０３−３において、トラフィック情報抽出部１２は、κ個の値の組におけるｔｈｐｔ＿ｐ２（例えば、０．２等）分位点の値を算出する。ステップＳ２０３−３にて算出されたｔｈｐｔ＿ｐ２分位点の値が、スループットの代表値（以下、「Ｔｈ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S203-3, the traffic information extraction unit 12 calculates the value of the thpt_p2 (for example, 0.2 or the like) quantile in the set of 個 values. The value of thpt_p2 quantile point calculated in step S203-3 is a representative value of throughput (hereinafter, referred to as “Th1_z” (z is 1 .. Z)).

ステップＳ２０３に続いて、トラフィック情報抽出部１２は、対象トラフィック内のＴＣＰフローのａｃｋ応答等から計算したＲＴＴサンプルに基づいて、以下のステップＳ２０４−１〜Ｓ２０４−３を実行することにより、試行ｉに関する高ＲＴＴの代表値を算出する（Ｓ２０４）。なお、ＴＣＰのパケット列からＲＴＴ（Round Trip Time）を算出する方法は、所定のものに限定されない。既存の方法の一例として、パケット分析ツールｗｉｒｅｓｈａｒｋにおいてｔｃｐ．ａｎａｌｙｓｉｓ．ａｃｋ＿ｒｔｔで読み取る方法が挙げられる。 Subsequent to step S203, the traffic information extraction unit 12 executes the following steps S204-1 to S204-3 based on the RTT sample calculated from the ack response of the TCP flow in the target traffic, etc. The representative value of the high RTT with respect to is calculated (S204). The method of calculating the RTT (Round Trip Time) from the TCP packet string is not limited to a predetermined one. As an example of the existing method, tcp. analysis. There is a method to read by ack_rtt.

ステップＳ２０４−１において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、対象トラフィックに対するＲＴＴデータ群を抽出する。 In step S204-1, the traffic information extraction unit 12 extracts an RTT data group for the target traffic for each aggregation unit time τ of trial i.

ステップＳ２０４−２において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、抽出されたＲＴＴデータ群のｒｔｔ＿ｐ１分位点の値を算出する。なお、ステップＳ２０４−１では、κ個のＲＴＴデータ群が抽出されている。したがって、ステップＳ２０４−２では、κ個のｒｔｔ＿ｐ１分位点の値の組が算出される。なお、ｒｔｔ＿ｐ１及び以下のｔｈｐｔ＿ｐ２は、事前設定パラメータの一つであり、スループット抽出閾値を分位点（例えば、０．１，０．２等）で指定する値である。 In step S204-2, the traffic information extraction unit 12 calculates the value of the rtt_p1 quantile of the extracted RTT data group for each aggregation unit time τ of trial i. In step S204-1, κ RTT data groups are extracted. Therefore, in step S204-2, a set of values of κ rtt_p1 quantiles is calculated. Note that rtt_p1 and the following thpt_p2 are one of preset parameters, and are values for specifying a throughput extraction threshold with a quantile (for example, 0.1, 0.2, etc.).

ステップＳ２０４−３において、トラフィック情報抽出部１２は、κ個のｒｔｔ＿ｐ１分位点の値のｒｔｔ＿ｐ２分位点の値を算出する。ステップＳ２０４−３で算出されたｒｔｔ＿ｐ２分位点の値が、高ＲＴＴの代表値（以下、「Ｒｔｔ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S204-3, the traffic information extraction unit 12 calculates the value of the rtt_p2 quantile of the values of 個 rtt_p1 quantiles. The value of the rtt_p2 quantile point calculated in step S204-3 is a representative value of the high RTT (hereinafter, referred to as "Rtt1_z" (z is 1 .. Z)).

ステップＳ２０４に続いて、トラフィック情報抽出部１２は、対象トラフィック内のＴＣＰフローの解析から計算したτごとのＴＣＰ再送発生比率（ＴＣＰパケット中の再送パケットの比率）に基づいて、以下のステップＳ２０５−１〜Ｓ２０５−２を実行することにより、試行ｉに関する再送率の代表値を算出する（Ｓ２０５）。なお、ＴＣＰのパケット列から再送を検出する方法は、所定のものに限定されない。既存の方法の一例として、パケット分析ツールｗｉｒｅｓｈａｒｋにおいてｔｃｐ．ａｎａｌｙｓｉｓ．ｒｅｔｒａｎｓｍｉｓｓｉｏｎ等のパラメータから評価する方法が挙げられる。 Subsequent to step S204, the traffic information extraction unit 12 performs the following steps S205-based on the TCP retransmission occurrence ratio (ratio of retransmission packets in the TCP packet) for each τ calculated from the analysis of the TCP flow in the target traffic. By executing 1 to S205-2, the representative value of the retransmission rate for trial i is calculated (S205). Note that the method of detecting retransmission from a TCP packet string is not limited to a predetermined one. As an example of the existing method, tcp. analysis. There is a method of evaluating from parameters such as retransmission.

ステップＳ２０５−１において、トラフィック情報抽出部１２は、集計単位時間τごとに、対象トラフィックに対しての再送率を抽出する。したがって、κ個の再送率の値の組が得られる。 In step S205-1, the traffic information extraction unit 12 extracts the retransmission rate for the target traffic for each aggregation unit time τ. Thus, a set of κ retransmission rate values is obtained.

ステップＳ２０５−２において、トラフィック情報抽出部１２は、κ個の再送率のｒｅｔ＿ｐ分位点の値を算出する。なお、ｒｅｔ＿ｐは、事前設定パラメータの一つであり、再送率抽出閾値を分位点（例えば、０．９，０．８等）で指定する値である。このｒｅｔ＿ｐ分位点の値が再送率の代表値（以下、「Ｒｅｔ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S205-2, the traffic information extraction unit 12 calculates the value of the ret_p quantile of κ retransmission rates. Note that ret_p is one of preset parameters, and is a value for specifying a retransmission rate extraction threshold with a quantile (for example, 0.9, 0.8, etc.). The value of the ret_p quantile is a representative value of the retransmission rate (hereinafter, referred to as “Ret1_z” (z is 1 .. Z)).

ステップＳ２０５に続いて、トラフィック情報抽出部１２は、以下のステップＳ２０６−１〜Ｓ２０６−３を実行することにより、ＤＮＳ応答時間の代表値を算出する（Ｓ２０６）。ＤＮＳ応答時間とは、対象トラフィック内のＤＮＳ検索の解析から計算したＤＮＳ検索からＤＮＳ応答までの時間である。 Subsequent to step S205, the traffic information extraction unit 12 executes the following steps S206-1 to S206-3 to calculate a representative value of DNS response time (S206). DNS response time is the time from DNS search to DNS response calculated from analysis of DNS search in the target traffic.

ステップＳ２０６−１において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、対象トラフィックに対してのＤＮＳ−ＲＴＴデータ集合を抽出する。 In step S206-1, the traffic information extraction unit 12 extracts the DNS-RTT data set for the target traffic for each aggregation unit time τ of the trial i.

ステップＳ２０６−２において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、抽出されたＤＮＳ−ＲＴＴデータ集合について、ｄｎｓｔ＿ｐ１分位点の値を算出する。なお、ステップＳ２０６−１では、κ個のＤＮＳ−ＲＴＴデータ集合が抽出されている。したがって、ステップＳ２０６−２では、κ個のｄｎｓｔ＿ｐ１分位点の値の組が算出される。なお、ｄｎｓｔ＿ｐ１及び以下のｄｎｓｔ＿ｐ２は、事前設定パラメータの一つであり、ＤＮＳ応答時間抽出閾値を分位点（例えば、０．９，０．８等）で指定する値である。 In step S206-2, the traffic information extraction unit 12 calculates the value of the dnst_p1 quantile point for the extracted DNS-RTT data set for each aggregation unit time τ of the trial i. In step S206-1, κ DNS-RTT data sets are extracted. Therefore, in step S206-2, a set of values of κ dnst_p1 quantiles is calculated. Note that dnst_p1 and the following dnst_p2 are one of preset parameters, and are values for specifying a DNS response time extraction threshold with a quantile point (for example, 0.9, 0.8, etc.).

ステップＳ２０６−３において、トラフィック情報抽出部１２は、κ個のｄｎｓｔ＿ｐ１分位点の値のｄｎｓｔ＿ｐ２分位点の値を算出する。このｄｎｓｔ＿ｐ２分位点の値が、ＤＮＳ応答時間（ＤＮＳ−ＲＴＴ）の代表値（以下、「Ｄｒｔ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S206-3, the traffic information extraction unit 12 calculates the value of the dnst_p2 quantile of the values of dn dnst_p1 quantiles. The value of the dnst_p2 quantile is a representative value of the DNS response time (DNS-RTT) (hereinafter, referred to as “Drt1_z” (z is 1 .. Z)).

ステップＳ２０６に続いて、トラフィック情報抽出部１２は、以下のステップＳ２０７−１〜Ｓ２０７−３を実行することにより、ＤＮＳ応答率の代表値を算出する（Ｓ２０７）。ＤＮＳ応答率とは、モニタ対象トラフィック内のＤＮＳ検索の解析から計算したＤＮＳ検索に対し応答が有った比率である。 Subsequent to step S206, the traffic information extraction unit 12 executes the following steps S207-1 to S207-3 to calculate a representative value of the DNS response rate (S207). The DNS response rate is the ratio of responses to DNS searches calculated from analysis of DNS searches in monitored traffic.

ステップＳ２０７−１において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、対象トラフィックに対してのＤＮＳ検索数と当該検索に対するＤＮＳ応答の数とを抽出する。なお、応答は、τより後の時間になる場合もあるため、ｄｎｓ＿ｔｉｍｅｏｕｔ経過時までを集計対象とする。ｄｎｓ＿ｔｉｍｅｏｕｔは、事前設定パラメータの一つであり、ＤＮＳ応答率算出時のタイムアウト時間である。 In step S207-1, the traffic information extraction unit 12 extracts the number of DNS searches for the target traffic and the number of DNS responses for the search for each aggregation unit time τ of the trial i. In addition, since a response may become time after (tau), it is counted until dns_timeout progress. dns_timeout is one of preset parameters, and is a timeout time at the time of DNS response rate calculation.

ステップＳ２０７−２において、トラフィック情報抽出部１２は、試行ｉの集計単位時間τごとに、応答率（ＤＮＳ応答数／ＤＮＳ検索数）を算出する。検索数が０のτについては値なしとする。なお、ステップＳ２０７−２では、κ個の応答率が算出される。したがって、κ個の応答率の組が得られる。 In step S207-2, the traffic information extraction unit 12 calculates the response rate (the number of DNS responses / the number of DNS searches) for each aggregation unit time τ of the trial i. There is no value for τ with a search number of 0. In step S207-2, κ response rates are calculated. Therefore, a set of κ response rates is obtained.

ステップＳ２０７−３において、トラフィック情報抽出部１２は、κ個の応答率の組から「値なし」のものがあれば除外し、κ個（除外してκ未満のこともある）の値のｄｎｓｒ＿ｐ分位点の値を算出する。ｄｎｓｒ＿ｐは、事前設定パラメータの一つであり、ＤＮＳ応答率抽出閾値を分位点（例えば、０．２等）で指定する値である。このｄｎｓｒ＿ｐ分位点の値が、ＤＮＳ応答率の代表値（以下、「Ｄｒｐ１＿ｚ」という（ｚは１．．Ｚ）。）である。 In step S207-3, the traffic information extraction unit 12 excludes “no value” from the set of κ response rates, and dnsr_p of values of ((may be excluded less than κ). Calculate the quantile value. dnsr_p is one of preset parameters, and is a value for specifying a DNS response rate extraction threshold with a quantile (for example, 0.2 or the like). The value of the dnsr_p quantile is a representative value of the DNS response rate (hereinafter, referred to as “Drp1_z” (z is 1 .. Z)).

ステップＳ２０２〜Ｓ２０７は、Ｚ回の全ての試行について実行される（Ｓ２０８、Ｓ２０９）。その結果、スループット代表値Ｔｈ１として、以下のようなベクトルが得られる。
Ｔｈ１＝（Ｔｈ＿１，Ｔｈ１＿２，Ｔｈ１＿３…Ｔｈ１＿ｚ）
なお、ｔｈｐｔ＿ｐ１，ｔｈｐｔ＿ｐ２の組が、複数設定されてもよい。その場合、当該組毎に、Ｔｈ２＿ｚ，Ｔｈ３＿ｚ等のベクトルが得られる。 Steps S202 to S207 are performed for all the trials Z times (S208, S209). As a result, the following vectors are obtained as the throughput representative value Th1.
Th1 = (Th_1, Th1_2, Th1_3 ... Th1_z)
A plurality of sets of thpt_p1 and thpt_p2 may be set. In that case, vectors such as Th2_z and Th3_z are obtained for each set.

また、高ＲＴＴ代表値Ｒｔｔ１として、以下のようなベクトルが得られる。
Ｒｔｔ１＝（Ｒｔｔ１＿１，Ｒｔｔ１＿２，…Ｒｔｔ１＿ｚ）
なお、ｒｔｔ＿ｐ１，ｒｔｔ＿ｐ２の組が、複数設定されてもよい。その場合、当該組ごとに、Ｒｔｔ２＿ｚ，Ｒｔｔ３＿ｚ等のベクトルが得られる。 Also, the following vector is obtained as the high RTT representative value Rtt1.
Rtt1 = (Rtt1_1, Rtt1_2, ... Rtt1_z)
A plurality of sets of rtt_p1 and rtt_p2 may be set. In that case, vectors such as Rtt2_z and Rtt3_z are obtained for each set.

また、再送率代表値Ｒｅｔ１として、以下のようなベクトルが得られる。
Ｒｅｔ１＝（Ｒｅｔ１＿１，Ｒｅｔ１＿２，…Ｒｅｔ１＿ｚ）
なお、ｒｅｔ＿ｐについて、複数の値が設定されてもよい。その場合、当該値ごとに、Ｒｅｔ２＿ｚ，Ｒｅｔ３＿ｚ等のベクトルが得られる。 Also, the following vector is obtained as the retransmission rate representative value Ret1.
Ret1 = (Ret1_1, Ret1_2, ... Ret1_z)
A plurality of values may be set for ret_p. In that case, vectors such as Ret2_z and Ret3_z are obtained for each value.

また、ＤＮＳ応答時間代表値Ｄｒｔ１として、以下のようなベクトルが得られる。
Ｄｒｔ１＝（Ｄｒｔ１＿１，Ｄｒｔ１＿２，…Ｄｒｔ１＿ｚ）
なお、ｄｎｓｔ＿ｐ１、ｄｎｓｔ＿ｐ２の組が複数設定されてもよい。その場合、当該組ごとに、Ｄｒｔ２＿ｚ，Ｄｒｔ３＿ｚ等のベクトルが得られる。 Also, the following vector is obtained as the DNS response time representative value Drt1.
Drt1 = (Drt1_1, Drt1_2, ... Drt1_z)
A plurality of sets of dnst_p1 and dnst_p2 may be set. In that case, vectors such as Drt2_z and Drt3_z are obtained for each set.

更に、ＤＮＳ応答率Ｄｒｐ１として、以下のようなベクトルが得られる。
Ｄｒｐ１＝（Ｄｒｐ１＿１，Ｄｒｐ１＿２，…Ｄｒｐ１＿ｚ）
なお、ｄｎｓｒ＿ｐについて、複数の値が設定されてもよい。その場合、当該値ごとに、Ｄｒｐ２＿ｚ、Ｄｒｐ３＿ｚ等のベクトルが得られる。 Furthermore, the following vector is obtained as the DNS response rate Drp1.
Drp1 = (Drp1_1, Drp1_2, ... Drp1_z)
A plurality of values may be set for dnsr_p. In that case, vectors such as Drp2_z and Drp3_z are obtained for each value.

続いて、トラフィック情報抽出部１２は、それぞれが長さＺの上記各代表値のベクトルを組（行列）として、以下のＤＰＩｖｅｃを出力する（Ｓ２１０）。
ＤＰＩｖｅｃ＝（Ｔｈ１（，Ｔｈ２，Ｔｈ３．．），Ｒｔｔ１（，Ｒｔｔ２，Ｒｔｔ３．．），Ｒｅｔ１（，Ｒｅｔ２，Ｒｅｔ３．．），Ｄｒｔ１（，Ｄｒｔ２，Ｄｒｔ３），Ｄｒｐ１（，Ｄｒｐ２，Ｄｒｐ３．．））
なお、（，Ｔｈ２，Ｔｈ３．．）、（，Ｒｔｔ２，Ｒｔｔ３．．）、（，Ｒｅｔ２，Ｒｅｔ３．．）、（，Ｄｒｔ２，Ｄｒｔ３）及び（，Ｄｒｐ２，Ｄｒｐ３．．）は、事前設定パラメータとしての閾値（分位点）の組の数によって、構成するベクトルの数が変化することを示す。 Subsequently, the traffic information extraction unit 12 outputs the following DPIvec as a set (matrix) of vectors of the respective representative values each having the length Z (S210).
DPIvec = (Th1 (, Th2, Th3 ..), Rtt1 (, Rtt2, Rtt3 ..), Ret1 (, Ret2, Ret3 ..), Drt1 (, Drt2, Drt3), Drp1 (, Drp2, Drp3 ..) )
Note that (, Th2, Th3 ..), (, Rtt2, Rtt3 ..), (, Ret2, Ret3 ..), (, Drt2, Drt3) and (, Drp2, Drp3 ..) are used as preset parameters. It is shown that the number of sets of vectors is changed by the number of sets of threshold values (quantile points) of.

続いて、学習段階において品質推定部１３が実行する処理手順について説明する。図５は、学習段階において品質推定部１３が実行する処理手順の一例を説明するためのフローチャートである。 Subsequently, a processing procedure performed by the quality estimation unit 13 in the learning stage will be described. FIG. 5 is a flowchart for explaining an example of the processing procedure performed by the quality estimation unit 13 in the learning stage.

ステップＳ３０１において、品質推定部１３は、ｗｅｂ劣化分析部１１によって算出された、ｍ＋ｎ＋１個の長さＺの以下の論理値ベクトルである劣化度論理値Ｄｂ＿（ｚ，ｓ）を入力する。
Ｄｂ＿（ｚ，ｓ）＝（Ｄｂ＿（ｚ，−ｍ），Ｄｂ＿（ｚ，−ｍ＋１），…Ｄｂ＿（ｚ，ｎ−１），Ｄｂ＿（ｚ，ｎ））
なお、Ｄｂ＿（ｚ，ｓ）は、ｓを固定すると要素数Ｚ個のベクトルである。ｓはｍ＋ｎ＋１個の値を取るので、Ｄｂ＿（ｚ，ｓ）は、ｍ＋ｎ＋１個のＺ個のベクトルを要素とするベクトルとなる。 In step S301, the quality estimation unit 13 inputs the degradation degree logic value Db_ (z, s), which is the following logic value vector of m + n + 1 lengths of Z calculated by the web degradation analysis unit 11.
Db_ (z, s) = (Db_ (z, -m), Db_ (z, -m + 1), ... Db_ (z, n-1), Db_ (z, n))
Note that Db_ (z, s) is a vector of Z element number when s is fixed. Since s takes m + n + 1 values, Db_ (z, s) is a vector having m + n + 1 Z vectors as elements.

続いて、品質推定部１３は、トラフィック情報抽出部１２によって出力された、以下の代表値のベクトルの組（行列）を入力する（Ｓ３０２）。
ＤＰＩｖｅｃ＝（Ｔｈ１（，Ｔｈ２，Ｔｈ３．．），Ｒｔｔ１（，Ｒｔｔ２，Ｒｔｔ３．．），Ｒｅｔ１（，Ｒｅｔ２，Ｒｅｔ３．．），Ｄｒｔ１（，Ｄｒｔ２，Ｄｒｔ３），Ｄｒｐ１（，Ｄｒｐ２，Ｄｒｐ３．．））
続いて、品質推定部１３は、ｍ＋ｎ＋１個（すなわち、１以上）の推定器のインスタンスを生成する（Ｓ３０３）。すなわち、劣化判定閾値Ｔｓｈ＿ｓの数と同数の（劣化判定閾値Ｔｓｈ＿ｓごとの）推定器が用意される。各推定器を「Ｓ＿ｓ」と表記する。ここで、ｓは、ｍ＋ｎ＋１種類のＴｓｈ＿ｓに対応する。 Subsequently, the quality estimation unit 13 inputs the following set (vector) of vectors of representative values output by the traffic information extraction unit 12 (S302).
DPIvec = (Th1 (, Th2, Th3 ..), Rtt1 (, Rtt2, Rtt3 ..), Ret1 (, Ret2, Ret3 ..), Drt1 (, Drt2, Drt3), Drp1 (, Drp2, Drp3 ..) )
Subsequently, the quality estimation unit 13 generates m + n + 1 (ie, one or more) estimator instances (S303). That is, the same number of estimators (for each degradation determination threshold Tsh_s) as the number of degradation determination thresholds Tsh_s is prepared. Each estimator is denoted "S_s". Here, s corresponds to m + n + 1 types of Tsh_s.

続いて、品質推定部１３は、Ｄｂ＿（ｚ，ｓ）を目的変数とし、ＤＰＩｖｅｃを説明変数として、各Ｓ＿ｓを学習させる（Ｓ３０４）。なお、各Ｓ＿ｓは、ｓが共通するＤｂ＿（ｚ，ｓ）について学習を行う。これでＳ＿ｓは学習済みとなり、ｗｅｂページの表示待ち時間の品質の推定に用いることが可能な状態となる。 Subsequently, the quality estimation unit 13 learns each S_s with Db_ (z, s) as an objective variable and DPIvec as an explanatory variable (S304). Each S_s learns about Db_ (z, s) common to s. In this way, S_s has been learned and can be used to estimate the quality of the display page waiting time.

続いて、推定段階において推定装置１０が実行する処理手順について説明する。図６は、推定段階において推定装置１０が実行する処理手順の一例を説明するためのフローチャートである。 Then, the process sequence which the estimation apparatus 10 performs in an estimation step is demonstrated. FIG. 6 is a flowchart for explaining an example of the processing procedure performed by the estimation device 10 in the estimation stage.

図６の処理手順の前提として、推定対象のエリア及び時間帯に対応するトラフィック計測データＤ２（例えば、ＤＰＩのログデータ）が収集されているとする。当該トラフィック計測データＤ２を、以下「推定対象計測データ」という。なお、ｗｅｂページの表示待ち時間の計測は不要である。当該表示待ち時間の劣化の度合いが目的変数（推定対象）だからである。 As a premise of the processing procedure of FIG. 6, it is assumed that traffic measurement data D2 (for example, log data of DPI) corresponding to the area and time zone to be estimated are collected. The traffic measurement data D2 is hereinafter referred to as "estimated target measurement data". It is not necessary to measure the display waiting time of the web page. This is because the degree of deterioration of the display waiting time is the target variable (target of estimation).

ステップＳ４０１において、トラフィック情報抽出部１２は、推定対象計測データを入力する。続くステップＳ４０２〜Ｓ４０６において、トラフィック情報抽出部１２は、図４のステップＳ２０３〜Ｓ２０７と同様の処理を実行して、推定対象計測データから各代表値を算出（抽出）する。但し、推定対象のエリア及び時間帯は１つであるため、Ｚ＝１である。 In step S401, the traffic information extraction unit 12 inputs estimation target measurement data. In the subsequent steps S402 to S406, the traffic information extraction unit 12 executes the same processing as in steps S203 to S207 of FIG. 4 to calculate (extract) each representative value from the estimation target measurement data. However, since there is one area and time zone to be estimated, Z = 1.

その結果、トラフィック情報抽出部１２は、図５のステップＳ２１０と同様に、長さＺ＝１である、ＤＰＩｖｅｃと同じ形式の行列を得る。以下、当該行列を「ＤＰＩｖｅｃ＿ｎｅｗ」という。 As a result, the traffic information extraction unit 12 obtains a matrix of the same format as DPIvec, which has a length Z = 1, as in step S210 of FIG. Hereinafter, the matrix is referred to as "DPIvec_new".

続いて、品質推定部１３は、以下のように、各Ｓ＿ｓ（各推定器）に対してＤＰＩｖｅｃ＿ｎｅｗを入力することで、各劣化判定閾値Ｔｓｈ＿ｓに応じた、劣化か否かの論理値の組（ベクトル）であるＥｓｔ＿ｓを得る（Ｓ４０７）
（複数のｓに対し）Ｅｓｔ＿ｓ＝Ｓ＿ｓ（ＤＰＩｖｅｃ＿ｎｅｗ）
なお、Ｅｓｔ＿ｓは、ｍ＋ｎ＋１の長さの論理値の組（ベクトル）となる。 Subsequently, the quality estimation unit 13 inputs a set of DPIvec_new to each S_s (each estimator) as described below to set a set of logical values (whether or not to be degraded) according to each degradation determination threshold Tsh_s. Get Est_s which is a vector) (S407)
Est_s = S_s (DPIvec_new) (for multiple s)
Note that Est_s is a set (vector) of logical values of length m + n + 1.

典型的には、Ｔｓｈ＿ｓの値が最大のｓについて真（劣化）であり、Ｔｓｈ＿ｓが順に小さくなるようｓを変えていくと、或るｓがθであるときに偽（非劣化）となり、それ以下のＴｓｈ＿ｓでは全部偽（非劣化）というパターンとなる（但し、全部偽、全部真、又は順番が壊れていることも起こりうる。）。 Typically, when the value of Tsh_s is true (deterioration) for the maximum s, and s is changed so that Tsh_s becomes smaller in order, it becomes false (non-deterioration) when some s is θ, In the following Tsh_s, the pattern is all false (non-degraded) (although all false, all true, or the order may be broken).

品質推定部１３は、Ｅｓｔ＿ｓからこのようなθを探索し、探索されたθに対応したＴｓｈ＿θを「推定された劣化度」（推定対象のエリア及び時間帯に対する劣化度の推定値）として出力する（Ｓ４０９）。具体的には、例えば、全部偽（非劣化）であれば、品質推定部１３は、「非劣化」と出力する。この場合、（最少の）Ｔｓｈ＿ｓ未満の劣化となる。また、全部真（劣化）であれば、品質推定部１３は、「（最大の）Ｔｓｈ＿ｓ以上」を出力する。また、全部偽でなく、かつ、全部真でない場合、偽と真とが、或るＴｓｈ＿ｓを境に区分されていない場合でも、品質推定部１３は、Ｔｓｈ＿ｓが大きい側から順に値を確認し、最初に偽となったＴｓｈ＿θを使い「Ｔｓｈ＿θ以上」を出力する。このＴｓｈ＿θは（個別のｗｅｂページのみには依存しない）ｗｅｂページの表示待ち時間の劣化発生の度合の推定結果である。 The quality estimation unit 13 searches for such θ from Est_s, and outputs Tsh_θ corresponding to the searched θ as “the estimated degree of deterioration” (estimated value of the degree of deterioration for the area and time zone to be estimated). (S409). Specifically, for example, if all false (non-deteriorated), the quality estimation unit 13 outputs “non-deteriorated”. In this case, the degradation is less than (minimum) Tsh_s. If all are true (deteriorated), the quality estimation unit 13 outputs “(maximum) Tsh_s or more”. Also, if all is not false and all is not true, even if false and true are not separated by a certain Tsh_s, the quality estimation unit 13 checks the values in order from the side with the largest Tsh_s, Output "Tsh_θ or more" using Tsh_θ that first becomes false. This Tsh_θ is an estimation result of the degree of deterioration of the display waiting time of the web page (which does not depend on only the individual web page).

なお、本実施の形態では、通信トラフィックの状況を示すパラメータを抽出するパラメータの一例として、低スループット代表値、高ＲＴＴ代表値、再送率代表値、ＤＮＳ応答時間代表値、及びＤＮＳ応答率代表値を用いる例を説明したが、必ずしもこれら全部が用いられなくてもよく、これらの一部（例えば、いずれか一つ）が用いられてもよい。また、通信トラフィックの状況を把握可能なパラメータであれば、これら以外のパラメータが用いられてもよい。 In this embodiment, a low throughput representative value, a high RTT representative value, a retransmission rate representative value, a DNS response time representative value, and a DNS response rate representative value as an example of a parameter for extracting a parameter indicating the status of communication traffic. Although the example which uses is demonstrated, these all may not necessarily be used and some (for example, any one) of these may be used. In addition, parameters other than these may be used as long as the parameters can grasp the status of communication traffic.

上述したように、本実施の形態によれば、ｗｅｂページの表示待ち時間の計測値と、当該表示待ち時間の計測のエリア及び時間帯に対応する通信トラフィックに関する情報との関係を推定器に学習させ、推定したい未知データの通信トラフィックに関する情報を当該推定器に入力することで（個別ページのみには依存しない）ｗｅｂページの待ち時間の劣化の程度を推定することができる。したがって、本実施の形態によれば、個別のｗｅｂページの影響が除去された表示待ち時間の品質の推定を可能とすることができる。 As described above, according to the present embodiment, the estimation unit learns the relationship between the measured value of the display waiting time of the web page and the information on the communication traffic corresponding to the area and time zone of the measurement of the display waiting time. The degree of degradation of the latency of the web page can be estimated (independent of only the individual page) by inputting to the estimator the information on the communication traffic of unknown data to be estimated. Therefore, according to the present embodiment, it is possible to estimate the quality of display waiting time in which the influence of the individual web page is removed.

また、本実施の形態では、複数のｗｅｂページ・サーバを学習に用い、また「ｗｅｂページの表示待ち時間の劣化」という一般的特性を（ネットワーク起因でなくｗｅｂページ側の理由等で）一部だけ劣化するものの影響を除外して抽出するために、ｗｅｂページごとの基準で正規化した「劣化度」の考え方、及び複数の計測データの分位点によりデータを抽出する手法を用いることで、個別ｗｅｂページ毎の待ち時間の差を吸収してｗｅｂページの待ち時間の劣化を捉える。したがって、ｗｅｂページ・サーバが千差万別であり、同じ時間・場所で利用していても個々のページにより待ち時間は異なるといった問題を解消することができる。 Moreover, in the present embodiment, a plurality of web page servers are used for learning, and a general characteristic of "deterioration of the display page waiting time" (for reasons such as web page side not network origination) is partially By using the concept of “deterioration degree” normalized on the basis of each web page, and a method of extracting data by quantile points of multiple measurement data, in order to extract excluding the influence of things that only degrade Absorb the difference in waiting time for each web page to catch the deterioration of the waiting time for web pages. Therefore, it is possible to solve the problem that web pages and servers are different from one another, and even if they are used at the same time and place, the waiting time differs depending on each page.

また、本実施の形態では、パッシブ計測データ（ユーザ・通信相手・目的もバラバラの通信データ解析ログ（ＤＰＩのログ））の内容の不安定性に対し、（表示待ち時間の劣化時に対応するログも劣化しているなど）劣化情報と対応性が良く、極端な異常値に左右されにくいトラフィックデータの分位点を抽出することで、平均値等を抽出する場合と比較して、斯かる不安定性に適切に対応することができる。 Further, in the present embodiment, against the instability of the contents of the passive measurement data (the communication data analysis log (user, communication partner, purpose is also disjointed from the communication data analysis log (DPI log)), Such instability compared to the case of extracting an average value or the like by extracting a quantile point of traffic data that has good correspondence with deterioration information, etc., and is unlikely to be influenced by an extreme abnormal value. Can respond appropriately to

また、同じＮＷ劣化影響下でも個別ｗｅｂページごとに表示待ち時間の劣化への影響の出方が異なるが、本実施の形態では、複数の対象ｗｅｂページを一括して扱い、その統計量で評価することで「典型的な劣化影響の出方」として定量化することが出来る。また、一般に劣化判定は二値（論理値）で行う。本実施の形態でも論理値の推定器を用いるが、本実施の形態では「劣化度」として定量的な値を用いるため、学習時の論理値ベクトルに変換する際に（複数の劣化レベルに対応した）複数用意し、それぞれ異なるレベルの推定器を用意することで、劣化度合いの判定も併せて実現することができる
なお、本実施の形態において、ｗｅｂ劣化分析部１１は、算出部及び比較部の一例である。トラフィック情報抽出部１２は、取得部の一例である。品質推定部１３は、学習部及び推定部の一例である。 In addition, although the appearance of the influence on the deterioration of the display waiting time is different for each individual web page even under the same NW deterioration influence, in the present embodiment, a plurality of target web pages are treated collectively and evaluated by the statistics By doing this, it can be quantified as "the typical way of causing deterioration". Also, in general, deterioration determination is performed using two values (logical values). Although the present embodiment also uses a logical value estimator, in the present embodiment a quantitative value is used as the “degree of degradation”, so when converting into a logical value vector at the time of learning (corresponding to a plurality of degradation levels) It is also possible to realize the determination of the degree of deterioration by preparing a plurality of estimators and preparing different levels for each. Note that, in the present embodiment, the web degradation analyzer 11 includes a calculator and a comparator. An example of The traffic information extraction unit 12 is an example of an acquisition unit. The quality estimation unit 13 is an example of a learning unit and an estimation unit.

以上、本発明の実施の形態について詳述したが、本発明は斯かる特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 Although the embodiments of the present invention have been described above in detail, the present invention is not limited to such specific embodiments, and various modifications may be made within the scope of the present invention as set forth in the claims. Modifications and changes are possible.

１０推定装置
１１ｗｅｂ劣化分析部
１２トラフィック情報抽出部
１３品質推定部
１００ドライブ装置
１０１記録媒体
１０２補助記憶装置
１０３メモリ装置
１０４ＣＰＵ
１０５インタ段階装置
Ｂバス DESCRIPTION OF REFERENCE NUMERALS 10 estimation device 11 web degradation analysis unit 12 traffic information extraction unit 13 quality estimation unit 100 drive device 101 recording medium 102 auxiliary storage device 103 memory device 104 CPU
105 Inter-stage device B bus

Claims

A calculation unit that calculates a representative value of an indicator indicating the quality of the display waiting time based on measurement data of the display waiting time of the web page for each of a plurality of trials in which at least one of the area and the time zone is different;
A comparison unit that outputs, for each trial, a logical value indicating a comparison result with a threshold for the representative value related to the trial;
An acquisition unit configured to acquire, for each of the trials, a parameter indicating a state of communication traffic in each of a region and a time zone related to the trial;
A learning unit that uses the logical value of each trial as a target variable and the parameter of each trial as an explanatory variable to learn an estimator;
An estimation unit that inputs the parameters in a certain area and time zone to the estimator to estimate the quality of display latency of web pages in the area and time zone;
An estimation apparatus characterized by having:

The calculation unit calculates a representative value of an indicator indicating the quality of the display waiting time of the web page based on the measurement data of the display waiting time of each of the plurality of web pages for each trial.
The estimation device according to claim 1, characterized in that:

The comparison unit outputs, for each trial, a logical value group indicating comparison results with a plurality of threshold values for the representative value related to the trial;
The learning unit learns a plurality of estimators corresponding to each of the plurality of threshold values, with the logical value group of each trial as a target variable and the parameter of each trial as an explanatory variable.
The estimation unit inputs the parameters in a certain area and a time zone to the estimator corresponding to each of the plurality of threshold values to estimate the quality of display waiting time of the web page in the area and the time zone.
The estimation device according to claim 1 or 2, characterized in that:

The acquisition unit acquires the parameter for each of a plurality of unit times dividing the time zone, and calculates a representative value of the acquired parameter for each unit time.
The learning unit learns the estimator using the logical value of each trial as a target variable and the representative value of the parameter of each trial as an explanatory variable.
The estimation apparatus according to any one of claims 1 to 3, characterized in that:

The calculation unit calculates a representative value of an indicator indicating the quality of the display waiting time of the web page based on a plurality of measurement data of the display waiting time of the web page for each trial.
The estimation apparatus according to any one of claims 1 to 4, characterized in that:

A calculation procedure of calculating a representative value of an indicator indicating the quality of the display waiting time based on measurement data of the display waiting time of the web page for each of a plurality of trials in which at least one of the region and the time zone is different;
A comparison procedure for outputting, for each trial, a logical value indicating a comparison result with a threshold for the representative value related to the trial;
An acquisition procedure for acquiring, for each of the trials, a parameter indicating a state of communication traffic in each of the region and the time zone related to the trial;
A learning procedure in which the logical value of each trial is used as an objective variable, and the parameter of each trial is used as an explanatory variable to learn an estimator;
An estimation procedure for inputting the parameters in a certain area and time zone into the estimator to estimate the quality of display latency of web pages in the area and time zone;
Estimation method characterized in that the computer executes.

A program which causes a computer to function as each part according to any one of claims 1 to 5.