WO2016188498A1 - Wireless network throughput evaluating method and device - Google Patents

Wireless network throughput evaluating method and device Download PDF

Info

Publication number
WO2016188498A1
WO2016188498A1 PCT/CN2016/084549 CN2016084549W WO2016188498A1 WO 2016188498 A1 WO2016188498 A1 WO 2016188498A1 CN 2016084549 W CN2016084549 W CN 2016084549W WO 2016188498 A1 WO2016188498 A1 WO 2016188498A1
Authority
WO
WIPO (PCT)
Prior art keywords
throughput
base stations
base station
sequence
base
Prior art date
Application number
PCT/CN2016/084549
Other languages
French (fr)
Chinese (zh)
Inventor
顾军
张兴
易正磊
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016188498A1 publication Critical patent/WO2016188498A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W16/00Network planning, e.g. coverage or traffic planning tools; Network deployment, e.g. resource partitioning or cells structures
    • H04W16/22Traffic simulation tools or models
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/08Testing, supervising or monitoring using real traffic

Definitions

  • This document relates to, but is not limited to, data mining technology, and in particular to a method and device for evaluating wireless network throughput.
  • Embodiments of the present invention provide a method and an apparatus for evaluating wireless network throughput, which can satisfy current network throughput behavior analysis.
  • a method for evaluating a wireless network throughput including:
  • N and M are both positive integers and N is greater than M.
  • the historical data of acquiring the throughput of the N base stations includes:
  • the value of the sequence of the preset percentage thresholds in which the sequence value is prior to each other in the original throughput sequence of each base station is replaced by the calculated average of the throughput sequence values. Value, get the new throughput sequence for each base station;
  • the normalized throughput sequence of each base station is obtained by normalizing the time series of the new throughput of each base station, and the obtained normalized throughput sequence is used as the historical data of the throughput.
  • the constructing the base station relationship network of the N base stations according to the historical data of the acquired throughput of the N base stations includes:
  • the M base stations that use the constructed base station relationship network and the acquired historical data of the throughput to find an important effect on the base station relationship network throughput evaluation performance include:
  • the best throughput evaluation effect is selected, and the m base stations corresponding to the selected best throughput evaluation effects are used as the base stations.
  • M ⁇ m, M ⁇ N, and m ⁇ N.
  • the number of undirected edges of each base station is proportional to the size of the base station.
  • the estimating the remaining N-M base station throughputs by using the determined historical data of the throughput of the M important base stations includes:
  • the estimated throughput of the remaining N-M base stations is obtained by using the constructed throughput relationship model and the throughput history data of the M important base stations.
  • an apparatus for evaluating wireless network throughput including:
  • An acquisition module configured to collect historical data of throughput of N base stations
  • a building module configured to construct a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations;
  • a searching module configured to use the constructed base station relationship network and the acquired historical data of the throughput to find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations ;
  • An evaluation module configured to use the historical data of the determined throughput of the M important base stations to evaluate the remaining N-M base station throughputs
  • N and M are both positive integers and N is greater than M.
  • the collecting module includes:
  • Calculating a throughput average unit configured to collect a raw throughput sequence of each base station, and calculate an average value of the throughput sequence values in the collected original throughput sequence
  • the obtaining unit is configured to replace the calculated throughput with a sequence of a preset percentage threshold of the sequence value before each sequence of the original throughput sequence of each of the acquired base stations is replaced by the calculated The average of the sequence values, the new throughput sequence for each base station, and the normalization of the time series of the new throughput for each base station, resulting in each The normalized throughput sequence of the base station, the resulting normalized throughput sequence is taken as historical data of the throughput.
  • the building module includes:
  • Calculating the correlation coefficient unit is configured to calculate correlation coefficients between two of the N base stations according to the normalized throughput sequence of each of the obtained base stations;
  • the building unit is configured to construct the base station relationship network of the N base stations by using an undirected edge generated between two base stations of the N base stations.
  • the searching module includes:
  • the obtaining unit is configured to obtain the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network, and sequentially select the m base stations with the medium most middle of the N base stations, according to the support vector machine Assessing the throughput of the Nm base stations, and obtaining the throughput evaluation effect of the m base station relationship networks;
  • the searching unit is configured to select the best throughput evaluation effect in the throughput evaluation effect of the obtained m base station relationship networks, and use the m base stations corresponding to the selected best throughput evaluation effect as the base station M base stations that play an important role in the network throughput evaluation effect;
  • M ⁇ m, M ⁇ N, and m ⁇ N.
  • the technical solution provided by the embodiment of the present invention includes: acquiring historical data of throughput of N base stations; and constructing a base station relationship network of N base stations according to the acquired historical data of throughput of the N base stations According to the constructed base station relationship network and the historical data of the acquired throughput, find M base stations that play an important role in the base station relationship network throughput evaluation effect, and use the M base stations as important base stations; use the determined M important Historical data of the throughput of the base station, and the remaining NM base station throughput is evaluated.
  • the embodiment of the present invention uses the historical data of the base station throughput of a selected number of important base stations to evaluate the characteristics of other base stations, thereby reducing the complexity of data analysis;
  • the throughput of the base part of the base station is known, the throughput of other unknown base stations in the spatial range is evaluated, thereby providing a reference for optimizing the wireless network resources.
  • FIG. 1 is a flowchart of a method for evaluating wireless network throughput according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of an apparatus for evaluating wireless network throughput according to an embodiment of the present invention
  • FIG. 3 is a flowchart of a method for evaluating a wireless network throughput according to an embodiment of the present invention
  • FIG. 4 is a flowchart of an algorithm of a support vector machine according to an embodiment of the present invention.
  • FIG. 5 is a network diagram for constructing a base station relationship according to a first embodiment of the present invention
  • FIG. 6 is a diagram showing the variation of the average value of SMAPE (Symmetric mean absolute percentage error) with m according to the first embodiment of the present invention
  • FIG. 7 is a diagram showing evaluation results of two base stations according to a first embodiment of the present invention.
  • FIG. 8 is a network diagram for constructing a base station relationship according to a second embodiment of the present invention.
  • Figure 9 is a diagram showing the variation of the average value of SMAPE with m according to the second embodiment of the present invention.
  • FIG. 10 is a diagram showing evaluation results of two base stations according to a second embodiment of the present invention.
  • FIG. 1 is a flowchart of a method for evaluating throughput of a wireless network according to an embodiment of the present invention. As shown in FIG. 1 , the method includes the following steps:
  • Step 101 Acquire historical data of throughput of N base stations
  • N may include the number of base stations in the area where the wireless network throughput is evaluated.
  • Step 102 Construct a base station relationship network of N base stations according to the acquired historical data of the throughput of the N base stations.
  • Step 103 Using the constructed base station relationship network and the historical data of the acquired throughput, find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations;
  • Step 104 Evaluate the remaining N-M base station throughputs by using historical data of the determined throughput of the M important base stations;
  • N and M are both positive integers and N is greater than M.
  • obtaining historical data of the throughput of the N base stations includes: collecting an original throughput sequence of each base station, and calculating an average value of the throughput sequence values in the collected original throughput sequence; The value of the sequence of the preset percentage thresholds in which the sequence value is prior to each sequence in the original throughput sequence of each base station is replaced by the average of the calculated throughput sequence values, and the new throughput of each base station is obtained.
  • the sequence of normalization throughput is obtained by normalizing the sequence of the new throughput of each base station, and the normalized throughput sequence obtained is used as the historical data of the throughput.
  • the preset percentage threshold may be 3%, and 3% is only an optional value.
  • the preset percentage threshold is a value obtained by a person skilled in the art according to empirical analysis, and may be 2% to 5%.
  • the original throughput data may be collected by using the original throughput sequence of the base station for all the durations, or the original throughput sequence of the preset duration.
  • the preset duration is generally greater than or equal to 14 days when the original throughput sequence of the preset duration is used.
  • the base station relationship network for constructing the N base stations according to the historical data of the acquired throughput of the N base stations includes: calculating, according to the normalized throughput sequence of each base station obtained, two base stations of the N base stations respectively. Correlation coefficient; when the correlation coefficient is greater than the correlation coefficient threshold, an undirected edge is generated between the two base stations; the base station relationship of the N base stations is constructed by the undirected edges generated between the two base stations in the N base stations The internet.
  • the correlation coefficient threshold may be 0.6.
  • the M base stations that play an important role in evaluating the throughput of the base station relationship network include: performing statistics on the undirected side of each base station in the base station relationship network. The number of the nodes is obtained, and the degree of each base station is obtained; the m base stations with the most moderate base stations are sequentially selected, and the throughput of the Nm base stations is evaluated according to the support vector machine, and the throughput evaluation effect of the m base station relationship networks is obtained; Among the throughput evaluation effects of the m base station relationship networks, the best throughput evaluation effect is selected, and the m base stations corresponding to the selected best throughput evaluation effects are regarded as important for the base station relationship network throughput evaluation effect.
  • the number of undirected edges is proportional to the size of the base station, and such a proportional relationship is common knowledge of those skilled in the art.
  • the utilization of the throughput history data of the M important base stations to evaluate the remaining NM base station throughput includes: constructing a throughput relationship model between the remaining NM base stations and the M important base stations by using a support vector machine algorithm; The relationship model and the throughput history data of the M important base stations obtain the estimated throughput of the remaining NM base stations.
  • the construction of the throughput relationship model is a common technical means by those skilled in the art.
  • other algorithms may be used to determine the throughput relationship model. After constructing the throughput relationship model, the throughput history data of the M base stations is taken as an input, and the evaluation throughput of the N-M base stations can be output.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to perform the above-mentioned wireless network throughput evaluation method.
  • the method includes: an acquisition module 201, a construction module 202, a lookup module 203, and an evaluation module 204.
  • the collecting module 201 is configured to acquire historical data of the throughput of the N base stations;
  • the constructing module 202 is configured to construct a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations;
  • the searching module 203 is configured In order to utilize the constructed base station relationship network and the historical data of the acquired throughput, find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations;
  • the evaluation module 204 is configured to utilize The historical data of the throughput of the M important base stations is determined, and the remaining NM base station throughputs are evaluated; wherein N and M are both positive integers, and N is greater than M.
  • the collecting module 201 includes: a calculating throughput average unit, configured to collect an original throughput sequence of each base station, and calculate an average value of the throughput sequence values in the collected original throughput sequence;
  • the obtaining unit is configured to replace the calculated throughput sequence with the value of the sequence of the preset percentage threshold of the sequence value before each sequence of the original throughput sequence of each base station collected
  • the average of the values, the new throughput sequence of each base station is obtained, and the normalized throughput sequence of each base station is obtained by normalizing the time series of the new throughput of each base station, and the obtained normalized throughput sequence is obtained.
  • the throughput sequence is used as historical data for throughput.
  • the constructing module 202 includes: calculating a correlation coefficient unit configured to calculate a correlation coefficient between two base stations of the N base stations according to the normalized throughput sequence of each obtained base station;
  • the generating undirected edge unit is set to generate an undirected edge between the two base stations when the calculated correlation coefficient is greater than the correlation coefficient threshold;
  • the building unit is configured to construct a base station relationship network of N base stations by using undirected edges generated between two base stations of the N base stations.
  • the lookup module 203 includes:
  • the obtaining unit is configured to obtain the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network, and sequentially select m base stations with the most moderate among the N base stations, and evaluate Nm base stations according to the support vector machine. Throughput, the throughput evaluation effect of m base station relationship networks is obtained;
  • the searching unit is configured to select the best throughput evaluation effect in the throughput evaluation effect of the obtained m base station relationship networks, and use the m base stations corresponding to the best throughput evaluation effect as the base station relationship.
  • M ⁇ m, M ⁇ N, and m ⁇ N.
  • the embodiment of the invention mainly comprises the following four modules: a data preprocessing module (equivalent to an acquisition module), a base station relationship network construction module (equivalent to a construction module), an important base station selection module (equivalent to a search module), and a space throughput evaluation module. (Equivalent to the evaluation module).
  • the data pre-processing module is configured to select the N base stations to be studied and eliminate the abnormal data points;
  • the base station relationship network construction module is configured to construct the base station relationship between the base stations according to the historical data of the throughput of the collected N base stations.
  • the network the important base station selection module is configured to select M important base stations from the N base stations according to the constructed historical data of the base station relationship network and the throughput; the space throughput evaluation module is set to be used for the time to be evaluated, according to the known The determined throughput of the M base stations evaluates the throughput of the other NM base stations.
  • the data preprocessing module can be configured to include:
  • Excluding the abnormal point in the historical data of the throughput of each base station includes the point where the sequence value is extremely large, that is, the value of the sequence of the preset percentage threshold after the sequence value is sorted, and the sequence value is before, Excluding the anomaly point includes replacing the value of the sequence of the outliers with the average of the calculated throughput sequence values.
  • the base station relationship network building module can be configured to include:
  • the important base station selection module can be set to include:
  • the space throughput assessment module includes:
  • FIG. 3 is a flowchart of a method for evaluating a wireless network throughput according to an embodiment of the present invention. As shown in FIG. 3, the method includes:
  • Step 1 data preprocessing
  • Data preprocessing mainly consists of the following steps:
  • each base station throughput sequence to obtain a normalized throughput sequence S i of the i-th base station, and use the obtained normalized throughput sequence as historical data of throughput.
  • max(p i ),min(p i ) represent the maximum and minimum values of the original throughput sequence, respectively, and L is the total length of the sequence of the throughput sequence.
  • Step 2 construct a base station relationship network
  • c For a given correlation coefficient threshold c, if ⁇ ij is greater than c, it is considered that there is a significant correlation between base station i and base station j, and an undirected edge is added between them, so that the base station relationship of N base stations can be constructed.
  • the internet if ⁇ ij is greater than c, it is considered that there is a significant correlation between base station i and base station j, and an undirected edge is added between them, so that the base station relationship of N base stations can be constructed.
  • Step 3 Select an important base station
  • a support vector machine (SVM) is used to evaluate base station throughput.
  • SVM support vector machine
  • the algorithm flow of the SVM is shown in Figure 4 and includes the following steps:
  • step 7 When it is judged that it is smaller than the given error e, the process proceeds to step 7, and when it is judged that it is not smaller than the given error e, the parameter is adjusted, and the process returns to step 3.
  • the final throughput evaluation effect is measured by the symmetric mean relative error (SMAPE), and the SMAPE reflects the relative error between the evaluation value and the real value, and solves the problem that the real value is too small.
  • SMAPE symmetric mean relative error
  • the vector machine evaluates the throughput of other Nm base stations. Calculate the SMAPE of each base station that is evaluated, and select the M base stations with the lowest average SMAPE as the important base stations.
  • Step 4 Use other important base stations to evaluate other base station throughput.
  • the SVM algorithm is used, and the throughput relationship model of other N-M base stations and M important base stations is trained using the historical data of the throughput.
  • the throughput of the M base stations in the time period to be evaluated is input into the relational model, and the throughput of the corresponding N-M base stations can be output.
  • the data in this example is derived from the statistics of all base stations in a large city.
  • the time granularity is 60 minutes and the total length of time is 21 consecutive days.
  • the wireless network space throughput evaluation method in the embodiment of the present invention includes the following steps:
  • Step 1 Data preprocessing
  • Step 2 Construct a base station relationship network for 95 base stations to be studied
  • Step 4 Select an important base station
  • the first 15 days of data is selected as the training sample set, and the last 3 days of data is used as the test sample set; the first 15 days of data of all base stations is used as the input of the support vector machine algorithm (SVM), and the output training is obtained.
  • SVM support vector machine algorithm
  • Step 5 Use the support vector machine algorithm to estimate the space throughput.
  • the support vector machine algorithm uses the historical data of the throughput to train the throughput relationship model of the other 87 base stations and the 8 important base stations.
  • the throughput of the eight base stations in the last three days of the original 21-day data can be output.
  • the data in this example is derived from the statistical data of a typical area in a large city with a time granularity of 60 minutes and a total length of time of 18 consecutive days.
  • the wireless network space throughput evaluation method in the embodiment of the present invention includes the following steps:
  • Step 1 Data preprocessing
  • Step 2 117 base stations to be studied, and construct a base station relationship network
  • Step 4 Select an important base station
  • the first 12 days of data are selected as the training set, and the last three days of data are used as the test set; the first 12 days of data from all base stations are used as input to the support vector machine algorithm (SVM), and the output training is obtained.
  • SVM support vector machine algorithm
  • Step 5 Evaluate space throughput using the SVM algorithm.
  • the support vector machine algorithm uses the historical data of the throughput to train the throughput relationship model of the other 106 base stations and 11 important base stations.
  • the throughput of 11 base stations can be output.
  • FIG. 10 it is an example of the evaluation result, where 3 is an evaluation value and 4 is a true value.
  • the embodiment of the invention obtains the relationship between the throughput changes of the base stations according to the historical data of the base station, and constructs The base station relationship network selects a few important base stations from the network to evaluate the throughput of other large base stations. It has high practical value. For example, in the data acquisition of the base station, there are many base stations whose data is missing. With the embodiment of the present invention, the missing data can be evaluated for further network analysis. At the same time, it can be flexibly selected according to the historical data of the throughput of different regions or time periods to evaluate, with universal applicability and better prediction accuracy.
  • each module/unit in the foregoing embodiment may be implemented in the form of hardware, for example, by implementing an integrated circuit to implement its corresponding function, or may be implemented in the form of a software function module, for example, being executed by a processor and stored in a memory. Programs/instructions to implement their respective functions.
  • the invention is not limited to any specific form of combination of hardware and software.

Abstract

A wireless network throughput evaluating method and device, comprising: acquiring historical throughput data of N base stations; according to the obtained historical throughput data of the N base stations, constructing a base station association network of the N base stations; according to the constructed base station association network and the obtained historical throughput data, finding M base stations important for the evaluation of a throughput of the base station association network, and considering the M base stations as important base stations; using the historical throughput data of the determined M base stations, evaluating a throughput of the remaining (N-M) base stations. In embodiments of the present invention, historical data of base station throughputs of selected important base stations is used to evaluate other base stations, thus reducing data complexity.

Description

一种无线网络吞吐量的评估方法及装置Method and device for evaluating wireless network throughput 技术领域Technical field
本文涉及但不限于数据挖掘技术,尤其涉及一种无线网络吞吐量的评估方法及装置。This document relates to, but is not limited to, data mining technology, and in particular to a method and device for evaluating wireless network throughput.
背景技术Background technique
随着无线网络的快速发展,移动互联网数据业务的种类和流量都有了很大的提高,流量爆炸性增长、业务类型极其丰富,对网络流量行为分析也就愈加复杂。With the rapid development of wireless networks, the types and traffic of mobile Internet data services have been greatly improved. The explosive growth of traffic and the extremely rich types of services have made the analysis of network traffic behavior more complicated.
为了有效实现网络规划设计、网络资源分配,精细化运营管理等,必须准确地分析网络吞吐量。由于数据业务的多样性、随机性和突发性等特点,相关技术中的数据分析方法过于复杂,已经不能够满足当前的网络吞吐量行为分析。In order to effectively implement network planning and design, network resource allocation, and refined operation management, it is necessary to accurately analyze network throughput. Due to the diversity, randomness and suddenness of data services, the data analysis methods in related technologies are too complicated to meet the current network throughput behavior analysis.
发明内容Summary of the invention
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。The following is an overview of the topics detailed in this document. This Summary is not intended to limit the scope of the claims.
本发明实施例提供一种无线网络吞吐量的评估方法及装置,能够满足当前的网络吞吐量行为分析。Embodiments of the present invention provide a method and an apparatus for evaluating wireless network throughput, which can satisfy current network throughput behavior analysis.
根据本发明实施例的一个方面,提供了一种无线网络吞吐量的评估方法,包括:According to an aspect of an embodiment of the present invention, a method for evaluating a wireless network throughput is provided, including:
获取N个基站的吞吐量的历史数据;Obtaining historical data of throughput of N base stations;
根据所获取的N个基站的吞吐量的历史数据,构建所述N个基站的基站关系网络;Constructing a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations;
根据构建的所述基站关系网络及获取的所述吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站;Obtaining M base stations that play an important role in evaluating the throughput of the base station relationship network according to the constructed base station relationship network and the historical data of the obtained throughput, and using the M base stations as important base stations;
利用所述确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M 个基站吞吐量进行评估;Using the historical data of the determined throughput of the M important base stations, the remaining N-M Base station throughput is evaluated;
其中,N和M均为正整数,并且N大于M。Where N and M are both positive integers and N is greater than M.
可选地,所述获取N个基站的吞吐量的历史数据包括:Optionally, the historical data of acquiring the throughput of the N base stations includes:
采集每个基站的原始吞吐量序列,并计算出采集到的所述原始吞吐量序列中吞吐量序列数值的平均值;Collecting an original throughput sequence of each base station, and calculating an average value of the throughput sequence values in the collected original throughput sequence;
通过将所采集到的每个基站的每一个原始吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出的所述吞吐量序列数值的平均值,得到每个基站的新吞吐量序列;The value of the sequence of the preset percentage thresholds in which the sequence value is prior to each other in the original throughput sequence of each base station is replaced by the calculated average of the throughput sequence values. Value, get the new throughput sequence for each base station;
通过对每个基站的新吞吐量的时间序列进行归一化处理,得到每个基站的归一化吞吐量序列,将得到的归一化吞吐量序列作为所述吞吐量的历史数据。The normalized throughput sequence of each base station is obtained by normalizing the time series of the new throughput of each base station, and the obtained normalized throughput sequence is used as the historical data of the throughput.
可选地,所述根据所获取的N个基站的吞吐量的历史数据,构建所述N个基站的基站关系网络包括:Optionally, the constructing the base station relationship network of the N base stations according to the historical data of the acquired throughput of the N base stations includes:
根据所得到每个基站的所述归一化吞吐量序列,分别计算所述N个基站中两两基站之间的相关系数;Calculating correlation coefficients between two of the N base stations according to the normalized throughput sequence of each of the obtained base stations;
当计算得到的所述相关系数大于相关系数阈值时,则在所述两两基站之间生成一条无向边;When the calculated correlation coefficient is greater than a correlation coefficient threshold, an undirected edge is generated between the two base stations;
通过所述N个基站中两两基站之间生成的无向边,构建所述N个基站的所述基站关系网络。Constructing the base station relationship network of the N base stations by using an undirected edge generated between two base stations of the N base stations.
可选地,所述利用构建的所述基站关系网络以及获取的所述吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站包括:Optionally, the M base stations that use the constructed base station relationship network and the acquired historical data of the throughput to find an important effect on the base station relationship network throughput evaluation performance include:
通过统计所述基站关系网络中每个基站的无向边条数,得到每个基站的度;Obtaining the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network;
依次选取所述N个基站中度最大的m个的基站,根据支持向量机评估N-m个基站的吞吐量,得到m种基站关系网络的吞吐量评估效果;Selecting the base stations of the N most basic base stations in sequence, and evaluating the throughput of the N-m base stations according to the support vector machine, and obtaining the throughput evaluation effect of the m base station relationship networks;
在所得到的m种基站关系网络的吞吐量评估效果中,选取最好的吞吐量评估效果,并将所选取的最好吞吐量评估效果相对应的m个基站作为对基站 关系网络吞吐量评估效果起重要作用的M个基站;In the obtained throughput evaluation effect of the m base station relationship networks, the best throughput evaluation effect is selected, and the m base stations corresponding to the selected best throughput evaluation effects are used as the base stations. M base stations that play an important role in the network throughput evaluation effect;
其中,m、M、N为正整数,M<=m,M<N,m<N。Where m, M, and N are positive integers, M<=m, M<N, and m<N.
可选地,所述每个基站的无向边条数与基站度的大小成正比。Optionally, the number of undirected edges of each base station is proportional to the size of the base station.
可选地,所述利用确定出的所述M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估包括:Optionally, the estimating the remaining N-M base station throughputs by using the determined historical data of the throughput of the M important base stations includes:
通过支持向量机算法构造剩余的N-M个基站与M个重要基站的吞吐量关系模型;Constructing a throughput relationship model between the remaining N-M base stations and M important base stations by using a support vector machine algorithm;
利用构造的所述吞吐量关系模型和所述M个重要基站的吞吐量历史数据,得到剩余的N-M个基站的评估吞吐量。The estimated throughput of the remaining N-M base stations is obtained by using the constructed throughput relationship model and the throughput history data of the M important base stations.
根据本发明实施例的另一方面,提供了一种无线网络吞吐量的评估装置,包括:According to another aspect of the embodiments of the present invention, an apparatus for evaluating wireless network throughput is provided, including:
采集模块,设置为采集N个基站的吞吐量的历史数据;An acquisition module, configured to collect historical data of throughput of N base stations;
构建模块,设置为根据所获取的N个基站的吞吐量的历史数据,构建所述N个基站的基站关系网络;a building module, configured to construct a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations;
查找模块,设置为利用构建的所述基站关系网络以及获取的所述吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站;a searching module, configured to use the constructed base station relationship network and the acquired historical data of the throughput to find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations ;
评估模块,设置为利用所述确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估;An evaluation module, configured to use the historical data of the determined throughput of the M important base stations to evaluate the remaining N-M base station throughputs;
其中,N和M均为正整数,并且N大于M。Where N and M are both positive integers and N is greater than M.
可选地,所述采集模块包括:Optionally, the collecting module includes:
计算吞吐量平均值单元,设置为采集每个基站的原始吞吐量序列,并计算出采集到的所述原始吞吐量序列中吞吐量序列数值的平均值;Calculating a throughput average unit, configured to collect a raw throughput sequence of each base station, and calculate an average value of the throughput sequence values in the collected original throughput sequence;
获取单元设置为,通过将所采集到的每个基站的每一个原始吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出的所述吞吐量序列数值的平均值,得到每个基站的新吞吐量序列,以及通过对每个基站的新吞吐量的时间序列进行归一化处理,得到每个 基站的归一化吞吐量序列,将得到的归一化吞吐量序列作为所述吞吐量的历史数据。The obtaining unit is configured to replace the calculated throughput with a sequence of a preset percentage threshold of the sequence value before each sequence of the original throughput sequence of each of the acquired base stations is replaced by the calculated The average of the sequence values, the new throughput sequence for each base station, and the normalization of the time series of the new throughput for each base station, resulting in each The normalized throughput sequence of the base station, the resulting normalized throughput sequence is taken as historical data of the throughput.
可选地,所述构建模块包括:Optionally, the building module includes:
计算相关系数单元设置为,根据所得到每个基站的归一化吞吐量序列,分别计算所述N个基站中两两基站之间的相关系数;Calculating the correlation coefficient unit is configured to calculate correlation coefficients between two of the N base stations according to the normalized throughput sequence of each of the obtained base stations;
生成无向边单元设置为,当计算得到的所述相关系数大于相关系数阈值时,则在所述两两基站之间生成一条无向边;Generating the undirected edge unit to set an undirected edge between the two base stations when the calculated correlation coefficient is greater than the correlation coefficient threshold;
构建单元设置为,通过所述N个基站中两两基站之间生成的无向边,构建所述N个基站的所述基站关系网络。The building unit is configured to construct the base station relationship network of the N base stations by using an undirected edge generated between two base stations of the N base stations.
可选地,所述查找模块包括:Optionally, the searching module includes:
获取单元设置为,通过统计所述基站关系网络中每个基站的无向边条数,得到每个基站的度,以及依次选取所述N个基站中度最大的m个基站,根据支持向量机评估N-m个基站的吞吐量,得到m种基站关系网络的吞吐量评估效果;The obtaining unit is configured to obtain the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network, and sequentially select the m base stations with the medium most middle of the N base stations, according to the support vector machine Assessing the throughput of the Nm base stations, and obtaining the throughput evaluation effect of the m base station relationship networks;
查找单元设置为,在所得到的m种基站关系网络的吞吐量评估效果中,选取最好的吞吐量评估效果,并将所选取的最好吞吐量评估效果相对应的m个基站作为对基站关系网络吞吐量评估效果起重要作用的M个基站;The searching unit is configured to select the best throughput evaluation effect in the throughput evaluation effect of the obtained m base station relationship networks, and use the m base stations corresponding to the selected best throughput evaluation effect as the base station M base stations that play an important role in the network throughput evaluation effect;
其中,m、M、N为正整数,M<=m,M<N,m<N。Where m, M, and N are positive integers, M<=m, M<N, and m<N.
与相关技术相比,本发明实施例提供的技术方案,包括:获取N个基站的吞吐量的历史数据;根据所获取的N个基站的吞吐量的历史数据,构建N个基站的基站关系网络;根据构建的基站关系网络以及获取的吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站;利用确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估。本发明实施例的有益效果在于:本发明实施例采用选取出的少量重要基站的基站吞吐量的历史数据,对其他基站的特性进行评估,减小了数据分析的复杂度;同时,使得能够在已知空间部分基站吞吐量的情况下,评估出空间范围内其他未知基站的吞吐量,从而对无线网络资源的优化提供参考。 Compared with the related art, the technical solution provided by the embodiment of the present invention includes: acquiring historical data of throughput of N base stations; and constructing a base station relationship network of N base stations according to the acquired historical data of throughput of the N base stations According to the constructed base station relationship network and the historical data of the acquired throughput, find M base stations that play an important role in the base station relationship network throughput evaluation effect, and use the M base stations as important base stations; use the determined M important Historical data of the throughput of the base station, and the remaining NM base station throughput is evaluated. The beneficial effects of the embodiments of the present invention are as follows: the embodiment of the present invention uses the historical data of the base station throughput of a selected number of important base stations to evaluate the characteristics of other base stations, thereby reducing the complexity of data analysis; When the throughput of the base part of the base station is known, the throughput of other unknown base stations in the spatial range is evaluated, thereby providing a reference for optimizing the wireless network resources.
在阅读并理解了附图和详细描述后,可以明白其他方面。Other aspects will be apparent upon reading and understanding the drawings and detailed description.
附图概述BRIEF abstract
图1是本发明实施例提供的一种无线网络吞吐量的评估方法流程图;FIG. 1 is a flowchart of a method for evaluating wireless network throughput according to an embodiment of the present invention;
图2是本发明实施例提供的一种无线网络吞吐量的评估装置示意图;2 is a schematic diagram of an apparatus for evaluating wireless network throughput according to an embodiment of the present invention;
图3是本发明实施例提供的无线网络吞吐量评估方法的流程图;3 is a flowchart of a method for evaluating a wireless network throughput according to an embodiment of the present invention;
图4是本发明实施例提供的支持向量机的算法流程图;4 is a flowchart of an algorithm of a support vector machine according to an embodiment of the present invention;
图5是本发明第一实施例提供的构建基站关系网络图;FIG. 5 is a network diagram for constructing a base station relationship according to a first embodiment of the present invention; FIG.
图6是本发明第一实施例提供的SMAPE(Symmetric mean absolute percentage error,对称平均相对误差)平均值随m的变化情况图;6 is a diagram showing the variation of the average value of SMAPE (Symmetric mean absolute percentage error) with m according to the first embodiment of the present invention;
图7是本发明第一实施例提供的两个基站的评估结果图;7 is a diagram showing evaluation results of two base stations according to a first embodiment of the present invention;
图8是本发明第二实施例提供的构建基站关系网络图;8 is a network diagram for constructing a base station relationship according to a second embodiment of the present invention;
图9是本发明第二实施例提供的SMAPE平均值随m的变化情况图;Figure 9 is a diagram showing the variation of the average value of SMAPE with m according to the second embodiment of the present invention;
图10是本发明第二实施例提供的两个基站的评估结果图。FIG. 10 is a diagram showing evaluation results of two base stations according to a second embodiment of the present invention.
本发明的实施方式Embodiments of the invention
下文中将结合附图对本申请的实施例进行详细说明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。Embodiments of the present application will be described in detail below with reference to the accompanying drawings. It should be noted that, in the case of no conflict, the features in the embodiments and the embodiments in the present application may be arbitrarily combined with each other.
图1是本发明实施例提供的一种无线网络吞吐量的评估方法流程图,如图1所示,包括以下步骤:FIG. 1 is a flowchart of a method for evaluating throughput of a wireless network according to an embodiment of the present invention. As shown in FIG. 1 , the method includes the following steps:
步骤101:获取N个基站的吞吐量的历史数据;Step 101: Acquire historical data of throughput of N base stations;
需要说明的是,本发明实施例,N可以包括无线网络吞吐量的评估所在区域范围内的基站的数目。It should be noted that, in the embodiment of the present invention, N may include the number of base stations in the area where the wireless network throughput is evaluated.
步骤102:根据所获取的N个基站的吞吐量的历史数据,构建N个基站的基站关系网络;Step 102: Construct a base station relationship network of N base stations according to the acquired historical data of the throughput of the N base stations.
步骤103:利用构建的基站关系网络以及获取的吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站; Step 103: Using the constructed base station relationship network and the historical data of the acquired throughput, find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations;
步骤104:利用确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估;Step 104: Evaluate the remaining N-M base station throughputs by using historical data of the determined throughput of the M important base stations;
其中,N和M均为正整数,并且N大于M。Where N and M are both positive integers and N is greater than M.
可选的,获取N个基站的吞吐量的历史数据包括:采集每个基站的原始吞吐量序列,并计算采集到的原始吞吐量序列中吞吐量序列数值的平均值;通过将所采集的每个基站的每一个原始吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出的吞吐量序列数值的平均值,得到每个基站的新吞吐量序列;通过对每个基站的新吞吐量的序列进行归一化处理,得到每个基站的归一化吞吐量序列,将得到的归一化吞吐量序列作为吞吐量的历史数据。Optionally, obtaining historical data of the throughput of the N base stations includes: collecting an original throughput sequence of each base station, and calculating an average value of the throughput sequence values in the collected original throughput sequence; The value of the sequence of the preset percentage thresholds in which the sequence value is prior to each sequence in the original throughput sequence of each base station is replaced by the average of the calculated throughput sequence values, and the new throughput of each base station is obtained. The sequence of normalization throughput is obtained by normalizing the sequence of the new throughput of each base station, and the normalized throughput sequence obtained is used as the historical data of the throughput.
需要说明的是,本发明实施例,预设百分比阈值可以为3%,3%只是一个可选数值,预设百分比阈值为本领域技术人员根据经验分析获得的数值,可以是2%~5%中的一个值。采集原始吞吐量数据可以采用基站的所有时长的原始吞吐量序列,也可以是预设时长的原始吞吐量序列,预设时长的原始吞吐量序列时,预设时长一般大于或等于14天。It should be noted that, in the embodiment of the present invention, the preset percentage threshold may be 3%, and 3% is only an optional value. The preset percentage threshold is a value obtained by a person skilled in the art according to empirical analysis, and may be 2% to 5%. A value in . The original throughput data may be collected by using the original throughput sequence of the base station for all the durations, or the original throughput sequence of the preset duration. The preset duration is generally greater than or equal to 14 days when the original throughput sequence of the preset duration is used.
其中,根据所获取的N个基站的吞吐量的历史数据,构建N个基站的基站关系网络包括:根据所得到每个基站的归一化吞吐量序列,分别计算N个基站中两两基站之间的相关系数;当相关系数大于相关系数阈值时,则在两两基站之间生成一条无向边;通过N个基站中两两基站之间生成的无向边,构建N个基站的基站关系网络。The base station relationship network for constructing the N base stations according to the historical data of the acquired throughput of the N base stations includes: calculating, according to the normalized throughput sequence of each base station obtained, two base stations of the N base stations respectively. Correlation coefficient; when the correlation coefficient is greater than the correlation coefficient threshold, an undirected edge is generated between the two base stations; the base station relationship of the N base stations is constructed by the undirected edges generated between the two base stations in the N base stations The internet.
需要说明的是,计算两两基站之间的相关系数为本领域技术人员的惯用技术手段,相关系数阈值可以取0.6。It should be noted that calculating the correlation coefficient between the two base stations is a common technical means by those skilled in the art, and the correlation coefficient threshold may be 0.6.
可选的,利用构建的基站关系网络以及获取的吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站包括:通过统计基站关系网络中每个基站的无向边条数,得到每个基站的度;依次选取N个基站中度最大的m个基站,根据支持向量机评估N-m个基站的吞吐量,得到m种基站关系网络的吞吐量评估效果;在所得到的m种基站关系网络的吞吐量评估效果中,选取最好的吞吐量评估效果,并将所选取的最好吞吐量评估效果相对应的m个基站作为对基站关系网络吞吐量评估效果起重要作用的M个 基站;其中,m、M、N为正整数,M<=m,M<N,m<N。本发明实施例,无向边条数与基站度的大小成正比,这种正比关系为本领域技术人员的公知常识。Optionally, using the constructed base station relationship network and the historical data of the acquired throughput, the M base stations that play an important role in evaluating the throughput of the base station relationship network include: performing statistics on the undirected side of each base station in the base station relationship network. The number of the nodes is obtained, and the degree of each base station is obtained; the m base stations with the most moderate base stations are sequentially selected, and the throughput of the Nm base stations is evaluated according to the support vector machine, and the throughput evaluation effect of the m base station relationship networks is obtained; Among the throughput evaluation effects of the m base station relationship networks, the best throughput evaluation effect is selected, and the m base stations corresponding to the selected best throughput evaluation effects are regarded as important for the base station relationship network throughput evaluation effect. M a base station; wherein m, M, and N are positive integers, M<=m, M<N, and m<N. In the embodiment of the present invention, the number of undirected edges is proportional to the size of the base station, and such a proportional relationship is common knowledge of those skilled in the art.
其中,利用M个重要基站的吞吐量历史数据,对剩余的N-M个基站吞吐量进行评估包括:通过支持向量机算法构造剩余的N-M个基站与M个重要基站的吞吐量关系模型;利用吞吐量关系模型和M个重要基站的吞吐量历史数据,得到剩余的N-M个基站的评估吞吐量。The utilization of the throughput history data of the M important base stations to evaluate the remaining NM base station throughput includes: constructing a throughput relationship model between the remaining NM base stations and the M important base stations by using a support vector machine algorithm; The relationship model and the throughput history data of the M important base stations obtain the estimated throughput of the remaining NM base stations.
需要说明的是,构建吞吐量关系模型为本领域技术人员的惯用技术手段,除了本发明实施例的通过支持向量机算法进行构造外,还可以采用其他算法进行吞吐量关系模型的判断。在构建吞吐量关系模型后,将M个基站的吞吐量历史数据作为输入,可以输出N-M个基站的评估吞吐量。It should be noted that the construction of the throughput relationship model is a common technical means by those skilled in the art. In addition to the construction of the support vector machine algorithm in the embodiment of the present invention, other algorithms may be used to determine the throughput relationship model. After constructing the throughput relationship model, the throughput history data of the M base stations is taken as an input, and the evaluation throughput of the N-M base stations can be output.
本发明实施例还提供一种计算机存储介质,计算机存储介质中存储有计算机可执行指令,计算机可执行指令用于执行上述的无线网络吞吐量的评估方法。The embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to perform the above-mentioned wireless network throughput evaluation method.
图2是本发明实施例提供的一种无线网络吞吐量的评估装置示意图,如图2所示,包括:采集模块201、构建模块202、查找模块203以及评估模块204。采集模块201,设置为获取N个基站的吞吐量的历史数据;构建模块202,设置为根据所获取的N个基站的吞吐量的历史数据,构建N个基站的基站关系网络;查找模块203设置为利用构建的基站关系网络以及获取的吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站;评估模块204,设置为利用确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估;其中,N和M均为正整数,并且N大于M。2 is a schematic diagram of an apparatus for evaluating wireless network throughput according to an embodiment of the present invention. As shown in FIG. 2, the method includes: an acquisition module 201, a construction module 202, a lookup module 203, and an evaluation module 204. The collecting module 201 is configured to acquire historical data of the throughput of the N base stations; the constructing module 202 is configured to construct a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations; the searching module 203 is configured In order to utilize the constructed base station relationship network and the historical data of the acquired throughput, find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations; the evaluation module 204 is configured to utilize The historical data of the throughput of the M important base stations is determined, and the remaining NM base station throughputs are evaluated; wherein N and M are both positive integers, and N is greater than M.
可选的,采集模块201包括:计算吞吐量平均值单元,设置为采集每个基站的原始吞吐量序列,并计算出采集到的原始吞吐量序列中吞吐量序列数值的平均值; Optionally, the collecting module 201 includes: a calculating throughput average unit, configured to collect an original throughput sequence of each base station, and calculate an average value of the throughput sequence values in the collected original throughput sequence;
获取单元设置为,通过将所采集到的每个基站的每一个原始吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出的吞吐量序列数值的平均值,得到每个基站的新吞吐量序列,以及通过对每个基站的新吞吐量的时间序列进行归一化处理,得到每个基站的归一化吞吐量序列,将得到的归一化吞吐量序列作为吞吐量的历史数据。The obtaining unit is configured to replace the calculated throughput sequence with the value of the sequence of the preset percentage threshold of the sequence value before each sequence of the original throughput sequence of each base station collected The average of the values, the new throughput sequence of each base station is obtained, and the normalized throughput sequence of each base station is obtained by normalizing the time series of the new throughput of each base station, and the obtained normalized throughput sequence is obtained. The throughput sequence is used as historical data for throughput.
构建模块202包括:计算相关系数单元设置为,根据所得到每个基站的归一化吞吐量序列,分别计算N个基站中两两基站之间的相关系数;The constructing module 202 includes: calculating a correlation coefficient unit configured to calculate a correlation coefficient between two base stations of the N base stations according to the normalized throughput sequence of each obtained base station;
生成无向边单元设置为,当计算得到的相关系数大于相关系数阈值时,则在两两基站之间生成一条无向边;The generating undirected edge unit is set to generate an undirected edge between the two base stations when the calculated correlation coefficient is greater than the correlation coefficient threshold;
构建单元设置为,通过N个基站中两两基站之间生成的无向边,构建N个基站的基站关系网络。The building unit is configured to construct a base station relationship network of N base stations by using undirected edges generated between two base stations of the N base stations.
查找模块203包括:The lookup module 203 includes:
获取单元设置为,通过统计基站关系网络中每个基站的无向边条数,得到每个基站的度,以及依次选取N个基站中度最大的m个基站,根据支持向量机评估N-m个基站的吞吐量,得到m种基站关系网络的吞吐量评估效果;The obtaining unit is configured to obtain the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network, and sequentially select m base stations with the most moderate among the N base stations, and evaluate Nm base stations according to the support vector machine. Throughput, the throughput evaluation effect of m base station relationship networks is obtained;
查找单元设置为,在所得到的m种基站关系网络的吞吐量评估效果中,选取最好的吞吐量评估效果,并将取的最好吞吐量评估效果相对应的m个基站作为对基站关系网络吞吐量评估效果起重要作用的M个基站;The searching unit is configured to select the best throughput evaluation effect in the throughput evaluation effect of the obtained m base station relationship networks, and use the m base stations corresponding to the best throughput evaluation effect as the base station relationship. M base stations that play an important role in network throughput evaluation;
其中,m、M、N为正整数,M<=m,M<N,m<N。Where m, M, and N are positive integers, M<=m, M<N, and m<N.
本发明实施例主要包含以下四个模块:数据预处理模块(相当于采集模块),基站关系网络构建模块(相当于构建模块),重要基站选取模块(相当于查找模块),空间吞吐量评估模块(相当于评估模块)。数据预处理模块,设置为选取待研究的N个基站,剔除其中的异常数据点;基站关系网络构建模块,设置为根据已采集的N个基站的吞吐量的历史数据构建基站之间的基站关系网络;重要基站选取模块,设置为根据构建的基站关系网络和吞吐量的历史数据,从N个基站中选取出M个重要基站;空间吞吐量评估模块,设置为对于待评估时间,根据已知的确定出的M个基站的吞吐量评估出其他N-M个基站的吞吐量。 The embodiment of the invention mainly comprises the following four modules: a data preprocessing module (equivalent to an acquisition module), a base station relationship network construction module (equivalent to a construction module), an important base station selection module (equivalent to a search module), and a space throughput evaluation module. (Equivalent to the evaluation module). The data pre-processing module is configured to select the N base stations to be studied and eliminate the abnormal data points; the base station relationship network construction module is configured to construct the base station relationship between the base stations according to the historical data of the throughput of the collected N base stations. The network; the important base station selection module is configured to select M important base stations from the N base stations according to the constructed historical data of the base station relationship network and the throughput; the space throughput evaluation module is set to be used for the time to be evaluated, according to the known The determined throughput of the M base stations evaluates the throughput of the other NM base stations.
数据预处理模块可以设置为包括:The data preprocessing module can be configured to include:
A1.选取空间位置上处于同一区域的N个基站;A1. Select N base stations in the same area in the spatial location;
A2.剔除每个基站吞吐量的历史数据中的异常点;这里,异常点包括序列数值极大的点,即按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值,剔除异常点包括将异常点的序列的数值替换为计算出的吞吐量序列数值的平均值。A2. Excluding the abnormal point in the historical data of the throughput of each base station; here, the abnormal point includes the point where the sequence value is extremely large, that is, the value of the sequence of the preset percentage threshold after the sequence value is sorted, and the sequence value is before, Excluding the anomaly point includes replacing the value of the sequence of the outliers with the average of the calculated throughput sequence values.
A3.对数据进行一次归一化。A3. Normalize the data once.
基站关系网络构建模块可以设置为包括:The base station relationship network building module can be configured to include:
B1.计算N个基站两两之间的相关系数;B1. Calculating a correlation coefficient between two base stations of the N base stations;
B2.根据相关系数,构建一个给定的相关系数阈值的基站关系网络。B2. Based on the correlation coefficient, construct a base station relationship network with a given correlation coefficient threshold.
重要基站选取模块可以设置为包括:The important base station selection module can be set to include:
C1.统计基站关系网络中每一个基站度的大小;C1. Counting the size of each base station in the base station relationship network;
C2.依次选取度前M(M=1,2……N)大的基站作为重要基站,根据支持向量机评估其他N-M个基站的吞吐量;C2. sequentially select the base station with M (M=1, 2...N) before the degree as the important base station, and evaluate the throughput of the other N-M base stations according to the support vector machine;
C3.选取在吞吐量的历史数据上评估效果最好时(起重要作用)的M个基站作为重要基站。C3. Select the M base stations that evaluate the best effect (acting role) on the historical data of the throughput as important base stations.
空间吞吐量评估模块包括:The space throughput assessment module includes:
D1.根据选出的M个重要基站,评估其他N-M个基站的吞吐量。D1. Evaluate the throughput of other N-M base stations according to the selected M important base stations.
图3是本发明实施例提供的无线网络吞吐量评估方法的流程图,如图3所示,包括:3 is a flowchart of a method for evaluating a wireless network throughput according to an embodiment of the present invention. As shown in FIG. 3, the method includes:
步骤1、数据预处理; Step 1, data preprocessing;
为了未来根据部分基站的吞吐量评估其他大量基站的吞吐量,需要先获取到所有基站的吞吐量的历史数据,然后对获取到的历史数据进行预处理。数据预处理主要包含以下几个步骤:In order to evaluate the throughput of other large base stations according to the throughput of some base stations in the future, historical data of the throughput of all base stations needs to be acquired first, and then the acquired historical data is preprocessed. Data preprocessing mainly consists of the following steps:
a、根据需求选取空间位置上处于同一区域的N个基站;a, selecting N base stations in the same area at the spatial location according to requirements;
b、整理N个基站原始吞吐量序列,将每一个吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出 的吞吐量序列数值的平均值;例如、将吞吐量序列中前3%大的吞吐量替换为该吞吐量序列的序列数值的平均值,得到第i个基站的吞吐量序列pi(i=1,2……N),作为新吞吐量序列;b. arranging the original throughput sequences of the N base stations, and replacing the values of the sequence of the preset percentage thresholds in the sequence of the sequence values before each sequence of the throughput sequence with the average value of the calculated throughput sequence values. For example, replacing the first 3% of the throughput in the throughput sequence with the average of the sequence values of the throughput sequence to obtain the throughput sequence p i (i=1, 2...N) of the i-th base station, As a new throughput sequence;
c、对每一个基站吞吐量序列进行归一化处理,得到第i个基站的归一化吞吐量序列Si,将得到的归一化吞吐量序列作为吞吐量的历史数据。c. normalize each base station throughput sequence to obtain a normalized throughput sequence S i of the i-th base station, and use the obtained normalized throughput sequence as historical data of throughput.
Figure PCTCN2016084549-appb-000001
Figure PCTCN2016084549-appb-000001
其中,
Figure PCTCN2016084549-appb-000002
为第i个基站t时刻的归一化吞吐量序列,max(pi),min(pi)分别表示原始吞吐量序列的最大值与最小值,L为吞吐量序列的序列总长度。
among them,
Figure PCTCN2016084549-appb-000002
For the normalized throughput sequence at time t of the i-th base station, max(p i ),min(p i ) represent the maximum and minimum values of the original throughput sequence, respectively, and L is the total length of the sequence of the throughput sequence.
步骤2、构建基站关系网络;Step 2: construct a base station relationship network;
对待研究的N个基站,L为所采集数据(这里的数据为吞吐量序列)的总时长,取L中前T(一般
Figure PCTCN2016084549-appb-000003
左右)个时间数据计算第i(i=1,2,3……N)个基站与第j(j=1,2,3……N)个基站之间的相关系数ρij,计算公式为
For the N base stations to be studied, L is the total duration of the collected data (where the data is the throughput sequence), taking L before the T (general
Figure PCTCN2016084549-appb-000003
Calculating the correlation coefficient ρ ij between the i-th (i=1, 2, 3...N) base stations and the jth (j=1, 2, 3...N) base stations by using the time data of the left and right)
Figure PCTCN2016084549-appb-000004
Figure PCTCN2016084549-appb-000004
Si为第i个基站的吞吐量序列,
Figure PCTCN2016084549-appb-000005
为第i个基站在总时长内的平均吞吐量,
Figure PCTCN2016084549-appb-000006
为第i个基站在时刻t时的吞吐量大小(t=1,2,3……T);Sj为第j个基站的吞吐量序列,
Figure PCTCN2016084549-appb-000007
为第j个基站在总时长内的平均吞吐量,
Figure PCTCN2016084549-appb-000008
为第j个基站在时刻t时的吞吐量大小(t=1,2,3……T)。对于一个给定的相关系数阈值c,若ρij大于c,则认为基站i与基站j存在明显的相关关系,在他们之间添加一条无向边,这样就可以构建出N个基站的基站关系网络。
S i is the throughput sequence of the i-th base station,
Figure PCTCN2016084549-appb-000005
The average throughput of the i-th base station over the total duration,
Figure PCTCN2016084549-appb-000006
The throughput of the i-th base station at time t (t=1, 2, 3...T); S j is the throughput sequence of the j-th base station,
Figure PCTCN2016084549-appb-000007
The average throughput of the jth base station over the total duration,
Figure PCTCN2016084549-appb-000008
The throughput of the jth base station at time t (t = 1, 2, 3 ... T). For a given correlation coefficient threshold c, if ρ ij is greater than c, it is considered that there is a significant correlation between base station i and base station j, and an undirected edge is added between them, so that the base station relationship of N base stations can be constructed. The internet.
步骤3、选取重要基站; Step 3. Select an important base station;
在本发明中,采用支持向量机(SVM,Support Vector Machine)来评估基站吞吐量。SVM的算法流程如图4所示,包括以下步骤:In the present invention, a support vector machine (SVM) is used to evaluate base station throughput. The algorithm flow of the SVM is shown in Figure 4 and includes the following steps:
1、根据评估样本建立训练样本集和测试样本集;1. Establish a training sample set and a test sample set according to the evaluation sample;
2、根据训练样本集建立目标函数; 2. Establish an objective function according to the training sample set;
3、求解目标函数,得到最优参数;3. Solve the objective function and get the optimal parameters;
4、将最优参数代入目标函数,得到决策回归方程;4. Substituting the optimal parameters into the objective function to obtain a decision regression equation;
5、使用测试数据验证决策回归方程;5. Verify the decision regression equation using test data;
6、是否小于给定误差e;6, whether it is less than the given error e;
当判断小于给定误差e时,进入步骤7,当判断不小于给定误差e时,调整参数,并返回到步骤3。When it is judged that it is smaller than the given error e, the process proceeds to step 7, and when it is judged that it is not smaller than the given error e, the parameter is adjusted, and the process returns to step 3.
7、将评估样本输入决策回归方程计算其他基站吞吐量。7. Calculate the sample input decision regression equation to calculate the throughput of other base stations.
在本发明中,最终吞吐量评估效果好坏使用对称平均相对误差(SMAPE)来衡量,SMAPE反映了评估值与真实值之间相对误差的大小,同时解决了由于真实值过小可能带来的相对误差太大的问题,其公式为:In the present invention, the final throughput evaluation effect is measured by the symmetric mean relative error (SMAPE), and the SMAPE reflects the relative error between the evaluation value and the real value, and solves the problem that the real value is too small. The problem of relative error is too large, the formula is:
Figure PCTCN2016084549-appb-000009
Figure PCTCN2016084549-appb-000009
其中,Ft为评估值,At为实际值。Where Ft is the evaluation value and At is the actual value.
为了筛选出部分基站作为重要基站,依次选取度最大的前m(m=1,2……N)基站作为重要基站,即对基站关系网络吞吐量评估效果起重要作用的M个基站,根据支持向量机评估其他N-m个基站的吞吐量。计算评估出来的每一个基站的SMAPE,选取平均SMAPE最小时的M个基站作为重要基站。In order to select some base stations as important base stations, the top m (m=1, 2...N) base stations with the largest degree are selected as the important base stations, that is, M base stations that play an important role in evaluating the throughput of the base station relationship network, according to the support. The vector machine evaluates the throughput of other Nm base stations. Calculate the SMAPE of each base station that is evaluated, and select the M base stations with the lowest average SMAPE as the important base stations.
步骤4、使用重要基站评估其他基站吞吐量。 Step 4. Use other important base stations to evaluate other base station throughput.
在本发明实施例中,根据选出的M个重要基站,采用SVM算法,使用吞吐量的历史数据训练出其他N-M个基站与M个重要基站的吞吐量关系模型。将待评估时间段内的M个基站的吞吐量输入到关系模型中,即可输出对应的N-M个基站的吞吐量。In the embodiment of the present invention, according to the selected M important base stations, the SVM algorithm is used, and the throughput relationship model of other N-M base stations and M important base stations is trained using the historical data of the throughput. The throughput of the M base stations in the time period to be evaluated is input into the relational model, and the throughput of the corresponding N-M base stations can be output.
下面结合附图5至附图10对本发明实施例进行说明。Embodiments of the present invention will be described below with reference to FIGS. 5 to 10.
实施例一 Embodiment 1
本实例中数据来源于某大型城市所有基站统计的数据,其时间颗粒度为60分钟,时间总长度为连续21天。本发明实施例中的无线网络空间吞吐量评估方法包含以下步骤: The data in this example is derived from the statistics of all base stations in a large city. The time granularity is 60 minutes and the total length of time is 21 consecutive days. The wireless network space throughput evaluation method in the embodiment of the present invention includes the following steps:
步骤一:数据预处理;Step 1: Data preprocessing;
A.根据需求选取空间位置上处于同一区域的95个基站; A. Select 95 base stations in the same area at the spatial location according to the requirements;
B.剔除95个基站中的异常数据点,得到每一基站的吞吐量序列;B. Excluding the abnormal data points in 95 base stations, and obtaining the throughput sequence of each base station;
C.对每一个基站吞吐量序列进行归一化处理,得到第i个基站的归一化吞吐量序列SiC. Normalize each base station throughput sequence to obtain a normalized throughput sequence S i for the i-th base station.
步骤二:对待研究的95个基站,构建基站关系网络;Step 2: Construct a base station relationship network for 95 base stations to be studied;
A.取这95个基站前18天的数据计算第i(i=1,2,3……95)个基站与第j(j=1,2,3……95)个基站之间的相关系数ρij,计算公式为A. Calculate the correlation between the i-th (i=1, 2, 3...95) base stations and the jth (j=1, 2, 3...95) base stations by taking the data of the first 18 days of the 95 base stations. The coefficient ρ ij is calculated as
Figure PCTCN2016084549-appb-000010
Figure PCTCN2016084549-appb-000010
其中T=432,Si为第i个基站的吞吐量序列,
Figure PCTCN2016084549-appb-000011
为第i个基站在总时长内的平均吞吐量,
Figure PCTCN2016084549-appb-000012
为第i个基站在时刻t时的吞吐量大小(t=1,2,3……432);Sj为第j个基站的吞吐量序列,
Figure PCTCN2016084549-appb-000013
为第j个基站在总时长内的平均吞吐量,
Figure PCTCN2016084549-appb-000014
为第j个基站在时刻t时的吞吐量大小(t=1,2,3……432)。
Where T=432, S i is the throughput sequence of the i-th base station,
Figure PCTCN2016084549-appb-000011
The average throughput of the i-th base station over the total duration,
Figure PCTCN2016084549-appb-000012
The throughput of the i-th base station at time t (t=1, 2, 3...432); S j is the throughput sequence of the j-th base station,
Figure PCTCN2016084549-appb-000013
The average throughput of the jth base station over the total duration,
Figure PCTCN2016084549-appb-000014
The throughput of the jth base station at time t (t = 1, 2, 3 ... 432).
B.在本发明实施例中,给定相关系数阈值c=0.6(一般认为相关系数大于0.6即为强相关),若ρij大于0.6,则在基站i与基站j之间添加一条无向边,这样就可以构建出95个基站的关系网络。如图5所示,其中点代表基站,无向边体现了基站之间的相关性,点越大代表该基站的度越大;其中,点越大,代表无向边越多,无向边越多,度越大。B. In the embodiment of the present invention, a given correlation coefficient threshold c=0.6 (it is generally considered that the correlation coefficient is greater than 0.6 is a strong correlation), and if ρ ij is greater than 0.6, an undirected edge is added between the base station i and the base station j. So, you can build a network of 95 base stations. As shown in FIG. 5, the dot represents the base station, and the undirected edge reflects the correlation between the base stations. The larger the point, the greater the degree of the base station; wherein, the larger the point, the more the undirected side, the undirected side The more, the greater the degree.
步骤四:选取重要基站;Step 4: Select an important base station;
A.从18天的历史数据中,选取前15天数据作为训练样本集,后3天数据作为测试样本集;将所有基站前15天数据作为支持向量机算法(SVM)的输入,输出训练得到的其他95-m个基站与选取的m个基站吞吐量关系模型;A. From the 18-day historical data, the first 15 days of data is selected as the training sample set, and the last 3 days of data is used as the test sample set; the first 15 days of data of all base stations is used as the input of the support vector machine algorithm (SVM), and the output training is obtained. The other 95-m base stations and the selected m base station throughput relationship models;
B.将度最大的m个基站的后3天数据作为吞吐量关系模型的输入,输出其他95-m个基站后3天的估计值;B. The last 3 days of the m base stations with the largest degree are used as the input of the throughput relationship model, and the estimated values of the other 3 days after the other 95-m base stations are output;
C.计算95-m个基站每一个基站相应的SMAPE,做出SMAPE平均值随 m的变化情况,如图6所示,黑点为95-m个基站SMAPE的平均值,从图中可以看出,当m=8时,其他基站的平均SMAPE最小,也就是预测效果最佳,因此在本实施例中选取的重要基站个数为M=8。C. Calculate the corresponding SMAPE of each base station of 95-m base stations, and make the average value of SMAPE The change of m, as shown in Figure 6, the black point is the average value of 95-m base station SMAPE. It can be seen from the figure that when m=8, the average SMAPE of other base stations is the smallest, which is the best prediction effect. Therefore, the number of important base stations selected in this embodiment is M=8.
步骤五:使用支持向量机算法评估空间吞吐量。Step 5: Use the support vector machine algorithm to estimate the space throughput.
在本发明实施例中,根据选出的8个重要基站,采用支持向量机算法(SVM)使用吞吐量的历史数据训练出其他87个基站与8个重要基站的吞吐量关系模型。将原始21天数据中的最后3天的8个基站的吞吐量输入到关系模型中,即可输出对应的87个基站的吞吐量。In the embodiment of the present invention, according to the selected eight important base stations, the support vector machine algorithm (SVM) uses the historical data of the throughput to train the throughput relationship model of the other 87 base stations and the 8 important base stations. By inputting the throughput of the eight base stations in the last three days of the original 21-day data into the relational model, the throughput of the corresponding 87 base stations can be output.
如图7所示,展示了87个基站中部分基站的评估结果,其中1为评估值,2为真实值。计算87个基站的评估误差,得到平均SMAPE=30.3%,可见本发明实施例方法具有较高的准确度As shown in FIG. 7, the evaluation results of some of the 87 base stations are shown, where 1 is the evaluation value and 2 is the real value. Calculating the evaluation error of 87 base stations, and obtaining an average SMAPE=30.3%, it can be seen that the method of the embodiment of the present invention has high accuracy.
实施例二 Embodiment 2
本实例中数据来源于某大型城市中典型区域的统计数据,其时间颗粒度为60分钟,时间总长度为连续18天。本发明实施例中的无线网络空间吞吐量评估方法包含以下步骤:The data in this example is derived from the statistical data of a typical area in a large city with a time granularity of 60 minutes and a total length of time of 18 consecutive days. The wireless network space throughput evaluation method in the embodiment of the present invention includes the following steps:
步骤一:数据预处理;Step 1: Data preprocessing;
A.根据需求选取空间位置上处于同一区域的117个基站;A. Select 117 base stations in the same area at the spatial location according to the requirements;
B.剔除117个基站中的异常数据点,得到每一基站的吞吐量序列;B. Excluding the abnormal data points in 117 base stations, and obtaining the throughput sequence of each base station;
C.对每一个基站吞吐量序列进行归一化处理,得到第i个基站的归一化吞吐量序列SiC. Normalize each base station throughput sequence to obtain a normalized throughput sequence S i for the i-th base station.
步骤二:对待研究的117个基站,构建基站关系网络;Step 2: 117 base stations to be studied, and construct a base station relationship network;
A.取这117个基站前15天的数据计算第i(i=1,2,3……117)个基站与第j(j=1,2,3……117)个基站之间的相关系数ρij,计算公式为A. Calculate the correlation between the i-th (i=1, 2, 3...117) base stations and the jth (j=1, 2, 3...117) base stations by taking the data of the first 15 days of the 117 base stations. The coefficient ρ ij is calculated as
Figure PCTCN2016084549-appb-000015
Figure PCTCN2016084549-appb-000015
其中T=360,Si为第i个基站的吞吐量序列,
Figure PCTCN2016084549-appb-000016
为第i个基站在总时长内 的平均吞吐量,
Figure PCTCN2016084549-appb-000017
为第i个基站在时刻t时的吞吐量大小(t=1,2,3……360);Sj为第j个基站的吞吐量序列,
Figure PCTCN2016084549-appb-000018
为第j个基站在总时长内的平均吞吐量,
Figure PCTCN2016084549-appb-000019
为第j个基站在时刻t时的吞吐量大小(t=1,2,3……360)。
Where T=360, S i is the throughput sequence of the i-th base station,
Figure PCTCN2016084549-appb-000016
For the average throughput of the i-th base station over the total duration,
Figure PCTCN2016084549-appb-000017
The throughput of the i-th base station at time t (t=1, 2, 3...360); S j is the throughput sequence of the j-th base station,
Figure PCTCN2016084549-appb-000018
The average throughput of the jth base station over the total duration,
Figure PCTCN2016084549-appb-000019
The throughput of the jth base station at time t (t = 1, 2, 3 ... 360).
B.在本发明实施例中,给定相关系数阈值c=0.6(一般认为相关系数大于0.6即为强相关),若ρij大于0.6,则在基站i与基站j之间添加一条无向边,这样就可以构建出117个基站的关系网络。如图8所示,其中点代表基站,边体现了基站之间的相关性,点越大代表该基站的度越大;其中,点越大,代表无向边越多,无向边越多,度越大。B. In the embodiment of the present invention, a given correlation coefficient threshold c=0.6 (it is generally considered that the correlation coefficient is greater than 0.6 is a strong correlation), and if ρ ij is greater than 0.6, an undirected edge is added between the base station i and the base station j. So, you can build a network of 117 base stations. As shown in FIG. 8 , the point represents the base station, and the correlation between the base stations is reflected. The greater the point, the greater the degree of the base station; wherein, the larger the point, the more the undirected side, the more the undirected side The greater the degree.
步骤四:选取重要基站;Step 4: Select an important base station;
A.从15天的历史数据中,选取前12天数据作为训练集,后三天数据作为测试集;将所有基站前12天数据作为支持向量机算法(SVM)的输入,输出训练得到的其他117-m个基站与选取的m个基站吞吐量关系模型;A. From the 15 days of historical data, the first 12 days of data are selected as the training set, and the last three days of data are used as the test set; the first 12 days of data from all base stations are used as input to the support vector machine algorithm (SVM), and the output training is obtained. a throughput relationship model between 117-m base stations and selected m base stations;
B.将度最大的m个基站的后3天数据作为吞吐量关系模型的输入,输出其他117-m个基站后三天的估计值;B. The last 3 days of the m base stations with the largest degree are used as the input of the throughput relationship model, and the estimated values of the other 117-m base stations are outputted for the next three days;
C.计算117-m的基站每一个基站相应的SMAPE,做出平均SMAPE随m的变化情况,如图9所示,黑点为117-m个基站SMAPE的平均值,从图中可以看出,当m=11时,其他基站的平均SMAPE最小,也就是预测效果最佳,因此在本实施例中我们选取的重要基站个数为M=11。C. Calculate the average SMAPE change with m for each SMAPE of each base station of 117-m. As shown in Figure 9, the black point is the average value of 117-m base station SMAPE, as can be seen from the figure. When m=11, the average SMAPE of other base stations is the smallest, that is, the prediction effect is the best, so in this embodiment, the number of important base stations we select is M=11.
步骤五:使用SVM算法评估空间吞吐量。Step 5: Evaluate space throughput using the SVM algorithm.
在本发明实施例中,根据选出的11个重要基站,采用支持向量机算法(SVM)使用吞吐量的历史数据训练出其他106个基站与11个重要基站的吞吐量关系模型。将待评估时间段内的11个基站的吞吐量输入到关系模型中,即可输出对应的106个基站的吞吐量。In the embodiment of the present invention, according to the selected 11 important base stations, the support vector machine algorithm (SVM) uses the historical data of the throughput to train the throughput relationship model of the other 106 base stations and 11 important base stations. By inputting the throughput of 11 base stations in the time period to be evaluated into the relational model, the throughput of the corresponding 106 base stations can be output.
如图10所示,即为评估结果示例,其中3为评估值,4为真实值。计算106个基站的评估误差,得到平均SMAPE=36.4%,本发明实施例评估结果有较高的准确度。As shown in FIG. 10, it is an example of the evaluation result, where 3 is an evaluation value and 4 is a true value. The evaluation error of 106 base stations is calculated to obtain an average SMAPE=36.4%, and the evaluation result of the embodiment of the present invention has high accuracy.
综上所述,本发明实施例具有以下技术效果:In summary, the embodiments of the present invention have the following technical effects:
本发明实施例根据基站历史数据得到基站之间吞吐量变化关系,并构建 基站关系网络,从该网络中选取出少数重要基站,从而评估出其他大量基站的吞吐量。具有很高的实用价值,例如在基站数据采集中,有很多基站的数据会有缺失,采用本发明实施例,可以评估出缺失数据,从而做进一步的网络分析。同时,可以根据需求,灵活的选取不同地区或者时间段的吞吐量的历史数据来评估,具有普遍的适用性和更好的预测准确度。The embodiment of the invention obtains the relationship between the throughput changes of the base stations according to the historical data of the base station, and constructs The base station relationship network selects a few important base stations from the network to evaluate the throughput of other large base stations. It has high practical value. For example, in the data acquisition of the base station, there are many base stations whose data is missing. With the embodiment of the present invention, the missing data can be evaluated for further network analysis. At the same time, it can be flexibly selected according to the historical data of the throughput of different regions or time periods to evaluate, with universal applicability and better prediction accuracy.
尽管上文对本发明实施例进行了详细说明,但是本发明不限于此,本技术领域技术人员可以根据本发明的原理进行多种修改。因此,凡按照本发明原理所作的修改,都应当理解为落入本发明的保护范围。Although the embodiments of the present invention have been described in detail above, the present invention is not limited thereto, and various modifications may be made by those skilled in the art in accordance with the principles of the present invention. Therefore, modifications made in accordance with the principles of the invention are to be understood as falling within the scope of the invention.
本领域普通技术人员可以理解上述方法中的全部或部分步骤可通过程序来指令相关硬件(例如处理器)完成,所述程序可以存储于计算机可读存储介质中,如只读存储器、磁盘或光盘等。可选地,上述实施例的全部或部分步骤也可以使用一个或多个集成电路来实现。相应地,上述实施例中的每个模块/单元可以采用硬件的形式实现,例如通过集成电路来实现其相应功能,也可以采用软件功能模块的形式实现,例如通过处理器执行存储于存储器中的程序/指令来实现其相应功能。本发明不限制于任何特定形式的硬件和软件的结合。One of ordinary skill in the art will appreciate that all or a portion of the above steps may be performed by a program to instruct related hardware, such as a processor, which may be stored in a computer readable storage medium, such as a read only memory, disk or optical disk. Wait. Alternatively, all or part of the steps of the above embodiments may also be implemented using one or more integrated circuits. Correspondingly, each module/unit in the foregoing embodiment may be implemented in the form of hardware, for example, by implementing an integrated circuit to implement its corresponding function, or may be implemented in the form of a software function module, for example, being executed by a processor and stored in a memory. Programs/instructions to implement their respective functions. The invention is not limited to any specific form of combination of hardware and software.
虽然本申请所揭露的实施方式如上,但所述的内容仅为便于理解本申请而采用的实施方式,并非用以限定本申请,如本发明实施方式中的具体的实现方法。任何本申请所属领域内的技术人员,在不脱离本申请所揭露的精神和范围的前提下,可以在实施的形式及细节上进行任何的修改与变化,但本申请的专利保护范围,仍须以所附的权利要求书所界定的范围为准。The embodiments disclosed in the present application are as described above, but the descriptions are only for the purpose of understanding the present application, and are not intended to limit the present application, such as the specific implementation method in the embodiments of the present invention. Any modifications and changes in the form and details of the embodiments may be made by those skilled in the art without departing from the spirit and scope of the disclosure. The scope defined by the appended claims shall prevail.
工业实用性Industrial applicability
上述技术方案降低了进行数据的复杂度。 The above technical solution reduces the complexity of performing data.

Claims (10)

  1. 一种无线网络吞吐量的评估方法,所述评估方法包括:A method for evaluating wireless network throughput, the evaluation method comprising:
    获取N个基站的吞吐量的历史数据;Obtaining historical data of throughput of N base stations;
    根据所获取的N个基站的吞吐量的历史数据,构建所述N个基站的基站关系网络;Constructing a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations;
    根据构建的所述基站关系网络及获取的所述吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站;Obtaining M base stations that play an important role in evaluating the throughput of the base station relationship network according to the constructed base station relationship network and the historical data of the obtained throughput, and using the M base stations as important base stations;
    利用所述确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估;Using the historical data of the determined throughput of the M important base stations, evaluating the remaining N-M base station throughputs;
    其中,N和M均为正整数,并且N大于M。Where N and M are both positive integers and N is greater than M.
  2. 根据权利要求1所述的评估方法,其中,所述获取N个基站的吞吐量的历史数据包括:The evaluation method according to claim 1, wherein the obtaining historical data of the throughput of the N base stations comprises:
    采集每个基站的原始吞吐量序列,并计算出采集到的所述原始吞吐量序列中吞吐量序列数值的平均值;Collecting an original throughput sequence of each base station, and calculating an average value of the throughput sequence values in the collected original throughput sequence;
    通过将所采集到的每个基站的每一个原始吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出的所述吞吐量序列数值的平均值,得到每个基站的新吞吐量序列;The value of the sequence of the preset percentage thresholds in which the sequence value is prior to each other in the original throughput sequence of each base station is replaced by the calculated average of the throughput sequence values. Value, get the new throughput sequence for each base station;
    通过对每个基站的新吞吐量的时间序列进行归一化处理,得到每个基站的归一化吞吐量序列,将得到的归一化吞吐量序列作为所述吞吐量的历史数据。The normalized throughput sequence of each base station is obtained by normalizing the time series of the new throughput of each base station, and the obtained normalized throughput sequence is used as the historical data of the throughput.
  3. 根据权利要求2所述的评估方法,其中,所述根据所获取的N个基站的吞吐量的历史数据,构建所述N个基站的基站关系网络包括:The evaluation method according to claim 2, wherein the constructing the base station relationship network of the N base stations according to the historical data of the acquired throughput of the N base stations comprises:
    根据所得到每个基站的所述归一化吞吐量序列,分别计算所述N个基站中两两基站之间的相关系数;Calculating correlation coefficients between two of the N base stations according to the normalized throughput sequence of each of the obtained base stations;
    当计算得到的所述相关系数大于相关系数阈值时,则在所述两两基站之间生成一条无向边; When the calculated correlation coefficient is greater than a correlation coefficient threshold, an undirected edge is generated between the two base stations;
    通过所述N个基站中两两基站之间生成的无向边,构建所述N个基站的所述基站关系网络。Constructing the base station relationship network of the N base stations by using an undirected edge generated between two base stations of the N base stations.
  4. 根据权利要求3所述的评估方法,其中,所述利用构建的所述基站关系网络以及获取的所述吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站包括:The evaluation method according to claim 3, wherein the M base stations that use the constructed base station relationship network and the acquired historical data of the throughput to find an effect on the base station relationship network throughput evaluation effect include :
    通过统计所述基站关系网络中每个基站的无向边条数,得到每个基站的度;Obtaining the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network;
    依次选取所述N个基站中度最大的m个的基站,根据支持向量机评估N-m个基站的吞吐量,得到m种基站关系网络的吞吐量评估效果;Selecting the base stations of the N most basic base stations in sequence, and evaluating the throughput of the N-m base stations according to the support vector machine, and obtaining the throughput evaluation effect of the m base station relationship networks;
    在所得到的m种基站关系网络的吞吐量评估效果中,选取最好的吞吐量评估效果,并将所选取的最好吞吐量评估效果相对应的m个基站作为对基站关系网络吞吐量评估效果起重要作用的M个基站;In the throughput evaluation effect of the obtained m base station relationship networks, the best throughput evaluation effect is selected, and the m base stations corresponding to the selected best throughput evaluation effects are used as the base station relationship network throughput evaluation. M base stations whose effects play an important role;
    其中,m、M、N为正整数,M<=m,M<N,m<N。Where m, M, and N are positive integers, M<=m, M<N, and m<N.
  5. 根据权利要求4所述的评估方法,其中,所述每个基站的无向边条数与基站度的大小成正比。The evaluation method according to claim 4, wherein the number of undirected edges of each base station is proportional to the size of the base station.
  6. 根据权利要求5所述的评估方法,其中,所述利用确定出的所述M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估包括:The evaluation method according to claim 5, wherein the evaluating the remaining N-M base station throughputs using the determined historical data of the throughput of the M important base stations comprises:
    通过支持向量机算法构造剩余的N-M个基站与M个重要基站的吞吐量关系模型;Constructing a throughput relationship model between the remaining N-M base stations and M important base stations by using a support vector machine algorithm;
    利用构造的所述吞吐量关系模型和所述M个重要基站的吞吐量历史数据,得到剩余的N-M个基站的评估吞吐量。The estimated throughput of the remaining N-M base stations is obtained by using the constructed throughput relationship model and the throughput history data of the M important base stations.
  7. 一种无线网络吞吐量的评估装置,所述评估装置包括:An apparatus for evaluating wireless network throughput, the evaluation apparatus comprising:
    采集模块,设置为获取N个基站的吞吐量的历史数据;An acquisition module, configured to acquire historical data of throughput of N base stations;
    构建模块,设置为根据所获取的N个基站的吞吐量的历史数据,构建所述N个基站的基站关系网络;a building module, configured to construct a base station relationship network of the N base stations according to the acquired historical data of the throughput of the N base stations;
    查找模块,设置为利用构建的所述基站关系网络以及获取的所述吞吐量的历史数据,找到对基站关系网络吞吐量评估效果起重要作用的M个基站,并将该M个基站作为重要基站; a searching module, configured to use the constructed base station relationship network and the acquired historical data of the throughput to find M base stations that play an important role in evaluating the throughput of the base station relationship network, and use the M base stations as important base stations ;
    评估模块,设置为利用所述确定出的M个重要基站的吞吐量的历史数据,对剩余的N-M个基站吞吐量进行评估;An evaluation module, configured to use the historical data of the determined throughput of the M important base stations to evaluate the remaining N-M base station throughputs;
    其中,N和M均为正整数,并且N大于M。Where N and M are both positive integers and N is greater than M.
  8. 根据权利要求7所述的评估装置,其中,所述采集模块包括:The evaluation device of claim 7, wherein the acquisition module comprises:
    计算吞吐量平均值单元,设置为采集每个基站的原始吞吐量序列,并计算出采集到的所述原始吞吐量序列中吞吐量序列数值的平均值;Calculating a throughput average unit, configured to collect a raw throughput sequence of each base station, and calculate an average value of the throughput sequence values in the collected original throughput sequence;
    获取单元设置为,通过将所采集到的每个基站的每一个原始吞吐量序列中按照序列数值大小排序后、序列数值在前的预设百分比阈值的序列的数值替换为计算出的所述吞吐量序列数值的平均值,得到每个基站的新吞吐量序列,以及通过对每个基站的新吞吐量的时间序列进行归一化处理,得到每个基站的归一化吞吐量序列,将得到的归一化吞吐量序列作为所述吞吐量的历史数据。The obtaining unit is configured to replace the calculated throughput with a sequence of a preset percentage threshold of the sequence value before each sequence of the original throughput sequence of each of the acquired base stations is replaced by the calculated The average value of the sequence values is obtained, and a new throughput sequence of each base station is obtained, and the normalized throughput sequence of each base station is obtained by normalizing the time series of the new throughput of each base station, and the obtained The normalized throughput sequence is used as historical data for the throughput.
  9. 根据权利要求8所述的评估装置,其中,所述构建模块包括:The evaluation device of claim 8, wherein the building block comprises:
    计算相关系数单元设置为,根据所得到每个基站的归一化吞吐量序列,分别计算所述N个基站中两两基站之间的相关系数;Calculating the correlation coefficient unit is configured to calculate correlation coefficients between two of the N base stations according to the normalized throughput sequence of each of the obtained base stations;
    生成无向边单元设置为,当计算得到的所述相关系数大于相关系数阈值时,则在所述两两基站之间生成一条无向边;Generating the undirected edge unit to set an undirected edge between the two base stations when the calculated correlation coefficient is greater than the correlation coefficient threshold;
    构建单元设置为,通过所述N个基站中两两基站之间生成的无向边,构建所述N个基站的所述基站关系网络。The building unit is configured to construct the base station relationship network of the N base stations by using an undirected edge generated between two base stations of the N base stations.
  10. 根据权利要求9所述的评估装置,其中,所述查找模块包括:The evaluation device of claim 9, wherein the lookup module comprises:
    获取单元设置为,通过统计所述基站关系网络中每个基站的无向边条数,得到每个基站的度,以及依次选取所述N个基站中度最大的m个基站,根据支持向量机评估N-m个基站的吞吐量,得到m种基站关系网络的吞吐量评估效果;The obtaining unit is configured to obtain the degree of each base station by counting the number of undirected edges of each base station in the base station relationship network, and sequentially select the m base stations with the medium most middle of the N base stations, according to the support vector machine Assessing the throughput of the Nm base stations, and obtaining the throughput evaluation effect of the m base station relationship networks;
    查找单元设置为,在所得到的m种基站关系网络的吞吐量评估效果中,选取最好的吞吐量评估效果,并将所选取的最好吞吐量评估效果相对应的m个基站作为对基站关系网络吞吐量评估效果起重要作用的M个基站;The searching unit is configured to select the best throughput evaluation effect in the throughput evaluation effect of the obtained m base station relationship networks, and use the m base stations corresponding to the selected best throughput evaluation effect as the base station M base stations that play an important role in the network throughput evaluation effect;
    其中,m、M、N为正整数,M<=m,M<N,m<N。 Where m, M, and N are positive integers, M<=m, M<N, and m<N.
PCT/CN2016/084549 2015-10-21 2016-06-02 Wireless network throughput evaluating method and device WO2016188498A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510686017.1 2015-10-21
CN201510686017.1A CN106612511B (en) 2015-10-21 2015-10-21 Wireless network throughput evaluation method and device based on support vector machine

Publications (1)

Publication Number Publication Date
WO2016188498A1 true WO2016188498A1 (en) 2016-12-01

Family

ID=57392550

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/084549 WO2016188498A1 (en) 2015-10-21 2016-06-02 Wireless network throughput evaluating method and device

Country Status (2)

Country Link
CN (1) CN106612511B (en)
WO (1) WO2016188498A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112600728A (en) * 2020-12-07 2021-04-02 华东交通大学理工学院 5G mobile base station flow prediction analysis system based on big data
CN113709794A (en) * 2021-08-23 2021-11-26 Oppo广东移动通信有限公司 Wireless network communication method and related device
CN117494908A (en) * 2023-12-29 2024-02-02 宁波港信息通信有限公司 Port cargo throughput prediction method and system based on big data

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111901206A (en) * 2020-08-28 2020-11-06 杭州安恒信息技术股份有限公司 Network card testing method, device and related equipment
CN112839342B (en) * 2020-12-31 2022-02-08 国网吉林省电力有限公司长春供电公司 Disaster relief mobile emergency base station site selection method based on support vector machine
TWI780822B (en) * 2021-07-19 2022-10-11 國立陽明交通大學 Network throughput evaluation device and method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101998476A (en) * 2009-08-31 2011-03-30 中国移动通信集团设计院有限公司 Method and device for determining cell throughput
CN102685766A (en) * 2012-05-13 2012-09-19 西华大学 Wireless network flow prediction method based on local minimax probability machine
WO2012148403A1 (en) * 2011-04-28 2012-11-01 Empire Technology Development Llc Mobile traffic forecasting using public transportation information
CN104394538A (en) * 2014-11-28 2015-03-04 重庆大学 Mobile network data flow analysis and prediction method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101541030B (en) * 2009-05-06 2011-06-01 华为技术有限公司 Method for predicting data based on support vector machine and equipment thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101998476A (en) * 2009-08-31 2011-03-30 中国移动通信集团设计院有限公司 Method and device for determining cell throughput
WO2012148403A1 (en) * 2011-04-28 2012-11-01 Empire Technology Development Llc Mobile traffic forecasting using public transportation information
CN102685766A (en) * 2012-05-13 2012-09-19 西华大学 Wireless network flow prediction method based on local minimax probability machine
CN104394538A (en) * 2014-11-28 2015-03-04 重庆大学 Mobile network data flow analysis and prediction method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
RATTARO, CLAUDINA ET AL.: "Throughput Prediction in Wireelass Networks Using Ststistical Learning", LAWDN-LATIN- AMERICAN WORKSHOP ON DYNAMIC NETWORKS, 30 November 2010 (2010-11-30), XP055331996 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112600728A (en) * 2020-12-07 2021-04-02 华东交通大学理工学院 5G mobile base station flow prediction analysis system based on big data
CN113709794A (en) * 2021-08-23 2021-11-26 Oppo广东移动通信有限公司 Wireless network communication method and related device
CN113709794B (en) * 2021-08-23 2024-03-19 Oppo广东移动通信有限公司 Wireless network communication method and related device
CN117494908A (en) * 2023-12-29 2024-02-02 宁波港信息通信有限公司 Port cargo throughput prediction method and system based on big data
CN117494908B (en) * 2023-12-29 2024-03-22 宁波港信息通信有限公司 Port cargo throughput prediction method and system based on big data

Also Published As

Publication number Publication date
CN106612511A (en) 2017-05-03
CN106612511B (en) 2020-03-27

Similar Documents

Publication Publication Date Title
WO2016188498A1 (en) Wireless network throughput evaluating method and device
Nikravesh et al. Mobile network traffic prediction using MLP, MLPWD, and SVM
JP6384065B2 (en) Information processing apparatus, learning method, and program
JP2021532502A (en) Neural network model training methods, equipment, computer equipment and storage media
Brooks et al. Nonparametric convergence assessment for MCMC model selection
CN110335168B (en) Method and system for optimizing power utilization information acquisition terminal fault prediction model based on GRU
CN107886160B (en) BP neural network interval water demand prediction method
CN103678004A (en) Host load prediction method based on unsupervised feature learning
Zhu et al. CARP: Context-aware reliability prediction of black-box web services
CN109242250A (en) A kind of user&#39;s behavior confidence level detection method based on Based on Entropy method and cloud model
WO2017071369A1 (en) Method and device for predicting user unsubscription
US20210099894A1 (en) Forcasting time series data
CN113869521A (en) Method, device, computing equipment and storage medium for constructing prediction model
CN111598457A (en) Method and device for determining quality of power wireless network
Suppa et al. A clustered approach for fast computation of betweenness centrality in social networks
CN110913407B (en) Overlapping coverage analysis method and device
CN113158435B (en) Complex system simulation running time prediction method and device based on ensemble learning
CN109948242A (en) Network representation learning method based on feature Hash
Allahdadi et al. Predicting short 802.11 sessions from radius usage data
Coates et al. Compressed network monitoring
JP6398991B2 (en) Model estimation apparatus, method and program
CN113783715B (en) Opportunistic network topology prediction method adopting causal convolutional neural network
Zhang et al. An improved composite hypothesis test for Markov models with applications in network anomaly detection
CN109086207B (en) Page response fault analysis method, computer readable storage medium and terminal device
An et al. Hypothesis testing for band size detection of high-dimensional banded precision matrices

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16799385

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16799385

Country of ref document: EP

Kind code of ref document: A1