CN105517019A - Method for detecting LTE (Long Term Evolution) network performance by using integrated regression system - Google Patents

Method for detecting LTE (Long Term Evolution) network performance by using integrated regression system Download PDF

Info

Publication number
CN105517019A
CN105517019A CN201510947246.4A CN201510947246A CN105517019A CN 105517019 A CN105517019 A CN 105517019A CN 201510947246 A CN201510947246 A CN 201510947246A CN 105517019 A CN105517019 A CN 105517019A
Authority
CN
China
Prior art keywords
regression
integrated
regression algorithm
training set
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510947246.4A
Other languages
Chinese (zh)
Other versions
CN105517019B (en
Inventor
吴冬华
欧阳晔
石路路
代心灵
胡岳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Hua Su Science And Technology Co Ltd
Original Assignee
Nanjing Hua Su Science And Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing Hua Su Science And Technology Co Ltd filed Critical Nanjing Hua Su Science And Technology Co Ltd
Priority to CN201510947246.4A priority Critical patent/CN105517019B/en
Publication of CN105517019A publication Critical patent/CN105517019A/en
Application granted granted Critical
Publication of CN105517019B publication Critical patent/CN105517019B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/02Arrangements for optimising operational condition

Abstract

The invention provides a method for detecting an LTE (Long Term Evolution) network performance by using an integrated regression system. The method comprises the following steps: step (1), collecting a sample data set; step (2), through an established model, using a reception index to predict an empty delay, and adopting a regression algorithm to respectively predict samples in a training set on a specific subset; step (3), deducing an error value of each sample in the training set and an error of the regression algorithm; step (4), using an analytical method to perform weighted processing on the error in the regression algorithm, and forming an integrated regression prediction model through weighted regression; and step (5), applying the integrated regression algorithm in step (4) to a test set, and detecting the accuracy of a model prediction relationship obtained based on the training set. Through the operation and promotion of the method for detecting the LTE network performance by using the integrated regression system provided by the invention, the empty delay of a user level can be analyzed and inferred, so that a mobile operator can recognize problem cells with a higher empty delay, thus the service quality of an LTE network can be improved by optimizing the problem cells.

Description

Integrated regression system is adopted to detect the method for LTE network performance
Technical field
The present invention relates to a kind of method detecting LTE network performance, especially relate to a kind of method adopting integrated regression system to detect LTE network performance.
Background technology
In recent years, along with the fast development of LTE wireless network, generation and the acquisition of communication data are also thereupon flourish.Except the store and management to this data flow, a larger challenge how to utilize these data serving communication network better.Therefore, by converting data to corresponding network index and carry out critic network performance and user experience quality (QoE) becoming final goal.By analyzing, the time delay of eating dishes without rice or wine due to higher value will affect network reception quality and increase network interferences, and this experiences to have on user awareness comparatively directly affects, and time delay of therefore eating dishes without rice or wine is as a core index.Generally, time delay of eating dishes without rice or wine adopts acquisition by soft, but software and hardware input cost is higher, does not possess all-round popularization condition.
For mobile communication business, most important time delay is end-to-end time delay, and namely for the transmitting-receiving two-end connected, packet produces from transmitting terminal, to the time delay that receiving terminal correctly receives.Different according to business model, end-to-end time delay can be divided into one way time delay and backhaul time delay, wherein one way time delay refers to that packet produces the time delay correctly arriving another one receiving terminal through wireless network from transmitting terminal, and backhaul time delay refers to that packet produces destination server from transmitting terminal and receives packet and return corresponding packet until transmitting terminal correctly receives the time delay of reply data bag.
Existing mobile communication is interpersonal communication mainly, and along with miniaturization and the intellectuality of hardware device, the high speed between following mobile communication more " people and thing " and " thing and thing " is connected to be applied.Machine communication (MachineTypeCommunication, MTC) service application scope widely, as portable medical, car networking, Smart Home, Industry Control, environmental monitoring etc. will promote MTC system application explosive growth, large number quipments is by access network, realize real " all things on earth is interconnected ", for mobile communication brings boundless vital force.Simultaneously, MTC system range of application also can bring new technological challenge to mobile communication widely, the business such as such as real-time cloud calculating, virtual reality, game on line, tele-medicine, intelligent transportation, intelligent grid, remote real_time control are more responsive to time delay, propose higher demand to time delay.
Therefore, need to work out a kind of detection method, analyze and infer the time delay of eating dishes without rice or wine of user class, thus make mobile operator identify the problem cells of higher time delay of eating dishes without rice or wine, and then improve LTE network service quality by optimization problem community.
Summary of the invention
The object of this invention is to provide a kind of method detecting LTE network performance with integrated regression system makes mobile operator identify the problem cells of higher time delay of eating dishes without rice or wine to solve, and then improves the problem of LTE network service quality by optimization problem community.
Technical solution of the present invention is:
Adopt integrated regression system to detect the method for LTE network performance, comprise the following steps:
(1) collection of sample data collection: the reception achievement data of LTE network and network each stage time delay of eating dishes without rice or wine is carried out to collection and formed sample data collection, and described sample data collection is divided into training set and test set;
Wherein, training set is for finding the projected relationship receiving index and eat dishes without rice or wine between time delay; Test set is for detecting the accuracy of the model prediction relation drawn based on training set;
(2) in the operating process of described training set, by the model set up, utilize and receive index prediction and to eat dishes without rice or wine time delay, adopt regression algorithm to predict the sample in training set in particular subset respectively;
(3) by comparing the predicted value that in above-mentioned steps (2), model obtains and time delay value of eating dishes without rice or wine really, the error amount of each sample and the error of regression algorithm in training set is derived;
Wherein, for regression algorithm J, the row error amount in training set is then called as error J;
(4) adopt analytical method that the error in regression algorithm is weighted process, form integrated regressive prediction model by weighted regression combination;
(5) the integrated regression algorithm in step (4) is applied to test set, detect the accuracy of the model prediction relation drawn based on training set.
Further, the reception index receiving achievement data in described step (1) comprises Reference Signal Received Power RSRP, Reference Signal Received Quality RSRQ, Signal to Interference plus Noise Ratio SINR, the Physical Resource Block PUSCH-PRB on Physical Uplink Shared Channel and the Physical Resource Block PDSCH-PRB on Physical Downlink Shared Channel.
Further, the regression algorithm that employing eight kinds is different in step (2) is predicted the sample in training set respectively in particular subset, linear regression respectively, second order polynomial regression, three rank polynomial regressions, ridge regression, LASSO returns, Elastic returns, and GAM returns and MARS returns;
Wherein, described linear regression provides following formula (1):
E (y)=β 0+ β 1x 1+ ...+β dx d(1), in formula, E (y) represents predicted value, and y represents hypothesis and to eat dishes without rice or wine time delay, x 1..., x drepresentative receives index; In this model, response variable y Gaussian distributed, uses least square method directly to calculate and obtains corresponding fitting coefficient β 0..., β d;
Described second order polynomial regression algorithm first calculate each index set once, Quadratic Orthogonal multinomial, thus obtain 2D form, carry out models fitting with 2D+1 item parameter;
Described three rank polynomial regression algorithm, when the choosing of variable, the number of times of orthogonal polynomial is chosen from 1 time to 3 times, thus decreases the constraint of forecast model;
Described ridge regression algorithm provides following formula (2):
in formula, algorithm is by increasing penalty coefficient, to factor beta 0..., β dlimit, become contraction, thus find the minimum variance of least squares estimator, the theory that wherein parametric t proposes based on control forecasting value variance according to E.Cule and M.DeIorio is chosen automatically, and k represents a kth coefficient.The scope of k from 1,2,3 ..., d;
Described LASSO regression algorithm provides following formula (3):
constraint function in formula limits the absolute value sum of regression coefficient, removing constant coefficient, and t value is chosen automatically;
Described Elastic regression algorithm provides following formula (4):
constraint function in formula is the linear combination constraint function in described ridge regression algorithm and described LASSO regression algorithm being carried out to regularization, and wherein, α is that 1/2, t value is chosen automatically;
Described GAM regression algorithm provides following formula (5):
G (E (y))=β 0+ f 1(x 1)+...+f d(x d) (5), in formula, g represents generalized linear Copula, f 1..., f drepresent the non-linear relationship between input variable, β 0for constant term, x 1..., x dreceive from five the data obtained index;
Described MARS regression algorithm provides following formula (6):
X → max (0, x-C) orx → max (0, C-x); C ∈ R (6), in MARS returns, returns the linear combination being synthesized to a hinge function.
Further, in described step (2), the calculating on training set is all completed by successive Regression, by finding suitable weight.
Further, in step (4), utilize the row error obtained in step (3) to infer weight, utilize the method for weight optimization model to give following formula (7):
wherein w 1... w 2drawn by minimizing Weighted least square method and the total weight calculation of restriction.
Further, infer in described step (4) in the process of weight, give following formula (8) for constraints:
Σ j = 1 8 w j = 1 - - - ( 8 ) .
Further, in described step (5), the predicted value obtained in test set and known time delay value of truly eating dishes without rice or wine are compared, and give formula (9) and draw error rate ε test;
ϵ t e s t = 1 n t e s t Σ i = 1 n t e s t | y ^ i - y i | - - - ( 9 ) , In order to confirm stability and the accuracy of the integrated regressive prediction model obtained in described step (4), needing to carry out various different comparison, first, the identical error rate being performed on training set, obtaining an error rate ε trsinif, ε testand ε trsinbetween difference less, then show not occur over-fitting, this means the stability of integrated regressive prediction model; Secondly, test set calculates the predicted value of often kind of regression algorithm, makes it respectively obtain an error rate, by the ε these error rates and integrated recurrence obtained testcompare, then can check out the accuracy of integrated regressive prediction model.
The invention has the beneficial effects as follows: adopted the integrated regression system in the method for integrated regression system detection LTE network performance by the present invention, the prediction of effectively deriving of cell RF index is utilized to eat dishes without rice or wine time delay, integrated regressive prediction model is not only stablized but also can accurately be predicted time delay of eating dishes without rice or wine, it can carry out the selection of calculating and model automatically, and therefore model possesses stronger adaptivity; By examples prove, the theory of this assessment models is accurately, and from the angle of user, integrated regression system is easier to obtain and analyze index of correlation data, thus the predicting the outcome of the time delay that obtains eating dishes without rice or wine.Promoted by the operation adopting integrated regression system to detect the method for LTE network performance, can analyze and infer the time delay of eating dishes without rice or wine of user class, thus make mobile operator identify the problem cells of higher time delay of eating dishes without rice or wine, and then improve LTE network service quality by optimization problem community.
Accompanying drawing explanation
Fig. 1 is that the operation workflow that the embodiment of the present invention adopts integrated regression system to detect the method for LTE network performance illustrates schematic diagram;
Fig. 2 gives the simulated data sets comparative illustration schematic diagram of the integrated regression system of the present invention and classical regression algorithm.
Embodiment
The preferred embodiments of the present invention are described in detail below in conjunction with accompanying drawing.
Embodiment provides a kind of method adopting integrated regression system to detect LTE network performance, and the enforcement of this detection method comprises (1) data source: receive cell RF performance index data set; (2) associate: associating of the time delay value of eating dishes without rice or wine of cell RF index and each element of data centralization; (2) predict: be newly worth by cell RF index and estimate time delay of eating dishes without rice or wine.Other implementation contents except Section 1: sample data: sample data is distributed according to training set and test set; Modeling process: process data transporting something containerized statistical regression algorithm, calculate often kind of regression algorithm weight, in conjunction with regression algorithm and weight, to obtain Aksu River anticipation function; Modelling verification: on fc-specific test FC collection, model result is verified, the then accuracy of known predicted value.
Embodiment
Adopt integrated regression system to detect the method for LTE network performance, as shown in Figure 1, comprising:
(1) collection of sample data collection: the reception achievement data of LTE network and network each stage time delay of eating dishes without rice or wine is carried out to collection and formed sample data collection, and described sample data collection is divided into training set and test set;
Wherein, training set is for finding the projected relationship receiving index and eat dishes without rice or wine between time delay; Test set is for detecting the accuracy of the model prediction relation drawn based on training set;
(2) in the operating process of described training set, by the model set up, utilize and receive index prediction and to eat dishes without rice or wine time delay, adopt regression algorithm to predict the sample in training set in particular subset respectively;
(3) by comparing the predicted value that in above-mentioned steps (2), model obtains and time delay value of eating dishes without rice or wine really, the error amount of each sample and the error of regression algorithm in training set is derived;
Wherein, for regression algorithm J, the row error amount in training set is then called as error J;
(4) adopt analytical method that the error in regression algorithm is weighted process, form integrated regressive prediction model by weighted regression combination;
(5) the integrated regression algorithm in step (4) is applied to test set, detect the accuracy of the model prediction relation drawn based on training set.
Further, the reception index receiving achievement data in described step (1) comprises Reference Signal Received Power RSRP, Reference Signal Received Quality RSRQ, Signal to Interference plus Noise Ratio SINR, the Physical Resource Block PUSCH-PRB on Physical Uplink Shared Channel and the Physical Resource Block PDSCH-PRB on Physical Downlink Shared Channel.
Further, the regression algorithm that employing eight kinds is different in step (2) is predicted the sample in training set respectively in particular subset, linear regression respectively, second order polynomial regression, three rank polynomial regressions, ridge regression, LASSO returns, Elastic returns, and GAM returns and MARS returns;
Wherein, described linear regression provides following formula (1):
E (y)=β 0+ β 1x 1+ ...+β dx d(1) in formula, E (y) represents predicted value, and y represents hypothesis and to eat dishes without rice or wine time delay, x 1..., x drepresentative receives index; In this model, response variable y Gaussian distributed, uses least square method directly to calculate and obtains corresponding fitting coefficient β 0..., β d;
Described second order polynomial regression algorithm first calculate each index set once, Quadratic Orthogonal multinomial, thus obtain 2D form, carry out models fitting with 2D+1 item parameter;
Described three rank polynomial regression algorithm, when the choosing of variable, the number of times of orthogonal polynomial is chosen from 1 time to 3 times, thus decreases the constraint of forecast model;
Described ridge regression algorithm provides following formula (2):
in formula, algorithm is by increasing penalty coefficient, to factor beta 0..., β dlimit, become contraction, thus find the minimum variance of least squares estimator, the theory that wherein parametric t proposes based on control forecasting value variance according to E.Cule and M.DeIorio is chosen automatically, k represents a kth coefficient, the scope of k from 1,2,3 ..., d;
Described LASSO regression algorithm provides following formula (3):
constraint function in formula limits the absolute value sum of regression coefficient, removing constant coefficient, and t value is chosen automatically;
Described Elastic regression algorithm provides following formula (4):
αΣ k = 1 d β k 2 + ( 1 - α ) Σ k = 1 d | β k | ≤ t - - - ( 4 ) , Constraint function in formula is the linear combination constraint function in described ridge regression algorithm and described LASSO regression algorithm being carried out to regularization, and wherein, α is that 1/2, t value is chosen automatically;
Described GAM regression algorithm provides following formula (5):
G (E (y))=β 0+ f 1(x 1)+...+f d( xd) (5), in formula, g represents generalized linear Copula, f 1..., f drepresent the non-linear relationship between input variable, β 0for constant term, x 1..., x dreceive from five the data obtained index;
Described MARS regression algorithm provides following formula (6):
X → max (0, x-C) orx → max (0, C-x); C ∈ R (6), in MARS returns, returns the linear combination being synthesized to a hinge function.
Further, in described step (2), the calculating on training set is all completed by successive Regression, by finding suitable weight.
Further, in step (4), utilize the row error obtained in step (3) to infer weight, utilize the method for weight optimization model to give following formula (7):
wherein w 1... w sdrawn by minimizing Weighted least square method and the total weight calculation of restriction.
Further, infer in described step (4) in the process of weight, give following formula (8) for constraints:
Σ j = 1 8 w j = 1 - - - ( 8 ) .
Further, in described step (5), the predicted value obtained in test set and known time delay value of truly eating dishes without rice or wine are compared, and give formula (9) and draw error rate ε test:
ϵ t e s t = 1 n t e s t Σ i = 1 n t e s t | y ^ i - y i | - - - ( 9 ) , In order to confirm stability and the accuracy of the integrated regressive prediction model obtained in described step (4), needing to carry out various different comparison, first, the identical error rate being performed on training set, obtaining an error rate ε trsinif, ε testand ε trsinbetween difference less, then show not occur over-fitting, this means the stability of integrated regressive prediction model; Secondly, test set calculates the predicted value of often kind of regression algorithm, makes it respectively obtain an error rate, by the ε these error rates and integrated recurrence obtained testcompare, then can check out the accuracy of integrated regressive prediction model.
Fig. 1 gives embodiment and adopts integrated regression system to detect the operation workflow explanation schematic diagram of the method for LTE network performance.In more detail, the object that the integrated regression system of this employing detects the method for LTE network performance is that conventional receiver index prediction by describing in detail is below eated dishes without rice or wine the approximation of time delay, but be more critically, present slight non-linear between time delay of eating dishes without rice or wine and conventional receiver index, there is no direct correlation.Therefore, algorithm model can not process the index received simply by linear regression.Multiple regression algorithm, when predicting large data, is attached in final weight estimation, just can increases system robustness, and obtain predicted value accurately by this system (the integrated regressive prediction model in the present invention).
Legacy system can carry out summary by the flow chart shown in Fig. 1 and describe.First complete and receive achievement data and network each stage and to eat dishes without rice or wine the collection of time delay.This sample data collection is divided into training set and test set by model.Wherein the data of 70% be training set, 30% data be test set.
Secondly, training set is for finding the projected relationship receiving index and eat dishes without rice or wine between time delay.In the operating process of training set, by the model set up, utilize and receive index prediction and to eat dishes without rice or wine time delay.Specifically, be exactly the different regression algorithm of use eight kinds, in particular subset, the sample in training set predicted respectively.(by staying a cross-validation method, will describe in detail in ensuing paragraph).
The predicted value obtained by comparison model and time delay value of eating dishes without rice or wine really, thus derive the error amount of each sample and the error of often kind of regression algorithm in training set.For regression algorithm J, the row error amount in training set is then called as " error J ".
Afterwards, take a kind of analytical method that regression algorithm is weighted process (be called least square method, will introduce in detail in ensuing paragraph) according to error 1 ~ 8.Algorithm after being combined by weighted regression just constitutes our integrated regressive prediction model.
Finally, integrated regression algorithm is applied to test set.Detect the accuracy of the model prediction relation drawn based on training set with it, and then improve the forecasting accuracy of whole model.According to Mathematical Modeling proof rule, model result can carry out matching accurately at training set and checking collection, and so not think the situation that there is overfitting, then this model is accurately.
The present embodiment, except describing time delay of eating dishes without rice or wine, also relates to the reception index that other five are used for weighing the quality of reception and MPS process.They and eat dishes without rice or wine between time delay, to there is certain positive and negative dependency relation.These five indexs are specifically: Reference Signal Received Power (RSRP), Reference Signal Received Quality (RSRQ), Signal to Interference plus Noise Ratio (SINR), the Physical Resource Block (PUSCH-PRB) on Physical Uplink Shared Channel and the Physical Resource Block (PDSCH-PRB) on Physical Downlink Shared Channel.RSRP represents on the frequency band measured, the linear average of the power of the Resource Unit of carrying community own reference signal, i.e. average signal strength.Measured the power of Received signal strength by the mean value of some reference signals, therefore can know whether user easily accesses this community by it.The ratio of what RSRQ was corresponding is received signal strength and total bandwidth power, wherein, total bandwidth power comprises Serving cell power, interference power and noise.It is that the level of noise and signal disturbing is by quantifying to assess the quality of received signal.SINR is the ratio between signal power and interference power, noise summation.PUSCH-PRB (PDSCH-PRB) represents the PRB quantity distributing to up channel (down channel), and user is an important reference at the numerical value of local resource consumption.
Sample data is divided into training set and test set, when the training set based on sample data carries out regression training, first need to understand in native system involved eight kinds of different regression algorithms.As mentioned above, these eight kinds of regression algorithms have used different statistical methods and statistic algorithm, because this ensure that the accuracy of integrated regression system.These eight kinds of regression algorithms are: linear regression, second order polynomial regression, three rank polynomial regressions, ridge regression, and LASSO returns, and Elastic returns, and GAM returns and MARS returns.
In classical linear regression, suppose to eat dishes without rice or wine time delay (hereinafter referred to as y) (being called x with reception index 1..., x d) between pass be linear.Using formula 1 represents relation between the two, and wherein E (y) represents predicted value.In this model, response variable y Gaussian distributed, uses least square method directly to calculate and obtains corresponding fitting coefficient β 0..., β d.Take this regression algorithm, final predicted value will be subject to the impact of all reception indexs.
E(y)=β 01x 1+...+β dx d(1)
Second order polynomial regression algorithm is comparatively close to classical linear regression.But it is not directly select to receive index x 1..., x d, but first calculate each index set once, Quadratic Orthogonal multinomial, thus obtain 2D form.After having made this change, carry out models fitting with 2D+1 (the 2D form of index and constant term) item parameter, instead of the D+1 item parameter in classical linear model.This fit approach operation is upper the same with least square method before.
Three rank polynomial regression algorithm, model is similar to second order polynomial, but when the choosing of variable, the number of times of orthogonal polynomial is chosen from 1 time to 3 times, thus decreases the constraint of forecast model.
It is similar that ridge regression algorithm and formula 1 describe, but unlike, this algorithm passes through to increase penalty coefficient, to factor beta 0..., β dcarry out limiting (becoming contraction).Precisely, this algorithm by limiting the quadratic sum of coefficient, thus finds the minimum variance of least squares estimator (being similar to classical linear regression), as shown in equation 2.The theory that wherein parametric t proposes based on control forecasting value variance according to E.Cule and M.DeIorio is chosen automatically.
Σ k = 1 d β k 2 ≤ t - - - ( 2 )
LASSO return and ridge regression a bit similar, but wherein constraint function difference.The constraint function that LASSO returns limits the absolute value sum (removing constant coefficient) of regression coefficient, as shown in Equation 3.Theory analysis shows that this is more strong constraint for coefficient, even can produce some strict null coefficient.Therefore, this algorithm by making alternatively to carry out coefficient of reduction, thus achieves the object that index set simplifies.In ridge regression, t value is chosen automatically.
Σ k = 1 d | β k | ≤ t - - - ( 3 )
It is the another kind of algorithm by the coefficient of reduction that carries out in returning at ridge regression and LASSO compromising that Elastic returns.Here constraint function is the linear combination constraint function in ridge regression and LASSO recurrence being carried out to regularization, as shown in Equation 4.Wherein, α is that 1/2, t value is chosen automatically.
αΣ k = 1 d β k 2 + ( 1 - α ) Σ k = 1 d | β k | ≤ t - - - ( 4 )
In GAM returns, eat dishes without rice or wine time delay value y and index input variable x 1..., x dbetween relation as shown in Equation 5, wherein suppose y from ED~* class.On the left side of formula, G represents generalized linear Copula.F on the right of formula 1..., f drepresent the non-linear relationship between input variable, β 0for constant term.Function f 1a kind of nonparametric backfitting algorithm can be considered to.This algorithm uses the mode of iteration, utilizes cubic spline function (at least there is the cube of certain index) to carry out approximate description.In this article, y represents and eats dishes without rice or wine time delay and suppose Gaussian distributed (belonging to exponential family), g=id, x 1..., x dreceive from five the data obtained index.
g(E(y))=β 0+f 1(x 1)+...+f d(x d)(5)
In MARS returns, return the linear combination being synthesized to a hinge function.Hinge function, as in formula 6 define, be by the overall nonlinear model of self non-linear release.As a whole, whole space is divided into the polynomial subspace of each response.Return operation to be separated these spaces and performing, need two steps.First, by reducing the error of residual sum of squares (RSS), the basic function that makes new advances of forward calculation and hinge function in an iterative manner.Next, then by backward deletion minimum contribution item, model is revised.Final step is then reduce over matching.
x→max(0,x-C)orx→max(0,C-x);C∈R(6)
Calculating on training set is all completed by successive Regression.By finding suitable weight, thus often kind of regression combination is become integrated recurrence.Next will be described this process.
In each regression algorithm J, element i each on training set is adopted and stays a cross-validation method, thus obtain predicted value in order to obtain this predicted value, then to carry out the operation of regression algorithm j to all elements except i.So obtain one to be obtained by element i anticipation function.When eating dishes without rice or wine, Yanzhong all elements is all known, variance then by calculating.Therefore, for each regression algorithm J, column vector error is then defined as: ε j:=(ε ij) ' i(wherein ' represent transposition).
Next step utilizes these row errors to infer weight.W 1... w 8drawn by minimizing Weighted least square method and the total weight calculation of restriction.Wherein, formula 7 defines the method utilizing weight optimization model, and formula 8 defines constraints.Here || || 2represent Euclid norm.
arg min w 1 ... w s | | Σ j = 1 8 w j ϵ j | | 2 - - - ( 7 )
Σ j = 1 8 w j = 1 - - - ( 8 )
After obtaining weight, still need to understand the regression function quoted in often kind of prediction and often kind of regression algorithm.In order to realize this point, all elements on training set is all performed by regression algorithm j, thus obtain anticipation function P j.Finally, integrated recurrence is then defined as P jlinear weighted combination.
Once integrated recurrence is drawn by derivation, then its result is checked in test set.In checking process, the predicted value obtained in test set and known time delay value of truly eating dishes without rice or wine are compared.The average error rate defined in formula 9 is calculated error rate ε test.In order to confirm stability and the accuracy of integrated regression algorithm, need to carry out various different comparison.First, the identical error rate is performed on training set, obtain an error rate ε trsin.If ε testand ε trsinbetween difference less, then show not occur over-fitting, this means the stability of algorithm.Secondly, test set calculates the predicted value of often kind of regression algorithm, makes it respectively obtain an error rate.By the ε that these error rates and integrated recurrence are obtained testcompare, then can check out the accuracy of algorithm.
ϵ t e s t = 1 n t e s t Σ i = 1 n t e s t | y ^ i - y i | - - - ( 9 )
Fig. 2 gives a sample of analogue data, according to said process, calculates associated weight and derive integrated regression algorithm by analytic approach, calculating average error rate at training set and test set.In this example, the error increment of training set and test set is lower, shows that model does not occur overfitting.Therefore known, integrated regression algorithm is for other regression algorithm, and the result drawn is more accurate, and practicality is stronger.In addition, relative to other all algorithms (or even three rank polynomial regression algorithm), the stability of integrated recurrence is also stronger.Therefore, preferred regression algorithm is exactly integrated recurrence here.
In general, the prediction of effectively deriving of cell RF index is utilized to eat dishes without rice or wine time delay by integrated regression system.Integrated regression model is not only stablized but also can accurately be predicted time delay of eating dishes without rice or wine, and it can carry out the selection of calculating and model automatically, and therefore model possesses stronger adaptivity.By examples prove, the theory of this assessment models is accurately.From the angle of user, native system is easier to obtain and analyze index of correlation data, thus the predicting the outcome of the time delay that obtains eating dishes without rice or wine.

Claims (7)

1. adopt integrated regression system to detect a method for LTE network performance, comprise the following steps:
(1) collection of sample data collection: the reception achievement data of LTE network and network each stage time delay of eating dishes without rice or wine is carried out to collection and formed sample data collection, and described sample data collection is divided into training set and test set;
Wherein, training set is for finding the projected relationship receiving index and eat dishes without rice or wine between time delay; Test set is for detecting the accuracy of the model prediction relation drawn based on training set;
(2) in the operating process of described training set, by the model set up, utilize and receive index prediction and to eat dishes without rice or wine time delay, adopt regression algorithm to predict the sample in training set in particular subset respectively;
(3) by comparing the predicted value that in above-mentioned steps (2), model obtains and time delay value of eating dishes without rice or wine really, the error amount of each sample and the error of regression algorithm in training set is derived;
Wherein, for regression algorithm J, the row error amount in training set is then called as error J;
(4) adopt analytical method that the error in regression algorithm is weighted process, form integrated regressive prediction model by weighted regression combination;
(5) the integrated regression algorithm in step (4) is applied to test set, detect the accuracy of the model prediction relation drawn based on training set.
2. the method adopting integrated regression system to detect LTE network performance as claimed in claim 1, it is characterized in that, the reception index receiving achievement data in described step (1) comprises Reference Signal Received Power RSRP, Reference Signal Received Quality RSRQ, Signal to Interference plus Noise Ratio SINR, the Physical Resource Block PUSCH-PRB on Physical Uplink Shared Channel and the Physical Resource Block PDSCH-PRB on Physical Downlink Shared Channel.
3. the method adopting integrated regression system to detect LTE network performance as claimed in claim 2, it is characterized in that, the regression algorithm that employing eight kinds is different in step (2) is predicted the sample in training set respectively in particular subset, linear regression respectively, second order polynomial regression, three rank polynomial regressions, ridge regression, LASSO returns, and Elastic returns, and GAM returns and MARS returns;
Wherein, described linear regression provides following formula (1):
E (y)=β 0+ β 1x 1+ ...+β dx d(1), in formula, E (y) represents predicted value, and y represents hypothesis and to eat dishes without rice or wine time delay, x 1..., x drepresentative receives index; In this model, response variable y Gaussian distributed, uses least square method directly to calculate and obtains corresponding fitting coefficient β o..., β d;
Described second order polynomial regression algorithm first calculate each index set once, Quadratic Orthogonal multinomial, thus obtain 2D form, carry out models fitting with 2D+1 item parameter;
Described three rank polynomial regression algorithm, when the choosing of variable, the number of times of orthogonal polynomial is chosen from 1 time to 3 times, thus decreases the constraint of forecast model;
Described ridge regression algorithm provides following formula (2):
in formula, algorithm is by increasing penalty coefficient, to factor beta 0..., β dlimit, become contraction, thus find the minimum variance of least squares estimator, the theory that wherein parametric t proposes based on control forecasting value variance according to E.Cule and M.DeIorio is chosen automatically, k represents a kth coefficient, the scope of k from 1,2,3 ..., d;
Described LASSO regression algorithm provides following formula (3):
constraint function in formula limits the absolute value sum of regression coefficient, removing constant coefficient, and t value is chosen automatically;
Described Elastic regression algorithm provides following formula (4):
αΣ k - 1 a β k 2 + ( 1 - α ) Σ k - 1 d | β k | ≤ t - - - ( 4 ) , Constraint function in formula is the linear combination constraint function in described ridge regression algorithm and described LASSO regression algorithm being carried out to regularization, and wherein, α is that 1/2, t value is chosen automatically;
Described GAM regression algorithm provides following formula (5):
G (E (y))=β 0+ f 1(x 1)+...+f d(x d) (5), in formula, g represents that generalized linear contacts
Function, f 1..., f drepresent the non-linear relationship between input variable, β 0for constant term, x 1..., x dreceive from five the data obtained index;
Described MARS regression algorithm provides following formula (6):
X → max (0, x-C) orx → max (0, C-x) C ∈ R (6), in MARS returns, returns the linear combination being synthesized to a hinge function.
4. the method adopting integrated regression system to detect LTE network performance as claimed in claim 3, it is characterized in that, in described step (2), the calculating on training set is all completed by successive Regression, by finding suitable weight.
5. the method adopting integrated regression system to detect LTE network performance as claimed in claim 4, it is characterized in that, in step (4), utilize the row error obtained in step (3) to infer weight, utilize the method for weight optimization model to give following formula (7):
wherein w 1... w 2drawn by minimizing Weighted least square method and the total weight calculation of restriction.
6. the method adopting integrated regression system to detect LTE network performance as claimed in claim 5, is characterized in that, infer in the process of weight, give following formula (8) for constraints in described step (4):
Σ j = 1 8 w j = 1 - - - ( 8 ) .
7. the method adopting integrated regression system to detect LTE network performance as claimed in claim 5, it is characterized in that, in described step (5), the predicted value obtained in test set and known time delay value of truly eating dishes without rice or wine are compared, and gives formula (9) and draw error rate ε test:
ϵ t e s t = 1 n t e s t Σ i = 1 n t e s t | y ^ i - y i | - - - ( 9 ) , In order to confirm stability and the accuracy of the integrated regressive prediction model obtained in described step (4), needing to carry out various different comparison, first, the identical error rate being performed on training set, obtaining an error rate ε trainif, ε testand ε treinbetween difference less, then show not occur over-fitting, this means the stability of integrated regressive prediction model; Secondly, test set calculates the predicted value of often kind of regression algorithm, makes it respectively obtain an error rate, by the ε these error rates and integrated recurrence obtained testcompare, then can check out the accuracy of integrated regressive prediction model.
CN201510947246.4A 2015-12-17 2015-12-17 Using the method for integrated regression system detection LTE network performance Active CN105517019B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510947246.4A CN105517019B (en) 2015-12-17 2015-12-17 Using the method for integrated regression system detection LTE network performance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510947246.4A CN105517019B (en) 2015-12-17 2015-12-17 Using the method for integrated regression system detection LTE network performance

Publications (2)

Publication Number Publication Date
CN105517019A true CN105517019A (en) 2016-04-20
CN105517019B CN105517019B (en) 2019-03-08

Family

ID=55724582

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510947246.4A Active CN105517019B (en) 2015-12-17 2015-12-17 Using the method for integrated regression system detection LTE network performance

Country Status (1)

Country Link
CN (1) CN105517019B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106604290A (en) * 2016-12-19 2017-04-26 南京华苏科技有限公司 Method for user perception and evaluation of wireless network performance based on webpage browsing
CN108803348A (en) * 2018-08-03 2018-11-13 北京深度奇点科技有限公司 A kind of optimization method of pid parameter and the optimization device of pid parameter
CN109284320A (en) * 2018-08-15 2019-01-29 上海明析数据科技有限公司 Automatic returning diagnostic method in big data platform
CN109948262A (en) * 2019-03-22 2019-06-28 清华大学 A kind of semiconductor devices modeling method and system using rational fraction regression model
CN110969370A (en) * 2020-01-14 2020-04-07 深圳市建筑科学研究院股份有限公司 Quality risk analysis method for building structural member
CN111338304A (en) * 2020-03-02 2020-06-26 顺忠宝智能科技(深圳)有限公司 Method for real-time prediction and information communication of production line yield by applying artificial intelligence cloud computing
CN112131706A (en) * 2020-08-21 2020-12-25 上海大学 Method for rapidly predicting melting point of low-melting-point alloy through ridge regression
CN113382477A (en) * 2021-05-14 2021-09-10 北京邮电大学 Method for modeling uplink interference between wireless network users
EP4037363A1 (en) * 2017-12-22 2022-08-03 AirTies Belgium SPRL A method for identifying at least a wireless link causing interferences on other wireless links

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101267362A (en) * 2008-05-16 2008-09-17 亿阳信通股份有限公司 A dynamic identification method and its device for normal fluctuation range of performance normal value
CN102904603A (en) * 2011-07-28 2013-01-30 中兴通讯股份有限公司 Searching window processing method and device
WO2013056435A1 (en) * 2011-10-19 2013-04-25 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for channel predicting
CN103093095A (en) * 2013-01-14 2013-05-08 湖州师范学院 Software failure time forecasting method based on kernel principle component regression algorithm
CN103514369A (en) * 2013-09-18 2014-01-15 上海交通大学 Regression analysis system and method based on active learning
CN105050125A (en) * 2015-06-23 2015-11-11 武汉虹信通信技术有限责任公司 Method and device for evaluating mobile data service quality oriented to user experience

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101267362A (en) * 2008-05-16 2008-09-17 亿阳信通股份有限公司 A dynamic identification method and its device for normal fluctuation range of performance normal value
CN102904603A (en) * 2011-07-28 2013-01-30 中兴通讯股份有限公司 Searching window processing method and device
WO2013056435A1 (en) * 2011-10-19 2013-04-25 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for channel predicting
CN103093095A (en) * 2013-01-14 2013-05-08 湖州师范学院 Software failure time forecasting method based on kernel principle component regression algorithm
CN103514369A (en) * 2013-09-18 2014-01-15 上海交通大学 Regression analysis system and method based on active learning
CN105050125A (en) * 2015-06-23 2015-11-11 武汉虹信通信技术有限责任公司 Method and device for evaluating mobile data service quality oriented to user experience

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
闵佳: "基于数据挖掘的移动网络优化与运营技术研究", 《中国优秀硕士学位论文全文数据库》 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106604290B (en) * 2016-12-19 2020-02-14 南京华苏科技有限公司 User perception evaluation wireless network performance method based on web browsing
CN106604290A (en) * 2016-12-19 2017-04-26 南京华苏科技有限公司 Method for user perception and evaluation of wireless network performance based on webpage browsing
EP4037363A1 (en) * 2017-12-22 2022-08-03 AirTies Belgium SPRL A method for identifying at least a wireless link causing interferences on other wireless links
CN108803348A (en) * 2018-08-03 2018-11-13 北京深度奇点科技有限公司 A kind of optimization method of pid parameter and the optimization device of pid parameter
CN108803348B (en) * 2018-08-03 2021-07-13 北京深度奇点科技有限公司 PID parameter optimization method and PID parameter optimization device
CN109284320B (en) * 2018-08-15 2021-10-26 上海派拉软件股份有限公司 Automatic regression diagnosis method on big data platform
CN109284320A (en) * 2018-08-15 2019-01-29 上海明析数据科技有限公司 Automatic returning diagnostic method in big data platform
CN109948262A (en) * 2019-03-22 2019-06-28 清华大学 A kind of semiconductor devices modeling method and system using rational fraction regression model
CN110969370A (en) * 2020-01-14 2020-04-07 深圳市建筑科学研究院股份有限公司 Quality risk analysis method for building structural member
CN110969370B (en) * 2020-01-14 2023-05-02 深圳市建筑科学研究院股份有限公司 Quality risk analysis method for building structural member
CN111338304A (en) * 2020-03-02 2020-06-26 顺忠宝智能科技(深圳)有限公司 Method for real-time prediction and information communication of production line yield by applying artificial intelligence cloud computing
CN112131706A (en) * 2020-08-21 2020-12-25 上海大学 Method for rapidly predicting melting point of low-melting-point alloy through ridge regression
CN113382477A (en) * 2021-05-14 2021-09-10 北京邮电大学 Method for modeling uplink interference between wireless network users

Also Published As

Publication number Publication date
CN105517019B (en) 2019-03-08

Similar Documents

Publication Publication Date Title
CN105517019A (en) Method for detecting LTE (Long Term Evolution) network performance by using integrated regression system
Bisio et al. Smart probabilistic fingerprinting for WiFi-based indoor positioning with mobile devices
US10374902B2 (en) Method for clustering wireless channel MPCs based on a KPD doctrine
CN103874118B (en) Radio Map bearing calibrations in WiFi indoor positionings based on Bayesian regression
CN109490826B (en) Ranging and position positioning method based on radio wave field intensity RSSI
CN111680870B (en) Comprehensive evaluation method for quality of target motion trail
US11762396B2 (en) Positioning system and positioning method based on WI-FI fingerprints
CN107886160B (en) BP neural network interval water demand prediction method
CN113837361A (en) Air pollutant concentration prediction method and system
CN106162559A (en) For estimating equipment and the method for position in a wireless communication system
CN111817805B (en) Method, device and medium for adjusting channel propagation model parameters
CN114116828A (en) Association rule analysis method, device and storage medium for multidimensional network index
CN103945531A (en) Method for WLAN indoor positioning Radio Map updating based on information entropy
CN103916969A (en) Combined authorized user perception and link state estimation method and device
CN105187139B (en) A kind of outdoor radio signal reception strength map constructing method based on intelligent perception
CN112911502B (en) Base station longitude and latitude estimation method based on MDT technology and RSRP ranging
CN106779354B (en) Bayes data fusion evaluation method for aircraft performance evaluation
Wang et al. A novel indoor ranging method using weighted altofrequent RSSI measurements
CN104635206B (en) A kind of method and device of wireless location
US20200178161A1 (en) Small cell identification using machine learning
EP4270824A1 (en) Channel estimation method based on neural network and communication apparatus
CN115099385A (en) Spectrum map construction method based on sensor layout optimization and adaptive Kriging model
CN111628837B (en) Channel modeling method and device
Liu et al. Prediction of wireless network connectivity using a Taylor Kriging approach
WO2023061303A1 (en) Large-scale fading modeling and estimation method, system, and device, and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 210000 No. 6 Longjing Road, Chunxi Town, Gaochun District, Nanjing City, Jiangsu Province

Applicant after: Nanjing Hua Su Science and Technology Ltd.

Address before: 210000 No. 6 Longjing Road, Chunxi Town, Gaochun District, Nanjing City, Jiangsu Province

Applicant before: Nanjing Hua Su Science and Technology Co., Ltd.

GR01 Patent grant
GR01 Patent grant