CN103093094A - Software failure time forecasting method based on kernel partial least squares regression algorithm - Google Patents
Software failure time forecasting method based on kernel partial least squares regression algorithm Download PDFInfo
- Publication number
- CN103093094A CN103093094A CN2013100130053A CN201310013005A CN103093094A CN 103093094 A CN103093094 A CN 103093094A CN 2013100130053 A CN2013100130053 A CN 2013100130053A CN 201310013005 A CN201310013005 A CN 201310013005A CN 103093094 A CN103093094 A CN 103093094A
- Authority
- CN
- China
- Prior art keywords
- kernel
- software failure
- regression algorithm
- software
- failure time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a software failure time forecasting method based on a kernel partial least squares regression algorithm. Through the application of a kernel function technology, the problem of software reliability forecast is converted to the problem of recession estimation, and the kernel partial least squares regression algorithm is used for resolving the problem of the software reliability forecast. Through fully consideration of a small sample property of the software reliability forecast, the situations that the size of observational variables is bigger than that of observational samples and multicollinearity exists among the variables can be overcome by using the kernel function technology, and so that a model 'overfitting' situation arises in modeling approaches such as a neural network does not occur. By means of the software failure time forecasting method based on the kernel partial least squares regression algorithm, model parameters are automatically and continuously adjusted to fit the dynamic change in a failure process, therefore adaptive forecasting of the software reliability is achieved, and the adaptive capability of a software failure forecasting model is improved effectively.
Description
[technical field]
The present invention relates in software reliability test and evaluation process next time or software failure time data Forecasting Methodology in the following long period.
[background technology]
Software reliability refers under rated condition, at the appointed time, and the probability that software did not lose efficacy.The stochastic process reliability model is that software reliability growth model area research is maximum, a most widely used class, but the statistics composition of achieved reliability problem can not only be described with classical statistical distribution functions, and the random process model hypothesis that need to make many priori to attribute and the software failure process of software fault, this causes, and each model shows great precision of prediction difference in different projects, and namely the applicability of model is relatively poor.
Specially for prediction and the classification problem of small sample data, obtain extraordinary result in a lot of similar reliability predictions fields based on the kernel function theory and method, be fit to this challenge of software reliability prediction.By means of computer technology, this class model has adaptive ability and learning functionality, performance is preferably all arranged on model applicability and evaluation prediction ability, the superperformance that shows in the finite sample situation based on the software reliability model of kernel function theory, can solve to a great extent the problems such as study excessively of neural network, become a breach of outbalance in present Research of reliability model.
[summary of the invention]
Technical matters to be solved by this invention is to provide a kind of software failure time forecasting methods based on the Kernel partial least squares regression algorithm, realizes the adaptive prediction of software reliability, effectively improves the adaptive faculty of software faults prediction model.For this reason, the present invention by the following technical solutions, it comprises following steps:
(1), at first observe and record order software failure data collection, and all inputoutput data normalization;
(2), by abstraction and hypothesis, software failure time prediction problem is converted into a function regression problem;
(3), select to be used for the kernel function of prediction and the initialization value of given parameters;
(4), select to be used for the fail data number of study;
(5), adopt the Kernel partial least squares regression algorithm to learn to optimize for different failure dates sets
(6), select at last the parameter after optimization that the new out-of-service time is predicted.
Further, step (2) is described is converted into a function regression problem to software failure time prediction problem, adopts following methods:
Suppose that the software failure time that has occured is t
1, t
2, L, t
n, make t
l=f (t
l-m, t
L-m+1, L, t
l-1), t
lObey fixing but unknown conditional distribution function F (t
lt
l-m, t
L-m+1, L, t
l-1), at t
1, t
2, L, t
kUnder known conditions to t
k+1Predict and become: known k-m observation (T
1, t
m+1), (T
2, t
m+2), L, (T
k-m, t
k) and k-m+1 input T
K-m+1Situation under, estimate k-m+1 output valve
Wherein, T
iExpression m dimensional vector [t
i, t
i+1, L, t
m+i].
The kernel function of using in step (3) is gaussian kernel function,
Its initial parameter value g=1.
Fail data number in step (4) is the integer between 5-8.
Step (5) adopts the Kernel partial least squares regression algorithm to learn to optimize for different failure dates sets, comprises following process:
Step 1, the input data are k dimensional vector X={x
1, x
2, L, x
l, be output as vectorial y
s, s=1,2, L, m
Step 2 builds kernel matrix: K
ij=k (x
i, x
j) i, j=1,2, L, l, wherein
K
j+1=(I-τ
jτ′
j/||τ
j||
2)K
j(I-τ
jτ′
j/||τ
j||
2)
Step 6 is calculated B=[β
1, L, β
k] T=[τ
1, L, τ
k], obtain factor alpha=B (T ' KB)
-1T ' Y
The present invention takes into full account the small sample characteristic of software failure data, the kernel function theory as a kind of Main Means and method, the dynamic law that presents in conjunction with the software failure process, the software reliability prediction problem is converted into a regression estimation problem, and uses the Kernel partial least squares regression algorithm and solve this problem.
The present invention utilizes the covariance information between the input and output variable to extract the potential feature of data, can overcome observational variable more than the multicollinearity that exists between the situation of observation sample number and variable, model " over-fitting " situation that the modeling method such as neural network produces therefore can not occur.In new Forecasting Methodology, along with software failure constantly occurs, model parameter will constantly adjust to adapt to the dynamic change of failure procedure automatically, thereby realize the adaptive prediction of software reliability, effectively improve the adaptive faculty of software faults prediction model.
[description of drawings]
Fig. 1 is the process flow diagram of software failure time forecasting methods of the present invention.
[embodiment]
1) data normalization
When using regression estimation algorithm to learn to predict, at first need all inputoutput datas are normalized to interval [0.1,0.9], specifically transform formula and be:
Wherein, y is the value after normalization, and x is actual value, x
maxThe maximal value of data centralization, x
minMinimum value, Δ=x
max-x
min, after prediction finishes, adopt following mapping that data-mapping is got back to actual value:
2) problem transforms
In the prediction model of software reliability based on the kernel function theory, software failure time data and the relation that occurs between m out-of-service time data before it are carried out modeling, the Single-step Prediction problem can be converted into: observe (T for known k-m
1, t
m+1), (T
2, t
m+2), L, (T
k-m, t
k) and k-m+1 input T
K-m+1Situation under, estimate k-m+1 output valve
T wherein
iExpression m dimensional vector [t
i, t
i+1, L, t
m+i], same,
As input, can predict
In like manner can predict and obtain
3) kernel function of selecting, the initialization value of parameter
4) value of definite kernel function parameter
Kernel functional parameter is selected problem, and its essence is exactly an optimization problem, adopts the grid search method to carry out kernel functional parameter and selects, such as when predicting with SVM, adopt gaussian kernel function, need to determine that two parameters be penalty factor and kernel functional parameter g, based on gridding method with C ∈ [C
1, C
2], change step is C
s, and g ∈ [g
1, g
2], change step is g
t, train for every pair of parameter (C, g), choose the best a pair of parameter of effect as model parameter.
5) Kernel partial least squares regression algorithm
The kernel function regression problem is found the solution and can be described as: given a group vector
With corresponding desired value
As input, want to find out x
iWith t
iBetween corresponding relation, make running into a new vector x
*The time, can dope its corresponding desired value t
*, t
iIt is any real number.The corresponding relation of supposing x and t meets following function:
Wherein, k (x, x
i) be kernel function, the purpose of kernel function regression estimation algorithm is to find suitable w
iAlgorithm is as follows:
Step 1, the input data are k dimensional vector X={x
1, x
2, L, x
l, be output as vectorial y
s, s=1,2, L, m
Step 2 builds kernel matrix: K
ij=k (x
i, x
j) i, j=1,2, L, l, wherein
K
j+1=(I-τ
jτ′
j/||τ
j||
2)K
j(I-τ
jτ′
j/||τ
j||
2)
In order to provide rational comparison and analysis to the model of setting up, adopt 10 models that propose from the true fail data set pair of dissimilar software to carry out experimental analysis, as shown in table 1.These data sets have been described the failure procedure of each software systems, and each data point comprises the set of two kinds of observation statistics: accumulative total execution time and accumulative total Failure count.In experiment, training set comprises from the rear complete thrashing process of test beginning, in order to allow kernel function learn fully, in experimentation, get all data sets first three/one as learning data, after back 2/3rds data are predicted and True Data compare.
Listed the AE value of each model on ten data sets in table, wherein model 1-6 represents respectively SRGM With Logistic TEF, SRGM With Rayleigh TEF, Delayed S-Shaped Model With Logistic TEF, Delayed S-Shaped Model With Rayleigh TEF, G-O model, Yamada Delayed S-Shaped; Model 7 represents the method that the present invention adopts, and the kernel function that a, b, c, d representative are adopted is respectively Gaussian Function, Linear Function, Polynomial Function, Symmetric Triangle Function.
Show the AE value of each model prediction on 1:10 data set
Conclusion: on the different pieces of information collection, when adopting different kernel functions and adopting different regression estimation methods, the model prediction performance is all variant, adopts estimated performance and the applicability that can effectively improve model based on the prediction model of software reliability of Kernel partial least squares regression algorithm.
Above-described embodiment is to explanation of the present invention, is not limitation of the invention, any scheme after simple transformation of the present invention is all belonged to protection scope of the present invention.
Claims (4)
1. based on the software failure time forecasting methods of Kernel partial least squares regression algorithm, it is characterized in that, it comprises following steps:
(1), at first observe and record order software failure data collection, and all inputoutput data normalization;
(2), by abstraction and hypothesis, software failure time prediction problem is converted into a function regression problem;
(3), select to be used for the kernel function of prediction and the initialization value of given parameters;
(4), select to be used for the fail data number of study;
(5), adopt the Kernel partial least squares regression algorithm to learn to optimize for different failure dates sets
(6), select at last the parameter after optimization that the new out-of-service time is predicted.
2. the software failure time forecasting methods based on the Kernel partial least squares regression algorithm as claimed in claim 1, is characterized in that, step (2) is described is converted into a function regression problem to software failure time prediction problem, adopts following methods:
Suppose that the software failure time that has occured is t
1, t
2, L, t
n, make t
l=f (t
l-m, t
L-m+1, L, t
l-1), t
lObey fixing but unknown conditional distribution function F (t
l| t
l-m, t
L-m+1, L, t
l-1), at t
1, t
2, L, t
kUnder known conditions to t
k+1Predict and become: known k-m observation (T
1, t
m+1), (T
2, t
m+2), L, (T
k-m, t
k) and k-m+1 input T
K-m+1Situation under, estimate k-m+1 output valve
Wherein, T
iExpression m dimensional vector [t
i, t
i+1, L, t
m+i].
3. the software failure time forecasting methods based on the core principle component regression algorithm as claimed in claim 1, is characterized in that, the kernel function of using in step (3) is gaussian kernel function,
Its initial parameter value g=1.Fail data number in step (4) is the integer between 5-8.
4. the software failure time forecasting methods based on the Kernel partial least squares regression algorithm as claimed in claim 1, is characterized in that, step (5) adopts the Kernel partial least squares regression algorithm to learn to optimize for different failure dates sets, comprises following process:
Step 1, the input data are k dimensional vector X={x
1, x
2, L, x
l, be output as vectorial y
s, s=1,2, L, m
Step 2 builds kernel matrix: K
ij=k (x
i, x
j) i, j=1,2, L, l, wherein
K
j+1=(I-τ
jτ′
j/||τ
j||
2)K
j(I-τ
jτ′
j/||τ
j||
2)
Step 6 is calculated B=[β
1, L, β
k] T=[τ
1, L, τ
k], obtain factor alpha=B (T ' KB)
-1T ' Y.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013100130053A CN103093094A (en) | 2013-01-14 | 2013-01-14 | Software failure time forecasting method based on kernel partial least squares regression algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013100130053A CN103093094A (en) | 2013-01-14 | 2013-01-14 | Software failure time forecasting method based on kernel partial least squares regression algorithm |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103093094A true CN103093094A (en) | 2013-05-08 |
Family
ID=48205653
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2013100130053A Pending CN103093094A (en) | 2013-01-14 | 2013-01-14 | Software failure time forecasting method based on kernel partial least squares regression algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103093094A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105260304A (en) * | 2015-10-19 | 2016-01-20 | 湖州师范学院 | /Software reliability prediction method based on QBGSA RVR (Quantum-inspired Binary Gravitational Search Algorithm-Relevance Vector Machine) |
CN107947984A (en) * | 2017-11-24 | 2018-04-20 | 浙江网新电气技术有限公司 | A kind of failure predication processing method and its system towards railway transport of passengers service |
CN108267951A (en) * | 2016-12-30 | 2018-07-10 | 南京理工大学 | A kind of maximum power point-tracing control method based on core offset minimum binary |
-
2013
- 2013-01-14 CN CN2013100130053A patent/CN103093094A/en active Pending
Non-Patent Citations (2)
Title |
---|
楼俊钢 等: "《软件可靠性预测的核函数方法》", 《计算机科学》 * |
蒋红卫 等: "《核偏最小二乘回归及其在医学中的应用》", 《中国卫生统计》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105260304A (en) * | 2015-10-19 | 2016-01-20 | 湖州师范学院 | /Software reliability prediction method based on QBGSA RVR (Quantum-inspired Binary Gravitational Search Algorithm-Relevance Vector Machine) |
CN105260304B (en) * | 2015-10-19 | 2018-03-23 | 湖州师范学院 | A kind of software reliability prediction method based on QBGSA RVR |
CN108267951A (en) * | 2016-12-30 | 2018-07-10 | 南京理工大学 | A kind of maximum power point-tracing control method based on core offset minimum binary |
CN107947984A (en) * | 2017-11-24 | 2018-04-20 | 浙江网新电气技术有限公司 | A kind of failure predication processing method and its system towards railway transport of passengers service |
CN107947984B (en) * | 2017-11-24 | 2021-08-03 | 浙江网新电气技术有限公司 | Fault prediction processing method and system for railway passenger service |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10929529B2 (en) | Cyber physical attack detection | |
Wang et al. | A compound framework for wind speed forecasting based on comprehensive feature selection, quantile regression incorporated into convolutional simplified long short-term memory network and residual error correction | |
Hosseini-Moghari et al. | Monthly and seasonal drought forecasting using statistical neural networks | |
Liu et al. | A two-stage method of quantitative flood risk analysis for reservoir real-time operation using ensemble-based hydrologic forecasts | |
CN112598248B (en) | Load prediction method, load prediction device, computer equipment and storage medium | |
CN103886218A (en) | Lake and reservoir algal bloom predicating method based on multielement nonstationary time series analysis and neural network and support vector machine compensation | |
CN103093095A (en) | Software failure time forecasting method based on kernel principle component regression algorithm | |
Liu et al. | Sparse-gev: Sparse latent space model for multivariate extreme value time serie modeling | |
Nagahamulla et al. | An ensemble of artificial neural networks in rainfall forecasting | |
Krishna | An integrated approach for weather forecasting based on data mining and forecasting analysis | |
CN104699979B (en) | Urban lake storehouse algal bloom Study on prediction technology of chaotic series based on complex network | |
CN107480344A (en) | A kind of series stress-strength system reliability self-adaptive estimation method | |
CN103093094A (en) | Software failure time forecasting method based on kernel partial least squares regression algorithm | |
Robati et al. | Inflation rate modeling: adaptive neuro-fuzzy inference system approach and particle swarm optimization algorithm (ANFIS-PSO) | |
Wang et al. | An improved weighted naive bayesian classification algorithm based on multivariable linear regression model | |
Kajornrit et al. | Estimation of missing precipitation records using modular artificial neural networks | |
de Castro et al. | The generation of synthetic inflows via bootstrap to increase the energy efficiency of long-term hydrothermal dispatches | |
CN106772354A (en) | Method for tracking target and device based on parallel fuzzy gaussian sum particle filter | |
CN103106139B (en) | Based on the software failure time forecasting methods that relevance vector regression is estimated | |
Yan et al. | A stochastic tropical cyclone model for the northwestern Pacific Ocean with improved track and intensity representations | |
Yusop et al. | Disaggregation of daily rainfall data using Bartlett Lewis Rectangular Pulse model: a case study in central Peninsular Malaysia | |
Wang et al. | Comparative study among three different artificial neural networks to infectious diarrhea forecasting | |
Deb et al. | A correlation based imputation method for incomplete traffic accident data | |
Shi et al. | Remaining useful life prediction based on modified relevance vector regression algorithm | |
Mares et al. | Stochastic modeling of the connection between sea level pressure and discharge in the Danube lower basin by means of Hidden Markov Model |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20130508 |