KR20200107392A

KR20200107392A - A method of predicting software failure time based on nonlinear regression model and a computer readable medium thereof

Info

Publication number: KR20200107392A
Application number: KR1020190026470A
Authority: KR
Inventors: 양태진
Original assignee: 남서울대학교 산학협력단
Priority date: 2019-03-07
Filing date: 2019-03-07
Publication date: 2020-09-16
Also published as: KR102172120B1

Abstract

According to the present invention, disclosed are a method of predicting a software failure time based on a nonlinear regression model, and a computer-readable recording medium. The method includes the steps of: allowing a trend certification part to predict a software failure time by using a nonlinear regression model; and allowing a failure time analysis part to determine an optimal parameter statistic model for the predicted software failure time by using a determination coefficient and a mean square error. According to the present invention, in fields of software system design and software reliability, software developers and managers can accurately predict and consider an increase in necessary information for data censorship, thereby quickly and accurately analyzing and predicting a software failure time, which can lead to an improvement in software stability.

Description

A method of predicting software failure time based on nonlinear regression model and a computer readable medium thereof

본 발명은 소프트웨어 고장 시간의 예측 방법에 관한 것으로, 특히 비선형 회귀 모형을 사용하여 소프트웨어 고장 시간을 검토하고, 결정 계수가 높고 평균 제곱 오차가 낮은 모형을 결정하여 장래의 고장 시간을 정확하게 예측할 수 있는 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법 및 컴퓨터 판독 가능 기록 매체에 관한 것이다.The present invention relates to a method of predicting software failure time, and in particular, a nonlinear regression model that can accurately predict future failure time by examining software failure time using a nonlinear regression model, and determining a model with a high coefficient of determination and a low mean square error. A method for predicting software failure time based on a regression model and a computer-readable recording medium.

일반적으로 수리 가능 시스템은 고장이 발생하였을 때, 고장이 난 부분의 수리나 시스템 전체의 교체를 통하여 시스템을 운용 가능하게 하는 것을 말한다.In general, a repairable system means that when a failure occurs, the system can be operated by repairing the defective part or replacing the entire system.

오늘날 대부분의 많은 시스템들은 복잡한 수리 가능 시스템으로 이루어져 있어 분석하기 어렵다. Most of today's systems consist of complex repairable systems that are difficult to analyze.

하지만, 체계적인 분석 방법을 도입하여 시스템 신뢰도를 정확하게 예측할 수 있다면 효율적인 정비 정책(Maintenance policy)을 수립하는데 도움이 될 수 있을 것이다. However, if the system reliability can be accurately predicted by introducing a systematic analysis method, it will be helpful to establish an efficient maintenance policy.

수리 가능 시스템에서의 주 관심사는 고장 패턴을 감지하는 것이며, 고장 간격 시간(Inter-failures time)이 어떤 패턴에 따라 변할 때 일정한 추세(trend)를 가진다고 한다. The main concern in repairable systems is to detect failure patterns, and is said to have a constant trend when the inter-failures time changes according to a certain pattern.

만약, 단조 추세로서 고장 간격 시간이 점점 길어진다면 이는 신뢰성이 개선됨을 의미하고, 반대로 고장 간격 시간이 점점 짧아진다면 이는 신뢰성이 악화됨을 의미한다. If, as a monotonic trend, the failure interval time becomes longer, it means that reliability is improved. Conversely, if the failure interval time becomes shorter, it means that reliability is deteriorated.

또한, 고장 간격 시간이 짧아졌다 길어지는 주기성을 보이거나, 고장 간격 시간이 짧아지다 일정하다 길어지는 욕조 곡선(Bath-tub curve) 모양의 추세를 보이는 경우도 있다. In addition, there are cases in which the periodicity of the failure interval time becomes shorter and longer, or the bath-tub curve shape becomes constant and longer after the failure interval time is shorter.

고장의 패턴을 분석할 때 실질적으로 쓰이는 방법으로는 도시적(Graphical)인 방법과 통계적 검정(Statistical test) 방법이 있다.There are two methods that are practically used when analyzing failure patterns, a graphical method and a statistical test method.

도시적 방법은 관찰된 고장 데이터를 그림으로 보여줌으로써 데이터에 대한 전반적인 추세를 파악할 수 있도록 해준다. Illustrative methods allow you to identify overall trends in the data by graphically showing observed failure data.

예를 들면, 고장 발생 시간에 따른 누적 고장 개수를 도시하는 것은 간단하면서도 매우 유용한 정보를 제공해 준다. For example, showing the cumulative number of failures according to the failure occurrence time provides simple and very useful information.

이때, 관찰 시간 동안의 도시된 모양이 선형을 나타내면 고장률이 일정하다는 것을 의미한다. At this time, if the shape shown during the observation time is linear, it means that the failure rate is constant.

고장 간격 시간 데이터가 각각 독립이고 지수 분포의 형태를 나타내면 재생 과정(Renewal process) 또는 동질 포아송 과정(Homogeneous Poisson Process)이 적합함을 의미한다. If the failure interval time data are independent and show the form of an exponential distribution, it means that the Renewal process or the Homogeneous Poisson Process is suitable.

동질 포아송 과정은 수리가능 시스템에서의 고장 발생을 설명할 수 있는 가장 간단한 통계적 모형이며, 최소 수리(minimal repair)를 가정한다. The homogeneous Poisson process is the simplest statistical model that can explain the occurrence of failures in a repairable system, and assumes minimal repair.

여기에서, 최소 수리란 시스템의 고장 발생시 시스템을 고장 발생 이전의 상태로 수리하는 것을 의미한다. Here, the minimum repair means repairing the system to the state before the failure when a system failure occurs.

동질 포아송 과정인지 아닌지를 구분하는 간단한 방법으로는 고장 발생 시간 마다의 누적 고장률을 도시하는 방법과 Duane 도시(로그 변환한 고장 시간과 로그 변환한 누적 고장 비율에 대한 도시)가 있다. Simple methods of distinguishing whether or not a homogeneous Poisson process is performed include a method showing the cumulative failure rate for each failure occurrence time and a Duane plot (a diagram of log-converted failure time and log-converted accumulated failure rate).

두 방법 모두 도시한 점들이 선형성을 띠면 동질 포아송 과정이 적합한 모델이라고 판단할 수 있다. If the points shown in both methods have linearity, it can be determined that the homogeneous Poisson process is a suitable model.

그러나, 동질 포아송 과정으로는 수리 가능 시스템의 전체 수명 주기 중 초기 고장 기간과 신뢰성이 악화되는 마모 고장 기간을 설명할 수 없으므로, 시스템의 전체 수명 주기는 비동질 포아송 과정(Nonhomogeneous Poisson Process)을 이용하는 것이 일반적이다.However, since the homogeneous Poisson process cannot explain the initial failure period of the entire life cycle of the repairable system and the wear failure period that deteriorates reliability, the entire life cycle of the system is recommended to use the Nonhomogeneous Poisson Process. It is common.

여기에서, 비동질 포아송 과정이란 일정한 고장률을 가지고 있는 동질 포아송 과정과 달리, 고장률이 일정하지 않아서 일정한 추세(trend)를 가지고 있는 모수 통계 모형(parametric statistical model)을 의미한다.Here, the non-homogeneous Poisson process refers to a parametric statistical model that has a constant trend because the failure rate is not constant, unlike a homogeneous Poisson process that has a constant failure rate.

한편, 각종 소프트웨어의 고장 발생률은 일정하거나, 단조 증가 또는 단조 감소하는 패턴을 가진다. On the other hand, the incidence rate of failure of various software is constant or has a monotonically increasing or monotonically decreasing pattern.

소프트웨어 신뢰성 모델의 데이터 분석을 위해 최근에 다양한 추세 분석이 개발되고 있다. Recently, various trend analysis has been developed for data analysis of the software reliability model.

추세 분석의 방법으로는 산술 평균 테스트 및 라플라스 추세 검정이 있다.Methods of trend analysis include arithmetic mean test and Laplace trend test.

그런데, 소프트웨어 고장으로 인한 컴퓨터 시스템 오류는 우리 사회에 엄청난 손실을 초래할 수 있다. However, computer system errors due to software failure can cause enormous losses in our society.

따라서, 소프트웨어 안정성은 소프트웨어 개발 프로세스에서 중요한 문제로서, 사용자의 요구 사항 및 테스트 비용을 충족시켜야만 한다. Therefore, software stability is an important issue in the software development process, and it must meet the user's requirements and test costs.

소프트웨어의 가변성 및 신뢰성의 비용을 줄이기 위해서는 능률적으로 소프트웨어 테스트(디버깅) 비용을 미리 알아야 한다. To reduce the cost of software variability and reliability, it is necessary to know in advance the cost of software testing (debugging) efficiently.

그러므로, 소프트웨어 신뢰성 비용 및 고장 시간을 고려한 소프트웨어 개발 프로세스가 필수적이며, 소프트웨어 제품의 결함 내용을 평가하여 품질을 향상시킬 수 있는 모형을 개발할 필요가 있다. Therefore, a software development process that considers software reliability cost and failure time is essential, and it is necessary to develop a model that can improve quality by evaluating the defect contents of software products.

지금까지 많은 소프트웨어 안정성 모형이 제안되었으며, 소프트웨어 고장 현상을 설명하기 위한 이러한 모형 중 일부는 비동질 포아송 과정을 기반으로 한다. So far, many software stability models have been proposed, and some of these models to explain software failure phenomena are based on non-homogeneous Poisson processes.

사실, 이러한 모형은 시간에 따른 오류 감지 속도로 오류 감지 프로세스를 취소하는 데 상당히 효과적이지만, 오류가 발생할 때마다 오류가 즉시 제거되어 새로운 문제가 발생하지 않는다고 가정하고 있으며, 이 문제는 일반적으로 완벽한 디버깅(Debugging)이라고 한다. In fact, these models are quite effective in canceling the error detection process at the rate of error detection over time, but they assume that whenever an error occurs, the error is immediately removed so no new problem arises, and this problem is usually perfect for debugging. It is called (Debugging).

그런데, 종래에 소프트웨어 신뢰성 모델의 데이터 분석을 위해 이미 개발된 추세 분석 방식들은 개요 내용에 대한 정보만 제공할 수 있을 뿐, 중도의 절단 고장 시간과 예상 고장 시간을 분석하지 못하는 한계를 가진다. However, the trend analysis methods already developed for data analysis of the software reliability model in the related art can only provide information on the outline contents, and have a limitation in not being able to analyze the intermediate cut failure time and the predicted failure time.

또한, 수리 가능 시스템(Repairable system)의 추세 정보는 예방 보전 정책(Preventive Maintenance policy)의 예방 보전 주기와 보전 횟수를 결정하는데 사용되고 있다.In addition, trend information of a repairable system is used to determine the preventive maintenance cycle and the number of maintenance of the preventive maintenance policy.

이러한 추세 정보를 얻기 위한 검정 방법으로는 TTT 도시(Total Time on Test plot), Nelson-Aalen 도시, 밀리터리 핸드북 검정, 라플라스 검정, L-R(Lewis-Robinson) 검정, PCNT(Pair-wise Comparison Nonparametric test) 검정이 사용되고 있다. Test methods for obtaining such trend information include TTT plot (Total Time on Test plot), Nelson-Aalen plot, military handbook test, Laplace test, Lewis-Robinson (LR) test, and Pair-wise Comparison Nonparametric test (PCNT) test. Is being used.

그런데, 종래 대부분의 추세 검정 방법들은 단순히 추세의 유무를 검정하기 위한 방법으로서 시스템에 대한 단편적인 정보만을 제공해 준다는 한계가 있다.However, most of the conventional trend test methods have a limitation in that they provide only fragmentary information about the system as a method for simply testing the presence or absence of a trend.

이에 본 발명자는 소프트웨어 신뢰성 비용 및 고장 시간을 고려하여 소프트웨어 제품의 결함 내용을 평가하고, 소프트웨어 고장 시간을 신속하고 정확하게 분석하고 예측할 수 있는 소프트웨어 고장 시간 예측 방법을 발명하기에 이르렀다.Accordingly, the present inventor has come to invent a software failure time prediction method capable of evaluating the defect contents of a software product in consideration of software reliability cost and failure time, and quickly and accurately analyzing and predicting the software failure time.

JP 1996-030493 AJP 1996-030493 A

본 발명의 목적은 에러 기간이 다른 비선형 회귀 모형을 사용하여 중도 절단 시간과 예측 시간을 검토하고, 결정 계수가 높고 평균 제곱 오차가 낮은 모형을 결정하여 장래의 고장 시간을 예측할 수 있는 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법을 제공하는데 있다. An object of the present invention is based on a nonlinear regression model capable of predicting future failure times by examining the cutoff time and prediction time using a nonlinear regression model with different error periods, and determining a model with a high coefficient of determination and a low mean square error. It is to provide a method of predicting software failure time.

본 발명의 다른 목적은 상기 목적을 달성하기 위하여 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터 판독 가능 기록 매체를 제공하는 것이다.Another object of the present invention is to provide a computer-readable recording medium recording a program for executing a method of predicting software failure time based on a nonlinear regression model in a computer in order to achieve the above object.

상기 목적을 달성하기 위한 본 발명의 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법은 추세 검정부가 비선형 회귀 모형을 이용하여 소프트웨어 고장 시간을 예측하는 단계; 및 고장 시간 분석부가 결정 계수 및 평균 제곱 오차를 이용하여 상기 예측된 소프트웨어 고장 시간에 대한 최적의 모수 통계 모형을 결정하는 단계; 를 포함하는 것을 특징으로 한다.The method for predicting software failure time based on a nonlinear regression model of the present invention for achieving the above object includes: predicting a software failure time by using a nonlinear regression model by a trend test unit; And determining, by a failure time analysis unit, an optimal parameter statistical model for the predicted software failure time using a determination coefficient and a mean square error. It characterized in that it comprises a.

상기 목적을 달성하기 위한 본 발명의 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법은 (a) 추세 검정 분석부가 수집된 소프트웨어 고장 시간 데이터로 데이터의 유효성을 검증하는 단계; (b) 모수 추정치 계산부가 상기 검증된 고장 시간 데이터를 인가받아 비선형 회귀 모형에 대한 모수 추정치를 계산하는 단계; (c) 최적 모형 기준 산출부가 고장 시간 예측 모형을 결정하기 위한 결정 계수 및 평균 제곱 오차를 계산하는 단계; (d) 고장 시간 예측부가 계산된 상기 결정 계수 및 상기 평균 제곱 오차를 인가받아 실제 값에 대한 장래의 고장 시간을 분석하여 고장 시간 데이터를 예측하는 단계; 및 (e) 최적 모형 결정부가 상기 예측된 고장 시간 데이터를 분석하여 최적의 모수 통계 모형을 결정하는 단계; 를 포함하는 것을 특징으로 한다.A method for predicting software failure time based on a nonlinear regression model of the present invention for achieving the above object includes: (a) verifying the validity of the data using software failure time data collected by a trend test analysis unit; (b) receiving, by a parameter estimate calculation unit, the verified failure time data and calculating a parameter estimate for a nonlinear regression model; (c) calculating, by an optimal model reference calculation unit, a determination coefficient and a mean square error for determining a failure time prediction model; (d) predicting failure time data by analyzing a future failure time with respect to an actual value by receiving the calculated determination coefficient and the mean square error calculated by a failure time prediction unit; And (e) determining an optimal parameter statistical model by analyzing the predicted failure time data by an optimal model determination unit. It characterized in that it comprises a.

상기 다른 목적을 달성하기 위한 본 발명의 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 컴퓨터 판독 가능 기록 매체는 추세 검정부가 비선형 회귀 모형을 이용하여 소프트웨어 고장 시간을 예측하는 단계; 및 고장 시간 분석부가 결정 계수 및 평균 제곱 오차를 이용하여 상기 예측된 소프트웨어 고장 시간에 대한 최적의 모수 통계 모형을 결정하는 단계; 를 포함하는 것을 특징으로 한다.A computer-readable recording medium of a method for predicting software failure time based on a nonlinear regression model of the present invention for achieving the above other object includes: predicting a software failure time by using a nonlinear regression model by a trend test unit; And determining, by a failure time analysis unit, an optimal parameter statistical model for the predicted software failure time using a determination coefficient and a mean square error. It characterized in that it comprises a.

상기 다른 목적을 달성하기 위한 본 발명의 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 컴퓨터 판독 가능 기록 매체는 (a) 추세 검정 분석부가 수집된 소프트웨어 고장 시간 데이터로 데이터의 유효성을 검증하는 단계; (b) 모수 추정치 계산부가 상기 검증된 고장 시간 데이터를 인가받아 비선형 회귀 모형에 대한 모수 추정치를 계산하는 단계; (c) 최적 모형 기준 산출부가 고장 시간 예측 모형을 결정하기 위한 결정 계수 및 평균 제곱 오차를 계산하는 단계; (d) 고장 시간 예측부가 계산된 상기 결정 계수 및 상기 평균 제곱 오차를 인가받아 실제 값에 대한 장래의 고장 시간을 분석하여 고장 시간 데이터를 예측하는 단계; 및 (e) 최적 모형 결정부가 상기 예측된 고장 시간 데이터를 분석하여 최적의 모수 통계 모형을 결정하는 단계; 를 포함하는 것을 특징으로 한다.The computer-readable recording medium of the method for predicting software failure time based on a nonlinear regression model of the present invention for achieving the above other object includes: (a) verifying the validity of the data with software failure time data collected by a trend test analysis unit; (b) receiving, by a parameter estimate calculation unit, the verified failure time data and calculating a parameter estimate for a nonlinear regression model; (c) calculating, by an optimal model reference calculation unit, a determination coefficient and a mean square error for determining a failure time prediction model; (d) predicting failure time data by analyzing a future failure time with respect to an actual value by receiving the calculated determination coefficient and the mean square error calculated by a failure time prediction unit; And (e) determining an optimal parameter statistical model by analyzing the predicted failure time data by an optimal model determination unit. It characterized in that it comprises a.

기타 실시예의 구체적인 사항은 "발명을 실시하기 위한 구체적인 내용" 및 첨부 "도면"에 포함되어 있다.Details of other embodiments are included in "Specific contents for carrying out the invention" and the attached "Drawings".

본 발명의 이점 및/또는 특징, 그리고 그것들을 달성하는 방법은 첨부되는 도면과 함께 상세하게 후술되어 있는 각종 실시예를 참조하면 명확해질 것이다.Advantages and/or features of the present invention, and a method of achieving them will become apparent with reference to various embodiments described below in detail together with the accompanying drawings.

그러나, 본 발명은 이하에서 개시되는 각 실시예의 구성만으로 한정되는 것이 아니라 서로 다른 다양한 형태로도 구현될 수도 있으며, 단지 본 명세서에서 개시한 각각의 실시예는 본 발명의 게시가 완전하도록 하며, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에게 본 발명의 범주를 완전하게 알려주기 위해 제공되는 것이며, 본 발명은 청구범위의 각 청구항의 범주에 의해 정의될 뿐임을 알아야 한다.However, the present invention is not limited only to the configuration of each embodiment disclosed below, but may be implemented in various different forms, and only each embodiment disclosed in the present specification makes the posting of the present invention complete. It should be understood that the present invention is provided to completely inform the scope of the present invention to those of ordinary skill in the art, and the present invention is only defined by the scope of each claim in the claims.

본 발명에 의할 경우, 소프트웨어 시스템 설계 및 소프트웨어 신뢰성 분야에서, 소프트웨어 개발자 및 운용자들이 데이터를 검열하는 데 필요한 정보의 증가를 정확하게 예측하고 고려하게 되어, 소프트웨어 고장 시간을 신속하고 정확하게 분석하고 예측이 가능해져 소프트웨어 안정성을 향상시킬 수 있다.According to the present invention, in the field of software system design and software reliability, software developers and operators accurately predict and consider the increase in information required to inspect data, so that software failure time can be quickly and accurately analyzed and predicted. Can improve software stability.

도 1은 본 발명의 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법을 구현하기 위한 예측 장치의 블록도이다.
도 2는 본 발명의 일 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 동작을 설명하기 위한 순서도이다.
도 3은 본 발명의 다른 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 동작을 설명하기 위한 순서도이다.
도 4는 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S210) 에서 유효성 검증 대상인 소프트웨어 고장 시간 데이터에 대한 표이다.
도 5는 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S210)에서 이용된 라플라스 추세 검정 분석에 대한 그래프이다.
도 6은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S220) 에서 계산된 비선형 회귀 모형 각각의 모수 추정치에 대한 표이다.
도 7은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S230) 에서 계산된 비선형 회귀 모형 각각의 결정 계수 및 평균 제곱 오차에 대한 표이다.
도 8은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 고장 횟수(1~30) 대비 예측된 고장 시간의 변화를 실제 값과 함께 비선형 회귀 모형 각각을 비교하여 설명하기 위한 그래프이다.
도 9는 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 예측된 고장 횟수(1~30)에 관한 기초 통계 정보에 대하여 비선형 회귀 모형 각각을 비교하여 설명하기 위한 표이다.
도 10은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 고장 횟수(31~40) 대비 예측된 고장 시간의 변화를 비선형 회귀 모형 각각을 비교하여 설명하기 위한 그래프이다.
도 11은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 예측된 고장 횟수(31~40)에 관한 기초 통계 정보에 대하여 비선형 회귀 모형 각각을 비교하여 설명하기 위한 표이다.1 is a block diagram of a prediction apparatus for implementing a method of predicting a software failure time based on a nonlinear regression model of the present invention.
FIG. 2 is a flowchart illustrating an operation of a method for predicting a software failure time based on a nonlinear regression model according to an embodiment of the present invention.
3 is a flowchart for explaining the operation of a method for predicting a software failure time based on a nonlinear regression model according to another embodiment of the present invention.
FIG. 4 is a table showing software failure time data to be validated in step S210 of the method for predicting software failure time shown in FIG. 3.
FIG. 5 is a graph of a Laplace trend test analysis used in step S210 of the method for predicting software failure time shown in FIG. 3.
6 is a table showing parameter estimates of each of the nonlinear regression models calculated in step S220 of the software failure time prediction method shown in FIG. 3.
FIG. 7 is a table showing coefficients of determination and mean square errors of each of the nonlinear regression models calculated in step S230 of the method for predicting software failure time illustrated in FIG. 3.
FIG. 8 is a graph for explaining a change in the predicted failure time compared to the number of failures (1 to 30) by the method of predicting the software failure time shown in FIG. 3 by comparing each nonlinear regression model with an actual value.
9 is a table for explaining by comparing each nonlinear regression model with respect to basic statistical information on the number of failures (1 to 30) predicted by the method for predicting a software failure time shown in FIG. 3.
FIG. 10 is a graph for explaining a change in the predicted failure time compared to the number of failures 31 to 40 by the method of predicting the software failure time shown in FIG. 3 by comparing each of the nonlinear regression models.
11 is a table for explaining by comparing each of the nonlinear regression models with respect to basic statistical information about the number of failures 31 to 40 predicted by the method for predicting a software failure time illustrated in FIG. 3.

이하, 첨부한 도면을 참고로 하여 본 발명의 바람직한 실시예에 대하여 상세히 설명하면 다음과 같다.Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings.

본 발명을 상세하게 설명하기 전에, 본 명세서에서 사용된 용어나 단어는 통상적이거나 사전적인 의미로 무조건 한정하여 해석되어서는 아니되며, 본 발명의 발명자가 자신의 발명을 가장 최선의 방법으로 설명하기 위해서 각종 용어의 개념을 적절하게 정의하여 사용할 수 있다.Before describing the present invention in detail, terms and words used in the present specification are not to be interpreted as being unconditionally limited to their usual or dictionary meanings, and in order for the inventors of the present invention to describe their invention in the best way The concept of various terms can be appropriately defined and used.

더 나아가 이들 용어나 단어는 본 발명의 기술적 사상에 부합하는 의미와 개념으로 해석되어야 함을 알아야 한다.Furthermore, it should be understood that these terms or words should be interpreted as meanings and concepts consistent with the technical idea of the present invention.

즉, 본 명세서에서 사용된 용어는 본 발명의 바람직한 실시예를 설명하기 위해서 사용되는 것일 뿐이고, 본 발명의 내용을 구체적으로 한정하려는 의도로 사용된 것이 아니다.That is, the terms used in the present specification are only used to describe a preferred embodiment of the present invention, and are not intended to specifically limit the content of the present invention.

이들 용어는 본 발명의 여러 가지 가능성을 고려하여 정의된 용어임을 알아야 한다.It should be understood that these terms are terms defined in consideration of various possibilities of the present invention.

또한, 본 명세서에 있어서, 단수의 표현은 문맥상 명확하게 다른 의미로 지시하지 않는 이상, 복수의 표현을 포함할 수 있다.In addition, in the present specification, a singular expression may include a plurality of expressions unless clearly indicated in a different meaning in the context.

또한, 유사하게 복수로 표현되어 있다고 하더라도 단수의 의미를 포함할 수 있음을 알아야 한다.In addition, it should be noted that even if it is expressed in a similar plural form, it may include a singular meaning.

본 명세서의 전체에 걸쳐서 어떤 구성 요소가 다른 구성 요소를 "포함"한다고 기재하는 경우에는, 특별히 반대되는 의미의 기재가 없는 한 임의의 다른 구성 요소를 제외하는 것이 아니라 임의의 다른 구성 요소를 더 포함할 수도 있다는 것을 의미할 수 있다.Throughout the present specification, when a component is described as "including" another component, it does not exclude any other component, but further includes any other component unless otherwise indicated. It could mean you can do it.

더 나아가서, 어떤 구성 요소가 다른 구성 요소의 "내부에 존재하거나, 연결되어 설치된다"고 기재한 경우에는, 이 구성 요소가 다른 구성 요소와 직접적으로 연결되어 있거나 접촉하여 설치되어 있을 수 있다.Furthermore, when a component is described as "existing inside or being connected and installed" of another component, the component may be directly connected to or installed in contact with the other component.

또한, 일정한 거리를 두고 이격되어 설치되어 있을 수도 있으며, 일정한 거리를 두고 이격되어 설치되어 있는 경우에 대해서는 해당 구성 요소를 다른 구성 요소에 고정 내지 연결시키기 위한 제 3의 구성 요소 또는 수단이 존재할 수 있다.In addition, it may be installed spaced apart by a certain distance, and in the case of installation spaced apart from each other by a certain distance, a third component or means for fixing or connecting the component to other components may exist. .

한편, 상기 제 3의 구성 요소 또는 수단에 대한 설명은 생략될 수도 있음을 알아야 한다.Meanwhile, it should be understood that the description of the third component or means may be omitted.

반면에, 어떤 구성 요소가 다른 구성 요소에 "직접 연결"되어 있다거나, 또는 "직접 접속"되어 있다고 기재되는 경우에는, 제 3의 구성 요소 또는 수단이 존재하지 않는 것으로 이해하여야 한다.On the other hand, when a component is described as being "directly connected" to another component or "directly connected", it should be understood that there is no third component or means.

마찬가지로, 각 구성 요소 간의 관계를 설명하는 다른 표현들, 즉 " ~ 사이에"와 "바로 ~ 사이에", 또는 " ~ 에 이웃하는"과 " ~ 에 직접 이웃하는" 등도 마찬가지의 취지를 가지고 있는 것으로 해석되어야 한다.Likewise, other expressions describing the relationship between each component, such as "between" and "directly between", or "neighbor to" and "directly neighbor to" have the same effect. Should be interpreted as.

또한, 본 명세서에 있어서 "일면", "타면", "일측", "타측", "제 1", "제 2" 등의 용어는, 하나의 구성 요소에 대해서 이 하나의 구성 요소가 다른 구성 요소로부터 명확하게 구별될 수 있도록 하기 위해서 사용된다.In addition, in the present specification, terms such as “one side”, “the other side”, “one side”, “the other side”, “first”, and “second” refer to a component in which one component is different for one component. It is used to make it clearly distinct from the element.

하지만, 이와 같은 용어에 의해서 해당 구성 요소의 의미가 제한적으로 사용되는 것은 아님을 알아야 한다.However, it should be noted that the meaning of the corresponding component is not limitedly used by such terms.

또한, 본 명세서에서 "상", "하", "좌", "우" 등의 위치와 관련된 용어는, 사용된다면, 해당 구성 요소에 대해서 해당 도면에서의 상대적인 위치를 나타내고 있는 것으로 이해하여야 한다.In addition, in the present specification, terms related to positions such as "upper", "lower", "left", and "right", if used, should be understood as indicating a relative position in the drawing with respect to the corresponding component.

또한, 이들의 위치에 대해서 절대적인 위치를 특정하지 않는 이상은, 이들 위치 관련 용어가 절대적인 위치를 언급하고 있는 것으로 이해하여서는 아니된다.In addition, these position-related terms should not be understood as referring to absolute positions unless absolute positions are specified for their positions.

더욱이, 본 발명의 명세서에서는, "…부", "…기", "모듈", "장치" 등의 용어는, 사용된다면, 하나 이상의 기능이나 동작을 처리할 수 있는 단위를 의미한다.Moreover, in the specification of the present invention, terms such as "... unit", "... group", "module", "device", if used, mean a unit capable of processing one or more functions or operations.

이는 하드웨어 또는 소프트웨어, 또는 하드웨어와 소프트웨어의 결합으로 구현될 수 있음을 알아야 한다.It should be noted that this can be implemented in hardware or software, or a combination of hardware and software.

본 명세서에 첨부된 도면에서 본 발명을 구성하는 각 구성 요소의 크기, 위치, 결합 관계 등은 본 발명의 사상을 충분히 명확하게 전달할 수 있도록 하기 위해서 또는 설명의 편의를 위해서 일부 과장 또는 축소되거나 생략되어 기술되어 있을 수 있고, 따라서 그 비례나 축척은 엄밀하지 않을 수 있다.In the drawings attached to the present specification, the size, position, and coupling relationship of each component constituting the present invention are partially exaggerated, reduced, or omitted in order to sufficiently clearly convey the spirit of the present invention or for convenience of description. It may have been described, and therefore its proportion or scale may not be exact.

또한, 이하에서, 본 발명을 설명함에 있어서, 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 구성, 예를 들어, 종래 기술을 포함하는 공지 기술에 대한 상세한 설명은 생략될 수도 있다.In addition, in the following description of the present invention, a detailed description of a configuration determined to unnecessarily obscure the subject matter of the present invention, for example, a known technology including the prior art may be omitted.

도 1은 본 발명의 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법을 구현하기 위한 예측 장치의 블록도로서, 추세 검정부(100) 및 고장 시간 분석부(200)를 구비한다. 1 is a block diagram of a prediction apparatus for implementing a method for predicting a software failure time based on a nonlinear regression model of the present invention, and includes a trend test unit 100 and a failure time analysis unit 200.

추세 검정부(100)는 추세 검정 분석부(110), 모수 추정치 계산부(120) 및 최적 모형 기준 산출부(130)를 구비하고, 고장 시간 분석부(200)는 고장 시간 예측부(210) 및 최적 모형 결정부(220)를 구비한다. The trend test unit 100 includes a trend test analysis unit 110, a parameter estimate calculation unit 120, and an optimal model reference calculation unit 130, and the failure time analysis unit 200 is a failure time prediction unit 210 And an optimum model determination unit 220.

도 2는 본 발명의 일 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 동작을 설명하기 위한 순서도이다.FIG. 2 is a flowchart illustrating an operation of a method for predicting a software failure time based on a nonlinear regression model according to an embodiment of the present invention.

도 1 및 도 2를 참조하여 본 발명의 일 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 동작을 개략적으로 설명하면 다음과 같다. An operation of a method for predicting a software failure time based on a nonlinear regression model according to an embodiment of the present invention will be schematically described below with reference to FIGS. 1 and 2.

본 발명은 최적의 성장 회귀 모형을 이용하여 기 수집된 고장 시간 데이터를 가지고 다음과 같은 알고리즘에 의하여 장래의 고장 시간을 분석 및 예측한다.The present invention analyzes and predicts a future failure time by using the following algorithm with failure time data previously collected using an optimal growth regression model.

즉, 추세 검정부(100)가 비선형 회귀 모형을 이용하여 소프트웨어 고장 시간을 예측한다(S110).That is, the trend test unit 100 predicts the software failure time using the nonlinear regression model (S110).

고장 시간 분석부(200)가 결정 계수 및 평균 제곱 오차를 이용하여 상기 예측된 소프트웨어 고장 시간에 대한 최적의 모수 통계 모형을 결정한다(S120).The failure time analysis unit 200 determines an optimal parameter statistical model for the predicted software failure time using the determination coefficient and the mean squared error (S120).

도 3은 본 발명의 다른 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 동작을 설명하기 위한 순서도이다.3 is a flowchart for explaining the operation of a method for predicting a software failure time based on a nonlinear regression model according to another embodiment of the present invention.

도 1 및 도 3을 참조하여 본 발명의 다른 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법의 동작을 개략적으로 설명하면 다음과 같다. An operation of a method for predicting a software failure time based on a nonlinear regression model according to another embodiment of the present invention will be schematically described below with reference to FIGS. 1 and 3.

먼저, 추세 검정부(100) 내 추세 검정 분석부(110)가 수집한 소프트웨어 고장 시간 데이터를 가지고 라플라스(Laplace) 추세 검정 분석을 통하여 데이터의 유효성을 검증한다(S210).First, the validity of the data is verified through a Laplace trend test analysis with the software failure time data collected by the trend test analysis unit 110 in the trend test unit 100 (S210).

모수 추정치 계산부(120)가 추세 검정 분석부(110)로부터 데이터의 유효성이 검증된 고장 시간 데이터를 인가받아 비선형 회귀 모형에 대한 모수 추정치(

,

)를 계산한다(S220).The parameter estimate calculation unit 120 receives the failure time data for which the validity of the data is verified from the trend test analysis unit 110, and the parameter estimate for the nonlinear regression model (

,

) Is calculated (S220).

최적 모형 기준 산출부(130)가 효율적인 고장 시간 예측 모형을 결정하기 위한 결정 계수

및 평균 제곱 오차(MSE)를 계산한다(S230). Determination coefficient for the optimal model reference calculation unit 130 to determine an efficient failure time prediction model

And a mean squared error (MSE) is calculated (S230).

그 후에, 고장 시간 분석부(200) 내 고장 시간 예측부(210)가 최적 모형 기준 산출부(130)로부터 계산 결정 계수

및 평균 제곱 오차(MSE)를 인가받아 실제 값에 대한 장래의 고장 시간을 분석하여 고장 시간 데이터를 예측한다(S240). After that, the failure time prediction unit 210 in the failure time analysis unit 200 calculates the calculation determination coefficient from the optimal model reference calculation unit 130

And a mean square error (MSE) is applied to analyze a future failure time with respect to an actual value to predict failure time data (S240).

최적 모형 결정부(220)가 추세 검정 분석부(110)로부터 예측된 고장 시간 데이터를 분석하여 최적의 모수 통계 모형을 결정한다(S250).The optimal model determination unit 220 analyzes the failure time data predicted from the trend test analysis unit 110 to determine an optimal parameter statistical model (S250).

도 4는 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S210) 에서 유효성 검증 대상인 소프트웨어 고장 시간 데이터에 대한 표이다.FIG. 4 is a table showing software failure time data to be validated in step S210 of the method for predicting software failure time shown in FIG. 3.

도 5는 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S210)에서 이용된 라플라스 추세 검정 분석에 대한 그래프이다.FIG. 5 is a graph of a Laplace trend test analysis used in step S210 of the method for predicting software failure time shown in FIG. 3.

도 6은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S220) 에서 계산된 비선형 회귀 모형 각각의 모수 추정치에 대한 표이다.6 is a table showing parameter estimates of each of the nonlinear regression models calculated in step S220 of the software failure time prediction method shown in FIG. 3.

도 7은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법 중 단계(S230) 에서 계산된 비선형 회귀 모형 각각의 결정 계수 및 평균 제곱 오차에 대한 표이다.FIG. 7 is a table showing coefficients of determination and mean square errors of each of the nonlinear regression models calculated in step S230 of the method for predicting software failure time illustrated in FIG. 3.

도 8은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 고장 횟수(1~30) 대비 예측된 고장 시간의 변화를 실제 값과 함께 비선형 회귀 모형 각각을 비교하여 설명하기 위한 그래프이다.FIG. 8 is a graph for explaining a change in the predicted failure time compared to the number of failures (1 to 30) by the method of predicting the software failure time shown in FIG. 3 by comparing each nonlinear regression model with an actual value.

도 9는 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 예측된 고장 횟수(1~30)에 관한 기초 통계 정보에 대하여 비선형 회귀 모형 각각을 비교하여 설명하기 위한 표이다.9 is a table for explaining by comparing each nonlinear regression model with respect to basic statistical information on the number of failures (1 to 30) predicted by the method for predicting a software failure time shown in FIG. 3.

도 10은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 고장 횟수(31~40) 대비 예측된 고장 시간의 변화를 비선형 회귀 모형 각각을 비교하여 설명하기 위한 그래프이다.FIG. 10 is a graph for explaining a change in the predicted failure time compared to the number of failures 31 to 40 by the method of predicting the software failure time shown in FIG. 3 by comparing each of the nonlinear regression models.

도 11은 도 3에 도시된 소프트웨어 고장 시간의 예측 방법에 의해 예측된 고장 횟수(31~40)에 관한 기초 통계 정보에 대하여 비선형 회귀 모형 각각을 비교하여 설명하기 위한 표이다.11 is a table for explaining by comparing each of the nonlinear regression models with respect to basic statistical information about the number of failures 31 to 40 predicted by the method for predicting a software failure time illustrated in FIG. 3.

도 1 내지 도 11을 참조하여 본 발명의 다른 실시예에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 장치의 유기적인 동작을 상세하게 설명하면 다음과 같다. The organic operation of the apparatus for predicting failure time based on a nonlinear regression model according to another embodiment of the present invention will be described in detail with reference to FIGS. 1 to 11.

추세 검정 분석부(110)가 수집한 소프트웨어 고장 시간 데이터는 도 3에서 보는 바와 같이 고장 횟수(Failure number)와 고장 시간(Failure-time)이 상호 매핑되어 있다.In the software failure time data collected by the trend test analysis unit 110, as shown in FIG. 3, a failure number and a failure time are mapped to each other.

추세 검정 분석부(110)가 도 4에 도시된 소프트웨어 고장 시간 데이터의 유효성을 검증하기 위해서는 우선 추세 분석이 선행되어야 하는데, 본 발명은 도 5에 도시된 라플라스 추세 검정 분석을 이용한다. In order for the trend test analysis unit 110 to verify the validity of the software failure time data shown in FIG. 4, a trend analysis must be preceded, and the present invention uses the Laplace trend test analysis shown in FIG. 5.

여기에서, 라플라스 추세 검정 분석이란 동질 포아송 과정을 귀무가설, 단조 고장 함수를 가지는 비동질 포아송 과정을 대립 가설로 하는 검정을 하기 위한 방법으로서, 만약 귀무가설이 참이라면 검정 통계량은 점근적으로 표준 정규분포가 되는 추세 검정 분석 방식을 의미한다.Here, the Laplace trend test analysis is a method for testing in which the homogeneous Poisson process is a null hypothesis, and the non-homogeneous Poisson process with a monotonic failure function as an alternative hypothesis.If the null hypothesis is true, the test statistic is asymptotically standard normal. It refers to the method of analyzing the trend test that becomes a distribution.

일반적으로 산술 평균 증가 패턴의 경우, 산술 평균 테스트의 결과는 신뢰성 증가를 나타내며, 라플라스 계수 값이 -2 와 2 사이에서 변하는 라플라스 추세 분석 테스트 결과는 안정적인 신뢰성을 나타낸다. In general, in the case of the arithmetic mean increase pattern, the result of the arithmetic mean test indicates an increase in reliability, and the result of the Laplace trend analysis test in which the Laplace coefficient value varies between -2 and 2 indicates stable reliability.

도 5에서 보는 바와 같이, 라플라스 추세 검정의 결과는 라플라스 요인(Factor)이 -2와 2사이에 존재함으로써, 극단값(Extreme value)이 존재하지 않으므로 이 자료를 이용하여 신뢰성 보장을 위한 신뢰 성장 회귀 모형을 제시하는 것이 효율적임을 알 수 있다.As shown in Fig. 5, the result of the Laplace trend test is that the Laplace factor exists between -2 and 2, so there is no extreme value, so using this data to ensure reliability It can be seen that it is efficient to present a model.

본 발명은 장래의 소프트웨어 고장 시간을 예측하기 위해 비선형 회귀 모형인 성장 회귀 모형, 2차 회귀 모형 및 S-곡선 회귀 모형을 사용한다.The present invention uses a growth regression model, a quadratic regression model, and an S-curve regression model, which are nonlinear regression models to predict future software failure times.

비선형 회귀 모형(Nonlinear regression model)은 로그(Log) 또는 가중치(Weight)를 취한 비선형의 경우(곡선 선형 모형), 고장 횟수에 대한 설명 변수(x_i)와 고장 시간에 대한 종속 변수(y_i) 사이가 선형 관계인 다음의 수학식과 같은 선형 모형으로 변환할 수 있다.Nonlinear regression model is a logarithmic or weighted nonlinear case (curved linear model), an explanatory variable for the number of failures (x _i ) and a dependent variable for failure time (y _i ). It can be converted into a linear model as shown in the following equation where the relationship is linear.

[수학식 1][Equation 1]

여기에서, b₀ 및 b₁은 회귀계수를 나타내며,

, 즉 평균이 0, 표준 편차가 σ인 정규 분포를 나타낸다.Here, b ₀ and b ₁ represent regression coefficients,

, That is, a normal distribution with a mean of 0 and a standard deviation of σ.

또한, 본 발명은 장래의 소프트웨어 고장 시간을 예측하기 위해 다음의 수학식과 같은 성장 회귀 모형(Growth regression model)을 사용한다. In addition, the present invention uses a growth regression model such as the following equation to predict a future software failure time.

[수학식 2][Equation 2]

,(

)

,(

)

여기서, β₀ 및 β₁은 각각 b₀ 및 b₁의 추정치, x_i는 고장 횟수에 대한 설명 변수, y_i는 고장 시간에 대한 종속 변수이다.Here, β ₀ and β ₁ are estimates of b ₀ and b ₁ , respectively, x _i is an explanatory variable for the number of failures, and y _i is a dependent variable for failure time.

또한, 본 발명은 장래의 소프트웨어 고장 시간을 예측하기 위해 다음의 수학식과 같은 2차 회귀 모형(Quadratic regression model)을 사용한다. In addition, the present invention uses a quadratic regression model such as the following equation to predict a future software failure time.

[수학식 3][Equation 3]

여기서, β₀ _, β₁ 및 β₂ 는 각각 b_0, b₁및 b₂의 추정치이고, x_i는 고장 횟수에 대한 설명 변수, y_i는 고장 시간에 대한 종속 변수이다.Here, β ₀ _, β ₁ and β ₂ are estimates of b _0, b ₁ and b ₂ _, respectively, x _i is an explanatory variable for the number of failures, and y _i is a dependent variable for failure time.

또한, 본 발명은 장래의 소프트웨어 고장 시간을 예측하기 위해 다음의 수학식과 같은 S-곡선 회귀 모형(S-curve regression model)을 사용한다. In addition, the present invention uses an S-curve regression model as shown in the following equation to predict future software failure time.

[수학식 4][Equation 4]

, (

)

, (

)

도 6에서 보는 바와 같이, 모수 추정치 계산부(120)가 2차 회귀 모형, S-곡선 회귀 모형 및 성장 회귀 모형에 대하여 계산한 회귀 계수(b₁, b₂, b₃)의 모수 추정치(

,

)는 모두 계산되었으나, S-곡선 회귀 모형 및 성장 회귀 모형에 대하여 계산한 회귀 계수(b₂)의 모수 추정치(

)는 계산되지 않았다.As shown in FIG. 6, the parameter estimate of the regression coefficients (b ₁ , b ₂ , b ₃ ) calculated by the parameter estimate calculation unit 120 for the quadratic regression model, the S-curve regression model, and the growth regression model (

,

) Were all calculated, but the parameter estimate of the regression coefficient (b ₂ ) calculated for the S-curve regression model and the growth regression model (

) Was not calculated.

한편, 최적 모형 기준 산출부(130)가 효율적인 고장 시간 예측 모형을 결정하기 위한 결정 계수

및 평균 제곱 오차(MSE)를 계산한다.On the other hand, the optimal model reference calculation unit 130 is a determination coefficient for determining an efficient failure time prediction model

And the mean squared error (MSE) is calculated.

즉, 테스트 모형의 예측된 값과 실제 값을 나타내는 오차를 측정하는데 보다 나은 예측 기법의 기준을 결정하는 방법으로서, 결정 계수

및 평균제곱 오차(MSE)를 이용한 방법을 사용하며, 이러한 최적 모형을 결정하는 기준을 수정할 수 있다.That is, as a method of determining the criterion of a better prediction technique for measuring the error representing the predicted and actual values of the test model, the coefficient of determination

And a method using a mean squared error (MSE), and a criterion for determining such an optimal model can be modified.

결정 계수

는 회귀선에 의한 제곱의 합(SSR), 제곱 오차의 합(SSE) 및 총 제곱(SST)이라고 정의되며, 데이터의 변형을 설명하는데 적합도가 얼마나 성공적인지 측정할 수 있다. Coefficient of determination

Is defined as the sum of squares (SSR), sum of squared errors (SSE), and total squares (SST) by the regression line, and can measure how successful the fit is to explain the transformation of the data.

결정 계수

는 다음의 수학식과 같이 정의된다.Coefficient of determination

Is defined as the following equation.

[수학식 5][Equation 5]

이때,

,

는 y_i의 추정치이고,

는 y_i의 평균값이다.At this time,

,

Is the estimate of y _i ,

Is the average value of y _i .

다음은 효율적인 모형을 결정하기 위한 기준으로서, 평균 제곱 오차(MSE)는 다음의 수학식과 같이 정의된다.The following is a criterion for determining an efficient model, and the mean squared error (MSE) is defined as the following equation.

[수학식 6][Equation 6]

여기에서,

은 시간

까지 나타난 에러들의 누적 함수를 의미하고,

는

시점까지 평균값 함수로부터 추정된 에러의 누적 개수를 의미한다. From here,

Silver time

It means the cumulative function of the errors shown up to,

Is

It means the cumulative number of errors estimated from the average value function up to the point in time.

또한, n은 관찰 값의 수이고, k는 모수의 수를 의미한다. Also, n is the number of observations, and k is the number of parameters.

보통 평균제곱 오차가 작을수록 좋으므로, 최소값을 갖는 평균 제곱 오차 모형일수록 더욱 효율적임을 알 수 있다. Usually, the smaller the mean squared error is, the better, so it can be seen that the mean squared error model with the minimum value is more efficient.

그러므로, 시뮬레이션 결과 값의 비교에 있어서 평균 제곱 오차(MSE) 값이 작으면 상대적으로 효율적인 모형이 된다.Therefore, a relatively efficient model is obtained when the mean square error (MSE) value is small in comparison of simulation results.

도 7에서 보는 바와 같이, 최적 모형 기준 산출부(130)가 계산한 결정 계수

및 평균 제곱 오차(MSE)를 각 비선형 회귀 모형 별로 비교하면, 성장 회귀 모형이 2차 회귀 모형 및 S-곡선 회귀 모형보다 결정 계수

가 높고, 평균 제곱 오차(MSE)가 작은 것으로 나타났다.As shown in FIG. 7, the determination coefficient calculated by the optimal model reference calculation unit 130

And mean squared error (MSE) for each nonlinear regression model, the growth regression model is more determined than the quadratic regression model and the S-curve regression model.

Was high and the mean square error (MSE) was small.

이에, 최적 모형 결정부(220)가 성장 회귀 모형이 나머지 비선형 회귀 모형보다 적합성 측면에서 효과적임을 결정하게 된다.Accordingly, the optimal model determination unit 220 determines that the growth regression model is more effective in terms of suitability than the other nonlinear regression models.

도 8에서 보는 바와 같이, 시점-1(고장 횟수)에서 시점-30(고장 횟수)까지 비선형 회귀 모형인 성장 회귀 모형, 2차 회귀 모형 및 S-곡선 회귀 모형을 사용하여, 실제 값에 대하여 고장 시간을 예측한 결과값을 나타낸다. As shown in Figure 8, using a growth regression model, a quadratic regression model, and an S-curve regression model, which are nonlinear regression models from time point-1 (number of failures) to time point -30 (number of failures), failures against actual values Shows the result of predicting time.

도 8에서 모든 모형은 실제 값과 유사하게 추정하고 있으나, 특히, 성장 회귀 모형이 상대적으로 더 나은 예측 능력을 보여 주고 있으며, S-곡선 회귀 모형은 실제 값보다 낮게(과소 평가) 고장 시간을 예측하였고, 2차 모형은 더 높게 (과대 평가) 고장 시간을 예측하는 것으로 확인되었다.In Figure 8, all models are estimated to be similar to actual values, but in particular, the growth regression model shows relatively better predictive ability, and the S-curve regression model predicts the failure time lower than the actual value (underestimated). It was found that the second-order model predicted the failure time higher (overestimated).

도 9에서 보는 바와 같이, 고장 시간 예측부(210)가 예측한 기초 통계 정보를 각 비선형 회귀 모형 별로 비교하면, S-곡선 회귀 모형의 최대 값과 최소값의 추정 값은 10.181에서 224.132까지 범위로 나타내어 안정적이므로, S-곡선 회귀 모형이 나머지 모형보다 더 좋은 특성과 범위를 나타내고 있음을 알 수 있다.As shown in FIG. 9, when comparing the basic statistical information predicted by the failure time prediction unit 210 for each nonlinear regression model, the estimated maximum and minimum values of the S-curve regression model range from 10.181 to 224.132. Since it is stable, it can be seen that the S-curve regression model shows better properties and range than the rest of the models.

또한, 표준 편차 및 분산에서 2차 모형이 가장 높고, S-곡선 회귀 모형이 상대적으로 작기 때문에, 평균값은 2차 모형이 가장 높고 S-곡선 회귀 모형이 가장 낮은 것으로 확인되었다.In addition, since the quadratic model is the highest in standard deviation and variance and the S-curve regression model is relatively small, the average value is the highest in the quadratic model and the S-curve regression model is the lowest.

따라서, 안정적인 2차 모형을 사용하기 위해서는 분산 측정에 대한 테스트가 먼저 선행되어야 함을 최적 모형 결정부(220)가 결정하게 된다.Therefore, in order to use a stable quadratic model, the optimum model determination unit 220 determines that a test for variance measurement must be preceded.

도 10에서 보는 바와 같이, 시점-31(고장 횟수)에서 시점-40(고장 횟수)까지 비선형 회귀 모형인 성장 회귀 모형, 2차 회귀 모형 및 S-곡선 회귀 모형을 사용하여, 비 감소 특성을 나타내는 고장 시간을 예측한 결과값을 나타낸다. As shown in FIG. 10, using a growth regression model, a quadratic regression model, and an S-curve regression model, which are nonlinear regression models from time point-31 (number of failures) to time point -40 (number of failures), Shows the result of predicting the failure time.

도 10에서 고장 횟수가 증가할수록 고장에 대한 예측값은 성장 회귀 모형이 상승하는 경향이 상대적으로 높은데, 특히, S-곡선 회귀 모형이 성장 회귀 모형 및 2차 회귀 모형과 비교할 때 상대적으로 낮은 것으로 확인되었다.In Figure 10, as the number of failures increases, the predicted value for failure has a relatively high tendency for the growth regression model to rise.In particular, it was confirmed that the S-curve regression model was relatively low when compared to the growth regression model and the quadratic regression model. .

이는 범위 추정(특히, 표준 편차 및 분산의 추정)에 대하여 S-곡선 회귀 모형이 상대적으로 낮게 나타나고, 성장 회귀 모형이 가장 높기 때문이다. This is because the S-curve regression model appears relatively low and the growth regression model is the highest for range estimation (especially, estimation of standard deviation and variance).

도 11에서 보는 바와 같이, 고장 시간 예측부(210)가 시점-31(고장 횟수)에서 시점-40(고장 횟수)까지 예측한 기초 통계 정보를 각 비선형 회귀 모형 별로 비교하면, S-곡선 회귀 모형의 범위 추정 값은 5.282로서, 추정 값이 1185.198인 성장 회귀 모형과 추정 값이 653.879인 2차 회귀 모형과 비교할 때 현저하게 낮게 예측된 결과를 알 수 있다.As shown in FIG. 11, if the failure time prediction unit 210 compares the basic statistical information predicted from the time point-31 (the number of failures) to the time point -40 (the number of failures) for each nonlinear regression model, the S-curve regression model The estimated range of is 5.282, which shows a significantly lower predicted result when compared with the growth regression model with an estimated value of 1185.198 and the quadratic regression model with an estimated value of 653.879.

따라서, 안정적인 성장 회귀 모형을 사용하기 위해서는 분산 측정에 대한 테스트가 먼저 선행되어야 함을 최적 모형 결정부(220)가 결정하게 된다.Therefore, in order to use a stable growth regression model, the optimum model determination unit 220 determines that a test for variance measurement must be preceded.

한편, 본 발명에 따른 기록 매체는 전술한 본 발명에 따른 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법을 컴퓨터에서 실행시키기 위한 프로그램을 기록한 컴퓨터 판독 가능 기록 매체이다. Meanwhile, the recording medium according to the present invention is a computer-readable recording medium in which a program for executing the method for predicting a software failure time based on a nonlinear regression model according to the present invention described above is recorded in a computer.

본 발명의 일 실시예에 따른 기록 매체에는 추세 검정부가 비선형 회귀 모형을 이용하여 소프트웨어 고장 시간을 예측하는 단계; 및 고장 시간 분석부가 결정 계수 및 평균 제곱 오차를 이용하여 상기 예측된 소프트웨어 고장 시간에 대한 최적의 모수 통계 모형을 결정하는 단계; 가 포함된다.In the recording medium according to an embodiment of the present invention, a trend detection unit predicts a software failure time using a nonlinear regression model; And determining, by a failure time analysis unit, an optimal parameter statistical model for the predicted software failure time using a determination coefficient and a mean square error. Is included.

본 발명의 다른 실시예에 따른 기록 매체에는 (a) 추세 검정 분석부가 수집된 소프트웨어 고장 시간 데이터로 데이터의 유효성을 검증하는 단계; (b) 모수 추정치 계산부가 상기 검증된 고장 시간 데이터를 인가받아 비선형 회귀 모형에 대한 모수 추정치를 계산하는 단계; (c) 최적 모형 기준 산출부가 고장 시간 예측 모형을 결정하기 위한 결정 계수 및 평균 제곱 오차를 계산하는 단계; (d) 고장 시간 예측부가 계산된 상기 결정 계수 및 상기 평균 제곱 오차를 인가받아 실제 값에 대한 장래의 고장 시간을 분석하여 고장 시간 데이터를 예측하는 단계; 및 (e) 최적 모형 결정부가 상기 예측된 고장 시간 데이터를 분석하여 최적의 모수 통계 모형을 결정하는 단계; 가 포함된다.The recording medium according to another embodiment of the present invention includes the steps of: (a) verifying the validity of the data with software failure time data collected by a trend test analysis unit; (b) receiving, by a parameter estimate calculation unit, the verified failure time data and calculating a parameter estimate for a nonlinear regression model; (c) calculating, by an optimal model reference calculation unit, a determination coefficient and a mean square error for determining a failure time prediction model; (d) predicting failure time data by analyzing a future failure time with respect to an actual value by receiving the calculated determination coefficient and the mean square error calculated by a failure time prediction unit; And (e) determining an optimal parameter statistical model by analyzing the predicted failure time data by an optimal model determination unit. Is included.

상기 컴퓨터 판독 가능 기록 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. The computer-readable recording medium may include program instructions, data files, data structures, and the like alone or in combination.

상기 기록 매체에 기록되는 프로그램 명령은 실시예를 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. The program instructions recorded on the recording medium may be specially designed and configured for the embodiment, or may be known and usable to those skilled in computer software.

컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다.Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tapes, optical media such as CD-ROMs and DVDs, and magnetic media such as floptical disks. -A hardware device specially configured to store and execute program instructions such as magneto-optical media, and ROM, RAM, flash memory, and the like.

프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. Examples of the program instructions include not only machine language codes such as those produced by a compiler, but also high-level language codes that can be executed by a computer using an interpreter or the like.

상기된 하드웨어 장치는 실시예의 동작을 수행하기 위해 하나 이상의 소프트웨어 모듈로서 작동하도록 구성될 수 있으며, 그 역도 마찬가지이다. The hardware device described above may be configured to operate as one or more software modules to perform the operation of the embodiment, and vice versa.

이와 같이, 본 발명은 에러 기간이 다른 비선형 회귀 모형을 사용하여 중도 절단 시간과 예측 시간을 검토하고, 결정 계수가 높고 평균 제곱 오차가 낮은 모형을 결정하여 장래의 고장 시간을 예측할 수 있는 비선형 회귀 모형 기반 소프트웨어 고장 시간의 예측 방법 및 컴퓨터 판독 가능 기록 매체를 제공한다.As described above, the present invention is a nonlinear regression model capable of predicting a future failure time by examining the cutoff time and the prediction time using a nonlinear regression model with different error periods, and determining a model with a high coefficient of determination and a low mean square error. It provides a method of predicting the failure time of the based software and a computer-readable recording medium.

이를 통하여, 소프트웨어 시스템 설계 및 소프트웨어 신뢰성 분야에서, 소프트웨어 개발자 및 운용자들이 데이터를 검열하는 데 필요한 정보의 증가를 정확하게 예측하고 고려하게 되어, 소프트웨어 고장 시간을 신속하고 정확하게 분석하고 예측이 가능해져 소프트웨어 안정성을 향상시킬 수 있다.Through this, in the field of software system design and software reliability, software developers and operators can accurately predict and consider the increase in information needed to inspect data, enabling rapid and accurate analysis and prediction of software failure times, thereby improving software stability. Can be improved.

이상, 일부 예를 들어서 본 발명의 바람직한 여러 가지 실시예에 대해서 설명하였지만, 본 "발명을 실시하기 위한 구체적인 내용" 항목에 기재된 여러 가지 다양한 실시예에 관한 설명은 예시적인 것에 불과한 것이며, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 이상의 설명으로부터 본 발명을 다양하게 변형하여 실시하거나 본 발명과 균등한 실시를 행할 수 있다는 점을 잘 이해하고 있을 것이다.In the above, several preferred embodiments of the present invention have been described with some examples, but the description of the various various embodiments described in the "Specific Contents for Carrying out the Invention" section is merely exemplary, and the present invention is Those of ordinary skill in the art to which it belongs will be well understood from the above description that the present invention can be variously modified to be implemented or equivalent to the present invention.

또한, 본 발명은 다른 다양한 형태로 구현될 수 있기 때문에 본 발명은 상술한 설명에 의해서 한정되는 것이 아니며, 이상의 설명은 본 발명의 개시 내용이 완전해지도록 하기 위한 것으로 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에게 본 발명의 범주를 완전하게 알려주기 위해 제공되는 것일 뿐이며, 본 발명은 청구범위의 각 청구항에 의해서 정의될 뿐임을 알아야 한다.In addition, since the present invention can be implemented in a variety of other forms, the present invention is not limited by the above description, and the above description is intended to complete the disclosure of the present invention, and is generally used in the technical field to which the present invention pertains. It should be understood that it is provided only to completely inform the scope of the present invention to those skilled in the art, and that the present invention is only defined by each claim of the claims.

100: 추세 검정부
110: 추세 검정 분석부
120: 모수 추정치 계산부
130: 최적 모형 기준 산출부
200: 고장 시간 분석부
210: 고장 시간 예측부
220: 최적 모형 결정부100: trend test section
110: trend test analysis unit
120: parameter estimate calculation unit
130: optimal model reference calculation unit
200: failure time analysis unit
210: failure time prediction unit
220: optimal model decision section

Claims

Predicting a software failure time by using a nonlinear regression model by a trend test unit; And
Determining, by a failure time analysis unit, an optimal parameter statistical model for the predicted software failure time using a determination coefficient and a mean square error;
Characterized in that it comprises a,
A method of predicting software failure time based on a nonlinear regression model.

(a) verifying the validity of the data using the software failure time data collected by the trend test analysis unit;
(b) receiving, by a parameter estimate calculation unit, the verified failure time data and calculating a parameter estimate for a nonlinear regression model;
(c) calculating, by an optimal model reference calculation unit, a determination coefficient and a mean square error for determining a failure time prediction model;
(d) predicting failure time data by analyzing a future failure time with respect to an actual value by receiving the calculated determination coefficient and the mean square error calculated by a failure time prediction unit; And
(e) determining an optimal parameter statistical model by analyzing the predicted failure time data by an optimal model determination unit;
Characterized in that it comprises a,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 2,
Step (a)
Characterized in that the validity of the data is verified using a Laplace trend test analysis technique,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 2,
Step (a)
The explanatory variable (x _i ) for the number of failures and the dependent variable (y _i ) for the failure time are transformed into a linear model by Equation 1, which is a linear relationship,
Equation 1 is

as,
B ₀ and b ₁ are regression coefficients,

, That is, characterized in that it is a normal distribution with a mean of 0 and a standard deviation of σ,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 4,
The nonlinear regression model is
It characterized in that any one of a growth regression model, a quadratic regression model, and an S-curve regression model,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 5,
The growth regression model is defined by Equation 2,
Equation 2 is

,(

)as,
Wherein the β ₀ and β ₁ are the estimates of the b ₀ and the b ₁ , respectively, the x _i is an explanatory variable for the number of failures, and y _i is a dependent variable for the failure time,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 5,
The quadratic regression model is defined by Equation 3,
Equation 3 is

as,
The β _0, the β ₁ and the β ₂ are the estimated values of the b _0, the b ₁ and the regression coefficient b ₂ , respectively, the x _i is an explanatory variable for the number of failures, and y _i is a dependent variable for the failure time. Characterized in that,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 5,
The S-curve regression model is defined by Equation 4,
Equation 4 is

, (

The method of claim 2,
In step (c),
The coefficient of determination is defined by Equation 5 as the sum of squares (SSR), the sum of squared errors (SSE), and the total squares (SST) by a regression line,
Equation 5 is

Is,
remind

, remind

Is, above

Is the estimate of y _i , where

Is the average value of y _i ,
A method of predicting software failure time based on a nonlinear regression model.

The method of claim 2,
In step (c),
The mean squared error is defined by Equation 6,
Equation 6 is

Is,
remind

Silver time

The cumulative function of the errors shown to, above

Is the cumulative number of errors estimated from the average value function up to the time point x _i , n is the observed value, and k is the number of parameters,
A method of predicting software failure time based on a nonlinear regression model.

Predicting a software failure time by using a nonlinear regression model by a trend test unit; And
Determining, by a failure time analysis unit, an optimal parameter statistical model for the predicted software failure time using a determination coefficient and a mean square error;
Characterized in that it comprises a,
Computer-readable recording medium of a method of predicting software failure time based on a nonlinear regression model.

(a) verifying the validity of the data using the software failure time data collected by the trend test analysis unit;
(b) receiving, by a parameter estimate calculation unit, the verified failure time data and calculating a parameter estimate for a nonlinear regression model;
(c) calculating, by an optimal model reference calculation unit, a determination coefficient and a mean square error for determining a failure time prediction model;
(d) predicting failure time data by analyzing a future failure time with respect to an actual value by receiving the calculated determination coefficient and the mean square error calculated by a failure time prediction unit; And
(e) determining an optimal parameter statistical model by analyzing the predicted failure time data by an optimal model determination unit;
Characterized in that it comprises a,
Computer-readable recording medium of a method of predicting software failure time based on a nonlinear regression model.