CN105653450A - Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost - Google Patents

Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost Download PDF

Info

Publication number
CN105653450A
CN105653450A CN201511003717.2A CN201511003717A CN105653450A CN 105653450 A CN105653450 A CN 105653450A CN 201511003717 A CN201511003717 A CN 201511003717A CN 105653450 A CN105653450 A CN 105653450A
Authority
CN
China
Prior art keywords
feature
software
adaboost
genetic algorithm
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201511003717.2A
Other languages
Chinese (zh)
Inventor
李克文
邹晶杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Petroleum East China
Original Assignee
China University of Petroleum East China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Petroleum East China filed Critical China University of Petroleum East China
Priority to CN201511003717.2A priority Critical patent/CN105653450A/en
Publication of CN105653450A publication Critical patent/CN105653450A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3604Software analysis for verifying properties of programs
    • G06F11/3608Software analysis for verifying properties of programs using formal methods, e.g. model checking, abstract interpretation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • G06F18/2148Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the process organisation or structure, e.g. boosting cascade
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Software Systems (AREA)
  • Physiology (AREA)
  • Genetics & Genomics (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention is mainly applied to the field of software engineering, and targeted at a randomness problem of the existing software defect data feature selection method, provides a software defect data feature selection method based on combination of a modified genetic algorithm and Adaboost. The main steps are as follows: firstly, acquiring software module data from a software data set intensively, and performing labeling processing on the data; then dividing a feature space, and constructing a feature selection classifier based on Adaboost, wherein each feature sub-space corresponds to one weak classifier; optimizing Adaboost by using a frequency idea based genetic algorithm: performing screening of an optimal feature by combining a selected frequency of a feature; and finally, performing a test on the data set according to an acquired optimal feature subset, and comparing the method with other feature selection methods, so as to verify stability and accuracy of the method, and establishing a software defect prediction model. The method provided by the present invention is capable of relatively well overcoming the randomness problem in the software defect data feature selection process and has relatively good stability and relatively high accuracy.

Description

Based on the software defect data characteristics system of selection that improved adaptive GA-IAGA is combined with Adaboost
Technical field
The invention belongs to field of software engineering, be specifically related to a kind of software defect data characteristics system of selection being combined with Adaboost based on improved adaptive GA-IAGA.
Background technology
Rapid development of information technology, data mass memory, all trades and professions are in the urgent need to being converted into knowledge by data. From data, find that valuable information becomes the focus of current theory and application research. Statistics, data base, machine learning and visualization technique have been merged in data mining, by analysis of history data, it has been found that unknown and novel teachings, provide a kind of effective way for solving information age " data explosion, poor in information " problem. Feature selection (Featureselection) and classification (Classification) are modal tasks in DM, by feature selection by high dimensional data dimensionality reduction, forecast model is set up, it is possible to provide the Accurate Prediction to unknown problem by classifying.
Along with the enhancing day by day of increase and the logic complexity thereof day by day of software system scale, the defect having in software certainly will threaten the reliability of software, affects software quality. Due to the industry-by-industry strong depend-ence to software system, software fault may result in serious consequence, especially for high risk sexual system or even fatal. Software defect Predicting Technique is as a kind of important approach instructed with assessment software test job, and Experience of Software Testing Work is had important directive significance by the distribution situation of forecasting software defect exactly. For a software system, rational prediction defect can add up the defect counts and defect distribution that not yet find but still suffer from. Do so not only can instruct in the module that limited energy and resource input are easily made mistakes by developer to those, saves substantial amounts of human cost and resource; But also can objective appraisal test result, this to software quality, development cost and construction cycle control suffer from great meaning.
That software defect is predicted it is crucial that find there is abnormal module, this actually can regard two classification problems as, is divided into " normally " and two classifications of "abnormal" by software module.The premise of classification is by feature selection, and the optimal feature subset according to selecting is classified. But in reality, the feature selection process of software defect data faces problems:
(1) there is uncorrelated in a large number and redundancy software features
In each class software features, except basic feature is directly to extract from source code, other feature is all calculated by these basic feature values. Thus can obtaining, the dependency between same class software features is relatively big, there is more redundancy feature. Substantial amounts of redundancy or uncorrelated features participate in computing, it will bring dimension disaster, admittedly feature set need to be carried out dimension-reduction treatment, select optimal feature subset.
(2) result is selected to have randomness
In existing Feature Selection, single grader is adopted to classify, and utilizing classification results that feature selection result is passed judgment on, the quality of feature only comes from single grader classification results so that run the shortcoming that the optimal feature subset obtained has randomness.
To sum up, under the situation that current internet software product develops rapidly, proposing efficient software defect forecast model, software carries out correct abnormality detection is problem in the urgent need to address, and its premise selects optimal feature subset, is the basis and key of instructing abnormality detecting process accurately to carry out.
Summary of the invention
Present invention aim to address traditional stochastic problems existing for software defect data characteristics system of selection, it is provided that a kind of software defect data characteristics system of selection being combined based on improved adaptive GA-IAGA with Adaboost, to improve the stability of feature selection.
For achieving the above object, technical solution of the present invention mainly includes three below step:
A. the feature selection grader based on Adaboost is built
(1) concentrate acquisition software module data from software data, divide training set and test set.
(2) feature set is divided into T the proper subspace being sized to N, every sub spaces correspondence body one by one, feature is carried out binary coding, be mapped as 0-1 string. 0 represents and does not select this feature, and 1 represents and selects this feature, and each string be body one by one, T individuality one population of composition.
(3) sample weights initializes:
D1(i)=1/m formula (1)
Wherein, m is sample number, D1I () takes turns the weight in iteration for sample the 1st.
(4) it is circulated t=1,2 ..., T:
A. use the t sample distribution taken turns, train Weak Classifier ht, htQuality by False Rate ��tWeigh, ��tIt is all by the sample weights sum of misclassification:
��t=�� Dt(i)I[ht(xi)��yi] formula (2)
Wherein DtI () is sample (xi,yi) take turns the weight in iteration, I [h at tt(xi)��yi] illustrate to participate in False Rate ��tCalculate by the sample of misclassification.
B. the weight �� of then each Weak ClassifiertCan be used to weigh the importance of Weak Classifier.
α t = 1 2 l n ( 1 - ϵ t ϵ t ) Formula (3)
C. sample weights is updated: D t + 1 ( i ) = D t ( i ) Z t × e - α t , h t ( x i ) = y i e α t , h t ( x i ) ≠ y i Formula (4)
Wherein, ZtFor normalization factor, namely
Z t = Σ D t ( i ) e ( - α t y i ) h t ( x i ) Formula (5)
D. finally giving strong classifier is
H ( x ) = s i g n [ Σ t = 1 T α t h t ( x ) ] Formula (6)
B. genetic algorithm is adopted to be optimized
Adopt the genetic algorithm based on frequency thought that Adaboost is optimized: to carry out the screening of optimal characteristics in conjunction with feature selected frequency.
(1) select
Adopt the roulette selection with elite retention strategy to operate, optimum individual in population is directly selected into the next generation, then carries out roulette operation. In process, evaluate individual good and bad according to fitness function. In this optimization method, define fitness function by the Adaboost integrated detection recall rate (Recall) obtained and rate of false alarm (pf).Fitness function is:
f ( x ) = Re c a l l × p f = A A + B × C C + D Formula (7)
Wherein, recall rate (Recall), it is defined as the ratio being correctly predicted to be defective number of modules with true defective number of modules, is expressed as follows
Re c a l l = A A + B Formula (8)
Rate of false alarm (falsepositiverate, pf), also referred to as false sun rate. It is defined as the ratio that error prediction is defective module number and actual zero defect number of modules, is represented by
p f = C C + D Formula (9)
It is as shown in the table in A, B, C, D definition,
Table 1 classification predicts the outcome
Predict defective Prediction zero defect
Truly defective A B
True zero defect C D
(2) intersect, make a variation
Single-point crossover operator and single-point mutation operator is adopted to carry out intersection and the mutation operation of genetic algorithm.
(3) frequency screening
In the optimal solution that foundation single genetic algorithm optimization Adaboost obtains, the frequency that feature occurs, reconfigure and obtain optimal characteristics combination. Make F=(f1,f2��fp) represent and repeatedly run the optimal solution set that genetic algorithm obtains, fiRepresenting i-th optimal solution, p is the number of times that algorithm runs.
Formula (10)
Then, available formula (11) calculates jth feature selected frequency qj��
q j = Σ i = 1 p f i ( j ) p Formula (11)
When feature j selected frequency is less than certain threshold value, then this feature can not be selected into final character subset, otherwise, then it is added into final character subset. This threshold value obtains by experiment.
C. the foundation of feature selection module and test
By said process, carry out final screening according to feature selected frequency, thus obtaining optimal characteristics combination. Optimal feature subset according to obtaining is tested on software module data set, contrasts with other feature selection approach, verifies its stability in software defect prediction and accuracy rate, thus setting up software defect forecast model.
Accompanying drawing explanation
Fig. 1 is based on the software defect data characteristics system of selection flow chart that improved adaptive GA-IAGA is combined with Adaboost.
Detailed description of the invention
Below in conjunction with Fig. 1, the present invention is described in further detail.
The first step: build the feature selection grader based on Adaboost
(1) acquisition software module data are concentrated from software data, including software features collection, software module data. And software module data are divided into training set and test set in order to training and test. Do tag processes: software module data set { X, Y}, X={x1,x2��xm, Y={y1,y2}={+1 ,-1}. If software module xiZero defect, then (xi,yi)=(xi,-1), otherwise, (xi,yi)=(xi,+1)��
(2) feature set is divided into T the proper subspace being sized to N, every sub spaces correspondence body one by one, feature is carried out binary coding, be mapped as 0-1 string. 0 represents and does not select this feature, and 1 represents and selects this feature, and each string be body one by one, T individuality one population of composition.
(3) sample weights initializes:
D1(i)=1/m
Wherein, D1I () is sample (xi,yi) take turns the weight in iteration the 1st.
(4) it is circulated t=1,2 ..., T:
A. use the t sample distribution taken turns, train Weak Classifier ht, htQuality by False Rate ��tWeigh, ��tIt is all by the sample weights sum of misclassification:
ϵ t = Σ D t ( i ) I [ h t ( x i ) ≠ y i ]
Wherein DtI () is sample (xi,yi) take turns the weight in iteration, I [h at tt(xi)��yi] illustrate to participate in False Rate ��tCalculate by the sample of misclassification.
B. the weight �� of then each Weak ClassifiertCan be used to weigh the importance of Weak Classifier.
α t = 1 2 l n ( 1 - ϵ t ϵ t )
C. sample weights is updated: D t + 1 ( i ) = D t ( i ) Z t × e - α t , h t ( x i ) = y i e α t , h t ( x i ) ≠ y i
Wherein, ZtFor normalization factor, namely
Z t = Σ D t ( i ) e ( - α t y i ) h t ( x i )
D. finally giving strong classifier is:
H ( x ) = s i g n [ Σ t = 1 T α t h t ( x ) ]
Second step: adopt genetic algorithm to be optimized
The result that second step Adaboost strong classifier is obtained by the genetic algorithm based on frequency thought is adopted to be optimized: to carry out the screening of optimal characteristics in conjunction with feature selected frequency.
(1) select
Adopt the roulette selection with elite retention strategy to operate, optimum individual in population is directly selected into the next generation, then carries out roulette operation. In process, evaluate individual good and bad according to fitness function. In this optimization method, define fitness function by the Adaboost integrated detection recall rate (Recall) obtained and rate of false alarm (pf). Fitness function is:
f ( x ) = Re c a l l × p f = A A + B × C C + D
Wherein, recall rate (Recall), it is defined as the ratio being correctly predicted to be defective number of modules with true defective number of modules, is expressed as follows
Re c a l l = A A + B
Rate of false alarm (falsepositiverate, pf), also referred to as false sun rate. It is defined as the ratio that error prediction is defective module number and actual zero defect number of modules, is represented by
p f = C C + D
It is as shown in the table in A, B, C, D definition,
Table 1 classification predicts the outcome
Predict defective Prediction zero defect
Truly defective A B
True zero defect C D
(2) intersect, make a variation
Single-point crossover operator and single-point mutation operator is adopted to carry out intersection and the mutation operation of genetic algorithm.
(3) frequency screening
In the optimal solution that foundation single genetic algorithm optimization Adaboost obtains, the frequency that feature occurs, reconfigure and obtain optimal characteristics combination. Make F=(f1,f2��fp) represent and repeatedly run the optimal solution set that genetic algorithm obtains, fiRepresenting i-th optimal solution, p is the number of times that algorithm runs, it is stipulated that function is as follows
Then, available formula (11) calculates jth feature selected frequency qj��
q j = Σ i = 1 p f i ( j ) p
When feature j selected frequency is less than certain threshold value, then this feature can not be selected into final character subset, otherwise, then add final character subset. This threshold value obtains by experiment.
3rd step: by said process, carry out final screening according to feature selected frequency, thus obtaining optimal characteristics combination. Optimal feature subset according to obtaining is tested on software module data set, contrasts with other feature selection approach, verifies its stability in software defect prediction and accuracy rate.
The present invention can carry out higher-dimension software features dimensionality reduction, thus guiding software failure prediction, and there is good stability and higher accuracy rate.
The invention provides a kind of software defect data characteristics system of selection being combined based on improved adaptive GA-IAGA with Adaboost; should be understood that; for those skilled in the art; under the premise without departing from the principles of the invention; can also making some improvement, these improvement also should be regarded as protection scope of the present invention. Each ingredient not clear and definite in the present embodiment is used equally to prior art and is realized.

Claims (1)

1. the software defect data characteristics system of selection being combined with Adaboost based on improved adaptive GA-IAGA, it is characterised in that mainly include three below step:
A. the feature selection grader based on Adaboost is built
(1) acquisition software module data are concentrated from software data, including software features collection, software module data; And software module data are divided into training set and test set in order to training and test; Do tag processes: software module data set { X, Y}, X={x1, x2...xm, Y={y1, y2}={+1 ,-1}; If software module xiZero defect, then (xi, yi)=(xi,-1), otherwise, (xi, yi)=(xi,+1);
(2) feature set is divided into T the proper subspace being sized to N, every sub spaces correspondence body one by one, feature is carried out binary coding, be mapped as 0-1 string; 0 represents and does not select this feature, and 1 represents and selects this feature, and each string be body one by one, T individuality one population of composition;
(3) sample weights initializes:
D1(i)=1/m
Wherein, m is sample number, D1I () is sample (xi, yi) take turns the weight in iteration the 1st;
(4) it is circulated t=1,2 ..., T:
A. use the t sample distribution taken turns, train Weak Classifier ht, htQuality by False Rate ��tWeigh, ��tIt is all by the sample weights sum of misclassification:
ϵ t = Σ D t ( i ) I [ h t ( x i ) ≠ y i ]
Wherein DtI () is sample (xi, yi) take turns the weight in iteration, I [h at tt(xi)��yi] illustrate to participate in False Rate ��tCalculate by the sample of misclassification;
B. the weight �� of then each Weak ClassifiertCan be used to weigh the importance of Weak Classifier;
α t = 1 2 ln ( 1 - ϵ t ϵ t )
C. sample weights is updated: D t + 1 ( i ) = D t ( i ) Z t × e - α t , h t ( x i ) = y i e α t , h t ( x i ) ≠ y i
Wherein, ZtFor normalization factor, namely
Z t = Σ D t ( i ) e ( - α t y i ) h t ( x i )
D. finally giving strong classifier is
H ( x ) = sign [ Σ t = 1 T α t h t ( x ) ]
B. genetic algorithm is adopted to be optimized
Adopt the genetic algorithm based on frequency thought that Adaboost is optimized: to carry out the screening of optimal characteristics in conjunction with feature selected frequency;
(1) select
Adopt the roulette selection with elite retention strategy to operate, optimum individual in population is directly selected into the next generation, then carries out roulette operation; In process, evaluate individual good and bad according to fitness function; In this optimization method, defining fitness function by the Adaboost integrated detection recall rate (Recall) obtained and rate of false alarm (pf), fitness function is:
f ( x ) = Recall × pf = A A + B × C C + D
Wherein, recall rate (Recall), it is defined as the ratio being correctly predicted to be defective number of modules with true defective number of modules, is expressed as follows
Recall = A A + B
Rate of false alarm (falsepositiverate, pf), also referred to as false sun rate; It is defined as the ratio that error prediction is defective module number and actual zero defect number of modules, is represented by
pf = C C + D
It is as shown in the table in A, B, C, D definition,
Table 1 classification predicts the outcome
(2) intersect, make a variation
Single-point crossover operator and single-point mutation operator is adopted to carry out intersection and the mutation operation of genetic algorithm;
(3) frequency screening
In the optimal solution that foundation single genetic algorithm optimization Adaboost obtains, the frequency that feature occurs, reconfigure and obtain optimal characteristics combination; Make F=(f1, f2...fp) represent and repeatedly run the optimal solution set that genetic algorithm obtains, fiRepresenting i-th optimal solution, p is the number of times that algorithm runs, it is stipulated that function is as follows
Then, available formulaCalculate jth feature selected frequency qj; When feature j selected frequency is less than certain threshold value, then this feature can not be selected into final character subset, otherwise, then it is added into final character subset; This threshold value obtains by experiment;
C. the foundation of feature selection module and test
By said process, carry out final screening according to feature selected frequency, thus obtaining optimal characteristics combination; Optimal feature subset according to obtaining is tested on software module data set, contrasts with other feature selection approach, verifies its stability in software defect prediction and accuracy rate, thus setting up software defect forecast model.
CN201511003717.2A 2015-12-28 2015-12-28 Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost Pending CN105653450A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511003717.2A CN105653450A (en) 2015-12-28 2015-12-28 Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511003717.2A CN105653450A (en) 2015-12-28 2015-12-28 Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost

Publications (1)

Publication Number Publication Date
CN105653450A true CN105653450A (en) 2016-06-08

Family

ID=56478129

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511003717.2A Pending CN105653450A (en) 2015-12-28 2015-12-28 Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost

Country Status (1)

Country Link
CN (1) CN105653450A (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106126413A (en) * 2016-06-16 2016-11-16 南通大学 Software Defects Predict Methods based on two benches packaging type feature selection
CN106778861A (en) * 2016-12-12 2017-05-31 齐鲁工业大学 A kind of screening technique of key feature
CN106960002A (en) * 2017-02-13 2017-07-18 上海大学 A kind of cross-cutting information extraction method of feature based model
CN107170442A (en) * 2017-05-11 2017-09-15 北京理工大学 Multi-parameters optimization method based on self-adapted genetic algorithm
CN107247666A (en) * 2017-05-24 2017-10-13 武汉大学 A kind of feature based selection and the software defect number Forecasting Methodology of integrated study
CN107391365A (en) * 2017-07-06 2017-11-24 武汉大学 A kind of hybrid characteristic selecting method of software-oriented failure prediction
CN108563565A (en) * 2018-04-08 2018-09-21 中国人民解放军海军工程大学 Flight landing guides system reliability Quantitative Analysis Model method for building up
CN109657795A (en) * 2018-12-12 2019-04-19 华中科技大学 A kind of hard disk failure prediction technique based on Attributions selection
CN109947652A (en) * 2019-03-26 2019-06-28 中山大学 A kind of improvement sequence learning method of software defect prediction
CN110288013A (en) * 2019-06-20 2019-09-27 杭州电子科技大学 A kind of defective labels recognition methods based on block segmentation and the multiple twin convolutional neural networks of input
CN110322968A (en) * 2019-06-24 2019-10-11 北京科技大学 A kind of feature selection approach and device of disease category medical data
CN111177010A (en) * 2019-12-31 2020-05-19 杭州电子科技大学 Software defect severity identification method
CN111639024A (en) * 2020-05-18 2020-09-08 四川新网银行股份有限公司 Software quality prediction method based on characteristic frequency data mining
CN112257963A (en) * 2020-11-20 2021-01-22 北京轩宇信息技术有限公司 Defect prediction method and device based on aerospace software defect data distribution outlier
CN112269732A (en) * 2020-10-14 2021-01-26 北京轩宇信息技术有限公司 Method and device for selecting software defect prediction characteristics
CN117558452A (en) * 2024-01-11 2024-02-13 北京大学人民医院 MODS risk assessment model construction method, device, equipment and medium

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106126413A (en) * 2016-06-16 2016-11-16 南通大学 Software Defects Predict Methods based on two benches packaging type feature selection
CN106126413B (en) * 2016-06-16 2019-02-19 南通大学 The Software Defects Predict Methods of packaging type feature selecting based on the study of class imbalance and genetic algorithm
CN106778861A (en) * 2016-12-12 2017-05-31 齐鲁工业大学 A kind of screening technique of key feature
CN106960002A (en) * 2017-02-13 2017-07-18 上海大学 A kind of cross-cutting information extraction method of feature based model
CN107170442A (en) * 2017-05-11 2017-09-15 北京理工大学 Multi-parameters optimization method based on self-adapted genetic algorithm
CN107247666B (en) * 2017-05-24 2020-03-10 武汉大学 Feature selection and integrated learning-based software defect number prediction method
CN107247666A (en) * 2017-05-24 2017-10-13 武汉大学 A kind of feature based selection and the software defect number Forecasting Methodology of integrated study
CN107391365A (en) * 2017-07-06 2017-11-24 武汉大学 A kind of hybrid characteristic selecting method of software-oriented failure prediction
CN107391365B (en) * 2017-07-06 2020-10-13 武汉大学 Mixed feature selection method oriented to software defect prediction
CN108563565A (en) * 2018-04-08 2018-09-21 中国人民解放军海军工程大学 Flight landing guides system reliability Quantitative Analysis Model method for building up
CN109657795A (en) * 2018-12-12 2019-04-19 华中科技大学 A kind of hard disk failure prediction technique based on Attributions selection
CN109657795B (en) * 2018-12-12 2022-06-21 华中科技大学 Hard disk failure prediction method based on attribute selection
CN109947652A (en) * 2019-03-26 2019-06-28 中山大学 A kind of improvement sequence learning method of software defect prediction
CN110288013A (en) * 2019-06-20 2019-09-27 杭州电子科技大学 A kind of defective labels recognition methods based on block segmentation and the multiple twin convolutional neural networks of input
CN110322968A (en) * 2019-06-24 2019-10-11 北京科技大学 A kind of feature selection approach and device of disease category medical data
CN111177010B (en) * 2019-12-31 2023-12-15 杭州电子科技大学 Software defect severity identification method
CN111177010A (en) * 2019-12-31 2020-05-19 杭州电子科技大学 Software defect severity identification method
CN111639024A (en) * 2020-05-18 2020-09-08 四川新网银行股份有限公司 Software quality prediction method based on characteristic frequency data mining
CN112269732A (en) * 2020-10-14 2021-01-26 北京轩宇信息技术有限公司 Method and device for selecting software defect prediction characteristics
CN112269732B (en) * 2020-10-14 2024-01-05 北京轩宇信息技术有限公司 Software defect prediction feature selection method and device
CN112257963A (en) * 2020-11-20 2021-01-22 北京轩宇信息技术有限公司 Defect prediction method and device based on aerospace software defect data distribution outlier
CN112257963B (en) * 2020-11-20 2023-08-29 北京轩宇信息技术有限公司 Defect prediction method and device based on spaceflight software defect data distribution outlier
CN117558452A (en) * 2024-01-11 2024-02-13 北京大学人民医院 MODS risk assessment model construction method, device, equipment and medium
CN117558452B (en) * 2024-01-11 2024-03-26 北京大学人民医院 MODS risk assessment model construction method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN105653450A (en) Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost
CN105589806B (en) A kind of software defect tendency Forecasting Methodology based on SMOTE+Boosting algorithms
CN102707256B (en) Fault diagnosis method based on BP-Ada Boost nerve network for electric energy meter
CN106201871A (en) Based on the Software Defects Predict Methods that cost-sensitive is semi-supervised
CN107341497A (en) The unbalanced weighting data streams Ensemble classifier Forecasting Methodology of sampling is risen with reference to selectivity
Yu et al. Identifying source (s) of out-of-control signals in multivariate manufacturing processes using selective neural network ensemble
CN105893256A (en) Software failure positioning method based on machine learning algorithm
CN105677564A (en) Adaboost software defect unbalanced data classification method based on improvement
CN109242149A (en) A kind of student performance early warning method and system excavated based on educational data
CN116108758B (en) Landslide susceptibility evaluation method
CN104502103A (en) Bearing fault diagnosis method based on fuzzy support vector machine
CN103488869A (en) Wind power generation short-term load forecast method of least squares support vector machine
CN103810101A (en) Software defect prediction method and system
CN108416364A (en) Integrated study data classification method is merged in subpackage
CN110455512B (en) Rotary mechanical multi-integration fault diagnosis method based on depth self-encoder DAE
CN104536881A (en) Public testing error report priority sorting method based on natural language analysis
Khuat et al. Ensemble learning for software fault prediction problem with imbalanced data.
CN106681305A (en) Online fault diagnosing method for Fast RVM (relevance vector machine) sewage treatment
CN106529580A (en) EDSVM-based software defect data association classification method
CN105389598A (en) Feature selecting and classifying method for software defect data
CN104809476A (en) Multi-target evolutionary fuzzy rule classification method based on decomposition
CN110263934A (en) A kind of artificial intelligence data mask method and device
CN114266289A (en) Complex equipment health state assessment method
CN104318515A (en) Hyper-spectral image wave band dimension descending method based on NNIA evolutionary algorithm
Afzal Using faults-slip-through metric as a predictor of fault-proneness

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20160608