CN105653450A - Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost - Google Patents
Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost Download PDFInfo
- Publication number
- CN105653450A CN105653450A CN201511003717.2A CN201511003717A CN105653450A CN 105653450 A CN105653450 A CN 105653450A CN 201511003717 A CN201511003717 A CN 201511003717A CN 105653450 A CN105653450 A CN 105653450A
- Authority
- CN
- China
- Prior art keywords
- feature
- software
- adaboost
- genetic algorithm
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/3604—Software analysis for verifying properties of programs
- G06F11/3608—Software analysis for verifying properties of programs using formal methods, e.g. model checking, abstract interpretation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2148—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the process organisation or structure, e.g. boosting cascade
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/12—Computing arrangements based on biological models using genetic models
- G06N3/126—Evolutionary algorithms, e.g. genetic algorithms or genetic programming
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Data Mining & Analysis (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Software Systems (AREA)
- Physiology (AREA)
- Genetics & Genomics (AREA)
- Biomedical Technology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention is mainly applied to the field of software engineering, and targeted at a randomness problem of the existing software defect data feature selection method, provides a software defect data feature selection method based on combination of a modified genetic algorithm and Adaboost. The main steps are as follows: firstly, acquiring software module data from a software data set intensively, and performing labeling processing on the data; then dividing a feature space, and constructing a feature selection classifier based on Adaboost, wherein each feature sub-space corresponds to one weak classifier; optimizing Adaboost by using a frequency idea based genetic algorithm: performing screening of an optimal feature by combining a selected frequency of a feature; and finally, performing a test on the data set according to an acquired optimal feature subset, and comparing the method with other feature selection methods, so as to verify stability and accuracy of the method, and establishing a software defect prediction model. The method provided by the present invention is capable of relatively well overcoming the randomness problem in the software defect data feature selection process and has relatively good stability and relatively high accuracy.
Description
Technical field
The invention belongs to field of software engineering, be specifically related to a kind of software defect data characteristics system of selection being combined with Adaboost based on improved adaptive GA-IAGA.
Background technology
Rapid development of information technology, data mass memory, all trades and professions are in the urgent need to being converted into knowledge by data. From data, find that valuable information becomes the focus of current theory and application research. Statistics, data base, machine learning and visualization technique have been merged in data mining, by analysis of history data, it has been found that unknown and novel teachings, provide a kind of effective way for solving information age " data explosion, poor in information " problem. Feature selection (Featureselection) and classification (Classification) are modal tasks in DM, by feature selection by high dimensional data dimensionality reduction, forecast model is set up, it is possible to provide the Accurate Prediction to unknown problem by classifying.
Along with the enhancing day by day of increase and the logic complexity thereof day by day of software system scale, the defect having in software certainly will threaten the reliability of software, affects software quality. Due to the industry-by-industry strong depend-ence to software system, software fault may result in serious consequence, especially for high risk sexual system or even fatal. Software defect Predicting Technique is as a kind of important approach instructed with assessment software test job, and Experience of Software Testing Work is had important directive significance by the distribution situation of forecasting software defect exactly. For a software system, rational prediction defect can add up the defect counts and defect distribution that not yet find but still suffer from. Do so not only can instruct in the module that limited energy and resource input are easily made mistakes by developer to those, saves substantial amounts of human cost and resource; But also can objective appraisal test result, this to software quality, development cost and construction cycle control suffer from great meaning.
That software defect is predicted it is crucial that find there is abnormal module, this actually can regard two classification problems as, is divided into " normally " and two classifications of "abnormal" by software module.The premise of classification is by feature selection, and the optimal feature subset according to selecting is classified. But in reality, the feature selection process of software defect data faces problems:
(1) there is uncorrelated in a large number and redundancy software features
In each class software features, except basic feature is directly to extract from source code, other feature is all calculated by these basic feature values. Thus can obtaining, the dependency between same class software features is relatively big, there is more redundancy feature. Substantial amounts of redundancy or uncorrelated features participate in computing, it will bring dimension disaster, admittedly feature set need to be carried out dimension-reduction treatment, select optimal feature subset.
(2) result is selected to have randomness
In existing Feature Selection, single grader is adopted to classify, and utilizing classification results that feature selection result is passed judgment on, the quality of feature only comes from single grader classification results so that run the shortcoming that the optimal feature subset obtained has randomness.
To sum up, under the situation that current internet software product develops rapidly, proposing efficient software defect forecast model, software carries out correct abnormality detection is problem in the urgent need to address, and its premise selects optimal feature subset, is the basis and key of instructing abnormality detecting process accurately to carry out.
Summary of the invention
Present invention aim to address traditional stochastic problems existing for software defect data characteristics system of selection, it is provided that a kind of software defect data characteristics system of selection being combined based on improved adaptive GA-IAGA with Adaboost, to improve the stability of feature selection.
For achieving the above object, technical solution of the present invention mainly includes three below step:
A. the feature selection grader based on Adaboost is built
(1) concentrate acquisition software module data from software data, divide training set and test set.
(2) feature set is divided into T the proper subspace being sized to N, every sub spaces correspondence body one by one, feature is carried out binary coding, be mapped as 0-1 string. 0 represents and does not select this feature, and 1 represents and selects this feature, and each string be body one by one, T individuality one population of composition.
(3) sample weights initializes:
D1(i)=1/m formula (1)
Wherein, m is sample number, D1I () takes turns the weight in iteration for sample the 1st.
(4) it is circulated t=1,2 ..., T:
A. use the t sample distribution taken turns, train Weak Classifier ht, htQuality by False Rate ��tWeigh, ��tIt is all by the sample weights sum of misclassification:
��t=�� Dt(i)I[ht(xi)��yi] formula (2)
Wherein DtI () is sample (xi,yi) take turns the weight in iteration, I [h at tt(xi)��yi] illustrate to participate in False Rate ��tCalculate by the sample of misclassification.
B. the weight �� of then each Weak ClassifiertCan be used to weigh the importance of Weak Classifier.
C. sample weights is updated: Formula (4)
Wherein, ZtFor normalization factor, namely
D. finally giving strong classifier is
B. genetic algorithm is adopted to be optimized
Adopt the genetic algorithm based on frequency thought that Adaboost is optimized: to carry out the screening of optimal characteristics in conjunction with feature selected frequency.
(1) select
Adopt the roulette selection with elite retention strategy to operate, optimum individual in population is directly selected into the next generation, then carries out roulette operation. In process, evaluate individual good and bad according to fitness function. In this optimization method, define fitness function by the Adaboost integrated detection recall rate (Recall) obtained and rate of false alarm (pf).Fitness function is:
Wherein, recall rate (Recall), it is defined as the ratio being correctly predicted to be defective number of modules with true defective number of modules, is expressed as follows
Rate of false alarm (falsepositiverate, pf), also referred to as false sun rate. It is defined as the ratio that error prediction is defective module number and actual zero defect number of modules, is represented by
It is as shown in the table in A, B, C, D definition,
Table 1 classification predicts the outcome
Predict defective | Prediction zero defect | |
Truly defective | A | B |
True zero defect | C | D |
(2) intersect, make a variation
Single-point crossover operator and single-point mutation operator is adopted to carry out intersection and the mutation operation of genetic algorithm.
(3) frequency screening
In the optimal solution that foundation single genetic algorithm optimization Adaboost obtains, the frequency that feature occurs, reconfigure and obtain optimal characteristics combination. Make F=(f1,f2��fp) represent and repeatedly run the optimal solution set that genetic algorithm obtains, fiRepresenting i-th optimal solution, p is the number of times that algorithm runs.
Formula (10)
Then, available formula (11) calculates jth feature selected frequency qj��
When feature j selected frequency is less than certain threshold value, then this feature can not be selected into final character subset, otherwise, then it is added into final character subset. This threshold value obtains by experiment.
C. the foundation of feature selection module and test
By said process, carry out final screening according to feature selected frequency, thus obtaining optimal characteristics combination. Optimal feature subset according to obtaining is tested on software module data set, contrasts with other feature selection approach, verifies its stability in software defect prediction and accuracy rate, thus setting up software defect forecast model.
Accompanying drawing explanation
Fig. 1 is based on the software defect data characteristics system of selection flow chart that improved adaptive GA-IAGA is combined with Adaboost.
Detailed description of the invention
Below in conjunction with Fig. 1, the present invention is described in further detail.
The first step: build the feature selection grader based on Adaboost
(1) acquisition software module data are concentrated from software data, including software features collection, software module data. And software module data are divided into training set and test set in order to training and test. Do tag processes: software module data set { X, Y}, X={x1,x2��xm, Y={y1,y2}={+1 ,-1}. If software module xiZero defect, then (xi,yi)=(xi,-1), otherwise, (xi,yi)=(xi,+1)��
(2) feature set is divided into T the proper subspace being sized to N, every sub spaces correspondence body one by one, feature is carried out binary coding, be mapped as 0-1 string. 0 represents and does not select this feature, and 1 represents and selects this feature, and each string be body one by one, T individuality one population of composition.
(3) sample weights initializes:
D1(i)=1/m
Wherein, D1I () is sample (xi,yi) take turns the weight in iteration the 1st.
(4) it is circulated t=1,2 ..., T:
A. use the t sample distribution taken turns, train Weak Classifier ht, htQuality by False Rate ��tWeigh, ��tIt is all by the sample weights sum of misclassification:
Wherein DtI () is sample (xi,yi) take turns the weight in iteration, I [h at tt(xi)��yi] illustrate to participate in False Rate ��tCalculate by the sample of misclassification.
B. the weight �� of then each Weak ClassifiertCan be used to weigh the importance of Weak Classifier.
C. sample weights is updated:
Wherein, ZtFor normalization factor, namely
D. finally giving strong classifier is:
Second step: adopt genetic algorithm to be optimized
The result that second step Adaboost strong classifier is obtained by the genetic algorithm based on frequency thought is adopted to be optimized: to carry out the screening of optimal characteristics in conjunction with feature selected frequency.
(1) select
Adopt the roulette selection with elite retention strategy to operate, optimum individual in population is directly selected into the next generation, then carries out roulette operation. In process, evaluate individual good and bad according to fitness function. In this optimization method, define fitness function by the Adaboost integrated detection recall rate (Recall) obtained and rate of false alarm (pf). Fitness function is:
Wherein, recall rate (Recall), it is defined as the ratio being correctly predicted to be defective number of modules with true defective number of modules, is expressed as follows
Rate of false alarm (falsepositiverate, pf), also referred to as false sun rate. It is defined as the ratio that error prediction is defective module number and actual zero defect number of modules, is represented by
It is as shown in the table in A, B, C, D definition,
Table 1 classification predicts the outcome
Predict defective | Prediction zero defect | |
Truly defective | A | B |
True zero defect | C | D |
(2) intersect, make a variation
Single-point crossover operator and single-point mutation operator is adopted to carry out intersection and the mutation operation of genetic algorithm.
(3) frequency screening
In the optimal solution that foundation single genetic algorithm optimization Adaboost obtains, the frequency that feature occurs, reconfigure and obtain optimal characteristics combination. Make F=(f1,f2��fp) represent and repeatedly run the optimal solution set that genetic algorithm obtains, fiRepresenting i-th optimal solution, p is the number of times that algorithm runs, it is stipulated that function is as follows
Then, available formula (11) calculates jth feature selected frequency qj��
When feature j selected frequency is less than certain threshold value, then this feature can not be selected into final character subset, otherwise, then add final character subset. This threshold value obtains by experiment.
3rd step: by said process, carry out final screening according to feature selected frequency, thus obtaining optimal characteristics combination. Optimal feature subset according to obtaining is tested on software module data set, contrasts with other feature selection approach, verifies its stability in software defect prediction and accuracy rate.
The present invention can carry out higher-dimension software features dimensionality reduction, thus guiding software failure prediction, and there is good stability and higher accuracy rate.
The invention provides a kind of software defect data characteristics system of selection being combined based on improved adaptive GA-IAGA with Adaboost; should be understood that; for those skilled in the art; under the premise without departing from the principles of the invention; can also making some improvement, these improvement also should be regarded as protection scope of the present invention. Each ingredient not clear and definite in the present embodiment is used equally to prior art and is realized.
Claims (1)
1. the software defect data characteristics system of selection being combined with Adaboost based on improved adaptive GA-IAGA, it is characterised in that mainly include three below step:
A. the feature selection grader based on Adaboost is built
(1) acquisition software module data are concentrated from software data, including software features collection, software module data; And software module data are divided into training set and test set in order to training and test; Do tag processes: software module data set { X, Y}, X={x1, x2...xm, Y={y1, y2}={+1 ,-1}; If software module xiZero defect, then (xi, yi)=(xi,-1), otherwise, (xi, yi)=(xi,+1);
(2) feature set is divided into T the proper subspace being sized to N, every sub spaces correspondence body one by one, feature is carried out binary coding, be mapped as 0-1 string; 0 represents and does not select this feature, and 1 represents and selects this feature, and each string be body one by one, T individuality one population of composition;
(3) sample weights initializes:
D1(i)=1/m
Wherein, m is sample number, D1I () is sample (xi, yi) take turns the weight in iteration the 1st;
(4) it is circulated t=1,2 ..., T:
A. use the t sample distribution taken turns, train Weak Classifier ht, htQuality by False Rate ��tWeigh, ��tIt is all by the sample weights sum of misclassification:
Wherein DtI () is sample (xi, yi) take turns the weight in iteration, I [h at tt(xi)��yi] illustrate to participate in False Rate ��tCalculate by the sample of misclassification;
B. the weight �� of then each Weak ClassifiertCan be used to weigh the importance of Weak Classifier;
C. sample weights is updated:
Wherein, ZtFor normalization factor, namely
D. finally giving strong classifier is
B. genetic algorithm is adopted to be optimized
Adopt the genetic algorithm based on frequency thought that Adaboost is optimized: to carry out the screening of optimal characteristics in conjunction with feature selected frequency;
(1) select
Adopt the roulette selection with elite retention strategy to operate, optimum individual in population is directly selected into the next generation, then carries out roulette operation; In process, evaluate individual good and bad according to fitness function; In this optimization method, defining fitness function by the Adaboost integrated detection recall rate (Recall) obtained and rate of false alarm (pf), fitness function is:
Wherein, recall rate (Recall), it is defined as the ratio being correctly predicted to be defective number of modules with true defective number of modules, is expressed as follows
Rate of false alarm (falsepositiverate, pf), also referred to as false sun rate; It is defined as the ratio that error prediction is defective module number and actual zero defect number of modules, is represented by
It is as shown in the table in A, B, C, D definition,
Table 1 classification predicts the outcome
(2) intersect, make a variation
Single-point crossover operator and single-point mutation operator is adopted to carry out intersection and the mutation operation of genetic algorithm;
(3) frequency screening
In the optimal solution that foundation single genetic algorithm optimization Adaboost obtains, the frequency that feature occurs, reconfigure and obtain optimal characteristics combination; Make F=(f1, f2...fp) represent and repeatedly run the optimal solution set that genetic algorithm obtains, fiRepresenting i-th optimal solution, p is the number of times that algorithm runs, it is stipulated that function is as follows
Then, available formulaCalculate jth feature selected frequency qj; When feature j selected frequency is less than certain threshold value, then this feature can not be selected into final character subset, otherwise, then it is added into final character subset; This threshold value obtains by experiment;
C. the foundation of feature selection module and test
By said process, carry out final screening according to feature selected frequency, thus obtaining optimal characteristics combination; Optimal feature subset according to obtaining is tested on software module data set, contrasts with other feature selection approach, verifies its stability in software defect prediction and accuracy rate, thus setting up software defect forecast model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511003717.2A CN105653450A (en) | 2015-12-28 | 2015-12-28 | Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511003717.2A CN105653450A (en) | 2015-12-28 | 2015-12-28 | Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105653450A true CN105653450A (en) | 2016-06-08 |
Family
ID=56478129
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201511003717.2A Pending CN105653450A (en) | 2015-12-28 | 2015-12-28 | Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105653450A (en) |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106126413A (en) * | 2016-06-16 | 2016-11-16 | 南通大学 | Software Defects Predict Methods based on two benches packaging type feature selection |
CN106778861A (en) * | 2016-12-12 | 2017-05-31 | 齐鲁工业大学 | A kind of screening technique of key feature |
CN106960002A (en) * | 2017-02-13 | 2017-07-18 | 上海大学 | A kind of cross-cutting information extraction method of feature based model |
CN107170442A (en) * | 2017-05-11 | 2017-09-15 | 北京理工大学 | Multi-parameters optimization method based on self-adapted genetic algorithm |
CN107247666A (en) * | 2017-05-24 | 2017-10-13 | 武汉大学 | A kind of feature based selection and the software defect number Forecasting Methodology of integrated study |
CN107391365A (en) * | 2017-07-06 | 2017-11-24 | 武汉大学 | A kind of hybrid characteristic selecting method of software-oriented failure prediction |
CN108563565A (en) * | 2018-04-08 | 2018-09-21 | 中国人民解放军海军工程大学 | Flight landing guides system reliability Quantitative Analysis Model method for building up |
CN109657795A (en) * | 2018-12-12 | 2019-04-19 | 华中科技大学 | A kind of hard disk failure prediction technique based on Attributions selection |
CN109947652A (en) * | 2019-03-26 | 2019-06-28 | 中山大学 | A kind of improvement sequence learning method of software defect prediction |
CN110288013A (en) * | 2019-06-20 | 2019-09-27 | 杭州电子科技大学 | A kind of defective labels recognition methods based on block segmentation and the multiple twin convolutional neural networks of input |
CN110322968A (en) * | 2019-06-24 | 2019-10-11 | 北京科技大学 | A kind of feature selection approach and device of disease category medical data |
CN111177010A (en) * | 2019-12-31 | 2020-05-19 | 杭州电子科技大学 | Software defect severity identification method |
CN111639024A (en) * | 2020-05-18 | 2020-09-08 | 四川新网银行股份有限公司 | Software quality prediction method based on characteristic frequency data mining |
CN112257963A (en) * | 2020-11-20 | 2021-01-22 | 北京轩宇信息技术有限公司 | Defect prediction method and device based on aerospace software defect data distribution outlier |
CN112269732A (en) * | 2020-10-14 | 2021-01-26 | 北京轩宇信息技术有限公司 | Method and device for selecting software defect prediction characteristics |
CN117558452A (en) * | 2024-01-11 | 2024-02-13 | 北京大学人民医院 | MODS risk assessment model construction method, device, equipment and medium |
-
2015
- 2015-12-28 CN CN201511003717.2A patent/CN105653450A/en active Pending
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106126413A (en) * | 2016-06-16 | 2016-11-16 | 南通大学 | Software Defects Predict Methods based on two benches packaging type feature selection |
CN106126413B (en) * | 2016-06-16 | 2019-02-19 | 南通大学 | The Software Defects Predict Methods of packaging type feature selecting based on the study of class imbalance and genetic algorithm |
CN106778861A (en) * | 2016-12-12 | 2017-05-31 | 齐鲁工业大学 | A kind of screening technique of key feature |
CN106960002A (en) * | 2017-02-13 | 2017-07-18 | 上海大学 | A kind of cross-cutting information extraction method of feature based model |
CN107170442A (en) * | 2017-05-11 | 2017-09-15 | 北京理工大学 | Multi-parameters optimization method based on self-adapted genetic algorithm |
CN107247666B (en) * | 2017-05-24 | 2020-03-10 | 武汉大学 | Feature selection and integrated learning-based software defect number prediction method |
CN107247666A (en) * | 2017-05-24 | 2017-10-13 | 武汉大学 | A kind of feature based selection and the software defect number Forecasting Methodology of integrated study |
CN107391365A (en) * | 2017-07-06 | 2017-11-24 | 武汉大学 | A kind of hybrid characteristic selecting method of software-oriented failure prediction |
CN107391365B (en) * | 2017-07-06 | 2020-10-13 | 武汉大学 | Mixed feature selection method oriented to software defect prediction |
CN108563565A (en) * | 2018-04-08 | 2018-09-21 | 中国人民解放军海军工程大学 | Flight landing guides system reliability Quantitative Analysis Model method for building up |
CN109657795A (en) * | 2018-12-12 | 2019-04-19 | 华中科技大学 | A kind of hard disk failure prediction technique based on Attributions selection |
CN109657795B (en) * | 2018-12-12 | 2022-06-21 | 华中科技大学 | Hard disk failure prediction method based on attribute selection |
CN109947652A (en) * | 2019-03-26 | 2019-06-28 | 中山大学 | A kind of improvement sequence learning method of software defect prediction |
CN110288013A (en) * | 2019-06-20 | 2019-09-27 | 杭州电子科技大学 | A kind of defective labels recognition methods based on block segmentation and the multiple twin convolutional neural networks of input |
CN110322968A (en) * | 2019-06-24 | 2019-10-11 | 北京科技大学 | A kind of feature selection approach and device of disease category medical data |
CN111177010B (en) * | 2019-12-31 | 2023-12-15 | 杭州电子科技大学 | Software defect severity identification method |
CN111177010A (en) * | 2019-12-31 | 2020-05-19 | 杭州电子科技大学 | Software defect severity identification method |
CN111639024A (en) * | 2020-05-18 | 2020-09-08 | 四川新网银行股份有限公司 | Software quality prediction method based on characteristic frequency data mining |
CN112269732A (en) * | 2020-10-14 | 2021-01-26 | 北京轩宇信息技术有限公司 | Method and device for selecting software defect prediction characteristics |
CN112269732B (en) * | 2020-10-14 | 2024-01-05 | 北京轩宇信息技术有限公司 | Software defect prediction feature selection method and device |
CN112257963A (en) * | 2020-11-20 | 2021-01-22 | 北京轩宇信息技术有限公司 | Defect prediction method and device based on aerospace software defect data distribution outlier |
CN112257963B (en) * | 2020-11-20 | 2023-08-29 | 北京轩宇信息技术有限公司 | Defect prediction method and device based on spaceflight software defect data distribution outlier |
CN117558452A (en) * | 2024-01-11 | 2024-02-13 | 北京大学人民医院 | MODS risk assessment model construction method, device, equipment and medium |
CN117558452B (en) * | 2024-01-11 | 2024-03-26 | 北京大学人民医院 | MODS risk assessment model construction method, device, equipment and medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105653450A (en) | Software defect data feature selection method based on combination of modified genetic algorithm and Adaboost | |
CN105589806B (en) | A kind of software defect tendency Forecasting Methodology based on SMOTE+Boosting algorithms | |
CN102707256B (en) | Fault diagnosis method based on BP-Ada Boost nerve network for electric energy meter | |
CN106201871A (en) | Based on the Software Defects Predict Methods that cost-sensitive is semi-supervised | |
CN107341497A (en) | The unbalanced weighting data streams Ensemble classifier Forecasting Methodology of sampling is risen with reference to selectivity | |
Yu et al. | Identifying source (s) of out-of-control signals in multivariate manufacturing processes using selective neural network ensemble | |
CN105893256A (en) | Software failure positioning method based on machine learning algorithm | |
CN105677564A (en) | Adaboost software defect unbalanced data classification method based on improvement | |
CN109242149A (en) | A kind of student performance early warning method and system excavated based on educational data | |
CN116108758B (en) | Landslide susceptibility evaluation method | |
CN104502103A (en) | Bearing fault diagnosis method based on fuzzy support vector machine | |
CN103488869A (en) | Wind power generation short-term load forecast method of least squares support vector machine | |
CN103810101A (en) | Software defect prediction method and system | |
CN108416364A (en) | Integrated study data classification method is merged in subpackage | |
CN110455512B (en) | Rotary mechanical multi-integration fault diagnosis method based on depth self-encoder DAE | |
CN104536881A (en) | Public testing error report priority sorting method based on natural language analysis | |
Khuat et al. | Ensemble learning for software fault prediction problem with imbalanced data. | |
CN106681305A (en) | Online fault diagnosing method for Fast RVM (relevance vector machine) sewage treatment | |
CN106529580A (en) | EDSVM-based software defect data association classification method | |
CN105389598A (en) | Feature selecting and classifying method for software defect data | |
CN104809476A (en) | Multi-target evolutionary fuzzy rule classification method based on decomposition | |
CN110263934A (en) | A kind of artificial intelligence data mask method and device | |
CN114266289A (en) | Complex equipment health state assessment method | |
CN104318515A (en) | Hyper-spectral image wave band dimension descending method based on NNIA evolutionary algorithm | |
Afzal | Using faults-slip-through metric as a predictor of fault-proneness |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20160608 |