CN110232023B

CN110232023B - Software defect positioning method, device and medium based on convolutional neural network

Info

Publication number: CN110232023B
Application number: CN201910429282.XA
Authority: CN
Inventors: 宋元章; 李洪雨; 陈媛; 王俊杰; 王安邦
Original assignee: Changchun Institute of Optics Fine Mechanics and Physics of CAS
Current assignee: Changchun Institute of Optics Fine Mechanics and Physics of CAS
Priority date: 2019-05-22
Filing date: 2019-05-22
Publication date: 2021-07-06
Anticipated expiration: 2039-05-22
Also published as: CN110232023A

Abstract

The embodiment of the invention discloses a method, a device and a computer readable storage medium for positioning software defects based on a convolutional neural network, which are used for calling a preset test case to test a program to be tested and generating one-dimensional statement coverage information of the test case and an execution result of the test case; converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to a set conversion rule; training the initial convolutional neural network by taking the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting the preset requirement; the incidence relation between the one-dimensional statement coverage information and the execution result of the test case is fully excavated through the convolutional neural network, the accuracy of the convolutional neural network in positioning software defects is effectively improved, the program to be tested is processed according to the convolutional neural network, the suspicious value of each statement in the program to be tested is obtained, and the predicted suspicious value is more accurate.

Description

Software defect positioning method, device and medium based on convolutional neural network

Technical Field

The invention relates to the technical field of software testing, in particular to a method and a device for positioning software defects based on a convolutional neural network and a computer readable storage medium.

Background

Software bug fixes are intended to detect and locate error codes that cause software failures. The traditional debugging method for manually setting the breakpoint has the disadvantages of difficult breakpoint position selection and huge time overhead. Therefore, it is a common pursuit of the software academia and the industry to realize the automation of defect localization. In recent years, researchers have tried to propose a series of methods for assisting in automated defect localization from different perspectives, including a slice-based method, a program-invariant-based method, a model inspection method, a program-spectrum-based method, and the like.

Compared with a slicing-based method, a program invariant-based method and a model checking method, a spectrum-based defect localization (SFL) method is an important method which is effective because the characteristics of no need of considering the internal structure of a program and low execution overhead are provided. The SFL method mainly estimates the error possibility of program entities (such as sentences, predicates and the like) by comparing and analyzing program spectrum information of a tested program in successful execution and failure execution and constructing a corresponding suspicion degree calculation formula. The accuracy of software defect positioning cannot reach high due to the limitation of a suspicious degree calculation formula.

Therefore, how to improve the accuracy of software defect positioning is a problem to be solved urgently by those skilled in the art.

Disclosure of Invention

The embodiment of the invention aims to provide a method and a device for positioning software defects based on a convolutional neural network and a computer readable storage medium, which can improve the accuracy of positioning the software defects.

In order to solve the above technical problem, an embodiment of the present invention provides a method for locating a software defect based on a convolutional neural network, including:

the embodiment of the invention also provides a software defect positioning method based on the convolutional neural network, which comprises the following steps:

calling a preset test case to test a program to be tested, and generating one-dimensional statement coverage information of the test case and an execution result of the test case;

converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to a set conversion rule;

training an initial convolutional neural network by taking the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting a preset requirement;

and processing the program to be tested according to the convolutional neural network to obtain the suspicious value of each statement in the program to be tested.

Optionally, the converting the one-dimensional statement coverage information of the test case into the two-dimensional statement coverage information according to the set conversion rule includes:

when the one-dimensional statement coverage information of the test case is dense data, converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to the corresponding relation between a function and the one-dimensional statement coverage information of the test case;

and when the one-dimensional statement coverage information of the test case is sparse data, randomly setting the one-dimensional statement coverage information of the test case in a two-dimensional list to obtain two-dimensional statement coverage information.

Optionally, the training the initial convolutional neural network with the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting a preset requirement includes:

dividing the sample data into a training sample set and a testing sample set;

training the initial convolutional neural network by using the training sample set to obtain a trained convolutional neural network;

inputting a target test sample into the convolutional neural network to obtain an output result; the target test sample is any one of the test samples in the test sample set which are not tested;

judging whether the deviation value of the output result and the execution result of the test sample is within a preset range or not;

if not, returning to the step of training the initial convolutional neural network by using the training sample set to obtain a trained convolutional neural network;

if so, selecting an untested test sample from the test sample set as a target test sample, returning to the step of inputting the target test sample into the convolutional neural network to obtain an output result, and taking the convolutional neural network as the convolutional neural network meeting the preset requirement until all the test samples in the test sample set are tested.

Optionally, the processing the program to be tested according to the convolutional neural network to obtain a suspicious value of each statement of the program to be tested includes:

constructing a virtual test case corresponding to each statement in the program to be tested;

converting the one-dimensional statement coverage information of the virtual test case into two-dimensional statement coverage information according to the conversion rule;

and inputting the two-dimensional statement coverage information into the convolutional neural network to obtain the suspicious value of each statement of the program to be tested.

The embodiment of the invention also provides a software defect positioning device based on the convolutional neural network, which comprises a generating unit, a converting unit, a training unit and an evaluating unit;

the generating unit is used for calling a preset test case to test the program to be tested and generating one-dimensional statement coverage information of the test case and an execution result of the test case;

the conversion unit is used for converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to a set conversion rule;

the training unit is used for training an initial convolutional neural network by taking the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting a preset requirement;

and the evaluation unit is used for processing the program to be tested according to the convolutional neural network so as to obtain the suspicious value of each statement in the program to be tested.

Optionally, the transformation unit comprises a dense transformation unit and a sparse transformation unit;

the dense conversion module is used for converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to the corresponding relation between the function and the one-dimensional statement coverage information of the test case when the one-dimensional statement coverage information of the test case is dense data;

the sparse conversion module is used for randomly setting the one-dimensional statement coverage information of the test case in a two-dimensional list to obtain two-dimensional statement coverage information when the one-dimensional statement coverage information of the test case is sparse data.

Optionally, the training unit includes a dividing subunit, a training subunit, a testing subunit, a judging subunit, and a selecting subunit;

the dividing subunit is configured to divide the sample data into a training sample set and a test sample set;

the training subunit is configured to train the initial convolutional neural network by using the training sample set to obtain a trained convolutional neural network;

the test subunit is used for inputting a target test sample into the convolutional neural network to obtain an output result; the target test sample is any one of the test samples in the test sample set which are not tested;

the judging subunit is configured to judge whether a deviation value between the output result and the execution result of the test sample is within a preset range; if not, returning to the training subunit; if yes, triggering the selected subunit;

the selecting subunit is configured to select an untested test sample from the test sample set as a target test sample, and return the test sample to the testing subunit until all test samples in the test sample set are tested, and then use the convolutional neural network as a convolutional neural network meeting a preset requirement.

Optionally, the evaluation unit comprises a construction subunit, a transformation subunit and an output subunit;

the construction subunit is used for constructing a virtual test case corresponding to each statement in the program to be tested;

the conversion unit is used for converting the one-dimensional statement coverage information of the virtual test case into two-dimensional statement coverage information according to the conversion rule;

and the output subunit is used for inputting the two-dimensional statement coverage information into the convolutional neural network so as to obtain a suspicious value of each statement of the program to be tested.

The embodiment of the invention also provides a software defect positioning device based on the convolutional neural network, which comprises the following components:

a memory for storing a computer program;

a processor for executing the computer program to implement the steps of the above convolutional neural network-based software defect localization method.

The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the software defect localization method based on the convolutional neural network are implemented as described above.

According to the technical scheme, the preset test case is called to test the program to be tested, and one-dimensional statement coverage information of the test case and an execution result of the test case are generated; in order to adapt to the data format of the convolutional neural network, one-dimensional statement coverage information of the test case needs to be converted into two-dimensional statement coverage information according to a set conversion rule; training the initial convolutional neural network by taking the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting the preset requirement; the method has the advantages that the incidence relation between the one-dimensional statement coverage information and the execution result of the test case is fully mined through the convolutional neural network, the accuracy of the convolutional neural network in positioning software defects is effectively improved, and when the software defects of the program to be tested need to be analyzed, the program to be tested can be processed according to the convolutional neural network, so that the suspicious value of each statement in the program to be tested can be obtained. The incidence relation between the one-dimensional statement coverage information of the test case and the execution result is effectively mined based on the convolutional neural network, so that the predicted suspicious value is more accurate.

Drawings

In order to illustrate the embodiments of the present invention more clearly, the drawings that are needed in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and that other drawings can be obtained by those skilled in the art without inventive effort.

FIG. 1 is a flowchart of a method for locating a software defect based on a convolutional neural network according to an embodiment of the present invention;

fig. 2 is a schematic structural diagram of a software defect locating apparatus based on a convolutional neural network according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a software defect locating apparatus based on a convolutional neural network according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without any creative work belong to the protection scope of the present invention.

In order that those skilled in the art will better understand the disclosure, the invention will be described in further detail with reference to the accompanying drawings and specific embodiments.

Software bug fixes are primarily aimed at detecting and locating error codes that cause software failures. In the traditional method, calculation formulas of other related fields are often introduced, and a large amount of experimental research is carried out to design a calculation formula of the suspiciousness of defect positioning; or starting from the tested program, and constructing a new spectrum method by mining some regular information contained in the program, such as the information amount contained in the event, the execution track mode and the like. Based on the premise, researchers use statement coverage information generated by test case execution and an execution result of the test case to construct a suspicion degree calculation formula Tarrantula, so that software defect positioning is assisted. The Tarrantula formula defines 4 classes of factorsThe method is used for describing the relationship between statement coverage information generated by test case execution and an execution result: a is₁₁Representing the number of times a certain statement is covered in the failure test; a is₁₀Representing the number of times a certain statement is covered in the success test; a is₀₀Representing the number of times that a statement is not covered in a success test; a is₀₁Indicating the number of times a statement has not been covered in the failure test.

Part of the reasons for the effect difference of the conventional defect positioning method can be summarized as follows: how to utilize a₁₁、a₁₀、a₀₀、a₀₁The 4 weighting factors contain information to calculate the suspicious degree of the statement. According to the hypothesis of potential association between statement coverage information generated by test case execution and an execution result, the more accurate the weights of the 4 factors are considered, the better the defect positioning effect of the corresponding method is.

The potential correlation existing between statement coverage information generated by test case execution and execution results is the basis for determining the degree of suspicion. Obviously, the defect revealing information contained in the potential correlation is fully mined and utilized, and the effectiveness of defect positioning of the doubtful degree calculation formula is improved. Therefore, according to the software defect positioning method, device and computer readable storage medium based on the convolutional neural network provided by the embodiment of the invention, the convolutional neural network is utilized to fully mine the potential relationship between statement coverage information generated by the execution of the test case and the execution result, and the statement coverage information generated by the execution of the test case and the execution result of the test case are respectively used as the characteristic data and the marking data of the sample to train the convolutional neural network, so that the accuracy of the test result of the convolutional neural network is effectively improved.

Next, a software defect localization method based on a convolutional neural network according to an embodiment of the present invention is described in detail. Fig. 1 is a flowchart of a method for locating a software defect based on a convolutional neural network according to an embodiment of the present invention, where the method includes:

s101: and calling a preset test case to test the program to be tested, and generating one-dimensional statement coverage information of the test case and an execution result of the test case.

The test case is utilized to run in the program to be tested, when the test case traverses to a certain statement in the program to be tested, the test case is represented to cover the statement.

For convenience of description, P ═ s may be used₁,s₂,…,s_zDenotes the program to be tested. Wherein s is_iAnd (1 is more than or equal to i and less than or equal to z) represents the ith statement, and z represents the number of statements forming the program to be tested.

The program to be tested is generally composed of a plurality of functions, each of which contains a plurality of statements. Expressing test case P ═ { f) in functional form₁,f₂,…,f_m}. Wherein f is_i(1. ltoreq. i.ltoreq.m) represents the ith function, and m represents the number of functions constituting the program.

f_i＝{E_i1,E_i2,…,E_inDenotes the function f_iA set of statements in. Wherein E is_ij(1. ltoreq. i.ltoreq.m, 1. ltoreq. j.ltoreq.n) represents the function f_iThe j-th statement of (1), n represents the function f_iThe total number of sentences of (2).

T＝{t₁,t₂,…,t_lDenotes the set of test cases. Wherein, t_i(1. ltoreq. i. ltoreq.l) represents the ith test case.

Ps(t_i)＝{e₁,e₂,…,e_j,…,e_zDenotes the test case t_i(1 ≦ i ≦ l) executing the one-dimensional statement coverage information for the generated test case. Wherein e is_j(1. ltoreq. j. ltoreq.z) represents t_iWhether to cover statement s_jIf t is_iCovering statement s_jThen e_jValue 1, otherwise e_jThe value is 0.

Rs(t_i) Representing test cases t_i(1. ltoreq. i.ltoreq.l) if t_iSuccessful execution, then Rs (t)_i) Is 0, if t_iFailure to execute Rs (t)_i) Is 1.

Taking a program to be tested as an example, the corresponding statement coverage information and test case execution result are shown in table 1:

TABLE 1

The correspondence between functions and statements is shown in table 2:

TABLE 2

S102: and converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to a set conversion rule.

In the embodiment of the invention, a convolutional neural network is utilized to mine the potential relation between statement coverage information generated by test case execution and an execution result. Considering that the convolutional neural network is used for processing high-dimensional data, it is necessary to convert the one-dimensional statement coverage information of the obtained test case into two-dimensional statement coverage information.

In the embodiment of the invention, two different conversion modes can be adopted to convert the one-dimensional statement coverage information of the test case into the two-dimensional statement coverage information.

Specifically, when the one-dimensional statement coverage information of the test case is dense data, the one-dimensional statement coverage information of the test case can be converted into two-dimensional statement coverage information according to the corresponding relationship between the function and the one-dimensional statement coverage information of the test case; and when the one-dimensional statement coverage information of the test case is sparse data, randomly setting the one-dimensional statement coverage information of the test case in a two-dimensional list to obtain two-dimensional statement coverage information.

It should be noted that, in practical application, for sparse data, the one-dimensional statement coverage information of the test case may also be converted into two-dimensional statement coverage information according to the corresponding relationship between the function and the one-dimensional statement coverage information of the test case; for dense data, one-dimensional statement coverage information of the test case can also be randomly arranged in the two-dimensional list to obtain two-dimensional statement coverage information. In the embodiment of the present invention, there is no limitation on which conversion method is specifically used.

Assume that the number of columns and the number of rows of the converted two-dimensional data are Dh and Dl. When the one-dimensional statement coverage information of the test case is dense, according to the corresponding relationship between the functions and the statements in table 2, Dh may be equal to the number m of the functions, Dl may be equal to the number of the statements of the "function with the largest number of statements", and data in each column represents coverage information of the statements of a certain function. The initial format for converting the one-dimensional statement coverage information of the test case in table 1 into the two-dimensional statement coverage information is shown in table 3a below:

E₁₁	E₂₁	E₃₁
			E₁₂	E₂₂	filling in
E₁₃	Filling in	Filling in

TABLE 3a

When Dh Dl ≠ z, data filling is required for Dh Dl-z spaces. Wherein z represents the number of statements in the sample data.

The filled data can conform to the random normal distribution, uniform distribution, and the minimum value of the effective range is directly filled. Without loss of generality, the data to be filled may be set to 0, and accordingly, the form after filling table 3a is as shown in the following table:

using the format shown in Table 3a as an example, test case t₁The corresponding two-dimensional statement coverage information is shown in the following table 3b, and the corresponding execution result is 1:

1	1	0
			1	0	0
0	0	0

TABLE 3b

Test case t₂The corresponding two-dimensional statement coverage information is shown in table 3c below, and the corresponding execution result is 0:

0	1	1
			1	0	0
1	0	0

TABLE 3c

Test case t₃The corresponding two-dimensional statement coverage information is shown in table 3d below, and the corresponding execution result is 1:

0	1	0
			0	1	0
0	0	0

TABLE 3d

When the one-dimensional statement coverage information of the test case is sparse, the one-dimensional statement coverage data can be randomly placed in z grids of Dh multiplied by Dl grids,

wherein

One initial format for converting the one-dimensional statement coverage information of the test case in table 1 into two-dimensional statement coverage information is shown in table 4a below:

E₂₂	E₂₁	E₁₃
			E₁₂	E₁₁	E₃₁

TABLE 4a

It should be noted that the form of table 4a is only one possible table form because it is randomly set.

Using the format shown in Table 4a as an example, test case t₁The corresponding two-dimensional statement coverage information is shown in the following table 4b, and the corresponding execution result is 1:

0	1	0
			1	1	0

TABLE 4b

Test case t₂The corresponding two-dimensional statement coverage information is shown in table 4c below, and the corresponding execution result is 0:

0	1	1
			1	0	1

TABLE 4c

Test case t₃The corresponding two-dimensional statement coverage information is shown in table 4d below, and the corresponding execution result is 1:

1	1	0
			0	0	0

TABLE 4d

S103: and training the initial convolutional neural network by taking the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain the convolutional neural network meeting the preset requirement.

The two-dimensional statement coverage information is the statement coverage information form of the test case which can be identified by the convolutional neural network.

In the embodiment of the invention, sample data is divided into a training sample set and a test sample set; and training the initial convolutional neural network by utilizing the training sample set to obtain the trained convolutional neural network.

The convolutional neural network is mainly composed of input layers, convolutional layers, Pooling (Pooling) layers, full-link layers and output layers. By adding these layers together, a complete convolutional neural network can be constructed.

In the embodiment of the invention, the topological structure of the constructed convolutional neural network is as follows:

(1) input layer Input

The number of input layers is 1, and two-dimensional data of Dh × Dl is input.

(2) Convolutional layer

Determining the number of layers of the convolutional layer when constructing the convolutional layer; determining the number of convolution kernels, the size of the convolution kernels and the step length of convolution kernel sliding of each convolution layer; determining whether to add a bias (Nobias); determining whether to adopt a wide convolution operation (to complement 0 to the edge before the convolution operation); before activation of the activation function, it is determined whether to add L2 regularization penalizes the convolution kernel parameters themselves (L2 regularization parameters).

(3) Pooling layer

The construction of the pooling layer comprises the steps of determining the number of layers of the pooling layer; the pooling window size, the step size of the pooling window sliding, the form of the pooling operation (mean pooling, maximum pooling, etc.) is determined for each pooling layer.

(4) Full connection layer

The method comprises the steps of determining the number of layers of the full-connection layer when the full-connection layer is constructed; dropout parameters for each connection layer are determined.

(5) Output layer

The number of output layers is one, and the number of nodes is 1. The output layer outputs the classification label using a logistic function or a normalized exponential function (softmax function).

In an embodiment of the present invention, the activation function of each layer may use a modified Linear Unit (ReLU) function. The learning Algorithm may employ a Back Propagation Algorithm (BPA).

Taking any one of the untested test samples in the test sample set, i.e. the target test sample as an example, the target test sample can be input into the convolutional neural network to obtain an output result. Judging whether the deviation value of the output result and the execution result of the test sample is within a preset range or not; if not, returning to the step of training the initial convolutional neural network by using the training sample set to obtain the trained convolutional neural network; if so, selecting an untested test sample from the test sample set as a target test sample, returning to the step of inputting the target test sample into the convolutional neural network to obtain an output result, and taking the convolutional neural network as the convolutional neural network meeting the preset requirement until all the test samples in the test sample set are tested.

For example, the two-dimensional statement coverage information corresponding to the test sample is input to the convolutional neural network to obtain an output result Y ', the test case execution result corresponding to the test sample is Y, and when | Y' -Y | ≦ τ is satisfied, the deviation value between the output result and the execution result of the test sample is considered to be within the preset range. Otherwise, the deviation value of the output result and the execution result of the test sample is not in the preset range.

S104: and processing the program to be tested according to the convolutional neural network to obtain the suspicious value of each statement in the program to be tested.

After the convolutional neural network is trained, the defect of the program to be tested can be located according to the convolutional neural network. Considering the form of two-dimensional statement coverage information of a test case input during convolutional neural network training, correspondingly, when defect positioning is carried out on a program to be tested, a virtual test case corresponding to each statement in the program to be tested needs to be constructed; converting the one-dimensional statement coverage information of the virtual test case into two-dimensional statement coverage information according to a conversion rule; and inputting the two-dimensional statement coverage information into a convolutional neural network to obtain the suspicious value of each statement of the program to be tested.

When the virtual test case is constructed, a corresponding virtual test case can be constructed for each statement, and when one virtual test case runs in the program to be tested, only one corresponding statement in the program to be tested is called.

Take 6 statements contained in the program to be tested as an example, which are s in sequence₁To s₆。

Sentence s₁Corresponding virtual test case ts₁One-dimensional sentence coverage information of Ps' (ts)₁)＝{1,0,0,0,0,0}。

Sentence s₂Corresponding virtual test case ts₂One-dimensional sentence coverage information of Ps' (ts)₂)＝{0,1,0,0,0,0}。

Sentence s₃Corresponding virtual test case ts₃One-dimensional sentence coverage information of Ps' (ts)₃)＝{0,0,1,0,0,0}。

Sentence s₄Corresponding virtual test case ts₄One-dimensional sentence coverage information of Ps' (ts)₄)＝{0,0,0,1,0,0}。

Sentence s₅Corresponding virtual test case ts₅One-dimensional sentence coverage information of Ps' (ts)₅)＝{0,0,0,0,1,0}。

Sentence s₆Corresponding virtual test case ts₆One-dimensional sentence coverage information of Ps' (ts)₆)＝{0,0,0,0,0,1}。

(1) The format for converting the one-dimensional statement coverage information of the test case into the two-dimensional statement coverage information by adopting a dense processing mode, namely according to the corresponding relation between the function and the one-dimensional statement coverage information of the test case, is as follows:

sentence s₁Virtual test case ts of₁Corresponding toInput data to the convolutional neural network:

1	0	0
			0	0	0
0	0	0

sentence s₂Virtual test case ts of₂Input data of the corresponding convolutional neural network:

0	0	0
			1	0	0
0	0	0

sentence s₃Virtual test case ts of₃Input data of the corresponding convolutional neural network:

0	0	0
			0	0	0
1	0	0

sentence s₄Virtual test case ts of₄Input data of the corresponding convolutional neural network:

0	1	0
			0	0	0
0	0	0

sentence s₅Virtual test case ts of₅Input data of the corresponding convolutional neural network:

0	0	0
			0	1	0
0	0	0

sentence s₆Virtual test case ts of₆Input data of the corresponding convolutional neural network:

0	0	1
			0	0	0
0	0	0

(2) by adopting a sparse processing mode, namely randomly setting the one-dimensional statement coverage information of the test case in a two-dimensional list, the format of the obtained two-dimensional statement coverage information is as follows:

sentence s₁Virtual test case ts of₁Input data of the corresponding convolutional neural network:

0	0	0
			0	1	0

0	0	0
			1	0	0

0	0	1
			0	0	0

0	1	0
			0	0	0

1	0	0
			0	0	0

0	0	0
			0	0	1

when the suspicious value of each sentence in the program to be tested is determined, which sentences are problem sentences can be determined according to the suspicious value of each sentence, so that the defect positioning is realized.

According to the technical scheme, the preset test case is called to test the program to be tested, and one-dimensional statement coverage information of the test case and an execution result of the test case are generated; converting the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to a set conversion rule; training the initial convolutional neural network by taking the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting the preset requirement; the method has the advantages that the incidence relation between the one-dimensional statement coverage information and the execution result of the test case is fully mined through the convolutional neural network, the accuracy of the convolutional neural network in positioning software defects is effectively improved, and when the software defects of the program to be tested need to be analyzed, the program to be tested can be processed according to the convolutional neural network, so that the suspicious value of each statement in the program to be tested can be obtained. The incidence relation between the one-dimensional statement coverage information of the test case and the execution result is effectively mined based on the convolutional neural network, so that the predicted suspicious value is more accurate.

The suspicious value of each statement output by the convolutional neural network ranges from 0 to 1. In the embodiment of the invention, after the suspicious values of the sentences in the program to be tested are obtained, the sentences in the program to be tested can be arranged according to the descending order of the suspicious values to generate the software defect positioning report table. Taking 6 statements included in the program to be tested introduced in S104 as an example, the corresponding software defect location report table is as follows:

degree of suspicionValue of	Sentence s_i
		0.93	s₁
0.91	s₂
		0.91	s₃
0.76	s₄
		0.53	s₅
0.18	s₆

Software defect positioning report table

The software defect positioning report table is obtained by performing descending order arrangement on the suspicious values of the sentences in the program to be tested, so that related personnel can conveniently and visually know the defect condition of the sentences in the program to be tested, and the defect sentences can be quickly positioned and processed.

Fig. 2 is a schematic structural diagram of a software defect locating apparatus based on a convolutional neural network according to an embodiment of the present invention, including a generating unit 21, a transforming unit 22, a training unit 23, and an evaluating unit 24;

the generating unit 21 is configured to invoke a preset test case to test a program to be tested, and generate one-dimensional statement coverage information of the test case and an execution result of the test case;

the conversion unit 22 is configured to convert the one-dimensional statement coverage information of the test case into two-dimensional statement coverage information according to a set conversion rule;

the training unit 23 is configured to train the initial convolutional neural network by using the two-dimensional statement coverage information and the corresponding execution result as sample data, so as to obtain a convolutional neural network meeting a preset requirement;

the evaluation unit 24 is configured to process the program to be tested according to the convolutional neural network to obtain a suspicious value of each statement in the program to be tested.

Optionally, the transformation unit comprises a dense transformant unit and a sparse transformant unit;

and the sparse conversion unit is used for randomly setting the one-dimensional statement coverage information of the test case in a two-dimensional list to obtain the two-dimensional statement coverage information when the one-dimensional statement coverage information of the test case is sparse data.

Optionally, the training unit includes a dividing subunit, a training subunit, a testing subunit, a judging subunit and a selecting subunit;

the dividing subunit is used for dividing the sample data into a training sample set and a test sample set;

the training subunit is used for training the initial convolutional neural network by utilizing the training sample set to obtain a trained convolutional neural network;

the test subunit is used for inputting the target test sample into the convolutional neural network to obtain an output result; the target test sample is any one of the test samples in the test sample set which are not tested;

the judgment subunit is used for judging whether the deviation value of the output result and the execution result of the test sample is within a preset range; if not, returning to the training subunit; if yes, triggering and selecting the subunit;

and the selecting subunit is used for selecting an untested test sample from the test sample set as a target test sample, returning to the testing subunit, and taking the convolutional neural network as the convolutional neural network meeting the preset requirement until all the test samples in the test sample set are tested.

the construction subunit is used for constructing virtual test cases corresponding to all the sentences in the program to be tested;

the transformation unit is used for transforming the one-dimensional statement coverage information of the virtual test case into two-dimensional statement coverage information according to a transformation rule;

and the output subunit is used for inputting the two-dimensional statement coverage information into the convolutional neural network so as to obtain the suspicious value of each statement of the program to be tested.

The description of the features in the embodiment corresponding to fig. 2 may refer to the related description of the embodiment corresponding to fig. 1, and is not repeated here.

Fig. 3 is a schematic structural diagram of a software defect locating apparatus 30 based on a convolutional neural network according to an embodiment of the present invention, including:

a memory 31 for storing a computer program;

a processor 32 for executing a computer program to implement the steps of the software defect localization method based on convolutional neural network as described above.

The embodiment of the invention also provides a computer readable storage medium, wherein a computer program is stored on the computer readable storage medium, and when being executed by a processor, the computer program realizes the steps of the software defect positioning method based on the convolutional neural network.

The software defect positioning method, device and computer readable storage medium based on the convolutional neural network provided by the embodiment of the invention are described in detail above. The embodiments are described in a progressive manner in the specification, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. The device disclosed by the embodiment corresponds to the method disclosed by the embodiment, so that the description is simple, and the relevant points can be referred to the method part for description. It should be noted that, for those skilled in the art, it is possible to make various improvements and modifications to the present invention without departing from the principle of the present invention, and those improvements and modifications also fall within the scope of the claims of the present invention.

Those of skill would further appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.

The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art.

Claims

1. A software defect positioning method based on a convolutional neural network is characterized by comprising the following steps:

processing the program to be tested according to the convolutional neural network to obtain the suspicious value of each statement in the program to be tested;

converting the one-dimensional statement coverage information of the test case into the two-dimensional statement coverage information according to the set conversion rule comprises:

2. The method of claim 1, wherein the training of the initial convolutional neural network with the two-dimensional statement coverage information and the corresponding execution result as sample data to obtain a convolutional neural network meeting preset requirements comprises:

dividing the sample data into a training sample set and a testing sample set;

3. The method according to any one of claims 1-2, wherein the processing the program under test according to the convolutional neural network to obtain the suspicious value of each statement of the program under test comprises:

4. A software defect positioning device based on a convolutional neural network is characterized by comprising a generating unit, a converting unit, a training unit and an evaluating unit;

the evaluation unit is used for processing the program to be tested according to the convolutional neural network so as to obtain the suspicious value of each statement in the program to be tested;

the transformation unit comprises a dense transformation unit and a sparse transformation unit;

5. The apparatus of claim 4, wherein the training unit comprises a dividing subunit, a training subunit, a testing subunit, a judging subunit, and a selecting subunit;

6. The device according to any one of claims 4 to 5, wherein the evaluation unit comprises a construction subunit, a transformation subunit and an output subunit;

7. A software defect locating device based on a convolutional neural network is characterized by comprising:

a memory for storing a computer program;

a processor for executing the computer program to implement the steps of the convolutional neural network-based software defect localization method of any of claims 1 to 3.

8. A computer-readable storage medium, having stored thereon a computer program which, when being executed by a processor, carries out the steps of the convolutional neural network-based software defect localization method as claimed in any one of claims 1 to 3.