CN112101557A

CN112101557A - Demand change prediction method based on CRF-LSTM algorithm

Info

Publication number: CN112101557A
Application number: CN202010828693.9A
Authority: CN
Inventors: 殷茗; 王一博; 许卓; 田嘉毅; 王生泽; 任卓力; 邓子凡; 黄仲臻
Original assignee: Northwestern Polytechnical University
Current assignee: Northwestern Polytechnical University
Priority date: 2020-08-18
Filing date: 2020-08-18
Publication date: 2020-12-18

Abstract

The invention relates to a demand change prediction method based on a CRF-LSTM algorithm, which is used for extracting data context information of user demands, observing behavior and background values of users, simulating and capturing states set during system operation, and learning and deducing current and continuously developed targets of system participants.

Description

Demand change prediction method based on CRF-LSTM algorithm

Technical Field

The invention belongs to the technical field of software development, and relates to a demand change prediction method based on a CRF-LSTM algorithm.

Background

The document "A statistical analysis approach to prediction user's changing requirements for Software service evolution, Journal of Systems and Software 132(2017) 147-. The method is used to encode known relationships between observations and construct consistent interpretations, is a mathematical model for constructing target inferences that infer a user's target based on runtime observations of actions and environmental context values. Firstly, in order to accurately infer a predefined user target in a domain knowledge base and detect emerging or special intentions of a user, constructing a linear chain CRF model for coding a known behavior pattern of the user to realize the target as a measurement for abnormal value detection; secondly, observing the operation and related context values of a real user when operating on the system, and generating < o11, > o1n >, < o21, > o2m >,. and deducing the target of the user at each observation time point in a CRF model obtained by a prepared observation sequence; and finally, detecting emerging intentions, wherein the emerging intentions are a situation sequence mode and are not predefined in a domain knowledge base, so that the newly-appeared requirements are formulated based on the detection results of the emerging intentions. However, the method cannot consider long-term context information, more consideration is given to linear weighted combination of local features of the whole sentence, and the neural network fitting nonlinearity cannot be carried out, so that agile development requirement change cannot be predicted more accurately to meet user requirements.

The traditional LSTM model is powerful in sequence modeling, can capture long-distance context information, and has the capability of fitting nonlinearity of a neural network, but for the time t, an output layer is influenced by a hidden layer containing the context information and a current input layer, the output layer and the output layers at other times are independent, if strong dependency exists between the output layers, for example, a general noun is followed by an adjective, which is a constraint, the LSTM cannot model the constraints, and the performance of the LSTM model is limited. The LSTM network processing is equivalent to obtaining a better representation method of input data, the vector finally output by the LSTM unit can be regarded as a representation form of the input data, the CRF-LSTM model is a method for combining the existing linear statistical model with a neural network structure, namely combining the CRF with the output end, solving the problem of extracting sequence characteristics by using the LSTM, and effectively utilizing sentence-level marking information by using the CRF. Under the CRF-LSTM model, the output will not be the independent tags, but the optimal tag sequence.

Disclosure of Invention

Technical problem to be solved

The method aims to solve the problem that only conditional random fields are used for sequentially marking and analyzing data in the agile development demand change prediction process, and long-term context information of collected data is ignored. The method takes the characteristics of high efficiency and accuracy of the agile development demand change into consideration, and provides more accurate prediction for the agile development demand change by adopting a CRF-LSTM (cross-domain similarity-least squares) combined algorithm.

Technical scheme

A demand change prediction method based on a CRF-LSTM algorithm is characterized by comprising the following steps:

step 1: user behavior data collection

Adding a JavaScript event monitor in an area where user behaviors need to be collected, and recording the user behaviors to a local log in a parameter mode on a corresponding label; inserting the record into a database after obtaining a piece of user behavior data; then, taking out the data in the database at regular time and uploading the data to a server at one time in a JSON array form; the server side responds to the user behavior through the @ RequestMapping annotation, outputs the user behavior to a local Log file user operation. xlsx through the Log4j configuration, and newly establishes a Java class LogRecord which is used for adding a user id, the action generation time and a target to the Log file;

step 2: xlsx, classifying data according to user id, embedding a buried point code in a file management system for collecting user operation behavior information, determining a user behavior tag and a user target tag, and making a corresponding tag dictionary;

and step 3: selecting a Keras-based neural network framework to construct a CRF-LSTM demand prediction model;

inputting the data obtained in the step 2 as a CRF-LSTM demand prediction model, and obtaining the prediction model through N rounds of iterative learning; predicting a target corresponding to the user behavior by using the prediction model to obtain a prediction result; the prediction probability and the prediction precision are included; the prediction probability is the probability of the user to perform the operation, and the prediction precision is the prediction accuracy;

and 4, step 4: statistical prediction results for inferring user intent

Statistically calculating the prediction probability and the prediction precision Acc, wherein the calculation formula is shown as formula (1), wherein TP is the number of samples which are actually the target x and are predicted as x, FP is the number of samples which are not actually the target x but are predicted as x, FN is the number of samples which are actually the target x but are not predicted as x, and TN is the number of samples which are not actually the target x but are not predicted as x;

the higher the prediction probability is, the more possible the user is to perform the operation, and the higher the prediction precision is, the more accurate the prediction is proved;

the following reference criteria were used for reasoning:

1) if the prediction accuracy of the label is lower than 0.9, the reason is that the key setting of the system interface is unreasonable; since there are problems with key locations, names, or function settings, which do not match the user's habits, the user cannot quickly understand and use the system;

2) if the prediction of a certain label is accurate, but the prediction probability is lower than 0.99, the system function is divided poorly or the key setting is unreasonable; because one button in the system bears the multifunctional requirement or the button is unreasonable in arrangement, the function of the button is misunderstood by a user, and therefore the target requirement of the user cannot be stably predicted by the prediction model;

3) continuous errors occur in a certain operation process, which are caused by unreasonable function-related key settings or incapability of displaying prompt information by a system; the setting of the function keys and the understanding of the user have great problems, or the system cannot display error prompt information, so that the user is difficult to find errors through continuous operation.

And 5: verifying the result of intent reasoning, summarizing the change of user requirements: inviting part of the users to participate in the survey and feedback, confirming the real intentions of the users, from which the new or potential needs of the users can be derived.

The user behavior and the corresponding label in the step 1:

user behavior	Corresponding label
		Creating a folder	createfolder
Downloading files	downloadfile
		Editing files	editfile
Mobile file	movefile
		Mobile folder	movefolder
Opening file folder	openfolder
		Deleting files	removefile
Deleting file folder	removefolder
		Renaming files	renamefile
Renaming folder	renamefolder
		Changing language	switchlanguage
Switching list views	switchlistview
		Updating files	uploadfile
Viewing documents	viewfile

。

In the step 3, N is 30.

Advantageous effects

The demand change prediction method based on the CRF-LSTM algorithm, provided by the invention, is used for extracting data context information of user demands, observing behaviors and background values of users, simulating and capturing states set during system operation, and learning and deducing current and continuously developed targets of system participants.

Drawings

FIG. 1 CRF-LSTM model structure for entity recognition

Detailed Description

The invention will now be further described with reference to the following examples and drawings:

as shown in FIG. 1, under the CRF-LSTM model, the output will not be mutually independent tags, but the optimal tag sequence.

For the input: x ═ X₁,x₂,…x_n) An output probability matrix Pn k for LSTM may be defined where k is the number of output labels. Pi, j refers to the probability that the ith word is marked as the jth tag.

For the tag sequence to be predicted Y ═ Y₁,y₂,…y_n) The following definitions may be given:

wherein A is a state transition matrix, A_i,jRepresenting the probability of transitioning from the ith tag to the jth tag.

By finding the maximum s (X, y), the optimal output tag sequence can be obtained. The CRF introduced here is only to model the output label binary group, then to calculate by using dynamic programming, and finally to label according to the obtained optimal path.

A demand change prediction method based on a CRF-LSTM algorithm specifically comprises the following steps:

step 1: user behavior data collection

user behavior and its corresponding tags:

TABLE 1 user behavior object tags

inputting the data obtained in the step 2 as a CRF-LSTM demand prediction model, and obtaining the prediction model through 30 rounds of iterative learning; predicting a target corresponding to the user behavior by using the prediction model to obtain a prediction result; the prediction probability and the prediction precision are included; the prediction probability is the probability of the user to perform the operation, and the prediction precision is the prediction accuracy;

movement of	Self-reporting target	Predicting an objective	Prediction probability
				clickswitchlistview	movefile	movefile	0.99206614
clickupload	movefile	movefile	0.99971896
				clickviewfile	movefile	movefile	0.9998857
clickviewimage	movefile	movefile	0.9996197

Table 2 partial prediction results

The method comprises the following specific steps:

(1) and (4) introducing. Introducing keras, pandas and numpy libraries and modules and models contained therein, including Sequential, Dense, LSTM, InputLayer, Bidirectional, TimeDistributed, Embedding, Activation, GRU, Adam, CRF, as required.

(2) And reading the data. And reading the txt file which is obtained and processed before.

(3) And setting parameters. The embedded layer SIZE EMBED _ SIZE 128, the HIDDEN layer unit number HIDDEN _ SIZE 128, and the sample number batt _ SIZE 2 selected in one training.

(4) A data set is constructed. The data is sliced and a dataset is created.

(5) And constructing a model and training. And (3) constructing a CRF-LSTM neural network model, training the model and storing. The model is divided into an input layer, an embedding layer, a bidirectional RNN packaging layer, a GRU layer, a TimeDistributed packaging layer and a CRF layer. The collected user behavior data is used as a training set to be input, and the model can predict the behavior target on the basis of learning of a large amount of actual data to obtain a behavior target prediction result.

(6) And (5) predicting, setting a training set for predicting, drawing and displaying.

And 4, step 4: statistical prediction results for inferring user intent

the following reference criteria were used for reasoning:

And 5: and verifying the result of the intention reasoning and summarizing the change of the user requirement. Inviting part of the users to participate in the survey and feedback, confirming the real intentions of the users, from which the new or potential needs of the users can be derived.

And taking the change of the user requirement as a new iteration variable to participate in the next iteration development work, and feeding the change back to the developer. According to the requirements of agile software development, reasonable development tasks are formulated to be helped, so that the requirements of users are met better. Inviting project-related principals or industry experts in the field to further analyze the reasonableness of user intent inferences based on their understanding and expertise of the project to determine whether emerging intents, intent changes, or system deficiencies exist. If validation is indeed an emerging intent or system deficiency, it can be used to summarize and analyze changes in user demand, thereby enabling prediction of changes in user demand. In the process of agile software development, the method is applied between two iteration cycles, changes of user requirements are predicted, and the changes are used as new iteration variables to participate in a new round of iteration development work.

Claims

1. A demand change prediction method based on a CRF-LSTM algorithm is characterized by comprising the following steps:

step 1: user behavior data collection

and 4, step 4: statistical prediction results for inferring user intent

the following reference criteria were used for reasoning:

3) continuous errors occur in a certain operation process, which are caused by unreasonable function-related key settings or incapability of displaying prompt information by a system; the setting of the function keys and the understanding of the user have great problems, or the system cannot display error prompt information, so that the user is difficult to find errors through continuous operation;

2. The method of claim 1, wherein the user behavior and corresponding label in step 1:

。

3. The method of claim 1, wherein N in step 3 is 30.