CN110377713B

CN110377713B - Method for improving context of question-answering system based on probability transition

Info

Publication number: CN110377713B
Application number: CN201910641706.9A
Authority: CN
Inventors: 谢铁
Original assignee: Guangzhou Tanyu Technology Co ltd
Current assignee: Guangzhou Tanyu Technology Co ltd
Priority date: 2019-07-16
Filing date: 2019-07-16
Publication date: 2023-09-15
Anticipated expiration: 2039-07-16
Also published as: CN110377713A

Abstract

A method for improving the context of a question-answering system based on probability transition belongs to the technical field of data processing methods, and adopts a classification algorithm and a probability transition matrix A to process user problem data, (1) preset labeling data of the system; (2) Receiving a user problem, and preprocessing to obtain processing data; (3) Training the labeling data through a classification algorithm to obtain an intention classification model; then, the labeling data are transmitted to a probability transition matrix A for training, and an initialized probability transition matrix A is obtained; (4) Predicting the processing data to obtain the distribution P of the prediction labels; the invention provides a context-combined intention recognition method, by which labeling data with context scenes can be unnecessary to prepare, and the labor cost is saved; through the self-learning capability of the probability transition matrix A, the accuracy of the whole system is higher and higher along with the change of the service time.

Description

Method for improving context of question-answering system based on probability transition

Technical Field

The invention belongs to the technical field of data processing methods, and particularly relates to a method for improving the context of a question-answering system based on probability transfer.

Background

In the e-commerce field, when a user (i.e., a buyer) makes online shopping, a consultation behavior is generated to customer service. In many automated question-answering systems, what is needed is to identify and classify the intent of the buyer. It is common practice to classify intent recognition as short text, but this practice breaks the impact of the user's context (historical dialog) on intent classification. For example, the buyer speaks "160". The buyer may either express a confirmation price or be providing his height or weight. This requires confirmation of specific intent from the above. There are also some schemes to combine context intent recognition, such as inputting 5 questions of buyers in succession as input, sorting with hierarchical intent. Or the tags above are taken as a feature into the current sentence for calculation. However, since this approach requires continuous chat recording by the user and requires the labeling personnel to pay attention to the context of each piece of data when labeling, additional effort is incurred in labeling and labeling errors are also easily caused. Furthermore, another class of drawbacks of this approach is that the samples are highly unbalanced, requiring manual replenishment of the data. Another type of solution is to use a rule, i.e. to manually specify the rules that the context appears in, and it is obvious that this solution is time-consuming and labor-consuming, and it is difficult to guarantee that all the possibilities are enumerated.

Disclosure of Invention

The present invention aims to overcome the above-mentioned drawbacks and disadvantages and to provide a method for improving the context of a question-answering system based on probability transitions.

In order to solve the technical problems, the following technical scheme is adopted:

the method for improving the context of the question-answering system based on the probability transition realizes the improvement of the question-answering system by combining a classification algorithm and a probability transition matrix A, and comprises the following specific steps:

(1) Predicting a series of data for the system, and manually calibrating the data to obtain calibration data;

(2) Receiving user problems, preprocessing the user problems to obtain processing data, and facilitating subsequent link processing;

(3) The labeling data is processed, and the processing content is as follows:

training the obtained labeling data through a TEXT CNN model to obtain an intention classification model;

(3-2) inputting the labeling data into a probability transition matrix for training to obtain an initialized probability transition matrix A;

(4) Predicting the processing data in the step (2) through a classification algorithm to obtain the distribution P of the predicted tags;

(5) A series of calculations are performed on the distribution P of the predictive labels to screen out missing information Q _i ，i＝1,2,3...n；

(6) Processing the processing data which is a complete session process in the step (2) by combining the probability transition matrix A obtained in the step (3-2) to obtain accurate missing sentences and corresponding context contents;

furthermore, the classification algorithms are TEXT CNN, LSTM, BERT and SVM, and particularly the classification algorithm with probability distribution of the prediction result is applicable to the system.

Further, the series of algorithms in the step (5) specifically includes:

(5-1) calculating an average number M of the distribution P by an average number formula,

wherein P is ₁ 、P ₂ ...P _i Representing a specific numerical value, i representing the number of the set of data;

(5-2) calculating by a variance formula, screening out the characteristics,

wherein i represents the number of data, M is an average number, s ² Representing the variance, when the variance s ² The smaller the value, the more difficult it is to judge the intention expressed for some of the processed data.

Further, the specific contents calculated in the step (6) in combination with the probability transition matrix a are as follows:

(6-1) combining the processed data, we have n dialogues for the sentence Q determined to be missing in step (4) _i With a distribution P _i Combining a random probability transition matrix A initialized by the system, and constructing an objective function: f= ||q _i-1 -AQ _i And the n pieces of processing data are data sets with complete dialogue scenes, so that the probability transition matrix A learns the prediction result of the current processing data, and the missing sentences and the corresponding context content are determined.

By adopting the scheme, the method has the following beneficial effects:

(1) The invention provides a context-combined intention recognition method, by which labeling data with context scenes can be unnecessary to prepare, and the labor cost is saved.

(2) Through the self-learning capability of the probability transition matrix A, the accuracy of the whole system is higher and higher along with the change of the service time.

Drawings

Fig. 1 is a flow chart of the entire system.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.

(4) Predicting a series of data for the system, and manually calibrating the data to obtain calibration data;

(5) Receiving user problems, preprocessing the user problems to obtain processing data, and facilitating subsequent link processing;

(6) The labeling data is processed, and the processing content is as follows:

(5) A series of calculations are performed on the distribution P of the predictive labels to screen out missing information Q _i I=1, 2, 3..n; the series of algorithms are specifically as follows:

(5-2) calculating by a variance formula, screening out the characteristics,

(6) Processing the processing data which is a complete session process in the step (2) by combining the probability transition matrix A obtained in the step (3-2) to obtain accurate missing sentences and corresponding context contents, wherein the specific contents calculated by combining the probability transition matrix A are as follows:

(6-1) combining the processed data, we have n dialogues for the sentence Q determined to be missing in step (4) _i With a distribution P _i Combining a random probability transition matrix A initialized by the system, and constructing an objective function: f= ||q _i-1 -AQ _i And (3) enabling the probability transition matrix A to learn the prediction result of the current processing data, so as to determine the missing statement and the corresponding context content.

Preferably, the classification algorithms are TEXT CNN, LSTM, BERT and SVM, and particularly, the classification algorithm with probability distribution of the prediction result is applicable to the system.

Preferably, the n pieces of processing data in the step (6-1) are data sets with complete dialogue scenes.

The working principle of the system is as follows: as shown in fig. 1, first, the TEXT CNN trains labeling data to obtain an intention classification model, then, the TEXT CNN predicts an intended label for a trained TEXT CNN for a received user problem, if the probability distribution of the predicted label is smooth, the predicted label is obtained, if the probability distribution of the predicted label is not smooth, unlabeled data (i.e., user problem data in a complete session) is trained and learned by an initialized probability transition matrix a, and a label probability distribution P obtained by the TEXT CNN for each sentence of user problem is combined, the obtained label probability distribution P and the probability transition matrix a are multiplied to obtain a new probability distribution, and an intended label corresponding to a new probability distribution top1 is output, thereby completing the whole system process.

The invention has been described in terms of embodiments, and the device can be modified and improved without departing from the principles of the invention. It should be noted that all technical solutions obtained by equivalent substitution or equivalent transformation fall within the protection scope of the present invention.

Claims

1. A method for improving the context of a question-answering system based on probability transition adopts a classification algorithm and a probability transition matrix A to process user question data, and is characterized in that:

the specific processing steps are as follows:

(1) Presetting annotation data for a system;

(2) Receiving a user problem, and preprocessing to obtain processing data;

(3) Training the labeling data through a classification algorithm to obtain an intention classification model; then, the labeling data are transmitted to a probability transition matrix A for training, and an initialized probability transition matrix A is obtained;

(4) Predicting the processing data to obtain the distribution P of the prediction labels;

(5) A series of calculations are performed by predicting the distribution P of the tags, screening the missing information Q _i ，i＝1，2，3...n；

(6) For a data set in a complete session process in the processed data, calculating by combining the initialized probability transfer matrix A to obtain accurate missing sentences and corresponding context contents; wherein, a series of calculations in the step (5) are specifically:

(5-1) calculating an average number M of the distribution P by an average number formula,wherein, P1, P2..pi represents a specific numerical value, i represents the number of the set of data;

(5-2) calculating by a variance formula,

wherein i represents the number of data, M is an average number, S ² Representing the variance, when the variance S ² The smaller the value, the harder it is to judge the intention expressed by some of the processed data;

the content calculated by combining the probability transition matrix A in the step (6) is as follows:

(6-1) combining the processed data, having n dialogues, with respect to the information Q selected as missing in step (5) _i With a distribution P _i Combining a random probability transition matrix A initialized by the system, and constructing an objective function: f= ||q _i-1 -AQ _i And (3) enabling the probability transition matrix A to learn the prediction result of the current processing data so as to determine the missing statement and the corresponding context.

2. A method for improving the context of a question-answering system based on probability transitions as set forth in claim 1, wherein: the labeling data is a series of problems of manual calibration.

3. A method for improving the context of a question-answering system based on probability transitions as set forth in claim 1, wherein: the classification algorithm is a classification algorithm with TEXT CNN, LSTM, BERT and SVM, and particularly has probability distribution of a prediction result, and the classification algorithm is applicable to the system.

4. A method for improving the context of a question-answering system based on probability transitions as set forth in claim 1, wherein: the n pieces of processing data combined in the step (6-1) are data sets with complete dialogue scenes.