CN112883179B

CN112883179B - Man-machine conversation method and system

Info

Publication number: CN112883179B
Application number: CN202110200723.6A
Authority: CN
Inventors: 陈欢欢; 范祖宁
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2021-02-23
Filing date: 2021-02-23
Publication date: 2024-05-24
Anticipated expiration: 2041-02-23
Also published as: CN112883179A

Abstract

The invention discloses a man-machine conversation method and a system, which are used for preprocessing an acquired current conversation sentence to obtain a word vector sequence, inputting the word vector sequence into a sentence classification model to obtain a sentence function label of the current conversation sentence, inputting the word vector sequence and the sentence function label into a generated conversation model, and outputting an answer sentence of the current conversation sentence by adopting the generated conversation method. The invention adds a certain proportion of question sentences in the sentence function labels of the training sentence classification model, so that the answer sentences output by the generated dialogue model contain a certain proportion of question sentences, and in order to avoid the situation that the generated dialogue model outputs question sentences while other sentence patterns are wrongly output, a loss function is introduced.

Description

Man-machine conversation method and system

Technical Field

The invention relates to the technical field of natural language processing, in particular to a man-machine conversation method and a man-machine conversation system.

Background

Dialog systems, also known as interactive dialog agents or chat robots, have a very wide range of applications, such as technical support, entertainment, etc. In a dialogue system, the question asked by a general user can be solved through one round of dialogue, but in practical application, a more common scene is multiple rounds of dialogue, and the user often carries out new question for the answer of the dialogue system or expands related interaction.

However, the research on the dialogue system is mostly in a form of asking a question by a user, and the dialogue system answers a question, and the question-and-answer mode is similar to the information retrieval mode, and because the dialogue system obtains less effective information from the user, the effectiveness of dialogue interaction is affected, so that man-machine dialogue cannot be well continued, and a great difference exists between the man-machine dialogue and the dialogue in daily life, so that the experience effect of the user is poor.

Disclosure of Invention

In view of the above, the invention discloses a man-machine conversation method and a system thereof, so as to realize active conversation during man-machine conversation and improve user experience.

A human-machine conversation method, comprising:

acquiring a current dialogue sentence;

preprocessing the current dialogue sentence to obtain a word vector sequence;

inputting the word vector sequence into a pre-trained sentence classification model to obtain a sentence function label of the current dialogue sentence, wherein the sentence classification model is obtained by training the word vector sequence of the dialogue sentence as a training sample and the sentence function label as a label, and the questionable sentence proportion in the sentence function label of the sentence classification model is trained in a preset proportion interval;

And inputting the word vector sequence and the sentence function label into a pre-trained generated dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generated dialogue method, wherein the generated dialogue model adopts a back propagation algorithm to learn in the training process, and a loss function converges to a minimum value, and the loss function is used for representing the loss condition of the classification probability of the answer sentence output by the generated dialogue model relative to the classification probability of a reference answer sentence.

Optionally, the preprocessing the current dialogue sentence to obtain a word vector sequence specifically includes:

word segmentation is carried out on the current dialogue sentence to obtain a word sequence;

And carrying out word embedding processing on each word in the word sequence, and converting each word into a word vector to obtain the word vector sequence.

Optionally, the inputting the word vector sequence and the sentence function tag into a pre-trained generating dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generating dialogue method specifically includes:

encoding the word vector sequence by using an encoder to obtain a hidden state;

Decoding the hidden state by using a decoder to obtain an output sequence, and stopping decoding when the last character of the output sequence is a stop symbol;

and inputting the decoded output sequence and the sentence function tag into the generated dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generated dialogue method.

Optionally, the expression of the hidden state is as follows:

h_t＝f(x_t,h_t-1);c＝h_t；

wherein h _t is the hidden state, f is an activation function, the activation function is a nonlinear function, x _t is a word vector in the t-th word vector sequence, h _t-1 is a context vector of the word vector sequence corresponding to the time t-1, and c is a context vector.

Optionally, the loss function is a weighted average of the output statement loss factor and the classification loss factor, and the expression of the loss function is as follows:

Wherein loss is the loss function, alpha and beta represent weights of weighted averages, alpha and beta are both values between 0 and 1, and alpha+beta=1;

Wherein, the expression of the output statement loss factor is as follows:

Wherein lossA represents the output sentence loss factor, n represents the total length of the current dialogue sentence, i represents the independent variable in the summation process, represents the current dialogue sentence length in the answer sentence generation process, P represents the probability from the 1 st word to the n-1 st word for generating the n-th word, y _n represents the n-th word in the answer sentence, y ₁ represents the 1 st word in the answer sentence, and y _n-1 represents the n-1 st word in the answer sentence;

the expression of the classification loss factor is as follows:

in the formula, k represents a sentence function class, i represents a variable in the summation process, the value is 1-k, Y represents a classification label corresponding to the answer sentence, U represents a classification label corresponding to the reference answer V, P (Y|k) represents the probability of obtaining the classification label Y by the answer sentence, and P (U|k) represents the probability of obtaining the classification label U by the generated dialogue model output reference answer V.

A human-machine conversation system, comprising:

A dialogue sentence acquisition unit for acquiring a current dialogue sentence;

the preprocessing unit is used for preprocessing the current dialogue sentence to obtain a word vector sequence;

The function tag obtaining unit is used for inputting the word vector sequence into a pre-trained sentence classification model to obtain a sentence function tag of the current dialogue sentence, wherein the sentence classification model is obtained by training the word vector sequence of the dialogue sentence as a training sample and the sentence function tag as a tag, and the question mark proportion in the sentence function tag for training the sentence classification model is within a preset proportion interval;

The answer sentence acquisition unit is used for inputting the word vector sequence and the sentence function label into a pre-trained generation type dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generation type dialogue method, wherein the generation type dialogue model adopts a back propagation algorithm to learn in the training process, and a loss function is converged to a minimum value, and the loss function is used for representing the loss condition of the classification probability of the answer sentence output by the generation type dialogue model relative to the classification probability of a reference answer sentence.

Optionally, the preprocessing unit is specifically configured to:

Optionally, the answer sentence acquisition unit is specifically configured to:

encoding the word vector sequence by using an encoder to obtain a hidden state;

Optionally, the expression of the hidden state is as follows:

h_t＝f(x_t,h_t-1);c＝h_t；

Wherein, the expression of the output statement loss factor is as follows:

the expression of the classification loss factor is as follows:

As can be seen from the above technical solution, the present invention discloses a man-machine conversation method and system, wherein a word vector sequence is obtained by preprocessing an obtained current conversation sentence, the word vector sequence is input into a sentence classification model to obtain a sentence function tag of the current conversation sentence, the word vector sequence and the sentence function tag are input into a generated conversation model, and a generated conversation method is adopted to output an answer sentence of the current conversation sentence. According to the invention, a certain proportion of question sentences are added in the sentence function labels of the training sentence classification model, so that the answer sentences output by the generated dialogue model comprise a certain proportion of question sentences, and in order to avoid the situation that the generated dialogue model outputs question sentences and wrongly outputs sentence patterns other than question sentences, a loss function is introduced, wherein the loss function represents the loss condition of the classification probability of the answer sentences output by the generated dialogue model relative to the classification probability of the reference answer sentences.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, and it is obvious that the drawings in the following description are only embodiments of the present invention, and that other drawings can be obtained according to the disclosed drawings without inventive effort for a person skilled in the art.

FIG. 1 is a flow chart of a man-machine conversation method disclosed in an embodiment of the invention;

FIG. 2 is a flowchart of a method for inputting a word vector sequence and sentence function tags into a pre-trained generated dialogue model and outputting answer sentences of a current dialogue sentence by using the generated dialogue method according to the embodiment of the invention;

fig. 3 is a schematic structural diagram of a man-machine conversation system according to an embodiment of the present invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

The embodiment of the invention discloses a man-machine conversation method and a man-machine conversation system, which are used for preprocessing an acquired current conversation sentence to obtain a word vector sequence, inputting the word vector sequence into a sentence classification model to obtain a sentence function label of the current conversation sentence, inputting the word vector sequence and the sentence function label into a generated conversation model, and outputting an answer sentence of the current conversation sentence by adopting the generated conversation method. According to the invention, a certain proportion of question sentences are added in the sentence function labels of the training sentence classification model, so that the answer sentences output by the generated dialogue model comprise a certain proportion of question sentences, and in order to avoid the situation that the generated dialogue model outputs question sentences and wrongly outputs sentence patterns other than question sentences, a loss function is introduced, wherein the loss function represents the loss condition of the classification probability of the answer sentences output by the generated dialogue model relative to the classification probability of the reference answer sentences.

Referring to fig. 1, a flow chart of a man-machine conversation method disclosed in an embodiment of the present invention includes:

step S101, acquiring a current dialogue statement;

step S102, preprocessing the current dialogue sentence to obtain a word vector sequence;

the preprocessing process for the current dialogue statement specifically comprises the following steps:

And carrying out word embedding processing on each word in the word sequence, and converting each word into a word vector to obtain a word vector sequence.

It should be noted that, word segmentation is performed on the current dialogue sentence, that is, the dialogue sentence is segmented into a sequence taking a word as a unit, and the sequence is a word sequence.

In this embodiment, a word vector sequence may be obtained by performing word embedding processing on each word in the word sequence, where word embedding is a process of converting a word into a word vector of a preset length.

Step S103, inputting the word vector sequence into a pre-trained sentence classification model to obtain a sentence function tag of the current dialogue sentence;

wherein, sentence function tags include four types: question sentences, statement sentences, imperative sentences and exclamation sentences.

The sentence classification model is obtained by training a word vector sequence of dialogue sentences and sentence function labels as training samples, wherein the ratio of question sentences in the sentence function labels of the training sentence classification model is in a preset ratio interval, and the value of the preset ratio interval is determined according to actual needs, such as 50% -60%.

Sentence function is the purpose of a user when expressing a sentence, and is one of the important attributes in linguistics, namely, an important factor representing the interaction process between the user and the dialogue system. Sentence functions are divided into four types: question sentences, statement sentences, imperative sentences and exclamation sentences. Since sentence functions are related to the purpose of speaking, sentence functions are important factors in characterizing the interaction process of a user with a dialog system. Before classifying dialogue sentences according to functions, the dialogue sentences need to be preprocessed, and the preprocessing process comprises the following steps: and segmenting the dialogue sentence to obtain a word sequence, carrying out word embedding processing on each word in the word sequence, and converting each word into a word vector to obtain a word vector sequence.

The dialogue system in this embodiment may be formalized, specifically: the given dialog statement context set d=w ₁,W₂,.....,W_n-1,W_n,W_n generates a reply Y _n. Where W _n represents the nth dialog sentence in the dialog sentence context set. In the invention, the mere use of dialogue sentence context sets and replies as training data is insufficient, and because the generated dialogue model cannot directly acquire the current dialogue state information, a sentence function information is required as a supplement. Sentence function information is used to avoid the generated dialogue model outputting monotonous statement sentences each time a question is answered, but can transform multiple sentence types, including actively asking questions to the user.

Since the sentence classification process is supervised learning according to the sentence function, all the acquired history dialogue sentences need to be divided. In this embodiment, all the historical dialogue sentences are divided into a training set and a testing set, the training set may account for 10% -20% of all the historical dialogue sentences, and each historical dialogue sentence in the training set is labeled with a sentence function label. The specific method of labeling is not required, and can be manual labeling or manual correction after labeling by using other models. The sentence function labels are marked by sentence classification models obtained through training of historical dialogue sentences in the test set.

In addition to the historical dialogue sentences in the training set, a certain proportion of statement sentences and question sentences are required to be added, so that the active question capability of the dialogue system is improved. The training data is in the form of: in the preprocessing of the dialogue sentences, the input sentences X and the output sentences Y (X, Y) are subjected to sentence function classification according to the method in step S103, so as to obtain sentence function labels D (Y), wherein D (Y) is the question sentences, and the proportion of D (Y) is 50% -60%.

When the dialogue sentences are functionally classified, the sentence classification model is input as dialogue sentences U, consists of word vector sequences x ₁,x₂,.....,x_n and is output as sentence functional labels L.

The method of classifying dialogue sentences according to the present embodiment is not particularly limited, and for example, classification can be performed using a cyclic convolutional neural network (Recurrent Convolutional Neural Network, RCNN). The structure commonly used for solving the time sequence information such as the text processing problem is a cyclic neural network, sentences with different sentence function types also contain different keyword features, and the relative positions of the sentences do not influence the classification of the sentences, so that the feature extraction capability of the convolutional neural network can be utilized. The algorithmic model constructed by the sentence classification method is referred to as a sentence classification model.

In this embodiment, a supervised learning method is used to train to obtain a sentence classification model, and each set of sample data includes a pair of data, namely a word vector sequence as input data, and a sentence function tag as desired output data. The supervised learning algorithm is not limited in this regard, and the present embodiment analyzes training data and generates an inference function that can be used to predict data from a validation set or a test set.

And step S104, inputting the word vector sequence and the sentence function label into a pre-trained generation type dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generation type dialogue method.

The generated dialogue model is learned by adopting a back propagation algorithm in the training process, and a loss function is converged to a minimum value, wherein the loss function is used for representing the loss condition of the classification probability of the answer sentence output by the generated dialogue model relative to the classification probability of the reference answer sentence.

In this embodiment, the generative dialogue model outputs answer sentences using the generative dialogue method. The generated dialogue is to train an end-to-end deep neural network to obtain a generated dialogue model to generate answer sentences of dialogue sentences, so the generated dialogue model can generate more flexible answer sentences for the dialogue sentences.

In the training process, the generated dialogue model provides the real word vector sequence in the dialogue sentence to the generated dialogue model, and the generated dialogue model can be learned through a back propagation algorithm. The back propagation algorithm is defined as an algorithm that trains some given feedforward neural network for a given input pattern given a known classification. The back propagation algorithm may converge the loss function to a minimum given the training set.

The dialog system trains the model using the chat corpus and generates context-adaptive answer sentences from the dialog sentences entered by the user. In the generated dialog, an end-to-end deep neural network, also referred to as a codec network, is employed.

The generated dialogue model learns from input to output mapping relation, and given the generated dialogue model, an input sequence u=x ₁,x₂,.....,x_n,x_n represents word vectors and sentence function labels L, wherein the input sequence U can be a continuous sequence consisting of n sentences, n is generally 3-5, the dialogue output is answer sentence y _n, and the generated dialogue model learns by maximizing the conditional probability of y _n. In the training process of the generated dialogue model, reference answers V are needed, wherein V represents real answers to U in training corpus, and the model is used as reference in training. The specific implementation of the end-to-end deep neural network can be varied, but the implementation steps should meet the following requirements:

Referring to fig. 2, a method flowchart for inputting a word vector sequence and a sentence function tag into a pre-trained generated dialogue model and outputting an answer sentence of a current dialogue sentence by using the generated dialogue method is disclosed in the embodiment of the invention, and the method comprises the following steps:

Step S201, encoding the word vector sequence by using an encoder to obtain a hidden state;

the word vector sequence is read word by an encoder and converted into context vectors through a recurrent neural network.

The hidden state of the word vector sequence is a context vector, and the expression of the hidden state is shown in formula (1), wherein formula (1) is as follows: ;

h_t＝f(x_t,h_t-1);c＝h_t (1)；

Wherein h _t is a hidden state, f is an activation function, the activation function is a nonlinear function, which may be tanh, a gating unit of a long-short-term memory network, or other nonlinear functions, x _t is a word vector in the t-th word vector sequence, h _t-1 is a context vector of the word vector sequence corresponding to the time t-1, that is, the time t above, and c is a context vector.

Step S202, decoding the hidden state by using a decoder to obtain an output sequence, and stopping decoding when the last character of the output sequence is a stop symbol;

The generation probability p (y _t|y_t-1,...,y₁ |x) of the t-th word is shown in formula (2), and formula (2) is as follows:

p(y_t|y_t-1,…,y₁|x)＝g(y_t-1,s_t,c_t) (2)；

Where y _t is the word vector of the t-th word in the output sequence, s _t is the hidden state of the decoder at time t, y _t-1 is the word vector of the t-1 st word in the output sequence, y _tl is the word vector of the 1 st word in the output sequence, x is the hidden state, represents the coding of the input sequence, g is the output function of the cyclic neural network unit of the decoder, s _t is the hidden state of the decoder at time t, and c _t is the background variable of time step t.

The expression of the generation probability of the t-th word includes, but is not limited to, the expression shown in formula (2).

The expression of s _t is shown in formula (3), and formula (3) is as follows:

s_t＝f(y_t-1,s_t-1,c_t) (3)；

where s _t-1 is the hidden state of the decoder at time t-1, and f is an activation function, where the activation function is a nonlinear function, and may be tanh, a gating unit of a long-short-term memory network, or other nonlinear functions.

And step 203, inputting the decoded output sequence and the sentence function tag into the generated dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generated dialogue method.

In the training process of the generated dialogue model, the real output sequence in the dialogue statement is provided for the generated dialogue model, and learning can be performed through a back propagation algorithm. The back propagation algorithm is defined as an algorithm that trains some given feedforward neural network for a given input pattern given a known classification. The back propagation algorithm may minimize the loss function given the training set. The form of the loss function will be limited in the present invention.

During the training process, the generated dialogue model can obtain an output sequence y _n (i.e. an answer sentence) from the input sequence U (i.e. the word vector sequence and the sentence function tag) mentioned in step S104, wherein the output sentence loss factor of the loss function is defined as lossA, the classification loss factor is defined as lossB, which is a key step for implementing the active dialogue in the present invention.

The loss function is a weighted average of the output statement loss factor and the class loss factor, the expression of the loss function loss is shown in formula (4), and the formula (4) is as follows:

Where α and β represent weights of a weighted average, α and β each take a value between 0 and 1, and α+β=1.

The expression of the output statement loss factor of the loss function in the formula (4) is shown as formula (5), and the formula (5) is as follows:

Wherein lossA denotes an output sentence loss factor of the loss function, n denotes a total length of a current dialogue sentence, i denotes an argument in a summation process, represents a current dialogue sentence length in an answer sentence generation process, P denotes a probability from a1 st word to an n-1 st word of generation of the n-th word, y _n denotes the n-th word in the answer sentence, y ₁ denotes the 1 st word in the answer sentence, and y _n-1 denotes the n-1 st word in the answer sentence.

Assuming that the sentence types are k, since the last step of the classifier is used as the output layer, the probability distribution of classification is a discrete probability distribution, and the discrete random variable takes a value from 1 to k.

The expression of the classification loss factor lossB is shown in formula (6), and formula (6) is as follows:

In the formula, k represents a sentence function class, i represents a variable in the summation process, the value is 1-k, Y represents a classification label corresponding to an answer sentence output by a generated dialogue model, U represents a classification label corresponding to a reference answer V, P (Y|k) represents the probability of obtaining the classification label Y by the answer sentence output by the generated dialogue model, and P (U|k) represents the probability of obtaining the classification label U by the reference answer V output by the generated dialogue model.

If the classification of the output answer sentence is consistent with the classification in the dialogue sentence, the classification loss factor takes a smaller value, and if the classification is not consistent, the classification loss factor takes a larger value.

The key of the invention for realizing the active dialogue is that the proportion of the answer (i.e. the reference answer V in S203) in the corpus is adjusted in the step S103, so that the question sentence, the praise sentence and the like occupy a certain proportion, the generated dialogue model can adjust the self parameters in the training process by the classification loss factor lossB, and output a new answer sentence after adjustment, and recalculate the loss function until the loss function converges to a smaller value, and the value of lossB tends to 0, the classification probability distribution of the output answer sentence of the generated dialogue model is basically consistent with the classification probability of the reference answer sentence in the training dialogue sentence, and the generated dialogue model can be considered to realize the diversity of the output answer sentence, namely the active dialogue.

During the test, a trained model (the loss function has converged) is used as the model for the test.

Specifically, if the generated dialogue model should output the question sentence, but incorrectly output the statement sentence or other sentence patterns, the generated dialogue model is reflected on the loss factor, the wrong prediction type is restrained, model parameters are adjusted through a back propagation algorithm, and finally the generated dialogue model can fit the output sentence type in the corpus, so that active dialogue is realized.

In summary, the invention discloses a man-machine conversation method, which is used for preprocessing an acquired current conversation sentence to obtain a word vector sequence, inputting the word vector sequence into a sentence classification model to obtain a sentence function label of the current conversation sentence, inputting the word vector sequence and the sentence function label into a generated conversation model, and outputting an answer sentence of the current conversation sentence by adopting the generated conversation method. According to the invention, a certain proportion of question sentences are added in the sentence function labels of the training sentence classification model, so that the answer sentences output by the generated dialogue model comprise a certain proportion of question sentences, and in order to avoid the situation that the generated dialogue model outputs question sentences and wrongly outputs sentence patterns other than question sentences, a loss function is introduced, wherein the loss function represents the loss condition of the classification probability of the answer sentences output by the generated dialogue model relative to the classification probability of the reference answer sentences.

In practical application, the test of the generated dialogue model comprises the following three steps:

Step one, in an active dialogue of a generated dialogue model, testing the functional classification diversity of dialogue sentences:

And performing word segmentation and word embedding processing on dialogue data D=W ₁,W₂,.....,W_n-1 serving as test data to obtain a word vector sequence, inputting the word vector sequence into a pre-trained sentence classification model to obtain a sentence function tag of a current dialogue sentence, inputting the word vector sequence and the sentence function tag into a pre-trained generated dialogue model, and outputting an answer sentence of the current dialogue sentence by using a generated dialogue method.

Step two, automatic evaluation:

a confusion (Perplexity) index may be employed to evaluate the generative dialog model. The confusion (perplexity) is an index for measuring the quality of the language model. The method mainly estimates the probability of occurrence of a sentence according to each word, normalizes the sentence length, and has the expression shown in a formula (7), wherein the formula (7) is as follows:

In the formula, PPL (S) represents XXX, exp represents XXX, n represents XXX, k represents XXX, log (p (S|S ')) represents XXX, S represents a dialogue sentence composed of a word sequence of length n, S' represents a word sequence composed of 1 to k-1, and p (S|S ') represents the probability that the sequence S is generated under the S' condition.

Another criterion for automatic evaluation is classification entropy: the adopted method is to count the proportion of each sentence function type, list the proportion into a table, and verify the proportion by using the concept of the classification entropy: since the entropy of the sentence function probability is related to the diversity of the output of the generated dialogue model, and the diversity of the sentence function affects the initiative of the dialogue system, the classification entropy of the generated sentence is an important index for comparing the distinction between the basic model and the improved model.

The class entropy can be expressed as: wherein i represents a sentence function tag, n represents the number of types of sentence classification, four are preferable in the present invention, including: question sentences, statement sentences, imperative sentences and exclamatory sentences, and P (i) represents the probability that a sentence classification belongs to an ith classification label.

Step three, manually evaluating

The manual evaluation has the advantage of higher accuracy, and answers generated in the model test are evaluated by adopting a manual evaluation method, wherein the evaluation can be divided into a plurality of indexes:

1. fluency of the answer sentence;

2. Contextual relevance of answer sentences;

3. Initiative of answer sentences;

Each index is given a score of 0-5 by the user, and the user scores are collected and recorded after the evaluation is finished.

Assuming that sentence classification models and generated dialogue models have been obtained through training, the following illustrates the process of user testing:

inputting a current dialogue sentence X:

i work in the sports center.

Preprocessing according to the method in the step S102, and segmenting the current dialogue sentence into four words:

I/at/sports center/work

Then converting into a word vector sequence x ₁,...,x₄;

corresponding to step S201, the word vector sequence is encoded by using the encoder to obtain the hidden state, and the encoding is repeated until the hidden state h ₄ is obtained.

Corresponding to step S202, the hidden state is decoded by using the decoder to obtain the output sequence y ₁,...,y_n, and decoding is stopped when y _n is the stop symbol < EOS >. In this embodiment, y ₆ = < EOS > is assumed.

The label D (Y) =2 of the output sentence obtained by classification in S103 represents a question sentence. The result is the same as the output classification result of the training corpus.

For the output sequence y ₁,...y_n, it is converted into an output sequence of natural language by looking up a word vector table.

The length of the output sequence obtained in this embodiment is assumed to be 5, specifically:

You/yes/what/department/

The sentence is a question sentence, and it can be seen that in this embodiment, the present invention outputs the question sentence, that is, an active dialogue is implemented. Results with statistical significance can be obtained by examining more examples.

Corresponding to the embodiment of the method, the invention also discloses a man-machine conversation system.

Referring to fig. 3, a schematic structural diagram of a family dialogue system according to an embodiment of the present invention is disclosed, where the system includes:

a dialogue sentence acquisition unit 301 configured to acquire a current dialogue sentence;

a preprocessing unit 302, configured to preprocess the current dialogue sentence to obtain a word vector sequence;

the preprocessing unit 302 may specifically be configured to:

A function tag obtaining unit 303, configured to input the word vector sequence to a pre-trained sentence classification model, so as to obtain a sentence function tag of the current dialogue sentence;

And an answer sentence acquisition unit 304, configured to input the word vector sequence and the sentence function tag into a pre-trained generated dialogue model, and output an answer sentence of the current dialogue sentence by using a generated dialogue method.

In summary, the invention discloses a man-machine dialogue system, which is used for preprocessing an acquired current dialogue sentence to obtain a word vector sequence, inputting the word vector sequence into a sentence classification model to obtain a sentence function label of the current dialogue sentence, inputting the word vector sequence and the sentence function label into a generated dialogue model, and outputting an answer sentence of the current dialogue sentence by adopting a generated dialogue method. According to the invention, a certain proportion of question sentences are added in the sentence function labels of the training sentence classification model, so that the answer sentences output by the generated dialogue model comprise a certain proportion of question sentences, and in order to avoid the situation that the generated dialogue model outputs question sentences and wrongly outputs sentence patterns other than question sentences, a loss function is introduced, wherein the loss function represents the loss condition of the classification probability of the answer sentences output by the generated dialogue model relative to the classification probability of the reference answer sentences.

To further optimize the above embodiment, the answer sentence acquisition unit 304 may specifically be configured to:

encoding the word vector sequence by using an encoder to obtain a hidden state;

Wherein the expression of the hidden state is as follows:

h_t＝f(x_t,h_t-1);c＝h_t (1)；

In this embodiment, the loss function is a weighted average of the output statement loss factor and the classification loss factor, and the expression of the loss function is as follows:

Wherein, the expression of the output statement loss factor is as follows:

the expression of the classification loss factor is as follows:

It should be specifically noted that, in the system embodiment, the specific working principle of each component is referred to the corresponding portion of the method embodiment, and will not be described herein.

Finally, it is further noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

In the present specification, each embodiment is described in a progressive manner, and each embodiment is mainly described in a different point from other embodiments, and identical and similar parts between the embodiments are all enough to refer to each other.

The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. A human-machine conversation method, comprising:

acquiring a current dialogue sentence;

preprocessing the current dialogue sentence to obtain a word vector sequence;

Inputting the word vector sequence and the sentence function label into a pre-trained generation type dialogue model, and outputting an answer sentence of the current dialogue sentence by adopting a generation type dialogue method, wherein the generation type dialogue model adopts a back propagation algorithm to learn in the training process, and a loss function converges to a minimum value, and the loss function is used for representing the loss condition of the classification probability of the answer sentence output by the generation type dialogue model relative to the classification probability of a reference answer sentence;

The step of inputting the word vector sequence and the sentence function label into a pre-trained generation type dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generation type dialogue method, which comprises the following steps: encoding the word vector sequence by using an encoder to obtain a hidden state; decoding the hidden state by using a decoder to obtain an output sequence, and stopping decoding when the last character of the output sequence is a stop symbol; inputting the output sequence and the sentence function label obtained by decoding into the generated dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generated dialogue method;

wherein the expression of the hidden state is as follows:

h_t＝f(x_t,h_t-1);c＝h_t；

2. The human-machine conversation method of claim 1 wherein the preprocessing the current conversation sentence to obtain a word vector sequence specifically includes:

3. The human-machine conversation method of claim 1 wherein the penalty function is a weighted average of output statement penalty factors and class penalty factors, the penalty function being expressed as:

Wherein, the expression of the output statement loss factor is as follows:

the expression of the classification loss factor is as follows:

4. A human-machine conversation system, comprising:

A dialogue sentence acquisition unit for acquiring a current dialogue sentence;

The answer sentence acquisition unit is used for inputting the word vector sequence and the sentence function label into a pre-trained generation type dialogue model, and outputting an answer sentence of the current dialogue sentence by adopting a generation type dialogue method, wherein the generation type dialogue model adopts a back propagation algorithm to learn in the training process, and a loss function is converged to a minimum value, and the loss function is used for representing the loss condition of the classification probability of the answer sentence output by the generation type dialogue model relative to the classification probability of a reference answer sentence;

The answer sentence acquisition unit is specifically configured to: encoding the word vector sequence by using an encoder to obtain a hidden state; decoding the hidden state by using a decoder to obtain an output sequence, and stopping decoding when the last character of the output sequence is a stop symbol; inputting the output sequence and the sentence function label obtained by decoding into the generated dialogue model, and outputting the answer sentence of the current dialogue sentence by adopting a generated dialogue method;

wherein the expression of the hidden state is as follows:

h_t＝f(x_t,h_t-1);c＝h_t；

5. The human-machine conversation system of claim 4 wherein the preprocessing unit is specifically configured to:

6. The human-machine conversation system of claim 4 wherein the penalty function is a weighted average of output sentence penalty factors and class penalty factors, the penalty function being expressed as:

Wherein, the expression of the output statement loss factor is as follows:

the expression of the classification loss factor is as follows: