CN111061850A

CN111061850A - Dialog state tracking method, system and device based on information enhancement

Info

Publication number: CN111061850A
Application number: CN201911276031.9A
Authority: CN
Inventors: 赵军; 何世柱; 刘康; 刘庆斌
Original assignee: Institute of Automation of Chinese Academy of Science
Current assignee: Institute of Automation of Chinese Academy of Science
Priority date: 2019-12-12
Filing date: 2019-12-12
Publication date: 2020-04-24
Anticipated expiration: 2039-12-12
Also published as: CN111061850B

Abstract

The invention belongs to the technical field of natural language processing, and particularly relates to a dialog state tracking method, a dialog state tracking system and a dialog state tracking device based on information enhancement, aiming at solving the problem that the accuracy of an unknown slot value generated by only utilizing context information of a dialog text in the conventional dialog state tracking method is poor. The system method comprises the steps that based on a dialog text of a user at the t moment, a dialog state at the t moment is obtained through a dialog state tracking model; the dialogue state at the time t comprises one or more slot value pairs and corresponding probabilities; the slot value pair comprises a slot and a slot value; the dialog state tracking model is constructed based on a neural network of an encoder-decoder architecture. The invention improves the accuracy of the generation of the unknown slot value.

Description

Dialog state tracking method, system and device based on information enhancement

Technical Field

The invention belongs to the technical field of natural language processing, and particularly relates to a dialogue state tracking method, system and device based on information enhancement.

Background

With the rapid development of the internet and the popularization of intelligent devices, more and more conversation systems appear in the daily life of people. These dialog systems tend to assist people in performing a number of tasks. For example, the device can help people to order food, play music and other tasks. In the intelligent dialogue system, how to understand the user intention and clarify the user request becomes the most important part of the dialogue system. Dialog state tracking is emerging to address this issue.

Dialog state tracking is an important part of task-oriented dialog systems and one of the difficult problems in dialog systems. The task is to automatically recognize structured user states from a plurality of rounds of dialogs with various expressions. These structured states are typically composed of some predefined slots and their slot values. For example, in a meal ordering task, there will be slots of 'food type', 'restaurant location' and 'price', as well as some known values (slot values in a predefined state library).

Conventional dialog state tracking methods are mostly based on one such assumption. All the slots and the slot values are predefined in advance, so that the dialogue state tracking task can be simplified into a classification task, model design can be facilitated through the simplification, and the performance of the model on a data set with a limit value is improved. However, these models have difficulty dealing with unknown bin values that are not in the predefined state library bin value pairs. To deal with unknown slot values, there have been methods that generate unknown slot values from similar context information by selecting words from the dialog as slot values, but this method can only utilize the context information in the dialog. Because of the variety of the utterance expression modes in the dialog, the context information often has no obvious regularity, so that a model which only utilizes the context information to generate an unknown slot value is often unreliable in a complex context, and the application of a dialog system in practice is influenced.

Disclosure of Invention

In order to solve the above-mentioned problem in the prior art, that is, to solve the problem that the accuracy of an unknown slot value generated by using only context information of a dialog text in the conventional dialog state tracking method is poor, in a first aspect of the present invention, a method for dialog state tracking based on information enhancement is provided, the method comprising:

based on the dialog text of the user at the time t, obtaining the dialog state at the time t through a dialog state tracking model; the dialogue state at the time t comprises one or more slot value pairs and corresponding probabilities; the slot value pair comprises a slot and a slot value;

the dialog state tracking model is constructed based on a neural network of an encoder-decoder architecture; the training method of the model comprises the following steps:

a100, acquiring a dialog text of a user at a time t based on a labeled dialog set, and acquiring the semantic association degree of each word in the dialog text with a slot and a slot value of each slot value pair in a predefined state library by a preset semantic association degree evaluation method;

step A200, obtaining the dialog state at the time t through a dialog state tracking model based on the dialog text at the time t of the user, the dialog state at the time t-1 and the semantic association degree; calculating a loss value of the dialogue state tracking model, and updating parameters of the dialogue state tracking model;

step A300, making t equal to t +1, and circularly executing the method of the steps A100-A200 until the session is ended;

step A400, based on the corresponding conversation states at all times, obtaining the clustered conversation states by a preset semantic clustering method, and adding the clustered conversation states into the predefined state library;

and step A500, updating the dialogue states at all times by adopting the method of the steps A100-A200, calculating the loss value of the dialogue state tracking model, and updating the parameters of the dialogue state tracking model to obtain the trained dialogue state tracking model.

In some preferred embodiments, the preset semantic relevance evaluation method includes a central relevance evaluation method and a peripheral relevance evaluation method;

the central association degree evaluation method obtains semantic association degree by solving the cosine similarity of each word in the dialog text and the word vector of each slot value to the middle slot;

the peripheral association degree evaluation method obtains semantic association degree by solving the cosine similarity between each word in the dialog text and the slot value in each slot value pair.

In some preferred embodiments, the dialog state tracking model includes a dialog utterance encoder, a previous dialog state encoder, a known value encoder, a current dialog state decoder;

the dialogue utterance encoder is used for replying the dialogue text of the user based on the dialogue text at the time t of the user and the dialogue text of the user at the time t-1, performing word vector splicing by a Glove word embedding method and an n-gram word embedding method, and obtaining dialogue utterance encoding expression by a spliced word vector sequence through a bidirectional long-short memory network;

the last step dialogue state encoder is used for splicing the slot value pairs in the dialogue state at the t-1 moment, and the spliced sequence is expressed by the last step dialogue state encoding through a bidirectional long-short memory network;

the known value encoder is used for splicing the slot values in each slot value pair of the predefined state library, and the spliced sequence is subjected to two-way long-short memory network to obtain the coded representation of the known slot value;

and the current conversation state decoder is used for decoding the conversation state at the time t through a bidirectional long-short memory network based on the conversation history representation formed by splicing the conversation utterance coding representation and the last conversation state coding representation and the slots in the value pairs of the slots of the current conversation state.

In some preferred embodiments, the word vector concatenation is performed by a Glove word embedding method or an n-gram word embedding method, and the method comprises the following steps:

based on a dialog text of a user at a time t and a dialog text of a system reply user at a time t-1, acquiring corresponding word vectors by a Glove word embedding method and an n-gram word embedding method respectively and splicing to obtain a user dialog word vector sequence and a system reply word vector sequence;

and splicing the user dialogue word vector sequence and the system reply word vector sequence.

In some preferred embodiments, the method of "obtaining the dialog state at time t by the dialog state tracking model" in step a200 is as follows:

obtaining the probability of each slot value pair in the conversation state at the time t, and if the slot values correspond to the middle slot value and the slot value in a one-to-one correspondence manner, selecting a known value with the highest probability and the probability greater than a preset probability threshold value as the slot value; the known value is a slot value in a predefined state library;

if the slot value is in a one-to-many relationship with the middle slot and the slot value, selecting a plurality of known values corresponding to the probability greater than a preset probability threshold value as the slot value corresponding to the slot;

otherwise, acquiring the reward value of each word in the conversation history representation according to a preset reward rule; and multiplying the probability corresponding to the slot value pair with the slot value in the conversation state, wherein the slot value is one-to-many, and taking the word with the highest probability value after multiplication as the conversation state.

In some preferred embodiments, the method of obtaining the reward value of each word in the dialog history representation according to a preset reward rule includes:

randomly acquiring a slot of a slot value to be acquired in a dialogue state of words in the dialogue history representation and at the time t and semantic association degrees of known values corresponding to the slot, and averaging all the semantic association degrees to obtain an average association degree;

if the selected word is the target word, the reward value is a preset reward value; if the average relevance of the selected words exceeds a preset threshold value, the reward value is the product of the preset reward value and the average relevance; otherwise, the reward value corresponding to the selected word is set to be 0.

In some preferred embodiments, in step a400, "obtaining the clustered dialog states by using a preset semantic clustering method" includes:

step A410, sorting each slot value pair in a first dialogue state according to the semantic association degree of each slot value pair in the first dialogue state and each slot value pair in a predefined state library, and selecting N slot value pairs as a second dialogue state; the first conversation state is a conversation state corresponding to all moments of the current conversation;

step A420, sorting the semantic association degrees of the slot values in the slot value pairs of the predefined state library according to the slot values of the second dialogue state, and selecting M slot value pairs as a third dialogue state;

and step A430, deleting the slot value pairs containing the set word part of speech and the stop word in the third dialogue state, and taking the rest slot value pairs as the dialogue state after clustering.

In a second aspect of the present invention, a system for session state tracking based on information enhancement is provided, the system includes an obtaining module and a session state tracking module;

the acquisition module is configured to acquire a dialog text of a user at the moment t as input information;

the dialogue state tracking module is configured to obtain a dialogue state at the time t through a dialogue state tracking model based on the input information; the dialogue state at the time t comprises one or more slot value pairs and corresponding probabilities; the slot value pair includes a slot and a slot value.

In a third aspect of the present invention, a storage device is provided, in which a plurality of programs are stored, and the program applications are loaded and executed by a processor to implement the above-mentioned dialog state tracking method based on information enhancement.

In a fourth aspect of the present invention, a processing apparatus is provided, which includes a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; the program is adapted to be loaded and executed by a processor to implement the above-described information-enhancement-based dialog state tracking method.

The invention has the beneficial effects that:

the invention improves the accuracy of the generation of the unknown slot value. The invention combines the semantic relation between each word and the slot-slot value in the user dialogue text as the enhanced information through two semantic association degree evaluation methods, and trains the dialogue state tracking model by adopting a reinforcement learning method. The method solves the technical problem of only utilizing context, can overcome the defects that the traditional method generates irrelevant values and can not generate rarely mentioned values, and improves the accuracy of state tracking.

Drawings

Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings.

FIG. 1 is a schematic diagram of a training process of a dialog state tracking model based on an information-enhanced dialog state tracking method according to an embodiment of the present invention;

FIG. 2 is a block diagram of a dialog state tracking system based on information enhancement according to an embodiment of the present invention;

FIG. 3 is a diagram of training a conversation state tracking model based on a user's conversation in booking a restaurant, in accordance with one embodiment of the present invention;

fig. 4 is a schematic diagram of the effect of the invention compared to other methods according to an embodiment of the invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the related invention are shown in the drawings.

It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.

The invention discloses a dialogue state tracking method based on information enhancement, which comprises the following steps:

In order to more clearly describe the dialog state tracking method based on information enhancement, the following describes in detail the steps of an embodiment of the method in conjunction with the accompanying drawings.

In the following preferred embodiment, the training method of the dialog state tracking model is first described in detail, and the dialog state obtained by the dialog state tracking method based on information enhancement is described in detail.

1. Training method of dialogue state tracking model

Step A100, obtaining a dialog text of a user at a time t based on a labeled dialog set, and obtaining the semantic association degree of each word in the dialog text with a slot and a slot value of each slot value pair in a predefined state library through a preset semantic association degree evaluation method.

In this embodiment, a dialog text at the time t of the user is obtained from a labeled dialog set, and the association between each word in the dialog and an ontology (a predefined state library including a plurality of slot value pairs), that is, the semantic association between each word in the dialog text and the slot and slot value of the predefined state library slot value pair, is evaluated. Regarding the evaluation of semantic relevance, the invention designs two methods: a center relevance evaluation method and a periphery relevance evaluation method.

The central relevance evaluation method is used for evaluating the semantic relation between the words in the dialog text and the slots of each slot value pair in the predefined state library. This degree of association is based on the assumption that values within the same slot in a conversation should have a significant semantic relationship with the slot.

The peripheral relevance evaluation method is used for evaluating the semantic relation between the words in the dialog text and the slot values of the slot value pairs of the predefined state library.

The semantic relationship is evaluated by cosine similarity of the word vectors. For each slot, we use the average of the semantic relationships of words to slots and slot values as their semantic relatedness.

Step A200, obtaining the dialog state at the time t through a dialog state tracking model based on the dialog text at the time t of the user, the dialog state at the time t-1 and the semantic association degree; and calculating the loss value of the dialogue state tracking model and updating the parameters of the dialogue state tracking model.

In the present embodiment, the dialog state tracking model is constructed based on a neural network of an encoder-decoder architecture, and the model includes a dialog utterance encoder, a previous dialog state encoder, a known value encoder, and a current dialog state decoder.

The conversational speech encoder is primarily used to encode the current user conversational text and the system response of the previous step (the system replies to the user's conversational text at the previous time) into a vector representation with context information for subsequent calculations. The method comprises the following specific steps:

firstly, based on a dialog text of a user at a time t and a dialog text of a system reply user at a time t-1, corresponding word vectors are obtained through a Glove word embedding method and an n-gram word embedding method respectively and are spliced to obtain a user dialog word vector sequence and a system reply word vector sequence. And splicing the user dialogue word vector sequence and the system reply word vector sequence to be used as the word vector input of the initial dialogue utterance.

The word vectors of the initial conversational utterances are then encoded cyclically using a bi-directional long-short memory network

Wherein, BilSTM is the long and short memory network coder used by us,

is the word vector of the initial conversational utterances we input, b_i-1The word vector for the initial conversational utterance at the previous time instant, i is the subscript. Therefore, each coded word vector can be provided with context information, and a more accurate conversational speech coding representation can be obtained.

The previous dialog state encoder is mainly used for encoding the user state (the dialog state at the previous moment) obtained in the previous dialog. This is mainly due to the inheritance of user states. If the user expressed some intent in a previous conversation turn. These intentions will be known by the system by default in later sessions. We therefore need to encode the last dialog state in the current round. The slot value pairs in the previous dialog state are spliced into a sequence, and then the sequence is encoded by using another two-way long-short memory network to obtain the previous dialog state encoding representation containing the context information. This representation is concatenated with the conversational speech encoder representation to yield a complete conversational history representation. The user may select a word from the dialog history representation as the dialog state. I.e. the word is used as information for the slot.

The known value encoder is primarily used to encode each known value (the bin value in each bin value pair of the predefined state library) into a vector representation. Some of the known values are composed of multiple words and we need a representation of their entirety for generation. Similarly, we encode the known values using a two-way long-short memory network, and we use the last vector representation output by the encoder as the known bin encoded representation.

The dialog state decoder takes the dialog history representation with the slot and attention weight as the decoder input, and obtains the decoding vector q^s。q^s＝LSTM(c^s,[z^s,e^s]) Wherein e is^sIs the slot name, z^sIs a dialog history representation with attention weight, c^sIs a hidden layer we get with the self-attention mechanism. c. C^sThe obtaining process of (2) is shown in the formulas (1), (3):

P^s＝softmax(a^s) (2)

wherein, a^sFor non-normalized weight distribution, W^s、b^sFor trainable parameters, M is a vector representation of the dialog history, P^sFor normalized weight distribution, d_mIs the length of M and i is subscript.

The process of obtaining a dialog history representation with attention weight is shown in equations (4) (5) (6):

P^z＝softmax(a^z) (5)

wherein v is^z、W^e、W^mFor trainable parameters, a^zFor non-normalized weight distribution, P^zIs a normalized weight distribution.

Then we can get the decoded state q by using a long and short memory network as the decoder^s，q^sProbability P that can be used to obtain a known value^vsThe calculation is shown in equation (7):

P^vs(v|s)＝sigmoid(q^s*V^s) (7)

wherein, is the dot product operation, V^sIs a vector representation of known values.

So that we can select a known value for each bin by this probability. For the selection of words in dialog text, we use the calculation method (8) (9) to obtain the probability P^ws：

P^ws(w|s)＝softmax(a^w) (9)

Wherein v is^o、W^oFor trainable parameters, a^wRepresented as non-normalized weights.

This probability may be a word selected as the probability for the current slot. However, in the dialog state tracking task, there is often a case that one slot is composed of a plurality of slot values, and the probability is obtained by using the method (10):

P^ws(w|s)＝sigmoid(a^w) (10)

for a slot containing one slot value, we select a known value with the highest probability and exceeding a threshold (preferably 0.5 in the present invention), if there is no known value that satisfies the condition, we select as the value a word of such condition, for a slot containing multiple slot values, we select all values with probabilities exceeding the threshold, and if there is no word that satisfies the condition in the dialog history representation.

For the probability P^vsWe train using the cross-entropy loss of label labels and probabilities in the dataset, for P^wsWe use the pointer of the target word in the dialogue as the training target, and then train by the reinforced learning of the cross entropy and semantic information of the training target and the prediction target, namely, the training is carried outAnd selecting a slot value by acquiring the probability of the words in the dialog text through reinforcement learning. The reinforcement learning is described in detail below:

taking words in the conversation history as an action space, and randomly sampling a word from the conversation space;

and evaluating the semantic relevance of the sampled words and the current slot. Calculating the semantic relations between the selected words and the current decoding slot and the known values of the slot, wherein the semantic relations are evaluated through cosine similarity of word vectors, and the average value of the semantic relations is taken as the degree of association;

if the selected word is just the target word, the prize value is determined to be the preset prize value, the invention is preferably set to 5, if the association degree exceeds the threshold value (0.68 in the first training and 0.75 in the second training), the prize value is set to 5 times the association degree, otherwise, the prize value is set to 0 if the word is stopped.

Based on the obtained reward value and the selection probability of the word in the dialogue text, the loss of reinforcement learning of the probability of selecting the word satisfying the condition as the slot value in the dialogue history is obtained, as shown in equation (11):

wherein the content of the first and second substances,

for the prize value corresponding to the selected word, L_rlTo reinforce the loss of learning.

And step a300, making t equal to t +1, and executing the method of steps a100-a200 in a loop until the session is ended.

In this embodiment, based on the current dialog between the user and the system, the dialog state tracking model is trained in a loop until the end.

And step A400, based on the corresponding conversation states at all times, obtaining the clustered conversation states by a preset semantic clustering method, and adding the clustered conversation states into the predefined state library.

In this embodiment, based on the dialog states at all times in the current dialog process, we perform semantic clustering to obtain a complete ontology. The method comprises the following specific steps:

step A410, sorting the semantic relations (semantic association degrees) of the slot values of the conversation states at all times and the slots of each slot value pair in a predefined state library, and removing noise according to the length set manually, namely selecting the first N slot value pairs as a second conversation state;

step A420, sorting the semantic relations of the slot values in the predefined state library based on the slot values in the second dialogue state, and removing more noises according to the length set manually, namely selecting the first M slot values as a third dialogue state;

step A430, judging the part of speech of the word in the multiple utterances according to the part of speech rules, removing words with incorrect part of speech, selecting correct values according to the context words of the word, and removing stop words to obtain a clustered conversation state, namely deleting the slot value pairs containing the part of speech of the set word and the stop words in the third conversation state. For example, the slot value pairs of the dialog state are only allowed to be nouns, and the slot value pairs of other parts of speech are deleted. And adding the clustered conversation states into a predefined state library.

As shown in FIG. 3, the present invention takes the example of a user booking a restaurant to interact with the system, and a complete training process of the model can be seen. Wherein, the speech coder is the abbreviation of the dialogue speech coder.

In this embodiment, based on the updated predefined state library, the semantic association and the dialog state corresponding to the dialog text at all times of the user are obtained again. Namely, the generalization capability of the model is improved, the values which do not appear in the predefined state library are added into the library, and the model is trained again based on the dialog text of the user until the trained dialog state tracking model is obtained.

2. Dialog state tracking method based on information enhancement

Based on the dialog text of the user at the time t, obtaining the dialog state at the time t through a dialog state tracking model; the dialogue state at the time t comprises one or more slot value pairs and corresponding probabilities; the slot value pair includes a slot and a slot value.

In this embodiment, based on the dialog text at the time t of the user, the trained dialog state tracking model obtains the dialog state at the time t. Other modules fed into the dialog system obtain the dialog's responses and policies.

In order to prove the effectiveness of the Dialog state tracking model of the present invention on the Dialog system, the Dialog state tracking Challenge 2 and Wizard of Oz 2 (simulation scenario Dialog 2 data set) corpora are used to perform testing, and the testing effect of the method and other existing methods is shown in table 1:

TABLE 1

In table 1, Models is the name of the model method, Joint is the Joint target accuracy, and Request is the accuracy of the Request slot. The comparison model comprises: a de-lexical network (Delexi.), a de-lexical + semantic dictionary (Delexi. + semdit), a neural state tracker-deep neural network (NBT-DNN), a neural state tracker-convolutional neural network (NBT-CNN), a global local attention network (GLAD), an extensible tracker (Scalable-DST), a pointer network (PtrNet), a pointer network + ontology (PtrNet + ont.), a copy enhanced tracker (CEDST), a copy enhanced tracker + ontology (CEDST + ont.).

The last one in Table 1 is the accuracy of the model method of the present invention, and it can be seen that on Wizard of Oz 2, the method exceeds all methods, and on Dialog State Tracking Change 2, exceeds all similar methods, but is slightly lower than GLAD, but GLAD utilizes the results of multiple speech recognitions in Dialog State Tracking Change 2, and it cannot handle unknown values, and the method of the present invention is not a similar method. As also shown in fig. 4, it can be seen that the present invention (TS-DST) has a good processing effect on less-mentioned (Rare occurrences) and Irrelevant values (Irrelevant values) compared to the prine method and the CEDST method.

A dialog state tracking system based on information enhancement according to a second embodiment of the present invention, as shown in fig. 2, includes: the system comprises an acquisition module and a conversation state tracking module;

It can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working process and related description of the system described above may refer to the corresponding process in the foregoing method embodiment, and details are not described herein again.

It should be noted that, the dialog state tracking system based on information enhancement provided in the foregoing embodiment is only illustrated by the division of the foregoing functional modules, and in practical applications, the above functions may be allocated to different functional modules according to needs, that is, the modules or steps in the embodiment of the present invention are further decomposed or combined, for example, the modules in the foregoing embodiment may be combined into one module, or may be further split into multiple sub-modules, so as to complete all or part of the functions described above. The names of the modules and steps involved in the embodiments of the present invention are only for distinguishing the modules or steps, and are not to be construed as unduly limiting the present invention.

A storage device according to a third embodiment of the present invention stores therein a plurality of programs, which are adapted to be loaded by a processor and to implement the above-described information enhancement-based dialog state tracking method.

A processing apparatus according to a fourth embodiment of the present invention includes a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; the program is adapted to be loaded and executed by a processor to implement the above-described information-enhancement-based dialog state tracking method.

It can be clearly understood by those skilled in the art that, for convenience and brevity, the specific working processes and related descriptions of the storage device and the processing device described above may refer to the corresponding processes in the foregoing method examples, and are not described herein again.

Those of skill in the art would appreciate that the various illustrative modules, method steps, and modules described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that programs corresponding to the software modules, method steps may be located in Random Access Memory (RAM), memory, Read Only Memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. To clearly illustrate this interchangeability of electronic hardware and software, various illustrative components and steps have been described above generally in terms of their functionality. Whether these functions are performed in electronic hardware or software depends on the intended application of the solution and design constraints. Those skilled in the art may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.

The terms "first," "second," and the like are used for distinguishing between similar elements and not necessarily for describing or implying a particular order or sequence.

The terms "comprises," "comprising," or any other similar term are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.

So far, the technical solutions of the present invention have been described in connection with the preferred embodiments shown in the drawings, but it is easily understood by those skilled in the art that the scope of the present invention is obviously not limited to these specific embodiments. Equivalent changes or substitutions of related technical features can be made by those skilled in the art without departing from the principle of the invention, and the technical scheme after the changes or substitutions can fall into the protection scope of the invention.

Claims

1. A dialogue state tracking method based on information enhancement is characterized by comprising the following steps:

2. The information enhancement-based dialog state tracking method according to claim 1, wherein the preset semantic association degree evaluation method comprises a central association degree evaluation method and a peripheral association degree evaluation method;

3. The information enhancement based dialog state tracking method of claim 1 wherein the dialog state tracking model comprises a dialog utterance encoder, a previous dialog state encoder, a known value encoder, a current dialog state decoder;

4. The information enhancement-based dialog state tracking method according to claim 3, wherein word vector concatenation is performed by a Glove word embedding method and an n-gram word embedding method, and the method comprises:

5. The information-enhancement-based dialog state tracking method according to claim 3, wherein the method for obtaining the dialog state at time t through the dialog state tracking model in step A200 is as follows:

6. The information enhancement-based dialog state tracking method according to claim 5, wherein the method for obtaining the reward value of each word in the dialog history representation according to a preset reward rule comprises the following steps:

7. The method for tracking dialog state based on information enhancement according to claim 1, wherein in step a400, "obtaining the clustered dialog state by a preset semantic clustering method" comprises:

8. A dialogue state tracking system based on information enhancement is characterized by comprising an acquisition module and a dialogue state tracking module;

the dialogue state module is configured to obtain a dialogue state at the time t through a dialogue state tracking model based on the input information; the dialogue state at the time t comprises one or more slot value pairs and corresponding probabilities; the slot value pair includes a slot and a slot value.

9. A storage device having a plurality of programs stored therein, wherein the program applications are loaded and executed by a processor to implement the information-based augmented session state tracking method of any one of claims 1-7.

10. A processing device comprising a processor, a storage device; a processor adapted to execute various programs; a storage device adapted to store a plurality of programs; characterized in that said program is adapted to be loaded and executed by a processor to implement the information enhancement based dialog state tracking method of any of claims 1-7.