CN112818097A

CN112818097A - Off-task training system based on dialog box state tracking model

Info

Publication number: CN112818097A
Application number: CN202110104849.3A
Authority: CN
Inventors: 潘晓光; 焦璐璐; 令狐彬; 宋晓晨; 韩丹
Original assignee: Shanxi Sanyouhe Smart Information Technology Co Ltd
Current assignee: Shanxi Sanyouhe Smart Information Technology Co Ltd
Priority date: 2021-01-26
Filing date: 2021-01-26
Publication date: 2021-05-18

Abstract

The invention belongs to the field of natural language data processing, and particularly relates to a task external training system based on a dialog box state tracking model. The invention is beneficial to supporting model training through auxiliary task data, especially the use of MTL, and greatly improves the performance of processing high-difficulty tasks. Meanwhile, the method opens a door for a large number of irrelevant natural language processing corpora, and the corpora are defined in a wide non-dialogue task to relieve the data sparsity problem in the DST. The method is used for off-task training of the tracking model.

Description

Off-task training system based on dialog box state tracking model

Technical Field

The invention belongs to the field of natural language data processing, and particularly relates to an off-task training system based on a dialog box state tracking model.

Background

In task-oriented dialog systems today, the role of the dialog state tracker is to summarize the dialog history up to now and to extract the user goals. Dialog State Tracking (DST) is severely affected by data sparsity. While many Natural Language Processing (NLP) tasks benefit from migratory learning and multitask learning, these methods are limited by the amount of data available and the specificity of the dialog application in the dialog, and there are serious data sparsity issues with dialog state tracking and problems with natural language processing that are not solved or do not work well in the processing of the dialog concerned.

Disclosure of Invention

Aiming at the technical problem that the dialogue state tracking is seriously influenced by data sparsity, the invention provides the off-task training system based on the dialogue state tracking model, which has high efficiency, small error and strong stability.

In order to solve the technical problems, the invention adopts the technical scheme that:

a task external training system based on a dialog box state tracking model comprises a DST module, an auxiliary task module, an ITFT module and an MTL module, wherein the ITFT module is connected with the MTL module, the MTL module is connected with the DST module, and the MTL module is connected with the auxiliary task module;

the DST module is used for extracting meaning and intention from user input and keeping and updating the information in the continuous process of the conversation;

the auxiliary task module is used for supporting model training;

the ITFT module is used for guiding the parameters of the encoder to a favorable direction so that subsequent fine adjustment can find better local optimization;

the MTL module is used to train the same model between the auxiliary task and the target task simultaneously.

In the DST module, DST (session state tracking) processes a data set by using a DST model Trippy, and a Roberta compiler gives bert adaptability to fragment differentiation in a session.

The auxiliary task module comprises sentences and classification tasks of the sentences to layers, and adopts the following training constraints: the auxiliary task is a classification problem or a span prediction problem; only one auxiliary task can be used at a time.

The ITFT module is a task fine-tuning module, and continuously trains the same model on two unrelated tasks, wherein the two unrelated tasks are an auxiliary task and a DST task respectively.

The MTL module is a multi-task learning module, DST training is carried out on each step, additional training is carried out on auxiliary tasks, the training is carried out alternately between the auxiliary tasks and target tasks on the step level, the auxiliary tasks and the target tasks share one optimizer, and continuous two updating are carried out.

Compared with the prior art, the invention has the following beneficial effects:

the invention is beneficial to supporting model training through auxiliary task data, especially the use of MTL, and greatly improves the performance of processing high-difficulty tasks. Meanwhile, the method opens a door for a large number of irrelevant natural language processing corpora, and the corpora are defined in a wide non-dialogue task to relieve the data sparsity problem in the DST.

Drawings

FIG. 1 is a flow chart of the main steps of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

A task external training method based on a dialog box state tracking model is disclosed, as shown in figure 1, and comprises a DST module, an auxiliary task module, an ITFT module and an MTL module, wherein the ITFT module is connected with the MTL module, the MTL module is connected with the DST module, and the MTL module is connected with the auxiliary task module;

further, the DST module is used for extracting meaning and intention from the user input and keeping and updating the information in the process of continuing the conversation;

further, the auxiliary task module is used for effectively supporting model training;

further, the ITFT module is used to direct the encoder parameters to a favorable direction for subsequent fine tuning to find better local optima;

further, the MTL module is used to train the same model between the auxiliary task and the target task simultaneously.

Further, in the DST module, DST (dialog state tracking) is dialog state tracking, and the task of DST is to extract meaning and intention from user input, and to retain and update such information during the continuation of dialog. With the disclosed DST model Trippy, which depends on the individual performance of the context encoder, slot gate and span prediction, i.e. any of these parts may benefit from out-of-task training, an advanced performance in processing data sets is possible, and a Roberta compiler was chosen because the differentiation of fragments by bert is not adaptive in the dialog.

Further, in the module auxiliary task module, auxiliary tasks irrelevant to the DST are specifically considered. The method comprises a classification task of sentences and sentence pair levels, and aims to find language phenomena. The following training constraints were found to be applicable: one is that the auxiliary task can be a classification problem or a span prediction problem; secondly, only one auxiliary task can be used at a time. The latter enables then to clearly identify the effect of a specific auxiliary task.

Further, in the module ITFT module, ITFT (Intermediate Task Fine-tuning), i.e. Intermediate Task Fine-tuning, continuously trains the same model, i.e. auxiliary Task and DST Task, on two unrelated tasks. The purpose of the ITFT is to direct the encoder parameters to a favorable direction so that subsequent fine tuning can find better local optima.

Further, in the module MTL module, MTL (Multi-task Learning), i.e. Multi-task Learning, we train the same model on two unrelated tasks simultaneously using MTL. DST training is performed for each step and additional training is performed on the auxiliary task, i.e. at the step level, training alternates between the auxiliary task and the target task, both tasks sharing an optimizer and performing two updates in succession.

All the modules can be packaged into an application program, and the off-task training technical function based on the dialog box state tracking model is completed cooperatively through mutual calling interfaces.

Although only the preferred embodiments of the present invention have been described in detail, the present invention is not limited to the above embodiments, and various changes can be made without departing from the spirit of the present invention within the knowledge of those skilled in the art, and all changes are encompassed in the scope of the present invention.

Claims

1. An off-task training system based on a dialog state tracking model, characterized in that: the system comprises a DST module, an auxiliary task module, an ITFT module and an MTL module, wherein the ITFT module is connected with the MTL module, the MTL module is connected with the DST module, and the MTL module is connected with the auxiliary task module;

the auxiliary task module is used for supporting model training;

2. The off-task training system based on the dialog state tracking model of claim 1, wherein: in the DST module, DST (session state tracking) processes a data set by using a DST model Trippy, and a Roberta compiler gives bert adaptability to fragment differentiation in a session.

3. The off-task training system based on the dialog state tracking model of claim 1, wherein: the auxiliary task module comprises sentences and classification tasks of the sentences to layers, and adopts the following training constraints: the auxiliary task is a classification problem or a span prediction problem; only one auxiliary task can be used at a time.

4. The off-task training system based on the dialog state tracking model of claim 1, wherein: the ITFT module is a task fine-tuning module, and continuously trains the same model on two unrelated tasks, wherein the two unrelated tasks are an auxiliary task and a DST task respectively.

5. The off-task training system based on the dialog state tracking model of claim 1, wherein: the MTL module is a multi-task learning module, DST training is carried out on each step, additional training is carried out on auxiliary tasks, the training is carried out alternately between the auxiliary tasks and target tasks on the step level, the auxiliary tasks and the target tasks share one optimizer, and continuous two updating are carried out.