CN111581978A

CN111581978A - Intention identification method through domain migration and antagonistic learning

Info

Publication number: CN111581978A
Application number: CN202010348401.1A
Authority: CN
Inventors: 周柳阳; 董世超; 赵勇
Original assignee: Shenzhen Yihao Hulian Technology Co ltd
Current assignee: Shenzhen Yihao Hulian Technology Co ltd
Priority date: 2020-04-28
Filing date: 2020-04-28
Publication date: 2020-08-25

Abstract

An intention identification method through domain migration and antagonistic learning relates to the technical field of data processing analysis and comprises the following steps: step 1): performing feature extraction on a large amount of labeled data and unlabeled data by adopting a pre-training model extractor; step 2): extracting input sentences input into each dialogue, and expressing the input sentences into a semantic vector through an extractor; step 3): inputting the semantic vector into a mapping layer, wherein source data corresponds to the source mapping layer, and target data is mapped to the target mapping layer; step 4): the target mapping layer distinguishes whether the output target data is real or not through a classifier. The invention inhibits the efficiency of the classifier in the field of target data, constructs real labels and false labels aiming at the target data so as to achieve the effect of resisting learning, and can train the source data, the mapping layer and the classifier thereof in advance, thereby improving the accuracy and reducing the cost consumption of manpower and material resources.

Description

Intention identification method through domain migration and antagonistic learning

Technical Field

The invention relates to the technical field of data processing and analysis, in particular to an intention identification method for migration and counterstudy through fields.

Background

The intention recognition is used as a component of a modularized dialogue system, plays roles in recognizing user intention and judging dialogue fields, the current intention recognition task is usually used as a classification model in machine learning for modeling, the used learning mode is mostly supervised learning, and the supervised learning is to give a section of data representation and a corresponding label to train the model discrimination capability. However, supervised classification learning is successfully built on an assumption that the data distribution of training data is consistent or similar to test data, but in the real world, this assumption often does not exist; secondly, a large amount of labeled data is needed for model training in supervised learning, which brings cost consumption of manpower and material resources.

Disclosure of Invention

Technical problem to be solved

Aiming at the defects of the prior art, the invention provides an intention identification method through field migration and antagonistic learning, which solves the problem that supervised classification learning is successfully established on the assumption that the data distribution of training data is consistent or similar to test data, but in the real world, the assumption does not exist; secondly, a large amount of labeled data is needed for model training in supervised learning, which brings the problem of cost consumption of manpower and material resources.

(II) technical scheme

In order to achieve the purpose, the invention is realized by the following technical scheme: an intention recognition method through domain migration and antagonistic learning, comprising the steps of:

step 1): performing feature extraction on a large amount of labeled data and unlabeled data by adopting a pre-training model extractor;

step 2): extracting input sentences input into each dialogue, and expressing the input sentences into a semantic vector through an extractor;

step 3): inputting the semantic vector into a mapping layer, wherein source data corresponds to the source mapping layer, and target data is mapped to the target mapping layer;

step 4): the target mapping layer distinguishes whether the output target data is real or not through a classifier.

Preferably, the pre-training model extractor adopts a Chinese ERNIE model.

Preferably, the labeled data is source data, and the unlabeled data is target data.

Preferably, the mapping layer is implemented by a recurrent neural network.

Preferably, the source mapping layer outputs to a source classifier for classifying the marked source data.

Preferably, for the multi-label classification output, a plurality of sigmoid functions are adopted to perform a plurality of binary classification outputs.

(III) advantageous effects

The invention provides an intention identification method for migration and counterstudy through a field. Has the following beneficial effects:

(1) the method for identifying the intention of the user to move and resist learning through the field comprises the steps of extracting features of a large amount of labeled data and unlabeled data by adopting a pre-training model extractor; extracting input sentences input into each dialogue, and expressing the input sentences into a semantic vector through an extractor; inputting the semantic vector into a mapping layer, wherein source data corresponds to the source mapping layer, and target data is mapped to the target mapping layer; the target mapping layer distinguishes whether the output real target data exists through a classifier, the effect of resisting learning is achieved, the source data, the mapping layer and the classifier are trained in advance, the accuracy is improved, and the cost consumption of manpower and material resources is reduced.

Drawings

FIG. 1 is a flow chart of an intent recognition method of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, the present invention provides a technical solution: an intention recognition method through domain migration and antagonistic learning, comprising the steps of:

step 1): performing feature extraction on a large amount of labeled data and unlabeled data by adopting a pre-training model extractor, wherein the pre-training model extractor adopts a Chinese ERNIE model, labeled data are source data, and unlabeled data are target data;

step 3): inputting the semantic vector into a mapping layer, wherein the mapping layer is realized by a recurrent neural network, source data corresponds to a source mapping layer, the source mapping layer outputs the source data to a source classifier, the labeled source data is classified, and target data is mapped to a target mapping layer;

step 4): the target mapping layer distinguishes whether the output target data is real or not through a classifier, and multiple S-shaped functions are adopted to carry out multiple two-classification output aiming at the multi-label classification output.

It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation. The use of the phrase "comprising one of the elements does not exclude the presence of other like elements in the process, method, article, or apparatus that comprises the element.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. An intention recognition method through domain migration and antagonistic learning, characterized in that: the method comprises the following steps:

2. The method of claim 1, wherein the method comprises the following steps: the pre-training model extractor adopts a Chinese ERNIE model.

3. The method of claim 1, wherein the method comprises the following steps: the marked data are source data, and the unmarked data are target data.

4. The method of claim 1, wherein the method comprises the following steps: the mapping layer is implemented by a recurrent neural network.

5. The method of claim 1, wherein the method comprises the following steps: the source mapping layer is output to a source classifier for classifying the marked source data.

6. The method of claim 1, wherein the method comprises the following steps: and aiming at the multi-label classified output, a plurality of S-shaped functions are adopted to carry out a plurality of two-classification outputs.