CN112860870A

CN112860870A - Noise data identification method and equipment

Info

Publication number: CN112860870A
Application number: CN202110283194.0A
Authority: CN
Inventors: 张勇; 刘升平; 梁家恩
Original assignee: Unisound Intelligent Technology Co Ltd; Xiamen Yunzhixin Intelligent Technology Co Ltd
Current assignee: Unisound Intelligent Technology Co Ltd; Xiamen Yunzhixin Intelligent Technology Co Ltd
Priority date: 2021-03-16
Filing date: 2021-03-16
Publication date: 2021-05-28
Anticipated expiration: 2041-03-16
Also published as: CN112860870B

Abstract

The invention provides a method and equipment for identifying noise data, which comprises the following steps: acquiring original training data; carrying out forward reasoning on the original training data to obtain a prediction result; calculating based on the original training data and the prediction result to obtain a loss result; carrying out derivation on the original training data based on the loss result to obtain gradient data; converting the sample characteristic data based on the gradient data to obtain new sample characteristic data; forming new training data based on the new sample characteristic data and the sample result data; carrying out union processing on the new training data and the training data to obtain a first data set; processing the first data set to obtain a second data set; training through the first data set and the second data set to obtain a final model; and identifying the noise data in the input intention data through a final model. The scheme performs special processing on training data in a training stage, and enhances the robustness of the model in a mode of countertraining and sample fusion.

Description

Noise data identification method and equipment

Technical Field

The invention relates to the technical field of noise data identification, in particular to a noise data identification method and equipment.

Background

In the prior art, noise data is generally not specially processed in a scene in a customized dialog system for some user customers. Instead, the noise data is trained with the user intent data as a noise intent in the normal scenario.

In such scenarios, the user intention data is relatively small, whereas in training data for the task intended to be identified, positive intention data and negative noise data need to be maintained in a certain ratio, e.g., 1:3 or 1: 5. Therefore, when training data is collated, noise data cannot be too much. And the speech space of noisy data is relatively large, a small amount of training data coverage is not sufficient. However, the prior art does not provide additional special processing for negative noise data. Therefore, current intent recognition techniques are less effective at recognizing such unintelligible or noisy data. A case may occur where a large amount of noise data is recognized as positive data.

Thus, there is a need for a solution to the problems of the prior art.

Disclosure of Invention

The invention provides a voice data identification method and voice data identification equipment, which can solve the technical problem of poor identification performance in the prior art.

The technical scheme for solving the technical problems is as follows:

the embodiment of the invention provides a method for identifying noise data, which comprises the following steps:

acquiring original training data including intention data and noise data of a user;

carrying out forward reasoning on the original training data to obtain a prediction result;

calculating based on the original training data and the prediction result to obtain a loss result;

deriving the original training data based on the loss result to obtain gradient data;

converting the sample characteristic data based on the gradient data to obtain new sample characteristic data; the original training data consists of the sample characteristic data and sample result data corresponding to the sample characteristic data;

forming new training data based on the new sample feature data and the sample result data;

performing union processing on the new training data and the training data to obtain a first data set;

processing any two pieces of data in the first data set in a preset mode to obtain a second data set;

training a selected intention classification algorithm through the first data set and the second data set to obtain a final model;

and identifying the noise data in the input intention data through a final model.

In a specific embodiment, the forward processing is performed by the following formula:

wherein (x)_i，y_i) Inputting the original training data; theta is a model parameter; f (theta, x)_i,y_i) Representing a function of a model for performing forward processing on the input original training data;

and the prediction result is obtained.

In a specific embodiment, the loss result is obtained by the following formula:

wherein the content of the first and second substances,

is the prediction result; x is the number of_i,y_iBoth are the input original training data;

representing a loss function; loss_iAs a result of loss.

In a specific embodiment, the gradient data is obtained by the following formula:

wherein, grad_iIs gradient data; loss_iAs a result of the loss;

is a derivative function.

In a specific embodiment, the new sample feature data is obtained by the following formula:

wherein epsilon is a parameter between 0 and 1; sign (gradi) is a sign-solving function; when grad is greater than 0, sign (grad)_i) 1 is ═ 1; when grad is less than 0, sign (grad)_i)＝-1；

New sample characteristic data; x is the number of_iSample characteristic data is obtained; y is_iIs sample result data.

In a specific embodiment, the preset mode is processed by the following formula:

wherein the content of the first and second substances,

any two pieces of data in the first data set are obtained; λ is a weight parameter; x^MIXIs the second data set.

In a specific embodiment, the selected intent classification algorithm comprises: a convolutional neural network or a recurrent neural network.

In a specific embodiment, the loss function of the final model includes:

a cross entropy loss function for the first data set and a KL divergence loss function for the second data set.

In a specific embodiment, the method further comprises the following steps:

and if the final model is tested, inputting the original training data to carry out forward reasoning to obtain a prediction result of the final model, and comparing the prediction result of the final model with sample result data to determine a test result.

The embodiment of the invention also provides a device for identifying noise data, which comprises:

an acquisition module for acquiring original training data including intention data and noise data of a user;

the forward reasoning module is used for carrying out forward reasoning on the original training data to obtain a prediction result;

the loss module is used for calculating based on the original training data and the prediction result to obtain a loss result;

a derivation module, configured to derive the original training data based on the loss result to obtain gradient data;

the conversion module is used for converting the sample characteristic data based on the gradient data to obtain new sample characteristic data; the original training data consists of the sample characteristic data and sample result data corresponding to the sample characteristic data;

a forming module for forming new training data based on the new sample feature data and the sample result data;

the union set module is used for carrying out union set processing on the new training data and the training data to obtain a first data set;

the processing module is used for processing any two data in the first data set in a preset mode to obtain a second data set;

the training module is used for training the selected intention classification algorithm through the first data set and the second data set to obtain a final model;

and the identification module is used for identifying the noise data in the input intention data through the final model.

The invention has the beneficial effects that:

the embodiment of the invention provides a method and equipment for identifying noise data, wherein the method comprises the following steps: acquiring original training data including intention data and noise data of a user; carrying out forward reasoning on the original training data to obtain a prediction result; calculating based on the original training data and the prediction result to obtain a loss result; deriving the original training data based on the loss result to obtain gradient data; converting the sample characteristic data based on the gradient data to obtain new sample characteristic data; the original training data consists of the sample characteristic data and sample result data corresponding to the sample characteristic data; forming new training data based on the new sample feature data and the sample result data; performing union processing on the new training data and the training data to obtain a first data set; processing any two pieces of data in the first data set in a preset mode to obtain a second data set; training a selected intention classification algorithm through the first data set and the second data set to obtain a final model; and identifying the noise data in the input intention data through a final model. The scheme performs special processing on training data in a training stage, enhances the robustness of the model by means of countertraining and sample fusion, avoids the defect that a large amount of noise data is recognized as positive data, and has no influence on the recognition capability of the intention of a user. The algorithm improves the intention recognition capability in a scene and improves the actual experience of a user.

Drawings

Fig. 1 is a schematic flow chart illustrating a method for identifying noise data according to an embodiment of the present invention;

fig. 2 is a schematic diagram of a frame structure of a noise data identification device according to an embodiment of the present invention;

fig. 3 is a schematic diagram of a frame structure of a noise data identification device according to an embodiment of the present invention;

fig. 4 is a flowchart of a framework structure of a terminal according to an embodiment of the present invention.

Detailed Description

The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.

Example 1

The embodiment 1 of the invention discloses a method for identifying noise data, which comprises the following steps as shown in figure 1:

step 101, obtaining original training data including intention data and noise data of a user;

specifically, training data is prepared

The training data includes intent data and noise data of the user.

102, carrying out forward reasoning on the original training data to obtain a prediction result;

the forward processing is performed by the following formula:

and the prediction result is obtained.

103, calculating based on the original training data and the prediction result to obtain a loss result;

the loss results are obtained by the following formula:

wherein the content of the first and second substances,

representing a loss function; loss_iAs a result of loss.

104, deriving the original training data based on the loss result to obtain gradient data;

the gradient data is obtained by the following formula:

wherein, grad_iIs gradient data; loss_iAs a result of the loss;

is a derivative function.

105, converting the sample characteristic data based on the gradient data to obtain new sample characteristic data; the original training data consists of the sample characteristic data and sample result data corresponding to the sample characteristic data;

the new sample feature data is obtained by the following formula:

wherein epsilon is a parameter between 0 and 1; sign (grad) is a sign-solving function; when grad is greater than 0, sign (grad)_i) 1 is ═ 1; when grad is less than 0, sign (grad)_i)＝-1；

106, forming new training data based on the new sample characteristic data and the sample result data;

step 107, performing union processing on the new training data and the training data to obtain a first data set;

108, processing any two data in the first data set in a preset mode to obtain a second data set;

the preset mode processing is performed by the following formula:

wherein the content of the first and second substances,

any two pieces of data in the first data set are obtained; λ is a weight parameter; x^MIXIs the second data set. Specifically, the value range of λ is0-1 for adjusting x_iAnd x_jThe weight of (c) is generally chosen based on experience and the final effect, for example 0.8 may be chosen.

Step 109, training the selected intention classification algorithm through the first data set and the second data set to obtain a final model;

specifically, the selected intent classification algorithm includes: convolutional Neural Networks (CNN) or Recurrent Neural Networks (RNN).

Step 110, identifying noise data in the input intention data through a final model.

In a specific embodiment, the loss function of the final model includes:

In a specific embodiment, the method further comprises the following steps:

Specifically, when performing subsequent model tests or performing forward reasoning on the model on line, data X is input, and then a prediction result of the model is obtained through the forward reasoning of the model.

Here, a specific application scenario is described, which specifically includes the following steps:

step 1: preparing training data

The training data includes intent data and noise data of the user.

Step 2: an intent classification algorithm is selected. Such as a Convolutional Neural Network (CNN) or a Recurrent Neural Network (RNN).

And 3, step 3: input to the model (x)_i，y_i) Performing forward calculation;according to the details

Where θ represents the parameters of the model, f (θ, x)_i) The representation model carries out forward processing on the input x to obtain a result

The formula representation is for input data (x)_i，y_i) And corresponding prediction results

The loss obtained.

The formula represents loss versus input data (x)_i，y_i) The resulting gradient is derived.

Wherein e is a parameter between 0 and 1. The sign (grad) function is a sign-finding function. When grad is greater than 0, sign (grad)_i) 1 is ═ 1; when grad is less than 0, sign (grad)_i)＝-1。

Is transformed

Will be provided with

The data set is obtained after the operations

And 4, step 4: get a new data set X^ADA＝X∪X^adv。

And 5, step 5: for data set X^ADAAny two pieces of data in

The following operations were carried out

Thereby obtaining a new data set

Step 6 Using dataset X^ADAAnd X^MIXAs training data, the model is trained. The loss of the model is:

wherein for X^ADAUsing a cross entropy loss function for the samples of X^MIXThe data in (1) uses the KL divergence loss function. Obtaining the final model

And 7, step 7: and when the subsequent model test or the online forward reasoning of the model is carried out, inputting data X, and then obtaining the prediction result of the model through the forward reasoning of the model.

The method and the device have the advantages that training data are specially processed in a training stage, robustness of the model is enhanced through countertraining and a sample fusion mode, and meanwhile recognition capability of user intention cannot be influenced. The algorithm improves the intention recognition capability in a scene and improves the actual experience of a user. Meanwhile, the recognition capability of a small-data-volume dialogue system on noise data is improved, the defect that a large amount of noise data is recognized as positive data is overcome, the scheme can be nested in a plurality of deep learning classification algorithms of any type, and the application range is wide.

Example 2

Embodiment 2 of the present invention also discloses a device for identifying noise data, as shown in fig. 2, including:

an obtaining module 201, configured to obtain original training data including intention data and noise data of a user;

a forward reasoning module 202, configured to perform forward reasoning on the original training data to obtain a prediction result;

a loss module 203, configured to perform calculation based on the original training data and the prediction result to obtain a loss result;

a derivation module 204, configured to derive the original training data based on the loss result to obtain gradient data;

a conversion module 205, configured to convert the sample feature data based on the gradient data to obtain new sample feature data; the original training data consists of the sample characteristic data and sample result data corresponding to the sample characteristic data;

a forming module 206 for forming new training data based on the new sample feature data and the sample result data;

a union module 207, configured to perform union processing on the new training data and the training data to obtain a first data set;

the processing module 208 is configured to perform processing in a preset manner on any two pieces of data in the first data set to obtain a second data set;

a training module 209, configured to train the selected intention classification algorithm through the first data set and the second data set to obtain a final model;

and the identification module 210 is used for identifying the noise data in the input intention data through the final model.

and the prediction result is obtained.

In a specific embodiment, the loss result is obtained by the following formula:

wherein the content of the first and second substances,

representing a loss function; loss_iAs a result of loss.

wherein, grad_iIs gradient data; loss_iAs a result of the loss;

is a derivative function.

wherein the content of the first and second substances,

In a specific embodiment, the loss function of the final model includes:

In a specific embodiment, as shown in fig. 3, the method further includes:

the testing module 211 is configured to input the original training data to perform forward reasoning to obtain a prediction result of the final model if the final model is tested, and compare the prediction result of the final model with sample result data to determine a testing result.

Example 3

Embodiment 3 of the present invention further discloses a terminal, as shown in fig. 4, the terminal includes a memory and a processor, and the processor executes the method in embodiment 1 when running an application program in the memory.

The embodiment of the invention provides a method and equipment for identifying noise data, wherein the method comprises the following steps: acquiring original training data including intention data and noise data of a user; carrying out forward reasoning on the original training data to obtain a prediction result; calculating based on the original training data and the prediction result to obtain a loss result; deriving the original training data based on the loss result to obtain gradient data; converting the sample characteristic data based on the gradient data to obtain new sample characteristic data; the original training data consists of the sample characteristic data and sample result data corresponding to the sample characteristic data; forming new training data based on the new sample feature data and the sample result data; performing union processing on the new training data and the training data to obtain a first data set; processing any two pieces of data in the first data set in a preset mode to obtain a second data set; training a selected intention classification algorithm through the first data set and the second data set to obtain a final model; and identifying the noise data in the input intention data through a final model. According to the scheme, training data are specially processed in a training stage, robustness of the model is enhanced through countertraining and a sample fusion mode, and meanwhile the defect that a large amount of noise data are recognized as positive data is avoided, and the recognition capability of the intention of a user cannot be influenced. The algorithm improves the intention recognition capability in a scene and improves the actual experience of a user.

While the invention has been described with reference to specific embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A method for identifying noisy data, comprising:

2. The method of claim 1, wherein the forward processing is performed by the following equation:

wherein (x)_i，y_i) Inputting the original training data; theta is a model parameter; f (theta, x)_i,y_i) Representing model pair inputThe function of forward processing is carried out on the input original training data;

and the prediction result is obtained.

3. The method of claim 1 or 2, wherein the loss result is obtained by the following formula:

wherein the content of the first and second substances,

representing a loss function; loss_iAs a result of loss.

4. A method according to claim 1 or 3, wherein the gradient data is obtained by the following formula:

wherein, grad_iIs gradient data; loss_iAs a result of the loss;

is a derivative function.

5. The method of claim 1 or 4, wherein the new sample characteristic data is obtained by the following formula:

6. The method of claim 1 or 5, wherein the predetermined manner of processing is performed by the following formula:

wherein the content of the first and second substances,

7. The method of claim 1, wherein the selected intent classification algorithm comprises: a convolutional neural network or a recurrent neural network.

8. The method of claim 1, wherein the loss function of the final model comprises:

9. The method of claim 1, further comprising:

10. An apparatus for recognizing noise data, comprising: