CN112328750A

CN112328750A - Method and system for training text discrimination model

Info

Publication number: CN112328750A
Application number: CN202011347328.2A
Authority: CN
Inventors: 蔡晓华
Original assignee: Shanghai Netis Technologies Co ltd
Current assignee: Shanghai Netis Technologies Co ltd
Priority date: 2020-11-26
Filing date: 2020-11-26
Publication date: 2021-02-05

Abstract

The invention provides a method and a system for training a text discrimination model, which comprises the following steps: extracting a real language sample from a real language library and inputting the real language sample into a generation model; the generation model inserts, deletes or replaces modifiers into the extracted real language sample to obtain a first new language sample; the generation model introduces confusion words into the main words of the extracted real language samples to obtain second new language samples; inputting the first new language sample or the second new language sample into a discrimination model, comparing the input first new language sample or the second new language sample with the real language sample by the discrimination model, and judging whether the input first new language sample or the second new language sample is a positive sample or a negative sample; comparing the judgment result of the discriminant model with the expectation of the generated model, and updating the model parameters of the discriminant model according to the comparison result; and the generated model updates the model parameters of the generated model according to the updated model parameters of the discrimination model. The quality of the generated positive samples and the quality of the generated negative samples by introducing the confusion words are controllable compared with the conventional learning mode.

Description

Method and system for training text discrimination model

Technical Field

The invention relates to the field of data processing, in particular to a method and a system for training a text discrimination model.

Background

Counterlearning is mainly used in the field of image recognition at present, and an image generated by a generative model is distinguished through a discriminant model, so that the image generation capability of the generative model is continuously improved.

For example, patent document CN109949317A discloses a semi-supervised image instance segmentation method based on gradual counterstudy, which retrains an instance segmentation model to obtain a segmentation model with higher accuracy. The existing countermeasure learning mainly focuses on the performance improvement of the generated model after training, and neglects the performance improvement in the countermeasure learning of the discrimination model. The way of improving the robustness of the discriminant model through counterlearning is not much in the current published documents.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a method and a system for training a text discrimination model.

The method for training the text discriminant model provided by the invention comprises the following steps:

a sample extraction step: extracting a real language sample from a real language library and inputting the real language sample into a generation model;

a sample generation step: the generation model inserts, deletes or replaces modifiers into the extracted real language sample to obtain a first new language sample; the generation model introduces confusion words into the main words of the extracted real language samples to obtain second new language samples;

a judging step: inputting the first new language sample or the second new language sample into a discrimination model, comparing the input first new language sample or the second new language sample with the real language sample by the discrimination model, and judging whether the first new language sample or the second new language sample is a positive sample or a negative sample;

and (3) judging the updating step of the model: comparing the judgment result of the discriminant model with the expectation of the generated model, and updating the model parameters of the discriminant model according to the comparison result;

generating a model updating step: and the generated model updates the model parameters of the generated model according to the updated model parameters of the discrimination model.

Preferably, the sample generating step comprises:

for the inserted modifiers, the generating model judges the insertable position according to the extracted real language sample, and inserts modifiers into the judged insertable position;

for deleting the modifiers, the generating model judges the positions of the modifiers according to the extracted real language sample, and deletes the modifiers at the positions of the modifiers obtained by judgment;

for the replacement of the modifiers, the generation model judges the positions of the modifiers according to the extracted real language sample, and replaces the modifiers at the positions of the modifiers obtained by judgment with new modifiers;

wherein the insertion, deletion or replacement of modifiers does not change the classification of the extracted real language sample itself as a positive or negative sample.

Preferably, the sample generating step comprises:

for introducing the confusion words, the generating model judges the positions and the types of the main words according to the extracted real language samples, and inserts or replaces the main words with the confusion words, thereby changing the classification that the extracted real language samples belong to positive samples or negative samples.

Preferably, the comparing in the step of discriminating includes: vectorizing and KL divergence calculation are carried out on the first new language sample or the second new language sample, if the distribution difference of the calculation results is smaller than a preset value, the discrimination model judges that the first new language sample or the second new language sample is a positive sample, and otherwise, the discrimination model judges that the first new language sample or the second new language sample is a negative sample.

Preferably, the step of updating the discriminant model includes:

when the judgment result of the discrimination model is consistent with the expectation of the generated model, the discrimination model updates the model parameters through a reverse transfer function, so that the probability of the discrimination model for giving correct discrimination is higher;

when the judgment result of the discrimination model is inconsistent with the expectation of the generated model, the discrimination model updates the model parameters through a reverse transfer function, so that the probability of giving wrong discrimination by the discrimination model is lower;

the generative model updating step comprises: and on the basis of the model parameters of the updated discrimination model, the generative model calculates gradients according to the direction in which the previous discrimination model is most susceptible to errors and the future discrimination model is most susceptible to errors, and respectively updates the model parameters of the generative model.

The system for training the text discriminant model provided by the invention comprises the following components:

a sample extraction module: extracting a real language sample from a real language library and inputting the real language sample into a generation model;

a sample generation module: the generation model inserts, deletes or replaces modifiers into the extracted real language sample to obtain a first new language sample; the generation model introduces confusion words into the main words of the extracted real language samples to obtain second new language samples;

a judging module: inputting the first new language sample or the second new language sample into a discrimination model, comparing the input first new language sample or the second new language sample with the real language sample by the discrimination model, and judging whether the first new language sample or the second new language sample is a positive sample or a negative sample;

a discrimination model updating module: comparing the judgment result of the discriminant model with the expectation of the generated model, and updating the model parameters of the discriminant model according to the comparison result;

a generative model update module: and the generated model updates the model parameters of the generated model according to the updated model parameters of the discrimination model.

Preferably, the sample generation module comprises:

Preferably, the comparison in the discrimination module includes: vectorizing and KL divergence calculation are carried out on the first new language sample or the second new language sample, if the distribution difference of the calculation results is smaller than a preset value, the discrimination model judges that the first new language sample or the second new language sample is a positive sample, and otherwise, the discrimination model judges that the first new language sample or the second new language sample is a negative sample.

Preferably, the discriminant model update module includes:

the generative model update module comprises: and on the basis of the model parameters of the updated discrimination model, the generative model calculates gradients according to the direction in which the previous discrimination model is most susceptible to errors and the future discrimination model is most susceptible to errors, and respectively updates the model parameters of the generative model.

Compared with the prior art, the invention has the following beneficial effects:

1) the patent provides a new generation learning scheme by utilizing a generation countermeasure learning mode according to the characteristics of a text.

2) The generation model in the scheme randomly extracts real language samples from real data distribution, and generates the samples on the basis, so that the initial quality of the samples is higher than that of the samples in the past learning mode.

3) The quality of the positive samples generated by inserting, deleting and replacing modifiers and the quality of the negative samples generated by introducing confusing words are controllable compared with the quality of the negative samples generated by the conventional learning method.

Drawings

Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:

FIG. 1 is a flow chart of the operation of the present invention.

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.

As shown in fig. 1, the method for training a text discriminant model provided by the present invention trains two models:

model 1: a model is generated for generating text in a near real language, denoted G.

Model 2: and the discrimination model is used for discriminating whether the text generated by the G is real or not and is represented by D.

On a server, a generative model G and a discriminant model D are trained so that new language data generated by G cannot make D distinguish whether the new language data is real language data or generated language data.

Referring to the flow chart of fig. 1, the method comprises the following steps:

step 1: samples were drawn randomly. G, randomly extracting real samples from a real language library;

step 2: a positive sample is generated. Positive samples are text that is expected to be in real language. Specifically, the new language data is generated as a positive sample by inserting, deleting, and replacing some modifiers.

The step 2 specifically comprises the following steps: for the inserted modifier, G, after judging the position where the modifier can be inserted, inserting a modifier; for deleting the modifier, G deletes the modifier after judging the position of the modifier; and for the replacement modifier, G replaces the modifier with a new modifier after judging the position of the modifier. This operation does not change the classification of the original text;

and step 3: and generating a negative sample. Negative examples are examples of languages that are expected to be non-real. Specifically, some confusion words are introduced into main words to change the classification of the original text;

the step 3 specifically comprises the following steps: g is responsible for judging the position and the category of the main stem word and then inserting or replacing the original main stem word by the confusion word. Confusing words will change the category of the original main word.

And 4, step 4: the positive and negative samples are input to D, which compares them to the true sample distribution. Step 2 and step 3, in order to generate data which makes D judge wrongly as much as possible, namely, making D judge the generated positive sample as a negative sample by mistake, or making D judge the generated negative sample as a positive sample by mistake;

the comparison mode in the step 4 specifically includes: vectorization and KL divergence calculation. And if the distribution difference is small as shown in the calculation result, judging that the sample is a positive sample by D, otherwise, judging that the sample is a negative sample by D.

And 5: check if the judgment is correct. Specifically, the discrimination of D is compared with the expectation of G, D calculates the gradient of D according to the result, and updates the model parameter of D, and the aim is to minimize the discrimination error;

the comparison process in step 5 specifically includes: if the discrimination of D is consistent with the expectation of G, D updates the parameters by passing the gradient backwards so that D is more correct for the correct discrimination. If the discrimination of D is not consistent with the expectation of G, D updates the parameters by passing the gradient backwards so that D is less erroneous for erroneous discrimination.

Step 6: g, calculating the gradient of G according to the latest model parameter of D, and updating the model parameter of G, wherein the aim is to maximize the possibility of D discrimination error;

the calculation process in step 6 specifically includes: and G, on the basis of the latest model parameters of D, calculating gradients according to the directions of the previous D most easily judged errors and the future most likely judged errors, and respectively updating the model parameters for generating the positive samples and the negative samples.

The invention also provides a system for training the text discrimination model, which comprises:

a sample extraction module: and extracting a real language sample from the real language library and inputting the sample into the generating model.

A sample generation module: the generation model inserts, deletes or replaces modifiers into the extracted real language sample to obtain a first new language sample; and the generation model introduces confusion words into the main words of the extracted real language samples to obtain a second new language sample.

A judging module: and inputting the first new language sample or the second new language sample into a discrimination model, and comparing the input first new language sample or the second new language sample with the real language sample by the discrimination model to judge whether the first new language sample or the second new language sample is a positive sample or a negative sample.

A discrimination model updating module: and comparing the judgment result of the discriminant model with the expectation of the generated model, and updating the model parameters of the discriminant model according to the comparison result.

Those skilled in the art will appreciate that, in addition to implementing the system and its various devices, modules, units provided by the present invention as pure computer readable program code, the system and its various devices, modules, units provided by the present invention can be fully implemented by logically programming method steps in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system and various devices, modules and units thereof provided by the invention can be regarded as a hardware component, and the devices, modules and units included in the system for realizing various functions can also be regarded as structures in the hardware component; means, modules, units for performing the various functions may also be regarded as structures within both software modules and hardware components for performing the method.

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.

Claims

1. A method for training a text discriminant model, comprising:

2. The method of training a text discriminant model according to claim 1, wherein the sample generation step comprises:

3. The method of training a text discriminant model according to claim 1, wherein the sample generation step comprises:

4. The method of claim 1, wherein the comparing in the discriminating step comprises: vectorizing and KL divergence calculation are carried out on the first new language sample or the second new language sample, if the distribution difference of the calculation results is smaller than a preset value, the discrimination model judges that the first new language sample or the second new language sample is a positive sample, and otherwise, the discrimination model judges that the first new language sample or the second new language sample is a negative sample.

5. The method of training a text discriminant model of claim 1, wherein the discriminant model updating step comprises:

6. A system for training a text discriminant model, comprising:

7. The system for training a text discriminant model according to claim 6, wherein the sample generation module comprises:

8. The system for training a text discriminant model according to claim 6, wherein the sample generation module comprises:

9. The system for training a text discriminant model of claim 6, wherein the manner of comparison in the discriminant module comprises: vectorizing and KL divergence calculation are carried out on the first new language sample or the second new language sample, if the distribution difference of the calculation results is smaller than a preset value, the discrimination model judges that the first new language sample or the second new language sample is a positive sample, and otherwise, the discrimination model judges that the first new language sample or the second new language sample is a negative sample.

10. The system for training a text discriminant model of claim 6, wherein the discriminant model update module comprises: