CN117648921A

CN117648921A - Cross-theme composition automatic evaluation method and system based on paired double-layer countermeasure alignment

Info

Publication number: CN117648921A
Application number: CN202410114378.8A
Authority: CN
Inventors: 张春云; 邓纪芹; 崔超然; 赵洪焱; 李磊
Original assignee: Shandong University of Finance and Economics
Current assignee: Shandong University of Finance and Economics
Priority date: 2024-01-29
Filing date: 2024-01-29
Publication date: 2024-03-05
Anticipated expiration: 2044-01-29
Also published as: CN117648921B

Abstract

The invention relates to the technical field of natural language processing, and discloses a cross-theme composition automatic evaluation method and system based on paired double-layer countermeasure alignment; the method comprises the following steps: inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results; the trained model extracts composition characterization of each pair of source subject composition and target subject composition in a training stage, maps the two composition characterization into a feature space, and executes alignment operation on feature distribution of the subject hierarchy of the two composition characterization in the feature space; meanwhile, executing operation on the feature distribution of the two composition characterization category layers in the feature space; with consistency constraints, the differences between all classifier outputs are minimally constrained. The invention improves the consistency and stability of the scoring result.

Description

Cross-theme composition automatic evaluation method and system based on paired double-layer countermeasure alignment

Technical Field

The invention relates to the technical field of natural language processing, in particular to a cross-theme composition automatic evaluation method and system based on paired double-layer countermeasure alignment.

Background

The statements in this section merely relate to the background of the present disclosure and may not necessarily constitute prior art.

Automated composition scoring involves automatically assessing the quality of student composition by identifying technical errors, understanding the elements of subject sentences, coherence, and other aspects. As an important field of natural language processing in educational application, automated composition scoring can not only greatly relieve the teacher from scoring composition, but also provide rapid feedback to students.

Since 1960, studies on automatic composition scoring have been developed in the academic literature. It is generally defined as one regression task (predicting the continuity score of a composition), a classification task (classifying compositions into predefined classes, e.g., bad, medium, good and excellent), or a ranking task (ranking two arbitrary compositions according to their quality). Early approaches were mostly based on manually designed features such as grammar mistakes, consistency or punctuation clarity. In recent years, various deep learning frameworks such as Convolutional Neural Networks (CNNs), recurrent Neural Networks (RNNs), and converters have become the basis framework in automatic composition scoring methods. This is due to their powerful ability to efficiently model complex patterns and identify key features in the composition, making automatic scoring possible.

Generally, an automatic composition scoring task is a task of automatically scoring articles made by students according to a given topic, and a topic refers to a composition requirement given by a piece of text, which is also called a hint, and each topic (hint) is different, such as a genre and a central word, so that the content of the scored articles is closely related to the topic. Also, the composition from different topics therefore shows considerable differences in vocabulary, utterances and writing style characteristics. The invention is to train a composition automatic scoring model, the theme articles used for training are called source theme articles, the theme articles used for testing are called target theme articles, and the task aims to accurately score on the target theme articles through the model trained by the source theme articles. In real-world situations, it is often challenging to collect scoring composition for a particular topic (target topic). In contrast, a large number of scoring compositions belonging to other different source topics are readily available. Thus, automated composition scoring across topics has recently become an active topic of research. One straightforward approach is to apply models trained on source topics directly to target topics for scoring. However, this approach can lead to significant performance degradation caused by the difference between the source and target subject composition distributions, i.e., domain shifts.

To alleviate the problem of domain bias, existing cross-topic automatic composition scoring methods attempt to learn a migratable composition scoring model based on multiple scored source topic compositions by minimizing the difference between the source topic and target topic composition distributions. Some of these approaches assume that models trained based on generic features can be easily migrated to new topic compositions and attempt to learn a migratable model based on hand-made generic features extracted from multiple source topics to score target topic compositions. However, these methods rely on common features of hand-made, such as part-of-speech tagging, misspelling, etc., which require considerable engineering skill and field expertise. Other approaches, such as constructing cross-topic automatic composition scoring models using a two-stage paradigm, in a first stage they train a topic-independent model based on multiple source topic compositions; in the second stage, they refine this topic-independent model with specific topic information.

Despite significant advances, the performance of the cross-topic composition automatic scoring method is still not satisfactory. The existing automatic scoring work mainly has two problems: first, they attempt to map multiple source and target topic compositions into a unified feature space to learn their common shared features. However, in addition to domain drift between each source topic composition and the target topic composition distribution, there is also domain drift between different source topic compositions. Thus, learning shared features applicable to all source and target subject compositions can be challenging. Secondly, the existing automatic scoring method for cross-topic composition focuses on aligning the distribution between the full local source topic and the target topic composition, and ignores the inherent class structure in the data distribution. There is a significant difference in category distribution among the eight topics in the ASAP dataset. Thus, even if the global source and target subject composition distributions are well aligned, misalignment of the distributions between compositions belonging to the same category may persist.

Disclosure of Invention

In order to solve the defects in the prior art, the invention provides a cross-theme composition automatic evaluation method and system based on paired double-layer countermeasure alignment; based on the paired double-layer countermeasure alignment, the feature distribution alignment of the theme hierarchy and the category hierarchy is realized in the cross-theme automatic composition scoring task, the sharing features are learned for each pair of source theme composition and target theme composition, and the optimal composition alignment is realized.

In one aspect, a cross-topic composition automatic assessment method based on paired double-layer challenge alignment is provided, comprising: acquiring text data of an evaluation text to be tested; inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results.

The training method comprises the steps of training a cross-subject composition automatic evaluation model, extracting composition characterization of each pair of source subject composition and target subject composition in a training stage, mapping the two composition characterization into a feature space, and executing alignment operation on feature distribution of a subject hierarchy of the two composition characterization in the feature space; meanwhile, executing operation on the feature distribution of the two composition characterization category layers in the feature space; with consistency constraints, the differences between all classifier outputs are minimally constrained.

In another aspect, there is provided a cross-topic composition automatic assessment system based on paired bilayer challenge alignment, comprising: an acquisition module configured to: acquiring text data of an evaluation text to be tested; an evaluation module configured to: inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results.

The technical scheme has the following advantages or beneficial effects: first, the present invention improves the ability to align feature distributions between source and target subject compositions. Through topic hierarchy alignment, topic hierarchy distribution of source topic compositions and target topic compositions is aligned integrally, and the problem of domain drift is relieved. Meanwhile, through category level alignment, the distribution of composition categories is aligned in fine granularity, and the condition of distribution dislocation is further reduced. Secondly, the invention improves the scoring performance and accuracy of the composition, minimizes the difference between different theme compositions through double-layer alignment, and ensures that the scoring of the target theme composition is more accurate and reliable. Finally, the consistency constraint is introduced to encourage consistency between the outputs of the classifier pairs, so that consistency and stability of scoring results are improved.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention.

Fig. 1 is an internal structure diagram of an automatic cross-topic composition evaluation model according to the first embodiment.

Detailed Description

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

The dual-hierarchy alignment method learns the shared features between each pair of source and target subject compositions, respectively, and aligns their feature distribution at the subject and category hierarchies together. For each pair of source and target topic compositions, the topic hierarchy alignment bureau is used to align their topic hierarchy distributions, while the category hierarchy alignment performs the alignment of category distributions in a fine-grained manner. By jointly aligning the topic hierarchy and the category hierarchy, the model provided by the invention can realize optimal alignment between the source topic composition distribution and the target topic composition distribution.

The present invention then uses the topic and category countermeasure networks to align their topic and category hierarchical distributions within each space. To further encourage consistency of all classifier pairs, the present invention introduces a consistency constraint that minimizes the difference between the outputs of all classifier pairs. Finally, the present invention jointly optimizes the topic countermeasure and class countermeasure networks while learning the shared feature representation and classifier.

An embodiment provides a cross-theme composition automatic evaluation method based on paired double-layer countermeasure alignment, which comprises the following steps: s101: acquiring text data of an evaluation text to be tested; s102: inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results.

It should be understood that a topic refers to a written topic or proposition given in a composition test, i.e., a topic or question that an examinee needs to develop and discuss in an article. A source topic refers to a topic for training that differs from a target topic. The target topic is a composition topic to be scored.

Further, the extracting the composition characterization of each pair of the source theme composition and the target theme composition specifically includes: for each sentence in the work, using a convolutional neural network to encode its representation, resulting in a representation of each word; using a first attention mechanism layer to enhance the representation of each word to obtain sentence representation; using a long-term and short-term memory network to aggregate the context information of sentence characterization to obtain a hidden state sequence representation of the composition; and enhancing the hidden state sequence representation by using a second attention mechanism layer to obtain the representation of the composition.

Further, for each sentence in the work, the convolutional neural network is used for encoding the representation of the sentence, so as to obtain the representation of each word, which specifically comprises: for each sentence in the work, its representation is encoded using convolutional neural networks, the firstCharacterization of individual words->The method comprises the following steps: />The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicate->Personal word (s)/(s)>Representing +.>Characterization of individual words,/>Is an activation function->And->Is a weight matrix and bias parameters, < >>Representing from +.>Words to->Window composed of individual words,/->Representing window size in convolutional neural network, which is used in convolution to extract local features, ++>Indicate->Word embedding of individual words.

Further, the enhancing the representation of each word by using the first attention mechanism layer to obtain sentence representation includes: enhanced with a first attention mechanism layer (word attention pooling) resulting in sentence characterizationSentence characterization->The expression is: />；/>；The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>And->Representing a weight matrix, +.>Indicating bias(s)>And->Respectively represent the firstAttention vector and attention weight of individual word, +.>Indicate->Characterization of individual words,/>Is a hyperbolic tangent function for nonlinear transformation, < ->Representing an exponential function, playing a normalizing role in the attention calculation,representing the sum of the attention characterizations of all words p (from p=1 to p=m) in the attention calculation, resulting in an attention weight per word +.>，/>Representing the sentence characterization by multiplying the characterization of each word by its attention weight and summing.

Further, the method uses a long-short-term memory network to aggregate the context information of sentence characterization to obtain a hidden state sequence representation of the composition, and specifically includes: after obtaining the sentence representation, aggregating the context information using a long short term memory network (LSTM); to obtain a composition comprisingThe hidden state sequence of the composition of each sentence, all sentences are input into the LSTM unit. The hidden state sequence is denoted->The calculation method is as follows:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>And->Are respectively->Step of inputting sentences and hiding states thereof; />Is the hidden state of the previous time step, LSTM refers to long-short-term memory network, and is based on the current input +.>And the hidden state of the previous time step +.>Calculating hidden state of current time step +.>. Further, using a second attention mechanism layer, enhancing the hidden state sequence representation to obtain a composition characterization, including: enhancement of the hidden state sequence representation using a second attentive mechanism layer, resulting in a composition representation +.>The method specifically comprises the following steps: />；；/>The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>And->Representing a weight matrix, +.>Representing bias items->And->Respectively +.>Attention vector and attention weight of each sentence, +.>Indicate->Characterization of the individual sentences, tanh is a hyperbolic tangent function for non-linear transformation, ++>And->Representing a weight matrix, +.>Representing an exponential function>Representing +.>(from->To->) Summing the attention characterizations of (2) to get the attention weight of each sentence,/i>Representing that each sentence is treated by attention +.>(from->To the point of) Multiplied by their attention weights and summed to produce a composition representation.

It should be appreciated that LSTM is a variant of Recurrent Neural Network (RNN) that is effective in alleviating the problem of gradient extinction of recurrent neural networks.

Further, the mapping of the two composition representations into the feature space specifically includes: composition characterization of obtained target subject matterFeature extractor->Map them to feature space: />；/>The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicate->No. H of personal source topic>The composition of the text @, @>Express item->The composition of the text @, @>Indicate->Feature extractor of individual source topics +.>And->Respectively represent +.>Source subject and target subject composition pass->Personal feature extractor->The resulting composition characterization, feature extractor->Is a fully connected layer.

By executing the steps, the invention can acquire the characterization of the texts in the N feature spaces for the specific theme.

Further, the step S101: acquiring text data of the to-be-evaluated composition, and further comprising: removing special characters in the text, performing word segmentation by using a word segmentation library, converting the text into numerical feature vectors and the like.

Further, as shown in fig. 1, the trained cross-topic composition automatic evaluation model includes: the system comprises an embedded layer, a convolutional neural network, a first attention mechanism layer, a long-period memory network and a second attention mechanism layer which are connected in sequence; the output end of the second attention mechanism layer is respectively connected with the first double-layer alignment unit, the second double-layer alignment unit and the Nth double-layer alignment unit of the third double-layer alignment unit … …; n is a positive integer greater than or equal to 1.

The inner structures of the first double-layer alignment unit, the second double-layer alignment unit, the third double-layer alignment unit … … and the nth double-layer alignment unit are consistent, and the first double-layer alignment unit comprises: the input end of the first full-connection layer is connected with the output end of the second attention mechanism layer; the output end of the first full-connection layer is respectively connected with the input end of the first classifier and the input end of the first gradient inversion layer; and the output end of the first classifier is used for outputting an evaluation result of the composition. The outputs of the first gradient inversion layer (Gradient reversal layer, GRL) are connected to the inputs of the first subject matter level discriminator and the inputs of the first set of class level discriminators, respectively.

Wherein the first set of class level discriminators comprises: four parallel class level discriminators; the input ends of the four parallel class-level discriminators are connected with the output end of the first gradient inversion layer.

Further, the embedded layer, the convolutional neural network, the first attention mechanism layer, the long-short-period memory network and the second attention mechanism layer which are sequentially connected form a feature extractor. The goal of the feature extractor is to map the source and target topics, respectively, into different feature spaces.

The first theme level discriminator and the four parallel class level discriminators are realized by adopting a softmax classifier; the first classifier is also implemented using a softmax classifier.

Further, as shown in fig. 1, the training process includes: constructing a training set, wherein the training set comprises N source theme compositions of known evaluation level labels and a target theme composition of a known evaluation level pseudo label; composition of the target subjectRandomly pairing the personal theme composition to obtain the +.>Target topic-source topic composition pairs; />The range of the value is 1 to N.

Will be the firstThe target subject-source subject composition pairs are input into a cross-subject composition automatic evaluation model, and the cross-subject composition automatic evaluation model pairs are +.>Feature extraction is carried out on the target subject-source subject composition pair to obtain +.>Composition characterization and +.>Composition characterization of individual source subject composition; will be->Composition characterization and +.>Composition characterization of the individual source subject composition by +.>The full connection layer maps to the feature space.

First, theThe output value of the full link layer is fed into +.>Classifier, calculate->A cross entropy loss function of the source topic composition of the classifier; first->The output value of the full link layer is fed into +.>Gradient inversion layer, th->The output value of the gradient inversion layer is fed into the firstTheme level discriminatorAnd->A group class level arbiter.

Calculate the firstThe topic hierarchy of the topic level arbiter counters the penalty function; calculate->Class-level countermeasures loss functions of the group class-level discriminators; calculating the loss function of all target subject compositions in the pseudo tag generation process of the target subject compositions of the training set; a classifier consistency constraint loss function is calculated.

Calculating a total loss function value, and stopping training when the total loss function value is not reduced any more, so as to obtain a trained cross-subject composition automatic evaluation model; wherein, the total loss function is: the cross entropy loss functions of source theme compositions of N classifiers, the theme hierarchy antagonism loss functions of N theme level discriminators, the class level antagonism loss functions of N group class level discriminators, the loss functions of all target theme compositions and the summation results of the classifier consistency constraint loss functions; taking the trained cross-topic as the average value of N classifier output values of the evaluation model as the predicted value of the target topic composition.

Further, the cross entropy loss functions of the source theme composition of the N classifiers specifically include: for the firstComposition of individual source subjects, calculating cross entropy loss between their predictive score and true score +.>：The method comprises the steps of carrying out a first treatment on the surface of the Wherein->Personal source topic->The composition goes through->Characterization by the individual topic feature extractor ∈>，/>Indicate->The subject classifier pairs pass +.>Resulting characterization of the individual topic feature extractor +.>After prediction, get +.>Predictive probability of composition label of category, +.>Indicate->No. H of personal source topic>Authentic tag of the composition->Representing a category ranging from 0 to +.>，/>Indicate->Number of compositions within the individual source topic, +.>Representing cross entropy loss.

Thus, for the followingIndividual source topic specific classifier, overall source topic cross entropy penalty->The method comprises the following steps:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicate->Cross entropy loss of each topic, N total source topics, the total cross entropy loss sum of N is calculated.

It should be appreciated that to learn the shared characteristics of each source and target topic composition pair, the present invention aligns their distribution at the topic level and category level through the antagonism network. Specifically, characterization of a given source compositionIt is input to its corresponding classifier +.>And discriminator->Is a kind of medium. Classifier->Is a softmax classifier for predicting the label of each composition (i.e., poor, medium, good, and excellent). The discriminator is also a softmax classifier for determining the input actionWhether the text is from the target topic.

The topic hierarchy of the N topic level discriminants resists the loss function, and specifically comprises: for each pair of compositions of source and target topics, the corresponding cross entropy loss between their predicted and real topic labels is calculated:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicate->Number of compositions in the individual source topic, +.>And->Indicating that go through->Characteristic representations of source and target topics obtained by the topic feature extractor, < ->Indicate->A topic discriminator, which is a softmax classifier for determining whether the input composition is from the current source topic or from the target topic,/->Representing the number of compositions of the target subject.

Thus, all that is calledThe total loss of a particular topic arbiter is the topic-level counterloss function->By calculation, it is obtained that: />The method comprises the steps of carrying out a first treatment on the surface of the N source topics total->Only the topic hierarchy counter-loss of the jth source topic is calculated, and +.>Resulting in a sum of N losses.

In learning the shared feature characterization, a feature extractor of a particular topic aims to minimize the classification loss of the source topic text to achieve accurate scoring while maximizing the discriminant loss to confuse the discriminant.

Further, the loss function of all the target subject compositions specifically includes: generating pseudo tags for unlabeled target subject matter compositions: composition for each discourse subject matterCalculating the subject composition of the target->Characterization of->And soft predictionThe method comprises the steps of carrying out a first treatment on the surface of the At temperature->At=1/2, the soft prediction is sharpened: />The method comprises the steps of carrying out a first treatment on the surface of the Wherein,is the original soft prediction probability vector +.>Middle->Prediction probability of individual category->Is a temperature parameter, ++>Representing the total number of target subject compositions, +.>Represents the +.sup.th in the predictive probability vector after sharpening>Probability of individual categories. To store the characterization and prediction of all target subject compositions, a memory library is assigned to each subject>It is iteratively updated in a moving average manner, defined as follows: />；/>The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>An updated smoothing parameter is shown for controlling the extent to which new data affects the moving average. />Is a characteristic representation of the subject matter of interest, < >>Is a representation of the features after moving average, and similarly, < >>Is a target subject soft prediction, then +.>Is soft prediction after moving average.

To determineIs->Nearest neighbors, calculate->Is stored in the memory bank of the specific subject>Cosine similarity between the feature representations of all compositions in (a).

To integrate itInformation of individual neighbors->The soft labels of the neighbors are averaged to produce +.>Is a soft label of (a). Then (I)>Is determined as follows: />；/>The method comprises the steps of carrying out a first treatment on the surface of the Wherein the corresponding highest probability is set as pseudo tag +.>Confidence of->The method comprises the steps of carrying out a first treatment on the surface of the Soft label->Comprises->Selecting soft label with maximum class probability as pseudo label +.>，/>Is soft label->Probability of individual category->Representing a common->Neighbor(s),>representing the subject of the object->Representing composition in the target topic, +.>Representing composition from the subject of interest +.>Soft label of neighbor->Is->Is a soft label of (a).

Based onA classifier for each topic that calculates the loss of all target topic composition by cross entropy loss of weighted confidence: />The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicate->Category pseudo tags; the corresponding highest probability of the inside of the soft tag used in calculating the pseudo tag is set as pseudo tag +.>Confidence of->，/>Representing the subject composition of the object>Prediction results obtained by the respective classifier, +.>The value of (2) ranges from 0 to，/>Is the number of target subject compositions.

Overall, there is a significant difference in category distribution between different topical compositions. In order to take into account the inherent category structure of the different topics, the invention performs category-level alignment for each pair of compositions of source and target topics.

Further, the saidThe class-level counterattack loss function of the individual class-level discriminator specifically includes:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Indicate->Cross entropy loss of individual discriminators: />And->Respectively represent->The>A personal category of the work corpus; />Representing the passage of a source theme composition->Result of category discriminator,/>Representing the subject composition of the object>And (5) a result of the category discriminator.

Will beThe overall penalty of a category-level discriminator of individual source and target topic pairs is referred to as category-level countermeasures penalty:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Calculated is the loss of a certain category of a certain source subject, since there are N source mastersQuestions and K categories, so calculation of +.>。

It will be appreciated that in order to align the category level feature distribution of each pair of source and target subject compositions, compositions belonging to the same category are input into their respective category discriminators. The category discriminator aims at judging whether the input composition comes from the target subject. To align the category-level feature distribution of each pair of source and target subject compositions, the feature extractor of a particular subject attempts to confuse its corresponding category-level discriminator. This is also a resistance training between a particular topic classifier and the corresponding class-level discriminator. Further, the classifier consistency constraint loss function specifically includes:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>Representing the absolute value of the difference between the prediction probabilities generated by each pair of topic-specific classifiers; n represents the number of source subjects, |T| represents the number of target subject compositions, |N +.>Indicating that go through->Personal classifier and->The individual classifiers predict the absolute value of the probability differences.

After completing the topic level and category level alignment of each pair of source and target topics, the composition for each target topic will be obtainedAnd predicting the result. The present invention introduces a consistency constraint to encourage these +.>Consistency between individual topic-specific classifiers. Further, the total loss function includes:the method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>,/>Is a weight parameter adjusting the relative importance of the different losses,/->Is the global source topic cross entropy loss, +.>Loss of composition of all target subjects,is a topic hierarchy against loss, is->Is category hierarchy against loss, is->Is a consistency loss.

The final loss consists of three main parts: classification losses (i.e) Double countermeasures against losses (i.e) And classifier consistency loss. As the dual-tier alignment antagonism network learns shared feature characterizations through antagonism training, the feature extractor of a particular topic strives to minimize the classification loss of the source topic and the target topic to achieve accurate scoring, while maximizing the dual antagonism loss to confuse the corresponding topic level and class level discriminators. This means that the arbiter and classifier follow opposite gradient directions when performing parameter updatesTo (c). The invention realizes the counter-propagation process by automatically reversing the gradient direction of the loss of the discriminators before propagating to the feature extractor parameters of the specific subject, thereby achieving the effect of countermeasure training.

After comprehensively optimizing these loss functions to obtain an optimal dual countermeasure network, the final score for each piece of target subject composition can be obtained by averaging the prediction results of all source subject specific classifiers.

Further, the step S102: inputting text data of the to-be-tested evaluation composition into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result, wherein the method specifically comprises the following steps of: the trained cross-subject composition automatic evaluation model is used for extracting the characteristics of the to-be-evaluated composition; mapping the extracted characterization to a feature space; classifying the features of the feature space to obtain classification results of N classifiers; and averaging the classification results of the N classifiers to obtain an evaluation result.

The method comprises the steps of mapping source topic composition pairs and target topic composition pairs into different feature spaces, wherein all source-target topic composition pairs firstly extract a common feature representation through a shared feature extractor, and learn common features of all source topics and target topics. After sharing the feature extractor, the source-target subject composition passes through the respective feature extractor to maximize extraction of the shared features of each of the source and target subjects.

With the topic and category countermeasure networks aligned in each space with their topic and category hierarchical distributions, the input to the discriminator in the topic countermeasure network is each source-target topic pair sample, discriminating whether from the source topic or the target topic. The input of the class countermeasure network discriminator is a sample of the same class of each source-target pair, and the same is true of discriminating from the source subject or the target subject.

Each source-target topic pair has a respective antagonism network, meaning that there are multiple classifiers that can be used to score the target topic composition. By introducing consistency constraints, each classifier is constrained to score results that are close to or even identical for the same target topic composition.

The invention improves the alignment capability of the characteristic distribution between the source theme composition and the target theme composition. Through topic hierarchy alignment, topic hierarchy distribution of source topic compositions and target topic compositions is aligned integrally, and the problem of domain drift is relieved. Meanwhile, through category level alignment, the distribution of composition categories is aligned in fine granularity, and the condition of distribution dislocation is further reduced. The method improves the scoring performance and accuracy of the composition, minimizes the difference between different theme compositions through double-layer alignment, and enables the scoring of the target theme compositions to be more accurate and reliable. Finally, consistency constraints are introduced to encourage consistency between the outputs of the classifier pairs, so that consistency and stability of scoring results are improved.

Embodiment II provides a cross-theme composition automatic evaluation system based on paired double-deck countermeasure alignment, comprising: an acquisition module configured to: acquiring text data of an evaluation text to be tested; an evaluation module configured to: inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results.

The above description is only of the preferred embodiments of the present invention and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. The automatic cross-theme composition evaluation method based on paired double-layer countermeasure alignment is characterized by comprising the following steps of:

acquiring text data of an evaluation text to be tested;

inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results;

2. The method for automatically evaluating cross-topic composition based on paired double-deck countermeasure alignment according to claim 1, wherein the extracting composition representations of each pair of source topic composition and target topic composition specifically comprises:

for each sentence in the work, using a convolutional neural network to encode its representation, resulting in a representation of each word;

using a first attention mechanism layer to enhance the representation of each word to obtain sentence representation;

using a long-term and short-term memory network to aggregate the context information of sentence characterization to obtain a hidden state sequence representation of the composition;

and enhancing the hidden state sequence representation by using a second attention mechanism layer to obtain the representation of the composition.

3. The method for automatically evaluating cross-topic composition based on paired bilayer challenge alignment according to claim 1, wherein the mapping of both composition representations into a feature space specifically comprises:

composition characterization of obtained target subject matterFeature extractor->Map them to feature space:

；

wherein,indicate->No. H of personal source topic>The composition of the text @, @>Express item->The composition of the text @, @>Indicate->Feature extractor of individual source topics +.>And->Respectively represent +.>Source subject and target subject composition pass->Personal feature extractor->The resulting composition characterization, feature extractor->Is a fully connected layer.

4. The automatic cross-topic composition assessment method based on paired double-deck countermeasure alignment according to claim 1, wherein the trained cross-topic composition automatic assessment model comprises:

the system comprises an embedded layer, a convolutional neural network, a first attention mechanism layer, a long-period memory network and a second attention mechanism layer which are connected in sequence; the output end of the second attention mechanism layer is respectively connected with the first double-layer alignment unit, the second double-layer alignment unit and the Nth double-layer alignment unit of the third double-layer alignment unit … …; n is a positive integer greater than or equal to 1;

the inner structures of the first double-layer alignment unit, the second double-layer alignment unit, the third double-layer alignment unit … … and the nth double-layer alignment unit are consistent, and the first double-layer alignment unit comprises: the input end of the first full-connection layer is connected with the output end of the second attention mechanism layer; the output end of the first full-connection layer is respectively connected with the input end of the first classifier and the input end of the first gradient inversion layer; the output end of the first classifier is used for outputting an evaluation result of the composition;

the output end of the first gradient inversion layer is respectively connected with the input end of the first theme level discriminator and the input ends of the first group of class level discriminators; wherein the first set of class level discriminators comprises: four parallel class level discriminators; the input ends of the four parallel class-level discriminators are connected with the output end of the first gradient inversion layer.

5. The automatic cross-topic composition assessment method based on paired double-deck countermeasure alignment according to claim 4, wherein the training process includes:

constructing a training set, wherein the training set comprises N source theme compositions of known evaluation level labels and a target theme composition of a known evaluation level pseudo label; composition of the target subjectRandomly pairing the personal theme composition to obtain the +.>Target topic-source topic composition pairs; />The value range of (2) is 1-N;

will be the firstThe target subject-source subject composition pairs are input into a cross-subject composition automatic evaluation model, and the cross-subject composition automatic evaluation model pairs are +.>Feature extraction is carried out on the target subject-source subject composition pair to obtain +.>Composition characterization and +.>Composition characterization of individual source subject composition;

will be the firstComposition characterization and +.>Composition characterization of the individual source subject composition by +.>Mapping the full connection layer to a feature space; first->The output value of the full link layer is fed into +.>Classifier, calculate->A cross entropy loss function of the source topic composition of the classifier;

first, theThe output value of the full link layer is fed into +.>Gradient inversion layer, th->The output value of the gradient inversion layer is fed into +.>Theme level discriminator and->A group class level discriminator; calculate->The topic hierarchy of the topic level arbiter counters the penalty function; calculate->Class-level countermeasures loss functions of the group class-level discriminators; calculating the loss function of all target subject compositions in the pseudo tag generation process of the target subject compositions of the training set; calculating a consistency constraint loss function of the classifier;

6. The method for automatically evaluating cross-topic compositions based on paired double-layer challenge alignment according to claim 5, wherein the cross-entropy loss functions of source topic compositions of the N classifiers specifically comprise:

for the firstComposition of individual source subjects, calculating cross entropy loss between their predictive score and true score +.>：

；

Wherein, the firstPersonal source topic->The composition goes through->Characterization by the individual topic feature extractor ∈>，/>Indicate->The subject classifier pairs pass +.>Resulting characterization of the individual topic feature extractor +.>After prediction, get +.>Predictive probability of composition label of category, +.>Indicate->No. H of personal source topic>Authentic tag of the composition->Representing a category ranging from 0 to +.>，/>Indicate->Number of compositions within the individual source topic, +.>Representing cross entropy loss;

thus, for the followingIndividual source topic specific classifier, overall source topic cross entropy penalty->The method comprises the following steps:

；

wherein,indicate->Cross entropy loss of each topic, N total source topics, the total cross entropy loss sum of N is calculated.

7. The method for automatically evaluating cross-topic composition based on paired double-deck challenge alignment according to claim 5, wherein the topic-level challenge loss functions of the N topic-level discriminators specifically include:

for each pair of compositions of source and target topics, the corresponding cross entropy loss between their predicted and real topic labels is calculated:

；

wherein,indicate->Number of compositions in the individual source topic, +.>And->Indicating that go through->Characteristic representations of source and target topics obtained by the topic feature extractor, < ->Indicate->A topic discriminator, which is a softmax classifier for determining whether the input composition is from the current source topic or from the target topic,/->The number of compositions representing the target topic;

thus, all that is calledThe total loss of a particular topic arbiter is the topic-level counterloss function->By calculation, it is obtained that:

；

there are a total of N source topics,only calculate->The topic hierarchy of the individual source topics counter losses, but->Resulting in a sum of N losses.

8. The method for automatically evaluating cross-topic composition based on paired bilayer challenge alignment according to claim 5, wherein the loss function of all target topic compositions specifically comprises:

；

wherein,indicate->Category pseudo tags; the corresponding highest probability of the inside of the soft tag used in calculating the pseudo tag is set as pseudo tag +.>Confidence of->，/>Representing the subject composition of the object>The classifier is obtainedIs predicted by->The value of (2) is from 0 to C-1, -/->Is the number of target subject compositions.

9. The method for automatically evaluating cross-topic composition based on double-deck challenge alignment according to claim 5, wherein the class-level challenge loss functions of the N class-level group discriminators specifically include:

first, theCross entropy loss of individual discriminators:

；

wherein,and->Respectively represent->The>A personal category of the work corpus; />Representing the passage of a source theme composition->Result of category discriminator,/>Representing the subject composition of the object>A result of the category discriminator;

will beThe overall penalty of a category-level discriminator of individual source and target topic pairs is referred to as category-level countermeasures penalty:

；

wherein,the loss of a certain category of a certain source theme is calculated, but there are N source themes and K categories;

the consistency constraint loss function of the classifier specifically comprises the following steps:

calculating the absolute value of the difference between the prediction probabilities generated by each pair of topic-specific classifiers:

；

wherein,representing the number of source subjects, |T| representing the number of target subject compositions, |for +.>Indicating that go through->Personal classifier and->Individual classifier predictionAbsolute value of probability difference.

10. Automatic cross-theme composition evaluation system based on alignment of paired double-layer countermeasures is characterized by comprising:

an acquisition module configured to: acquiring text data of an evaluation text to be tested;

an evaluation module configured to: inputting text data of the to-be-tested evaluation text into a trained cross-subject composition automatic evaluation model, and outputting an evaluation result; the trained cross-subject composition automatic evaluation model is obtained by training different subject compositions with known evaluation results;