CN117634431A

CN117634431A - Method and system for evaluating text style conversion quality

Info

Publication number: CN117634431A
Application number: CN202410108976.4A
Authority: CN
Inventors: 展恩昊; 郭冬升; 段强; 姜凯
Original assignee: Shandong Inspur Science Research Institute Co Ltd
Current assignee: Shandong Inspur Science Research Institute Co Ltd
Priority date: 2024-01-26
Filing date: 2024-01-26
Publication date: 2024-03-01

Abstract

The invention relates to the technical field of artificial intelligence, in particular to a method and a system for evaluating text style conversion quality, which comprises the following steps: the text style classifier is used for classifying and evaluating, and textCNN is introduced as a text classifier, so that the textCNN effectively captures local characteristics of the text in a convolutional neural network structure; semantic similarity assessment, introducing BERTSCore similarity indexes; statement fluency assessment, namely calculating statement confusion degree by using a GPT-2 language model; the beneficial effects are as follows: the method and the system for evaluating the text style conversion quality provided by the invention utilize the machine learning text classifier, the BERTSCore similarity index and the statement confusion index to provide comprehensive and objective quality evaluation for the text style conversion task.

Description

Method and system for evaluating text style conversion quality

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to a method and a system for evaluating text style conversion quality.

Background

In the information age today, natural Language Processing (NLP) technology is evolving rapidly, offering more text processing and application possibilities to people.

Text style conversion is one of the important tasks in the NLP field in the prior art, aiming at converting text from one style to another, such as formal documents to informal spoken language. The technology has wide application prospect in the fields of advertisement, social media content generation, text creation and the like.

However, quality assessment of text style conversion has been a challenging problem. Traditionally, assessing text quality has focused mainly on grammar correctness and smoothness, however, as text generation techniques evolve, quality assessment requires more attention to semantic preservation and accuracy of style conversion. The existing assessment method usually only covers part of aspects, and lacks comprehensiveness and objectivity.

Disclosure of Invention

The invention aims to provide a method and a system for evaluating text style conversion quality, which are used for solving the problems in the background technology.

In order to achieve the above purpose, the present invention provides the following technical solutions: a method of assessing text style conversion quality, the method comprising the steps of:

the text style classifier is used for classifying and evaluating, and textCNN is introduced as a text classifier, so that the textCNN effectively captures local characteristics of the text in a convolutional neural network structure;

semantic similarity assessment, introducing BERTSCore similarity indexes;

statement fluency assessment, statement confusion is calculated using a GPT-2 language model.

Preferably, the specific operations of the text style classifier classification evaluation include:

training by using a preprocessed text data set, manually marking the style category of the converted text, using the converted text and the original text as training data, and learning the characteristic representation of different layers by the textCNN through the combination of a plurality of convolution layers and pooling layers so as to realize the training of the style classifier;

in the classification stage, the trained TextCNN model is used for distributing the converted text to different style categories, and through forward propagation of the converted text, the model distributes a confidence score for each category, selects the category with the highest confidence as the style category of the converted text, and realizes the evaluation of the accuracy of the text category.

Preferably, the specific operation of semantic similarity evaluation includes:

pre-training model selection, namely using a pre-trained BERT model as a basis for calculating similarity, and respectively encoding an original text and a converted text to obtain hidden representations of the original text and the converted text;

and calculating similarity, namely calculating BERTSCore between the original text and the converted text by using hidden representation, quantitatively evaluating whether the text conversion process keeps semantic consistency by adopting cosine similarity as similarity measurement, and indicating that the converted text is more similar to the original text in terms of semantics by higher BERTSCore score.

Preferably, the specific operation of statement fluency assessment includes:

the GPT-2 model is applied, the generated conversion text is further processed by adopting a GPT-2 language model, and the GPT-2 model generates a text with fluent language expression by inputting the converted text;

confusion computation, using the confusion of the generated text to evaluate its fluency and understandability, sentence confusion is measured by computing the uncertainty of the model's prediction of the next word in a given context, with lower confusion values indicating that the generated text is more fluent in language expression and easy to understand.

A system for evaluating text style conversion quality comprises a textCNN text classifier module, a BERTSCore similarity index module and a GPT-2 statement confusion module;

the textCNN text classifier module is used for collecting enough original texts and converted texts thereof, manually labeling style types of the original texts and dividing the data into a training set and a verification set; constructing a textCNN model, wherein the textCNN model comprises a convolution layer and a pooling layer, the convolution layer captures text features of different scales by using a plurality of convolution kernels of different sizes, and the pooling layer is used for reducing and retaining key information;

the BERTSCore similarity index module comprises BERT model selection, text coding and similarity calculation;

GPT-2 statement confusion module, including GPT-2 model application and statement confusion computation.

Preferably, the TextCNN text classifier module minimizes the loss function by performing multiple rounds of training on the training set, optimizes model parameters, monitors the performance of the model using the validation set, and stops training when performance stagnates to avoid overfitting; in the classification stage, the trained model is applied to the converted text, the confidence coefficient of each category is calculated through forward propagation, and the category with the highest confidence coefficient is selected as the style category of the converted text, so that the accuracy evaluation of the text category is realized.

Preferably, the BERT model selection: selecting a pre-trained BERT model, such as a BERT-Chinese model, as a basis for calculating the similarity;

text encoding: performing BERT coding on the original text and the converted text respectively to obtain hidden representation of the original text and the converted text, wherein the BERT coding captures context information and semantic features of the text;

similarity calculation: based on BERT coding, cosine similarity between the original text and the converted text is calculated as BERTSCore similarity score, and whether semantic consistency is maintained in the conversion process is quantitatively evaluated.

Preferably, statement confusion calculation: by inputting the generated text, calculating statement confusion of the generated text by using a GPT-2 model, wherein the lower the confusion value is, the smoother and easy to understand the generated text in language expression is represented;

and comprehensively applying a textCNN classifier, a BERTSCore similarity index and a GPT-2 confusion index.

Compared with the prior art, the invention has the beneficial effects that:

the method and the system for evaluating the text style conversion quality provided by the invention utilize the machine learning text classifier, the BERTSCore similarity index and the statement confusion index to provide comprehensive and objective quality evaluation for the text style conversion task.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

Detailed Description

In order to make the objects, technical solutions, and advantages of the present invention more apparent, the embodiments of the present invention will be further described in detail with reference to the accompanying drawings. It should be understood that the specific embodiments described herein are some, but not all, embodiments of the present invention, are intended to be illustrative only and not limiting of the embodiments of the present invention, and that all other embodiments obtained by persons of ordinary skill in the art without making any inventive effort are within the scope of the present invention.

Embodiment one:

referring to fig. 1, the present invention provides a technical solution: a method of assessing text style conversion quality, the method comprising the steps of:

1. text style classifier classification evaluation

In the text style conversion quality evaluation method of the present invention, first, we introduce TextCNN (Convolutional Neural Network) as a text classifier. TextCNN can effectively capture local features of text in convolutional neural network structures. The training and use of the module comprises the following steps:

training phase: we use a pre-processed text data set for training. For the converted text, we manually annotate its style category and use it as training data along with the original text. The textCNN can learn different levels of characteristic representation through the combination of a plurality of convolution layers and pooling layers, so that training of the style classifier is realized.

Classification stage: the trained TextCNN model is used to assign the converted text into different style categories. By forward propagating the converted text, the model will assign a confidence score to each category. Finally, we select the category with the highest confidence as the style category of the converted text, thereby enabling an assessment of text category accuracy.

2. Semantic similarity assessment

To measure the semantic similarity between the converted text and the original text, we introduced the BERTScore similarity index. The following is a detailed description of the module:

pre-training model selection: we use a pre-trained BERT model (e.g., BERT-Chinese) as the basis for computing similarity. By encoding the original text and the converted text separately, we obtain a hidden representation of both.

Similarity calculation: using the hidden representation, we calculate BERTSCore between the original text and the converted text. By using cosine similarity as a similarity measure, we can quantitatively evaluate whether the text conversion process maintains semantic consistency. A higher BERTScore score indicates that the converted text is semantically more similar to the original text.

3. Statement fluency assessment

To evaluate the fluency and understandability of the generated converted text, we use the GPT-2 language model to calculate statement confusion. The following is a detailed description of the module:

GPT-2 model application: we employ a GPT-2 language model to further process the generated converted text. By entering the converted text, the GPT-2 model generates a text with a fluent linguistic representation.

And (5) calculating confusion degree: we use the confusion of the generated text to evaluate its fluency and understandability. Statement confusion is measured by calculating the uncertainty of the model's prediction of the next word in a given context. A lower confusion value indicates that the generated text is more fluent in language expression and easy to understand.

The invention can provide a comprehensive and objective text style conversion quality assessment method and provides an innovative solution for the field of text style conversion by comprehensively analyzing the textCNN classifier, the BERTSCore similarity index and the GPT-2 confusion index.

Embodiment two:

on the basis of the first embodiment, a system for evaluating text style conversion quality is provided, wherein the system consists of a text classifier module, a BERTSCore similarity index module and a GPT-2 statement confusion degree module;

1. textCNN text classifier module

Training data set preparation: enough original text and converted text thereof are collected, and style categories thereof are manually marked. These data are divided into training and validation sets.

TextCNN architecture: the TextCNN model is built, including a convolution layer and a pooling layer. The convolution layer uses a plurality of different sized convolution kernels to capture text features of different dimensions. The pooling layer is used for reducing and retaining key information.

Training process: the model parameters are optimized by performing multiple rounds of training on the training set using a gradient descent algorithm to minimize the loss function. The performance of the model is monitored using a validation set and training is stopped when performance stagnates to avoid overfitting.

Classification stage: in the classification stage, a trained model is applied to the converted text, and confidence of each category is calculated through forward propagation. And selecting the category with the highest confidence as the style category of the converted text, and realizing the evaluation of the accuracy of the text category.

2. BERTSCore similarity index module

BERT model selection: a pre-trained BERT model, such as BERT-Chinese, is selected as the basis for computing the similarity.

Text encoding: and performing BERT coding on the original text and the converted text respectively to obtain hidden representations of the original text and the converted text. BERT encoding is capable of capturing contextual information and semantic features of text.

3. GPT-2 statement confusion degree module

GPT-2 model application: the converted text is generated using a pre-trained GPT-2 language model, resulting in text with fluent language expression.

Statement confusion calculation: by inputting the generated text, the statement confusion of the generated text is calculated using the GPT-2 model. The lower the confusion value, the more fluent and easy to understand in language the generated text is represented.

The method and the system can comprehensively and objectively evaluate the quality of text style conversion by comprehensively applying the textCNN classifier, the BERTSCore similarity index and the GPT-2 confusion index.

Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A method of evaluating text style conversion quality, characterized by: the method comprises the following steps:

semantic similarity assessment, introducing BERTSCore similarity indexes;

2. A method of assessing a quality of text style conversion in accordance with claim 1 wherein: specific operations of the text style classifier classification evaluation include:

3. A method of assessing a quality of text style conversion in accordance with claim 1 wherein: specific operations of semantic similarity evaluation include:

4. A method of assessing a quality of text style conversion in accordance with claim 1 wherein: the concrete operation of statement fluency assessment comprises the following steps:

5. A system for assessing the quality of text style conversion, applied to a method for assessing the quality of text style conversion as claimed in any one of claims 1 to 4, characterized in that: the system comprises a text classifier module, a BERTSCore similarity index module and a GPT-2 statement confusion module;

6. The system for assessing the quality of text style conversion of claim 5 wherein: the textCNN text classifier module is used for minimizing a loss function by performing multi-round training on a training set, optimizing model parameters, monitoring the performance of a model by using a verification set, and stopping training when the performance is stagnant so as to avoid overfitting; in the classification stage, the trained model is applied to the converted text, the confidence coefficient of each category is calculated through forward propagation, and the category with the highest confidence coefficient is selected as the style category of the converted text, so that the accuracy evaluation of the text category is realized.

7. The system for assessing the quality of text style conversion of claim 5 wherein: BERT model selection: selecting a pre-trained BERT model, such as a BERT-Chinese model, as a basis for calculating the similarity;

8. The system for assessing the quality of text style conversion of claim 5 wherein: statement confusion calculation: by inputting the generated text, calculating statement confusion of the generated text by using a GPT-2 model, wherein the lower the confusion value is, the smoother and easy to understand the generated text in language expression is represented;