CN117056451A

CN117056451A - New energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement

Info

Publication number: CN117056451A
Application number: CN202311024888.8A
Authority: CN
Inventors: 汪才钦; 张顺香; 周渝皓; 王琰慧; 王小龙
Original assignee: Anhui University of Science and Technology
Current assignee: Anhui University of Science and Technology
Priority date: 2023-08-12
Filing date: 2023-08-12
Publication date: 2023-11-14

Abstract

The invention discloses a new energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement, which belongs to the field of natural language processing and comprises the following steps: s1: acquiring new energy automobile complaint data, and performing part-of-speech tagging and text screening treatment on the new energy automobile complaint data; s2: acquiring text feature vectors based on an entity extraction method of a conditional random field, and using CRF to carry out sequence labeling to obtain an entity set; s3: fusing text features and theme features to obtain enhanced text context representation; s4: based on the tri-affine mechanism, the context is expressed as auxiliary information, the capability of the model to perceive the relation between the aspect words and the viewpoint words is enhanced, and therefore the adaptive aspect-viewpoint pairs are screened out. The method acquires the enhanced context representation by fusing the theme features and the text features, and takes the enhanced context representation as auxiliary information to improve the capability of the model to perceive the relation between the aspect words and the viewpoint words, thereby providing reliable data reference for improving products of new energy automobile merchants.

Description

New energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement

Technical Field

The invention relates to the field of natural language processing, in particular to a new energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement.

Background

With the rapid development of new energy products, more and more people select new energy automobiles as travel tools. The new product brings convenience to people, and simultaneously causes a plurality of new problems, thereby causing a great deal of complaints of customers. Customer complaint information often exposes defects existing in the product design process, so that comments of customers on specific aspects of products in complaint texts are mined, reliable data reference can be provided for adjustment and decision making of enterprise operation, and enterprises are helped to design products meeting public demands.

Currently, research on extraction is lacking in terms of new energy automobile complaint texts, namely views. Wherein the research on aspect-viewpoint extraction mainly relies on word distance information or syntax dependency for relation detection, namely target word information. However, since the complaint text of the new energy automobile has the characteristics of large entity density, long sentence pattern and the like, the ideas and the method only consider word-level information, neglect the influence of the context of the text on word matching, and easily cause misjudgment on matching of aspect words and viewpoint words. Because the subject information of the complaint text is highly summarized to the text content and has low probability of being interfered by redundant information, the invention considers the fusion of the subject characteristics and the text characteristics to acquire enhanced text context representation, and takes the context representation as auxiliary information for relation detection to enhance the capability of the model to sense the relation between the aspect words and the viewpoint words so as to improve the extraction accuracy.

Disclosure of Invention

The invention aims to provide a new energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement, which is characterized by integrating theme features and text features as context representations and using the theme features and the text features as auxiliary information enhancement models to identify the relation among entities. And in the entity extraction stage, capturing context information through an entity extraction model to obtain text characteristics, and inputting the text characteristics into the CRF to obtain an entity set. The relation detection stage obtains enhanced context representation by fusing the theme features and the text features, and screens out adapted aspect-perspective pairs with the context representation as an aid by using a tri-affine mechanism.

The invention adopts the following technical scheme for realizing the purpose:

a new energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement comprises the following steps:

(1) Acquiring new energy automobile complaint data and preprocessing a text;

and acquiring new energy automobile complaint data, and performing part-of-speech tagging and text screening processing on the new energy automobile complaint data.

(2) Acquiring an aspect word set and a viewpoint word set in a complaint text based on an entity extraction method of a conditional random field;

and taking the text word vector as the input of a bidirectional long-short-time memory network, updating the weight of the feature vector through a multi-head attention mechanism, and using CRF to carry out sequence labeling so as to obtain an entity set.

(3) Fusing text features and theme features to obtain enhanced text context representation;

and taking the theme of the complaint text as the input of the BERT pre-training model, acquiring a theme feature representation, and fusing the theme feature and the text feature to acquire an enhanced text context representation.

(4) Based on a tri-affine mechanism, the context is expressed as auxiliary information, and the capability of the model for sensing the relation between aspect words and viewpoint words is enhanced;

the context representation, the aspect word representation and the viewpoint word representation are used as inputs of a three affine mechanism, word pair matching scores are calculated, and the adaptive aspect-viewpoint pairs are screened out.

The specific operations of the new energy automobile complaint data acquisition and text screening pretreatment in the step (1) are as follows:

(1.1) acquiring complaint data of the vehicle mass network platform through a crawler technology.

(1.2) removing stop words. Nonsensical phrases that have only structural functions but do not provide useful information are removed. For example, "principal purchases … … in a month of a year", "principal spends a plurality of money … …", "let continue to use … …", etc.

(1.3) screening data. And removing complaint texts with the number of words less than 30 words and more than 200 words in the crawl data, so as to ensure the balance of the data set.

And (1.4) marking the parts of speech of the new energy automobile complaint text by using an NITK part of speech marking tool, and marking the data by using a BIO marking method, thereby constructing a new energy automobile complaint text data set containing 4680 pieces of data.

In the step (2), based on entity relation extraction, the specific steps of obtaining the aspect word set and the viewpoint word set in the complaint text are as follows:

and (2.1) acquiring word vectors of each word of the text by using the BERT pre-training model, and inputting the acquired vector representations into a bidirectional long-short-time memory network to obtain characteristic representations of the text.

And (2.2) obtaining richer text feature representation while reducing redundant information interference in the new energy automobile complaint text by increasing weights occupied by aspect words and viewpoint words by considering multiple groups of attention scores through a multi-head attention mechanism.

The multi-head attention mechanism calculation process is as follows:

wherein Q is a query vector, K is a key vector, V is a value vector matrix, d _k The term vector dimension Attention (Q, K, V) represents a single self-Attention score, head _j Representing the calculation result of the j-th attention head, W is a weight matrixMHA (Q, K, V) represents the final attention score.

And inputting the feature vector representation of the obtained complaint text into the CRF for sequence labeling, marking out aspect words and viewpoint words, and obtaining an aspect word set and a viewpoint word set.

The global optimal solution is sought by calculating the joint probability distribution of the whole sequence, so that the problem that most of terms in the field of new energy automobiles are in a nested and composite structure is effectively solved. Let the current observation sequence be x= { x ₁ ,x ₂ ,x ₃ ,..,x _n Tag sequence y= { y } ₁ ,y ₂ ,y ₃ ,..,y _n The calculation process is as follows:

wherein W is ₁ 、W ₂ Are trainable weight vectors, and F (y, x) is expressed as a combination of a transfer characteristic function and a state characteristic function.

In the step (3), based on fusion of text features and theme features, the specific steps for obtaining the enhanced text context representation are as follows:

(3.1) obtaining sentence representations of the subject using the BERT pre-training model as an encoder. By adding special marks [ CLS ] to the head and tail of the subject sentence]And [ SEP ]]As input, the hidden state representation of each position is obtained after the coding layerThe acquisition process is as follows:

wherein the CLS vector is the hidden state representation of the first position in the input sequence, and represents the sentence characteristic of the complaint subject, and is marked as h ^CLS 。

(3.2) in order to promote the relevance between the entity identification and the relation detection two modules, a pooling operation is adopted to select the part which has an important role in the extraction task. The new feature representation is then passed through a max pooling operation. The calculation process is as follows:

(3.2.1) representing the text feature as h _t And carrying out averaging pooling to obtain a vector with the same dimension as the theme feature vector, and splicing the new text feature vector and the theme feature vector to obtain a new feature vector, so that semantic information of an entity can be better expressed by considering each feature vector uniformly. The calculation process is as follows:

wherein V is _A And the average information of the text features is represented, and V represents the context representation after the topic features and the text features are spliced.

And (3.2.2) carrying out maximum pooling on the new spliced feature vectors to obtain a final context feature representation. The purpose of the maximum pooling is to reduce the sensitivity of the model to noise, enhance the characterizability of the context information, and furthermore use of the maximum pooling reduces the training parameters and increases the training speed for the whole model.

h ^t ＝max pooling(V)

Wherein h is ^t Representation of enhanced contextual feature representations

The context is expressed as auxiliary information based on a affine mechanism in the step (4), so that the capability of the model to perceive the relation between the aspect words and the viewpoint words is enhanced, and the process is as follows:

and respectively projecting the query vector, the key vector and the value vector into different low-dimensional spaces by adopting a three-affine mechanism, and calculating the association relation among the three by introducing additional linear transformation.

Pairing aspect items and viewpoint items pairwise to obtain an aspect-viewpoint pair set, and for each target word pair (S _a ,S _o ) Distribution relation labelThereby transformingInto separate classes of tasks.

Obtaining aspect vector representation a by position coding _t ＝{a _i ,a _i+1 ,...,a _i+q Sum-view vector representation o _t ＝{o _i ,o _i+1 ,...,o _i+p Finally, aspect representations, perspective representations, and contextual feature representations are taken as TriAffine inputs. The specific process is as follows:

wherein W represents a parameter that can be learned,and->Respectively representing an aspect representation and a perspective representation from an entity identification module, h ^c Representing context feature vectors, Φ _a,o,t Representing the probability of word pair matching, r _a,o The classification result is represented.

The context-enhanced new energy automobile complaint text aspect-viewpoint pair extraction method provided by the invention has the following advantages:

(1) According to the invention, the text vector is acquired through the long-short-term memory network, more accurate and comprehensive text characteristic representation can be obtained, and then the noise interference caused by redundant information in the complaint text is reduced through the multi-head attention mechanism, so that technical assistance is provided for subsequent relation detection.

(2) The method utilizes the method of fusing the theme characteristics and the text characteristics, so that the context semantics of the obtained complaint text are more accurate; meanwhile, a tri-affine mechanism is used to represent the association relationship between entity words as auxiliary computation by context, and the target aspect-viewpoint pair is obtained through a Sigmoid function.

Drawings

FIG. 1 is a flow chart of a new energy automobile complaint text aspect-perspective pair extraction method based on context enhancement;

FIG. 2 is a schematic diagram of a process for obtaining entity word sets by an entity extraction method based on a conditional random field;

FIG. 3 is a diagram of a context representation process for merging text features with subject feature acquisition enhancements;

fig. 4 is a process diagram of a relationship detection method based on a affine mechanism.

Detailed Description

The invention is further illustrated by the following examples.

Embodiment one: the invention provides a new energy automobile complaint text aspect-viewpoint pair extraction method based on context enhancement, which is shown in figure 1. The method comprises the following specific steps:

s1, acquiring a new energy automobile complaint text and preprocessing the text;

s1.1, firstly, acquiring a new energy automobile complaint text. 5000 new energy automobile related complaint data with different styles are crawled from a vehicle quality network through a crawler technology, and meanwhile, a theme sentence corresponding to each complaint text is obtained and stored in an excel file mode.

S1.2, removing stop words. Nonsensical phrases that have only structural functions but do not provide useful information are removed. For example, "principal purchases … … in a month of a year", "principal spends a plurality of money … …", "let continue to use … …", etc.

S1.3, screening data. And removing complaint texts with the word numbers of less than 30 and more than 200 in the crawl data, so as to ensure the balance of the data set.

S1.4, using NITK part-of-speech tagging tool to tag part-of-speech of new energy automobile complaints text to obtain vectorized representation V= [ V ] ₁ ,v ₂ ,...,v _n ]And then marking the data by adopting a BIO marking method, thereby constructing a new energy automobile complaint text data set containing 4680 pieces of data.

S2, acquiring an aspect word set and a viewpoint word set in the complaint text by using an entity extraction method based on a conditional random field. The following description is made in connection with fig. 2:

s2.1 through a bidirectional long and short time memory network BiLSTM obtains text forward hidden stateAnd the backward hidden state->And splicing to obtain a complete sequence to capture long-term dependency in the sequence.

S2.1.1 complaint text data P= { C ₁ ,C ₂ ,...,C _n N is greater than or equal to 2, each sentence contains m words, C _p Represents the p-th sentence, w _p,j Representing the jth word in the p-th sentence.

S2.1.2 the text word vector with semantic representation is obtained by using the BERT pre-training model as an encoder to encode sentences, and is expressed as follows:

W _S ＝{w ₁ ,w ₂ ,w ₃ ,...,w _n }，w _k a vector representation representing the kth word.

S2.1.3, capturing the bidirectional information through the bidirectional long-short-time memory network can transmit the context information containing the candidate aspect items to the corresponding aspect items, and simultaneously transmit the context information containing the candidate aspect items to the corresponding aspect items so as to improve the interactivity between target entities.

S2.2, through a multi-head attention mechanism, by considering a plurality of groups of attention scores, the weights occupied by aspect words and viewpoint words are increased so as to obtain richer text feature representation while reducing redundant information interference in new energy automobile complaint texts.

S2.3, searching a global optimal solution by calculating joint probability distribution of the whole sequence, so that the problem that most of terms in the field of new energy automobiles are in a nested and composite structure is effectively solved. Let the current observation sequence be x= { x ₁ ,x ₂ ,x ₃ ,..,x _n Tag sequence y= { y } ₁ ,y ₂ ,y ₃ ,..,y _n The calculation process is as follows:

S3, fusing the text features and the theme features to obtain the enhanced text context representation. The description is given in connection with fig. 3:

s3.1, a BERT pre-training model is adopted as an encoder to acquire sentence representations of the subject. By adding special marks [ CLS ] to the head and tail of the subject sentence]And [ SEP ]]As input, the hidden state representation t= [ h ] of each position is obtained after passing through the coding layer ^t ₁ ,h ^t ₂ ,h ^t ₃ ...,h ^t _n ]. The calculation process is as follows:

wherein the CLS vector is the hidden state representation of the first position in the input sequence, and represents the sentence characteristic of the complaint subject, and is marked as h ^CLS

S3.2, after the theme features are acquired, the text features and the theme features are fused, and the enhanced context representation is obtained.

S3.2.1, in order to promote the relevance between the entity identification and the relation detection two modules, the invention adopts pooling operation to select the part which has important effect on the extraction task. The new feature representation is then passed through a max pooling operation. The calculation process is as follows:

S3.2.2 the new feature vectors obtained by the concatenation are maximally pooled to obtain the final context feature vector. The purpose of the maximum pooling is to reduce the sensitivity of the model to noise, enhance the characterizability of the context information, and furthermore use of the maximum pooling reduces the training parameters and increases the training speed for the whole model.

h ^t ＝max pooling(V)

Wherein h is ^t Representing the enhanced representation of contextual features.

S4, based on a tri-affine mechanism, the context is expressed as auxiliary information, and the capability of the model for sensing the relation between the aspect words and the viewpoint words is enhanced. The following description is made in connection with fig. 4:

s4.1, respectively projecting the query vector, the key vector and the value vector into different low-dimensional spaces by adopting a three-affine mechanism, and calculating the association relation among the three by introducing additional linear transformation. Obtaining aspect vector representation a by position coding _t ＝{a _i ,a _i+1 ,...,a _i+q Sum-view vector representation o _t ＝{o _i ,o _i+1 ,...,o _i+p }。

And S4.2, taking the aspect representation, the viewpoint representation and the context characteristic representation as TriAffine input. The specific process is as follows:

Furthermore, the foregoing embodiments are merely illustrative of specific embodiments of the present invention and are not intended to be limiting, and it will be understood by those skilled in the art that some of the techniques may be equally substituted, and such modifications and substitutions are intended to be within the scope of the present invention.

Claims

1. The context enhancement-based new energy automobile complaint text aspect-viewpoint pair extraction method is characterized by comprising the following steps of:

step 1: acquiring new energy automobile complaint data and preprocessing a text; acquiring new energy automobile complaint data, and performing part-of-speech tagging and text screening treatment on the new energy automobile complaint data;

step 2: acquiring an aspect word set and a viewpoint word set in a complaint text based on an entity relation extraction method of a conditional random field; the text word vector is used as the input of a bidirectional long-short-time memory network, the weight of the feature vector is updated through a multi-head attention mechanism, and CRF is used for sequence labeling, so that an entity set is obtained;

step 3: fusing text features and theme features to obtain enhanced text context representation; taking the subject of the complaint text as the input of the BERT pre-training model, acquiring a subject feature representation, and fusing the subject feature and the text feature to acquire an enhanced text context representation;

step 4: based on a three affine mechanism, using context representation as auxiliary information, and calculating the probability of matching between aspect words and viewpoint words; the context representation, the aspect word representation and the viewpoint word representation are used as inputs of a three affine mechanism, word pair matching scores are calculated, and correct aspect-viewpoint pairs are screened out.

2. The context-enhanced new energy automobile complaint text aspect-viewpoint pair extraction method according to claim 1, wherein step 1 comprises:

step 1.1, screening data; the nonsensical word groups which only have the structural function but do not provide useful information are removed, so that the interference of redundant information on the relation between words and viewpoint words in the perception of a model is reduced, complaint texts with the word numbers of less than 30 and more than 200 in crawling data are removed, and the balance of a data set is ensured;

step 1.2, part-of-speech tagging; and marking the parts of speech of the complaint text of the new energy automobile by using an NITK part of speech marking tool, and marking the data by using a BIO marking method, so that the model can learn more vivid parts of speech characteristics.

3. The method for extracting entity relation based on conditional random field according to claim 1, wherein the step 2 comprises:

acquiring text forward hidden state through bi-directional long-short-time memory network BiLSTMWith a backward hidden stateAnd splicing to obtain a complete sequence to capture long-term dependency in the sequence;

step 2.1 Consulting text data P= { C ₁ ,C ₂ ,...,C _n N is greater than or equal to 2, each sentence contains m words, C _p Represents the p-th sentence, w _p,j Representing a j-th word in the p-th sentence;

and 2.2, adopting a BERT pre-training model as an encoder to encode sentences to obtain text word vectors with semantic characterization, wherein the text word vectors are expressed as follows:

W _S ＝{w ₁ ,w ₂ ,w ₃ ,...,w _n }，w _k a vector representation representing a kth word;

step 2.3 capturing the bi-directional information through the bi-directional long-short-term memory network can transfer the context information containing the candidate aspect to the corresponding aspect, and simultaneously transfer the context information containing the candidate aspect to the corresponding aspect to promote the interactivity between the target entities.

4. The method for extracting entity relation based on conditional random field according to claim 1, wherein the step 2 comprises:

step 2.4, using a multi-head attention mechanism, and obtaining richer text feature representation while increasing weights occupied by aspect words and viewpoint words by considering multiple groups of attention scores so as to reduce redundant information interference in new energy automobile complaint texts; the attentiveness mechanism process is expressed as follows:

wherein Q is a query vector, K is a key vector, V is a value vector matrix, d _k The term vector dimension Attention (Q, K, V) represents a single self-Attention score, head _j Representing the calculation result of the j-th attention head, W being a weight matrix, MHA (Q, K, V) representing the final attention score;

step 2.5, inputting the obtained feature vector representation of the complaint text into a CRF for sequence labeling, labeling aspect words and viewpoint words, and obtaining an aspect word set and a viewpoint word set; let the current observation sequence be x= { x ₁ ,x ₂ ,x ₃ ,..,x _n Tag sequence y= { y } ₁ ,y ₂ ,y ₃ ,..,y _n The calculation process is as follows:

5. A method of fusing text features with topic features to obtain an enhanced text context representation as in claim 1, wherein step 3 comprises:

step 3.1 Using BERT Pre-training model as encoder to obtain sentence representation of subject by adding special tags [ CLS ] to the head and tail of subject sentence respectively]And [ SEP ]]As input, the hidden state representation of each position is obtained after the coding layerThe acquisition process is as follows:

wherein the CLS vector is the hidden state representation of the first position in the input sequence, and represents the sentence characteristic of the complaint subject, and is marked as h ^CLS ；

Step 3.2 in order to promote the relevance between the entity identification and the relation detection two modules, the invention adopts pooling operation to select the part which has important effect on the extraction task, and then the new characteristic representation is processed through the maximum pooling operation, and the process is as follows:

step 3.2.1 representing the text feature as h _t The method comprises the steps of carrying out averaging pooling to obtain vectors with the same dimension as the topic feature representation, and splicing new text feature vectors and topic feature vectors to obtain new feature representation, wherein the purpose is to uniformly consider each feature vector and better express semantic information of an entity, and the calculation process is as follows:

wherein V is _A Average information representing text features, V representing a context representation after the subject features are spliced with the text features;

step 3.2.2 maximize pooling of the new feature vectors spliced to obtain final context feature representation, the purpose of the maximize pooling is to reduce the sensitivity of the model to noise, enhance the representation capability of the context information, and furthermore, the use of the maximize pooling reduces the training parameters and improves the training speed for the entire model, wherein h ^t Representing the enhanced representation of contextual features.

h ^t ＝maxpooling(V) (5)。

6. The method for calculating the probability of matching between terms and terms based on a three affine mechanism and using context representation as side information according to claim 1, wherein the step 4 comprises:

step 4.1, respectively projecting the query vector, the key vector and the value vector into different low-dimensional spaces by adopting a three-affine mechanism, and calculating the association relation among the three by introducing additional linear transformation; obtaining aspect vector representation a by position coding _t ＝{a _i ,a _i+1 ,...,a _i+q Sum-view vector representation o _t ＝{o _i ,o _i+1 ,...,o _i+p }；

Step 4.2 takes aspect representation, viewpoint representation and context feature representation as input of a affine mechanism, and the specific process is as follows:

wherein W represents a parameter that can be learned,and->Respectively representing an aspect representation and a perspective representation from an entity identification module, h ^c Representing context feature vectors, Φ _a,o,t Probability score, r, representing word pair match _a,o The classification result is represented.