CN115270805A

CN115270805A - Semantic information extraction method of service resources

Info

Publication number: CN115270805A
Application number: CN202110483937.9A
Authority: CN
Inventors: 李松; 宋时佳
Original assignee: Qingdao Haida Xinxing Software Consulting Co ltd
Current assignee: Qingdao Haida Xinxing Software Consulting Co ltd
Priority date: 2021-04-30
Filing date: 2021-04-30
Publication date: 2022-11-01

Abstract

The invention discloses a semantic information extraction method of service resources, which extracts named entity information based on a BERT model; constructing a deep neural network model by combining an attention mechanism, taking an embedded vector output by a BERT model as input, and extracting a semantic relation by adopting the deep neural network model; when a deep neural network model is constructed, different attentions are generated for different characteristics by combining an attentive force mechanism; the invention solves the problem of semantic information extraction by using an end-to-end model, does not depend on the characteristics of manual manufacture or an external NLP tool, has high model training speed, and avoids repeated learning because most parameters are trained in advance.

Description

Semantic information extraction method of service resources

Technical Field

The invention belongs to the technical field of natural language understanding, and particularly relates to a semantic information extraction method for service resources.

Background

With the continuous advance of office informatization and living networking, typical online service resources such as patent services, legal services, policy services, scientific services, research and consultation are rapidly increasing and continuously accumulating.

The method provides new requirements and challenges for accurately and efficiently retrieving the service resources which meet the real intention of the user; meanwhile, the classification management of service resources is too simple at present, the subtle semantic difference of services cannot be reflected, and a set of effective semantic analysis description system is lacked.

At present, a large number of emerging methods are integrated into semantic representation models of texts, for example, various pre-training models such as BERT and GPT, and feature extraction methods such as RNN, LSTM/CNN and transformer are used to improve the semantic representation models, so that the models themselves become complicated. However, for the service resource function description of such special natural languages, the above models are directly used, and for fine-grained and refined semantic analysis and poor and strong matching humanity, the problems that the computation precision of the semantic similarity of Chinese vocabularies is not high, the expression information of the sentence representation method is incomplete and incomplete, the real functional semantics of resources cannot be matched, and the like exist.

Disclosure of Invention

The invention aims to provide a semantic information extraction method of service resources, which can be used for simultaneously extracting named entity information and semantic relations.

The invention is realized by adopting the following technical scheme:

a semantic information extraction method of service resources is provided, which comprises the following steps: named entity information is extracted based on a BERT model; constructing a deep neural network model by combining an attention mechanism, taking an embedded vector output by a BERT model as input, and extracting a semantic relation by adopting the deep neural network model; when the deep neural network model is constructed, different attentions are generated for different features by combining an attentiveness mechanism.

Further, for static service resources, attention weight is obtained through an activation unit; the activation unit carries out element subtraction operation on the input embedded vector, then the embedded vector is connected with the original embedded vector and then sent to the full connection layer, and attention weight is obtained through the output layer of the single neuron; aiming at the dynamic service resources, extracting a semantic relation by combining a service relation migration network; the service relationship migration network negatively samples the user behavior of the embedded vector, the change rule is mined through the change process of the GRU network simulation relationship, and the attention score is added on the basis of the update gate of the GRU network to obtain the changed embedded representation.

Further, the method further comprises: outputting the embedded representation of the extracted semantic relation in a form of a triple; a triple includes concepts, parent concepts and child concepts.

Further, the triplets are converted into three line paragraphs, one concept name per line.

Further, in the training step of extracting the named entity information, in the first stage of training, the contribution of the NER module to the total loss is weighted.

Further, the embedded vector output by the BERT model is sent into the deep neural network model after the average pooling operation.

Compared with the prior art, the invention has the advantages and positive effects that: the semantic information extraction method of the service resource provided by the invention extracts named entity information based on a BERT model; constructing a deep neural network model by combining an attention mechanism, taking an embedded vector output by a BERT model as input, and extracting a semantic relation by adopting the deep neural network model; when a deep neural network model is constructed, different attentions are generated for different characteristics by combining an attentive force mechanism; the invention solves the problem of semantic information extraction by using an end-to-end model, does not depend on the characteristics of manual manufacture or an external NLP tool, has high model training speed, and avoids repeated learning because most parameters are trained in advance.

Other features and advantages of the present invention will become more apparent from the detailed description of the embodiments of the present invention when taken in conjunction with the accompanying drawings.

Drawings

Fig. 1 is a flowchart of a semantic information extraction method for service resources according to the present invention.

Detailed Description

The following describes embodiments of the present invention in further detail with reference to the accompanying drawings.

The semantic information extraction method of the service resource, as shown in fig. 1, includes the following steps:

step S11: named entity information is extracted based on the BERT model.

For a given input sequence S and tok tokens of N Chinese characters, a vector sequence is generated by a pre-trained BERT-based model and then is sent to a feed-forward neural network for classification. The output size of the layer is the task of NLP (non-line segment) of named entity extraction, relation extraction and the like when the training number is output.

Fine-tuning the weight of the BERT model in the training process, and training the whole model in an end-to-end mode; specifically, in the first stage of training, the contribution of the NER module to the total loss is weighted.

Step S12: and constructing a deep neural network model by combining an attention mechanism.

Constructing a deep neural network model by combining an attention mechanism, adopting different models and methods aiming at different application scenes, and learning the embedded representation of the context relationship by using the deep neural network combined with the attention mechanism when the relationship among static service resources is stable; when the service resource relation is dynamically changed, a time sequence sub-network based on GRU units is added to capture the relation change rule.

Step S13: and taking the embedded vector output by the BERT model as input, and extracting the semantic relation by adopting a deep neural network model.

In constructing the deep neural network model, different attentions are generated for different features in conjunction with an attentiveness mechanism.

Wherein, aiming at the static service resource, the attention weight is obtained by the activation unit; the activation unit carries out element subtraction operation on the input embedded vector, then the embedded vector is connected with the original embedded vector and then sent to the full connection layer, and attention weight is obtained through the output layer of the single neuron; and the embedded vector output by the BERT model is sent into the deep neural network model after average pooling operation.

Aiming at dynamic service resources, extracting a semantic relation by combining a service relation migration network; the service relationship migration network negatively samples the user behavior of the embedded vector, the change rule is mined through the change process of the GRU network simulation relationship, and the attention score is added on the basis of the update gate of the GRU network to obtain the changed embedded representation.

And S14, outputting the extracted embedded representation of the semantic relation in a form of a triple.

A triple includes a concept, a parent concept, and a child concept.

For a focus concept, assuming it has parent concepts and child concepts, a triple may be converted into three lines of paragraphs, one concept name per line, each paragraph separated from another by an empty line.

The overall training process for the relationship extraction model of the invention is as follows:

the method comprises the steps of firstly dividing a service resource data set, preprocessing a training set, then passing through a BERT network to obtain vector representation of resources, then training a relationship extraction and classification network, giving weights of different characteristics by using an attention mechanism as a classification basis, storing a model into a model base after verification is carried out by using a test set, and issuing extraction for service resource relationship.

It should be noted that the above description is not intended to limit the present invention, and the present invention is not limited to the above examples, and those skilled in the art can easily understand the present invention.

Claims

1. A semantic information extraction method for service resources is characterized by comprising the following steps:

named entity information is extracted based on a BERT model;

constructing a deep neural network model by combining an attention mechanism, taking an embedded vector output by a BERT model as input, and extracting a semantic relation by adopting the deep neural network model; when the deep neural network model is constructed, different attentions are generated for different features by combining an attentiveness mechanism.

2. The method for extracting semantic information of service resources according to claim 1,

aiming at the static service resources, the attention weight is obtained through an activation unit; the activation unit performs element subtraction operation on the input embedded vector, connects the embedded vector with the original embedded vector, sends the embedded vector to the full-connection layer, and obtains attention weight through the output layer of the single neuron;

aiming at the dynamic service resources, extracting a semantic relation by combining a service relation migration network; the service relationship migration network negatively samples the user behavior of the embedded vector, the change rule is mined through the change process of the GRU network simulation relationship, and the attention score is added on the basis of the update gate of the GRU network to obtain the changed embedded representation.

3. The method for extracting semantic information of service resources according to claim 1, further comprising:

outputting the embedded representation of the extracted semantic relation in a form of a triple; a triple includes a concept, a parent concept, and a child concept.

4. The method of extracting semantic information of a service resource of claim 3, wherein the triplets are converted into three rows of paragraphs, one concept name for each row.

5. The method of semantic information extraction of service resources of claim 1, characterized in that in the training step of extracting named entity information, in the first stage of training, the contribution of the NER module to the total loss is weighted.

6. The method for extracting semantic information of service resources according to claim 1, wherein the embedded vector output by the BERT model is fed into the deep neural network model after the average pooling operation.