CN113961687A

CN113961687A - Multi-turn question-answering system intention classification and named entity identification research method

Info

Publication number: CN113961687A
Application number: CN202111238190.7A
Authority: CN
Inventors: 冯卫森; 冯落落; 尹青山; 高明; 王建华
Original assignee: Shandong New Generation Information Industry Technology Research Institute Co Ltd
Current assignee: Shandong New Generation Information Industry Technology Research Institute Co Ltd
Priority date: 2021-10-25
Filing date: 2021-10-25
Publication date: 2022-01-21

Abstract

The invention provides a research method for intention classification and named entity identification of a multi-turn question-answering system. Which comprises the following steps: 1) when the intention classification is carried out, the historical information of the context is used as input through a dialogue state tracking module, and an intention classification prediction model is transmitted; 2) during named entity recognition and prediction, the intention recognition result of the round is used as a feature and is transmitted into a named entity prediction model, and 3) the named entity recognition and intention classification are trained through a multi-task model, and the model is improved according to the requirements of the first two points. According to the invention, NLU and DM are directly changed into bidirectional models, the accuracy of intention classification can be improved by context information, named entity identification and intention classification are fused through a multi-task model Bert model, and deployment can be conveniently carried out.

Description

Multi-turn question-answering system intention classification and named entity identification research method

Technical Field

The invention relates to an intelligent question-answering system, in particular to a research method for intention classification and named entity identification of a multi-turn question-answering system, and belongs to the technical field of artificial intelligence.

Background

Referring to fig. 1, a conventional task-based question-answering system adopts a pipeline structure, a natural speech understanding module (NLU) is a main component of the task-based question-answering system, named entity recognition and intention classification are respectively used for extracting entities in a dialog and classifying dialog intentions in the NLU, and the direction from a natural language understanding module (NLU) to a dialog management module (DM) is unidirectional.

In addition, the existing intelligent dialogue platform has two defects when the NLU is realized. First, existing platforms do not consider the intent of the text as a feature when doing named entity recognition. Second, the impact of the contextual information on intent classification is not considered when doing intent classification.

Disclosure of Invention

The invention aims to provide a multi-turn question-answering system intention classification and named entity identification research method.

In order to achieve the purpose, the invention is realized by the following technical scheme:

a multi-turn question-answering system intention classification and named entity identification research method comprises the following steps:

1) when the intention classification is carried out, the historical information of the context is used as input through a dialogue state tracking module, and an intention classification prediction model is transmitted;

2) when the named entity is identified and predicted, the intention identification result of the round is used as a characteristic and is transmitted into a named entity prediction model;

3) named entity recognition and intent classification are trained through a multi-tasking model, and the model is refined as required by the first two points.

The optimal scheme of the research method for intention classification and named entity recognition of the multi-turn question-answering system is characterized in that a multi-task framework of a Bert model is used, a [ CLS ] mark bit is added at the beginning of each input sentence of the Bert, the mark bit serves as a prediction result of intention classification, a conditional random field CRF is added on an output layer of the Bert to serve as prediction output of named entity recognition, and hidden layer information of the intention classification serves as input to serve as a part of named entity recognition prediction.

The preferable scheme of the research method for intention classification and named entity identification of the multi-turn question-answering system specifically comprises the following steps:

s1, taking DST context data in a DM module as one of inputs of a Bert model;

s2, fusing the named entity identification module and the intention classification module by using a Bert multitask;

and S3, transmitting the intention classification result state as an input to a CRF model for prediction.

The invention has the advantages that:

NLU and DM are directly changed into a bidirectional model, the accuracy of intention classification can be improved by context information, named entity identification and intention classification are fused through a multi-task model Bert model, and deployment can be conveniently carried out.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention.

Fig. 1 is a flow chart of an intelligent dialog in an embodiment of the present invention.

Fig. 2 is a diagram of an intelligent dialog improvement in an embodiment of the present invention.

FIG. 3 is a graph of multitasking berts in an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

As described in the background art, the conventional task-based question-answering system adopts a pipeline structure, and is unidirectional from a natural language understanding module (NLU) to a dialogue management module (DM), so that the problem of inaccurate intention classification exists; the named entity recognition task (NER) and the Intent classification task (Intent Classifier) are separate and complex to lay out.

In order to solve the problems, the invention adopts the following scheme:

The embodiment of the invention specifically comprises the following steps:

1. directly changing NLU and DM into bidirectional models, so that an intention classification model can acquire context information of Dialog State Tracking (DST) in classification;

2. adopting a multi-classification task of Bert, fusing a named entity recognition task (NER) and an intention classification task (Intent Classifier) through a model, and classifying the CLS by softmax through the Attention layer of Bert after the fused model passes through the Attention layer of Bert to obtain a final intention;

3. and the hidden state CLS of the intention is used as input and transmitted to a CRF layer, so that when the CRF layer identifies the named entity, the CRF layer outputs the identification result of the named entity in consideration of the intention of the corresponding round.

Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A multi-turn question-answering system intention classification and named entity recognition research method is characterized in that: the method comprises the following steps:

2. The multi-turn question-answering system intention classification and named entity identification research method of claim 1, wherein: the method comprises the steps of using a multitask architecture of a Bert model, adding a [ CLS ] identification bit at the beginning of each sentence input by the Bert, using the identification bit as a prediction result of an intention classification, adding a conditional random field CRF on an output layer of the Bert as a prediction output of named entity recognition, and using hidden layer information of the intention classification as input to serve as a part of named entity recognition prediction.

3. The multi-turn question-answering system intention classification and named entity identification research method of claim 2, wherein: the method specifically comprises the following steps:

s1, taking DST context data in a DM module as one of inputs of a Bert model;