CN114153955B

CN114153955B - Construction method of multi-skill task type dialogue system fusing chatting and common knowledge

Info

Publication number: CN114153955B
Application number: CN202111334457.2A
Authority: CN
Inventors: 陈楷; 熊京萍; 廖奇; 王辉
Original assignee: Kexun Jialian Information Technology Co ltd
Current assignee: Kexun Jialian Information Technology Co ltd
Priority date: 2021-11-11
Filing date: 2021-11-11
Publication date: 2023-04-07
Anticipated expiration: 2041-11-11
Also published as: CN114153955A

Abstract

The invention relates to a dialogue system, in particular to a multi-skill task type dialogue system construction method fusing chatting and common knowledge, which comprises the steps of collecting large-scale open-source Chinese chatting linguistic data, taking adjacent chats as chatting linguistic data pairs, constructing a chatting model, collecting common sense question-answer linguistic data and triple data in various large-scale fields, constructing a knowledge graph, scoring a model based on question-answer linguistic data training similarity, processing an original problem to obtain a candidate entity common sense set, obtaining all out-degree and in-degree relations in related two-hop relations from the knowledge graph, obtaining a final candidate entity, screening out a relation path by using the final candidate entity, preferably selecting the entity relation path to obtain a final path, inquiring a common sense answer from the knowledge graph based on the final path, and completing construction of a common sense dialogue model; the technical scheme provided by the invention can effectively overcome the defects that the prior art cannot have the common sense reasoning capability and the chatting function and has lower conversation reply recall accuracy.

Description

Construction method of multi-skill task type dialogue system fusing chatting and common knowledge

Technical Field

The invention relates to a dialogue system, in particular to a construction method of a multi-skill task type dialogue system fusing chatting and common knowledge.

Background

The dialog system may be classified into a chatting type dialog system, a question-and-answer type dialog system, and a task type dialog system according to task types.

The chatting dialogue system mainly performs emotion interactive communication with a user to help the user relieve worries and stuffiness, generally utilizes a large amount of chatting linguistic data to train data, for example, dialoDG, T5 and the like adopt a generative model, so that the model has the capability of generating corresponding output according to input, but the generated result is random, and the whole process is difficult to evaluate and control. The chatty type dialogue system does not have the common sense reasoning ability and can not provide vertical tasks in a specific field.

The question-answering type dialogue system is also called an FAQ question-answering system, and is mainly used for providing vertical consulting services, such as government affair consultation and legal consultation, for the user. The current mainstream question-answer dialogue systems are all based on a retrieval scheme, namely an ES question-answer pair library is constructed in advance according to questions and answers, when a user asks questions, the questions in the library are retrieved, topN candidate questions most similar to the user questions are returned, then the most similar standard questions are selected by using a grading rearrangement mechanism, and the answers corresponding to the standard questions are returned as answers to the questions. At present, a common FAQ question-answering system fully utilizes text and semantic feature vectors by constructing a large number of similar questions or high-quality knowledge maps and adopts a recall and scoring strategy to ensure controllability and precision, but the question-answering type dialogue system does not have the common knowledge reasoning capability and cannot support the chatting function.

The task type dialogue system, also called an intelligent customer service system, has wide application prospects in various fields of logistics, finance, insurance, manufacturing, e-commerce and the like, can replace part of manual customer service to provide 24h service for users, greatly reduces the pressure of the manual customer service, and reduces the enterprise cost. At present, mainstream task type conversation systems in the market are realized in a pipeline mode, the whole system is divided into a plurality of modules, for example, a conversation understanding NLU module mainly identifies user intentions and extracts key slot positions in conversations, a conversation tracking DST module mainly records conversation states and slot positions, a conversation strategy DPL module mainly decides what response should be made to a user in the conversation according to current input and historical records, and a conversation generation NLG module mainly generates contents of final responses to the user. In view of the complexity of the task-based dialog system and the high standard requirement for task completion, the system does not have the common sense reasoning capability and cannot well support the chatting function.

Disclosure of Invention

Technical problem to be solved

Aiming at the defects in the prior art, the invention provides the construction method of the multi-skill task type dialogue system integrating chatting and common knowledge, which can effectively overcome the defects that the prior art cannot combine the function of common sense reasoning and chatting and has lower dialogue reply recall accuracy.

(II) technical scheme

In order to achieve the purpose, the invention is realized by the following technical scheme:

a multi-skill task type dialogue system construction method fusing chatting and common knowledge comprises the following steps:

s1, collecting large-scale open-source Chinese chatting linguistic data, taking adjacent dialogues as chatting linguistic data pairs, and constructing a chatting model;

s2, collecting the common sense question and answer corpus and the triple data of each large-scale field, constructing a knowledge map, and training a similarity scoring model based on the common sense question and answer corpus;

s3, processing the original problem to obtain a candidate entity set, acquiring all out-degree and in-degree relations in the relevant two-hop relation from the knowledge graph, and obtaining a final candidate entity;

s4, screening out entity relation paths by using the final candidate entities, optimizing the entity relation paths to obtain final paths, and inquiring common sense answers from the knowledge graph based on the final paths to complete construction of a common sense dialogue model;

s5, analyzing the task dialogue corpus, abstracting out fixed body data, storing the fixed body data into a database, and constructing a task dialogue model at a session level;

and S6, generating a topic discrimination model by utilizing common sense reasoning and task dialogue corpus training.

Preferably, the step S4 of screening out the entity relationship path by using the final candidate entity, and performing optimization on the entity relationship path to obtain a final path, and querying a common sense answer from the knowledge graph based on the final path includes:

splicing the final candidate entities with all out-degree and in-degree relations in the corresponding two-hop relations to form complete sentences, respectively calculating the similarity between the original problem and each complete sentence by using a similarity scoring model, and screening out entity relation paths;

and scoring each entity relationship path and the original problem, selecting the entity relationship path with the highest score as a final path, and inquiring the common sense answer from the knowledge graph based on the final path.

Preferably, the step of splicing the final candidate entities with all the out-degree and in-degree relations in the corresponding two-hop relation into a complete sentence includes:

all final candidate entities correspond to a spliced complete sentence which comprises the following steps:

wherein the content of the first and second substances,

sentences composed of all relations in representative degree and the ith final candidate entity, I _i Is the final candidate entity of the ith input, R _r1 Being an in-degree relationship, T, within a two-hop relationship _r1 Is an in-degree entity within a two-hop relationship,

represents all relations in the degree and the sentence composed by the ith final candidate entity, R _r2 For out-of-degree relations within a two-hop relation, T _r2 And M is the number of final candidate entities.

Preferably, the calculating the similarity between the original problem and each complete sentence by using the similarity scoring model and screening out the entity relationship path includes:

and respectively passing the original problem and the complete sentence through a similarity scoring model, adding the last layer of N-dimensional feature vectors, splicing the last layer of N-dimensional feature vectors to form a 3 x N vector, outputting the 3 x N vector after passing through two full-connection layers, and finally scoring by using a softmax function to screen out an entity relationship path corresponding to a preset number of complete sentences with the highest score.

Preferably, in S2, collecting the general knowledge question and answer corpus and the triple data in each large-scale field, constructing a knowledge graph, and training a similarity scoring model based on the general knowledge question and answer corpus, including:

the method comprises the steps of expanding diversity of the common sense question-answer corpus by utilizing triple data, cleaning and preprocessing the triple data, storing the triple data into a neo4j database to construct a knowledge map, and training an SimCSE-RoBerta similarity scoring model based on the common sense question-answer corpus.

Preferably, the scoring each entity relationship path and the original question, selecting the entity relationship path with the highest score as the final path, and querying a common sense answer from the knowledge graph based on the final path includes:

calculating the total score source between the ith entity relationship path and the original problem by adopting the following formula _i ：

source _i ＝L _i +D _i +R _i +C _i

Wherein L is _i The intersection length, D, obtained after the duplication of the ith entity relationship path and the original problem is removed on all word granularities _i Is the inverse of the length of the ith entity relationship path, R _i Is the reciprocal of the hop count of the ith entity relationship path, C _i The frequency of the final candidate entity appearing in the ith entity relationship path is obtained;

selecting a Total score source _i And taking the highest entity relationship path as a final path, and assembling cypher query sentences based on the final path to query the common sense answers from the knowledge graph.

Preferably, the task dialog model is built at a session level in S5, including:

the method for performing fine adjustment on the dialog neural network by combining the words, the belief states, the database results, the system actions and the system replies of the users in each dialogue turn into a dialogue sequence comprises the following steps:

in the first dialog turn, the user inputs the utterance as U ₀ According to the speech U ₀ The generated belief state is B ₀ Belief state B ₀ For database retrieval to retrieve satisfaction of belief state B ₀ The number of entities under the constraint and the search result are D ₀ According to { U ₀ ，B ₀ ，D ₀ Generation System action A ₀ And system reply R ₀ ；

At the tth dialog turn, based on the user utterance U _t And all previously generated outputs U ₀ ，B ₀ ，D ₀ ，A ₀ ，R ₀ ，…，U _t-1 ，B _t-1 ，D _t-1 ，A _t-1 ，R _t-1 ，U _t Is multiplied by the sum of the coefficients to generate B _t 、A _t And R _t 。

Preferably, the processing the original problem in S3 to obtain a candidate entity set includes:

and performing word segmentation and part-of-speech extraction on the original problem through a lac tool, reserving words and phrases with the part-of-speech, removing stop words, and respectively obtaining substrings of the words and phrases with the part-of-speech from 2 to the original problem length to obtain a candidate entity set.

Preferably, in S3, obtaining all the out-degree and in-degree relations in the relevant two-hop relation from the knowledge graph, and obtaining a final candidate entity, includes:

obtaining all out-degree and in-degree relations in the related two-hop relation from the knowledge graph, and counting the relation data amount count _qr The similarity s between the original problem and each candidate entity is calculated using the following formula _qe ：

The similarity s between the original problem and each relation in the two-hop relation is calculated by using the following formula _qc ：

Wherein S is _q For the original set of problems, S _e As a set of candidate entities, S _c Is a set of relationships;

calculating the relation data amount count using the following equation _qr The similarity s between the original question and each candidate entity _qe Similarity s of original problem and each relation in two-hop relation _qc Weighted score candidate between _s ：

candidate _s ＝count _qr *W _qr +s _qr *W _qe +s _qc *W _qc

Wherein, W _qr 、W _qe 、W _qc Are all fixed adjustment values;

and reserving the preset number of candidate entities with the highest scores as final candidate entities.

Preferably, the method for constructing the Chinese chatting model includes the steps of collecting Chinese chatting corpus of a large-scale open source in S1, taking adjacent chatting as a chatting corpus pair, and constructing the chatting model, wherein the steps include:

collecting large-scale open-source Chinese chatting linguistic data, performing noise cleaning and quality inspection on the Chinese chatting linguistic data, removing sensitive subject samples, taking the upper part of an adjacent conversation as a question query and the lower part as an Answer, and training a generating chatting model by utilizing a Dialdo DG neural network.

(III) advantageous effects

Compared with the prior art, the construction method of the multi-skill task type dialogue system fusing the chatting and the common knowledge, provided by the invention, has the following beneficial effects:

1) In the construction process of a common sense dialogue model, a unique candidate entity extraction strategy is designed, an additional entity link model is not needed, the cost of manual labeling and model training is reduced, feature vectors are extracted by using an SimCSE-RoBerta similarity scoring model at an entity relation path stage for differential fusion, feature information is enhanced, and recall scoring is carried out by combining with the original problem when a final path is optimized, so that the recall accuracy is effectively improved;

2) In the construction process of the task dialogue model, a pipeline multi-module scheme is abandoned, a unified data input format is constructed, mutual information of dialogue texts and system responses is improved, and model training is carried out in a DialoDG end-to-end mode.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort.

FIG. 1 is a schematic flow diagram of the present invention;

FIG. 2 is a schematic flow chart of the method for constructing a chat model according to the present invention;

FIG. 3 is a schematic flow chart of the construction of a common sense dialogue model in the present invention;

FIG. 4 is a schematic flow chart of the task dialogue model construction method of the present invention;

fig. 5 is a working diagram of the dialog neural network in the process of constructing the task dialogue model in the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention. It is to be understood that the embodiments described are only a few embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

A multi-skill task type dialogue system construction method fusing chat and common knowledge is disclosed, as shown in FIG. 2, collecting large-scale open-source Chinese chat linguistic data, taking adjacent dialogues as chat linguistic data pairs, and constructing a chat model, and specifically comprises the following steps:

collecting large-scale open-source Chinese chatting linguistic data, performing noise cleaning and quality inspection on the Chinese chatting linguistic data, removing sensitive subject samples, taking the upper part of an adjacent conversation as a question query and the lower part as an Answer, and training a generating chatting model by using a DiaoloDG neural network.

As shown in fig. 3, collecting the common sense question and answer corpus and the triple data in various large-scale fields, constructing a knowledge graph, and training a similarity scoring model based on the common sense question and answer corpus, specifically including:

expanding the diversity of the common sense question and answer corpus by using 1.4 hundred million Chinese entity triple data sourced from ownthink, cleaning and preprocessing the triple data, storing the triple data into a neo4j database to construct a knowledge map, and training a SimCSE-RoBerta similarity scoring model based on the common sense question and answer corpus.

As shown in fig. 3, the original problem is processed to obtain a candidate entity set, all the relationships between out-degree and in-degree in the relevant two-hop relationship are obtained from the knowledge graph, and a final candidate entity is obtained.

(1) Processing the original problem to obtain a candidate entity set, including:

(2) Obtaining all the relationships of out-degree and in-degree in the related two-hop relationship from the knowledge graph, and obtaining a final candidate entity, wherein the steps comprise:

The similarity s between the original problem and each relation in the two-hop relation is calculated by adopting the following formula _qc ：

Wherein S is _q For the original set of problems, S _e As a set of candidate entities S _c Is a set of relationships;

candidate _s ＝count _qr *W _qr +s _qr *W _qe +s _qc *W _qc

Wherein, W _qr 、W _qe 、W _qc Are all fixed adjustment values;

As shown in fig. 3, the entity relationship path is screened out by using the final candidate entity, the entity relationship path is optimized to obtain a final path, and a common sense answer is queried from the knowledge graph based on the final path to complete the construction of the common sense dialogue model, which specifically includes:

and scoring each entity relationship path and the original question, selecting the entity relationship path with the highest score as a final path, and inquiring the common sense answer from the knowledge graph based on the final path.

(1) And splicing the final candidate entities with all the out-degree and in-degree relations in the corresponding two-hop relations respectively to form a complete sentence, wherein the steps comprise:

all final candidate entities correspond to the complete sentence spliced by:

/>

wherein, the first and the second end of the pipe are connected with each other,

represents all relationships in the degree of entry and a sentence composed of the ith final candidate entity, I _i Is the final candidate entity of the ith input, R _r1 Is an in-degree relationship, T, within a two-hop relationship _r1 Is an in-degree entity within a two-hop relationship,

represents all relations in the degree and the sentence composed by the ith final candidate entity, R _r2 In a two-hop relationship, T _r2 And M is the number of final candidate entities.

(2) Respectively calculating the similarity between the original problem and each complete sentence by using a similarity scoring model, and screening an entity relationship path, wherein the similarity scoring model comprises the following steps:

(3) Scoring each entity relationship path and the original question, selecting the entity relationship path with the highest score as a final path, and inquiring a common sense answer from the knowledge graph based on the final path, wherein the method comprises the following steps:

source _i ＝L _i +D _i +R _i +C _i

selecting a Total score source _i And taking the highest entity relationship path as a final path, and assembling cypher query sentences based on the final path to query common sense answers from the knowledge graph.

As shown in fig. 4, the task dialogue corpus is analyzed, the fixed ontology data is abstracted and stored in the database, and the task dialogue model is constructed at the session level.

(1) Analyzing the task dialogue corpus, abstracting out fixed ontology data and storing the fixed ontology data into a database, wherein the method comprises the following steps:

take restaurant information as an example, [ { "pname": anhui province, "city": compost city, "adname": "address" in the mountainous region of hollyhock: innovation Daotai No. 2800 in Sichuan province, "location": [117.26104,31.85117], "name": old and rural chicken innovation industry park shop, "quality":5, "license range": cheap } ], generally if a third party has more accurate external information, the third party can be butted in the form of an HTTP interface, then the diversity of language expressions in linguistic data from different sources is removed, mutual information between a dialogue semantic structure and response is fully utilized, all data are differentiated, specific slot positions in dialogue data are replaced in a uniform alias mode, and the words, the belief states, database results (also can be results of the external interface), system actions and system replies of a user are all surrounded in a specific coding format as input;

the specific data format of the conventional scheme is as follows:

user i want to see a cheap, good and good restaurant near the lake park of Sichuan

[ Slot ] [ Shuxi lake park, cheap, good public praise ]

[ System ] the old and rural chicken innovation industry park store is good, and the popular comment is a 5-star good comment

User is good, help me order today's 4-person compartment

The specific data format in the invention is as follows:

< sos _ u > ] I want to see the restaurant at the privacy quality around address [ < eos _ u > ]

[<sos_a>][address][pricerange][quality][restaurant][<eos_u>]

[ < sos _ r > ] name is good, and popular is quality. [ < eos _ r > ]

[sos_db][tb_restaurant][address][pricerange][quality][eos_db]

Good [ < sos _ u > ] person's compartment [ < eos _ u > ] helping me to schedule time

[<sos_a>][booking][address][pricerange][quality][time][person][<eos_u>]

[ < sos _ r > ] good, has succeeded in booking you [ old and rural chicken innovation industry park store ] [ today ] people's booth [ 4 ]. Ask what can help you? [ < eos _ r > ]

(2) Constructing a task dialog model at the session level, as shown in FIG. 5, includes:

in the first dialog turn, the user inputs the utterance as U ₀ According to the speech U ₀ The generated belief state is B ₀ Belief state B ₀ For database retrieval to retrieve satisfaction of belief state B ₀ The number of entities under the constraint and the search result are D ₀ According to { U ₀ ，B ₀ ，D ₀ Generate system actions A ₀ And system reply R ₀ ；

At the tth dialog turn, based on the user utterance U _t And all previously generated outputs { U } ₀ ，B ₀ ，D ₀ ，A ₀ ，R ₀ ，…，U _t-1 ，B _t-1 ，D _t-1 ，A _t-1 ，R _t-1 ，U _t Is multiplied by the sum of the coefficients to generate B _t 、A _t And R _t 。

And generating a topic discrimination model by using common sense reasoning and task dialogue corpus training. The method comprises the following steps that a user inputs a topic type judgment through a topic judgment model, and if a result output by the topic judgment model is a common sense type, a common sense conversation model is called; if the result output by the topic judging model is the task type, calling a task dialogue model; and if the topic discrimination model judges that the input does not belong to the common sense type or the task type, calling the chatting model.

The above examples are only intended to illustrate the technical solution of the present invention, and not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A multi-skill task type dialogue system construction method fusing chatting and common knowledge is characterized in that: the method comprises the following steps:

s4, screening out entity relationship paths by using the final candidate entities, optimizing the entity relationship paths to obtain final paths, and inquiring common sense answers from the knowledge graph based on the final paths to complete the construction of a common sense dialogue model;

s6, generating a topic discrimination model by utilizing common sense reasoning and task dialogue corpus training;

the method includes the steps of collecting large-scale open-source Chinese chatting corpus in S1, taking adjacent dialogues as chatting corpus pairs, and constructing a chatting model, wherein the steps include:

collecting large-scale open-source Chinese chatting linguistic data, performing noise cleaning and quality inspection on the Chinese chatting linguistic data, removing sensitive subject samples, taking the upper part of an adjacent conversation as a question query and the lower part as an Answer, and training a generating chatting model by utilizing a dialogue neural network;

in S2, collecting the common sense question and answer corpus and the triple data of each large-scale field, constructing a knowledge map, and training a similarity scoring model based on the common sense question and answer corpus, wherein the model comprises the following steps:

the triple data is used for expanding the diversity of the common sense question and answer corpus, cleaning and preprocessing the triple data, storing the triple data into a neo4j database to construct a knowledge graph, and training a SimCSE-RoBerta similarity scoring model based on the common sense question and answer corpus.

2. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 1, wherein: in S4, screening out entity relationship paths by using the final candidate entities, optimizing the entity relationship paths to obtain final paths, and inquiring common sense answers from the knowledge graph based on the final paths, wherein the steps comprise:

3. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 2, wherein: the step of splicing the final candidate entities with all the out-degree and in-degree relations in the corresponding two-hop relations respectively into a complete sentence comprises the following steps:

all final candidate entities correspond to the complete sentence spliced by:

wherein the content of the first and second substances,

represents all relationships in the degree of entry and a sentence composed of the ith final candidate entity, I _i Is the final candidate entity of the ith input, R _r1 Being an in-degree relationship, T, within a two-hop relationship _r1 Is an in-degree entity within a two-hop relationship,

represents all relations in the degree and the sentence composed of the ith final candidate entity, R _r2 For out-of-degree relations within a two-hop relation, T _r2 The entity number is the out degree entity in the two-hop relationship, M is the number of the final candidate entities, R represents the total number of the relationship, E represents the total number of the entities, i represents the final candidate mark, R1 represents the in degree correlation coefficient, E1 represents the number of the in degree entities, R2 represents the out degree correlation coefficient, and E2 represents the number of the out degree entities.

4. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 2, wherein: the method for respectively calculating the similarity between an original problem and each complete sentence by utilizing the similarity scoring model and screening out an entity relationship path comprises the following steps:

5. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 2, wherein: the scoring of each entity relationship path and the original problem, selecting the entity relationship path with the highest score as the final path, and inquiring the common sense answer from the knowledge graph based on the final path comprises the following steps:

calculating the total score source between the ith entity relationship path and the original question by adopting the following formula:

source _i ＝L _i +D _i +R _i +C _i

wherein L is _i The intersection length, D, obtained by removing the duplication of the ith entity relationship path and the original problem on all word granularities _i Is the inverse of the length of the ith entity relationship path, R _i Is the reciprocal of the hop count of the ith entity relationship path, C _i The frequency of the final candidate entity appearing in the ith entity relationship path is obtained;

6. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 1, wherein: and S5, constructing a task dialog model at a session level, wherein the task dialog model comprises the following steps:

the method comprises the following steps of forming a dialogue sequence by the words, the belief states, the database results, the system actions and the system replies of each dialogue turn user, and finely adjusting the dialogue sequence on a dialogue neural network, wherein the dialogue sequence specifically comprises the following steps:

in the first dialog turn, the user inputs the utterance as U ₀ According to the speech U ₀ The generated belief state is B ₀ Belief state B ₀ For database retrieval to satisfy belief state B ₀ The number of entities under the constraint and the search result are D ₀ According to { U ₀ ，B ₀ ，D ₀ Generation System action A ₀ And system reply R ₀ ；

At the tth dialog turn, based on the user utterance U _t And all previously generated outputs { U } ₀ ，B ₀ ，D ₀ ，A ₀ ，R ₀ ，…，U _t-1 ，B _t-1 ，D _t-1 ，A _t-1 ，R _t-1 ，U _t Is multiplied by the sum of the coefficients to generate B _t 、A _t And R _t ；

Wherein, U _t-1 Representing utterances entered by the user during the t-1 st dialog turn, B _t-1 Representing the belief states generated from the utterances input by the user at the t-1 st dialogue turn, D _t-1 Is represented by the formula A and B _t-1 Corresponding search result, A _t-1 Represents the system action at the t-1 st dialogue turn, R _t-1 The system replies at t-1 dialog turns on the presentation.

7. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 1, wherein: processing the original problem in S3 to obtain a candidate entity set, which comprises:

extracting words and parts of speech of the original problem by a lac tool, reserving words and phrases with parts of speech, removing stop words, and respectively obtaining substrings of the words and phrases with parts of speech from 2 to the original problem length so as to obtain a candidate entity set.

8. The method for building a multi-skill task type dialogue system fusing chatting and general knowledge according to claim 7, wherein: s3, acquiring all out-degree and in-degree relations in the related two-hop relations from the knowledge graph, and obtaining a final candidate entity, wherein the steps comprise:

obtaining all out-degree and in-degree relations in the related two-hop relation from the knowledge graph, and counting the relation data amount count _qr The similarity s between the original problem and each candidate entity is calculated using the following formula _qe :

candidate _s ＝count _qr *W _qr +s _qr *W _qe +s _qc *W _qc

Wherein, W _qr 、W _qe 、W _qc Are all fixed adjustment values;