CN112199482A

CN112199482A - Dialog generation method, device, equipment and readable storage medium

Info

Publication number: CN112199482A
Application number: CN202011059826.7A
Authority: CN
Inventors: 李雅峥; 杨海钦; 姚晓远
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2020-09-30
Filing date: 2020-09-30
Publication date: 2021-01-08
Anticipated expiration: 2040-09-30
Also published as: CN112199482B; WO2022068197A1

Abstract

The invention discloses a dialog generation method, a device, equipment and a readable storage medium, wherein the method comprises the following steps: obtaining question information, and converting the question information into a first query vector by using a preset first gate recursion unit GRU model; determining a common sense vector associated with the first query vector by utilizing a preset first end-to-end memory network MemN2N model according to the first query vector, and forming a question vector according to the first query vector and the common sense vector; converting the question vector into a plurality of second query vectors by utilizing a preset second gate recursion unit GRU model according to the question vector, and sequentially inputting the second query vectors into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors; respectively converting each reply vector into reply words, and combining all the reply words into reply information; the invention can quickly and accurately form the reply information in the remote consultation session, thereby improving the user experience.

Description

Dialog generation method, device, equipment and readable storage medium

Technical Field

The invention relates to the technical field of remote consultation conversations of digital medical treatment, in particular to a conversation generation method, a device, equipment and a readable storage medium.

Background

With the continuous development of artificial intelligence, man-machine conversation is more and more applied to various scenes; for example, in the manual customer service scene, by identifying the question information input by the user, the reply information corresponding to the question information is formed, so that the labor cost is reduced; however, if the conventional open-domain man-machine conversation system lacks understanding of background knowledge and related common sense information of user questions, only starting from the conversation data, the system will generate general answers lacking effective information, and may affect readability of the answer information. In addition, how to quickly and accurately form the reply information according to the user question information becomes a technical problem which needs to be solved urgently by the technical personnel in the field.

Disclosure of Invention

The invention aims to provide a conversation generation method, a device, equipment and a readable storage medium, which can quickly and accurately form reply information in a remote consultation conversation and improve the user experience.

According to an aspect of the present invention, there is provided a dialog generation method, including:

obtaining question information, and converting the question information into a first query vector by using a preset first gate recursion unit GRU model;

determining a common sense vector associated with the first query vector by utilizing a preset first end-to-end memory network MemN2N model according to the first query vector, and forming a question vector according to the first query vector and the common sense vector;

converting the question vector into a plurality of second query vectors by utilizing a preset second gate recursion unit GRU model according to the question vector, and sequentially inputting the second query vectors into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors;

and respectively converting each reply vector into reply words, and combining all the reply words into reply information.

Optionally, the obtaining the question information and converting the question information into a first query vector by using a preset first gate recursive unit GRU model includes:

performing word segmentation on the question information, and forming a word sequence by a plurality of keywords obtained after the word segmentation;

aiming at a target keyword in the word sequence, calculating a hidden influence factor of the target keyword transmitted to a keyword positioned behind the target keyword in the word sequence by utilizing the first gate recursive unit GRU model according to the hidden influence factor of the target keyword transmitted to the keyword positioned in front of the target keyword in the word sequence;

taking a hidden influence factor calculated according to the last keyword in the word sequence as a first query vector u corresponding to the question information₁。

Optionally, the determining, according to the first query vector, a common sense vector associated with the first query vector by using a preset first end-to-end memory network MemN2N model, and forming a question vector according to the first query vector and the common sense vector, includes:

in the 1 st cycle of the first end-to-end memory network MemN2N model, the first query vectors u are respectively calculated₁With the ith common sense head vector x in the preset common sense head group_iThe correlation value pi;

head vector x according to the ith sense_iOf the correlation value p_iWith the ith common sense tail vector y in the preset common sense tail group_iCalculating the question sub-vector a of the 1 st cycle₁；

The first query vector u₁And the question vector a₁Adding to obtain the first query vector u of the 2 nd cycle₂；

According to the first query vector u of the 2 nd cycle₂Recalculating question sub-vector a of 2 nd cycle₂And a first query vector u for cycle 3₃And the rest is done in the same way until the questioning subvector a of the Mth cycle is calculated_M；

The question sub-vector a of the Mth cycle is added_MAs described aboveThe question vector.

Optionally, the method further includes:

acquiring a common sense information base; wherein the common sense information base comprises a plurality of common sense information expressed in the form of knowledge triples, and the common sense information comprises: head, tether and tail;

converting the head in each common sense information into a common sense head vector by presetting a first hidden layer matrix, thereby forming a common sense head group;

converting the tail part in each common sense information into a common sense tail vector by presetting a second hidden layer matrix, thereby forming a common sense tail group;

and establishing the corresponding relation between the common sense head vector and the common sense tail vector according to the relation part in each common sense information.

Optionally, the converting, according to the question vector, the question vector into a plurality of second query vectors by using a preset second gate recursive unit GRU model, and sequentially inputting each of the second query vectors into a preset second end-to-end memory network MemN2N model to obtain a plurality of response vectors includes:

taking the question vector as a hidden influence factor h of a first layer₀And presetting a starting character vector s₀Inputting the input vector into the second gate recursive unit GRU model to obtain an output vector s₁And a hidden impact factor h passed to the second layer₁；

The output vector s₁Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r₁；

The output vector s₁And hidden influence factor h of the second layer₁Re-inputting the second gate recursive unit GRU model to obtain an output vector s₂And a hidden impact factor h passed to the third layer₂And outputs the output vector s₂Re-inputting into the second end-to-end memory network MemN2N model to obtain a reply vector r₂And repeating the steps until the output vector of the second gate recursion unit GRU model is presetThe character vector is ended.

Optionally, said outputting said output vector s₁Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r₁The method comprises the following steps:

in the 1 st cycle of the second end-to-end memory network MemN2N model, calculating the second query vector s respectively₁With the i-th reply header vector k in the preset reply header group_iOf the correlation value p_i；

From the ith reply head vector k_iOf the correlation value p_iWith the ith reply tail vector l in the preset reply tail group_iCalculate the reply subvector o for the 1 st cycle₁；

The second query vector s₁Reply sub-vector o with cycle 1₁Adding to obtain a second query vector s of the 2 nd cycle₂；

Second query vector s according to said 2 nd cycle₂Recalculating reply subvector o for cycle 2₂And a second query vector s for cycle 3₃And so on until the answer subvector o of the Nth cycle is calculated_N；

Reply sub-vector o of the Nth cycle_NAs a reply vector r₁。

Optionally, the method further includes:

acquiring a reply information base; wherein the reply information base comprises a plurality of reply information expressed in the form of knowledge triples, and the reply information comprises: head, tether and tail;

converting the head in each reply message into a reply head vector by a preset conversion embedding TransE algorithm, thereby forming a reply head group;

converting the tail in each reply message into a reply tail vector through a preset conversion embedding TransE algorithm, thereby forming a reply tail group;

and establishing the corresponding relation between the reply head vector and the reply tail vector according to the relation part in each reply message.

In order to achieve the above object, the present invention also provides a dialog generating device, including:

the system comprises an acquisition module, a query module and a query module, wherein the acquisition module is used for acquiring question information and converting the question information into a first query vector by utilizing a preset first gate recursion unit GRU model;

the question module is used for determining a common sense vector associated with the first query vector by utilizing a preset first end-to-end memory network MemN2N model according to the first query vector and forming a question vector according to the first query vector and the common sense vector;

the answer module is used for converting the question vector into a plurality of second query vectors by utilizing a preset second gate recursive unit GRU model according to the question vector, and sequentially inputting the second query vectors into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors;

and the conversion module is used for respectively converting each reply vector into reply words and combining all the reply words into reply information.

In order to achieve the above object, the present invention further provides a computer device, which specifically includes: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the dialog generation method introduced above when executing the computer program.

In order to achieve the above object, the present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the dialog generating method described above.

The invention provides a dialog generation method, a device, equipment and a readable storage medium, which combine an end-to-end memory network MemN2N architecture with a GRU network to find out related common sense information according to question information, and comprehensively consider the question information and the common sense information to determine reply information. In the process of encoding the question information into the question vector, the question information is encoded in a form of GRU + MemN2N, a GRU network is used for replacing EmbeddingB in a MemN2N network aiming at the question information input by a user, and the final hidden layer state of the GRU network is input into the MemN2N network as a query vector. In the process of decoding the question vector into the reply information, the reply information is generated in the form of GRU + MemN 2N. The invention can quickly and accurately form the reply information in the remote consultation session, thereby improving the user experience.

Drawings

Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:

fig. 1 is an alternative flowchart of a dialog generation method according to an embodiment;

fig. 2 is a schematic diagram of an alternative structure of the dialog generating device according to the second embodiment;

fig. 3 is a schematic diagram of an alternative hardware architecture of the computer device according to the third embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example one

An embodiment of the present invention provides a dialog generation method, as shown in fig. 1, the method specifically includes the following steps:

step S101: the method comprises the steps of obtaining question information, and converting the question information into a first query vector by utilizing a preset first GRU (Gate recursive Unit) model.

Specifically, step S101 includes:

step A1: performing word segmentation on the question information, and forming a word sequence by a plurality of keywords obtained after the word segmentation; wherein the word sequence comprises N keywords;

step A2: aiming at a target keyword in the word sequence, calculating a hidden influence factor of the target keyword transmitted to a keyword positioned behind the target keyword in the word sequence by utilizing the first gate recursive unit GRU model according to the hidden influence factor of the target keyword transmitted to the keyword positioned in front of the target keyword in the word sequence;

step A3: taking a hidden influence factor calculated according to the last keyword in the word sequence as a first query vector u corresponding to the question information₁。

Step S102: according to the first query vector, determining a common sense vector associated with the first query vector by using a preset first MemN2N (End-to-End Memory Networks) model, and forming a question vector according to the first query vector and the common sense vector.

Specifically, step S102 includes:

step B1: calculating the first query vector u in the 1 st loop hop of the first end-to-end memory network MemN2N model₁With the ith common sense head vector x in the preset common sense head group_iOf the correlation value p_i；

Wherein p is_i＝Softmax((u₁)^T*x_i) And T is a transposition function.

Step B2: head vector x according to the ith sense_iOf the correlation value p_iWith the ith common sense tail vector y in the preset common sense tail group_iCalculating the question sub-vector a of the 1 st cycle₁；

Wherein, a₁＝∑_ip_iy_i。

Step B3:the first query vector u₁And the question vector a₁Adding to obtain the first query vector u of the 2 nd cycle₂；

Step B4: repeating the steps B1 to B3 until the question sub-vector a of the Mth cycle is calculated_M；

Step B5: the question sub-vector a of the Mth cycle is added_MAs the question vector.

Further, the method further comprises:

step C1: acquiring a common sense information base; wherein the common sense information base comprises a plurality of common sense information expressed in the form of knowledge triples, and the common sense information comprises: head, tether and tail;

taking "cat is an animal" as an example, the knowledge triplet form represents bits (h: cat, r: belonging, t: animal), where h represents head, t represents tail, and r represents the head-tail relationship.

Step C2: converting the head in each common sense information into a common sense head vector by presetting a first hidden layer matrix EmbeddingA, thereby forming a common sense head group;

step C3: converting the tail in each common sense information into a common sense tail vector by presetting a second hidden layer matrix EmbeddingC, thereby forming a common sense tail group;

step C4: and establishing the corresponding relation between the common sense head vector and the common sense tail vector according to the relation part in each common sense information.

In the encoding Encoder process, that is, in the process of encoding the question information into the question vector, the present embodiment encodes the question information in the form of GRU + MemN2N, and for the question information input by the user, uses the GRU network to replace the EmbeddingB in the MemN2N network, and inputs the final hidden layer state of the GRU network as the query vector into the MemN2N network. The whole MemN2N network is superposed by a plurality of hops, and in each hop, the correlation degree of the query vector and each common sense information in the Memory is calculated respectively. In this embodiment, Encoder is implemented by using GRU + MemN2N, and common knowledge information with high relevance to the whole question information can be continuously added on the premise that the GRU is used to extract complete question information, so that information deviation caused by searching for a single entity word is avoided. In addition, the common knowledge information of the Memory is calculated in a weighted sum mode, and the fact that a single knowledge triple is selected as compensation information is avoided, so that the obtained common knowledge information is more comprehensive.

Step S103: converting the question vector into a plurality of second query vectors by utilizing a preset second gate recursion unit GRU model according to the question vector, and sequentially inputting the second query vectors into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors.

Specifically, step S103 includes:

step D1: taking the question vector as a hidden influence factor h of a first layer₀And presetting a starting character vector s₀Inputting the input vector into the second gate recursive unit GRU model to obtain an output vector s₁And a hidden impact factor h passed to the second layer₁；

Wherein(s)₁，h₁)＝GRU(s₀，h₀)。

Step D2: the output vector s₁Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r₁；

Further, step D2 includes:

step D21: calculating the second query vectors s respectively in the 1 st loop hop of the second end-to-end memory network MemN2N model₁With the i-th reply header vector k in the preset reply header group_iOf the correlation value p_i；

Wherein p is_i＝Softmax((s₁)^Tk_i) T is a transposition function;

step D22: from the ith reply head vector k_iOf the correlation value p_iWith the ith reply tail vector l in the preset reply tail group_iCalculate the reply subvector o for the 1 st cycle₁；

Wherein o is₁＝∑_ip_il_i；

Step D23: the second query vector s₁Reply sub-vector o with cycle 1₁Adding to obtain a second query vector s of the 2 nd cyclic hop₂；

Step D24: repeating the steps D21 to D23 until the question sub-vector o of the Nth loop hop is calculated_N；

Step D25: the question sub-vector o of the Nth cycle is added_NAs a reply vector r₁。

Still further, the method further comprises:

step E1: acquiring a reply information base; wherein the reply information base comprises a plurality of reply information expressed in the form of knowledge triples, and the reply information comprises: head, tether and tail;

step E2: converting the head in each reply message into a reply head vector by a preset conversion embedding TransE algorithm, thereby forming a reply head group;

step E3: converting the tail in each reply message into a reply tail vector through a preset conversion embedding TransE algorithm, thereby forming a reply tail group;

wherein k is (h, r, t) MLP (TransE (h, r, t));

k_i＝h；l_i＝t。

step E4: and establishing the corresponding relation between the reply head vector and the reply tail vector according to the relation part in each reply message.

Step D3: the output vector s₁And hidden influence factor h of the second layer₁Re-inputting the second gate recursive unit GRU model to obtain an output vector s₂And a hidden impact factor h passed to the third layer₂And outputs the output vector s₂Re-inputting into the second end-to-end memory network MemN2N model to obtain a reply vector r₂And repeating the steps until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

Step S104: and respectively converting each reply vector into reply words, and combining all the reply words into reply information.

Specifically, step S104 includes:

the answer vector r is obtained according to the following formula_iCorresponding reply word w_i：

P(r_i＝w_i)＝softmax(Wr_i)；

Wherein W is a preset matrix containing a plurality of answer words, and the calculated word with the maximum P value in the matrix W is taken as R_iCorresponding reply word w_i。

In the Decoder process, namely the process of decoding the questioning vector into the reply information, the generation of the reply information is realized by adopting the form of GRU + MemN 2N; the initial hidden layer state of the GRU network is the output of the Encoder part. Aiming at Memory, different from an Encoder part, a TransE algorithm is used for completing the coding of the knowledge triples, and Embedding A and Embedding C in a Memory N2N model are replaced. Further, unlike the Encoder which takes the output of the last moment of the GRU network as the input of the MemN2N, the Decoder section takes each hidden state of the GRU as the query vector query of the MemN 2N.

In this embodiment, the Decoder portion is implemented to avoid distinguishing the entity words from the common words when generating the reply, so that all reply words can be obtained from the vocabulary. In addition, the method distinguishes a similarity calculation part of the Memory and the query from a weighting and output part by means of the thought of the Kay Value Memory Network, so that the query is closer to a head entity in a knowledge triple, the output is closer to a tail entity in the knowledge triple, and the repetition rate of model generation reply and question is reduced.

Example two

An embodiment of the present invention provides a dialog generating device, as shown in fig. 2, the dialog generating device specifically includes the following components:

an obtaining module 201, configured to obtain question information, and convert the question information into a first query vector by using a preset first gate recursive unit GRU model;

the question module 202 is configured to determine, according to the first query vector, a common sense vector associated with the first query vector by using a preset first end-to-end memory network MemN2N model, and form a question vector according to the first query vector and the common sense vector;

the reply module 203 is configured to convert the question vector into a plurality of second query vectors according to the question vector by using a preset second gate recursive unit GRU model, and sequentially input each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of reply vectors;

the conversion module 204 is configured to convert each reply vector into a reply word, and combine all reply words into reply information.

Specifically, the obtaining module 201 is configured to:

performing word segmentation on the question information, and forming a word sequence by a plurality of keywords obtained after the word segmentation; aiming at a target keyword in the word sequence, calculating a hidden influence factor of the target keyword transmitted to a keyword positioned behind the target keyword in the word sequence by utilizing the first gate recursive unit GRU model according to the hidden influence factor of the target keyword transmitted to the keyword positioned in front of the target keyword in the word sequence; taking a hidden influence factor calculated according to the last keyword in the word sequence as a first query vector u corresponding to the question information₁。

Further, question module 202 is specifically configured to:

in the 1 st cycle of the first end-to-end memory network MemN2N model, the first query vectors u are respectively calculated₁With the ith common sense head vector x in the preset common sense head group_iOf the correlation value p_i(ii) a Head vector x according to the ith sense_iOf the correlation value p_iWith the ith common sense tail vector y in the preset common sense tail group_iCalculating the question sub-vector a of the 1 st cycle₁(ii) a The first query vector u₁And the question vector a₁Adding to obtain the first query vector u of the 2 nd cycle₂(ii) a According to the first query vector u of the 2 nd cycle₂Recalculating question sub-vector a of 2 nd cycle₂And a first query vector u for cycle 3₃And the rest is done in the same way until the questioning subvector a of the Mth cycle is calculated_M(ii) a The question sub-vector a of the Mth cycle is added_MAs the question vector.

Further, the apparatus further comprises:

the processing module is used for acquiring a common sense information base; wherein the common sense information base comprises a plurality of common sense information expressed in the form of knowledge triples, and the common sense information comprises: head, tether and tail; converting the head in each common sense information into a common sense head vector by presetting a first hidden layer matrix, thereby forming a common sense head group; converting the tail part in each common sense information into a common sense tail vector by presetting a second hidden layer matrix, thereby forming a common sense tail group; and establishing the corresponding relation between the common sense head vector and the common sense tail vector according to the relation part in each common sense information.

Further, the reply module 203 is specifically configured to:

taking the question vector as a hidden influence factor h of a first layer₀And presetting a starting character vector s₀Inputting the input vector into the second gate recursive unit GRU model to obtain an output vector s₁And a hidden impact factor h passed to the second layer₁(ii) a The output vector s₁Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r₁(ii) a The output vector s₁And hidden influence factor h of the second layer₁Re-inputting the second gate recursive unit GRU model to obtain an output vector s₂And a hidden impact factor h passed to the third layer₂And outputs the output vector s₂Re-inputting into the second end-to-end memory network MemN2N model to obtain a reply vector r₂And so on until the second gate recursion unit GRUThe output vector of the model is a preset ending character vector.

Further, the reply module 203 is implementing the output vector s₁Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r₁The method specifically comprises the following steps:

in the 1 st cycle of the second end-to-end memory network MemN2N model, calculating the second query vector s respectively₁With the i-th reply header vector k in the preset reply header group_iOf the correlation value p_i(ii) a From the ith reply head vector k_iOf the correlation value p_iWith the ith reply tail vector l in the preset reply tail group_iCalculate the reply subvector o for the 1 st cycle₁(ii) a The second query vector s₁Reply sub-vector o with cycle 1₁Adding to obtain a second query vector s of the 2 nd cycle₂(ii) a Second query vector s according to said 2 nd cycle₂Recalculating reply subvector o for cycle 2₂And a second query vector s for cycle 3₃And so on until the answer subvector o of the Nth cycle is calculated_N(ii) a Reply sub-vector o of the Nth cycle_NAs a reply vector r₁。

Further, the processing module is further configured to:

acquiring a reply information base; wherein the reply information base comprises a plurality of reply information expressed in the form of knowledge triples, and the reply information comprises: head, tether and tail; converting the head in each reply message into a reply head vector by a preset conversion embedding TransE algorithm, thereby forming a reply head group; converting the tail in each reply message into a reply tail vector through a preset conversion embedding TransE algorithm, thereby forming a reply tail group; and establishing the corresponding relation between the reply head vector and the reply tail vector according to the relation part in each reply message.

EXAMPLE III

The embodiment also provides a computer device, such as a smart phone, a tablet computer, a notebook computer, a desktop computer, a rack server, a blade server, a tower server or a rack server (including an independent server or a server cluster composed of a plurality of servers) capable of executing programs, and the like. As shown in fig. 3, the computer device 30 of the present embodiment includes at least but is not limited to: a memory 301, a processor 302 communicatively coupled to each other via a system bus. It is noted that FIG. 3 only shows the computer device 30 having components 301 and 302, but it is understood that not all of the shown components are required and that more or fewer components may be implemented instead.

In this embodiment, the memory 301 (i.e., the readable storage medium) includes a flash memory, a hard disk, a multimedia card, a card-type memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a read-only memory (ROM), an electrically erasable programmable read-only memory (EEPROM), a programmable read-only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, and the like. In some embodiments, the storage 301 may be an internal storage unit of the computer device 30, such as a hard disk or a memory of the computer device 30. In other embodiments, the memory 301 may also be an external storage device of the computer device 30, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like, provided on the computer device 30. Of course, the memory 301 may also include both internal and external storage devices for the computer device 30. In the present embodiment, the memory 301 is generally used for storing an operating system and various types of application software installed in the computer device 30. In addition, the memory 301 may also be used to temporarily store various types of data that have been output or are to be output.

Processor 302 may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data Processing chip in some embodiments. The processor 302 generally serves to control the overall operation of the computer device 30.

Specifically, in this embodiment, the processor 302 is configured to execute a program of a dialog generating method stored in the processor 302, and when the program of the dialog generating method is executed, the following steps are implemented:

The specific embodiment process of the above method steps can be referred to in the first embodiment, and the detailed description of this embodiment is not repeated here.

Example four

The present embodiments also provide a computer readable storage medium, such as a flash memory, a hard disk, a multimedia card, a card type memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Read Only Memory (ROM), an Electrically Erasable Programmable Read Only Memory (EEPROM), a Programmable Read Only Memory (PROM), a magnetic memory, a magnetic disk, an optical disk, a server, an App application mall, etc., having stored thereon a computer program that when executed by a processor implements the method steps of:

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner.

The above description is only a preferred embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications of equivalent structures and equivalent processes, which are made by using the contents of the present specification and the accompanying drawings, or directly or indirectly applied to other related technical fields, are included in the scope of the present invention.

Claims

1. A dialog generation method, characterized in that the method comprises:

2. The dialog generation method according to claim 1, wherein the obtaining the question information and converting the question information into a first query vector using a preset first gate recursion unit GRU model includes:

3. The dialog generation method according to claim 2, wherein the determining, based on the first query vector and using a preset first end-to-end memory network MemN2N model, a common sense vector associated with the first query vector and forming a question vector based on the first query vector and the common sense vector comprises:

in the 1 st cycle of the first end-to-end memory network MemN2N model, the first query vectors u are respectively calculated₁With the ith common sense head vector x in the preset common sense head group_iOf the correlation value p_i；

The question sub-vector a of the Mth cycle is added_MAs the question vector.

4. The dialog generation method of claim 3, further comprising:

5. The dialog generation method according to claim 1, wherein the converting, according to the question vector, the question vector into a plurality of second query vectors by using a predetermined second gate recursion unit GRU model, and sequentially inputting the second query vectors into a predetermined second end-to-end memory network MemN2N model to obtain a plurality of response vectors includes:

The output vector s₁And hidden influence factor h of the second layer₁Re-inputting the second gate recursive unit GRU model to obtain an output vector s₂And a hidden impact factor h passed to the third layer₂And outputs the output vector s₂Re-inputting into the second end-to-end memory network MemN2N model to obtain a reply vector r₂And repeating the steps until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

6. The dialog generation method of claim 5 wherein the vector s is output₁Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r₁The method comprises the following steps:

Reply sub-vector o of the Nth cycle_NAs a reply vector r₁。

7. The dialog generation method of claim 6, further comprising:

8. A dialog generation apparatus, characterized in that the apparatus comprises:

9. A computer device, the computer device comprising: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the method of any of claims 1 to 7 when executing the computer program.

10. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 7.