CN112199482B

CN112199482B - Dialogue generation method, device, equipment and readable storage medium

Info

Publication number: CN112199482B
Application number: CN202011059826.7A
Authority: CN
Inventors: 李雅峥; 杨海钦; 姚晓远
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2020-09-30
Filing date: 2020-09-30
Publication date: 2023-07-21
Anticipated expiration: 2040-09-30
Also published as: CN112199482A; WO2022068197A1

Abstract

The invention discloses a dialogue generation method, a device, equipment and a readable storage medium, wherein the method comprises the following steps: acquiring questioning information, and converting the questioning information into a first query vector by using a preset first gate recursion unit GRU model; determining a common sense vector associated with the first query vector by utilizing a preset first end-to-end memory network (MemN 2N) model according to the first query vector, and forming a question vector according to the first query vector and the common sense vector; according to the question vector, converting the question vector into a plurality of second query vectors by using a preset second gate recursion unit GRU model, and sequentially inputting each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors; converting each reply vector into reply words respectively, and combining all the reply words into reply information; the invention can quickly and accurately form the reply information in the remote consultation dialogue, and improves the user experience.

Description

Dialogue generation method, device, equipment and readable storage medium

Technical Field

The present invention relates to the field of remote consultation dialogues for digital medical treatment, and in particular, to a dialog generating method, apparatus, device and readable storage medium.

Background

With the continuous development of artificial intelligence, man-machine conversations are increasingly being applied to various scenes; for example, in a manual customer service scene, the answer information corresponding to the question information is formed by identifying the question information input by a user, so that the labor cost is reduced; however, if the background knowledge of the user question and the common sense information of the user question are not understood, the conventional open domain man-machine dialogue system only starts from dialogue data, general answers lacking effective information are generated, and the readability of the answer information may be influenced. In addition, how to quickly and accurately form the reply information according to the user question information is a technical problem that needs to be solved by those skilled in the art.

Disclosure of Invention

The invention aims to provide a dialogue generation method, a dialogue generation device, dialogue generation equipment and a readable storage medium, which can quickly and accurately form reply information in a remote consultation dialogue and improve user experience.

According to an aspect of the present invention, there is provided a dialog generation method, the method comprising:

acquiring questioning information, and converting the questioning information into a first query vector by using a preset first gate recursion unit GRU model;

determining a common sense vector associated with the first query vector by utilizing a preset first end-to-end memory network (MemN 2N) model according to the first query vector, and forming a question vector according to the first query vector and the common sense vector;

according to the question vector, converting the question vector into a plurality of second query vectors by using a preset second gate recursion unit GRU model, and sequentially inputting each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors;

each reply vector is converted into a reply word, and all the reply words are combined into reply information, respectively.

Optionally, the acquiring the query information and converting the query information into a first query vector by using a preset first gate recursion unit GRU model includes:

performing word segmentation on the questioning information, and forming word sequences by a plurality of keywords obtained after the word segmentation;

aiming at a target keyword in the word sequence, calculating a hiding influence factor of the target keyword transmitted to a keyword positioned behind the target keyword in the word sequence by using the first recursion unit GRU model according to the hiding influence factor transmitted to the target keyword by the keyword positioned behind the target keyword in the word sequence;

taking a hiding influence factor calculated according to the last keyword in the word sequence as a first query vector u corresponding to the question information ₁ 。

Optionally, the determining, according to the first query vector, a common sense vector associated with the first query vector by using a preset first end-to-end memory network MemN2N model, and forming a question vector according to the first query vector and the common sense vector includes:

in the 1 st cycle of the first end-to-end memory network MemN2N model, respectively calculating the first query vector u ₁ With the ith common sense header vector x in the preset common sense header group _i Is a correlation value pi of (1);

according to the i th common sense header vector x _i Correlation value p of (2) _i With the ith common sense tail vector y in the preset common sense tail group _i Calculating the question sub-vector a of the 1 st cycle ₁ ；

The first query vector u ₁ And the question vector a ₁ Adding to obtain a first query vector u of the 2 nd cycle ₂ ；

A first query vector u according to the 2 nd cycle ₂ Recalculating the challenge sub-vector a for cycle 2 ₂ And a first query vector u of the 3 rd cycle ₃ And so on, until the question sub-vector a of the Mth cycle is calculated _M ；

Question sub-vector a of the Mth cycle _M As the question vector.

Optionally, the method further comprises:

obtaining a common sense information base; wherein the common sense information base includes a plurality of common sense information represented in the form of a knowledge triplet, and the common sense information includes: a head portion, a relationship portion, and a tail portion;

converting the header in each common sense information into a common sense header vector by presetting a first hidden layer matrix, thereby forming a common sense header group;

converting the tail in each common sense information into a common sense tail vector by presetting a second hidden layer matrix, thereby forming a common sense tail group;

and establishing a corresponding relation between the common sense head vector and the common sense tail vector according to the relation part in each common sense information.

Optionally, the converting the question vector into a plurality of second query vectors by using a preset second gate recursion unit GRU model according to the question vector, and sequentially inputting each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors, including:

taking the question vector as a hiding influence factor h of a first layer ₀ And will preset the start character vector s ₀ Input into the second gate recursive unit GRU model to obtain an output vector s ₁ And a concealment influence factor h passed to the second layer ₁ ；

The output vector s ₁ Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r ₁ ；

The output vector s ₁ And a concealment influence factor h of the second layer ₁ Re-input into said second gate recursive unit GRU model to obtain an output vector s ₂ And a concealment influence factor h passed to the third layer ₂ And outputs the vector s ₂ Re-input to the second end-to-end memory network MemN2N model to obtain a reply vector r ₂ And so on until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

Optionally, said outputting said output vector s ₁ Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r ₁ Comprising:

in the 1 st cycle of the second end-to-end memory network MemN2N model, respectively calculating the second query vector s ₁ With the ith reply header vector k in the preset reply header group _i Correlation value p of (2) _i ；

According to the ith reply header vector k _i Correlation value p of (2) _i With the ith reply tail vector l in the preset reply tail group _i Calculate 1 st cycle reply subvector o ₁ ；

The second query vector s ₁ Reply subvector o with cycle 1 ₁ Adding to obtainSecond query vector s of cycle 2 ₂ ；

A second query vector s according to the 2 nd cycle ₂ Recalculating the reply subvector o for cycle 2 ₂ Second query vector s of the 3 rd cycle ₃ And so on, until the reply subvector o of the Nth cycle is calculated _N ；

The reply subvector o of the Nth cycle _N As a reply vector r ₁ 。

Optionally, the method further comprises:

obtaining a reply information base; wherein the reply information base includes a plurality of reply information expressed in the form of a knowledge triplet, and the reply information includes: a head portion, a relationship portion, and a tail portion;

the head in each piece of reply information is converted into a reply head vector through a preset conversion embedding TransE algorithm, so that a reply head group is formed;

converting the tail in each reply message into a reply tail vector by a preset conversion embedded TransE algorithm, thereby forming a reply tail group;

and establishing a corresponding relation between the reply head vector and the reply tail vector according to the relation part in each reply message.

In order to achieve the above object, the present invention also provides a dialog generating apparatus, including:

the acquisition module is used for acquiring the questioning information and converting the questioning information into a first query vector by utilizing a preset first gate recursion unit GRU model;

the questioning module is used for determining a common sense vector associated with the first query vector by utilizing a preset first end-to-end memory network (MemN 2N) model according to the first query vector, and forming a questioning vector according to the first query vector and the common sense vector;

the reply module is used for converting the question vector into a plurality of second query vectors by using a preset second gate recursion unit GRU model according to the question vector, and sequentially inputting each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of reply vectors;

and the conversion module is used for respectively converting each reply vector into reply words and combining all the reply words into reply information.

In order to achieve the above object, the present invention further provides a computer device, which specifically includes: the system comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the processor realizes the steps of the dialogue generation method when executing the computer program.

In order to achieve the above object, the present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the dialog generation method described above.

The dialogue generating method, the dialogue generating device, the dialogue generating equipment and the dialogue generating readable storage medium combine an end-to-end memory network MemN2N architecture with a GRU network to find out common sense information related to the questioning information according to the questioning information, and comprehensively consider the questioning information and the common sense information to determine answer information. In the process of encoding the questioning information into the questioning vector, the questioning information is encoded in a form of GRU+Mem2N, and aiming at the questioning information input by a user, an Embedding B in the MemN2N network is replaced by a GRU network, and the final hidden layer of the GRU network is used as a query vector to be input into the MemN2N network. In decoding the question vector into the reply information, the reply information is generated in the form of GRU+Memn2N. The invention can quickly and accurately form the reply information in the remote consultation dialogue, and improves the user experience.

Drawings

Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to designate like parts throughout the figures. In the drawings:

FIG. 1 is a schematic flow chart of an alternative dialog generation method according to the first embodiment;

fig. 2 is a schematic diagram of an alternative composition structure of a dialogue generating device according to the second embodiment;

fig. 3 is a schematic diagram of an alternative hardware architecture of a computer device according to the third embodiment.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Example 1

The embodiment of the invention provides a dialogue generating method, as shown in fig. 1, which specifically comprises the following steps:

step S101: question information is obtained and converted into a first query vector using a preset first GRU (Gate Recurrent Unit, gate recursion unit) model.

Specifically, step S101 includes:

step A1: performing word segmentation on the questioning information, and forming word sequences by a plurality of keywords obtained after the word segmentation; wherein the word sequence includes N keywords;

step A2: aiming at a target keyword in the word sequence, calculating a hiding influence factor of the target keyword transmitted to a keyword positioned behind the target keyword in the word sequence by using the first recursion unit GRU model according to the hiding influence factor transmitted to the target keyword by the keyword positioned behind the target keyword in the word sequence;

step A3: taking a hiding influence factor calculated according to the last keyword in the word sequence as a first query vector u corresponding to the question information ₁ 。

Step S102: and determining a common sense vector associated with the first query vector by using a preset first MemN2N (End-to-End Memory Networks) model according to the first query vector, and forming a question vector according to the first query vector and the common sense vector.

Specifically, step S102 includes:

step B1: in the 1 st cycle hop of the first end-to-end memory network MemN2N model, respectively calculating the first query vector u ₁ With the ith common sense header vector x in the preset common sense header group _i Correlation value p of (2) _i ；

Wherein p is _i ＝Softmax((u ₁ ) ^T *x _i ) T is the transpose function.

Step B2: according to the i th common sense header vector x _i Correlation value p of (2) _i With the ith common sense tail vector y in the preset common sense tail group _i Calculating the question sub-vector a of the 1 st cycle ₁ ；

Wherein a is ₁ ＝∑ _i p _i y _i 。

Step B3: the first query vector u ₁ And the question vector a ₁ Adding to obtain a first query vector u of the 2 nd cycle ₂ ；

Step B4: repeating steps B1-B3 until calculating the question sub-vector a of the Mth cycle _M ；

Step B5: question sub-vector a of the Mth cycle _M As the question vector.

Further, the method further comprises:

step C1: obtaining a common sense information base; wherein the common sense information base includes a plurality of common sense information represented in the form of a knowledge triplet, and the common sense information includes: a head portion, a relationship portion, and a tail portion;

taking the example of a "cat being an animal", the knowledge triplet form represents the position (h: cat, r: belonging to t: animal), where h represents the head, t represents the tail, and r represents the relationship between head and tail.

Step C2: converting the header in each common sense information into a common sense header vector by presetting a first hidden layer matrix Embedding A, thereby forming a common sense header group;

step C3: converting the tail in each common sense information into a common sense tail vector by presetting a second hidden layer matrix Embedding C, thereby forming a common sense tail group;

step C4: and establishing a corresponding relation between the common sense head vector and the common sense tail vector according to the relation part in each common sense information.

In the process of encoding the Encoder, namely, the process of encoding the questioning information into a questioning vector, the questioning information is encoded in the form of GRU+Mem2N, and for the questioning information input by a user, the GRU network is used for replacing Embedding B in the MemN2N network, and the final hidden layer of the GRU network is used as a query vector to be input into the MemN2N network. The whole Memory 2N network is overlapped by a plurality of hops, and in each hop, the correlation degree of the query vector and each piece of common sense information in the Memory is calculated respectively. In this embodiment, the Encoder is implemented by using the gru+memn2n, so that on the premise that complete question information is extracted by using the GRU, common sense information with high relevance to the whole question information can be continuously added, and information deviation caused by searching for a single entity word is avoided. In addition, the common sense information of the Memory is calculated in a weighted sum mode, so that a single knowledge triplet is prevented from being selected as compensation information, and the acquired common sense information is more comprehensive.

Step S103: according to the question vector, the question vector is converted into a plurality of second query vectors by using a preset second gate recursion unit GRU model, and each second query vector is sequentially input into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors.

Specifically, step S103 includes:

step D1: taking the question vector as a hiding influence factor h of a first layer ₀ And will preset the start character vector s ₀ Input into the second gate recursive unit GRU model toObtaining an output vector s ₁ And a concealment influence factor h passed to the second layer ₁ ；

Wherein,(s) ₁ ，h ₁ )＝GRU(s ₀ ，h ₀ )。

Step D2: the output vector s ₁ Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r ₁ ；

Further, step D2 includes:

step D21: in the 1 st cycle hop of the second end-to-end memory network MemN2N model, respectively calculating the second query vector s ₁ With the ith reply header vector k in the preset reply header group _i Correlation value p of (2) _i ；

Wherein p is _i ＝Softmax((s ₁ ) ^T k _i ) T is a transposition function;

step D22: according to the ith reply header vector k _i Correlation value p of (2) _i With the ith reply tail vector l in the preset reply tail group _i Calculate 1 st cycle reply subvector o ₁ ；

Wherein o is ₁ ＝∑ _i p _i l _i ；

Step D23: the second query vector s ₁ Reply subvector o with cycle 1 ₁ Adding to obtain a second query vector s of the 2 nd cycle hop ₂ ；

Step D24: repeating steps D21-D23 until the question sub-vector o of the Nth cycle hop is calculated _N ；

Step D25: question sub-vector o of the nth cycle _N As a reply vector r ₁ 。

Still further, the method further comprises:

step E1: obtaining a reply information base; wherein the reply information base includes a plurality of reply information expressed in the form of a knowledge triplet, and the reply information includes: a head portion, a relationship portion, and a tail portion;

step E2: the head in each piece of reply information is converted into a reply head vector through a preset conversion embedding TransE algorithm, so that a reply head group is formed;

step E3: converting the tail in each reply message into a reply tail vector by a preset conversion embedded TransE algorithm, thereby forming a reply tail group;

where k= (h, r, t) =mlp (transition (h, r, t));

k _i ＝h；l _i ＝t。

step E4: and establishing a corresponding relation between the reply head vector and the reply tail vector according to the relation part in each reply message.

Step D3: the output vector s ₁ And a concealment influence factor h of the second layer ₁ Re-input into said second gate recursive unit GRU model to obtain an output vector s ₂ And a concealment influence factor h passed to the third layer ₂ And outputs the vector s ₂ Re-input to the second end-to-end memory network MemN2N model to obtain a reply vector r ₂ And so on until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

Step S104: each reply vector is converted into a reply word, and all the reply words are combined into reply information, respectively.

Specifically, step S104 includes:

the reply vector r is obtained according to the following formula _i Corresponding reply word w _i ：

P(r _i ＝w _i )＝softmax(Wr _i )；

Wherein W is a preset matrix containing a plurality of reply words, and the word with the maximum P value in the calculated matrix W is taken as R _i Corresponding reply word w _i 。

In the decoding process, namely, the process of decoding the question vector into the reply information, the reply information is generated in the form of GRU+MemN2N; the initial hidden state of the GRU network is the output of the Encoder section. For Memory, different from the Encoder part, the encoding of the knowledge triples is completed by using a TransE algorithm to replace the coding A and the coding C in the Memory N2N model. Furthermore, unlike the Encoder, which takes the output of the last instant of the GRU network as the input to MemN2N, the Encoder portion takes each hidden state of the GRU as the query vector query of MemN 2N.

In this embodiment, the implementation of the Decoder portion avoids the distinction between the entity word and the common word when generating the reply, so that all the reply words can be obtained according to the vocabulary. In addition, the patent distinguishes the similarity calculation part of the Memory and the query from the weighted sum output part by means of the thought of Kay Value Memory Network, so that the query is more similar to the head entity in the knowledge triplet, the output is more similar to the tail entity in the knowledge triplet, and the repetition rate of model generation reply and question is reduced.

Example two

The embodiment of the invention provides a dialogue generating device, as shown in fig. 2, which specifically comprises the following components:

the acquiring module 201 is configured to acquire query information, and convert the query information into a first query vector by using a preset first gate recursion unit GRU model;

a questioning module 202, configured to determine, according to the first query vector, a common sense vector associated with the first query vector by using a preset first end-to-end memory network MemN2N model, and form a questioning vector according to the first query vector and the common sense vector;

the reply module 203 is configured to convert the question vector into a plurality of second query vectors by using a preset second gate recursion unit GRU model according to the question vector, and sequentially input each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of reply vectors;

the conversion module 204 is configured to convert each reply vector into a reply word, and combine all the reply words into reply information.

Specifically, the obtaining module 201 is configured to:

for the saidThe questioning information is subjected to word segmentation, and a word sequence is formed by a plurality of keywords obtained after word segmentation; aiming at a target keyword in the word sequence, calculating a hiding influence factor of the target keyword transmitted to a keyword positioned behind the target keyword in the word sequence by using the first recursion unit GRU model according to the hiding influence factor transmitted to the target keyword by the keyword positioned behind the target keyword in the word sequence; taking a hiding influence factor calculated according to the last keyword in the word sequence as a first query vector u corresponding to the question information ₁ 。

Further, the questioning module 202 is specifically configured to:

in the 1 st cycle of the first end-to-end memory network MemN2N model, respectively calculating the first query vector u ₁ With the ith common sense header vector x in the preset common sense header group _i Correlation value p of (2) _i The method comprises the steps of carrying out a first treatment on the surface of the According to the i th common sense header vector x _i Correlation value p of (2) _i With the ith common sense tail vector y in the preset common sense tail group _i Calculating the question sub-vector a of the 1 st cycle ₁ The method comprises the steps of carrying out a first treatment on the surface of the The first query vector u ₁ And the question vector a ₁ Adding to obtain a first query vector u of the 2 nd cycle ₂ The method comprises the steps of carrying out a first treatment on the surface of the A first query vector u according to the 2 nd cycle ₂ Recalculating the challenge sub-vector a for cycle 2 ₂ And a first query vector u of the 3 rd cycle ₃ And so on, until the question sub-vector a of the Mth cycle is calculated _M The method comprises the steps of carrying out a first treatment on the surface of the Question sub-vector a of the Mth cycle _M As the question vector.

Further, the device further comprises:

the processing module is used for acquiring a common sense information base; wherein the common sense information base includes a plurality of common sense information represented in the form of a knowledge triplet, and the common sense information includes: a head portion, a relationship portion, and a tail portion; converting the header in each common sense information into a common sense header vector by presetting a first hidden layer matrix, thereby forming a common sense header group; converting the tail in each common sense information into a common sense tail vector by presetting a second hidden layer matrix, thereby forming a common sense tail group; and establishing a corresponding relation between the common sense head vector and the common sense tail vector according to the relation part in each common sense information.

Further, the reply module 203 is specifically configured to:

taking the question vector as a hiding influence factor h of a first layer ₀ And will preset the start character vector s ₀ Input into the second gate recursive unit GRU model to obtain an output vector s ₁ And a concealment influence factor h passed to the second layer ₁ The method comprises the steps of carrying out a first treatment on the surface of the The output vector s ₁ Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r ₁ The method comprises the steps of carrying out a first treatment on the surface of the The output vector s ₁ And a concealment influence factor h of the second layer ₁ Re-input into said second gate recursive unit GRU model to obtain an output vector s ₂ And a concealment influence factor h passed to the third layer ₂ And outputs the vector s ₂ Re-input to the second end-to-end memory network MemN2N model to obtain a reply vector r ₂ And so on until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

Further, the reply module 203 is configured to implement the step of outputting the vector s ₁ Is input into the second end-to-end memory network MemN2N model as a second query vector to obtain a reply vector r ₁ The method specifically comprises the following steps:

in the 1 st cycle of the second end-to-end memory network MemN2N model, respectively calculating the second query vector s ₁ With the ith reply header vector k in the preset reply header group _i Correlation value p of (2) _i The method comprises the steps of carrying out a first treatment on the surface of the According to the ith reply header vector k _i Correlation value p of (2) _i With the ith reply tail vector l in the preset reply tail group _i Calculate 1 st cycle reply subvector o ₁ The method comprises the steps of carrying out a first treatment on the surface of the The second query vector s ₁ Reply subvector o with cycle 1 ₁ Adding to obtain the 2 ndSecond query vector s of the loop ₂ The method comprises the steps of carrying out a first treatment on the surface of the A second query vector s according to the 2 nd cycle ₂ Recalculating the reply subvector o for cycle 2 ₂ Second query vector s of the 3 rd cycle ₃ And so on, until the reply subvector o of the Nth cycle is calculated _N The method comprises the steps of carrying out a first treatment on the surface of the The reply subvector o of the Nth cycle _N As a reply vector r ₁ 。

Still further, the processing module is further configured to:

obtaining a reply information base; wherein the reply information base includes a plurality of reply information expressed in the form of a knowledge triplet, and the reply information includes: a head portion, a relationship portion, and a tail portion; the head in each piece of reply information is converted into a reply head vector through a preset conversion embedding TransE algorithm, so that a reply head group is formed; converting the tail in each reply message into a reply tail vector by a preset conversion embedded TransE algorithm, thereby forming a reply tail group; and establishing a corresponding relation between the reply head vector and the reply tail vector according to the relation part in each reply message.

Example III

The present embodiment also provides a computer device, such as a smart phone, a tablet computer, a notebook computer, a desktop computer, a rack-mounted server, a blade server, a tower server, or a rack-mounted server (including an independent server or a server cluster formed by a plurality of servers) that can execute a program. As shown in fig. 3, the computer device 30 of the present embodiment includes at least, but is not limited to: a memory 301, a processor 302, which may be communicatively connected to each other via a system bus. It is noted that FIG. 3 only shows a computer device 30 having components 301-302, but it should be understood that not all of the illustrated components are required to be implemented, and that more or fewer components may alternatively be implemented.

In this embodiment, the memory 301 (i.e., readable storage medium) includes flash memory, a hard disk, a multimedia card, a card memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a read-only memory (ROM), an electrically erasable programmable read-only memory (EEPROM), a programmable read-only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, and the like. In some embodiments, the memory 301 may be an internal storage unit of the computer device 30, such as a hard disk or memory of the computer device 30. In other embodiments, the memory 301 may also be an external storage device of the computer device 30, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card) or the like, which are provided on the computer device 30. Of course, the memory 301 may also include both internal storage units of the computer device 30 and external storage devices. In this embodiment, the memory 301 is typically used to store an operating system and various types of application software installed on the computer device 30. In addition, the memory 301 can also be used to temporarily store various types of data that have been output or are to be output.

The processor 302 may be a central processing unit (Central Processing Unit, CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments. The processor 302 is generally used to control the overall operation of the computer device 30.

Specifically, in the present embodiment, the processor 302 is configured to execute a program of a dialog generation method stored in the processor 302, and the program of the dialog generation method when executed implements the steps of:

The specific embodiment of the above method steps may refer to the first embodiment, and this embodiment is not repeated here.

Example IV

The present embodiment also provides a computer readable storage medium, such as a flash memory, a hard disk, a multimedia card, a card memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a read-only memory (ROM), an electrically erasable programmable read-only memory (EEPROM), a programmable read-only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, a server, an App application store, etc., having stored thereon a computer program that when executed by a processor performs the following method steps:

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.

The foregoing embodiment numbers of the present invention are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.

From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment.

The foregoing description is only of the preferred embodiments of the present invention, and is not intended to limit the scope of the invention, but rather is intended to cover any equivalents of the structures or equivalent processes disclosed herein or in the alternative, which may be employed directly or indirectly in other related arts.

Claims

1. A method of dialog generation, the method comprising:

converting each reply vector into reply words respectively, and combining all the reply words into reply information;

the method for obtaining the question vector comprises the steps of converting the question vector into a plurality of second query vectors by using a preset second gate recursion unit GRU model according to the question vector, sequentially inputting each second query vector into a preset second end-to-end memory network MemN2N model to obtain a plurality of answer vectors, and comprises the following steps:

taking the question vector as a hiding influence factor of a first layerAnd vector +_for the preset start character>Input into said second gate recursive unit GRU model to obtain an output vector +.>And a concealment influence factor transferred to the second layer +.>；

The output vector is processedIs input as a second query vector into the second end-to-end memory network MemN2N model to obtain a reply vector +.>；

The output vector is processedAnd a concealment influence factor of the second layer +.>Re-input into said second gate recursive unit GRU model to obtain an output vector +.>And pass to the thirdHidden influencing factor of layer->And the output vector +.>Re-input to said second end-to-end memory network MemN2N model to obtain a reply vector +.>And so on until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

2. The dialog generation method of claim 1, wherein the acquiring the question information and converting the question information into the first query vector using a preset first gate recursion unit GRU model includes:

taking a hiding influence factor calculated according to the last keyword in the word sequence as a first query vector corresponding to the question information。

3. The method of claim 2, wherein determining a common sense vector associated with the first query vector according to the first query vector using a preset first end-to-end memory network MemN2N model, and forming a question vector according to the first query vector and the common sense vector comprises:

in the 1 st cycle of the first end-to-end memory network MemN2N model, respectively calculating the first query vectorWith the i-th common sense head vector in the preset common sense head group +.>Correlation value +.>；

According to the ith common sense head vectorCorrelation value +.>With the ith common sense tail vector in the preset common sense tail groupCalculating the question sub-vector +.1 in cycle>；

The first query vectorAnd the question vector->Adding to get the first query vector of cycle 2 +.>；

A first query vector according to the 2 nd cycleRecalculating the question sub-vector for cycle 2>And the first query vector of cycle 3 +.>And so on until the question sub-vector of the Mth cycle is calculated +.>；

Question sub-vector of the Mth cycleAs the question vector.

4. A dialog generation method according to claim 3, characterized in that the method further comprises:

5. The dialog generation method of claim 1, wherein the outputting the output vectorIs input as a second query vector into the second end-to-end memory network MemN2N model to obtain a reply vector +.>Comprising:

in the 1 st cycle of the second end-to-end memory network MemN2N model, respectively calculating the second query vectorsWith the i-th reply header vector in the preset reply header group +.>Correlation value +.>；

According to the ith reply header vectorCorrelation value +.>With the ith reply tail vector in the preset reply tail groupCalculating reply subvector of 1 st cycle +.>；

The second query vectorReply subvector +.1 with cycle>Adding to get the second query vector of the 2 nd cycle +.>；

A second query vector according to the 2 nd cycleRecalculating the reply subvector for cycle 2>And second query vector of the 3 rd cycle +.>And so on, until the reply subvector of the Nth cycle is calculated +.>；

The reply subvector of the Nth cycleAs reply vector +.>。

6. The dialog generation method of claim 5, wherein the method further comprises:

7. A dialog generation device, the device comprising:

the conversion module is used for respectively converting each reply vector into reply words and combining all the reply words into reply information;

wherein, the reply module is used for:

The output vector is processedAnd a concealment influence factor of the second layer +.>Re-input into said second gate recursive unit GRU model to obtain an output vector +.>And a concealment influence factor transferred to the third layer +.>And the output vector +.>Re-input to said second end-to-end memory network MemN2N model to obtain a reply vector +.>And so on until the output vector of the second gate recursion unit GRU model is a preset ending character vector.

8. A computer device, the computer device comprising: memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the method according to any one of claims 1 to 6 when the computer program is executed.

9. A computer readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, implements the steps of the method according to any one of claims 1 to 6.