CN106847271A

CN106847271A - A kind of data processing method and device for talking with interactive system

Info

Publication number: CN106847271A
Application number: CN201611139722.0A
Authority: CN
Inventors: 高参
Original assignee: Beijing Guangnian Wuxian Technology Co Ltd
Current assignee: Beijing Guangnian Wuxian Technology Co Ltd
Priority date: 2016-12-12
Filing date: 2016-12-12
Publication date: 2017-06-13

Abstract

A kind of data processing method and device for talking with interactive system, wherein, the method includes：Interaction data obtaining step, obtains the dialogue interaction data of user input；Dialog model result generation step, is parsed to dialogue interaction data, and analysis result is input in default dialogue generation model, obtains dialog model result, wherein, dialogue generation model is built based on NRM and enhancing learning algorithm；Feedback data exports step, generates corresponding feedback data according to dialog model result and exports.Compared to existing method, this method can solve the problem that the more conservative problem of the feedback result of the dialogue generation model generation of the coding layer decoding layer framework for being currently based on MLE object functions, so also allow for ultimately generating and exporting the feedback data more hommization to user and the actual interaction scenarios of laminating, so as to improve the Consumer's Experience of dialogue interactive system.

Description

A kind of data processing method and device for talking with interactive system

Technical field

The present invention relates to human-computer interaction technique field, specifically, it is related at a kind of data for talking with interactive system Reason method and device.

Background technology

With continuing to develop for science and technology, the introducing of information technology, computer technology and artificial intelligence technology, machine Industrial circle is progressively walked out in the research of people, gradually extend to the neck such as medical treatment, health care, family, amusement and service industry Domain.And people for the requirement of robot also conform to the principle of simplicity the multiple mechanical action of substance be promoted to anthropomorphic question and answer, independence and with The intelligent robot that other robot is interacted, man-machine interaction also just turns into the key factor for determining intelligent robot development.

Dialogue interactive system plays act foot light as the nucleus module in man-machine interactive system in man-machine interactive system The role of weight.In daily interactive process, the dialogue between user and robot is interacted and occupies most of ratio Example.Therefore, the quality of dialogue interactive system is directly connected to the Consumer's Experience of intelligent robot product, and then influences going for user Stay, the dialogue interaction capabilities for how lifting intelligent robot are then problem demanding prompt solutions.

The content of the invention

To solve the above problems, the invention provides a kind of data processing method for talking with interactive system, it includes：

Interaction data obtaining step, obtains the dialogue interaction data of user input；

Dialog model result generation step, parses to the dialogue interaction data, analysis result is input to default In dialogue generation model, dialog model result is obtained, wherein, the dialogue generation model is calculated based on NRM and enhancing study What method built；

Feedback data exports step, generates corresponding feedback data according to the dialog model result and exports.

According to one embodiment of present invention, the default dialogue generation model includes coding layer and decoding layer, wherein, institute The whole vector tables for stating the word segmentation result generation word segmentation result of the coding layer for being obtained according to parsing reach and part vector table Reach, and whole vector tables are reached and part vector table is up to splicing, obtain splicing result, the decoding layer is used to be based on Notice mechanism generates the dialog model result according to the splicing result.

According to one embodiment of present invention, the decoding layer is based on the neutral net of enhancing learning algorithm and engages in the dialogue mould The generation of type result.

According to one embodiment of present invention, in the enhancing learning algorithm, using MMI functions as reward function.

According to one embodiment of present invention, methods described also includes：

Modifying model step, obtains input data of the user for the feedback data, and according to the input number It is modified according to the default dialogue generation model.

Present invention also offers a kind of data processing equipment for talking with interactive system, it includes：

Interaction data acquisition module, its dialogue interaction data for being used to obtain user input；

Dialog model result-generation module, it is used to parse the dialogue interaction data, analysis result is input into To in default dialogue generation model, dialog model result is obtained, wherein, the dialogue generation model is based on NRM and enhancing What learning algorithm built；

Feedback data output module, it is used to generate corresponding feedback data according to the dialog model result and export.

According to one embodiment of present invention, the default dialogue generation model includes coding layer and decoding layer, wherein, institute Coding layer is stated to be configured to be reached and partly vectorial according to the whole vector tables for parsing the word segmentation result generation word segmentation result for obtaining Expression, and whole vector tables are reached and part vector table is up to splicing, splicing result is obtained, the decoding layer is configured to The dialog model result is generated according to the splicing result based on notice mechanism.

According to one embodiment of present invention, the decoding layer be configured to strengthen learning algorithm neutral net carry out The generation of feedback data.

According to one embodiment of present invention, described device also includes：

Modifying model module, it is used to obtain input data of the user for the feedback data, and according to described Input data is modified to the default dialogue generation model.

Compared to existing method, the data processing method for talking with interactive system provided by the present invention can be based on use The enquirement at family, generates more reasonably answer, meanwhile, this method can also solve to be currently based on the coding layer-solution of MLE object functions The more conservative problem of feedback result of the dialogue generation model generation of code layer framework, so also allows for ultimately generating and exporting Feedback data more hommization and the actual interaction scenarios of laminating to user, so as to improve the user's body of dialogue interactive system Test.

Additionally, this method can also by object function it is self-defined come cause dialogue interactive system and user between Dialogue interaction according to more have interest, diversity, interactivity and continuation, this be favorably improved dialogue interaction be User's viscosity of system.

Other features and advantages of the present invention will be illustrated in the following description, also, the partly change from specification Obtain it is clear that or being understood by implementing the present invention.The purpose of the present invention and other advantages can be by specification, rights Specifically noted structure is realized and obtained in claim and accompanying drawing.

Brief description of the drawings

In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the accompanying drawing wanted needed for technology description to do simple introduction：

Fig. 1 is according to an embodiment of the invention to realize that flow is shown for talk with the data processing method of interactive system It is intended to；

Fig. 2 is that dialog model result generation step according to an embodiment of the invention implements schematic flow sheet；

Fig. 3 is the structural representation of dialogue generation model according to an embodiment of the invention；

Fig. 4 is the structural representation for talking with the data processing equipment of interactive system according to an embodiment of the invention Figure.

Specific embodiment

Describe embodiments of the present invention in detail below with reference to drawings and Examples, how the present invention is applied whereby Technological means solves technical problem, and reaches the implementation process of technique effect and can fully understand and implement according to this.Need explanation As long as not constituting conflict, each embodiment in the present invention and each feature in each embodiment can be combined with each other, The technical scheme for being formed is within protection scope of the present invention.

Meanwhile, in the following description, many details are elaborated for illustrative purposes, to provide to of the invention real Apply the thorough understanding of example.It will be apparent, however, to one skilled in the art, that the present invention can be without tool here Body details or described ad hoc fashion are implemented.

In addition, can be in the such as one group department of computer science of computer executable instructions the step of the flow of accompanying drawing is illustrated Performed in system, and, although logical order is shown in flow charts, but in some cases, can be with different from herein Order perform shown or described step.

The data object of dialogue interactive system treatment mainly includes customer problem and answer.According to the affiliated number of customer problem According to field, dialogue interactive system is commonly divided into the question answering system towards defined domain, the question answering system towards open field and face To the dialogue interactive system of common problem collection.Originated according to the different pieces of information of answer, dialogue interactive system can be divided into based on knot The dialogue interactive system of structure data, the dialogue interactive system based on free text and the dialogue based on question and answer pair interaction system System.

Additionally, being divided according to the generation feedback mechanism of answer, it is right based on retrieval type that dialogue interactive system is further divided into Words interactive system and the dialogue interactive system based on production.Wherein, retrieval type dialogue interactive system is from the language for having existed The answer information with problem information best match, the degree of accuracy of the answer information that retrieval type dialogue interactive system is obtained are searched in material It is higher but adaptability is poor.In contrast, the answer information of production dialogue interactive system is by substantial amounts of language material and machine Learning algorithm parsing obtain, production dialogue interactive system obtained by answer information have good system suitability but at present Method there is no to ensure accuracy rate higher.

For the above mentioned problem in the presence of prior art, a kind of number for talking with interactive system is present embodiments provided According to processing method.

In order to clearly illustrate the reality of the data processing method for talking with interactive system that the present embodiment is provided Existing principle, implementation process and advantage, further say below in conjunction with schematic flow sheet is realized shown in Fig. 1 to the method It is bright.

As shown in figure 1, the data processing method for talking with interactive system that the present embodiment is provided is first in step The dialogue interaction data of user input is obtained in S101.It is pointed out that according to actual needs, the method is in step S101 Accessed dialogue interaction data both can be that the voice collected using audio collecting device (such as microphone etc.) is handed over Mutual data, or the text interaction data collected using text collection equipment (such as keyboard etc.), the present invention are not limited In this.

After above-mentioned dialogue interaction data is obtained, the method can be solved to above-mentioned dialogue interaction data in step s 102 Analysis, so as to obtain analysis result.Finally, can be input into for analysis result resulting in step S102 in step s 103 by the method To in default dialogue generation model, so as to generate corresponding dialog model according to above-mentioned analysis result by the dialogue generation model As a result.

In order to overcome existing retrieval type dialog generation system to generate the problem of answer information poor accuracy, the present embodiment In, the method employs the default dialogue generation model built based on NRM and enhancing learning algorithm in step s 103.

Fig. 2 is parsed and corresponding using generation model generation is talked with showing the present embodiment to dialogue interaction data Dialog model result implement schematic flow sheet, Fig. 3 shows the structural representation for talking with generation model in the present embodiment Figure, is further elaborated below in conjunction with Fig. 2 and Fig. 3.

As shown in Fig. 2 when the method is parsed to above-mentioned dialogue interaction data, preferably in step s 201 to above-mentioned Dialogue interaction data carries out word segmentation processing, so as to obtain talking with the word segmentation result of interaction data.It is pointed out that in the present invention Different embodiments in, according to actual needs, the method to dialogue interaction data carry out word segmentation processing when, can both use base In the mode of string matching, it would however also be possible to employ the mode based on statistics and machine learning, or it is using other reasonable sides Formula, the invention is not restricted to this.

After the analysis result for obtaining talking with interaction data, the method can be in step S202 according to gained in step S201 To word segmentation result come utilize default dialogue generation model generate whole vector tables of above-mentioned word segmentation result up to and part vector table Reach.

In the present embodiment, the dialogue generation model that the method is used includes coding layer and decoding layer.Wherein, coding layer RNN models are employed with decoding layer.Different from traditional FNN (Feed-forward Neural Networks, feed-forward Neutral net) model, RNN models introduce directed circulation, its problem that can process forward-backward correlation between those inputs.

It, again to output layer, is between layers full connection that traditional neural network model is from input layer to hidden layer, Node between every layer is connectionless.But this common neutral net is for many problems but helpless.For example, you What the next word for predicting sentence is, generally requires the word used above, because front and rear word is simultaneously in a sentence It is not independent.

In recent years, deep neural network is furtherd investigate and achieved in the tasks such as image classification, speech recognition Prominent effect, shows excellent expression learning ability.At the same time, language learning is represented by deep neural network It has been increasingly becoming a new research tendency.

However, flexible and changeable and semantic information complicated abstract due to human language so that deep neural network mould Type shows that application in study is faced than in the bigger challenge of image, voice in language table.First, compared to voice and image, language Speech is tone artifacts, is entirely that the notation for being produced by brain and being processed, is human civilization wisdom in human civilization process Height embody, the variability and flexibility ratio of language are considerably beyond image and voice signal.Second, image and voice have clearly Mathematical notation, such as gray level image is numerical matrix mathematically, and its represent minimum particle size element have determination Physical significance, the value of each point of image pixel represents certain gray scale color-values.

Therefore the method that the present embodiment is provided employs NRM combination RNN neutral nets to be analyzed, and why is RNN Neural network is referred to as circulated, because the current output of sequence is also relevant with output above.Specifically the form of expression is Network can be remembered and be applied in the calculating of current output to information above, i.e., the node between hidden layer is no longer without even Connect but have connection, and the input of hidden layer not only exports also defeated including last moment hidden layer including input layer Go out.In theory, RNN can be processed the sequence data of any length.

As shown in figure 3, in the present embodiment, the input data of each neuron is to divide in the input layer Input X of coding layer Each participle in word result, such as xi represents i-th participle in word segmentation result.It is each in the hidden layer Layer H of coding layer Individual neuron is preferably a nonlinear function (such as logistic function, general using LSTM/GRU cores), right For i-th neuron hi in hidden layer, the input of the neuron is hi-1 and xi.In the present embodiment, coding layer it is hidden Tibetan layer is output as the whole vector tables to analysis result and reaches.

In the present embodiment, based on NRM, coding layer employs mixed mechanism, specifically, similar with hidden layer Layer H, Each neuron is preferably a nonlinear function in neuronal layers Layer G, comes for i-th neuron gi therein Say, the input of the neuron is xi and gi-1.Neuronal layers Layer G are output as a part vector to analysis result Expression.

From foregoing description as can be seen that the present embodiment in, coding layer by using mixed mechanism come according to step S201 Analysis result obtain corresponding vector table and reach, this mode can not only realize the dialogue interaction data being input into user All hold, be also just so the feedback for finally being generated while the detailed information of dialogue interaction data can also be sufficiently reserved The degree of accuracy of data and relevance grade provide the foundation.

As shown in Figures 2 and 3, in the present embodiment, reached and part vector in the whole vector tables for obtaining above-mentioned analysis result After expression, the method can reach to whole vector tables of above-mentioned analysis result in step S203 and part vector table is up to spelling Connect, so as to obtain splicing result.

After splicing result is obtained, the method can be come according to resulting in step S203 in step S204 by decoding layer Splicing result generates dialog model result.Specifically, in the present embodiment, based on NRM, decoding layer is preferably by based on notice The mode of mechanism to generate corresponding dialog model result according to splicing result resulting in step S203.

To sum up, NRM uses a kind of mixed mechanism in coded portion, so that the sequence that coding obtains intermediate representation can not only The overall assurance of user's sentence information is enough realized, while the detailed information of sentence can also be sufficiently reserved.And adopted in decoded portion With notice mechanism, so that the complex interactive mode in the grasp question answering process that generation model can be relatively easy.

Further, in the present embodiment, decoding layer generates dialogue using the circulation neural network based on enhancing learning algorithm Model result.Enhancing study is a kind of learning method different from supervised learning, unsupervised learning, and it can solve the problem that similar playing chess In to do the problem that a series of decision-making just can determine that final result, and this exactly supervised learning and unsupervised learning institute solution be never .

Enhancing study is made up of intelligent body agent and the parts of environment environment two, it comprises several big elements, i.e., State set state (i.e. all states being likely to occur, corresponding to the location sets of chess piece in the problem of playing chess), action collection action (i.e. all possible action, corresponding to the scheme of beginning of chess piece in the problem of playing chess), decision function are (i.e. in certain state state Generate the process of some action action) and reward function reward function (i.e. for rewarding the target letter of action Number).So, the whole decision process of intelligent body agent is just analogous to " state-(action)-state-(action)-state-... " Flow.

For existing enhancing learning algorithm, it is generally used as target letter using the negative logarithm of maximum likelihood function Number.Maximal possibility estimation (MLE) knowledge-chosen makes data i.e. (i.e. decoding layer the exports elongated sentence) probability of occurrence observed The value of maximum parameter (parameter i.e. in coding layer-decoding layer).However, existing this enhancing learning algorithm is easily caused institute The answer information for being directed to problem information for obtaining is too conservative.That is, in the case where a problem information is given, using maximum seemingly The negative logarithm of right function is tended to generate a too conservative answer information as the enhancing learning algorithm of object function, and this is answered Case information is not easy mistake semantically occur, but simultaneously also without actual meaning.

For example, for the problem information of " headache of coming to school is caught a cold ", using the negative logarithm of maximum likelihood function as mesh The enhancing learning algorithm of scalar functions would generally be obtained such as " I is closed the border in practice, and we giocoso play again after a while ".

Regarding to the issue above, the method that the present embodiment is provided is in step S204 based on enhancing study according to step S203 In resulting result when generating corresponding dialog model result, its object function employs maximum mutual information (MMI) function.With Based on unlike MLE object functions, the enhancing learning algorithm based on MMI functions can set up input sentence and output sentence it Between mutual information.

As shown in figure 3, in the present embodiment, used as an intelligent body, the parameter in decoding layer is used as decision-making for decoding layer , used as a state, the output data of each neuron is used as action for each neuron in policy, Layer Y Action, reward function is MMI functions.It is pointed out that in other the present embodiment of the invention, reward function can also It is the linear combination of more multiple objective function to increase, and realizes talking with the difference in functionality purpose of generation model with this.

In the present embodiment, decoding layer preferably includes three layers of neuronal structure, wherein, each god of input layer Layer C The correspondence vector in above-mentioned splicing result can be received through unit.It is pointed out that in the present embodiment, input layer Layer C's is every The vector that individual neuron is received is differed.

Layer H as decoding layer hidden layer, the input of its i-th neuron is (si-1, yi-1, ci), is output as si.Wherein, si-1 represents the i-th -1 output of neuron in hidden layer, and yi-1 represents in output layer the defeated of the i-th -1 neuron Go out, ci represents i-th output of neuron in input layer.

Output Y as decoding layer output layer, the input of its i-th neuron is (si, yi-1, ci), is output as yi.The output data of the output layer dialog model result that namely entirely dialogue generation model is generated according to analysis result, The result can be represented with (y1, y2 ... yT ').Wherein, yi is i-th word in dialog model result.It is defeated in the present embodiment Go out layer preferably by softmax nonlinear functions to realize.Certainly, in other embodiments of the invention, output layer may be used also Realized with other rational functions, the invention is not restricted to this.

Again as shown in figure 1, in the present embodiment, after dialog model result is obtained, the method can in step S104 root Corresponding feedback data is generated according to the dialog model result obtained by step S103 and is exported.It is pointed out that in this hair In bright different embodiments, according to actual needs, feedback data of the method generated in step S104 can be using difference Appearance form (such as text, voice, limb action or image etc.), the invention is not restricted to this.

For example, for the dialogue interaction data " headache of coming to school is caught a cold " of user input, the method is in step S104 The voice feedback data of input " I is also headache " can be generated according to dialog model result resulting in step S103.

Again for example, for the dialogue interaction data " annual New Year's Day all carrys out high fever " of user input, existing data processing side Method would generally obtain such as " annual New Year's Day all goes to a hospital to see a doctor ", and the method that the present embodiment is provided is added by coding layer Hidden layer Layer H and/or Layer G, so that dialogue interactive system can more accurately parse above-mentioned dialogue interaction The semanteme of data, and then such as " best wishes for a speedy recovery, healthy " can be generated.

It is pointed out that in the present embodiment, in order that interactive system must be talked with can preferably be interacted with user, The method can also be modified to above-mentioned default dialogue generation model.Specifically, as shown in figure 1, the method can be in step Input data of the user for above-mentioned feedback data is obtained in S105, and is entered data to default dialogue generation model according to this It is modified.

As can be seen that compared to existing method from foregoing description, the present embodiment provided for talking with interactive system Data processing method enable to the elongated sentence can preferably to be expressed by neutral net.Meanwhile, this method can also be solved The feedback result for being certainly currently based on the dialogue generation model generation of the coding layer-decoding layer framework of MLE object functions is more guarded Problem, so also allow for ultimately generating and export to the actual interaction field of feedback data more hommization and laminating of user Scape, so as to improve the Consumer's Experience of dialogue interactive system.

Present invention also offers a kind of data processing equipment for talking with interactive system, Fig. 4 is shown in the present embodiment The structural representation of the data processing equipment.

As shown in figure 4, being preferably included for talking with the data processing equipment of interactive system of being provided of the present embodiment：Hand over Mutual data acquisition module 401, dialog model generation module 402, feedback data output module 403 and Modifying model module 404. Wherein, interaction data acquisition module 401 is used to obtain the dialogue interaction data of user input.It is pointed out that according to reality Need the difference of the form of the dialogue interaction data for obtaining, interaction data acquisition module 401 using different circuit or can set It is standby to realize, the invention is not restricted to this.For example, interaction data acquisition module 401 can be by audio collecting device (such as Mike Wind etc.) realize, gathering corresponding interactive voice data；Interaction data acquisition module 401 can be with text collection equipment (example Such as keyboard) realize, to gather corresponding text interaction data.

After dialogue interaction data is obtained, interaction data acquisition module 401 can be transmitted to right above-mentioned dialogue interaction data Words model result generation module 402.Dialog model result-generation module 402 can be solved to the dialogue interaction data for receiving Analysis, and the default dialogue generation module corresponding dialog model result of generation utilize according to analysis result.

The dialog model result that dialog model result-generation module 402 can generate itself is transmitted to feedback data and exports mould Block 403, with by feedback data output module 403 is according to the corresponding feedback data of above-mentioned dialog model result generation and exports.

In the present embodiment, in order that interactive system must be talked with can preferably be interacted with user, data processing dress Putting can also be modified using Modifying model module 404 to above-mentioned default dialogue generation model.Specifically, Modifying model module 404 can obtain input data of the user for above-mentioned feedback data, and be entered data to default dialogue generation model according to this It is modified.

It is pointed out that in the present embodiment, dialog model result-generation module 402, feedback data output module 403 with And Modifying model module 404 realizes its each concrete principle and flow of function and step S102 in above-mentioned Fig. 1 to step Content that S105 is illustrated is similar to, thus herein no longer dialog model result-generation module 402, feedback data output module 403 with And the related content of Modifying model module 404 is repeated.

It should be understood that disclosed embodiment of this invention is not limited to ad hoc structure disclosed herein or treatment step Suddenly, the equivalent substitute of these features that those of ordinary skill in the related art are understood should be extended to.It should also be understood that It is that term as used herein is only used for describing the purpose of specific embodiment, and is not intended to limit.

" one embodiment " or " embodiment " mentioned in specification means special characteristic, the structure for describing in conjunction with the embodiments Or characteristic is included at least one embodiment of the present invention.Therefore, the phrase " reality that specification various places throughout occurs Apply example " or " embodiment " same embodiment might not be referred both to.

Although above-mentioned example is used to illustrate principle of the present invention in one or more applications, for the technology of this area For personnel, in the case of without departing substantially from principle of the invention and thought, hence it is evident that can in form, the details of usage and implementation It is upper various modifications may be made and without paying creative work.Therefore, the present invention is defined by the appended claims.

Claims

1. a kind of data processing method for talking with interactive system, it is characterised in that including：

Dialog model result generation step, is parsed to the dialogue interaction data, and analysis result is input into default dialogue In generation model, dialog model result is obtained, wherein, the dialogue generation model is based on NRM and enhancing learning algorithm structure Build；

2. the method for claim 1, it is characterised in that the default dialogue generation model includes coding layer and decoding Layer, wherein, the coding layer be used for according to the word segmentation result that obtains of parsing generate whole vector tables of the word segmentation result up to and Part vector table reaches, and whole vector tables is reached and part vector table is up to splicing, and obtains splicing result, the decoding Layer according to the splicing result based on notice mechanism for generating the dialog model result.

3. method as claimed in claim 2, it is characterised in that the neutral net that the decoding layer is based on enhancing learning algorithm is entered The generation of row dialog model result.

4. method as claimed in claim 3, it is characterised in that in the enhancing learning algorithm, using MMI functions as reward Function.

5. the method as any one of Claims 1 to 4, it is characterised in that methods described also includes：

Modifying model step, obtains input data of the user for the feedback data, and according to the input data pair The default dialogue generation model is modified.

6. a kind of data processing equipment for talking with interactive system, it is characterised in that including：

Dialog model result-generation module, it is used to parse the dialogue interaction data, analysis result is input to pre- If in dialogue generation model, obtaining dialog model result, wherein, the dialogue generation model is based on NRM and enhancing study What algorithm built；

7. the device that such as claim 6 is stated, it is characterised in that the default dialogue generation model includes coding layer and decoding layer, Wherein, the coding layer is configured to be reached and portion according to the whole vector tables for parsing the word segmentation result generation word segmentation result for obtaining Point vector table reaches, and whole vector tables is reached and part vector table is up to splicing, and obtains splicing result, the decoding layer It is configured to notice mechanism and the dialog model result is generated according to the splicing result.

8. device as claimed in claim 7, it is characterised in that the decoding layer is configured to strengthen the nerve of learning algorithm Network carries out the generation of feedback data.

9. device as claimed in claim 8, it is characterised in that in the enhancing learning algorithm, using MMI functions as reward Function.

10. the device as any one of claim 6~9, it is characterised in that described device also includes：

Modifying model module, it is used to obtain input data of the user for the feedback data, and according to the input Data are modified to the default dialogue generation model.