CN106445147B - The behavior management method and device of conversational system based on artificial intelligence - Google Patents
The behavior management method and device of conversational system based on artificial intelligence Download PDFInfo
- Publication number
- CN106445147B CN106445147B CN201610862653.XA CN201610862653A CN106445147B CN 106445147 B CN106445147 B CN 106445147B CN 201610862653 A CN201610862653 A CN 201610862653A CN 106445147 B CN106445147 B CN 106445147B
- Authority
- CN
- China
- Prior art keywords
- behavior
- user
- candidate
- system action
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
Abstract
The embodiment of the invention discloses the behavior management method and devices of the conversational system based on artificial intelligence.This method comprises: generating current session feature according to current system interaction mode, current user state and system action sequence;According to the system action trigger model that the current session feature and training obtain, the candidate system behavior of user-association is selected from the system action sequence;It is interacted according to the candidate system behavior with the user.In the technical solution of the embodiment of the present invention, candidate system behavior is determined according to system action trigger model, candidate system behavior is determined by the system action triggering rule according to static configuration in compared with the prior art, improves flexibility and the generalization ability of conversational system.
Description
Technical field
The present invention relates to human-computer interaction technique fields, more particularly to the behavior management side of the conversational system based on artificial intelligence
Method and device.
Background technique
Artificial intelligence (Artificial Intelligence), english abbreviation AI.It is research, develop for simulating,
Extend and the theory of the intelligence of extension people, method, a new technological sciences of technology and application system.Artificial intelligence is to calculate
One branch of machine science, it attempts to understand essence of intelligence, and produce it is a kind of new can be in such a way that human intelligence be similar
The intelligence machine made a response, the research in the field include robot, language identification, image recognition, natural language processing and specially
Family's system etc..
The behavior management scheme of existing conversational system is the side for using static rule to configure by relevant product experience
Formula come generate candidate system behavior and selection optimizer system behavior.Namely according to current specific products application, configuring
This is filled in file using relevant system action triggering and ordering rule, and is used when selected behavior executes and matched in advance
The static rule set predicts subsequent user behavior.
The characteristics of due to existing rule-based candidate behavior triggering and ordering rule being all according to specific products, is artificial
Manual configuration.Thus, the prior art has some shortcomings below: 1) being compared using the vertical class conversation process that static rule configures
It is fixed, it is merely able to complete specific logic in rule, it is inflexible;2) static rule be the specific logic based on specific products come
It is configured, does not have generalization ability, it can not be by these rules in others vertical class and product.
Summary of the invention
In view of this, the embodiment of the present invention provides the behavior management method and device of the conversational system based on artificial intelligence,
To improve flexibility and the generalization ability of the conversational system based on artificial intelligence.
In a first aspect, the embodiment of the invention provides the behavior management methods of the conversational system based on artificial intelligence, comprising:
According to current system interaction mode, current user state and system action sequence, current session feature is generated;
According to the system action trigger model that the current session feature and training obtain, from the system action sequence
The candidate system behavior of middle selection user-association;
It is interacted according to the candidate system behavior with the user.
Second aspect, the embodiment of the invention provides the behavior management devices of the conversational system based on artificial intelligence, comprising:
Current signature generation module is used for according to current system interaction mode, current user state and system action sequence,
Generate current session feature;
Candidate behavior selecting module, the system action trigger mode for being obtained according to the current session feature and training
Type selects the candidate system behavior of user-association from the system action sequence;
System action decision-making module, for being interacted according to the candidate system behavior with the user.
Technical solution provided in an embodiment of the present invention first passes through the rule that machine learning triggers system action in advance and builds
Mould obtains system action trigger model, then foundation current system interaction mode, current user state and system action sequence,
Current session feature is generated, and determines candidate system behavior according to current session feature and system action trigger model, that is, we
Candidate system behavior is determined according to system action trigger model in case, compared with the prior art in by according to static configuration
System action triggering rule determine candidate system behavior, improve the flexibility of the conversational system based on artificial intelligence and general
Change ability.
Detailed description of the invention
Fig. 1 is the process of the behavior management method for the conversational system based on artificial intelligence that the embodiment of the present invention one provides
Figure;
Fig. 2 is the process of the behavior management method of the conversational system provided by Embodiment 2 of the present invention based on artificial intelligence
Figure;
Fig. 3 is the process of the behavior management method for the conversational system based on artificial intelligence that the embodiment of the present invention three provides
Figure;
Fig. 4 is the signal of the behavior management method for the conversational system based on artificial intelligence that the embodiment of the present invention three provides
Figure;
Fig. 5 is the structure of the behavior management device for the conversational system based on artificial intelligence that the embodiment of the present invention four provides
Figure.
Specific embodiment
The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched
The specific embodiment stated is used only for explaining the present invention rather than limiting the invention.It also should be noted that in order to just
Only the parts related to the present invention are shown in description, attached drawing rather than entire infrastructure.
Embodiment one
Fig. 1 is the process of the behavior management method for the conversational system based on artificial intelligence that the embodiment of the present invention one provides
Figure.The method of the present embodiment can be executed by the behavior management device of the conversational system based on artificial intelligence, which can pass through
The mode of hardware and/or software is realized.The method of the present embodiment is generally applicable to conversational system and user carries out human-computer interaction
Situation.With reference to Fig. 1, the behavior management method of the conversational system provided in this embodiment based on artificial intelligence be can specifically include
It is as follows:
S11, foundation current system interaction mode, current user state and system action sequence, generate current session feature.
Conversational system is interacted using chat robots interactive frame with user, which includes: NLU
((Natural Language Understanding, natural language understanding) module, for understanding the natural language of user as used
The natural language of user is converted to the structured representation that machine is understood that by the query language at family;UST(User Status
Updates, User Status update) module, the dialog state information of user is updated for the output according to NLU module, wherein using
The dialog state information at family includes system interaction state, user's intention and User Status etc.;System action triggers (Action-
Trigger) module picks out a series of subsequent may hold for the dialog state information according to the updated user of UST module
Capable candidate system behavior constitutes candidate system behavior list;Behaviour decision making (Policy) module, for system trigger module
The candidate system behavior of triggering is ranked up and selects an optimizer system behavior, and predicts subsequent user behavior;
Best behavior executes (Action-Exe) module: the optimizer system behavior of process performing decision-making module selection;NLG(Natural
Language Generation, spatial term) module, for being carried out according to the implementing result of best behavior execution module
Spatial term generates the natural language result for being finally presented to user.
In the present embodiment, current system interaction mode is used to characterize user's currently some locating system stages of interaction,
Such as start state phase, clear state stage and recommends state phase.Current user state may include the need of user's epicycle
Intention is sought, such as obtaining the intention of making a reservation of restaurant information, more wheel demand values of the user in different demands dimension are such as being looked for
Under the scene of restaurant, flavor that user can go Sichuan cuisine and Guangdong dishes etc. different in this demand slot position of flavor at the restaurant.System action
Sequence refers to the sequence being made of system action, and system action refers to the behavior that conversational system is able to carry out, such as system row
To be that recommendation movement, clarification movement and information meet etc..
Specifically, NLU module obtains the current natural language of user, and by the current natural language processing of user at structure
Changing indicates, UST module determines current user state according to structured representation information, and then, the acquisition of system action trigger module is worked as
Preceding User Status, and determine current system interaction mode and system action sequence, and generate current session feature.Due to current right
Words are characterized according to current system interaction mode, current user state and the generation of system action sequence, thus current session is special
Sign contains the feature of current system interaction mode, current user state and each system action.
S12, the system action trigger model obtained according to the current session feature and training, from the system action
The candidate system behavior of user-association is selected in sequence.
In the present embodiment, system action trigger model can be it is pre- first pass through what robot learning off-line training obtained,
For picking out a series of subsequent possible candidate system behaviors executed.
Illustratively, the system action trigger model can be trained in the following way and be obtained: based on artificial mark number
According to determining the first incidence relation between business scenario and system action and second between User Status and system action
Incidence relation;According to first incidence relation and second incidence relation, universal interaction feature is extracted, wherein described general
Interaction feature includes system interaction state, User Status, user is intended to and the implementing result of upper wheel system behavior;According to extraction
Universal interaction feature, construct the system action trigger model.
Wherein, user is intended to refer to that the demand of user is intended to, and such as obtains the intention of making a reservation of restaurant information.Specifically,
Marked when needing to be implemented system action in dialog procedure personnel artificially mark between business scenario and system action the
The second incidence relation between one incidence relation and User Status and system action, mark personnel can also be to business scenario
Under dialog logic be labeled, wherein business scenario can be tourism scene, scene of making a reservation, ticket booking scene or amusement and recreation field
Scape etc..Universal interaction feature is extracted subsequently, based on labor standard information such as incidence relation, and using machine learning model to system
Behavior triggering rule carries out off-line modeling, can such as be based on decision-tree model, touch according to the system action of universal interaction feature construction
Send out model.
It is that the system action trigger model obtained according to machine learning determines by candidate system behavior in this present embodiment,
The machine learning mode makes behavior triggering logic have flexibility and generalization ability compared to pure static configuration, so as to push away
The wide vertical class for arriving different field.
S13, it is interacted according to the candidate system behavior with the user.
Technical solution provided in this embodiment first passes through the rule that machine learning triggers system action in advance and models,
System action trigger model is obtained, then according to current system interaction mode, current user state and system action sequence, is generated
Current session feature, and candidate system behavior is determined according to current session feature and system action trigger model, that is, in this programme
Candidate system behavior determines according to system action trigger model, compared with the prior art in by being according to static configuration
System behavior triggering rule determines candidate system behavior, improves flexibility and the generalization ability of conversational system.
Illustratively, may include: before the candidate system behavior for selecting user-association in the system action sequence
Based on preset behavior configuration rule, prescreening processing is carried out to the system action for including in the system action sequence.
Illustratively, may include: after the candidate system behavior for selecting user-association in the system action sequence
Based on preset behavior configuration rule, additions and deletions intervention processing is carried out to the candidate system behavior.
The triggering of candidate behavior is modeled by the way of machine learning, and the static behavior with human configuration
Configuration rule completes the triggering work of candidate behavior jointly, and triggering logic has certain flexibility and general compared to pure static configuration
Change ability can be generalized to the vertical class in different fields.
Embodiment two
The present embodiment provides the new conversational system based on artificial intelligence of one kind on the basis of the above embodiment 1
Behavior management method.Fig. 2 is the behavior management method of the conversational system provided by Embodiment 2 of the present invention based on artificial intelligence
Flow chart.With reference to Fig. 2, the behavior management method of the conversational system provided in this embodiment based on artificial intelligence be can specifically include
It is as follows:
S21, foundation current system interaction mode, current user state and system action sequence, generate current session feature.
S22, the system action trigger model obtained according to the current session feature and training, from the system action
The candidate system behavior of user-association is selected in sequence.
S23, the enhancing study order models obtained according to online incremental training arrange the candidate system behavior
Sequence, and optimizer system behavior is determined according to ranking results.
After candidate behavior triggering, enhancing learns each candidate system behavior that order models are triggered, and foundation
Current system conditions and current user state are ranked up each candidate system behavior, obtain optimizer system behavior.
Illustratively, on-line training obtains the enhancing study order models in the following way: according to system interaction shape
State, User Status, user's intention, the environmental feedback information of the candidate system behavior and the candidate system behavior, pass through
Online incremental training obtains the enhancing study order models.
In the present embodiment, the environmental feedback information of the candidate system behavior may include that user clicks behavior, user
Lower single act, user's return information and user's evaluation information.For example, being used as epicycle if epicycle interactive user has click behavior
Interactive positive feedback is used as the negative-feedback of epicycle interaction if epicycle interactive user does not click behavior.
Enhancing study (Reinforcement Learning) model is selected to model the sequence of candidate behavior.Enhancing
Study is also referred to as intensified learning, is one of the research hotspot of machine learning in recent years and field of intelligent control.Enhancing study is intended to
By in the case where being participated in without extraneous " teacher ", intelligence system (Agent) itself constantly with environmental interaction or trial and error, according to
Feedback Evaluation signal adjustment movement obtains optimal strategy to adapt to environment.Compared to supervised learning, enhance the process packet of study
Containing several elements: 1) adaptability, i.e. intelligence system constantly improve model performance using environmental feedback information;2) reactive, i.e.,
Intelligence system can directly acquire state action rule from experience;3) incremental nature, i.e. intensified learning are a kind of increment types
It practises, can use online.
To sum up, conversational system is obtained user and is believed the environmental feedback of candidate system behavior by the continuous dialogue with user
Breath carries out self-teaching and adjustment, completes the study of online increment type, obtains enhancing study order models.With quantity of study
Increase, the effect of order models is constantly promoted.
S24, it is interacted according to the optimizer system behavior with the user.
Specifically, behaviour decision making module after selecting optimizer system behavior in candidate system behavior, executes optimizer system
Behavior, NLG module generate the natural language result for being finally presented to user according to the implementing result of optimizer system behavior.
In technical solution provided in this embodiment, conversational system trains obtained system action trigger mode according to machine learning
Type determines candidate system behavior, and by the enhancing that online incremental training obtains learn order models to candidate system behavior into
Row sequence, obtains optimizer system behavior, and interact with user according to optimizer system behavior.Since enhancing learns order models
It is that conversational system obtains environmental feedback information by the continuous dialogue with user, and carries out self according to environmental feedback information and learn
Practise and adjustment obtain, thus the sort method flexibly, it is accurate and there is versatility.
Embodiment three
The present embodiment provides a kind of new based on artificial intelligence on the basis of above-described embodiment one and embodiment two
The behavior management method of conversational system.Fig. 3 is the behavior for the conversational system based on artificial intelligence that the embodiment of the present invention three provides
The flow chart of management method.With reference to Fig. 3, the behavior management method of the conversational system provided in this embodiment based on artificial intelligence has
Body may include as follows:
S31, foundation current system interaction mode, current user state and system action sequence, generate current session feature.
S32, the system action trigger model obtained according to the current session feature and training, from the system action
The candidate system behavior of user-association is selected in sequence.
S33, the enhancing study order models obtained according to online incremental training arrange the candidate system behavior
Sequence, and optimizer system behavior is determined according to ranking results.
S34, it is interacted according to the optimizer system behavior with the user.
S35, the corresponding candidate boot options of the optimizer system behavior are determined.
In the present embodiment, candidate boot options are used to guide the candidate actions of user.
S36, the enhancing learning behavior prediction model obtained according to online incremental training, from the candidate boot options
Select best boot options.
Specifically, enhancing learning behavior prediction model is according to current system conditions and works as after obtaining candidate boot options
Preceding User Status is ranked up each candidate boot options, obtains best boot options, that is, predicts lower whorl user behavior.
Illustratively, on-line training obtains the enhancing learning behavior prediction model in the following way: handing over according to system
The mutually environmental feedback information of state, User Status, user's intention, the candidate boot options and the candidate boot options,
The enhancing learning behavior prediction model is obtained by online incremental training.
In the present embodiment, the environmental feedback information of the candidate boot options may include user's return information and use
Family evaluation information, for example, use click behavior of the user for the boot options of displaying as the positive feedback of the boot options, if
It is then negative-feedback that the boot options, which are not clicked,.
To sum up, with reference to Fig. 4, off-line training obtains system action in the following way before conversational system is interacted with user
Trigger model: obtaining the labeled data set manually marked, and characteristic extracting module extracts universal interaction feature from normal data,
And system action trigger model is obtained according to training under universal interaction characteristic curve.It is in user interaction process, NLU module obtains
And the natural language of structured representation user, processing result is transferred to UST module, UST module updates the status information of user,
And the status information of user is transferred to system action trigger module, system action trigger module is according to system action trigger mode
Type, the preconfigured static rule of the status information of user and intention and the pre-execution information of system action determine candidate system
System behavior, and the candidate actions list comprising candidate system behavior is transferred to behaviour decision making module.On the one hand, behaviour decision making mould
Block obtains order models to the feedback information on-line study of candidate system behavior according to user, according to order models to candidate system
Behavior is ranked up, and determines optimizer system behavior according to ranking results;On the other hand, behaviour decision making module determines optimizer system
The candidate path of navigation of behavior obtains behavior prediction model to the feedback information on-line study of candidate path of navigation according to user,
And lower whorl user's behavior prediction is carried out according to behavior prediction model.Best behavior executes optimizer system behavior;NLG module is according to most
The implementing result of good behavior execution module generates the natural language result for being finally presented to user.
In technical solution provided in this embodiment, conversational system trains obtained system action trigger mode according to machine learning
Type determines candidate system behavior, and by the enhancing that online incremental training obtains learn order models to candidate system behavior into
Row sequence, obtains optimizer system behavior, and interact with user according to optimizer system behavior.Also, pass through online increment type
The enhancing learning behavior prediction model that training obtains is ranked up candidate boot options to obtain best boot options.Due to enhancing
Study order models are that conversational system obtains environmental feedback information by the continuous dialogue with user, and believes according to environmental feedback
Breath carries out self-teaching and adjustment obtains, thus behavior prediction method flexibly, it is accurate and there is versatility.
Example IV
Fig. 5 is the structure of the behavior management device for the conversational system based on artificial intelligence that the embodiment of the present invention four provides
Figure.The device is generally applicable to the conversational system based on artificial intelligence and the situation of user's progress human-computer interaction.Referring to Fig. 5,
The specific structure of the behavior management device of conversational system provided in this embodiment based on artificial intelligence is as follows:
Current signature generation module 41, for according to current system interaction mode, current user state and system action sequence
Column generate current session feature;
Candidate behavior selecting module 42, the system action triggering for being obtained according to the current session feature and training
Model selects the candidate system behavior of user-association from the system action sequence;
System action decision-making module 43, for being interacted according to the candidate system behavior with the user.
Illustratively, above-mentioned apparatus includes behavior trigger model training module, and the behavior trigger model training module can
To be used for:
Based on artificial labeled data, the first incidence relation and user's shape between business scenario and system action are determined
The second incidence relation between state and system action;
According to first incidence relation and second incidence relation, universal interaction feature is extracted, wherein described general
Interaction feature includes system interaction state, User Status, user is intended to and the implementing result of upper wheel system behavior;
According to the universal interaction feature extracted, the system action trigger model is constructed.
Illustratively, above-mentioned apparatus may include:
Pre-screening module, for from the system action sequence select user-association candidate system behavior before,
Based on preset behavior configuration rule, prescreening processing is carried out to the system action for including in the system action sequence;Alternatively,
Additions and deletions intervention module, for from the system action sequence select user-association candidate system behavior it
Afterwards, it is based on preset behavior configuration rule, additions and deletions intervention processing is carried out to the candidate system behavior.
Illustratively, the system action decision-making module 43 may include:
Optimizer system behavior determination unit, the enhancing study order models for obtaining according to online incremental training are to institute
It states candidate system behavior to be ranked up, and determines optimizer system behavior according to ranking results;
System dialog unit, for being interacted according to the optimizer system behavior with the user.
Illustratively, above-mentioned apparatus includes order models training module, and the order models training module can be used for:
According to system interaction state, User Status, user's intention, the candidate system behavior and the candidate system row
For environmental feedback information, the enhancing is obtained by online incremental training and learns order models.
Illustratively, the environmental feedback information of the candidate system behavior may include that user clicks behavior, user places an order
Behavior, user's return information and user's evaluation information.
Illustratively, above-mentioned apparatus may include:
Candidate boot options determining module, described in determining after determining optimizer system behavior according to ranking results
The corresponding candidate boot options of optimizer system behavior;
Best boot options selecting module, the enhancing learning behavior for obtaining according to online incremental training predict mould
Type selects best boot options from the candidate boot options.
Illustratively, above-mentioned apparatus may include behavior prediction model training module, the behavior prediction model training mould
Block can be used for:
It is intended to according to system interaction state, User Status, user, the candidate boot options and the candidate guide choosing
The environmental feedback information of item, obtains the enhancing learning behavior prediction model by online incremental training.
The behavior management device of conversational system provided in this embodiment based on artificial intelligence, with any embodiment of that present invention
The behavior management method of the provided conversational system based on artificial intelligence belongs to same inventive concept, and it is any that the present invention can be performed
The behavior management method of conversational system based on artificial intelligence provided by embodiment has and executes the dialogue based on artificial intelligence
The corresponding functional module of behavior management method and beneficial effect of system.The not technical detail of detailed description in the present embodiment,
It can be found in the behavior management method for the conversational system based on artificial intelligence that any embodiment of that present invention provides.
Note that the above is only a better embodiment of the present invention and the applied technical principle.It will be appreciated by those skilled in the art that
The invention is not limited to the specific embodiments described herein, be able to carry out for a person skilled in the art it is various it is apparent variation,
It readjusts and substitutes without departing from protection scope of the present invention.Therefore, although being carried out by above embodiments to the present invention
It is described in further detail, but the present invention is not limited to the above embodiments only, without departing from the inventive concept, also
It may include more other equivalent embodiments, and the scope of the invention is determined by the scope of the appended claims.
Claims (14)
1. the behavior management method of the conversational system based on artificial intelligence characterized by comprising
According to current system interaction mode, current user state and system action sequence, current session feature is generated;
According to the system action trigger model that the current session feature and training obtain, selected from the system action sequence
Select the candidate system behavior of user-association;
It is interacted according to the candidate system behavior with the user;
Wherein, training obtains the system action trigger model in the following way:
Based on artificial labeled data, determine the first incidence relation between business scenario and system action and User Status with
The second incidence relation between system action;
According to first incidence relation and second incidence relation, universal interaction feature is extracted, wherein the universal interaction
Feature includes system interaction state, User Status, user is intended to and the implementing result of upper wheel system behavior;
According to the universal interaction feature extracted, the system action trigger model is constructed.
2. the method according to claim 1, wherein
It include: to be configured based on preset behavior before the candidate system behavior for selecting user-association in the system action sequence
Rule carries out prescreening processing to the system action for including in the system action sequence;Alternatively,
It include: to be configured based on preset behavior after the candidate system behavior for selecting user-association in the system action sequence
Rule carries out additions and deletions intervention processing to the candidate system behavior.
3. the method according to claim 1, wherein being handed over according to the candidate system behavior with the user
Mutually, comprising:
The enhancing study order models obtained according to online incremental training are ranked up the candidate system behavior, and foundation
Ranking results determine optimizer system behavior;
It is interacted according to the optimizer system behavior with the user.
4. according to the method described in claim 3, it is characterized in that, enhancing study order models are online in the following way
Training obtains:
According to system interaction state, User Status, user's intention, the candidate system behavior and the candidate system behavior
Environmental feedback information obtains the enhancing by online incremental training and learns order models.
5. according to method described in claim requirement 4, which is characterized in that the environmental feedback packet of the candidate system behavior
It includes user and clicks single act, user's return information and user's evaluation information under behavior, user.
6. according to the method described in claim 3, it is characterized in that, being wrapped after determining optimizer system behavior according to ranking results
It includes:
Determine the corresponding candidate boot options of the optimizer system behavior;
According to the enhancing learning behavior prediction model that online incremental training obtains, selection is best from the candidate boot options
Boot options.
7. according to the method described in claim 6, it is characterized in that, the enhancing learning behavior prediction model in the following way
On-line training obtains:
According to system interaction state, User Status, user's intention, the candidate boot options and the candidate boot options
Environmental feedback information obtains the enhancing learning behavior prediction model by online incremental training.
8. the behavior management device of the conversational system based on artificial intelligence characterized by comprising
Current signature generation module, for generating according to current system interaction mode, current user state and system action sequence
Current session feature;
Candidate behavior selecting module, the system action trigger model for being obtained according to the current session feature and training,
The candidate system behavior of user-association is selected from the system action sequence;
System action decision-making module, for being interacted according to the candidate system behavior with the user;
Wherein, described device includes behavior trigger model training module, and the behavior trigger model training module is used for:
Based on artificial labeled data, determine the first incidence relation between business scenario and system action and User Status with
The second incidence relation between system action;
According to first incidence relation and second incidence relation, universal interaction feature is extracted, wherein the universal interaction
Feature includes system interaction state, User Status, user is intended to and the implementing result of upper wheel system behavior;
According to the universal interaction feature extracted, the system action trigger model is constructed.
9. device according to claim 8 characterized by comprising
Pre-screening module, for being based on before the candidate system behavior for selecting user-association in the system action sequence
Preset behavior configuration rule carries out prescreening processing to the system action for including in the system action sequence;Alternatively,
Additions and deletions intervention module, for from the system action sequence select user-association candidate system behavior after, base
In preset behavior configuration rule, additions and deletions intervention processing is carried out to the candidate system behavior.
10. device according to claim 8, which is characterized in that the system action decision-making module includes:
Optimizer system behavior determination unit, the enhancing study order models for obtaining according to online incremental training are to the time
It selects system action to be ranked up, and determines optimizer system behavior according to ranking results;
System dialog unit, for being interacted according to the optimizer system behavior with the user.
11. device according to claim 10, which is characterized in that including order models training module, the order models
Training module is used for:
According to system interaction state, User Status, user's intention, the candidate system behavior and the candidate system behavior
Environmental feedback information obtains the enhancing by online incremental training and learns order models.
12. according to device described in claim requirement 11, which is characterized in that the environmental feedback information of the candidate system behavior
Single act, user's return information and user's evaluation information under behavior, user are clicked including user.
13. device according to claim 10 characterized by comprising
Candidate boot options determining module, it is described best for determining after determining optimizer system behavior according to ranking results
The corresponding candidate boot options of system action;
Best boot options selecting module, the enhancing learning behavior prediction model for being obtained according to online incremental training, from
Best boot options are selected in candidate's boot options.
14. device according to claim 13, which is characterized in that including behavior prediction model training module, the behavior
Prediction model training module is used for:
According to system interaction state, User Status, user's intention, the candidate boot options and the candidate boot options
Environmental feedback information obtains the enhancing learning behavior prediction model by online incremental training.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610862653.XA CN106445147B (en) | 2016-09-28 | 2016-09-28 | The behavior management method and device of conversational system based on artificial intelligence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610862653.XA CN106445147B (en) | 2016-09-28 | 2016-09-28 | The behavior management method and device of conversational system based on artificial intelligence |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106445147A CN106445147A (en) | 2017-02-22 |
CN106445147B true CN106445147B (en) | 2019-05-10 |
Family
ID=58169687
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610862653.XA Active CN106445147B (en) | 2016-09-28 | 2016-09-28 | The behavior management method and device of conversational system based on artificial intelligence |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106445147B (en) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107169586A (en) * | 2017-03-29 | 2017-09-15 | 北京百度网讯科技有限公司 | Resource optimization method, device and storage medium based on artificial intelligence |
CN107291867B (en) * | 2017-06-13 | 2021-07-20 | 北京百度网讯科技有限公司 | Dialog processing method, device and equipment based on artificial intelligence and computer readable storage medium |
CN107463601B (en) | 2017-06-13 | 2021-02-12 | 北京百度网讯科技有限公司 | Dialog understanding system construction method, device and equipment based on artificial intelligence and computer readable storage medium |
CN109086282A (en) * | 2017-06-14 | 2018-12-25 | 杭州方得智能科技有限公司 | A kind of method and system for the more wheels dialogue having multitask driving capability |
CN107463301A (en) * | 2017-06-28 | 2017-12-12 | 北京百度网讯科技有限公司 | Conversational system construction method, device, equipment and computer-readable recording medium based on artificial intelligence |
CN107369443B (en) * | 2017-06-29 | 2020-09-25 | 北京百度网讯科技有限公司 | Dialog management method and device based on artificial intelligence |
CN111104585B (en) * | 2018-10-25 | 2023-06-02 | 北京嘀嘀无限科技发展有限公司 | Question recommending method and device |
CN109635010B (en) * | 2018-12-26 | 2021-10-08 | 深圳市梦网视讯有限公司 | User characteristic and characteristic factor extraction and query method and system |
CN110211572B (en) * | 2019-05-14 | 2021-12-10 | 北京来也网络科技有限公司 | Dialogue control method and device based on reinforcement learning |
CN110569339B (en) * | 2019-07-22 | 2022-04-19 | 清华大学 | Dialogue method, medium, device and computing equipment |
CN110413756B (en) * | 2019-07-29 | 2022-02-15 | 北京小米智能科技有限公司 | Method, device and equipment for processing natural language |
CN111061366A (en) * | 2019-11-29 | 2020-04-24 | 深圳市杰思谷科技有限公司 | Method for robot to autonomously decide current behavior decision |
CN112307188B (en) * | 2020-12-30 | 2021-06-11 | 北京百度网讯科技有限公司 | Dialog generation method, system, electronic device and readable storage medium |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104350541A (en) * | 2012-04-04 | 2015-02-11 | 奥尔德巴伦机器人公司 | Robot capable of incorporating natural dialogues with a user into the behaviour of same, and methods of programming and using said robot |
CN105068661A (en) * | 2015-09-07 | 2015-11-18 | 百度在线网络技术(北京)有限公司 | Man-machine interaction method and system based on artificial intelligence |
-
2016
- 2016-09-28 CN CN201610862653.XA patent/CN106445147B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104350541A (en) * | 2012-04-04 | 2015-02-11 | 奥尔德巴伦机器人公司 | Robot capable of incorporating natural dialogues with a user into the behaviour of same, and methods of programming and using said robot |
CN105068661A (en) * | 2015-09-07 | 2015-11-18 | 百度在线网络技术(北京)有限公司 | Man-machine interaction method and system based on artificial intelligence |
Non-Patent Citations (1)
Title |
---|
"面向智能客服机器人的交互式问句理解研究";文博;《中国优秀硕士学位论文全文数据库》;20160315(第3期);第13-47页 |
Also Published As
Publication number | Publication date |
---|---|
CN106445147A (en) | 2017-02-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106445147B (en) | The behavior management method and device of conversational system based on artificial intelligence | |
Shi et al. | Virtual-taobao: Virtualizing real-world online retail environment for reinforcement learning | |
CN110168495B (en) | Searchable database of trained artificial intelligence objects | |
US20180357543A1 (en) | Artificial intelligence system configured to measure performance of artificial intelligence over time | |
Radwan et al. | Neutrosophic AHP multi criteria decision making method applied on the selection of learning management system | |
CN110188331A (en) | Model training method, conversational system evaluation method, device, equipment and storage medium | |
CN107150347A (en) | Robot perception and understanding method based on man-machine collaboration | |
CN110033022A (en) | Processing method, device and the storage medium of text | |
KR102540185B1 (en) | Recruitment position description text generation method, device, apparatus and medium | |
CN110399476A (en) | Generation method, device, equipment and the storage medium of talent's portrait | |
Wang et al. | Policy learning for domain selection in an extensible multi-domain spoken dialogue system | |
CN110472594A (en) | Method for tracking target, information insertion method and equipment | |
Botea et al. | Generating dialogue agents via automated planning | |
CN110390110A (en) | The method and apparatus that pre-training for semantic matches generates sentence vector | |
CN108115678A (en) | Robot and its method of controlling operation and device | |
Hafez et al. | Improving robot dual-system motor learning with intrinsically motivated meta-control and latent-space experience imagination | |
CN112000793B (en) | Man-machine interaction oriented dialogue target planning method | |
Li et al. | Simulation driven AI: From artificial to actual and vice versa | |
Xiao et al. | Robot learning in the era of foundation models: A survey | |
CN106709829B (en) | Learning situation diagnosis method and system based on online question bank | |
Saha et al. | Reinforcement learning based dialogue management strategy | |
JP2017513110A (en) | Contextual real-time feedback for neuromorphic model development | |
Coronado et al. | A personal agents hybrid architecture for question answering featuring social dialog | |
Zhan et al. | Dueling network architecture for multi-agent deep deterministic policy gradient | |
Lei et al. | Multi-skeleton structures graph convolutional network for action quality assessment in long videos |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |