CN107357838B - 基于多任务学习的对话策略在线实现方法 - Google Patents
基于多任务学习的对话策略在线实现方法 Download PDFInfo
- Publication number
- CN107357838B CN107357838B CN201710483734.3A CN201710483734A CN107357838B CN 107357838 B CN107357838 B CN 107357838B CN 201710483734 A CN201710483734 A CN 201710483734A CN 107357838 B CN107357838 B CN 107357838B
- Authority
- CN
- China
- Prior art keywords
- conversation
- value
- reward value
- learning
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000009471 action Effects 0.000 claims abstract description 61
- 238000012549 training Methods 0.000 claims abstract description 41
- 230000008569 process Effects 0.000 claims abstract description 14
- 230000002787 reinforcement Effects 0.000 claims abstract description 9
- 230000001186 cumulative effect Effects 0.000 claims description 9
- 230000006870 function Effects 0.000 claims description 9
- KDYFGRWQOYBRFD-UHFFFAOYSA-N succinic acid Chemical compound OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 claims 1
- 238000013461 design Methods 0.000 abstract description 2
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000013528 artificial neural network Methods 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3325—Reformulation based on results of preceding query
- G06F16/3326—Reformulation based on results of preceding query using relevance feedback from the user, e.g. relevance feedback on documents, documents sets, document terms or passages
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Theoretical Computer Science (AREA)
- Mathematical Physics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710483734.3A CN107357838B (zh) | 2017-06-23 | 2017-06-23 | 基于多任务学习的对话策略在线实现方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710483734.3A CN107357838B (zh) | 2017-06-23 | 2017-06-23 | 基于多任务学习的对话策略在线实现方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107357838A CN107357838A (zh) | 2017-11-17 |
CN107357838B true CN107357838B (zh) | 2020-09-01 |
Family
ID=60273492
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710483734.3A Active CN107357838B (zh) | 2017-06-23 | 2017-06-23 | 基于多任务学习的对话策略在线实现方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107357838B (zh) |
Families Citing this family (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110444214B (zh) | 2017-11-24 | 2021-08-17 | 深圳市腾讯计算机系统有限公司 | 语音信号处理模型训练方法、装置、电子设备及存储介质 |
CN108268616B (zh) * | 2018-01-04 | 2020-09-01 | 中国科学院自动化研究所 | 融合规则信息的可控制性对话管理扩展方法 |
CN108304489B (zh) * | 2018-01-05 | 2021-12-28 | 广东工业大学 | 一种基于强化学习网络的目标引导型个性化对话方法与系统 |
CN108282587B (zh) * | 2018-01-19 | 2020-05-26 | 重庆邮电大学 | 基于状态跟踪与策略导向下的移动客服对话管理方法 |
US20210042584A1 (en) * | 2018-01-30 | 2021-02-11 | Nec Corporation | Information processing apparatus, control method, and non-transitory storage medium |
US11501076B2 (en) * | 2018-02-09 | 2022-11-15 | Salesforce.Com, Inc. | Multitask learning as question answering |
CN108491380B (zh) * | 2018-03-12 | 2021-11-23 | 思必驰科技股份有限公司 | 用于口语理解的对抗多任务训练方法 |
CN108962238B (zh) * | 2018-04-25 | 2020-08-07 | 苏州思必驰信息科技有限公司 | 基于结构化神经网络的对话方法、系统、设备及存储介质 |
CN112135716B (zh) * | 2018-05-18 | 2023-11-03 | 谷歌有限责任公司 | 数据高效的分层强化学习 |
CN108804611B (zh) * | 2018-05-30 | 2021-11-19 | 浙江大学 | 一种基于自我评论序列学习的对话回复生成方法及系统 |
CN108959412B (zh) * | 2018-06-07 | 2021-09-14 | 出门问问信息科技有限公司 | 标注数据的生成方法、装置、设备及存储介质 |
CN108962224B (zh) * | 2018-07-19 | 2020-06-26 | 苏州思必驰信息科技有限公司 | 口语理解和语言模型联合建模方法、对话方法及系统 |
CN109227558A (zh) * | 2018-10-09 | 2019-01-18 | 北京智合大方科技有限公司 | 可实时调校的智能外呼机器人 |
US11100407B2 (en) | 2018-10-10 | 2021-08-24 | International Business Machines Corporation | Building domain models from dialog interactions |
CN109388698A (zh) * | 2018-10-22 | 2019-02-26 | 北京工业大学 | 一种基于深度强化学习的指导性自动聊天方法 |
CN110018722B (zh) * | 2018-11-06 | 2022-12-23 | 联想企业解决方案(新加坡)有限公司 | 用于热控制的机器学习装置、系统和方法 |
CN109817329B (zh) * | 2019-01-21 | 2021-06-29 | 暗物智能科技(广州)有限公司 | 一种医疗问诊对话系统以及应用于该系统的强化学习方法 |
CN109961152B (zh) * | 2019-03-14 | 2021-03-02 | 广州多益网络股份有限公司 | 虚拟偶像的个性化互动方法、系统、终端设备及存储介质 |
CN109977208B (zh) * | 2019-03-22 | 2021-04-09 | 北京中科汇联科技股份有限公司 | 一种融合faq和任务及主动引导的对话系统 |
US11681923B2 (en) * | 2019-04-19 | 2023-06-20 | Samsung Electronics Co., Ltd. | Multi-model structures for classification and intent determination |
CN110111766A (zh) * | 2019-04-22 | 2019-08-09 | 南京硅基智能科技有限公司 | 一种多领域任务型对话系统和终端 |
CN110245221B (zh) * | 2019-05-13 | 2023-05-23 | 华为技术有限公司 | 训练对话状态跟踪分类器的方法和计算机设备 |
CN110347815A (zh) * | 2019-07-11 | 2019-10-18 | 上海蔚来汽车有限公司 | 语音对话系统中的多任务处理方法以及多任务处理系统 |
CN110569339B (zh) * | 2019-07-22 | 2022-04-19 | 清华大学 | 对话方法、介质、装置和计算设备 |
US11423235B2 (en) | 2019-11-08 | 2022-08-23 | International Business Machines Corporation | Cognitive orchestration of multi-task dialogue system |
CN112884501B (zh) * | 2019-11-29 | 2023-10-10 | 百度在线网络技术(北京)有限公司 | 数据处理方法、装置、电子设备及存储介质 |
CN111104502A (zh) * | 2019-12-24 | 2020-05-05 | 携程计算机技术(上海)有限公司 | 外呼系统的对话管理方法、系统、电子设备和存储介质 |
CN111274438B (zh) * | 2020-01-15 | 2023-06-23 | 中山大学 | 一种语言描述引导的视频时序定位方法 |
CN112100354B (zh) * | 2020-09-16 | 2023-07-25 | 北京奇艺世纪科技有限公司 | 人机对话方法、装置、设备及存储介质 |
CN112800192B (zh) * | 2021-01-14 | 2022-02-08 | 云从科技集团股份有限公司 | 多轮对话方法、系统、介质及装置 |
CN112818097A (zh) * | 2021-01-26 | 2021-05-18 | 山西三友和智慧信息技术股份有限公司 | 一种基于对话框状态跟踪模型的任务外训练系统 |
CN113239171B (zh) * | 2021-06-07 | 2023-08-01 | 平安科技(深圳)有限公司 | 对话管理系统更新方法、装置、计算机设备及存储介质 |
CN114418119A (zh) * | 2022-01-21 | 2022-04-29 | 深圳市神州云海智能科技有限公司 | 一种基于结构深度嵌入的对话策略优化方法及系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103473087A (zh) * | 2013-08-30 | 2013-12-25 | 福建升腾资讯有限公司 | 一种多任务系统中软件开关机的关机控制方法 |
CN104462024A (zh) * | 2014-10-29 | 2015-03-25 | 百度在线网络技术(北京)有限公司 | 生成对话动作策略模型的方法和装置 |
CN105630960A (zh) * | 2015-12-24 | 2016-06-01 | 百度在线网络技术(北京)有限公司 | 测试领域任务型对话系统的方法和装置 |
CN105788593A (zh) * | 2016-02-29 | 2016-07-20 | 中国科学院声学研究所 | 生成对话策略的方法及系统 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7664644B1 (en) * | 2006-06-09 | 2010-02-16 | At&T Intellectual Property Ii, L.P. | Multitask learning for spoken language understanding |
US9299081B2 (en) * | 2012-09-10 | 2016-03-29 | Yahoo! Inc. | Deriving a user profile from questions |
US10088972B2 (en) * | 2013-12-31 | 2018-10-02 | Verint Americas Inc. | Virtual assistant conversations |
-
2017
- 2017-06-23 CN CN201710483734.3A patent/CN107357838B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103473087A (zh) * | 2013-08-30 | 2013-12-25 | 福建升腾资讯有限公司 | 一种多任务系统中软件开关机的关机控制方法 |
CN104462024A (zh) * | 2014-10-29 | 2015-03-25 | 百度在线网络技术(北京)有限公司 | 生成对话动作策略模型的方法和装置 |
CN105630960A (zh) * | 2015-12-24 | 2016-06-01 | 百度在线网络技术(北京)有限公司 | 测试领域任务型对话系统的方法和装置 |
CN105788593A (zh) * | 2016-02-29 | 2016-07-20 | 中国科学院声学研究所 | 生成对话策略的方法及系统 |
Non-Patent Citations (2)
Title |
---|
On-line Dialogue Policy Learning with Companion Teaching;Lu Chen et.al;《Proceedings of the 15th Conference of European Chapter of the association for Computational Linguistics》;20170407;正文第2节,图1 * |
口语对话系统中对话管理方法研究综述;王玉 等;《计算机科学》;20150630;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN107357838A (zh) | 2017-11-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107357838B (zh) | 基于多任务学习的对话策略在线实现方法 | |
CN109299237B (zh) | 基于行动者评论家强化学习算法的循环网络人机对话方法 | |
CN111159368B (zh) | 一种个性化对话的回复生成方法 | |
CN110837548B (zh) | 答案匹配方法、装置、电子设备及存储介质 | |
CN110059170B (zh) | 基于用户交互的多轮对话在线训练方法及系统 | |
CN111460833A (zh) | 文本生成方法、装置和设备 | |
CN114691852B (zh) | 人机对话系统及方法 | |
CN112633010A (zh) | 基于多头注意力和图卷积网络的方面级情感分析方法及系统 | |
CN113435211B (zh) | 一种结合外部知识的文本隐式情感分析方法 | |
CN113239167A (zh) | 一种可自动生成对话策略的任务型对话管理方法和系统 | |
CN114911932A (zh) | 基于主题语义增强的异构图结构多会话者情感分析方法 | |
CN111046178A (zh) | 一种文本序列生成方法及其系统 | |
CN110069611A (zh) | 一种主题增强的聊天机器人回复生成方法及装置 | |
CN110096516A (zh) | 自定义的数据库交互的对话生成方法及系统 | |
CN110297894B (zh) | 一种基于辅助网络的智能对话生成方法 | |
CN115392261A (zh) | 模型训练及任务型对话方法、电子设备 | |
CN115062606A (zh) | 对话数据分析及其模型训练方法、及电子设备 | |
CN113326367B (zh) | 基于端到端文本生成的任务型对话方法和系统 | |
CN117252161A (zh) | 一种特定领域的模型训练和文本生成方法 | |
CN116777568A (zh) | 金融市场交易事前智能对话下单方法、装置及存储介质 | |
CN112364659A (zh) | 一种无监督的语义表示自动识别方法及装置 | |
CN111414466A (zh) | 一种基于深度模型融合的多轮对话建模方法 | |
CN116701566A (zh) | 一种基于情感的多轮对话模型及对话方法 | |
CN116303930A (zh) | 一种基于语义匹配与生成模型的会话智能生成方法 | |
CN116204623A (zh) | 一种会话主题主动引导式会话方法和系统 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200623 Address after: Room 223, old administration building, 800 Dongchuan Road, Minhang District, Shanghai, 200240 Applicant after: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Applicant after: AI SPEECH Co.,Ltd. Address before: 200240 Dongchuan Road, Shanghai, No. 800, No. Applicant before: SHANGHAI JIAO TONG University Applicant before: AI SPEECH Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20201021 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Patentee after: AI SPEECH Co.,Ltd. Address before: Room 223, old administration building, 800 Dongchuan Road, Minhang District, Shanghai, 200240 Patentee before: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Patentee before: AI SPEECH Co.,Ltd. |
|
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Patentee after: Sipic Technology Co.,Ltd. Address before: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Patentee before: AI SPEECH Co.,Ltd. |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Online Implementation Method of Dialogue Strategy Based on Multitask Learning Effective date of registration: 20230726 Granted publication date: 20200901 Pledgee: CITIC Bank Limited by Share Ltd. Suzhou branch Pledgor: Sipic Technology Co.,Ltd. Registration number: Y2023980049433 |