CN108491380B - 用于口语理解的对抗多任务训练方法 - Google Patents
用于口语理解的对抗多任务训练方法 Download PDFInfo
- Publication number
- CN108491380B CN108491380B CN201810200343.0A CN201810200343A CN108491380B CN 108491380 B CN108491380 B CN 108491380B CN 201810200343 A CN201810200343 A CN 201810200343A CN 108491380 B CN108491380 B CN 108491380B
- Authority
- CN
- China
- Prior art keywords
- model
- task
- training
- shared space
- spoken language
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012549 training Methods 0.000 title claims abstract description 93
- 238000000034 method Methods 0.000 title claims abstract description 75
- 238000002372 labelling Methods 0.000 claims abstract description 16
- 238000005070 sampling Methods 0.000 claims abstract description 9
- 230000015654 memory Effects 0.000 claims description 19
- 230000002457 bidirectional effect Effects 0.000 claims description 8
- 238000004590 computer program Methods 0.000 claims description 8
- 238000009826 distribution Methods 0.000 claims description 8
- 238000013528 artificial neural network Methods 0.000 claims description 6
- 238000005457 optimization Methods 0.000 claims description 6
- 230000003042 antagnostic effect Effects 0.000 claims description 4
- 230000008569 process Effects 0.000 description 8
- 238000012360 testing method Methods 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 238000013459 approach Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000010295 mobile communication Methods 0.000 description 3
- 230000006403 short-term memory Effects 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 125000002015 acyclic group Chemical group 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000010006 flight Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 238000004574 scanning tunneling microscopy Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810200343.0A CN108491380B (zh) | 2018-03-12 | 2018-03-12 | 用于口语理解的对抗多任务训练方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810200343.0A CN108491380B (zh) | 2018-03-12 | 2018-03-12 | 用于口语理解的对抗多任务训练方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108491380A CN108491380A (zh) | 2018-09-04 |
CN108491380B true CN108491380B (zh) | 2021-11-23 |
Family
ID=63338789
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810200343.0A Active CN108491380B (zh) | 2018-03-12 | 2018-03-12 | 用于口语理解的对抗多任务训练方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108491380B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111523952B (zh) * | 2019-01-17 | 2023-05-05 | 阿里巴巴集团控股有限公司 | 信息提取的方法及装置、存储介质和处理器 |
CN109947931B (zh) * | 2019-03-20 | 2021-05-14 | 华南理工大学 | 基于无监督学习的文本自动摘要方法、系统、设备及介质 |
CN110795945B (zh) * | 2019-10-30 | 2023-11-14 | 腾讯科技(深圳)有限公司 | 一种语义理解模型训练方法、语义理解方法、装置及存储介质 |
CN113743111B (zh) * | 2020-08-25 | 2024-06-04 | 国家计算机网络与信息安全管理中心 | 基于文本预训练和多任务学习的金融风险预测方法及装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1326567A (zh) * | 1998-11-16 | 2001-12-12 | 艾利森电话股份有限公司 | 处理系统调度 |
CN107197475A (zh) * | 2016-03-14 | 2017-09-22 | 重庆邮电大学 | 一种基于多线程的传感节点标识符解析测试方法及系统 |
CN107341146A (zh) * | 2017-06-23 | 2017-11-10 | 上海交通大学 | 基于语义槽内部结构的可迁移口语语义解析系统及其实现方法 |
CN107357838A (zh) * | 2017-06-23 | 2017-11-17 | 上海交通大学 | 基于多任务学习的对话策略在线实现方法 |
WO2017223009A1 (en) * | 2016-06-23 | 2017-12-28 | Microsoft Technology Licensing, Llc | Multi-domain joint semantic frame parsing |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106844346B (zh) * | 2017-02-09 | 2020-08-25 | 北京红马传媒文化发展有限公司 | 基于深度学习模型Word2Vec的短文本语义相似性判别方法和系统 |
CN107085716B (zh) * | 2017-05-24 | 2021-06-04 | 复旦大学 | 基于多任务生成对抗网络的跨视角步态识别方法 |
CN107230401A (zh) * | 2017-06-02 | 2017-10-03 | 孟昕 | 利用互联网和语音技术的写作教学交互系统以及实现方法 |
CN107240395B (zh) * | 2017-06-16 | 2020-04-28 | 百度在线网络技术(北京)有限公司 | 一种声学模型训练方法和装置、计算机设备、存储介质 |
CN107463951A (zh) * | 2017-07-19 | 2017-12-12 | 清华大学 | 一种提高深度学习模型鲁棒性的方法及装置 |
-
2018
- 2018-03-12 CN CN201810200343.0A patent/CN108491380B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1326567A (zh) * | 1998-11-16 | 2001-12-12 | 艾利森电话股份有限公司 | 处理系统调度 |
CN107197475A (zh) * | 2016-03-14 | 2017-09-22 | 重庆邮电大学 | 一种基于多线程的传感节点标识符解析测试方法及系统 |
WO2017223009A1 (en) * | 2016-06-23 | 2017-12-28 | Microsoft Technology Licensing, Llc | Multi-domain joint semantic frame parsing |
CN107341146A (zh) * | 2017-06-23 | 2017-11-10 | 上海交通大学 | 基于语义槽内部结构的可迁移口语语义解析系统及其实现方法 |
CN107357838A (zh) * | 2017-06-23 | 2017-11-17 | 上海交通大学 | 基于多任务学习的对话策略在线实现方法 |
Non-Patent Citations (4)
Title |
---|
Adversarial Multi-Criteria Learning for Chinese Word Segmentation;Xinchi Chen等;《Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics》;20170804;1193-1203 * |
Semi-supervised sequence tagging with bidirectional language models;Matthew E. Peters等;《http://export.arxiv.org/abs/1705.00108》;20170429;1-10 * |
SEMI-SUPERVISED TRAINING USING ADVERSARIAL MULTI-TASK LEARNING FOR SPOKEN LANGUAGE UNDERSTANDING;Ouyu Lan等;《ICASSP 2018》;20180420;6049-6053 * |
统计中文口语理解执行策略的研究;李艳玲等;《计算机科学与探索》;20160408;980-987 * |
Also Published As
Publication number | Publication date |
---|---|
CN108491380A (zh) | 2018-09-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109637546B (zh) | 知识蒸馏方法和装置 | |
CN108962224B (zh) | 口语理解和语言模型联合建模方法、对话方法及系统 | |
US11568000B2 (en) | System and method for automatic task-oriented dialog system | |
CN108920666B (zh) | 基于语义理解的搜索方法、系统、电子设备及存储介质 | |
Cohn-Gordon et al. | Pragmatically informative image captioning with character-level inference | |
EP3516591B1 (en) | Neural machine translation systems | |
CN108491380B (zh) | 用于口语理解的对抗多任务训练方法 | |
US10268671B2 (en) | Generating parse trees of text segments using neural networks | |
CN107680580B (zh) | 文本转换模型训练方法和装置、文本转换方法和装置 | |
CN108417205B (zh) | 语义理解训练方法和系统 | |
EP3218854B1 (en) | Generating natural language descriptions of images | |
CN110516253B (zh) | 中文口语语义理解方法及系统 | |
US10083169B1 (en) | Topic-based sequence modeling neural networks | |
CN109074517B (zh) | 全局归一化神经网络 | |
JP2021524623A (ja) | 質問応答としてのマルチタスク学習 | |
US20160372118A1 (en) | Context-dependent modeling of phonemes | |
CN110234018B (zh) | 多媒体内容描述生成方法、训练方法、装置、设备及介质 | |
CN110349572A (zh) | 一种语音关键词识别方法、装置、终端及服务器 | |
CN106663092A (zh) | 具有罕见词处理的神经机器翻译系统 | |
CN111816160A (zh) | 普通话和粤语混合语音识别模型训练方法及系统 | |
Nguyen et al. | From film to video: Multi-turn question answering with multi-modal context | |
US20230034414A1 (en) | Dialogue processing apparatus, learning apparatus, dialogue processing method, learning method and program | |
CN111667728B (zh) | 语音后处理模块训练方法和装置 | |
CN110457674B (zh) | 一种主题指导的文本预测方法 | |
CN111522925A (zh) | 对话状态生成方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200618 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Co.,Ltd. Applicant after: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Co.,Ltd. Applicant before: SHANGHAI JIAO TONG University |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20201026 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Co.,Ltd. Applicant before: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. |
|
CB02 | Change of applicant information |
Address after: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant after: Sipic Technology Co.,Ltd. Address before: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant before: AI SPEECH Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Adversarial multitasking training method for oral comprehension Effective date of registration: 20230726 Granted publication date: 20211123 Pledgee: CITIC Bank Limited by Share Ltd. Suzhou branch Pledgor: Sipic Technology Co.,Ltd. Registration number: Y2023980049433 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |