CN117350304B - 一种多轮对话上下文向量增强方法及系统 - Google Patents
一种多轮对话上下文向量增强方法及系统 Download PDFInfo
- Publication number
- CN117350304B CN117350304B CN202311639567.9A CN202311639567A CN117350304B CN 117350304 B CN117350304 B CN 117350304B CN 202311639567 A CN202311639567 A CN 202311639567A CN 117350304 B CN117350304 B CN 117350304B
- Authority
- CN
- China
- Prior art keywords
- vector
- dialogue
- model
- sub
- loss
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000013598 vector Substances 0.000 title claims abstract description 167
- 238000000034 method Methods 0.000 title claims abstract description 59
- 238000009826 distribution Methods 0.000 claims abstract description 12
- 238000012549 training Methods 0.000 claims description 33
- 238000012545 processing Methods 0.000 claims description 22
- 230000007246 mechanism Effects 0.000 claims description 18
- 238000005457 optimization Methods 0.000 claims description 17
- 230000008451 emotion Effects 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 15
- 238000004458 analytical method Methods 0.000 claims description 14
- 230000006870 function Effects 0.000 claims description 12
- 238000005520 cutting process Methods 0.000 claims description 11
- 238000007781 pre-processing Methods 0.000 claims description 11
- 238000013528 artificial neural network Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 9
- 238000003860 storage Methods 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 abstract description 5
- 230000000694 effects Effects 0.000 abstract description 4
- 230000008569 process Effects 0.000 description 11
- 238000010586 diagram Methods 0.000 description 9
- 102100033814 Alanine aminotransferase 2 Human genes 0.000 description 7
- 101710096000 Alanine aminotransferase 2 Proteins 0.000 description 6
- 238000013459 approach Methods 0.000 description 5
- 238000011156 evaluation Methods 0.000 description 5
- 238000011176 pooling Methods 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 230000002708 enhancing effect Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 101000779415 Homo sapiens Alanine aminotransferase 2 Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000002996 emotional effect Effects 0.000 description 1
- 238000013209 evaluation strategy Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311639567.9A CN117350304B (zh) | 2023-12-04 | 2023-12-04 | 一种多轮对话上下文向量增强方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311639567.9A CN117350304B (zh) | 2023-12-04 | 2023-12-04 | 一种多轮对话上下文向量增强方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117350304A CN117350304A (zh) | 2024-01-05 |
CN117350304B true CN117350304B (zh) | 2024-02-02 |
Family
ID=89365238
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311639567.9A Active CN117350304B (zh) | 2023-12-04 | 2023-12-04 | 一种多轮对话上下文向量增强方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117350304B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI746214B (zh) * | 2020-10-19 | 2021-11-11 | 財團法人資訊工業策進會 | 機器閱讀理解方法、機器閱讀理解裝置及非暫態電腦可讀取媒體 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115956261A (zh) * | 2021-04-06 | 2023-04-11 | 辉达公司 | 神经网络中分布外输入数据的识别技术 |
CN116415154A (zh) * | 2023-06-12 | 2023-07-11 | 江西五十铃汽车有限公司 | 一种基于gpt的车辆故障解决方案生成方法及装置 |
CN116501861A (zh) * | 2023-06-25 | 2023-07-28 | 知呱呱(天津)大数据技术有限公司 | 基于层级bert模型与标签迁移的长文本摘要生成方法 |
CN116992833A (zh) * | 2023-06-30 | 2023-11-03 | 平安科技(深圳)有限公司 | 对话生成模型的训练方法、装置、设备及介质 |
WO2023222887A1 (en) * | 2022-05-19 | 2023-11-23 | Deepmind Technologies Limited | Intra-agent speech to facilitate task learning |
-
2023
- 2023-12-04 CN CN202311639567.9A patent/CN117350304B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115956261A (zh) * | 2021-04-06 | 2023-04-11 | 辉达公司 | 神经网络中分布外输入数据的识别技术 |
WO2023222887A1 (en) * | 2022-05-19 | 2023-11-23 | Deepmind Technologies Limited | Intra-agent speech to facilitate task learning |
CN116415154A (zh) * | 2023-06-12 | 2023-07-11 | 江西五十铃汽车有限公司 | 一种基于gpt的车辆故障解决方案生成方法及装置 |
CN116501861A (zh) * | 2023-06-25 | 2023-07-28 | 知呱呱(天津)大数据技术有限公司 | 基于层级bert模型与标签迁移的长文本摘要生成方法 |
CN116992833A (zh) * | 2023-06-30 | 2023-11-03 | 平安科技(深圳)有限公司 | 对话生成模型的训练方法、装置、设备及介质 |
Non-Patent Citations (4)
Title |
---|
ERNIE 3.0 Tiny: Frustratingly Simple Method to Improve Task-Agnostic Distillation Generalization;Weixin Liu etc.;AghaarXiv: 2301.03416v1 [cs.CL];全文 * |
ERNIE 3.0: LARGE-SCALE KNOWLEDGE ENHANCED PRE-TRAINING FOR LANGUAGE UNDERSTANDING AND GENERATION;Yu Sun etc.;arXiv:2107.02137v1 [cs.CL];全文 * |
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training;Hong Liu etc.;arXiv:2305.14342v3 [cs.LG];全文 * |
一种基于多任务学习的多模态情感识别方法;林子杰 ,等;北京大学学报(自然科学版);第57卷(第1期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN117350304A (zh) | 2024-01-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110046221B (zh) | 一种机器对话方法、装置、计算机设备及存储介质 | |
US20230028944A1 (en) | Dialogue generation method and network training method and apparatus, storage medium, and device | |
CN108427771B (zh) | 摘要文本生成方法、装置和计算机设备 | |
CN106448670B (zh) | 基于深度学习和强化学习的自动回复对话系统 | |
CN110032630B (zh) | 话术推荐设备、方法及模型训练设备 | |
CN109977201B (zh) | 带情感的机器聊天方法、装置、计算机设备及存储介质 | |
CN111966800B (zh) | 情感对话生成方法、装置及情感对话模型训练方法、装置 | |
CN117350304B (zh) | 一种多轮对话上下文向量增强方法及系统 | |
CN106682387A (zh) | 用于输出信息的方法和装置 | |
CN112115246A (zh) | 基于对话的内容推荐方法、装置、计算机设备及存储介质 | |
CN112307168A (zh) | 基于人工智能的问诊会话处理方法、装置和计算机设备 | |
CN113988086A (zh) | 对话处理方法及装置 | |
CN114168707A (zh) | 一种面向推荐的情绪型对话方法 | |
Lee et al. | Deep representation learning for affective speech signal analysis and processing: Preventing unwanted signal disparities | |
CN110955765A (zh) | 智能助理的语料构建方法、装置、计算机设备和存储介质 | |
CN115269836A (zh) | 意图识别方法及装置 | |
CN113806564A (zh) | 多模态信息性推文检测方法及系统 | |
CN109727091A (zh) | 基于对话机器人的产品推荐方法、装置、介质及服务器 | |
CN113656542A (zh) | 一种基于信息检索与排序的话术推荐方法 | |
CN117271745A (zh) | 一种信息处理方法、装置及计算设备、存储介质 | |
CN117494762A (zh) | 学生模型的训练方法、素材处理方法、装置及电子设备 | |
US11941508B2 (en) | Dialog system with adaptive recurrent hopping and dual context encoding | |
CN113849641B (zh) | 一种跨领域层次关系的知识蒸馏方法和系统 | |
CN111078854B (zh) | 问答预测模型的训练方法及装置、问答预测方法及装置 | |
CN114547276A (zh) | 基于三通道图神经网络的会话推荐方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240226 Address after: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee after: Beijing Zhiguagua Technology Co.,Ltd. Country or region after: China Address before: 806A, Building 1, Sixin Building, South Side of Heiniucheng Road, Hexi District, Tianjin, 300221 Patentee before: Zhiguagua (Tianjin) Big Data Technology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right | ||
CP03 | Change of name, title or address |
Address after: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee after: Beijing Xinghe Zhiyuan Technology Co.,Ltd. Country or region after: China Address before: No. 401-1, 4th floor, podium, building 3 and 4, No. 11, Changchun Bridge Road, Haidian District, Beijing 100089 Patentee before: Beijing Zhiguagua Technology Co.,Ltd. Country or region before: China |
|
CP03 | Change of name, title or address |