CN117236410A - 一种可信的电子文件大语言模型训练、推理方法和装置 - Google Patents
一种可信的电子文件大语言模型训练、推理方法和装置 Download PDFInfo
- Publication number
- CN117236410A CN117236410A CN202311500582.5A CN202311500582A CN117236410A CN 117236410 A CN117236410 A CN 117236410A CN 202311500582 A CN202311500582 A CN 202311500582A CN 117236410 A CN117236410 A CN 117236410A
- Authority
- CN
- China
- Prior art keywords
- trusted
- language model
- training
- evaluation index
- sample data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012549 training Methods 0.000 title claims abstract description 187
- 238000000034 method Methods 0.000 title claims abstract description 59
- 238000011156 evaluation Methods 0.000 claims abstract description 116
- 238000010276 construction Methods 0.000 claims description 27
- 230000002787 reinforcement Effects 0.000 claims description 24
- 239000013598 vector Substances 0.000 claims description 16
- 238000009825 accumulation Methods 0.000 claims description 7
- 238000000354 decomposition reaction Methods 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 12
- 230000006870 function Effects 0.000 description 12
- 238000003860 storage Methods 0.000 description 11
- 238000004590 computer program Methods 0.000 description 10
- 238000012545 processing Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000007476 Maximum Likelihood Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 208000004547 Hallucinations Diseases 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Abstract
Description
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311500582.5A CN117236410B (zh) | 2023-11-13 | 2023-11-13 | 一种可信的电子文件大语言模型训练、推理方法和装置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311500582.5A CN117236410B (zh) | 2023-11-13 | 2023-11-13 | 一种可信的电子文件大语言模型训练、推理方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117236410A true CN117236410A (zh) | 2023-12-15 |
CN117236410B CN117236410B (zh) | 2024-01-23 |
Family
ID=89086498
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311500582.5A Active CN117236410B (zh) | 2023-11-13 | 2023-11-13 | 一种可信的电子文件大语言模型训练、推理方法和装置 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117236410B (zh) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210374338A1 (en) * | 2020-05-26 | 2021-12-02 | Mastercard International Incorporated | Methods and systems for generating domain-specific text summarizations |
CN116127020A (zh) * | 2023-03-03 | 2023-05-16 | 北京百度网讯科技有限公司 | 生成式大语言模型训练方法以及基于模型的搜索方法 |
CN116226334A (zh) * | 2023-03-03 | 2023-06-06 | 北京百度网讯科技有限公司 | 生成式大语言模型训练方法以及基于模型的搜索方法 |
CN116244416A (zh) * | 2023-03-03 | 2023-06-09 | 北京百度网讯科技有限公司 | 生成式大语言模型训练方法、基于模型的人机语音交互方法 |
CN116564330A (zh) * | 2023-05-24 | 2023-08-08 | 思必驰科技股份有限公司 | 弱监督语音预训练方法、电子设备和存储介质 |
CN116611110A (zh) * | 2023-05-31 | 2023-08-18 | 山东浪潮科学研究院有限公司 | 一种数据隐私脱敏方法、装置、设备及存储介质 |
-
2023
- 2023-11-13 CN CN202311500582.5A patent/CN117236410B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210374338A1 (en) * | 2020-05-26 | 2021-12-02 | Mastercard International Incorporated | Methods and systems for generating domain-specific text summarizations |
CN116127020A (zh) * | 2023-03-03 | 2023-05-16 | 北京百度网讯科技有限公司 | 生成式大语言模型训练方法以及基于模型的搜索方法 |
CN116226334A (zh) * | 2023-03-03 | 2023-06-06 | 北京百度网讯科技有限公司 | 生成式大语言模型训练方法以及基于模型的搜索方法 |
CN116244416A (zh) * | 2023-03-03 | 2023-06-09 | 北京百度网讯科技有限公司 | 生成式大语言模型训练方法、基于模型的人机语音交互方法 |
CN116564330A (zh) * | 2023-05-24 | 2023-08-08 | 思必驰科技股份有限公司 | 弱监督语音预训练方法、电子设备和存储介质 |
CN116611110A (zh) * | 2023-05-31 | 2023-08-18 | 山东浪潮科学研究院有限公司 | 一种数据隐私脱敏方法、装置、设备及存储介质 |
Non-Patent Citations (1)
Title |
---|
徐;熊健;范金宇;: "语言结构词性流量的冲量过程", 数理统计与管理, no. 04 * |
Also Published As
Publication number | Publication date |
---|---|
CN117236410B (zh) | 2024-01-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111368996B (zh) | 可传递自然语言表示的重新训练投影网络 | |
CN112131350B (zh) | 文本标签确定方法、装置、终端及可读存储介质 | |
CN109376222B (zh) | 问答匹配度计算方法、问答自动匹配方法及装置 | |
CN109614471B (zh) | 一种基于生成式对抗网络的开放式问题自动生成方法 | |
JP2019504413A (ja) | 絵文字を提案するためのシステムおよび方法 | |
CN112069302A (zh) | 会话意图识别模型的训练方法、会话意图识别方法及装置 | |
WO2022234543A1 (en) | Systems and methods for active curriculum learning | |
CN113934835A (zh) | 结合关键词和语义理解表征的检索式回复对话方法及系统 | |
US20220198149A1 (en) | Method and system for machine reading comprehension | |
CN111507108B (zh) | 别名生成方法、装置、电子设备及计算机可读存储介质 | |
Bachrach et al. | An attention mechanism for answer selection using a combined global and local view | |
CN113704393A (zh) | 关键词提取方法、装置、设备及介质 | |
KR20230117716A (ko) | 검색어 추천 장치, 방법 및 기록매체 | |
CN111666375A (zh) | 文本相似度的匹配方法、电子设备和计算机可读介质 | |
Bachrach et al. | An attention mechanism for neural answer selection using a combined global and local view | |
CN117236410B (zh) | 一种可信的电子文件大语言模型训练、推理方法和装置 | |
CN113516094B (zh) | 一种用于为文档匹配评议专家的系统以及方法 | |
CN112989024B (zh) | 文本内容的关系提取方法、装置、设备及存储介质 | |
CN115510326A (zh) | 基于文本特征和情感倾向的网络论坛用户兴趣推荐算法 | |
CN114997155A (zh) | 一种基于表格检索和实体图推理的事实验证方法与装置 | |
CN114328820A (zh) | 信息搜索方法以及相关设备 | |
CN111444338A (zh) | 文本处理、装置、存储介质及设备 | |
Alwaneen et al. | Stacked dynamic memory-coattention network for answering why-questions in Arabic | |
CN117494815A (zh) | 面向档案的可信大语言模型训练、推理方法和装置 | |
CN110929527B (zh) | 一种确定语义相似度方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Qian Minghui Inventor after: Yang Guancan Inventor after: Gou Jiajie Inventor after: Sun Ke Inventor after: Yang Jianliang Inventor after: Pan Fei Inventor after: Ju Xiang Inventor after: Li Hurong Inventor after: Kuang Fu Inventor after: Xu Jiayuan Inventor after: Xu Zhixuan Inventor after: Fan Anyi Inventor before: Qian Minghui Inventor before: Yang Guancan Inventor before: Gou Jiajie Inventor before: Sun Ke Inventor before: Yang Jianliang Inventor before: Pan Fei Inventor before: Ju Xiang Inventor before: Li Hurong Inventor before: Kuang Fu Inventor before: Xu Jiayuan Inventor before: Xu Zhixuan Inventor before: Fan Anyi |
|
CB03 | Change of inventor or designer information |