CN116490879A - 用于神经网络中过度预测的方法和系统 - Google Patents
用于神经网络中过度预测的方法和系统 Download PDFInfo
- Publication number
- CN116490879A CN116490879A CN202180077947.0A CN202180077947A CN116490879A CN 116490879 A CN116490879 A CN 116490879A CN 202180077947 A CN202180077947 A CN 202180077947A CN 116490879 A CN116490879 A CN 116490879A
- Authority
- CN
- China
- Prior art keywords
- machine learning
- layer
- learning model
- prediction
- layers
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/048—Activation functions
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0499—Feedforward networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/0985—Hyperparameter optimisation; Meta-learning; Learning-to-learn
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Machine Translation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202063119566P | 2020-11-30 | 2020-11-30 | |
| US63/119,566 | 2020-11-30 | ||
| US17/455,181 | 2021-11-16 | ||
| US17/455,181 US12518129B2 (en) | 2020-11-30 | 2021-11-16 | Method and system for over-prediction in neural networks |
| PCT/US2021/059686 WO2022115291A1 (en) | 2020-11-30 | 2021-11-17 | Method and system for over-prediction in neural networks |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN116490879A true CN116490879A (zh) | 2023-07-25 |
Family
ID=81751544
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202180077947.0A Pending CN116490879A (zh) | 2020-11-30 | 2021-11-17 | 用于神经网络中过度预测的方法和系统 |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US12518129B2 (https=) |
| EP (1) | EP4252149A1 (https=) |
| JP (1) | JP7692482B2 (https=) |
| CN (1) | CN116490879A (https=) |
| WO (1) | WO2022115291A1 (https=) |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12499313B2 (en) * | 2021-01-21 | 2025-12-16 | Servicenow, Inc. | Ensemble scoring system for a natural language understanding (NLU) framework |
| US11842737B2 (en) | 2021-03-24 | 2023-12-12 | Google Llc | Automated assistant interaction prediction using fusion of visual and audio input |
| US20230237589A1 (en) * | 2022-01-21 | 2023-07-27 | Intuit Inc. | Model output calibration |
| US12010075B2 (en) * | 2022-06-29 | 2024-06-11 | Chime Financial, Inc. | Utilizing machine learning models to generate interactive digital text threads with personalized digital text reply options |
| US12608373B2 (en) | 2022-08-22 | 2026-04-21 | Oracle International Corporation | Detecting out-of-domain, out-of-scope, and confusion-span (OOCS) input for a natural language to logical form model |
| US12430330B2 (en) * | 2022-08-22 | 2025-09-30 | Oracle International Corporation | Calibrating confidence scores of a machine learning model trained as a natural language interface |
| US12536283B2 (en) * | 2022-11-09 | 2026-01-27 | Saudi Arabian Oil Company | Multi-layered machine learning model and use thereof |
| US11936814B1 (en) | 2022-11-22 | 2024-03-19 | Chime Financial, Inc. | Utilizing machine learning models to generate interactive digital text threads with personalized agent escalation digital text reply options |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN108874972A (zh) * | 2018-06-08 | 2018-11-23 | 青岛里奥机器人技术有限公司 | 一种基于深度学习的多轮情感对话方法 |
| US20190217206A1 (en) * | 2018-01-18 | 2019-07-18 | Moveworks, Inc. | Method and system for training a chatbot |
| US20200342850A1 (en) * | 2019-04-26 | 2020-10-29 | Oracle International Corporation | Routing for chatbots |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103679185B (zh) | 2012-08-31 | 2017-06-16 | 富士通株式会社 | 卷积神经网络分类器系统、其训练方法、分类方法和用途 |
| US10353905B2 (en) * | 2015-04-24 | 2019-07-16 | Salesforce.Com, Inc. | Identifying entities in semi-structured content |
| CN107590153B (zh) * | 2016-07-08 | 2021-04-27 | 微软技术许可有限责任公司 | 使用卷积神经网络的对话相关性建模 |
| JP7269778B2 (ja) | 2019-04-04 | 2023-05-09 | 富士フイルムヘルスケア株式会社 | 超音波撮像装置、および、画像処理装置 |
| US10693872B1 (en) * | 2019-05-17 | 2020-06-23 | Q5ID, Inc. | Identity verification system |
| WO2020241772A1 (ja) * | 2019-05-31 | 2020-12-03 | 国立大学法人京都大学 | 情報処理装置、スクリーニング装置、情報処理方法、スクリーニング方法、及びプログラム |
| KR102814913B1 (ko) * | 2019-10-02 | 2025-05-30 | 삼성전자주식회사 | 응답 추론 방법 및 장치 |
-
2021
- 2021-11-16 US US17/455,181 patent/US12518129B2/en active Active
- 2021-11-17 JP JP2023532791A patent/JP7692482B2/ja active Active
- 2021-11-17 EP EP21824219.6A patent/EP4252149A1/en active Pending
- 2021-11-17 CN CN202180077947.0A patent/CN116490879A/zh active Pending
- 2021-11-17 WO PCT/US2021/059686 patent/WO2022115291A1/en not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20190217206A1 (en) * | 2018-01-18 | 2019-07-18 | Moveworks, Inc. | Method and system for training a chatbot |
| CN108874972A (zh) * | 2018-06-08 | 2018-11-23 | 青岛里奥机器人技术有限公司 | 一种基于深度学习的多轮情感对话方法 |
| US20200342850A1 (en) * | 2019-04-26 | 2020-10-29 | Oracle International Corporation | Routing for chatbots |
Non-Patent Citations (1)
| Title |
|---|
| KIMIN LEE, KIBOK LEE, HONGLAK LEE, JINWOO SHIN: "A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks", ARXIV:1807.03888V2, 27 October 2018 (2018-10-27), pages 1 - 20 * |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2022115291A1 (en) | 2022-06-02 |
| EP4252149A1 (en) | 2023-10-04 |
| US20220172021A1 (en) | 2022-06-02 |
| JP7692482B2 (ja) | 2025-06-13 |
| US12518129B2 (en) | 2026-01-06 |
| JP2023551325A (ja) | 2023-12-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN114424185B (zh) | 用于自然语言处理的停用词数据扩充 | |
| JP7851913B2 (ja) | テキスト分類についての説明を与えるための技術 | |
| CN116724305B (zh) | 上下文标签与命名实体识别模型的集成 | |
| CN115398437B (zh) | 改进的域外(ood)检测技术 | |
| CN116802629B (zh) | 用于自然语言处理的多因素建模 | |
| CN116583837B (zh) | 用于自然语言处理的基于距离的logit值 | |
| CN115917553A (zh) | 在聊天机器人中实现稳健命名实体识别的实体级数据扩充 | |
| EP4128011A1 (en) | Batching techniques for handling unbalanced training data for a chatbot | |
| CN115398419A (zh) | 用于基于目标的超参数调优的方法和系统 | |
| US12518129B2 (en) | Method and system for over-prediction in neural networks | |
| KR20240089615A (ko) | 사전-트레이닝된 언어 모델의 단일 트랜스포머 계층으로부터의 다중-헤드 네트워크의 미세-튜닝 | |
| CN116547676A (zh) | 用于自然语言处理的增强型logit | |
| KR102821062B1 (ko) | 사전-트레이닝된 언어 모델들에 대한 긴 텍스트를 핸들링하기 위한 시스템 및 기술들 | |
| CN118202344A (zh) | 用于从文档中提取嵌入式数据的深度学习技术 | |
| KR20240111760A (ko) | 자연어 프로세싱을 위한 경로 드롭아웃 | |
| CN119183573A (zh) | 实体感知数据增强技术 | |
| US20230136965A1 (en) | Prohibiting inconsistent named entity recognition tag sequences | |
| CN116235164B (zh) | 聊天机器人的范围外自动转变 | |
| CN120092248A (zh) | 基于目标的超参数调谐中的目标函数优化 | |
| CN119768794A (zh) | 自适应训练数据扩充以促进命名实体识别模型的训练 | |
| CN116235164A (zh) | 聊天机器人的范围外自动转变 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |