CN116490879A - 用于神经网络中过度预测的方法和系统 - Google Patents

用于神经网络中过度预测的方法和系统 Download PDF

Info

Publication number
CN116490879A
CN116490879A CN202180077947.0A CN202180077947A CN116490879A CN 116490879 A CN116490879 A CN 116490879A CN 202180077947 A CN202180077947 A CN 202180077947A CN 116490879 A CN116490879 A CN 116490879A
Authority
CN
China
Prior art keywords
machine learning
layer
learning model
prediction
layers
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202180077947.0A
Other languages
English (en)
Chinese (zh)
Inventor
C·D·V·黄
T·T·乌
P·扎雷莫迪
Y·许
V·布利诺夫
Y-H·洪
Y·D·T·S·达马西里
V·韦氏诺一
E·L·贾拉勒丁
M·帕里克
T·L·董
M·E·约翰逊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oracle International Corp
Original Assignee
Oracle International Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oracle International Corp filed Critical Oracle International Corp
Publication of CN116490879A publication Critical patent/CN116490879A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • G06N3/0455Auto-encoder networks; Encoder-decoder networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0499Feedforward networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0985Hyperparameter optimisation; Meta-learning; Learning-to-learn
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN202180077947.0A 2020-11-30 2021-11-17 用于神经网络中过度预测的方法和系统 Pending CN116490879A (zh)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202063119566P 2020-11-30 2020-11-30
US63/119,566 2020-11-30
US17/455,181 2021-11-16
US17/455,181 US12518129B2 (en) 2020-11-30 2021-11-16 Method and system for over-prediction in neural networks
PCT/US2021/059686 WO2022115291A1 (en) 2020-11-30 2021-11-17 Method and system for over-prediction in neural networks

Publications (1)

Publication Number Publication Date
CN116490879A true CN116490879A (zh) 2023-07-25

Family

ID=81751544

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202180077947.0A Pending CN116490879A (zh) 2020-11-30 2021-11-17 用于神经网络中过度预测的方法和系统

Country Status (5)

Country Link
US (1) US12518129B2 (https=)
EP (1) EP4252149A1 (https=)
JP (1) JP7692482B2 (https=)
CN (1) CN116490879A (https=)
WO (1) WO2022115291A1 (https=)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12499313B2 (en) * 2021-01-21 2025-12-16 Servicenow, Inc. Ensemble scoring system for a natural language understanding (NLU) framework
US11842737B2 (en) 2021-03-24 2023-12-12 Google Llc Automated assistant interaction prediction using fusion of visual and audio input
US20230237589A1 (en) * 2022-01-21 2023-07-27 Intuit Inc. Model output calibration
US12010075B2 (en) * 2022-06-29 2024-06-11 Chime Financial, Inc. Utilizing machine learning models to generate interactive digital text threads with personalized digital text reply options
US12608373B2 (en) 2022-08-22 2026-04-21 Oracle International Corporation Detecting out-of-domain, out-of-scope, and confusion-span (OOCS) input for a natural language to logical form model
US12430330B2 (en) * 2022-08-22 2025-09-30 Oracle International Corporation Calibrating confidence scores of a machine learning model trained as a natural language interface
US12536283B2 (en) * 2022-11-09 2026-01-27 Saudi Arabian Oil Company Multi-layered machine learning model and use thereof
US11936814B1 (en) 2022-11-22 2024-03-19 Chime Financial, Inc. Utilizing machine learning models to generate interactive digital text threads with personalized agent escalation digital text reply options

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108874972A (zh) * 2018-06-08 2018-11-23 青岛里奥机器人技术有限公司 一种基于深度学习的多轮情感对话方法
US20190217206A1 (en) * 2018-01-18 2019-07-18 Moveworks, Inc. Method and system for training a chatbot
US20200342850A1 (en) * 2019-04-26 2020-10-29 Oracle International Corporation Routing for chatbots

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103679185B (zh) 2012-08-31 2017-06-16 富士通株式会社 卷积神经网络分类器系统、其训练方法、分类方法和用途
US10353905B2 (en) * 2015-04-24 2019-07-16 Salesforce.Com, Inc. Identifying entities in semi-structured content
CN107590153B (zh) * 2016-07-08 2021-04-27 微软技术许可有限责任公司 使用卷积神经网络的对话相关性建模
JP7269778B2 (ja) 2019-04-04 2023-05-09 富士フイルムヘルスケア株式会社 超音波撮像装置、および、画像処理装置
US10693872B1 (en) * 2019-05-17 2020-06-23 Q5ID, Inc. Identity verification system
WO2020241772A1 (ja) * 2019-05-31 2020-12-03 国立大学法人京都大学 情報処理装置、スクリーニング装置、情報処理方法、スクリーニング方法、及びプログラム
KR102814913B1 (ko) * 2019-10-02 2025-05-30 삼성전자주식회사 응답 추론 방법 및 장치

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190217206A1 (en) * 2018-01-18 2019-07-18 Moveworks, Inc. Method and system for training a chatbot
CN108874972A (zh) * 2018-06-08 2018-11-23 青岛里奥机器人技术有限公司 一种基于深度学习的多轮情感对话方法
US20200342850A1 (en) * 2019-04-26 2020-10-29 Oracle International Corporation Routing for chatbots

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KIMIN LEE, KIBOK LEE, HONGLAK LEE, JINWOO SHIN: "A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks", ARXIV:1807.03888V2, 27 October 2018 (2018-10-27), pages 1 - 20 *

Also Published As

Publication number Publication date
WO2022115291A1 (en) 2022-06-02
EP4252149A1 (en) 2023-10-04
US20220172021A1 (en) 2022-06-02
JP7692482B2 (ja) 2025-06-13
US12518129B2 (en) 2026-01-06
JP2023551325A (ja) 2023-12-07

Similar Documents

Publication Publication Date Title
CN114424185B (zh) 用于自然语言处理的停用词数据扩充
JP7851913B2 (ja) テキスト分類についての説明を与えるための技術
CN116724305B (zh) 上下文标签与命名实体识别模型的集成
CN115398437B (zh) 改进的域外(ood)检测技术
CN116802629B (zh) 用于自然语言处理的多因素建模
CN116583837B (zh) 用于自然语言处理的基于距离的logit值
CN115917553A (zh) 在聊天机器人中实现稳健命名实体识别的实体级数据扩充
EP4128011A1 (en) Batching techniques for handling unbalanced training data for a chatbot
CN115398419A (zh) 用于基于目标的超参数调优的方法和系统
US12518129B2 (en) Method and system for over-prediction in neural networks
KR20240089615A (ko) 사전-트레이닝된 언어 모델의 단일 트랜스포머 계층으로부터의 다중-헤드 네트워크의 미세-튜닝
CN116547676A (zh) 用于自然语言处理的增强型logit
KR102821062B1 (ko) 사전-트레이닝된 언어 모델들에 대한 긴 텍스트를 핸들링하기 위한 시스템 및 기술들
CN118202344A (zh) 用于从文档中提取嵌入式数据的深度学习技术
KR20240111760A (ko) 자연어 프로세싱을 위한 경로 드롭아웃
CN119183573A (zh) 实体感知数据增强技术
US20230136965A1 (en) Prohibiting inconsistent named entity recognition tag sequences
CN116235164B (zh) 聊天机器人的范围外自动转变
CN120092248A (zh) 基于目标的超参数调谐中的目标函数优化
CN119768794A (zh) 自适应训练数据扩充以促进命名实体识别模型的训练
CN116235164A (zh) 聊天机器人的范围外自动转变

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination