JP2022548770A5 - - Google Patents

Info

Publication number
JP2022548770A5
JP2022548770A5 JP2022518169A JP2022518169A JP2022548770A5 JP 2022548770 A5 JP2022548770 A5 JP 2022548770A5 JP 2022518169 A JP2022518169 A JP 2022518169A JP 2022518169 A JP2022518169 A JP 2022518169A JP 2022548770 A5 JP2022548770 A5 JP 2022548770A5
Authority
JP
Japan
Prior art keywords
model
sanitized
training data
path
trained
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022518169A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022548770A (ja
JP7507851B2 (ja
Filing date
Publication date
Priority claimed from US16/702,817 external-priority patent/US11568282B2/en
Application filed filed Critical
Publication of JP2022548770A publication Critical patent/JP2022548770A/ja
Publication of JP2022548770A5 publication Critical patent/JP2022548770A5/ja
Application granted granted Critical
Publication of JP7507851B2 publication Critical patent/JP7507851B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022518169A 2019-09-24 2020-09-21 機械学習システムにおける敵対的影響の軽減 Active JP7507851B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201962904869P 2019-09-24 2019-09-24
US62/904,869 2019-09-24
US16/702,817 2019-12-04
US16/702,817 US11568282B2 (en) 2019-09-24 2019-12-04 Mitigating adversarial effects in machine learning systems
PCT/IB2020/058763 WO2021059106A1 (en) 2019-09-24 2020-09-21 Mitigating adversarial effects in machine learning systems

Publications (3)

Publication Number Publication Date
JP2022548770A JP2022548770A (ja) 2022-11-21
JP2022548770A5 true JP2022548770A5 (https=) 2022-12-16
JP7507851B2 JP7507851B2 (ja) 2024-06-28

Family

ID=74881033

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022518169A Active JP7507851B2 (ja) 2019-09-24 2020-09-21 機械学習システムにおける敵対的影響の軽減

Country Status (6)

Country Link
US (1) US11568282B2 (https=)
JP (1) JP7507851B2 (https=)
CN (1) CN114450695B (https=)
DE (1) DE112020003829T5 (https=)
GB (1) GB2603391B (https=)
WO (1) WO2021059106A1 (https=)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11609990B2 (en) * 2019-05-29 2023-03-21 Anomalee Inc. Post-training detection and identification of human-imperceptible backdoor-poisoning attacks
US11514297B2 (en) * 2019-05-29 2022-11-29 Anomalee Inc. Post-training detection and identification of human-imperceptible backdoor-poisoning attacks
US11601463B2 (en) * 2020-07-28 2023-03-07 The Boeing Company Cybersecurity threat modeling and analysis with text miner and data flow diagram editor
US12001553B2 (en) * 2020-08-20 2024-06-04 Red Bend Ltd. Detecting vehicle malfunctions and cyber attacks using machine learning
US20250240658A1 (en) * 2021-10-20 2025-07-24 Nokia Technologies Oy Criteria-based measurement reporting
US12182258B2 (en) * 2021-11-08 2024-12-31 Microsoft Technology Licensing, Llc Adversarial training to minimize data poisoning attacks
EP4430513A1 (en) * 2021-11-08 2024-09-18 Microsoft Technology Licensing, LLC Adversarial training to minimize data poisoning attacks
US12326940B2 (en) * 2021-11-28 2025-06-10 International Business Machines Corporation Graph exploration framework for adversarial example generation
US20230205872A1 (en) * 2021-12-23 2023-06-29 Advanced Micro Devices, Inc. Method and apparatus to address row hammer attacks at a host processor
CN114283341B (zh) * 2022-03-04 2022-05-17 西南石油大学 一种高转移性对抗样本生成方法、系统及终端
CN114610885B (zh) * 2022-03-09 2022-11-08 江南大学 一种文本分类后门攻击方法、系统及设备
JP2024047424A (ja) * 2022-09-26 2024-04-05 株式会社Screenホールディングス 学習装置、学習方法および学習プログラムに関する。
US12475235B2 (en) 2023-01-19 2025-11-18 Citibank, N.A. Generative cybersecurity exploit discovery and evaluation
US12282565B2 (en) 2023-01-19 2025-04-22 Citibank, N.A. Generative cybersecurity exploit synthesis and mitigation
US12596813B2 (en) 2023-01-19 2026-04-07 Citibank, N.A Autonomous agent observation and control
US12314406B1 (en) 2023-01-19 2025-05-27 Citibank, N.A. Generative cybersecurity exploit discovery and evaluation
US11874934B1 (en) 2023-01-19 2024-01-16 Citibank, N.A. Providing user-induced variable identification of end-to-end computing system security impact information systems and methods
US12271491B2 (en) * 2023-01-19 2025-04-08 Citibank, N.A. Detection and mitigation of machine learning model adversarial attacks
US12608486B2 (en) 2023-01-19 2026-04-21 Citibank, N.A. Generating predicted end-to-end cyber-security attack characteristics via bifurcated machine learning-based processing of multi-modal data systems and methods
US20240296219A1 (en) * 2023-03-05 2024-09-05 Microsoft Technology Licensing, Llc Adverse or malicious input mitigation for large language models
CN121532780A (zh) * 2023-07-05 2026-02-13 软银集团股份有限公司 信息提供装置、信息提供方法以及信息提供程序
US12489798B2 (en) 2023-08-31 2025-12-02 Dell Products L.P. Managing artificial intelligence models to identify goals of malicious attackers
US20250077656A1 (en) * 2023-08-31 2025-03-06 Dell Products L.P. Managing impact of poisoned inferences on inference consumers using digital twins
US12602624B2 (en) 2023-12-11 2026-04-14 Citibank, N.A. Anomaly detection method for model outputs
US12602418B2 (en) 2024-04-11 2026-04-14 Citibank, N.A. Intelligent query decomposition, specialized model routing, and hierarchical aggregation with conflict resolution
US12596738B2 (en) 2024-04-11 2026-04-07 Citibank, N.A. Explainable large language model routing with immutable audit trails
US12346820B1 (en) 2024-04-11 2025-07-01 Citibank, N. A. Identifying and remediating gaps in artificial intelligence use cases using a generative artificial intelligence model

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010067033A (ja) * 2008-09-11 2010-03-25 Sony Corp データ処理装置、データ処理方法、及び、プログラム
US9558176B2 (en) 2013-12-06 2017-01-31 Microsoft Technology Licensing, Llc Discriminating between natural language and keyword language items
US9697469B2 (en) 2014-08-13 2017-07-04 Andrew McMahon Method and system for generating and aggregating models based on disparate data from insurance, financial services, and public industries
KR102494139B1 (ko) * 2015-11-06 2023-01-31 삼성전자주식회사 뉴럴 네트워크 학습 장치 및 방법과, 음성 인식 장치 및 방법
US11080616B2 (en) * 2016-09-27 2021-08-03 Clarifai, Inc. Artificial intelligence model and data collection/development platform
CN106934462A (zh) 2017-02-09 2017-07-07 华南理工大学 基于迁移的对抗性环境下的防御毒化攻击的学习方法
CN108320026B (zh) * 2017-05-16 2022-02-11 腾讯科技(深圳)有限公司 机器学习模型训练方法和装置
CN107316083B (zh) * 2017-07-04 2021-05-25 北京百度网讯科技有限公司 用于更新深度学习模型的方法和装置
US10540578B2 (en) 2017-12-21 2020-01-21 International Business Machines Corporation Adapting a generative adversarial network to new data sources for image classification
CA3033014A1 (en) * 2018-02-07 2019-08-07 Royal Bank Of Canada Robust pruned neural networks via adversarial training
US11195120B2 (en) * 2018-02-09 2021-12-07 Cisco Technology, Inc. Detecting dataset poisoning attacks independent of a learning algorithm
US10643602B2 (en) * 2018-03-16 2020-05-05 Microsoft Technology Licensing, Llc Adversarial teacher-student learning for unsupervised domain adaptation
CN108932527A (zh) 2018-06-06 2018-12-04 上海交通大学 使用交叉训练模型检测对抗样本的方法
CN108712448A (zh) 2018-07-09 2018-10-26 四川大学 一种基于动态污点分析的注入式攻击检测模型
CN109101999B (zh) 2018-07-16 2021-06-25 华东师范大学 基于支持向量机的协神经网络可信决策方法
US11568211B2 (en) 2018-12-27 2023-01-31 Intel Corporation Defending neural networks by randomizing model weights
CN109886210B (zh) * 2019-02-25 2022-07-19 百度在线网络技术(北京)有限公司 一种交通图像识别方法、装置、计算机设备和介质
CN109948663B (zh) 2019-02-27 2022-03-15 天津大学 一种基于模型抽取的步长自适应的对抗攻击方法
US11657162B2 (en) 2019-03-22 2023-05-23 Intel Corporation Adversarial training of neural networks using information about activation path differentials
US20190272375A1 (en) 2019-03-28 2019-09-05 Intel Corporation Trust model for malware classification
CN110222762A (zh) 2019-06-04 2019-09-10 恒安嘉新(北京)科技股份公司 对象预测方法、装置、设备、及介质

Similar Documents

Publication Publication Date Title
JP2022548770A5 (https=)
CN108363745B (zh) 机器人客服转人工客服的方法和装置
KR102158683B1 (ko) 외부 메모리로 신경망들 증강
GB2603391A (en) Mitigating adversarial effects in machine learning systems
US9484018B2 (en) System and method for building and evaluating automatic speech recognition via an application programmer interface
JP2022518646A5 (https=)
CN114529010B (zh) 一种机器人自主学习方法、装置、设备及存储介质
CN109661672B (zh) 使用强化学习利用外部存储器增强神经网络
CN109313540B (zh) 口语对话系统的两阶段训练
CN106462803B (zh) 增强具有外部存储器的神经网络
CN106471525A (zh) 增强神经网络以生成附加输出
WO2019127231A1 (en) Training data generators and methods for machine learning
JP2023024987A5 (https=)
EP3944233A3 (en) Orchestrating execution of a series of actions requested to be performed via an automated assistant
US12026049B1 (en) Resolving technology issues
US11468880B2 (en) Dialog system training using a simulated user system
CN111506814A (zh) 一种基于变分自注意力网络的序列推荐方法
CN115713939B (zh) 语音识别方法、装置及电子设备
WO2019222656A1 (en) Systems and methods for debugging neural networks with coverage guided fuzzing
JP7179672B2 (ja) 計算機システム及び機械学習方法
CN107977909A (zh) 具个人化学习路径自动产生机制之学习规划方法与学习规划系统
WO2021255888A1 (ja) アセスメント装置、代替モデル作成装置、アセスメント方法及びプログラム
CN112288032B (zh) 一种基于生成对抗网络的量化模型训练的方法及装置
JP2023103928A5 (https=)
JP2011053565A (ja) 信号分析装置、信号分析方法、プログラム、及び記録媒体