CN113692594A - 通过强化学习的公平性改进 - Google Patents

通过强化学习的公平性改进 Download PDF

Info

Publication number
CN113692594A
CN113692594A CN202080027018.4A CN202080027018A CN113692594A CN 113692594 A CN113692594 A CN 113692594A CN 202080027018 A CN202080027018 A CN 202080027018A CN 113692594 A CN113692594 A CN 113692594A
Authority
CN
China
Prior art keywords
fairness
mlm
value
computer
program instructions
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080027018.4A
Other languages
English (en)
Chinese (zh)
Inventor
G·查卢罗斯
F·弗勒特尔
F·格拉夫
P·勒斯滕伯格
S·勒维扎
E·斯洛特克
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN113692594A publication Critical patent/CN113692594A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/09Supervised learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/092Reinforcement learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/0985Hyperparameter optimisation; Meta-learning; Learning-to-learn
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Acyclic And Carbocyclic Compounds In Medicinal Compositions (AREA)
CN202080027018.4A 2019-04-08 2020-03-18 通过强化学习的公平性改进 Pending CN113692594A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US16/377,727 US20200320428A1 (en) 2019-04-08 2019-04-08 Fairness improvement through reinforcement learning
US16/377,727 2019-04-08
PCT/IB2020/052465 WO2020208444A1 (en) 2019-04-08 2020-03-18 Fairness improvement through reinforcement learning

Publications (1)

Publication Number Publication Date
CN113692594A true CN113692594A (zh) 2021-11-23

Family

ID=72663093

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080027018.4A Pending CN113692594A (zh) 2019-04-08 2020-03-18 通过强化学习的公平性改进

Country Status (6)

Country Link
US (1) US20200320428A1 (https=)
JP (1) JP2022527536A (https=)
CN (1) CN113692594A (https=)
DE (1) DE112020000537T5 (https=)
GB (1) GB2597406A (https=)
WO (1) WO2020208444A1 (https=)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115048425A (zh) * 2022-06-09 2022-09-13 深圳计算科学研究院 一种基于强化学习的数据筛选方法及其装置
WO2024120447A1 (zh) * 2022-12-09 2024-06-13 维沃移动通信有限公司 模型监督触发方法、装置、ue、网络侧设备、可读存储介质及通信系统

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11984199B2 (en) * 2019-08-02 2024-05-14 Kpn Innovations, Llc Methods and systems for generating compatible substance instruction sets using artificial intelligence
US11636386B2 (en) * 2019-11-21 2023-04-25 International Business Machines Corporation Determining data representative of bias within a model
US20210201400A1 (en) * 2019-12-27 2021-07-01 Lendingclub Corporation Intelligent servicing
US11556826B2 (en) * 2020-03-20 2023-01-17 Adobe Inc. Generating hyper-parameters for machine learning models using modified Bayesian optimization based on accuracy and training efficiency
US12050975B2 (en) 2020-05-06 2024-07-30 Discover Financial Services System and method for utilizing grouped partial dependence plots and shapley additive explanations in the generation of adverse action reason codes
US12321826B2 (en) 2020-05-06 2025-06-03 Discover Financial Services System and method for utilizing grouped partial dependence plots and game-theoretic concepts and their extensions in the generation of adverse action reason codes
US11551178B2 (en) * 2020-05-14 2023-01-10 Wells Fargo Bank, N.A. Apparatuses and methods for regulation offending model prevention
US12002258B2 (en) * 2020-06-03 2024-06-04 Discover Financial Services System and method for mitigating bias in classification scores generated by machine learning models
US12469075B2 (en) 2020-06-03 2025-11-11 Capital One Financial Corporation Computing system and method for creating a data science model having reduced bias
CN112163677B (zh) * 2020-10-14 2023-09-19 杭州海康威视数字技术股份有限公司 应用机器学习模型的方法、装置及设备
CN112257848B (zh) * 2020-10-22 2024-04-30 北京灵汐科技有限公司 确定逻辑核布局的方法、模型训练方法、电子设备、介质
US12554805B2 (en) 2020-11-27 2026-02-17 Amazon Technologies, Inc. Generating views for bias metrics and feature attribution captured in machine learning pipelines
US12547926B2 (en) 2020-11-27 2026-02-10 Amazon Technologies, Inc. Staged bias measurements in machine learning pipelines
WO2022115402A1 (en) * 2020-11-27 2022-06-02 Amazon Technologies, Inc. Staged bias measurements in machine learning pipelines
US12014287B2 (en) * 2020-12-04 2024-06-18 International Business Machines Corporation Batch scoring model fairness
CN112416602B (zh) * 2020-12-10 2022-09-16 清华大学 一种分布式数据流资源弹性伸缩增强插件及增强方法
CN112905465B (zh) * 2021-02-09 2022-07-22 中国科学院软件研究所 一种基于深度强化学习的机器学习模型黑盒公平性测试方法和系统
US12547673B2 (en) 2021-04-12 2026-02-10 International Business Machines Corporation Calculate fairness of machine learning model
US20220391683A1 (en) * 2021-06-07 2022-12-08 International Business Machines Corporation Bias reduction during artifical intelligence module training
EP4106231A1 (en) * 2021-06-14 2022-12-21 Google LLC Selection of physics-specific model for determination of characteristics of radio frequency signal propagation
US12443676B2 (en) 2021-10-13 2025-10-14 International Business Machines Corporation Controlling a bias of a machine learning module background
US20230222378A1 (en) * 2022-01-07 2023-07-13 Vittorio ROMANIELLO Method and system for evaluating fairness of machine learning model
US20230351172A1 (en) * 2022-04-29 2023-11-02 Intuit Inc. Supervised machine learning method for matching unsupervised data
US12561222B2 (en) * 2022-06-03 2026-02-24 Adobe Inc. Reducing bias in machine learning models utilizing a fairness deviation constraint and decision matrix
US20240020515A1 (en) * 2022-07-06 2024-01-18 University Of Southern California Systems and methods for a neural network database framework for answering database query types
US12475132B2 (en) 2023-02-20 2025-11-18 Capital One Financial Corporation Computing system and method for applying monte carlo estimation to determine the contribution of dependent input variable groups on the output of a data science model

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108062587A (zh) * 2017-12-15 2018-05-22 清华大学 一种无监督机器学习的超参数自动优化方法及系统
US20180341851A1 (en) * 2017-05-24 2018-11-29 International Business Machines Corporation Tuning of a machine learning system

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009237923A (ja) * 2008-03-27 2009-10-15 Nec Corp 学習方法およびシステム
US9008840B1 (en) 2013-04-19 2015-04-14 Brain Corporation Apparatus and methods for reinforcement-guided supervised learning
US20180082185A1 (en) * 2015-03-23 2018-03-22 Nec Corporation Predictive model updating system, predictive model updating method, and predictive model updating program
US10839302B2 (en) 2015-11-24 2020-11-17 The Research Foundation For The State University Of New York Approximate value iteration with complex returns by bounding
US11144616B2 (en) * 2017-02-22 2021-10-12 Cisco Technology, Inc. Training distributed machine learning with selective data transfers
JP6917004B2 (ja) * 2017-04-20 2021-08-11 オムロン株式会社 評価装置、評価方法及びそのプログラム
US11176487B2 (en) * 2017-09-28 2021-11-16 Oracle International Corporation Gradient-based auto-tuning for machine learning and deep learning models
CN109242105B (zh) * 2018-08-17 2024-03-15 第四范式(北京)技术有限公司 代码优化方法、装置、设备及介质

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180341851A1 (en) * 2017-05-24 2018-11-29 International Business Machines Corporation Tuning of a machine learning system
CN108062587A (zh) * 2017-12-15 2018-05-22 清华大学 一种无监督机器学习的超参数自动优化方法及系统

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BRIAN HU ZHANG ET AL.: ""Mitigating Unwanted Biases with Adversarial Learning"", 《AIES\'18:PROCEEDINGS OF THE 2018 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY》, 27 December 2018 (2018-12-27), pages 335 - 340 *
CHRISTINA WADSWORTH ET AL.: ""Achieving Fairness through Adversarial Learning: an Application to Recidivism Prediction"", 《HTTPS://ARXIV.ORG/ABS/1807.00199》, 30 June 2018 (2018-06-30), pages 1 - 5 *
MATT OLFAT ET AL.: ""Spectral Algorithms for Computing Fair Support Vector Machines"", 《HTTPS://ARXIV.ORG/ABS/1710.05895》, 16 October 2017 (2017-10-16), pages 1 - 10 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115048425A (zh) * 2022-06-09 2022-09-13 深圳计算科学研究院 一种基于强化学习的数据筛选方法及其装置
WO2024120447A1 (zh) * 2022-12-09 2024-06-13 维沃移动通信有限公司 模型监督触发方法、装置、ue、网络侧设备、可读存储介质及通信系统

Also Published As

Publication number Publication date
WO2020208444A1 (en) 2020-10-15
GB2597406A (en) 2022-01-26
JP2022527536A (ja) 2022-06-02
DE112020000537T5 (de) 2021-10-21
US20200320428A1 (en) 2020-10-08

Similar Documents

Publication Publication Date Title
CN113692594A (zh) 通过强化学习的公平性改进
JP7521107B2 (ja) Aiモデルを更新する方法、装置、および計算デバイス、ならびに記憶媒体
US20220129791A1 (en) Systematic approach for explaining machine learning predictions
US20210158147A1 (en) Training approach determination for large deep learning models
US20210374544A1 (en) Leveraging lagging gradients in machine-learning model training
US12608232B2 (en) Determining available memory on a mobile platform
EP3745317B1 (en) Apparatus and method for analyzing time series data based on machine learning
WO2021012263A1 (en) Systems and methods for end-to-end deep reinforcement learning based coreference resolution
US20190311258A1 (en) Data dependent model initialization
KR102457893B1 (ko) 딥러닝 기반의 강수량 예측 방법
US20250272485A1 (en) Deep learning system for navigating feedback
US20220129789A1 (en) Code generation for deployment of a machine learning model
US20190228297A1 (en) Artificial Intelligence Modelling Engine
US20250323822A1 (en) Real-time monitoring ecosystem
CN119760655A (zh) 基于智能数据分析大模型的数据分析方法和装置
US11651276B2 (en) Artificial intelligence transparency
CN118113503A (zh) 一种智能运维系统故障预测方法、装置、设备和存储介质
WO2025072847A1 (en) Fraud analytics and modeling engine
US20220198320A1 (en) Minimizing processing machine learning pipelining
US20240029088A1 (en) System for customer churn prediction and prevention
US20250094801A1 (en) Neural network critical neuron selection
WO2025137770A1 (en) Apparatus and methods for determining a probability datum
WO2025101527A1 (en) Techniques for learning co-engagement and semantic relationships using graph neural networks
US20240152440A1 (en) Game performance prediction across a device ecosystem
US20250209308A1 (en) Risk Analysis and Visualization for Sequence Processing Models

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20211123