JPWO2020208444A5

JPWO2020208444A5 -

Info

Publication number: JPWO2020208444A5
Application number: JP2021558964A
Authority: JP
Publication date: 2022-08-18

Claims

A computer-implemented method comprising:
receiving an initial version of a machine learning model (MLM) including a plurality of parameter values, a plurality of hyperparameter values, and an initial fairness value reflecting fairness with respect to the segmented relevant subgroups;
adjusting at least some of the parameter values and/or at least some of the hyperparameter values of the initial version of the MLM to create an interim version of the MLM;
determining a fairness value for the interim version of the MLM, comprising:
receiving a reinforcement learning meta-model (RLMM) defining a plurality of fairness-related goals and a reward function reflecting the plurality of fairness-related goals;
running the interim version of the MLM;
calculating, by the RLMM, a reward value based on the reward function during the operation of the interim version of the MLM;
determining by an operation comprising determining a provisional fairness value for the provisional version of the MLM based on the reward value;
determining that the interim fairness value is greater than the initial fairness value;
responsive to the determination that the interim fairness value is greater than the initial fairness value, replacing the initial version of the MLM with the interim version of the MLM, and replacing the initial fairness value with the interim fairness value. A computer-implemented method, comprising: replacing.

2. The computer-implemented method of claim 1, further comprising iteratively repeating said operation until said initial fairness value exceeds a predetermined threshold.

3. The computer-implemented method of claim 1 or 2, wherein the initial MLM is a supervised MLM.

4. The computer-implemented method of any one of claims 1-3, wherein the fairness-related target values include at least one of gender, age, nationality, religious beliefs, ethnicity and orientation.

5. The computer-implemented method of any one of claims 1-4, further comprising linking the initial MLM to the reinforcement learning meta-model based on configuration and retrieval.

6. The computer-implemented method of any one of claims 1-5, wherein the plurality of parameter values comprises values of at least one of the following parameter types: weighting factors and activation function variables.

The plurality of hyperparameter values includes at least one value of the following hyperparameter types: activation function type, number of nodes per layer, number of layers of neural network and machine learning model. 7. The computer-implemented method of any one of claims 1-6.

A computer program product that causes a computer to perform the steps of the computer-implemented method of any one of claims 1-7.

9. A computer-readable storage medium recording the computer program according to claim 8.

A computer system,
one or more computer processors;
one or more computer readable storage media;
and program instructions stored on said one or more computer-readable storage media, said program instructions for executing each step of the computer-implemented method of any one of claims 1-7 in said one. or a computer system configured to run on multiple computer processors.