EP3274929A1 - Procédé d'explication d'un score - Google Patents

Procédé d'explication d'un score

Info

Publication number
EP3274929A1
EP3274929A1 EP16714822.0A EP16714822A EP3274929A1 EP 3274929 A1 EP3274929 A1 EP 3274929A1 EP 16714822 A EP16714822 A EP 16714822A EP 3274929 A1 EP3274929 A1 EP 3274929A1
Authority
EP
European Patent Office
Prior art keywords
score
vector
parameter
list
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP16714822.0A
Other languages
German (de)
English (en)
French (fr)
Inventor
Hélia Pouyllau
Christophe Labreuche
Bénédicte GOUJON
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thales SA
Original Assignee
Thales SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thales SA filed Critical Thales SA
Publication of EP3274929A1 publication Critical patent/EP3274929A1/fr
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models
    • G06N5/045Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof

Definitions

  • the present invention relates to a method of explaining a score associated with a vector by a function.
  • the invention also relates to an associated computer program product and to an information carrier on which is encoded such a computer program product.
  • the invention also relates to a system for explaining a score associated with a vector by a function.
  • decision support systems are used to evaluate choices available to decision makers. For example, a banker evaluates a client's repayment capacity based on information such as monthly salary, the existence of previous loans, or the capital contributed by the client. These data are grouped together by the decision support system, which calculates a score from these data. The banker then decides, based on the score obtained by the client, whether to grant him a loan and, if so, at what rate.
  • the decision support system uses, to calculate the score, a mathematical model drawn from the expert knowledge of the field of application concerned.
  • a mathematical model drawn from the expert knowledge of the field of application concerned.
  • the mathematical model used may not apply in the same way to all situations and its complexity may make the overall score difficult to understand for the decision maker. This is all the more true as the number of parameters considered is high. It may therefore be important for the decision-maker to know which parameters played the most important role in calculating the score, so that the decision can be adapted accordingly.
  • each list comprising a second number of indicators, the second number being an integer strictly less than the first number and strictly greater than zero,
  • the third value being equal to the first corresponding value when the list does not include an indicator of the corresponding parameter, and different from the first corresponding value in the opposite case
  • the method comprises one or more of the following characteristics, taken separately or in any technically possible combination:
  • the importance indicators calculated during step e are either values of
  • the first set comprises a total number of lists, the total number satisfying the mathematical inequality:
  • step b comprises generating at least part of the lists of the first set according to a probability law relating to either the second number or the indicators contained in the list;
  • step b comprises generating all the lists whose second number is less than or equal to a second predetermined integer number strictly smaller than the first number;
  • the method further comprises a step a2 of providing a second score associated with a second vector containing second values of the parameters, and wherein, in step c, the third value is equal to the second value; corresponding when the list includes an indicator of the corresponding parameter;
  • each list comprising a second number of indicators, the second number being an integer strictly less than the first number and strictly greater than zero,
  • the third value being equal to the first corresponding value when the list does not include an indicator of the corresponding parameter, and different from the first corresponding value in the opposite case
  • FIG. 1 is a schematic view of an exemplary system for implementing an explanation method
  • FIG. 2 is a flowchart of a first example of implementation of an explanation method
  • FIG. 3 is a graphical representation of vectors used and / or generated by the explanation method of FIG. 2;
  • FIG. 4 is a flow diagram of another example of implementation of an explanation method
  • FIG. 5 is a flow chart of another exemplary implementation of an explanation method
  • FIG. 10 A system 10 and a computer program product 12 are shown in FIG.
  • the system 10 is a computer.
  • the system 10 is an electronic calculator adapted to manipulate and / or transform data represented as electronic or physical quantities in the registers of the system 10 and / or memories in other similar data corresponding to physical data in the data. memories, registers or other types of display, transmission or storage devices.
  • the system 10 includes a data processing unit 14 comprising a processor 16, memories 18 and an information carrier reader 20.
  • the system 10 also comprises a keyboard 22 and a display unit 24.
  • the computer program product 12 comprises a readable information medium 20.
  • a readable information medium 20 is a medium readable by the system 10, usually by the information processing unit 14.
  • the readable information medium 20 is a medium suitable for storing electronic instructions and capable of being coupled. to a bus of a computer system.
  • the readable information medium 20 is an optical disk, a CD-ROM, a magneto-optical disk, a ROM memory, a RAM memory, an EPROM memory, an EEPROM memory, a magnetic card or a memory card. optical card.
  • the program comprising program instructions.
  • the computer program is loadable on the data processing unit 14 and is adapted to result in the implementation of a method of explaining a score associated with a vector by a function when the computer program is implemented on the processor 16.
  • FIG. 2 illustrates an exemplary implementation of a method of explaining a score associated with a vector by a function .
  • the function is noted U in the following.
  • the U function associates a score with a vector.
  • the function U is, for example, an integral of Choquet.
  • the Choquet integral is an integral defined by the mathematician Gustave Choquet in 1953. The Choquet integral was initially used in the framework of statistical mechanics and potential theory before being applied to the theory of decision in the years
  • the Choquet integral makes it possible to represent, among other things, complex decision strategies such as veto criteria, preference criteria, synergies between criteria and redundancy between criteria.
  • the function U is a generalized additive independence model.
  • the Generated Additive Independence Model builds a function U by addition of utility functions u each relating to a subset of parameters.
  • Generalized Additive Independence models allow the modeling of decisions where the parameters are not independent individually, but where we observe independence between subsets of parameters (not necessarily disjoint).
  • a parameter can be used by several utility functions u when the parameter is present in several subsets. The importance of a parameter in the score obtained is therefore difficult to establish.
  • the function U is a weighted sum.
  • Each score is noted Uv, where V is the name of the vector.
  • V is the name of the vector.
  • the first associated score is noted Ux
  • the second associated score is noted Uy.
  • the score Ux, Uy is a unique value associated with the vector X, Y by the function U.
  • the score Ux, Uy is preferably a real number.
  • Each vector X, Y is a set of values xi, yi, each value xi, yi being associated with a parameter.
  • each parameter is distinct.
  • Each vector X, Y contains a first number N- ⁇ of values xi, yi of parameters.
  • Each value xi, yi is indicated by an index i representative of the position of the value xi, yi in each vector X, Y.
  • each vector X, Y takes the form of an N1 -uplet of values xi, yi.
  • Each vector X, Y comprises a single value xi, yi of each parameter.
  • each vector X, Y takes the form of an N1 -uplet of values xi, yi, and the value xi, yi associated with the parameter Xi occupies the same position in each vector X, Y.
  • the first number N1 is an integer strictly greater than one.
  • the first number N1 is greater than or equal to 3, preferably greater than or equal to 10, preferably greater than or equal to 15, preferably greater than or equal to 20, preferably greater than or equal to 50, preferably greater than or equal to 50. equal to 100.
  • Each parameter is a feature of an entity.
  • the parameters are denoted Xi, where i is the index representative of the position of the value xi, yi associated with the parameter Xi in each vector X, Y.
  • the index i is an indicator of the parameter Xi. This means that each index i is associated with a single parameter Xi, and vice versa.
  • the parameters Xi are therefore ordered by the index i.
  • the index i is an integer.
  • the index i is, on the one hand, greater than or equal to one, and on the other hand less than or equal to the first number N1.
  • the entity is a physical object, a concept, an action, or a set of actions or a set of physical objects.
  • the entity comprises five parameters which are, in order: a first parameter X1, a second parameter X2, a third parameter X3, a fourth parameter X4, and a fifth parameter X5.
  • the entity is an automobile.
  • the first parameter X1 is the price in Euros of the car
  • the second parameter X2 is the number of seats in the car
  • the third parameter X3 is the number of doors of the car
  • the fourth parameter X4 is the fuel consumption in liters per hundred kilometers traveled
  • the fifth parameter X5 is the horsepower of the engine.
  • Each parameter X can take a plurality of values xi, yi included in a respective range Pi.
  • the beach Pi is, for example, continuous.
  • each value x4, y4 of the fourth parameter X4 (“consumption") is a real number strictly greater than 2 and strictly less than 15.
  • the beach Pi is a discrete beach.
  • each value x2, y2 of the second parameter X2 ("number of places") is an integer greater than or equal to 1 and less than or equal to 8.
  • the explanation method comprises a step 100 of supplying, a step 1 10 of generating a first set E1 of lists Lj, a step 120 of initialization, a step 130 of selecting a list Lj, a step 140 of generating a third vector, a calculation step 150, an evaluation step 160, a selection step 170 of a set of parameters Xi, and a step 180 of developing an explanation.
  • a first score Ux and a second score Uy are provided.
  • the first score Ux is associated with a first vector X by the function U.
  • the first vector X contains the first number N1 of the first values xi of the parameters Xi.
  • the second score Uy is associated with a second vector Y by the function U.
  • the first score Ux is, for example, strictly greater than the second score Uy.
  • the second vector Y contains the first number N1 of second values yi of the parameters Xi.
  • the first vector X and the second vector Y represent two automobiles A and B to be compared.
  • the first vector X (10,000; 5; 3; 4,5; 75) represents automobile A and the second vector Y (12,000; 6; 5; 4,8; 80) represents automobile B.
  • the first vector X and the second vector Y are generated by an acquisition device external to the system 10, and transmitted by the acquisition device to the system 10 which calculates the first score Ux from the first vector X, and the second score Uy from the second vector Y.
  • the first vector X is transmitted by a user to the system 10 via the keyboard 22.
  • the user first enters the first value x1, equal to 10000 and representing the price of the car A.
  • the user types the first value x2, equal to 5 and representing the number of seats of the automobile A.
  • the user types the first value x3 equal to 3 and represents the number of doors of automobile A.
  • the user transmits the first value x4 equal to 4.5 and representing the fuel consumption of the automobile A.
  • the user types the first value x5 equal to 75 and representing the engine power of the automobile A.
  • the first score Ux and the second score Uy are provided by a device external to the system 10.
  • a first set E1 of lists is generated.
  • Each list is denoted Lj, where j is an index locating the position of the list Lj in the first set E1.
  • Each list Lj comprises at least one indicator of a parameter Xi.
  • each indicator included in the list Lj is denoted ij. Indeed, each indicator ij is associated with a parameter Xi, and contained in a list Lj.
  • Each indicator is associated with a single parameter Xi, and vice versa.
  • Each list Lj is thus able to define a subset SE2j of parameters Xi. This means that a parameter Xi is in the subset SE2j associated with the list Lj if and only if the list Lj contains the indicator ij associated with the parameter Xi.
  • Each list L comprises a second number N2j of indicators ij of the parameters Xi .
  • the second number N2j is an integer strictly greater than zero.
  • the second number N2j is strictly less than the first number N1.
  • the first set E1 contains a first list L1, a second list L2 and a third list L3.
  • the first list L1 contains the indicator 1 1 of the first parameter X1, the indicator
  • the second list L2 contains the indicator 22 of the second parameter X2, and the indicator 42 of the fourth parameter X4.
  • the third list L3 contains the indicator 13 of the first parameter X1, and the indicator 23 of the second parameter X2.
  • the list Lj is a N2j-tuple of indicators ij, each indicator ij of the list Lj being an integer equal to the index i of the corresponding parameter Xi.
  • the first list L1 is (1, 3,5)
  • the second list L2 is (2,4)
  • the third list L3 is (1,2)
  • each indicator ij is an integer equal to 1.
  • the list Lj is an N1 -uplet containing N2j indicators ij and elements ei, j. Each element ei, j is a number equal to zero.
  • the indicator ij associated with the parameter Xi is located in the nth position in the list Lj.
  • the first list L1 is the 5-tuple (1; 0; 1; 0; 1)
  • the second list L2 is the 5-tuple (0; 1; 0; 1; 0)
  • the third list L3 is the 5-tuple (1; 1; 0; 0; 0).
  • each indicator ij is the name of the corresponding parameter Xi.
  • the first set E1 comprises a plurality of first subsets SE1 each comprising a third number N3 of lists Lj.
  • Each first subset SE1 groups all the lists Lj comprising the same second number N2j of indicators ij.
  • the first set E1 contains a total number Nt of lists Lj.
  • the total number Nt of lists Lj is greater than or equal to the square of the first number N1.
  • each list Lj generated contains at most two indicators ij.
  • the total number Nt of lists Lj is less than or equal to the product of a coefficient Co and the maximum number Nm of possible lists Lj.
  • the maximum number Nm is equal to two to the power of the first number N1.
  • the inequality is written mathematically: Nt ⁇ Co * 2 Nl
  • the coefficient Co is a predetermined real number.
  • the coefficient Co is, for example, strictly less than one.
  • the coefficient Co is equal to one tenth.
  • At least part of the lists Lj of the first set E1 are generated according to a probability law P.
  • the probability law P relates, for example, to the indicators ij contained in each list Lj.
  • lists Lj are generated randomly according to a Latin hypercube algorithm. This means that the third number N3 of each first subset SE1 is equal to a first predetermined number Np, the first predetermined number Np being common to each first subset SE1, and the lists Lj of each first subset SE1 are generated randomly.
  • each list are generated according to a probability law P in which each indicator ij has a probability equal to that of each other indicator ij to be contained in the list Lj.
  • the first set E1 comprises three lists Lj containing a single indicator ij, three lists containing two indicators ij, and so on.
  • the probability law P relates to the second number N2j of each list. This means that the probability law P is used by the system 10 to determine the number of lists Lj of each first subset SE1, and the lists Lj of each first subset SE1 are generated randomly.
  • the third number N3 of each first subset SE1 is calculated, during the step 1 10 of generating the first set E1, according to a Gauss law (also called "normal law").
  • the Gaussian law is a law of probability in which the probability density f (N3) is given by the equation:
  • the probability law P is a uniform or normal law relating to the total number of lists Nt.
  • the coefficient Co is equal to one. This means that all possible Lj lists are generated.
  • the importance indicator Ri is a magnitude representative of the importance that the parameter Xi has in calculating the first score Ux and the second score Uy.
  • the importance indicator Ri is representative of the importance of the parameter Xi in the difference between the first score Ux and the second score Uy.
  • the importance indicator Ri is an element of a totally ordered space. This means that each indicator of importance Ri is suitable to be compared with each other indicator of importance Ri.
  • the indicator of importance Ri is preferably a real number.
  • the indicator of importance Ri is, for example, a value of Shapley.
  • Shapley is a greatness developed within the framework of game theory. As part of a cooperative game in which players must come together to achieve a goal, Shapley's value is assessed for each player from the number of winning coalitions comprising the player and becoming losers if the player withdraws from Coalition. The value of Shapley therefore makes it possible to evaluate the power of a player (that is to say his ability to influence the final result), and is, in particular, used to determine the distribution of winnings in case of victory. .
  • Shapley Ri is evaluated for each parameter Xi and makes it possible to evaluate the importance of the parameter Xi in the difference between the first score Ux and the second score Uy.
  • the importance indicator Ri is a Sobol index.
  • Sobol indices are estimates of the sensitivity of a function of at least one parameter to a variation of this parameter.
  • the first-order sensitivity index of the function U to the parameter Xi is defined from the variance V (U) of the function U by the equation:
  • Xi) is the expectation of the function U when the parameter Xi is fixed.
  • each indicator of importance Ri is set equal to zero.
  • a list Lj is selected. For example, if no list Lj has been selected during a previous selection step 130, the list L1 of index j equal to one is selected.
  • the index j of the list Lj is, for example, stored in the memory 18.
  • the list L1 (1; 0; 1; 0; 1) is selected.
  • a third vector Cj and at least a fourth vector Di, j are generated.
  • the third vector Cj comprises third values ci, j parameters Xi.
  • the third vector Cj is generated from the selected list Lj and at least from the first vector X.
  • Each third value ci, j is equal to the first value xi corresponding when the list Lj does not include an indicator ij of the corresponding parameter Xi.
  • the third value ci, j is different from the first value xi.
  • the list Lj comprises an indicator ij of the parameter Xi
  • the third value ci, j corresponding is equal to the second value yi corresponding.
  • the third vector C1 (12,000; 5; 5; 4,5; 80) is generated from the list L1 (1; 0; 1; 0; 1), the first vector X (10,000; 5; 3; 4,5; 75); and second vector Y (12,000; 6; 5; 4,8; 80).
  • the first vector X, the second vector Y and the third vector C1 are represented graphically in three dimensions in FIG. 3. Only the first parameter X1, the second parameter X2 and the third parameter X3 have been represented, for the sake of clarity.
  • a fourth vector is also generated, for each parameter Xi, from the third vector Cj.
  • the fourth vector is denoted Di, j, and denoted by the index j of the third vector Cj and by the index i of the parameter Xi.
  • Each fourth vector Di, j comprises fourth values of the parameters Xi.
  • Each fourth value of the fourth vector Di, j is denoted di, j, m, where m is an integer index identifying the position of the fourth value di, j, m in the fourth vector Di, j.
  • the index m of the fourth value di, j, m is greater than or equal to one, and less than or equal to the first number N1.
  • Each fourth value di, j, m is equal to the third value ci, j corresponding if the index m of the fourth value di, j, m is different from the index i of the parameter Xi.
  • the fourth value di, j, m is different from the third value ci, j.
  • the fourth value di, j, m is equal to the first value xi if the third value ci, j is equal to the second value yi, and is equal to the second value yi if the third value ci, j is equal to at the first value xi.
  • the fourth vector Di, j associated with parameter Xi of index i is constructed from the third vector Cj, replacing the third value ci, j of index i, and only this one, by depending on the case, the first value xi or the second value yi corresponding so that the fourth vector Di, j is different from the third vector Cj.
  • the fourth vector D1, 1 (10,000; 5; 5; 4,5; 80) is generated for the first parameter X1. Only the fourth index value m, which is equal to one, is different from the corresponding third value c1, 1.
  • the fourth vector D2,1 (12,000; 6; 5; 4,5; 80) is generated for the second parameter X2. Only the fourth index value m, which is equal to two, is different from the corresponding third c3,2 value.
  • the fourth vector D3,1 (12,000; 5; 3; 4,5; 80) is generated for the third parameter X3. Only the fourth index value m, which is equal to three, is different from the third corresponding value c3.1.
  • the fourth vector D4.1 (12000; 5; 5; 4.8; 80) is generated for the fourth parameter X4. Only the fourth index value m, which is equal to four, is different from the third corresponding value c4.1.
  • the fourth vector D5,1 (12,000; 5; 5; 4,5; 75) is generated for the fifth parameter X5. Only the fourth index value m, which is equal to five, is different from the third corresponding value c5.1.
  • a third score Ucj is calculated from the third vector Cj and a fourth score Udi, j is calculated from each fourth vector Di, j
  • the third score Ucj and the fourth scores Udi, j are computed by the information processing unit 14.
  • the third vector Cj and the fourth vectors Di, j are transmitted to an external computing device that the information processing unit 14 is able to control.
  • the computing device transmits back to the system 10 the third score Ucj and the fourth scores Udi, j.
  • the importance indicator Ri of each parameter Xi is evaluated.
  • the importance indicator Ri is evaluated from the third score Ucj and the fourth scores Udi, j.
  • the absolute value of the difference between the third score Ucj and the fourth score Udi, j associated with the parameter Xi is calculated.
  • the value of the importance indicator Ri is then increased by a quantity Q equal to the calculated absolute value, divided by the total number Nt of lists Lj of the first set E1.
  • the indicator of importance Ri evaluated is then an approximation of a value of Shapley.
  • this evaluation step 160 only the parameters Xi for which the first value xi is different from the second value yi are considered. This means that, when the first value xi and the second value yi are equal, the importance indicator Ri of the corresponding parameter Xi is not evaluated during this evaluation step 160.
  • step 130 of selection is repeated.
  • the reiteration is represented in FIG. 2 by an arrow 165.
  • the step 130 of selecting a list Lj, the step 140 of generating a third vector Cj, the step 150 of calculating at least one score and the step 160 of evaluating at least one importance indicator Ri are then reiterated in the order presented above for another list Lj.
  • the index j stored is increased by one, and the step 130 of selecting a list Lj, the step 140 of generating a third vector Cj, the step 150 of calculating at least one score and the step 160 of evaluating at least one indicator of importance Ri are implemented for the corresponding list Lj.
  • the evaluation step 160 is followed by a step 170 of selecting at least one parameter Xi.
  • the evaluation step 160 is followed by the step 170 of selecting at least one parameter Xi.
  • importance R4 and a fifth indicator of importance R5 were evaluated.
  • the first importance indicator R1 is associated with the first parameter X1
  • the second importance indicator R2 is associated with the second parameter X2, and so on.
  • the first indicator of importance R1 is equal to 0.5
  • the second indicator of importance R2 is equal to 0.05
  • the third indicator of importance R3 is equal to 0.10
  • the fourth indicator of importance R4 is equal to 0.30
  • the fifth indicator of importance R5 is equal to 0.05.
  • the parameter Xi which has the greatest importance in the difference between the first score Ux and the second score Uy is the first parameter X1.
  • the parameters Xi which have the smallest importance in the difference between the first score Ux and the second score Uy are the second parameter X2 and the fifth parameter X5.
  • step 170 of selecting at least one parameter Xi a fourth set E4 of parameters Xi is selected.
  • the fourth set E4 comprises a fourth number N4 of parameters Xi.
  • the fourth number N4 is, for example, predetermined.
  • the fourth number N4 is determined, during the step 170 of selecting at least one parameter Xi, from the estimated importance indicators Ri.
  • the parameters Xi selected during the selection step 170 of at least one parameter Xi are the parameters Xi whose importance indicators Ri are the highest.
  • the fourth number N4 is equal to two, and the first parameter X1 associated with the first indicator of importance R1 equal to 0.5 and the fourth parameter X4 associated with the fourth indicator of importance R4 equal at 0.30 are selected.
  • the explanation Ex is, for example, a relative explanation of the first score Ux with respect to the second score Uy.
  • the Ex explanation is likely to explain to a decision maker the difference between the first Ux score and the second Uy score.
  • the Ex explanation is own to explain to a decision maker the reasons that a Ux score, Uy is greater than another Ux score, Uy.
  • the Ex explanation is elaborated from the selected Xi parameters.
  • the Ex explanation includes a list of selected Xi parameters.
  • the Ex explanation includes, for example, a hole phrase such as: "[1] is preferred to [2] because of", where [1] is the name of the entity associated with the highest score.
  • the hole phrase is followed by the list of selected Xi parameters.
  • the explanation is displayed on the display unit 24.
  • the method of explaining the score Ux does not assume that the function U is known.
  • the explanation method can therefore be used in conjunction with an external device for calculating a score whose operating details are not known. Only the first and second scores Ux, Uy and the first and second associated vectors x, y are known.
  • the evaluation selection step 160 is followed by the step 170 of selecting at least one parameter Xi if all the lists Lj have not yet been selected and, when the step 160 of evaluation the quantity Q is strictly lower than a predetermined threshold S.
  • the explanation method does not take into account some non-significant lists Lj (for which the quantity Q is less than the threshold S). The explanation process is therefore faster to implement.
  • FIG. 4 A flowchart of a third exemplary implementation of an explanation method is shown in FIG. 4.
  • the function U has a global maximum Ug associated with a first extreme vector G, and a global minimum Ub associated with a second extreme vector B.
  • the global maximum Ug and the overall minimum Ub of the function U are stored in the memory 18.
  • the global maximum Ug and the global minimum Ub of the function U have been obtained by a mathematical analysis of the function U.
  • the global maximum Ug and the overall minimum Ub of the function U are obtained by an experimental method. For example, all possible X, Y vectors are generated and the score Ux, Uy associated with each of the vectors X, Y is calculated.
  • Step 1 of generating a first set E1, step 120 of initialization, step 130 of selecting a list Lj, step 140 of generating a third vector Cj, the step 150 of calculating at least one score, the step 160 of evaluating at least one indicator of importance Ri, and the step 170 of selecting a set of parameters Xi are then implemented from the first vector x and the first extreme vector G.
  • step 1 of generating a first set E1 of step 120 of initialization, of step 130 of selecting a list Lj, of step 140 of generating a third vector Cj, step 150 of calculating at least one score, step 160 of evaluating at least one indicator of importance Ri, and step 170 of selecting a set of parameters Xi
  • the global maximum Ug is used instead of the second score Uy
  • the first extreme vector G is used in place of the second vector Y.
  • a first indicator of importance Rpi is obtained for each parameter Xi.
  • a fifth set E5 of parameters Xi is selected on the basis of the first estimated importance indicators Rpi.
  • step 1 of generating a first set E1 step 120 of initialization, step 130 of selecting a list Lj, step 140 of generating a third vector Cj, l step 150 of calculating at least one score, step 160 of evaluating at least one indicator of importance Ri, and step 170 of selecting a set of parameters Xi are implemented from of the first vector x and the second extreme vector B.
  • step 1 of generating a first set E1 of step 120 of initialization, of step 130 of selecting a list Lj, of step 140 of generating a third vector Cj, of step 150 of calculating at least one score, of step 160 of evaluating at least one indicator of importance Ri, and step 170 of selecting a set of parameters Xi
  • the global minimum Ub is used instead of the second score Uy
  • the second extreme vector B is used in place of the second vector Y.
  • a second indicator of importance Rdi is obtained for each parameter Xi.
  • a sixth set E6 of parameters Xi is selected on the basis of the second evaluated importance indicators Rdi.
  • the explanation method then makes it possible to explain the first score Ux absolutely. This means that the method is able to explain the first score Ux without comparing it to a second score Uy.
  • FIG. 1 A flowchart of a fourth example of implementation of an explanation method is shown in FIG.
  • the coefficients of Mobius are the elements obtained, from a first sequence, by the inversion formula of Mobius.
  • the inversion formula of Mobius was introduced into the theory of numbers during the nineteenth century by August Anthony Mobius.
  • m (Lj) ⁇ (- ⁇ ) m v (T) in which:
  • T is a secondary list containing indicators ij and included in the list Lj
  • v (T) is the difference between a third score Ut associated with a third vector Cj and the first score Ux, the third vector Cj being calculated from the secondary list T.
  • the function U is k-additive. This means that there exists a second predetermined number k for which the Mobius coefficient m (Lj) of each list Lj is equal to zero if the second number N2j of the list Lj is strictly greater than the second predetermined number k.
  • the second predetermined number k is an integer strictly greater than 1 and strictly less than the first number N1.
  • the second predetermined number k is equal to 3.
  • the second predetermined number k is equal to 2.
  • a third subset SE3i of lists Lj is also defined.
  • the third subset SE3i each comprises lists Lj including the indicator ij of the parameter Xi.
  • Each third subset SE3i is therefore included in the first set E1.
  • the initialization step 120 and the selection step 130 are not implemented.
  • the third vector associated with each list Lj is generated.
  • the evaluation step 160 comprises an initialization sub-step 162, then a sub-step 165 for calculating at least one Mobius coefficient, and then a sub-step 167 for evaluating at least one indicator. of importance Ri.
  • the coefficient m (Lj) is a real number equal to the difference between the third score Ucj associated with the third vector Cj generated from the list Lj and the first score Ux. This is written mathematically:
  • An iterator is also generated.
  • the iterator n is set equal to zero.
  • n: n + l
  • a reduced list Lrj is generated, for each list Lj, if the list Lj contains an indicator ij of the parameter Xn of index equal to the iterator n,
  • the reduced list Lrj is constructed, starting from the list Lj, by the deletion of the indicator ij associated with the parameter Xn of index equal to the iterator n. This means that the reduced list Lrj contains each of the indicators ij contained in the list Lj, with the exception of the indicator ij equal to the iterator n.
  • the reduced list Lr1 (0; 0; 1; 0; 1) is generated from the list L1 (1; 0; 1; 0; 1 ).
  • the coefficient m (Lj) of each list is set equal to the difference between the coefficient m (Lj) associated with the list Lj, and the coefficient m (Lr, j) associated with the reduced list Lrj. This is written mathematically:
  • the first calculation sub-step 165 is repeated.
  • the reiteration is represented in FIG. 5 by an arrow 190.
  • the first calculation sub-step 165 is followed by a second calculation sub-step 167.
  • the importance indicator Ri associated with each parameter Xi is evaluated.
  • Each indicator of importance Ri is equal to the sum, for each list Lj containing the indicator ij of the parameter Xi, of the ratio between the coefficient m (Lj) of index associated with the list Lj and the second number N2j of the list Lj.
  • the step 170 of selecting at least one parameter Xi is implemented.
  • the importance indicator Ri associated with each parameter is then a value of Shapley and not an approximation of a value of Shapley.
  • step 170 of selecting at least one parameter Xi at least two fourth subsets SE4m of parameters are selected.
  • a fifth number N5 of fourth subsets SE4 are generated from the importance indicators Ri received.
  • each fourth subset SE4 will be identified by an index m and noted SE4m.
  • Each fourth subset SE4m contains at least one parameter Xi.
  • the fourth subsets SE4m are generated according to a data partitioning method.
  • Data partitioning (or data clustering in English) is one of the statistical methods of data analysis. Data partitioning aims at dividing a set of data into different homogeneous subsets, in that the data of each subset share common characteristics, which most often correspond to proximity criteria between objects.
  • the proximity criterion used is the difference between the indicators of importance Ri.
  • the parameters Xi are classified in the fourth subsets SE4m in ascending order of the calculated importance indices Ri. This means that each of the parameters Xi of the fourth subset SE41 of index m equal to 1 is associated with an indicator of importance Ri greater than each of the indicators of importance Ri associated with the parameters Xi of the fourth subset SE42 of index m equal to 2, and so on.
  • the explanation Ex takes the form of a sentence with holes of the type: "The automobile A is preferred to the automobile B first because of the parameters [1], then parameters [2], and finally because of the parameters [3] ", where [1] is a list of the parameters Xi of the fourth subset SE41, [2] is a list of the parameters Xi of the fourth subset SE42 and [3] is a list parameters Xi of the fourth subset SE43.
  • the explanation method automatically adjusts the number of parameters Xi used in the explanation Ex to the importance indicators Ri evaluated. The explanation Ex elaborated is therefore more relevant.
  • the explanation process does not depend on the U function used.
  • the explanation method is therefore adaptable to a large number of U functions.
EP16714822.0A 2015-03-26 2016-03-25 Procédé d'explication d'un score Pending EP3274929A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR1500602A FR3034230A1 (fr) 2015-03-26 2015-03-26 Procede d'explication d'un score
PCT/EP2016/056738 WO2016151145A1 (fr) 2015-03-26 2016-03-25 Procédé d'explication d'un score

Publications (1)

Publication Number Publication Date
EP3274929A1 true EP3274929A1 (fr) 2018-01-31

Family

ID=54365293

Family Applications (1)

Application Number Title Priority Date Filing Date
EP16714822.0A Pending EP3274929A1 (fr) 2015-03-26 2016-03-25 Procédé d'explication d'un score

Country Status (8)

Country Link
US (1) US11195110B2 (he)
EP (1) EP3274929A1 (he)
CN (1) CN107660288B (he)
CA (1) CA2980875A1 (he)
FR (1) FR3034230A1 (he)
IL (1) IL254658B (he)
SG (1) SG11201707927RA (he)
WO (1) WO2016151145A1 (he)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11354292B2 (en) * 2018-05-16 2022-06-07 Fair Isaac Corporation Method and apparatus for analyzing coverage, bias, and model explanations in large dimensional modeling data

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5995728A (en) * 1995-06-30 1999-11-30 Forman; Ernest H. Computer implemented method of generating and displaying objectives from pros and cons for evaluating decision alternatives
US5717865A (en) * 1995-09-25 1998-02-10 Stratmann; William C. Method for assisting individuals in decision making processes
US5983220A (en) * 1995-11-15 1999-11-09 Bizrate.Com Supporting intuitive decision in complex multi-attributive domains using fuzzy, hierarchical expert models
US5832465A (en) * 1997-04-07 1998-11-03 General Electric Company Method for building a self-learning evidential reasoning system
US20030078900A1 (en) * 2001-06-29 2003-04-24 Dool Jacques Van Den Distributed decision processing system with advanced comparison engine
FR2848006A1 (fr) * 2002-11-29 2004-06-04 Thales Sa Procede permettant d'expliquer une decision prise par un modele d'agregation multicritere compensatoire
EP2523149B1 (en) * 2011-05-11 2023-01-11 Tata Consultancy Services Ltd. A method and system for association and decision fusion of multimodal inputs
CN102521496A (zh) * 2011-12-02 2012-06-27 北京启明星辰信息安全技术有限公司 评估指标的重要性级别的获取方法和系统
CN102830441A (zh) * 2012-05-16 2012-12-19 陶冶 包含数据挖掘的油气层综合解释评价方法
CN102930350B (zh) * 2012-10-25 2016-08-03 合肥工业大学 绿色产品设计方案的不确定性优化决策方法
EP2948895B1 (en) * 2013-01-25 2019-06-12 Services Petroliers Schlumberger Pressure transient testing with sensitivity analysis
CN104239020A (zh) * 2013-06-21 2014-12-24 Sap欧洲公司 决策做出标准驱动的推荐
CN103345552A (zh) * 2013-06-28 2013-10-09 广东电网公司电力调度控制中心 电力ict通信网可靠性的评估方法及系统
US20150142717A1 (en) * 2013-11-19 2015-05-21 Microsoft Corporation Providing reasons for classification predictions and suggestions
CN103927602A (zh) * 2014-04-28 2014-07-16 河南云卓数据服务有限公司 一种面向高考志愿填报的辅助决策方法
US10019542B2 (en) * 2015-04-14 2018-07-10 Ptc Inc. Scoring a population of examples using a model

Also Published As

Publication number Publication date
US20180060746A1 (en) 2018-03-01
SG11201707927RA (en) 2017-10-30
IL254658B (he) 2022-05-01
US11195110B2 (en) 2021-12-07
CA2980875A1 (fr) 2016-09-29
FR3034230A1 (fr) 2016-09-30
CN107660288A (zh) 2018-02-02
CN107660288B (zh) 2021-12-07
WO2016151145A1 (fr) 2016-09-29
IL254658A0 (he) 2017-11-30

Similar Documents

Publication Publication Date Title
De Mendonça et al. Is inflation targeting a good remedy to control inflation?
EP1470501B1 (fr) Procedes et systemes de recherche et d'association de ressources d'information telles que des pages web
US20190035015A1 (en) Method and apparatus for obtaining a stable credit score
US10545938B2 (en) Scoring members of a set dependent on eliciting preference data amongst subsets selected according to a height-balanced tree
EP3274929A1 (fr) Procédé d'explication d'un score
EP3143595B1 (fr) Procédé d'affichage d'images
EP3200384B1 (fr) Procédé d'execution de calcul cryptographique et application a la classification par machines a vecteurs de support
EP3051508B1 (fr) Système de jeu informatisé
FR3031823A1 (fr) Lemmatisateur semantique base sur des dictionnaires ontologiques.
FR3110727A1 (fr) Procédé, dispositif et programme d’ordinateur pour identifier et ordonner des événements sélectionnables par un utilisateur
WO2020025892A1 (fr) Procede et dispositif d'enregistrement d'un nouvel objet dans un catalogue
FR3110268A1 (fr) Procédés d’utilisation sécurisée d’un premier réseau de neurones sur une donnée d’entrée, et d’apprentissage de paramètres d’un deuxième réseau de neurones
WO2019086782A1 (fr) Procédé, dispositif et programme de composition d'une liste ordonnée d'éléments recommandés
Laslier Solutions de tournois: un spicilège
EP3827342B1 (fr) Procédé de correction de données entre deux processus périodiques et système automatisé mettant en oeuvre un tel procédé
US11478715B1 (en) User-controllable model-driven matchmaking
Elmi Detection of abrupt changes and autoregressive models
FR3064104B1 (fr) Procede de reconnaissance vocale d'une adresse
WO2021055455A1 (en) Collecting and levelling up predictions as collectible cards
Loomes Taking process into account when modeling risky choice
Chandra A framework model to study fear of failure and decision making among entrepreneurs at different stages of an enterprise
EP3920106A1 (fr) Procédé, programme d'ordinateur et dispositif d évaluation de la robustesse d'un réseau de neurones contre des perturbations d'images
FR3139683A1 (fr) Procédé d’enrichissement, dispositif électronique et produit programme d’ordinateur correspondant
FR3089666A1 (fr) Procédé de détermination de plages temporelles de disponibilités communes à une pluralité d’agendas, produit programme d'ordinateur, médium de stockage et dispositif correspondant
FR3003966A1 (fr) Procede d'adaptation dynamique d'un environnement logiciel execute a partir d'un terminal de communication d'un utilisateur, au cours d'une communication entre l'utilisateur et au moins un interlocuteur.

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20170929

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

RIN1 Information on inventor provided before grant (corrected)

Inventor name: LABREUCHE, CHRISTOPHE

Inventor name: POUYLLAU, HELIA

Inventor name: GOUJON, BENEDICTE

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20200527

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

P01 Opt-out of the competence of the unified patent court (upc) registered

Effective date: 20230526