Disclosure of Invention
In view of the foregoing, the present disclosure provides a method and apparatus for decision optimization. By utilizing the method and the device, personalized decision optimization can be realized for different users.
According to an aspect of the present disclosure, there is provided a method for decision optimization, comprising: determining the contribution degree of a prediction result of each decision characteristic variable in a prediction model under user characteristic data, wherein the prediction model is established based on a decision target and comprises the decision characteristic variable and a non-decision characteristic variable; constructing a decision characteristic variable combination to be optimized based on the determined prediction result contribution of each decision characteristic variable; optimizing the variable values of each decision characteristic variable in the constructed decision characteristic variable combination so as to ensure that the corresponding prediction result of the prediction model is optimal; and performing decision optimization processing according to the variable values of the decision characteristic variables in the decision characteristic variable combination obtained after optimization processing.
Optionally, in an example of the above aspect, determining a prediction result contribution degree of each decision characteristic variable in the prediction model under the user characteristic data includes: and determining the prediction result contribution degree of each decision characteristic variable in the prediction model under the user characteristic data by using the interpretation model.
Optionally, in an example of the above aspect, the interpretation model comprises one of the following interpretation models: the Shap value model, the LIME model, and the DeepLift model.
Optionally, in an example of the above aspect, constructing a decision feature variable combination to be optimized based on the determined prediction result contribution degrees of the respective decision feature variables may include: based on the determined contribution degree of the prediction result of each decision characteristic variable, sequencing each decision characteristic variable; and selecting a preset number of decision characteristic variables with higher contribution degree from the sorted decision characteristic variables as a decision characteristic variable combination to be optimized.
Optionally, in an example of the above aspect, the optimizing the variable values of the decision characteristic variables in the constructed decision characteristic variable combination may include: optimizing the variable values of each decision characteristic variable in the constructed decision characteristic variable combination by using one of the following optimization algorithms: particle swarm algorithm, genetic algorithm and annealing algorithm.
Optionally, in an example of the above aspect, the optimizing the variable values of the decision characteristic variables in the constructed decision characteristic variable combination may include: and in a preset decision variable value range, optimizing the variable value of each decision characteristic variable in the constructed decision characteristic variable combination.
According to another aspect of the present disclosure, there is provided an apparatus for decision optimization, comprising: a contribution degree determination unit configured to determine a prediction result contribution degree of each decision characteristic variable in a prediction model under the user characteristic data, wherein the prediction model is created based on a decision target, and the prediction model comprises decision characteristic variables and non-decision characteristic variables; the decision characteristic variable combination construction unit is configured to construct a decision characteristic variable combination to be optimized based on the determined prediction result contribution degree of each decision characteristic variable; an optimization processing unit configured to perform optimization processing on the variable values of each decision characteristic variable in the constructed decision characteristic variable combination so as to optimize the corresponding prediction result of the prediction model; and the decision optimization unit is configured to perform decision optimization processing according to the variable values of the decision characteristic variables in the decision characteristic variable combination obtained after optimization processing.
Optionally, in an example of the above aspect, the contribution degree determining unit is configured to: and determining the prediction result contribution degree of each decision characteristic variable in the prediction model under the user characteristic data by using the interpretation model.
Optionally, in an example of the above aspect, the interpretation model comprises one of the following interpretation models: the Shap value model, the LIME model, and the DeepLift model.
Optionally, in an example of the above aspect, the decision feature variable combination construction unit includes: the sequencing module is configured to sequence the decision characteristic variables based on the determined prediction result contribution degrees of the decision characteristic variables; and the characteristic selection module is configured to select a predetermined number of decision characteristic variables with higher contribution degree from the sorted decision characteristic variables as a decision characteristic variable combination to be optimized.
Optionally, in one example of the above aspect, the optimizing processing unit is configured to: and optimizing the variable values of the decision characteristic variables in the constructed decision characteristic variable combination by using one of the following optimization algorithms: particle swarm algorithm, genetic algorithm and annealing algorithm.
Optionally, in one example of the above aspect, the optimizing processing unit is configured to: and in a preset decision variable value range, optimizing the variable value of each decision characteristic variable in the constructed decision characteristic variable combination.
According to another aspect of the present disclosure, there is provided a computing device comprising: at least one processor, and a memory coupled with the at least one processor, the memory storing instructions that, when executed by the at least one processor, cause the at least one processor to perform a method for decision optimization as described above.
According to another aspect of the present disclosure, there is provided a non-transitory machine-readable storage medium storing executable instructions that, when executed, cause the machine to perform the method for decision optimization as described above.
Detailed Description
The subject matter described herein will now be discussed with reference to example embodiments. It should be understood that these embodiments are discussed only to enable those skilled in the art to better understand and thereby implement the subject matter described herein, and are not intended to limit the scope, applicability, or examples set forth in the claims. Changes may be made in the function and arrangement of elements discussed without departing from the scope of the disclosure. Various examples may omit, substitute, or add various procedures or components as needed. For example, the described methods may be performed in an order different from that described, and various steps may be added, omitted, or combined. In addition, features described with respect to some examples may also be combined in other examples.
As used herein, the term "include" and its variants mean open-ended terms in the sense of "including, but not limited to. The term "based on" means "based at least in part on". The terms "one embodiment" and "an embodiment" mean "at least one embodiment". The term "another embodiment" means "at least one other embodiment". The terms "first," "second," and the like may refer to different or the same object. Other definitions, whether explicit or implicit, may be included below. The definition of a term is consistent throughout the specification unless the context clearly dictates otherwise.
Decision optimization methods and apparatuses according to embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings.
Fig. 1 shows a flow diagram of a decision optimization method according to an embodiment of the present disclosure.
As shown in FIG. 1, at block 110, the prediction result contribution of each decision feature variable in the prediction model under the user feature data is determined. Here, the predictive model is created based on a decision target, and includes decision-making characteristic variables and non-decision-making characteristic variables. The user characteristic data is user characteristic data corresponding to the decision characteristic variable and the non-decision characteristic variable.
For example, assume that in a marketing scenario, the decision target is the number of users registered for conversion, and in a transaction risk control scenario, the decision target is the sum of the amounts lost by the users due to fraudulent transactions. For this decision objective, an appropriate set of characteristic variables is created to build a predictive model to predict the decision objective based on the user characteristic data, such as predicting whether the user is converting or not, or predicting the amount of money lost by fraud in the user's transaction. In the present disclosure, the term "non-decision feature variable" refers to a feature variable that a decision party cannot interfere with, such as the age, the character, the past behavior history of the user, and the like. The term "decision characteristic variable" refers to a characteristic variable that a decision party can intervene in changing, for example, a characteristic variable such as a benefit and a profit issued to a user in a case where a decision target is the number of registered conversion persons of the user in a marketing scenario, or a characteristic variable such as a risk reminder output to the user in a transaction risk control scenario.
In addition, in the present disclosure, the prediction model may be created by using training data based on a decision target in advance, or may be created in real time by using training data after the decision target is obtained. Also, the decision target may be input in advance or in real time. In addition, in order to improve the accuracy of the prediction model, the training data used for prediction model training should cover the user population with as many features as possible, and try as many decision variable values as possible, thereby ensuring the richness of the training data dimension.
In one example of the present disclosure, an interpretation model may be used to determine the prediction result contribution of various decision feature variables in the predictive model under the user feature data. The interpretation model comprises one of the following interpretation models: the Shap value model, the LIME model, and the DeepLift model. How to use the interpretation model to determine the prediction result contribution of each decision characteristic variable in the prediction model under the user characteristic data will be described below with reference to fig. 2.
After the prediction result contribution of each decision characteristic variable is determined as described above, at block 120, a decision characteristic variable combination to be optimized is constructed based on the determined prediction result contribution of each decision characteristic variable. How to construct the decision feature variable combination to be optimized based on the determined prediction result contribution degrees of the respective decision feature variables will be explained below with reference to an example shown in fig. 3.
After the decision feature variable combination to be optimized is constructed, in block 130, optimization processing is performed on the variable values of each decision feature variable in the constructed decision feature variable combination, so that the corresponding prediction result of the prediction model is optimal.
For example, in one example, the optimizing the variable values of the decision characteristic variables in the constructed decision characteristic variable combination may include: optimizing the variable values of each decision characteristic variable in the constructed decision characteristic variable combination by using one of the following optimization algorithms: particle swarm algorithm, genetic algorithm and annealing algorithm. Here, the optimization algorithm used may be determined based on the type of decision feature variable to be subjected to the optimization process. For example, in the case that the decision characteristic variable to be optimized is a classification variable, the optimization algorithm preferably uses a particle swarm algorithm, such as a discrete particle swarm algorithm.
In addition, in another example of the present disclosure, the optimizing the variable values of the decision characteristic variables in the constructed decision characteristic variable combination may include: and in a preset decision variable value range, optimizing the variable value of each decision characteristic variable in the constructed decision characteristic variable combination. In addition, in order to make the optimization effect better, the value range of the decision variable can be properly adjusted according to the actual situation.
After determining the variable values of each decision characteristic variable in the decision characteristic variable combination, at block 140, a decision optimization process is performed according to the variable values of each decision characteristic variable in the decision characteristic variable combination obtained after the optimization process. For example, the decision mechanism used in the decision engine is optimally adjusted by using the variable values of the decision characteristic variables.
Fig. 2 shows a flowchart of one example of a prediction result contribution degree determination process of a decision feature variable according to an embodiment of the present disclosure.
As shown in fig. 2, for each decision feature variable in the predictive model, first, at block 210, one decision feature variable is selected as the initial current decision feature variable. For example, one decision feature variable may be randomly selected from the various decision feature variables as the initial current decision feature variable.
Then, for the current decision feature variable, the operations of blocks 220 to 270 are performed until the corresponding prediction result contribution is determined for all decision feature variables. Specifically, at block 220, the prediction results corresponding to any combination of the remaining decision characteristic variables in the decision characteristic variable combination other than the current decision characteristic variable are determined. For example, assuming that there are 4 decision feature variables V1, V2, V3, V4, and V1 as current decision feature variables in the prediction model, the prediction model is used to calculate a model prediction result in the case where the input of the prediction model is user feature data corresponding to any feature combination of V2, V3, and V4. Next, at block 230, the prediction results corresponding to any feature variable combination of the decision feature variable combinations that includes the current decision feature variable are determined. The feature variable combination described here is a feature variable combination obtained by adding the feature variable V1 for each feature combination in the block 220. Then, at block 240, the difference between the model prediction result determined at block 220 (without the current decision characteristic variable) and the corresponding model prediction result determined at block 230 (with the current decision characteristic variable) is calculated.
After the differences between the model predictions are determined, at block 350, the differences between all the model predictions are averaged to obtain the prediction contribution for the current decision feature variable. Then, at block 260, it is determined whether there are unprocessed decision feature variables. If so, then in block 270, a decision feature variable is selected from the unprocessed decision feature variables as the next current decision feature variable, and then back to block 220, the operations of blocks 220 through 270 are re-performed.
FIG. 3 shows a flow diagram of one example of a decision feature variable combination construction process, according to an embodiment of the present disclosure.
As shown in FIG. 3, first, at block 310, the decision feature variables are ranked based on their determined prediction result contribution. Then, at block 320, a predetermined number of decision characteristic variables with a greater contribution are selected from the sorted decision characteristic variables as the decision characteristic variable combinations to be optimized. Here, the predetermined number may be determined based on an actual application scenario or other suitable conditions. Furthermore, in other examples of the present disclosure, other suitable ways may also be employed to construct the decision feature variable combinations to be optimized.
With the decision optimization method of the present disclosure, for different users, the prediction result contribution of each decision characteristic variable in the prediction model is determined by using the user characteristic data of the user, a decision characteristic variable combination to be optimized is constructed based on the determined prediction results, and a variable value of each decision characteristic variable in the decision characteristic variable combination is determined using an optimization algorithm. In the decision optimization method, different prediction result contribution degrees of each decision characteristic variable can be obtained according to different user characteristic data, so that different decision characteristic variable combinations can be constructed, the optimized decision characteristic variable combinations are different according to different users, and further personalized decision optimization is realized.
By using the decision optimization method disclosed by the invention, the value range of the decision characteristic variable is properly adjusted according to the actual situation, and in the value range of the decision characteristic variable, the optimization processing is carried out on the variable value of each decision characteristic variable in the constructed decision characteristic variable combination, so that the optimization effect can be improved, and the decision optimization effect is further improved.
Fig. 4 shows a flow diagram of one example of an online switching process of a personalized decision mechanism, according to an embodiment of the disclosure.
As shown in fig. 4, after the optimized personalized decision making mechanism and engine are determined as above, when an online decision is made, the decision task is randomly allocated to the original decision making mechanism or the optimized personalized decision making mechanism for processing the incoming customer data to be decided, so as to make a decision, and record the corresponding decision making effect. After the operation is carried out for a period of time, the decision effect of the original decision mechanism or the optimized personalized decision mechanism is evaluated, if the decision effect of the original decision mechanism is better, the original decision mechanism is kept, and if the optimized personalized decision mechanism is better, the optimized personalized decision mechanism is used for replacing the original decision mechanism, so that the stability of decision service is ensured.
Fig. 5 shows a block diagram of a decision optimization apparatus 500 according to an embodiment of the present disclosure. As shown in fig. 5, the decision optimization apparatus 500 includes a contribution degree determination unit 510, a decision characteristic variable combination construction unit 520, an optimization processing unit 530, and a decision optimization unit 540.
The contribution degree determination unit 510 is configured to determine a prediction result contribution degree of each decision characteristic variable in a prediction model under the user characteristic data, the prediction model being created based on the decision target, the prediction model including the decision characteristic variable and the non-decision characteristic variable. The operation of the contribution degree determining unit 510 may refer to the operation of the block 110 described above with reference to fig. 1 and the operation described with reference to fig. 2.
The decision feature variable combination construction unit 520 is configured to construct a decision feature variable combination to be optimized based on the determined prediction result contribution degrees of the respective decision feature variables. The operation of the decision feature variable combination construction unit 520 may refer to the operation of the block 120 described above with reference to fig. 1 and the operation described with reference to fig. 3.
The optimization processing unit 530 is configured to perform optimization processing on the variable values of each decision characteristic variable in the constructed decision characteristic variable combination so as to optimize the corresponding prediction result of the prediction model. In one example of the present disclosure, the optimization processing unit 530 is configured to: and optimizing the variable values of the decision characteristic variables in the constructed decision characteristic variable combination by using one of the following optimization algorithms: particle swarm algorithm, genetic algorithm and annealing algorithm. In another example of the present disclosure, the optimization processing unit 530 is configured to: and in a preset decision variable value range, optimizing the variable value of each decision characteristic variable in the constructed decision characteristic variable combination. The operation of the optimization processing unit 530 may refer to the operation of block 130 described above with reference to fig. 1.
The decision optimization unit 540 is configured to perform decision optimization processing according to the variable values of the decision characteristic variables in the decision characteristic variable combination obtained after the optimization processing. The operation of the decision optimization unit 540 may refer to the operation of block 140 described above with reference to fig. 1.
In one example of the present disclosure, the contribution degree determining unit 510 may be configured to: and determining the prediction result contribution degree of each decision characteristic variable in the prediction model under the user characteristic data by using the interpretation model. The interpretation model may comprise one of the following interpretation models: the Shap value model, the LIME model, and the DeepLift model.
Fig. 6 shows a block diagram of one example of a decision feature variable combination construction unit 520 according to an embodiment of the present disclosure. As shown in fig. 6, the decision feature variable combination construction unit 520 includes a sorting module 521 and a feature selection module 523.
The ranking module 521 is configured to rank the respective decision characteristic variables based on the determined prediction result contribution degrees of the respective decision characteristic variables.
The feature selection module 523 is configured to select a predetermined number of decision feature variables with a larger contribution from the sorted decision feature variables as the decision feature variable combination to be optimized.
Embodiments of a decision optimization method and apparatus according to the present disclosure are described above with reference to fig. 1 to 6. The decision optimization device can be implemented by hardware, software, or a combination of hardware and software.
Fig. 7 illustrates a hardware block diagram of a computing device 700 for decision optimization, according to an embodiment of the disclosure. As shown in fig. 7, computing device 700 may include at least one processor 710, storage 720, memory 730, and communication interface 740, and at least one processor 710, storage 720, memory 730, and communication interface 740 are connected together via a bus 760. The at least one processor 710 executes at least one computer-readable instruction (i.e., the elements described above as being implemented in software) stored or encoded in memory.
In one embodiment, computer-executable instructions are stored in the memory that, when executed, cause the at least one processor 710 to: determining the contribution degree of a prediction result of each decision characteristic variable in a prediction model under user characteristic data, wherein the prediction model is established based on a decision target and comprises the decision characteristic variable and a non-decision characteristic variable; constructing a decision characteristic variable combination to be optimized based on the determined prediction result contribution of each decision characteristic variable; optimizing the variable values of each decision characteristic variable in the constructed decision characteristic variable combination so as to ensure that the corresponding prediction result of the prediction model is optimal; and performing decision optimization processing according to the variable values of the decision characteristic variables in the decision characteristic variable combination obtained after optimization processing.
It should be understood that the computer-executable instructions stored in the memory, when executed, cause the at least one processor 710 to perform the various operations and functions described above in connection with fig. 1-6 in the various embodiments of the present disclosure.
In the present disclosure, computing device 700 may include, but is not limited to: personal computers, server computers, workstations, desktop computers, laptop computers, notebook computers, mobile computing devices, smart phones, tablet computers, cellular phones, Personal Digital Assistants (PDAs), handheld devices, messaging devices, wearable computing devices, consumer electronics, and so forth.
According to one embodiment, a program product, such as a non-transitory machine-readable medium, is provided. A non-transitory machine-readable medium may have instructions (i.e., elements described above as being implemented in software) that, when executed by a machine, cause the machine to perform various operations and functions described above in connection with fig. 1-6 in various embodiments of the present disclosure. Specifically, a system or apparatus may be provided which is provided with a readable storage medium on which software program code implementing the functions of any of the above embodiments is stored, and causes a computer or processor of the system or apparatus to read out and execute instructions stored in the readable storage medium.
According to one embodiment, a program product, such as a non-transitory machine-readable medium, is provided. A non-transitory machine-readable medium may have instructions (i.e., elements described above as being implemented in software) that, when executed by a machine, cause the machine to perform various operations and functions described above in connection with fig. 1-6 in various embodiments of the present disclosure. Specifically, a system or apparatus may be provided which is provided with a readable storage medium on which software program code implementing the functions of any of the above embodiments is stored, and causes a computer or processor of the system or apparatus to read out and execute instructions stored in the readable storage medium.
In this case, the program code itself read from the readable medium can realize the functions of any of the above-described embodiments, and thus the machine-readable code and the readable storage medium storing the machine-readable code form part of the present invention.
Examples of the readable storage medium include floppy disks, hard disks, magneto-optical disks, optical disks (e.g., CD-ROMs, CD-R, CD-RWs, DVD-ROMs, DVD-RAMs, DVD-RWs), magnetic tapes, nonvolatile memory cards, and ROMs. Alternatively, the program code may be downloaded from a server computer or from the cloud via a communications network.
It will be understood by those skilled in the art that various changes and modifications may be made in the above-disclosed embodiments without departing from the spirit of the invention. Accordingly, the scope of the invention should be determined from the following claims.
It should be noted that not all steps and units in the above flows and system structure diagrams are necessary, and some steps or units may be omitted according to actual needs. The execution order of the steps is not fixed, and can be determined as required. The apparatus structures described in the above embodiments may be physical structures or logical structures, that is, some units may be implemented by the same physical entity, or some units may be implemented by a plurality of physical entities, or some units may be implemented by some components in a plurality of independent devices.
In the above embodiments, the hardware units or modules may be implemented mechanically or electrically. For example, a hardware unit, module or processor may comprise permanently dedicated circuitry or logic (such as a dedicated processor, FPGA or ASIC) to perform the corresponding operations. The hardware units or processors may also include programmable logic or circuitry (e.g., a general purpose processor or other programmable processor) that may be temporarily configured by software to perform the corresponding operations. The specific implementation (mechanical, or dedicated permanent, or temporarily set) may be determined based on cost and time considerations.
The detailed description set forth above in connection with the appended drawings describes exemplary embodiments but does not represent all embodiments that may be practiced or fall within the scope of the claims. The term "exemplary" used throughout this specification means "serving as an example, instance, or illustration," and does not mean "preferred" or "advantageous" over other embodiments. The detailed description includes specific details for the purpose of providing an understanding of the described technology. However, the techniques may be practiced without these specific details. In some instances, well-known structures and devices are shown in block diagram form in order to avoid obscuring the concepts of the described embodiments.
The previous description of the disclosure is provided to enable any person skilled in the art to make or use the disclosure. Various modifications to the disclosure will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other variations without departing from the scope of the disclosure. Thus, the disclosure is not intended to be limited to the examples and designs described herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.