WO2021121128A1 - Artificial intelligence-based sample evaluation method, apparatus, device, and storage medium - Google Patents

Artificial intelligence-based sample evaluation method, apparatus, device, and storage medium Download PDF

Info

Publication number
WO2021121128A1
WO2021121128A1 PCT/CN2020/135339 CN2020135339W WO2021121128A1 WO 2021121128 A1 WO2021121128 A1 WO 2021121128A1 CN 2020135339 W CN2020135339 W CN 2020135339W WO 2021121128 A1 WO2021121128 A1 WO 2021121128A1
Authority
WO
WIPO (PCT)
Prior art keywords
training
evaluated
model
samples
sample
Prior art date
Application number
PCT/CN2020/135339
Other languages
French (fr)
Chinese (zh)
Inventor
林春伟
刘莉红
刘玉宇
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021121128A1 publication Critical patent/WO2021121128A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Definitions

  • This application relates to the field of artificial intelligence, and in particular to a sample evaluation method, device, computer equipment, and storage medium based on artificial intelligence.
  • the embodiments of the present application provide an artificial intelligence-based sample evaluation method, device, computer equipment, and storage medium to solve the problem of the inability to explain the impact of the training sample on the output result of the training model to be evaluated.
  • a sample evaluation method based on artificial intelligence including:
  • the training data set including N test training samples, where N is a positive integer;
  • An artificial intelligence-based sample evaluation device including:
  • a data acquisition module for acquiring a training data set includes N test training samples, where N is a positive integer;
  • the sample training module is configured to train the N test training samples by using the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
  • the sample detection module is used to detect the N test training samples and select K target training samples, where K is a positive integer;
  • the first influence function module is configured to input NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first loss function corresponding to the sample loss function.
  • the second influence function module is used to change the sample characteristics of the K target training samples, obtain K updated characteristic samples, and divide the K updated characteristic samples and the training data set by the K target training samples NK outside test training samples are input into the training model to be evaluated for training, and a second influence function corresponding to the sample loss function is obtained;
  • the result acquisition module is configured to acquire the sample influence results of the K target training samples on the training model to be evaluated based on the first influence function and the second influence function.
  • a computer device includes a memory, a processor, and computer-readable instructions that are stored in the memory and can run on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
  • the training data set including N test training samples, where N is a positive integer;
  • One or more readable storage media storing computer readable instructions
  • the computer readable storage medium storing computer readable instructions
  • the one Or multiple processors perform the following steps:
  • the training data set including N test training samples, where N is a positive integer;
  • the server compares the output predicted value of the test training sample after training on the training model to be evaluated with the actual value corresponding to the N test training samples to obtain the corresponding
  • the sample loss function for the next step is to analyze the reasons that affect the output prediction value of the training model to be evaluated; through the selected K target training samples to further test the impact of the output prediction value of the training model to be evaluated, target training
  • the analysis or evaluation of the influence of the output result of the training model to be evaluated from the angle of the sample is helpful for subsequent optimization and improvement of the training model to be evaluated.
  • the server calculates the first influencing function and the second influencing function, while maintaining a relatively high fitting accuracy, so that the training model to be evaluated that cannot be derivable for calculus can be calculated at a lower computational cost.
  • the first influence function and the second influence function obtain K target training samples' sample influence results of the training model to be evaluated. By analyzing the sample influence results, the influence of the target training samples to the training model to be evaluated is obtained, and the target training samples are realized The interpretability of the impact of the output results of the training model is helpful for subsequent optimization and improvement of the training model to be evaluated.
  • FIG. 1 is a schematic diagram of an application environment of a sample evaluation method based on artificial intelligence in an embodiment of the present application
  • FIG. 2 is a flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application
  • FIG. 3 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application
  • FIG. 4 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application
  • FIG. 5 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application
  • FIG. 6 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a sample evaluation device based on artificial intelligence in an embodiment of the present application.
  • Fig. 8 is a schematic diagram of a computer device in an embodiment of the present application.
  • the artificial intelligence-based sample evaluation method can be applied to the application environment shown in FIG.
  • the artificial intelligence-based sample evaluation method is applied to a sample evaluation system, which includes a client and a server as shown in FIG. Intelligent sample evaluation method.
  • the client is also called the client, which refers to the program that corresponds to the server and provides local services to the client.
  • the client can be installed on, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices.
  • the server can be implemented as an independent server or a server cluster composed of multiple servers.
  • the server compares the output prediction value of the test training sample after training on the training model to be evaluated with the actual value corresponding to the N test training samples, and obtains the corresponding sample loss function, so that the next step can affect the output prediction of the training model to be evaluated Analyze the cause of the value; by selecting K target training samples to further test the impact of the output prediction value of the training model to be evaluated, the angle of the target training sample can be analyzed or evaluated for the impact of the output result of the training model to be evaluated. Helps subsequent optimization and improvement of the training model to be evaluated. Further, the server calculates the first influencing function and the second influencing function, while maintaining a relatively high fitting accuracy, so that the training model to be evaluated that cannot be derivable for calculus can be calculated at a lower computational cost.
  • the first influence function and the second influence function obtain K target training samples' sample influence results of the training model to be evaluated. By analyzing the sample influence results, the influence of the target training samples to the training model to be evaluated is obtained, and the target training samples are realized. The interpretability of the impact of the output results of the training model is helpful for subsequent optimization and improvement of the training model to be evaluated.
  • FIG. 2 a sample evaluation method based on artificial intelligence is provided.
  • the application of the method to the server in FIG. 1 is taken as an example for description, including the following steps:
  • the training data set includes N test training samples, where N is a positive integer.
  • the training data set is a user-defined set for storing test training samples.
  • the training data set stores N test training samples for analysis based on the N test training samples.
  • the test training sample includes the training data and the label corresponding to the training data.
  • the test training sample may be a car damage training sample, where a car damage training sample specifically includes a car damage image and a label corresponding to the car damage image. In this case, the car damage image is training data.
  • S20 Use the training model to be evaluated to train N test training samples, and obtain a sample loss function corresponding to the training model to be evaluated.
  • the training model to be evaluated is a model that needs to be evaluated and analyzed, and specifically may be a deep learning model for training a test training sample.
  • the training model to be evaluated includes but is not limited to the Faster RCNN model or the SS D model.
  • the sample loss function is a function that calculates the difference between the output predicted value of the training model to be evaluated and the actual value of the test training sample.
  • the output prediction value is the value obtained after the training model to be evaluated trains the test training sample.
  • the actual value is the actual value corresponding to the test training sample, and the actual value here can be understood as the label of the training data.
  • the actual value corresponding to the test training sample is A
  • the output prediction value obtained by the training model to be evaluated on the test training sample is B
  • the sample loss function is a function that measures the difference between A and B
  • the server inputs N test training samples to the training model to be evaluated for training. After the training model to be evaluated trains the N test training samples, the output prediction values corresponding to the N test training samples are obtained.
  • the server compares the N output predicted values with the actual values corresponding to the N test training samples, obtains the sample loss values corresponding to the N test training samples, and then builds the sample loss function based on the N sample loss values for the next step The reasons that affect the output prediction value of the training model to be evaluated are analyzed.
  • S30 Detect N test training samples, and select K target training samples, where K is a positive integer.
  • the target training sample is a sample used to test the influence of the output prediction value of the training model to be evaluated.
  • the server detects N test training samples, and selects K target training samples from the training data set.
  • selecting the target training samples may be randomly selecting K target training samples from the training data set, or selecting K target training samples determined by screening N test training samples according to a preset selection method.
  • the preset selection method is a user-defined selection method, which is used to select the target training sample.
  • the server further tests the influence of the output prediction value of the training model to be evaluated by selecting K target training samples, which can improve the efficiency of analyzing the influence of the target training sample on the output prediction value of the training model to be evaluated.
  • S40 Input the N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first influence function corresponding to the sample loss function.
  • the first influence function is a function for calculating the influence of N-K test training samples on the output prediction value of the training model to be evaluated.
  • the server passes the selected K target training samples, and then inputs the remaining NK test training samples in the training data set into the training model to be evaluated for testing.
  • the server calculates the output prediction value of the training model to be evaluated and the sample loss function to obtain the first influence function, so as to follow the output of the training model to be evaluated according to the first influence function
  • the predicted value is analyzed, and the sample impact result is obtained to realize the analysis of the training samples that affect the output predicted value of the training model to be evaluated, which is helpful for optimizing and improving the training model to be evaluated from the perspective of training samples.
  • S50 Perform sample feature changes on K target training samples, obtain K updated feature samples, and input K updated feature samples and NK test training samples in the training data set except for K target training samples into the training model to be evaluated for training , Obtain the second influence function corresponding to the sample loss function.
  • the second influence function is a function for calculating the influence of K update feature samples and N-K test training samples on the output prediction value of the training model to be evaluated.
  • the server obtains K updated feature samples by changing the sample characteristics of the target training sample, and trains the remaining NK test samples in the training data set.
  • the samples and K updated feature samples are input to the training model to be evaluated for testing, and the second model parameters are obtained according to the output predicted value of the training model to be evaluated.
  • the server obtains the second influence function by calculating the second model parameters and the sample loss function, so as to subsequently analyze the output prediction value of the training model to be evaluated according to the second influence function, and obtain the influence of the sample influence result. Realize the analysis of the reasons that affect the output prediction value of the training model to be evaluated.
  • sample feature changes are performed on the sample features ⁇ of K target training samples to obtain K updated feature samples whose sample features are ⁇ .
  • the sample influence result is the result of evaluating or analyzing the influence of the output prediction value of the training model to be evaluated.
  • the server performs comprehensive analysis and processing on the first influence function and the second influence function based on the preset processing logic, and obtains the sample influence results of the K target training samples of the training model to be evaluated.
  • the preset processing logic is to perform weighting or difference processing on the first influencing function and the second influencing function. That is, after acquiring the first influence function and the second influence function, the server can analyze the sample influence result of the output prediction value of the training model to be evaluated by the target training sample through the first influence function and the second influence function. The server obtains the impact of the target training sample on the training model to be evaluated by analyzing the sample impact results, and realizes the evaluation or analysis of the impact of the output predicted value of the training model to be evaluated.
  • the server compares the output predicted value of the test training sample after the training model to be evaluated is trained with the actual value corresponding to the N test training samples, and obtains the corresponding sample loss function, so that the next step can affect the expected value. Analyze the reason for the output prediction value of the evaluation training model; further test the impact of the output prediction value of the evaluation training model by selecting K target training samples, and the impact of the target training sample angle on the output prediction value of the evaluation training model Analysis or evaluation is helpful for subsequent optimization and improvement of the training model to be evaluated. Further, the server calculates the first influencing function and the second influencing function, while maintaining a relatively high fitting accuracy, so that the training model to be evaluated that cannot be derivable for calculus can be calculated at a lower computational cost.
  • the first influence function and the second influence function obtain K target training samples' sample influence results of the training model to be evaluated. By analyzing the sample influence results, the influence of the target training samples to the training model to be evaluated is obtained, and the target training samples are realized. The interpretability of the impact of the output results of the training model is helpful for subsequent optimization and improvement of the training model to be evaluated.
  • step S20 that is, using the training model to be evaluated to train N test training samples to obtain the sample loss function corresponding to the training model to be evaluated includes:
  • S21 Use the training model to be evaluated to train N test training samples, and obtain output prediction values corresponding to the N test training samples.
  • the server trains N test training samples through the training model to be evaluated, and obtains the output prediction value of the training model to be evaluated. Understandably, the server can further calculate the sample loss function of the difference between the output prediction value of the training model to be evaluated and the actual value of the test training sample through the N test training samples and the predicted output value of the training model to be evaluated.
  • the server obtains N sample loss values by calculating the actual values of N test training samples and the output prediction values of the training model to be evaluated, and obtains the corresponding sample loss function based on the N sample loss values for subsequent
  • the sample loss function is used to analyze the reasons that affect the output prediction value of the training model to be evaluated.
  • X is the input space car damage image
  • Y is the output space such as the label corresponding to the car damage image
  • Z i (X i ,Y i ) ⁇ X ⁇ Y.
  • the server obtains the test training sample after the training model to be evaluated is trained, and obtains the sample loss function used to calculate the difference between the output prediction value of the training model to be evaluated and the actual value corresponding to the test training sample, through
  • the sample loss function further analyzes the reasons that affect the output prediction value of the training model to be evaluated to ensure the accuracy and validity of the analysis result.
  • step S30 that is, detecting N test training samples and selecting K target training samples includes:
  • S31 Obtain current sample parameters corresponding to N test training samples, and determine whether the current sample parameters meet the screening parameter threshold.
  • the current sample parameter is the data parameter in the test training sample.
  • the filter parameter threshold is a value set by the user and is used to filter the current sample parameters.
  • the server After the server obtains the current sample parameters corresponding to the N test training samples, it judges the current sample parameters and judges whether the current sample parameters are sufficient to meet the screening parameter threshold, so as to filter the test training samples by the screening parameter threshold, so that K target training samples that meet the screening parameter threshold are selected from N test training samples, and the K target training samples are used to analyze the reasons that affect the output prediction value of the training model to be evaluated, so that the subsequent K target training samples are to be evaluated.
  • the samples of the training model affect the results, and the corresponding training samples are updated to improve the accuracy of the training model to be evaluated.
  • the test training samples are car damage training samples
  • the server obtains the current sample parameters corresponding to the car damage images in each car damage training sample in the training data set.
  • the current sample parameters can be the image resolution size and the image level resolution. At least one of the evaluation features of the image rate, image vertical resolution, image brightness, and contrast.
  • the screening parameter threshold corresponding to each evaluation feature is set to X. If the current sample parameter obtained is Y, and X ⁇ Y, the current sample parameter Determined as the target training sample.
  • test training sample is determined as the target training sample.
  • the conditions for filtering parameter thresholds can be user-defined thresholds.
  • determining that the current sample parameter satisfies the screening parameter threshold is a screening condition for screening K target training samples from N test training samples.
  • the current sample parameter may be greater than or equal to the screening parameter threshold.
  • the test training sample corresponding to the current sample parameter that meets the screening parameter threshold is determined as the target training sample.
  • the server screens the test training samples through the screening parameter threshold, so as to screen out K target training samples that meet the screening parameter threshold from the N test training samples, and use K target training samples to influence
  • the reason for the output prediction value of the training model to be evaluated is analyzed, so that the subsequent K target training samples affect the results of the sample of the training model to be evaluated, and the corresponding training samples are updated to improve the accuracy of the training model to be evaluated.
  • step S40 NK test training samples in the training data set except the K target training samples are input into the training model to be evaluated for training, and the first training model corresponding to the sample loss function is obtained.
  • An influence function including:
  • S41 Input the N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first change weight of the training model to be evaluated.
  • the first change weight is the weight of each model parameter in the training model to be evaluated after training the training model to be evaluated using N-K test training samples in addition to the K target training samples.
  • the server inputs N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, obtains the empirical risk based on the sample loss function, and obtains the first change weight based on the empirical risk.
  • the empirical risk is the average value obtained by accumulating the target training sample through the sample loss function. It is understandable that after the K target training samples are eliminated, NK test training samples other than the K target training samples are trained in the training model to be evaluated, and the weights of the model parameters are changed accordingly, which is obtained through empirical risk
  • the first change weight can further obtain the change of the sample weight of the target training sample to analyze the reasons that affect the output prediction value of the training model to be evaluated.
  • S42 Acquire the first model parameter of the training model to be evaluated according to the initial model parameters and the first change weight corresponding to the training model to be evaluated.
  • the initial model parameter is the initial parameter that minimizes the difference in the calculation of the sample loss function, and is a parameter obtained based on empirical risk.
  • the first model parameter is a test parameter that minimizes the calculation difference between the sample loss function and the N-K test training samples.
  • the empirical risk is specifically
  • the initial model parameters are obtained by calculating the empirical risk among them, Is the initial model parameter, ⁇ is the collection of all models in the database, ⁇ is the test model to be evaluated, and L(Z i , ⁇ ) is the sample loss function.
  • S43 Obtain a first influence function corresponding to the sample loss function based on the first model parameter and the sample loss function corresponding to the training model to be evaluated.
  • the server deletes the selected K target training samples, and then inputs the remaining NK test training samples in the training data set into the training model to be evaluated for testing.
  • the initial model parameters and the first change weight corresponding to the training model are updated to obtain the first model parameters.
  • the server obtains the first influence function through calculations based on the amount of change between the initial model parameters and the first model parameters and the sample loss function, and analyzes the output predicted value of the training model to be evaluated through the first influence function, and obtains the sample influence The influence of the result to realize the analysis of the reason that influences the output prediction value of the training model to be evaluated.
  • the server obtains the change amount J up,params (Z) between the initial model parameter and the first model parameter, which can realize that without testing the training samples, it can evaluate the first model parameter after removing certain K target training samples. Impact.
  • the server calculates the first model parameters and the sample loss function to maintain a relatively good fitting accuracy, so that the training model to be evaluated that cannot be derived and calculus can be evaluated at a lower computational cost. Calculate, obtain the first influence function, analyze the output prediction value of the training model to be evaluated through the first influence function, and obtain the influence of the sample on the result, so as to realize the analysis of the reasons that affect the output prediction value of the training model to be evaluated, and improve the The efficiency of the artificial intelligence sample evaluation method.
  • step S43 is to obtain the first influence function corresponding to the sample loss function based on the first model parameter and the sample loss function corresponding to the training model to be evaluated, including: based on the Hesse vector product and the preset number of iterations, The first model parameter and the sample loss function corresponding to the training model to be evaluated are processed, and the first influence function corresponding to the sample loss function is obtained.
  • the Hesse vector product is a method used to calculate the first influential function and the second influential function.
  • the preset number of iterations is the number of iterative calculations on the first influencing function and the second influencing function set by the user.
  • this embodiment uses the Hessel vector product (H To avoid directly calculating the inverse of the Hessian matrix, by efficiently estimating the Hessian vector product (H To calculate the first influencing function J up,loss (Z,Z test ). and The calculation of, can be achieved by stochastic estimation.
  • the random parameter estimation method only needs to sample one sample point in each iteration, so it can greatly increase the calculation speed and reduce the computing resources; at the same time middle And use Means Taylor's estimate of the first j items: From the nature of Taylor expansion, we know that when j ⁇ , therefore Unbiased estimate of Still have Here, the embodiment is based on To calculate the first influencing function J up,loss (Z,Z test ).
  • the first influence function corresponding to the selected K target training samples is iteratively calculated according to the preset number of iterations Determine the result of the last iteration calculation as the first influence function
  • the server calculates the first influence function through the Hessian vector product to improve the calculation efficiency, so as to improve the efficiency of the server to evaluate or analyze the influence of the output prediction value of the training model to be evaluated through the sample influence result. And through the high-efficiency calculation of the Hessian vector product, the first influence function is obtained to realize the evaluation or analysis of the influence of the output prediction value of the training model to be evaluated.
  • step S50 the K update feature samples and NK test training samples in the training data set except the K target training samples are input into the training model to be evaluated for training, and the samples are obtained
  • the second influence function corresponding to the loss function includes:
  • S51 Input the K update feature samples and N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second change weight corresponding to the training model to be evaluated.
  • the second change weight is that after the sample feature of the target training sample is changed, the weight of the feature sample is updated. Since the sample features of the K target training samples are changed, the weights of the K updated feature samples in the training model to be evaluated are changed accordingly, and the second change weight can further obtain the change of the sample weight and affect the training model to be evaluated The reason for the output predicted value is analyzed.
  • the server can further obtain the sample weight change through the second change weight and affect the output of the training model to be evaluated.
  • the reason for the predicted value is analyzed.
  • S52 Obtain a second model parameter corresponding to the training model to be evaluated according to the initial model parameter and the second change weight corresponding to the training model to be evaluated.
  • the second model parameter is a test parameter that minimizes the calculation difference between the K update feature samples and the N-K test training samples of the sample loss function.
  • the server calculates the initial, initial model parameters and the second change weight, and obtains the training model to be evaluated.
  • S53 Obtain a second influence function corresponding to the sample loss function according to the second model parameter and the sample loss function corresponding to the training model to be evaluated.
  • the server changes the sample characteristics of the target training sample, based on the second change weight, obtains the second model parameters corresponding to the training model to be evaluated, and calculates the second model parameters and the sample loss function to obtain the second impact function.
  • the server first calculates the second model parameters based on the second change weight to obtain the initial model The influence of type parameters. Further, the second influence function is used to update the feature by the sample loss function of the test training sample
  • the server can further obtain the change in sample weight through the second change weight to analyze the reasons that affect the output prediction value of the training model to be evaluated; then, use the second change weight to obtain the corresponding training model to be evaluated.
  • the second model parameter is to calculate the second model parameter and the sample loss function, while maintaining a better fitting accuracy, so that the training model to be evaluated that cannot be derived can be calculated and obtained at a lower computational cost.
  • the influence function so that the server can evaluate or analyze the influence of the output prediction value of the training model to be evaluated through the second influence function.
  • step S53 is to obtain the second influence function corresponding to the sample loss function based on the second model parameter and the sample loss function corresponding to the training model to be evaluated, including: based on the Hessian vector product and the preset iteration The number of times, the second model parameter and the sample loss function corresponding to the training model to be evaluated, and the second influence function corresponding to the sample loss function is obtained.
  • the corresponding second influence function is iteratively calculated according to the preset number of iterations Determine the result of the last iteration calculation as the second influence function
  • the server calculates the second influence function through the Hessian vector product to improve the calculation efficiency, so as to improve the efficiency of the server to evaluate or analyze the influence of the output prediction value of the training model to be evaluated through the sample influence result. And through the high-efficiency calculation of the Hessian vector product, the second influence function is obtained to realize the evaluation or analysis of the influence of the output prediction value of the training model to be evaluated.
  • an artificial intelligence-based sample evaluation device corresponds to the artificial intelligence-based sample evaluation method in the above-mentioned embodiment in a one-to-one correspondence.
  • the artificial intelligence-based sample evaluation device includes a data acquisition module 10, a sample training module 20, a sample detection module 30, a first influence function module 40, a second influence function module 50 and a result acquisition module 60.
  • the detailed description of each functional module is as follows:
  • the data acquisition module 10 is used to acquire a training data set, the training data set includes N test training samples, where N is a positive integer;
  • the sample training module 20 is used to train N test training samples using the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
  • the sample detection module 30 is used to detect N test training samples and select K target training samples, where K is a positive integer;
  • the first influence function module 40 is configured to input N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first influence function corresponding to the sample loss function;
  • the second influence function module 50 is used to change the sample characteristics of the K target training samples, obtain K updated feature samples, and combine the K updated feature samples and the training data set with NK test training samples other than the K target training samples Input the training model to be evaluated for training, and obtain the second influence function corresponding to the sample loss function;
  • the result acquisition module 60 is configured to acquire K target training samples based on the first influence function and the second influence function, and sample influence results of the training model to be evaluated.
  • sample training module 20 includes:
  • the prediction value acquisition sub-module is used to train N test training samples using the training model to be evaluated, and obtain the output prediction values corresponding to the N test training samples;
  • the loss function sub-module is used to obtain the sample loss function based on the test training sample and the output prediction value.
  • sample detection module 30 includes:
  • the threshold judgment sub-module is used to obtain the current sample parameters corresponding to the N test training samples, and judge whether the current sample parameters meet the screening parameter threshold;
  • the sample determination sub-module is used to determine the test training sample as the target training sample when the current sample parameter meets the screening parameter threshold.
  • the first influencing function module 40 includes:
  • the first weight sub-module is used to input N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first change weight of the training model to be evaluated;
  • the first parameter sub-module is used to obtain the first model parameter of the training model to be evaluated according to the initial model parameters and the first change weight corresponding to the training model to be evaluated;
  • the first function sub-module is used to obtain the first influence function corresponding to the sample loss function based on the first model parameter and the sample loss function corresponding to the training model to be evaluated.
  • the first influencing function module 40 also includes:
  • the first parameter processing sub-module is used to process the first model parameter and sample loss function corresponding to the training model to be evaluated based on the Hessian vector product and the preset number of iterations to obtain the first influence function corresponding to the sample loss function.
  • the second influence function module 50 includes:
  • the second weight sub-module is used to input the K update feature samples and NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second change weight corresponding to the training model to be evaluated ;
  • the second parameter sub-module is used to obtain the second model parameter corresponding to the training model to be evaluated according to the initial model parameter and the second change weight corresponding to the training model to be evaluated;
  • the second function sub-module is used to obtain the second influence function corresponding to the sample loss function based on the second model parameter and the sample loss function corresponding to the training model to be evaluated.
  • the second influencing function module 50 further includes:
  • the second parameter processing sub-module is used to obtain the second influence function corresponding to the sample loss function based on the Hessian vector product and the preset number of iterations, the second model parameter and the sample loss function corresponding to the training model to be evaluated.
  • Each module in the above-mentioned artificial intelligence-based sample evaluation device can be implemented in whole or in part by software, hardware, and a combination thereof.
  • the above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 8.
  • the computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, computer readable instructions, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium.
  • the computer equipment database is used for sample evaluation.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instructions are executed by the processor to realize an artificial intelligence-based sample evaluation method.
  • one or more readable storage media storing computer readable instructions are provided, the computer readable storage medium storing computer readable instructions, and the computer readable instructions are processed by one or more
  • the processor executes the one or more processors execute the computer-readable instructions to implement the artificial intelligence-based sample evaluation method in the foregoing embodiment, such as steps S10 to S60.
  • the processor executes the computer-readable instructions the functions of the modules/units in the embodiment of the artificial intelligence-based sample evaluation device, such as modules 10 to 60, are implemented.
  • the readable storage medium in this embodiment includes a non-volatile readable storage medium and a volatile readable storage medium.
  • a computer-readable storage medium is provided, and computer-readable instructions are stored on the computer-readable storage medium.
  • the computer-readable instructions are executed by a processor, the artificial intelligence-based sample evaluation method in the above-mentioned embodiment is implemented. For example, step S10 to step S60, in order to avoid repetition, the details are not repeated here.
  • the computer-readable instruction is executed by the processor, the function of each module/unit in the embodiment of the artificial intelligence-based sample evaluation device, such as module 10 to module 60, is realized. In order to avoid repetition, details are not described herein again.
  • the computer-readable instructions can be stored in a non-volatile computer.
  • the computer readable instruction may be stored in a non-volatile readable storage medium or may be stored in a volatile readable storage medium.
  • the computer readable instruction When executed, it may include The flow of the embodiment of each method.
  • any reference to memory, storage, database, or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory.
  • Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

An artificial intelligence-based sample evaluation method, an apparatus, a computer device, and a storage medium. The method comprises: obtaining a training data set; using a training model to be evaluated to train N test training samples so as to obtain the sample loss function corresponding to the training model to be evaluated; selecting K target training samples; inputting the N-K test training samples other than the K target training samples in the training data set into the training model to be evaluated for training, so as to obtain a first influence function; inputting K feature updated samples and the N-K test training samples other than the K target training samples in the training data set into the training model to be evaluated for training, so as obtain a second influence function; obtaining the sample influence result on the basis of the first influence function and the second influence function. The present invention allows for interpretability of the influence of a target training sample on the output result of a training model to be evaluated, thereby facilitating ensuing optimization and improvement of the training model to be evaluated.

Description

基于人工智能的样本评估方法、装置、设备及存储介质Artificial intelligence-based sample evaluation method, device, equipment and storage medium
本申请以2020年06月08日提交的申请号为202010514014.0,名称为“基于人工智能的样本评估方法、装置、计算机设备及存储介质”的中国发明申请为基础,并要求其优先权。This application is based on the Chinese invention application with the application number 202010514014.0 filed on June 8, 2020 and titled "Artificial Intelligence-based Sample Evaluation Method, Apparatus, Computer Equipment and Storage Medium", and claims its priority.
技术领域Technical field
本申请涉及人工智能领域,尤其涉及一种基于人工智能的样本评估方法、装置、计算机设备及存储介质。This application relates to the field of artificial intelligence, and in particular to a sample evaluation method, device, computer equipment, and storage medium based on artificial intelligence.
背景技术Background technique
在人工智能领域中,待评估训练模型的预测性能是一项重要的性能指标。但是待评估训练模型输出的样本影响结果的可解释性同样是一项重要的性能指标。因为通过理解待评估训练模型输出样本影响结果的原因,可以直接改变影响待评估训练模型输出样本影响结果的因素,来提升待评估训练模型的性能,同时也能向用户提供样本影响结果的解释,这在业务涉及用户敏感信息时显得尤为重要。In the field of artificial intelligence, the predictive performance of the training model to be evaluated is an important performance indicator. However, the sample output of the training model to be evaluated affects the interpretability of the result is also an important performance indicator. Because by understanding the reasons why the output samples of the training model to be evaluated affect the results, the factors that affect the output samples of the training model to be evaluated affect the results can be directly changed to improve the performance of the training model to be evaluated, and at the same time, it can also provide users with an explanation of the sample’s impact on the results. This is particularly important when the business involves sensitive user information.
然而现有许多领域中的待评估训练模型,例如用于图像和语音识别的深度神经网络模型,是一种复杂的黑箱模型,难以对输出的样本影响结果作出解释。现有技术主要着重于理解固定的待评估训练模型是如何与特定的样本影响结果对应,例如通过在测试数据点周围局部拟合更简单的待评估训练模型或通过对测试数据添加干扰来观测输出的样本影响结果。发明人意识到,现有技术只是从待评估训练模型的角度解释了待评估训练模型输出的样本影响结果,但是没有从训练样本角度对待评估训练模型输出结果的影响,不利于对待评估训练样本进行后续优化改进。However, existing training models to be evaluated in many fields, such as deep neural network models for image and speech recognition, are a complex black box model, and it is difficult to explain the impact of the output samples on the results. The prior art mainly focuses on understanding how the fixed training model to be evaluated corresponds to a specific sample impact result, for example, by locally fitting a simpler training model to be evaluated around the test data points or observing the output by adding interference to the test data The sample affects the results. The inventor realizes that the prior art only explains the impact of the sample output of the training model to be evaluated from the perspective of the training model to be evaluated, but does not have the impact of the output result of the training model to be evaluated from the perspective of the training sample, which is not conducive to the evaluation of the training sample. Follow-up optimization and improvement.
发明内容Summary of the invention
本申请实施例提供一种基于人工智能的样本评估方法、装置、计算机设备及存储介质,以解决无法解释训练样本对待评估训练模型输出结果的影响的问题。The embodiments of the present application provide an artificial intelligence-based sample evaluation method, device, computer equipment, and storage medium to solve the problem of the inability to explain the impact of the training sample on the output result of the training model to be evaluated.
一种基于人工智能的样本评估方法,包括:A sample evaluation method based on artificial intelligence, including:
获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;Acquiring a training data set, the training data set including N test training samples, where N is a positive integer;
采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;Training the N test training samples with the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;Perform detection on the N test training samples, and select K target training samples, where K is a positive integer;
将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining a first influence function corresponding to the sample loss function;
对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;Perform sample feature changes on the K target training samples, obtain K updated feature samples, and input the K updated feature samples and NK test training samples in the training data set except the K target training samples Training the training model to be evaluated to obtain a second influence function corresponding to the sample loss function;
基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。Based on the first influence function and the second influence function, obtain sample influence results of the K target training samples on the training model to be evaluated.
一种基于人工智能的样本评估装置,包括:An artificial intelligence-based sample evaluation device, including:
数据获取模块,用于获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;A data acquisition module for acquiring a training data set, the training data set includes N test training samples, where N is a positive integer;
样本训练模块,用于采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;The sample training module is configured to train the N test training samples by using the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
样本检测模块,用于对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;The sample detection module is used to detect the N test training samples and select K target training samples, where K is a positive integer;
第一影响函数模块,用于将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;The first influence function module is configured to input NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first loss function corresponding to the sample loss function. An influence function;
第二影响函数模块,用于对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;The second influence function module is used to change the sample characteristics of the K target training samples, obtain K updated characteristic samples, and divide the K updated characteristic samples and the training data set by the K target training samples NK outside test training samples are input into the training model to be evaluated for training, and a second influence function corresponding to the sample loss function is obtained;
结果获取模块,用于基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。The result acquisition module is configured to acquire the sample influence results of the K target training samples on the training model to be evaluated based on the first influence function and the second influence function.
一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions that are stored in the memory and can run on the processor, and the processor implements the following steps when the processor executes the computer-readable instructions:
获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;Acquiring a training data set, the training data set including N test training samples, where N is a positive integer;
采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;Training the N test training samples with the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;Perform detection on the N test training samples, and select K target training samples, where K is a positive integer;
将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining a first influence function corresponding to the sample loss function;
对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;Perform sample feature changes on the K target training samples, obtain K updated feature samples, and input the K updated feature samples and NK test training samples in the training data set except the K target training samples Training the training model to be evaluated to obtain a second influence function corresponding to the sample loss function;
基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。Based on the first influence function and the second influence function, obtain sample influence results of the K target training samples on the training model to be evaluated.
一个或多个存储有计算机可读指令的可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more readable storage media storing computer readable instructions, the computer readable storage medium storing computer readable instructions, and when the computer readable instructions are executed by one or more processors, the one Or multiple processors perform the following steps:
获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;Acquiring a training data set, the training data set including N test training samples, where N is a positive integer;
采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;Training the N test training samples with the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;Perform detection on the N test training samples, and select K target training samples, where K is a positive integer;
将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining a first influence function corresponding to the sample loss function;
对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;Perform sample feature changes on the K target training samples, obtain K updated feature samples, and input the K updated feature samples and NK test training samples in the training data set except the K target training samples Training the training model to be evaluated to obtain a second influence function corresponding to the sample loss function;
基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。Based on the first influence function and the second influence function, obtain sample influence results of the K target training samples on the training model to be evaluated.
上述基于人工智能的样本评估方法、装置、计算机设备及存储介质,服务端将测试训练样本在待评估训练模型进行训练后的输出预测值与N个测试训练样本对应的实际值进行对比,获取相应的样本损失函数,以便下一步对影响待评估训练模型的输出预测值的原因进行分析;通过对选取的K个目标训练样本来进一步对待评估训练模型的输出预测值的影响进行测试,能够目标训练样本的角度对待评估训练模型输出结果的影响进行分析或评估,有助于后续优化改进待评估训练模型。进一步地,服务端通过对第一影响函数和第二影响函数进行计算,保持较拟合精度的情况下,使得对于不能求导能微微积分的待评估训练模型能够以较低的计算成本计算第一影响函数和第二影响函数,得到K个目标训练样本对待评估训练模型的样本影响结果,通过对样本影响结果进行分析,得到目标训练样本对待评估训练模型的影响,实现了目标训练样本对待评估训练模型输出结果的影响的可解释性,有助于后续优化改进待评估训练模型。In the above-mentioned artificial intelligence-based sample evaluation method, device, computer equipment and storage medium, the server compares the output predicted value of the test training sample after training on the training model to be evaluated with the actual value corresponding to the N test training samples to obtain the corresponding The sample loss function for the next step is to analyze the reasons that affect the output prediction value of the training model to be evaluated; through the selected K target training samples to further test the impact of the output prediction value of the training model to be evaluated, target training The analysis or evaluation of the influence of the output result of the training model to be evaluated from the angle of the sample is helpful for subsequent optimization and improvement of the training model to be evaluated. Further, the server calculates the first influencing function and the second influencing function, while maintaining a relatively high fitting accuracy, so that the training model to be evaluated that cannot be derivable for calculus can be calculated at a lower computational cost. The first influence function and the second influence function obtain K target training samples' sample influence results of the training model to be evaluated. By analyzing the sample influence results, the influence of the target training samples to the training model to be evaluated is obtained, and the target training samples are realized The interpretability of the impact of the output results of the training model is helpful for subsequent optimization and improvement of the training model to be evaluated.
附图说明Description of the drawings
为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments of the present application. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative labor.
图1是本申请一实施例中基于人工智能的样本评估方法的一应用环境示意图;FIG. 1 is a schematic diagram of an application environment of a sample evaluation method based on artificial intelligence in an embodiment of the present application;
图2是本申请一实施例中基于人工智能的样本评估方法的一流程图;2 is a flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application;
图3是本申请一实施例中基于人工智能的样本评估方法的另一流程图;FIG. 3 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application;
图4是本申请一实施例中基于人工智能的样本评估方法的另一流程图;FIG. 4 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application;
图5是本申请一实施例中基于人工智能的样本评估方法的另一流程图;FIG. 5 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application;
图6是本申请一实施例中基于人工智能的样本评估方法的另一流程图;FIG. 6 is another flowchart of a sample evaluation method based on artificial intelligence in an embodiment of the present application;
图7是本申请一实施例中基于人工智能的样本评估装置的一示意图;FIG. 7 is a schematic diagram of a sample evaluation device based on artificial intelligence in an embodiment of the present application;
图8是本申请一实施例中计算机设备的一示意图。Fig. 8 is a schematic diagram of a computer device in an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
本申请实施例提供的基于人工智能的样本评估方法,该基于人工智能的样本评估方法可应用如图1所示的应用环境中。具体地,该基于人工智能的样本评估方法应用在样本评估系统中,该样本评估系统包括如图1所示的客户端和服务端,客户端与服务端通过网络进行通信,用于实现基于人工智能的样本评估方法。其中,客户端又称为用户端,是指与服务端相对应,为客户提供本地服务的程序。客户端可安装在但不限于各种个人计算机、笔记本电脑、智能手机、平板电脑和便携式可穿戴设备上。服务端可以用独立的服务端或者是多个服务端组成的服务端集群来实现。服务端将测试训练样本在待评估训练模型进行训练后的输出预测值与N个测试训练样本对应的实际值进行对比,获取相应的样本损失函数,以便下一步对影响待评估训练模型的输出预测值的原因进行分析;通过对选取的K个目标训练样本来进一步对待评估训练模型的输出预测值的影响进行测试,能够目标训练样本的角度对待评估训练模型输出结果的影响进行分析或评估,有助于后续优化改进待评估训练模型。进一步地,服务端通过对第一影响函数和第二影响函数进行计算,保持较拟合精度的情况下,使得对于不能求导能微微积分的待评估训练模型能够以较低的计算成本计算第一影响函数和第二影响函数,得到K个目标训练样本对待评估训练模型的样本影响结果,通过对样本影响结果进行分析,得到目标训练样本对待评估训练模型的影响,实现了目标训练样本对待评估训练模型输出结果的影响的可解释性,有助于后续优化改进待评估训练模型。According to the artificial intelligence-based sample evaluation method provided by the embodiments of the present application, the artificial intelligence-based sample evaluation method can be applied to the application environment shown in FIG. Specifically, the artificial intelligence-based sample evaluation method is applied to a sample evaluation system, which includes a client and a server as shown in FIG. Intelligent sample evaluation method. Among them, the client is also called the client, which refers to the program that corresponds to the server and provides local services to the client. The client can be installed on, but not limited to, various personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices. The server can be implemented as an independent server or a server cluster composed of multiple servers. The server compares the output prediction value of the test training sample after training on the training model to be evaluated with the actual value corresponding to the N test training samples, and obtains the corresponding sample loss function, so that the next step can affect the output prediction of the training model to be evaluated Analyze the cause of the value; by selecting K target training samples to further test the impact of the output prediction value of the training model to be evaluated, the angle of the target training sample can be analyzed or evaluated for the impact of the output result of the training model to be evaluated. Helps subsequent optimization and improvement of the training model to be evaluated. Further, the server calculates the first influencing function and the second influencing function, while maintaining a relatively high fitting accuracy, so that the training model to be evaluated that cannot be derivable for calculus can be calculated at a lower computational cost. The first influence function and the second influence function obtain K target training samples' sample influence results of the training model to be evaluated. By analyzing the sample influence results, the influence of the target training samples to the training model to be evaluated is obtained, and the target training samples are realized The interpretability of the impact of the output results of the training model is helpful for subsequent optimization and improvement of the training model to be evaluated.
在一实施例中,如图2所示,提供一种基于人工智能的样本评估方法,以该方法应用在图1中的服务端为例进行说明,包括如下步骤:In an embodiment, as shown in FIG. 2, a sample evaluation method based on artificial intelligence is provided. The application of the method to the server in FIG. 1 is taken as an example for description, including the following steps:
S10:获取训练数据集,训练数据集包括N个测试训练样本,其中,N为正整数。S10: Obtain a training data set. The training data set includes N test training samples, where N is a positive integer.
其中,训练数据集为用户自定义设置用于存储测试训练样本的集合,本示例中,训练数据集存储有N个测试训练样本,以便基于N个测试训练样本分析。测试训练样本包括训练数据和训练数据对应的标签。作为一示例,测试训练样本可以为车损训练样本,其中,一个车损训练样本具体包括车损图像和车损图像对应的标签,此时,车损图像为训练数据。Among them, the training data set is a user-defined set for storing test training samples. In this example, the training data set stores N test training samples for analysis based on the N test training samples. The test training sample includes the training data and the label corresponding to the training data. As an example, the test training sample may be a car damage training sample, where a car damage training sample specifically includes a car damage image and a label corresponding to the car damage image. In this case, the car damage image is training data.
S20:采用待评估训练模型对N个测试训练样本进行训练,获取待评估训练模型对应的样本损失函数。S20: Use the training model to be evaluated to train N test training samples, and obtain a sample loss function corresponding to the training model to be evaluated.
其中,待评估训练模型为需要进行评估分析的模型,具体可以为对测试训练样本进行训练的深度学习模型。可选地,待评估训练模型包括但不限于为Faster RCNN模型或SS D模型。样本损失函数为计算待评估训练模型的输出预测值与测试训练样本的实际值之间 的差异的函数。输出预测值为待评估训练模型对测试训练样本进行训练后得到的值。实际值为测试训练样本实际对应的值,此处的实际值可以理解为训练数据的标签。例如,测试训练样本实际对应的值为A,待评估训练模型对测试训练样本进行训练后得到的输出预测值为B,样本损失函数为度量A和B之间差异的函数Among them, the training model to be evaluated is a model that needs to be evaluated and analyzed, and specifically may be a deep learning model for training a test training sample. Optionally, the training model to be evaluated includes but is not limited to the Faster RCNN model or the SS D model. The sample loss function is a function that calculates the difference between the output predicted value of the training model to be evaluated and the actual value of the test training sample. The output prediction value is the value obtained after the training model to be evaluated trains the test training sample. The actual value is the actual value corresponding to the test training sample, and the actual value here can be understood as the label of the training data. For example, the actual value corresponding to the test training sample is A, the output prediction value obtained by the training model to be evaluated on the test training sample is B, and the sample loss function is a function that measures the difference between A and B
具体地,服务端将N个测试训练样本输入到待评估训练模型进行训练。待评估训练模型对N个测试训练样本进行训练后,得到与N个测试训练样本对应的输出预测值。服务端将N个输出预测值与N个测试训练样本对应的实际值进行对比,获取N个测试训练样本对应的样本损失值,再基于N个样本损失值构建样本损失函数,用于下一步对影响待评估训练模型的输出预测值的原因进行分析。Specifically, the server inputs N test training samples to the training model to be evaluated for training. After the training model to be evaluated trains the N test training samples, the output prediction values corresponding to the N test training samples are obtained. The server compares the N output predicted values with the actual values corresponding to the N test training samples, obtains the sample loss values corresponding to the N test training samples, and then builds the sample loss function based on the N sample loss values for the next step The reasons that affect the output prediction value of the training model to be evaluated are analyzed.
S30:对N个测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数。S30: Detect N test training samples, and select K target training samples, where K is a positive integer.
其中,目标训练样本为用于对待评估训练模型的输出预测值的影响进行测试的样本。Wherein, the target training sample is a sample used to test the influence of the output prediction value of the training model to be evaluated.
具体地,服务端对N个测试训练样本进行检测,并从训练数据集中选取K个目标训练样本。可选地,选取目标训练样本的可以是从训练数据集中随机选取K个目标训练样本,或者按照预设选取方式从N个测试训练样本进行筛选所确定的K个目标训练样本。其中,预设选取方式为用户自定义设置的选取方式,用于对目标训练样本进行选取。Specifically, the server detects N test training samples, and selects K target training samples from the training data set. Optionally, selecting the target training samples may be randomly selecting K target training samples from the training data set, or selecting K target training samples determined by screening N test training samples according to a preset selection method. Among them, the preset selection method is a user-defined selection method, which is used to select the target training sample.
可以理解地,服务端通过对选取的K个目标训练样本来进一步对待评估训练模型的输出预测值的影响进行测试,能够提高目标训练样本对待评估训练模型的输出预测值的影响进行分析的效率。Understandably, the server further tests the influence of the output prediction value of the training model to be evaluated by selecting K target training samples, which can improve the efficiency of analyzing the influence of the target training sample on the output prediction value of the training model to be evaluated.
S40:将训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取与样本损失函数相对应的第一影响函数。S40: Input the N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first influence function corresponding to the sample loss function.
其中,第一影响函数为计算N-K个测试训练样本对待评估训练模型的输出预测值影响的函数。Among them, the first influence function is a function for calculating the influence of N-K test training samples on the output prediction value of the training model to be evaluated.
具体地,为了分析目标训练样本对待评估训练模型的输出预测值的影响,服务端通过选取的K个目标训练样本后,将训练数据集中剩余的N-K个测试训练样本输入待评估训练模型进行检测,根据待评估训练模型的输出预测值,进一步地,服务端通过待评估训练模型的输出预测值和样本损失函数进行计算,获取第一影响函数,以便后续根据第一影响函数对待评估训练模型的输出预测值进行分析,获取样本影响结果,以实现对影响待评估训练模型的输出预测值的训练样本进行分析,有助于从训练样本角度对待评估训练模型进行优化改进。Specifically, in order to analyze the influence of the target training sample on the output prediction value of the training model to be evaluated, the server passes the selected K target training samples, and then inputs the remaining NK test training samples in the training data set into the training model to be evaluated for testing. According to the output prediction value of the training model to be evaluated, further, the server calculates the output prediction value of the training model to be evaluated and the sample loss function to obtain the first influence function, so as to follow the output of the training model to be evaluated according to the first influence function The predicted value is analyzed, and the sample impact result is obtained to realize the analysis of the training samples that affect the output predicted value of the training model to be evaluated, which is helpful for optimizing and improving the training model to be evaluated from the perspective of training samples.
S50:对K个目标训练样本进行样本特征变更,获取K个更新特征样本,将K个更新特征样本和训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取与样本损失函数相对应的第二影响函数。S50: Perform sample feature changes on K target training samples, obtain K updated feature samples, and input K updated feature samples and NK test training samples in the training data set except for K target training samples into the training model to be evaluated for training , Obtain the second influence function corresponding to the sample loss function.
其中,第二影响函数为计算K个更新特征样本和N-K个测试训练样本对待评估训练模型的输出预测值影响的函数。Wherein, the second influence function is a function for calculating the influence of K update feature samples and N-K test training samples on the output prediction value of the training model to be evaluated.
具体地,为了分析目标训练样本对待评估训练模型的输出预测值的影响,服务端通过 对目标训练样本的样本特征进行变更后,得到K个更新特征样本,将训练数据集中剩余的N-K个测试训练样本和K个更新特征样本输入待评估训练模型进行检测,根据待评估训练模型的输出预测值,获取第二模型参数。进一步地,服务端通过对第二模型参数和样本损失函数进行计算,获取第二影响函数,以便后续根据第二影响函数对待评估训练模型的输出预测值进行分析,获取样本影响结果的影响,以实现对影响待评估训练模型的输出预测值的原因进行分析。Specifically, in order to analyze the impact of the target training sample on the output prediction value of the training model to be evaluated, the server obtains K updated feature samples by changing the sample characteristics of the target training sample, and trains the remaining NK test samples in the training data set. The samples and K updated feature samples are input to the training model to be evaluated for testing, and the second model parameters are obtained according to the output predicted value of the training model to be evaluated. Further, the server obtains the second influence function by calculating the second model parameters and the sample loss function, so as to subsequently analyze the output prediction value of the training model to be evaluated according to the second influence function, and obtain the influence of the sample influence result. Realize the analysis of the reasons that affect the output prediction value of the training model to be evaluated.
作为一示例,对K个目标训练样本的样本特征β进行样本特征变更,得到K个样本特征为δ的更新特征样本。As an example, sample feature changes are performed on the sample features β of K target training samples to obtain K updated feature samples whose sample features are δ.
S60:基于第一影响函数和第二影响函数,获取K个目标训练样本对待评估训练模型的样本影响结果。S60: Based on the first influence function and the second influence function, obtain K target training samples sample influence results of the training model to be evaluated.
其中,样本影响结果为对待评估训练模型的输出预测值的影响进行评估或分析的结果。Wherein, the sample influence result is the result of evaluating or analyzing the influence of the output prediction value of the training model to be evaluated.
具体地,服务端基于预设处理逻辑对第一影响函数和第二影响函数进行综合分析处理,获取K个目标训练样本对待评估训练模型的样本影响结果。其中,预设处理逻辑为对第一影响函数和第二影响函数进行加权或求差处理。即服务端在获取第一影响函数和第二影响函数后,可通过第一影响函数和第二影响函数,分析出目标训练样本对待评估训练模型的输出预测值的样本影响结果。服务端通过对样本影响结果进行分析,得到目标训练样本对待评估训练模型的影响,实现对待评估训练模型的输出预测值的影响进行评估或分析。Specifically, the server performs comprehensive analysis and processing on the first influence function and the second influence function based on the preset processing logic, and obtains the sample influence results of the K target training samples of the training model to be evaluated. Wherein, the preset processing logic is to perform weighting or difference processing on the first influencing function and the second influencing function. That is, after acquiring the first influence function and the second influence function, the server can analyze the sample influence result of the output prediction value of the training model to be evaluated by the target training sample through the first influence function and the second influence function. The server obtains the impact of the target training sample on the training model to be evaluated by analyzing the sample impact results, and realizes the evaluation or analysis of the impact of the output predicted value of the training model to be evaluated.
在本实施例中,服务端将测试训练样本在待评估训练模型进行训练后的输出预测值与N个测试训练样本对应的实际值进行对比,获取相应的样本损失函数,以便下一步对影响待评估训练模型的输出预测值的原因进行分析;通过对选取的K个目标训练样本来进一步对待评估训练模型的输出预测值的影响进行测试,能够目标训练样本的角度对待评估训练模型输出结果的影响进行分析或评估,有助于后续优化改进待评估训练模型。进一步地,服务端通过对第一影响函数和第二影响函数进行计算,保持较拟合精度的情况下,使得对于不能求导能微微积分的待评估训练模型能够以较低的计算成本计算第一影响函数和第二影响函数,得到K个目标训练样本对待评估训练模型的样本影响结果,通过对样本影响结果进行分析,得到目标训练样本对待评估训练模型的影响,实现了目标训练样本对待评估训练模型输出结果的影响的可解释性,有助于后续优化改进待评估训练模型。In this embodiment, the server compares the output predicted value of the test training sample after the training model to be evaluated is trained with the actual value corresponding to the N test training samples, and obtains the corresponding sample loss function, so that the next step can affect the expected value. Analyze the reason for the output prediction value of the evaluation training model; further test the impact of the output prediction value of the evaluation training model by selecting K target training samples, and the impact of the target training sample angle on the output prediction value of the evaluation training model Analysis or evaluation is helpful for subsequent optimization and improvement of the training model to be evaluated. Further, the server calculates the first influencing function and the second influencing function, while maintaining a relatively high fitting accuracy, so that the training model to be evaluated that cannot be derivable for calculus can be calculated at a lower computational cost. The first influence function and the second influence function obtain K target training samples' sample influence results of the training model to be evaluated. By analyzing the sample influence results, the influence of the target training samples to the training model to be evaluated is obtained, and the target training samples are realized The interpretability of the impact of the output results of the training model is helpful for subsequent optimization and improvement of the training model to be evaluated.
在一实施例中,如图3所示,步骤S20,即采用待评估训练模型对N个测试训练样本进行训练,获取待评估训练模型对应的样本损失函数,包括:In one embodiment, as shown in FIG. 3, step S20, that is, using the training model to be evaluated to train N test training samples to obtain the sample loss function corresponding to the training model to be evaluated includes:
S21:采用待评估训练模型对N个测试训练样本进行训练,获取N个测试训练样本对应的输出预测值。S21: Use the training model to be evaluated to train N test training samples, and obtain output prediction values corresponding to the N test training samples.
具体地,服务端通过待评估训练模型对N个测试训练样本进行训练,获取待评估训练模型的输出预测值。可以理解地,服务端通过N个测试训练样本和待评估训练模型的输出 预测值能够进一步计算待评估训练模型的输出预测值与测试训练样本的实际值之间的差异的样本损失函数函数。Specifically, the server trains N test training samples through the training model to be evaluated, and obtains the output prediction value of the training model to be evaluated. Understandably, the server can further calculate the sample loss function of the difference between the output prediction value of the training model to be evaluated and the actual value of the test training sample through the N test training samples and the predicted output value of the training model to be evaluated.
S22:基于测试训练样本和输出预测值,获取样本损失函数。S22: Obtain a sample loss function based on the test training sample and the output prediction value.
具体地,服务端通过对N个测试训练样本的实际值和待评估训练模型的输出预测值进行计算,获取N个样本损失值,并基于N个样本损失值获取对应的样本损失函数,以便后续利用样本损失函数对影响待评估训练模型的输出预测值的原因进行分析。Specifically, the server obtains N sample loss values by calculating the actual values of N test training samples and the output prediction values of the training model to be evaluated, and obtains the corresponding sample loss function based on the N sample loss values for subsequent The sample loss function is used to analyze the reasons that affect the output prediction value of the training model to be evaluated.
作为一示例,X是输入空间车损图像,Y是输出空间如车损图像对应的标签,而每一个测试训练样本则定义为Z 1,...,Z n,其中Z i=(X i,Y i)∈X×Y。对于一个测试训练样本Z和待
Figure PCTCN2020135339-appb-000001
设经验风险可二次微积分并且对于初始模型参数θ是凸函数。
As an example, X is the input space car damage image, Y is the output space such as the label corresponding to the car damage image, and each test training sample is defined as Z 1 ,...,Z n , where Z i =(X i ,Y i )∈X×Y. For a test training sample Z and wait
Figure PCTCN2020135339-appb-000001
Suppose the empirical risk can be quadratic calculus and the initial model parameter θ is a convex function.
本实施例中,服务端获取测试训练样本在待评估训练模型进行训练后,获取用于计算待评估训练模型的输出预测值与测试训练样本对应的实际值之间的差异的样本损失函数,通过样本损失函数对影响待评估训练模型的输出预测值的原因进行进一步分析,保证分析结果的准确性和有效性。In this embodiment, the server obtains the test training sample after the training model to be evaluated is trained, and obtains the sample loss function used to calculate the difference between the output prediction value of the training model to be evaluated and the actual value corresponding to the test training sample, through The sample loss function further analyzes the reasons that affect the output prediction value of the training model to be evaluated to ensure the accuracy and validity of the analysis result.
在一实施例中,如图4所示,步骤S30,即对N个测试训练样本进行检测,选取K个目标训练样本,包括:In one embodiment, as shown in FIG. 4, step S30, that is, detecting N test training samples and selecting K target training samples includes:
S31:获取N个测试训练样本对应的当前样本参数,判断当前样本参数是否满足筛选参数阈值。S31: Obtain current sample parameters corresponding to N test training samples, and determine whether the current sample parameters meet the screening parameter threshold.
其中,当前样本参数为测试训练样本中的数据参数。筛选参数阈值为用户自定义设置的数值,用于对当前样本参数进行筛选。Among them, the current sample parameter is the data parameter in the test training sample. The filter parameter threshold is a value set by the user and is used to filter the current sample parameters.
具体地,服务端获取N个测试训练样本对应的当前样本参数后,对当前样本参数进行判断,判断当前样本参数是够满足筛选参数阈值,以通过筛选参数阈值对测试训练样本进行筛选,以便从N个测试训练样本中筛选出满足筛选参数阈值的K个目标训练样本,利用K个目标训练样本对影响待评估训练模型的输出预测值的原因进行分析,以便后续根据K个目标训练样本对待评估训练模型的样本影响结果,更新相应的训练样本,以提高待评估训练模型的准确率。Specifically, after the server obtains the current sample parameters corresponding to the N test training samples, it judges the current sample parameters and judges whether the current sample parameters are sufficient to meet the screening parameter threshold, so as to filter the test training samples by the screening parameter threshold, so that K target training samples that meet the screening parameter threshold are selected from N test training samples, and the K target training samples are used to analyze the reasons that affect the output prediction value of the training model to be evaluated, so that the subsequent K target training samples are to be evaluated The samples of the training model affect the results, and the corresponding training samples are updated to improve the accuracy of the training model to be evaluated.
作为一示例,测试训练样本为车损训练样本,服务端获取训练数据集中每一车损训练样本中的车损图像对应的当前样本参数,该当前样本参数可以为图像分辨率大小、图像水平分辨率、图像垂直分辨率、图像亮度和对比度中的至少一个评估特征,每一评估特征对应的筛选参数阈值设置为X,若获取的当前样本参数为Y,X<Y时,则将当前样本参数确定为目标训练样本。As an example, the test training samples are car damage training samples, and the server obtains the current sample parameters corresponding to the car damage images in each car damage training sample in the training data set. The current sample parameters can be the image resolution size and the image level resolution. At least one of the evaluation features of the image rate, image vertical resolution, image brightness, and contrast. The screening parameter threshold corresponding to each evaluation feature is set to X. If the current sample parameter obtained is Y, and X<Y, the current sample parameter Determined as the target training sample.
S32:若当前样本参数满足筛选参数阈值,则将测试训练样本确定为目标训练样本。S32: If the current sample parameter meets the screening parameter threshold, the test training sample is determined as the target training sample.
本示例中,筛选参数阈值的条件可以由用户自定义设置的阈值。作为一示例,将当前样本参数满足筛选参数阈值确定为从N个测试训练样本中筛选K个目标训练样本的筛选条 件,具体可以是当前样本参数大于或等于筛选参数阈值。In this example, the conditions for filtering parameter thresholds can be user-defined thresholds. As an example, determining that the current sample parameter satisfies the screening parameter threshold is a screening condition for screening K target training samples from N test training samples. Specifically, the current sample parameter may be greater than or equal to the screening parameter threshold.
具体地,当N个测试训练样本对应的当前样本参数中,若存在满足筛选参数阈值的当前样本参数,则将满足筛选参数阈值的当前样本参数对应的测试训练样本确定为目标训练样本。Specifically, when among the current sample parameters corresponding to the N test training samples, if there is a current sample parameter that meets the screening parameter threshold, the test training sample corresponding to the current sample parameter that meets the screening parameter threshold is determined as the target training sample.
在本实施例中,服务端通以通过筛选参数阈值对测试训练样本进行筛选,以便从N个测试训练样本中筛选出满足筛选参数阈值的K个目标训练样本,利用K个目标训练样本对影响待评估训练模型的输出预测值的原因进行分析,以便后续根据K个目标训练样本对待评估训练模型的样本影响结果,更新相应的训练样本,以提高待评估训练模型的准确率。In this embodiment, the server screens the test training samples through the screening parameter threshold, so as to screen out K target training samples that meet the screening parameter threshold from the N test training samples, and use K target training samples to influence The reason for the output prediction value of the training model to be evaluated is analyzed, so that the subsequent K target training samples affect the results of the sample of the training model to be evaluated, and the corresponding training samples are updated to improve the accuracy of the training model to be evaluated.
在一实施例中,如图5所示,步骤S40中,将训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取与样本损失函数相对应的第一影响函数,包括:In one embodiment, as shown in FIG. 5, in step S40, NK test training samples in the training data set except the K target training samples are input into the training model to be evaluated for training, and the first training model corresponding to the sample loss function is obtained. An influence function, including:
S41:将训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取待评估训练模型的第一变更权重。S41: Input the N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first change weight of the training model to be evaluated.
其中,第一变更权重为利用除K个目标训练样本外的N-K个测试训练样本对待评估训练模型进行训练之后,确定的待评估训练模型中各模型参数的权重。Wherein, the first change weight is the weight of each model parameter in the training model to be evaluated after training the training model to be evaluated using N-K test training samples in addition to the K target training samples.
具体地,服务端将训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,基于样本损失函数获取经验风险,基于经验风险获取第一变更权重。其中,经验风险为通过样本损失函数对目标训练样本进行累加计算后的平均值。可以理解地,由于将K个目标训练样本剔除后,除K个目标训练样本外的N-K个测试训练样本在待评估训练模型中训练,其模型参数的权重相应的发生改变,通过经验风险获取的第一变更权重,能够进一步获取目标训练样本的样本权重的变化对影响待评估训练模型的输出预测值的原因进行分析。Specifically, the server inputs N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, obtains the empirical risk based on the sample loss function, and obtains the first change weight based on the empirical risk. Among them, the empirical risk is the average value obtained by accumulating the target training sample through the sample loss function. It is understandable that after the K target training samples are eliminated, NK test training samples other than the K target training samples are trained in the training model to be evaluated, and the weights of the model parameters are changed accordingly, which is obtained through empirical risk The first change weight can further obtain the change of the sample weight of the target training sample to analyze the reasons that affect the output prediction value of the training model to be evaluated.
Figure PCTCN2020135339-appb-000002
Figure PCTCN2020135339-appb-000002
S42:根据待评估训练模型对应的初始模型参数和第一变更权重,获取待评估训练模型的第一模型参数。S42: Acquire the first model parameter of the training model to be evaluated according to the initial model parameters and the first change weight corresponding to the training model to be evaluated.
其中,初始模型参数为将样本损失函数计算的差异降为最小的初始参数,是基于经验风险获取的参数。第一模型参数为将样本损失函数对N-K个测试训练样本的计算差异降为最小的测试参数。Among them, the initial model parameter is the initial parameter that minimizes the difference in the calculation of the sample loss function, and is a parameter obtained based on empirical risk. The first model parameter is a test parameter that minimizes the calculation difference between the sample loss function and the N-K test training samples.
作为一示例,经验风险具体为
Figure PCTCN2020135339-appb-000003
通过对经验风险进行计算得到初始模型 参数
Figure PCTCN2020135339-appb-000004
其中,
Figure PCTCN2020135339-appb-000005
为初始模型参数,Θ为数据库中所有模型的集合,θ为待评估检测模型,L(Z i,θ)为样本损失函数。
As an example, the empirical risk is specifically
Figure PCTCN2020135339-appb-000003
The initial model parameters are obtained by calculating the empirical risk
Figure PCTCN2020135339-appb-000004
among them,
Figure PCTCN2020135339-appb-000005
Is the initial model parameter, Θ is the collection of all models in the database, θ is the test model to be evaluated, and L(Z i , θ) is the sample loss function.
Figure PCTCN2020135339-appb-000006
Figure PCTCN2020135339-appb-000006
S43:基于待评估训练模型对应的第一模型参数和样本损失函数,获取与样本损失函数相对应的第一影响函数。S43: Obtain a first influence function corresponding to the sample loss function based on the first model parameter and the sample loss function corresponding to the training model to be evaluated.
具体的,为了分析目标训练样本对待评估训练模型输出结果的影响,服务端删除选取的K个目标训练样本后,将训练数据集中剩余的N-K个测试训练样本输入待评估训练模型进行检测,对待评估训练模型对应的初始模型参数和第一变更权重进行更新,获取第一模型参数。进一步地,服务端通过基于初始模型参数与第一模型参数的变化量和样本损失函数进行计算,获取第一影响函数,通过第一影响函数对待评估训练模型的输出预测值进行分析,获取样本影响结果的影响,以实现对影响待评估训练模型的输出预测值的原因进行分析。Specifically, in order to analyze the impact of the target training sample on the output result of the training model to be evaluated, the server deletes the selected K target training samples, and then inputs the remaining NK test training samples in the training data set into the training model to be evaluated for testing. The initial model parameters and the first change weight corresponding to the training model are updated to obtain the first model parameters. Further, the server obtains the first influence function through calculations based on the amount of change between the initial model parameters and the first model parameters and the sample loss function, and analyzes the output predicted value of the training model to be evaluated through the first influence function, and obtains the sample influence The influence of the result to realize the analysis of the reason that influences the output prediction value of the training model to be evaluated.
Figure PCTCN2020135339-appb-000007
求导。
Figure PCTCN2020135339-appb-000008
为损失函数
Figure PCTCN2020135339-appb-000009
的黑塞矩阵的二阶导。
Figure PCTCN2020135339-appb-000010
为黑塞矩阵的一阶导。可以理解地,服务端获取初始模型参数与第一模型参数的变化量J up,params(Z),能够实现无需测试训练样本,即可评估出去除某K个目标训练样本后对第一模型参数的影响。
Figure PCTCN2020135339-appb-000007
Seek guidance.
Figure PCTCN2020135339-appb-000008
Is the loss function
Figure PCTCN2020135339-appb-000009
The second derivative of the Hesse matrix.
Figure PCTCN2020135339-appb-000010
Is the first derivative of the Hessian matrix. Understandably, the server obtains the change amount J up,params (Z) between the initial model parameter and the first model parameter, which can realize that without testing the training samples, it can evaluate the first model parameter after removing certain K target training samples. Impact.
Figure PCTCN2020135339-appb-000011
Figure PCTCN2020135339-appb-000011
在本实施例中,服务端通过对第一模型参数和样本损失函数进行计算,保持较拟合精度的情况下,使得对于不能求导能微微积分的待评估训练模型能够以较低的计算成本计算,获取第一影响函数,通过第一影响函数对待评估训练模型的输出预测值进行分析,获取样本影响结果的影响,以实现对影响待评估训练模型的输出预测值的原因进行分析,提高基于人工智能的样本评估方法的效率。In this embodiment, the server calculates the first model parameters and the sample loss function to maintain a relatively good fitting accuracy, so that the training model to be evaluated that cannot be derived and calculus can be evaluated at a lower computational cost. Calculate, obtain the first influence function, analyze the output prediction value of the training model to be evaluated through the first influence function, and obtain the influence of the sample on the result, so as to realize the analysis of the reasons that affect the output prediction value of the training model to be evaluated, and improve the The efficiency of the artificial intelligence sample evaluation method.
作为一示例,步骤S43,即基于待评估训练模型对应的第一模型参数和样本损失函数,获取与样本损失函数相对应的第一影响函数,包括:基于黑塞向量乘积和预设迭代次数,对待评估训练模型对应的第一模型参数和样本损失函数进行处理,获取与样本损失函数相对应的第一影响函数。As an example, step S43 is to obtain the first influence function corresponding to the sample loss function based on the first model parameter and the sample loss function corresponding to the training model to be evaluated, including: based on the Hesse vector product and the preset number of iterations, The first model parameter and the sample loss function corresponding to the training model to be evaluated are processed, and the first influence function corresponding to the sample loss function is obtained.
其中,黑塞向量乘积为用于对第一影响函数和第二影响函数进行计算的方法。预设迭代次数为用户自定义设置的对第一影响函数和第二影响函数进行迭代计算的次数。Among them, the Hesse vector product is a method used to calculate the first influential function and the second influential function. The preset number of iterations is the number of iterative calculations on the first influencing function and the second influencing function set by the user.
具体地,获取改变样本权重对目标训练样本Z test的第一影响函数J up,loss(Z,Z test),需要计算黑塞矩阵的逆,而这将消耗巨大的运算资源。为此,本实施例利用了黑塞向量乘积(H
Figure PCTCN2020135339-appb-000012
来避免直接计算黑塞矩阵的逆,通过高效地估计黑塞向量乘积(H
Figure PCTCN2020135339-appb-000013
来计算第一影响函数J up,loss(Z,Z test)。而
Figure PCTCN2020135339-appb-000014
的计算,则可以通过随机参数估计(stochastic estimation)的方法来实现。随机参数估计方法每一次迭代中只需采样一个样本点,因此能够极大地提高计算速度,降低运算资源;同时
Figure PCTCN2020135339-appb-000015
中的
Figure PCTCN2020135339-appb-000016
并用
Figure PCTCN2020135339-appb-000017
表示
Figure PCTCN2020135339-appb-000018
泰勒展开的前j项估计:
Figure PCTCN2020135339-appb-000019
从泰勒展开的性质可知,当j→∞时,
Figure PCTCN2020135339-appb-000020
因此
Figure PCTCN2020135339-appb-000021
的无偏估计
Figure PCTCN2020135339-appb-000022
仍有
Figure PCTCN2020135339-appb-000023
在此,实施例基于
Figure PCTCN2020135339-appb-000024
来计算第一影响函数J up,loss(Z,Z test)。
Specifically, to obtain the first influence function J up,loss (Z,Z test ) of changing the sample weight on the target training sample Z test , it is necessary to calculate the inverse of the Hessian matrix, which will consume huge computing resources. For this reason, this embodiment uses the Hessel vector product (H
Figure PCTCN2020135339-appb-000012
To avoid directly calculating the inverse of the Hessian matrix, by efficiently estimating the Hessian vector product (H
Figure PCTCN2020135339-appb-000013
To calculate the first influencing function J up,loss (Z,Z test ). and
Figure PCTCN2020135339-appb-000014
The calculation of, can be achieved by stochastic estimation. The random parameter estimation method only needs to sample one sample point in each iteration, so it can greatly increase the calculation speed and reduce the computing resources; at the same time
Figure PCTCN2020135339-appb-000015
middle
Figure PCTCN2020135339-appb-000016
And use
Figure PCTCN2020135339-appb-000017
Means
Figure PCTCN2020135339-appb-000018
Taylor's estimate of the first j items:
Figure PCTCN2020135339-appb-000019
From the nature of Taylor expansion, we know that when j→∞,
Figure PCTCN2020135339-appb-000020
therefore
Figure PCTCN2020135339-appb-000021
Unbiased estimate of
Figure PCTCN2020135339-appb-000022
Still have
Figure PCTCN2020135339-appb-000023
Here, the embodiment is based on
Figure PCTCN2020135339-appb-000024
To calculate the first influencing function J up,loss (Z,Z test ).
具体地,从训练数据集中选取K个目标训练样本,Z 1...Z k,并定义
Figure PCTCN2020135339-appb-000025
黑塞向量乘积初始值为
Figure PCTCN2020135339-appb-000026
将选取的K个目标训练样本对应的第一影响函数根据预设迭代次数进行迭代计算
Figure PCTCN2020135339-appb-000027
将最后一次迭代计算的结果确定为第一影响函数
Figure PCTCN2020135339-appb-000028
Specifically, select K target training samples from the training data set, Z 1 ... Z k , and define
Figure PCTCN2020135339-appb-000025
The initial value of the Hesse vector product is
Figure PCTCN2020135339-appb-000026
The first influence function corresponding to the selected K target training samples is iteratively calculated according to the preset number of iterations
Figure PCTCN2020135339-appb-000027
Determine the result of the last iteration calculation as the first influence function
Figure PCTCN2020135339-appb-000028
在本实施例中,服务端通过黑塞向量乘积对第一影响函数进行计算能提高计算效率,以提高服务端通过样本影响结果对待评估训练模型的输出预测值的影响进行评估或分析的效率,并通过黑塞向量乘积的高效率计算得到第一影响函数实现对待评估训练模型的输出预测值的影响进行评估或分析。In this embodiment, the server calculates the first influence function through the Hessian vector product to improve the calculation efficiency, so as to improve the efficiency of the server to evaluate or analyze the influence of the output prediction value of the training model to be evaluated through the sample influence result. And through the high-efficiency calculation of the Hessian vector product, the first influence function is obtained to realize the evaluation or analysis of the influence of the output prediction value of the training model to be evaluated.
在一实施例中,如图6所示,步骤S50中,将K个更新特征样本和训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取与样本损失函数相对应的第二影响函数,包括:In one embodiment, as shown in FIG. 6, in step S50, the K update feature samples and NK test training samples in the training data set except the K target training samples are input into the training model to be evaluated for training, and the samples are obtained The second influence function corresponding to the loss function includes:
S51:将K个更新特征样本和训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取待评估训练模型对应的第二变更权重。S51: Input the K update feature samples and N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second change weight corresponding to the training model to be evaluated.
其中,第二变更权重为目标训练样本的样本特征进行变更后,更新特征样本的权重。由于将K个目标训练样本的样本特征进行变更后,K个更新特征样本在待评估训练模型中的权重相应的发生改变,通过第二变更权重能够进一步获取样本权重的变化对影响待评估训练模型的输出预测值的原因进行分析。Wherein, the second change weight is that after the sample feature of the target training sample is changed, the weight of the feature sample is updated. Since the sample features of the K target training samples are changed, the weights of the K updated feature samples in the training model to be evaluated are changed accordingly, and the second change weight can further obtain the change of the sample weight and affect the training model to be evaluated The reason for the output predicted value is analyzed.
可以理解地,目标训练样本的样本特征进行变更后,对于更新特征样本Z δ权重∈响应的也进行变更,服务端通过第二变更权重能够进一步获取样本权重的变化对影响待评估训练模型的输出预测值的原因进行分析。 Understandably, after the sample feature of the target training sample is changed, the response to the updated feature sample Z δ weight ∈ is also changed. The server can further obtain the sample weight change through the second change weight and affect the output of the training model to be evaluated. The reason for the predicted value is analyzed.
S52:根据待评估训练模型对应的初始模型参数和第二变更权重,获取待评估训练模型对应的第二模型参数。S52: Obtain a second model parameter corresponding to the training model to be evaluated according to the initial model parameter and the second change weight corresponding to the training model to be evaluated.
其中,第二模型参数为将样本损失函数对K个更新特征样本和N-K个测试训练样本的计算差异降为最小的测试参数。Wherein, the second model parameter is a test parameter that minimizes the calculation difference between the K update feature samples and the N-K test training samples of the sample loss function.
具体地,服务端对初始,初始模型参数和第二变更权重进行计算,获取待评估训练模
Figure PCTCN2020135339-appb-000029
Specifically, the server calculates the initial, initial model parameters and the second change weight, and obtains the training model to be evaluated.
Figure PCTCN2020135339-appb-000029
S53:根据待评估训练模型对应的第二模型参数和样本损失函数,获取与样本损失函数相对应的第二影响函数。S53: Obtain a second influence function corresponding to the sample loss function according to the second model parameter and the sample loss function corresponding to the training model to be evaluated.
具体地,服务端对目标训练样本的样本特征进行变更后,基于第二变更权重,获取待评估训练模型对应第二模型参数,并对第二模型参数和样本损失函数进行计算,获取第二影响函数。Specifically, after the server changes the sample characteristics of the target training sample, based on the second change weight, obtains the second model parameters corresponding to the training model to be evaluated, and calculates the second model parameters and the sample loss function to obtain the second impact function.
作为一示例,服务端先基于第二变更权重,对第二模型参数进行计算,获取初始模型
Figure PCTCN2020135339-appb-000030
型参数的影响。进一步地,第二影响函数通过对测试训练样本的样本损失函数对更新特征
Figure PCTCN2020135339-appb-000031
As an example, the server first calculates the second model parameters based on the second change weight to obtain the initial model
Figure PCTCN2020135339-appb-000030
The influence of type parameters. Further, the second influence function is used to update the feature by the sample loss function of the test training sample
Figure PCTCN2020135339-appb-000031
在本实施例中,服务端通过第二变更权重能够进一步获取样本权重的变化对影响待评估训练模型的输出预测值的原因进行分析;接着,通过第二变更权重,获取待评估训练模型对应的第二模型参数,对第二模型参数和样本损失函数进行计算,保持较拟合精度的情况下,使得对于不能求导能微微积分的待评估训练模型能够以较低的计算成本计算获取第二影响函数,以使服务端能够通过第二影响函数对待评估训练模型的输出预测值的影响进行评估或分析。In this embodiment, the server can further obtain the change in sample weight through the second change weight to analyze the reasons that affect the output prediction value of the training model to be evaluated; then, use the second change weight to obtain the corresponding training model to be evaluated. The second model parameter is to calculate the second model parameter and the sample loss function, while maintaining a better fitting accuracy, so that the training model to be evaluated that cannot be derived can be calculated and obtained at a lower computational cost. The influence function, so that the server can evaluate or analyze the influence of the output prediction value of the training model to be evaluated through the second influence function.
在一实施例中,步骤S53,即基于待评估训练模型对应的第二模型参数和样本损失函数,获取与样本损失函数相对应的第二影响函数,包括:基于黑塞向量乘积和预设迭代次数,对待评估训练模型对应的第二模型参数和样本损失函数,获取与样本损失函数相对应的第二影响函数。In one embodiment, step S53 is to obtain the second influence function corresponding to the sample loss function based on the second model parameter and the sample loss function corresponding to the training model to be evaluated, including: based on the Hessian vector product and the preset iteration The number of times, the second model parameter and the sample loss function corresponding to the training model to be evaluated, and the second influence function corresponding to the sample loss function is obtained.
具体地,获取改变样本特征对目标训练样本Z test的第二影响函数J pert,loss(Z,Z test) T,需要
Figure PCTCN2020135339-appb-000032
对应的第二影响函数根据预设迭代次数进行迭代计算
Figure PCTCN2020135339-appb-000033
将最后一次迭代计算的结果确定为第二影响函数
Figure PCTCN2020135339-appb-000034
Specifically, to obtain the second influence function J pert,loss (Z,Z test ) T of the target training sample Z test by changing the sample characteristics, it needs
Figure PCTCN2020135339-appb-000032
The corresponding second influence function is iteratively calculated according to the preset number of iterations
Figure PCTCN2020135339-appb-000033
Determine the result of the last iteration calculation as the second influence function
Figure PCTCN2020135339-appb-000034
在本实施例中,服务端通过黑塞向量乘积对第二影响函数进行计算能提高计算效率, 以提高服务端通过样本影响结果对待评估训练模型的输出预测值的影响进行评估或分析的效率,并通过黑塞向量乘积的高效率计算得到第二影响函数实现对待评估训练模型的输出预测值的影响进行评估或分析。In this embodiment, the server calculates the second influence function through the Hessian vector product to improve the calculation efficiency, so as to improve the efficiency of the server to evaluate or analyze the influence of the output prediction value of the training model to be evaluated through the sample influence result. And through the high-efficiency calculation of the Hessian vector product, the second influence function is obtained to realize the evaluation or analysis of the influence of the output prediction value of the training model to be evaluated.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution. The execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present application.
在一实施例中,提供一种基于人工智能的样本评估装置,该基于人工智能的样本评估装置与上述实施例中基于人工智能的样本评估方法一一对应。如图7所示,该基于人工智能的样本评估装置包括数据获取模块10、样本训练模块20、样本检测模块30、第一影响函数模块40、第二影响函数模块50和结果获取模块60。各功能模块详细说明如下:In one embodiment, an artificial intelligence-based sample evaluation device is provided, and the artificial intelligence-based sample evaluation device corresponds to the artificial intelligence-based sample evaluation method in the above-mentioned embodiment in a one-to-one correspondence. As shown in FIG. 7, the artificial intelligence-based sample evaluation device includes a data acquisition module 10, a sample training module 20, a sample detection module 30, a first influence function module 40, a second influence function module 50 and a result acquisition module 60. The detailed description of each functional module is as follows:
数据获取模块10,用于获取训练数据集,训练数据集包括N个测试训练样本,其中,N为正整数;The data acquisition module 10 is used to acquire a training data set, the training data set includes N test training samples, where N is a positive integer;
样本训练模块20,用于采用待评估训练模型对N个测试训练样本进行训练,获取待评估训练模型对应的样本损失函数;The sample training module 20 is used to train N test training samples using the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
样本检测模块30,用于对N个测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;The sample detection module 30 is used to detect N test training samples and select K target training samples, where K is a positive integer;
第一影响函数模块40,用于将训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取与样本损失函数相对应的第一影响函数;The first influence function module 40 is configured to input N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first influence function corresponding to the sample loss function;
第二影响函数模块50,用于对K个目标训练样本进行样本特征变更,获取K个更新特征样本,将K个更新特征样本和训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取与样本损失函数相对应的第二影响函数;The second influence function module 50 is used to change the sample characteristics of the K target training samples, obtain K updated feature samples, and combine the K updated feature samples and the training data set with NK test training samples other than the K target training samples Input the training model to be evaluated for training, and obtain the second influence function corresponding to the sample loss function;
结果获取模块60,用于基于第一影响函数和第二影响函数,获取K个目标训练样本对待评估训练模型的样本影响结果。The result acquisition module 60 is configured to acquire K target training samples based on the first influence function and the second influence function, and sample influence results of the training model to be evaluated.
进一步地,样本训练模块20包括:Further, the sample training module 20 includes:
预测值获取子模块,用于采用待评估训练模型对N个测试训练样本进行训练,获取N个测试训练样本对应的输出预测值;The prediction value acquisition sub-module is used to train N test training samples using the training model to be evaluated, and obtain the output prediction values corresponding to the N test training samples;
损失函数子模块,用于基于测试训练样本和输出预测值,获取样本损失函数。The loss function sub-module is used to obtain the sample loss function based on the test training sample and the output prediction value.
进一步地,样本检测模块30包括:Further, the sample detection module 30 includes:
阈值判断子模块,用于获取N个测试训练样本对应的当前样本参数,判断当前样本参数是否满足筛选参数阈值;The threshold judgment sub-module is used to obtain the current sample parameters corresponding to the N test training samples, and judge whether the current sample parameters meet the screening parameter threshold;
样本确定子模块,用于当当前样本参数满足筛选参数阈值时,则将测试训练样本确定为目标训练样本。The sample determination sub-module is used to determine the test training sample as the target training sample when the current sample parameter meets the screening parameter threshold.
进一步地,第一影响函数模块40包括:Further, the first influencing function module 40 includes:
第一权重子模块,用于将训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取待评估训练模型的第一变更权重;The first weight sub-module is used to input N-K test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first change weight of the training model to be evaluated;
第一参数子模块,用于根据待评估训练模型对应的初始模型参数和第一变更权重,获 取待评估训练模型的第一模型参数;The first parameter sub-module is used to obtain the first model parameter of the training model to be evaluated according to the initial model parameters and the first change weight corresponding to the training model to be evaluated;
第一函数子模块,用于基于待评估训练模型对应的第一模型参数和样本损失函数,获取与样本损失函数相对应的第一影响函数。The first function sub-module is used to obtain the first influence function corresponding to the sample loss function based on the first model parameter and the sample loss function corresponding to the training model to be evaluated.
进一步地,第一影响函数模块40还包括:Further, the first influencing function module 40 also includes:
第一参数处理子模块,用于基于黑塞向量乘积和预设迭代次数,对待评估训练模型对应的第一模型参数和样本损失函数进行处理,获取与样本损失函数相对应的第一影响函数。The first parameter processing sub-module is used to process the first model parameter and sample loss function corresponding to the training model to be evaluated based on the Hessian vector product and the preset number of iterations to obtain the first influence function corresponding to the sample loss function.
进一步地,第二影响函数模块50包括:Further, the second influence function module 50 includes:
第二权重子模块,用于将K个更新特征样本和训练数据集中除K个目标训练样本外的N-K个测试训练样本输入待评估训练模型进行训练,获取待评估训练模型对应的第二变更权重;The second weight sub-module is used to input the K update feature samples and NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second change weight corresponding to the training model to be evaluated ;
第二参数子模块,用于根据待评估训练模型对应的初始模型参数和第二变更权重,获取待评估训练模型对应的第二模型参数;The second parameter sub-module is used to obtain the second model parameter corresponding to the training model to be evaluated according to the initial model parameter and the second change weight corresponding to the training model to be evaluated;
第二函数子模块,用于基于待评估训练模型对应的第二模型参数和样本损失函数,获取与样本损失函数相对应的第二影响函数。The second function sub-module is used to obtain the second influence function corresponding to the sample loss function based on the second model parameter and the sample loss function corresponding to the training model to be evaluated.
进一步地,第二影响函数模块50还包括:Further, the second influencing function module 50 further includes:
第二参数处理子模块,用于基于黑塞向量乘积和预设迭代次数,对待评估训练模型对应的第二模型参数和样本损失函数,获取与样本损失函数相对应的第二影响函数。The second parameter processing sub-module is used to obtain the second influence function corresponding to the sample loss function based on the Hessian vector product and the preset number of iterations, the second model parameter and the sample loss function corresponding to the training model to be evaluated.
关于基于人工智能的样本评估装置的具体限定可以参见上文中对于基于人工智能的样本评估方法的限定,在此不再赘述。上述基于人工智能的样本评估装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。Regarding the specific limitations of the artificial intelligence-based sample evaluation device, please refer to the above limitations on the artificial intelligence-based sample evaluation method, which will not be repeated here. Each module in the above-mentioned artificial intelligence-based sample evaluation device can be implemented in whole or in part by software, hardware, and a combination thereof. The above-mentioned modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the above-mentioned modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务端,其内部结构图可以如图8所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于样本评估。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种基于人工智能的样本评估方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 8. The computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium. The computer equipment database is used for sample evaluation. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instructions are executed by the processor to realize an artificial intelligence-based sample evaluation method.
在一个实施例中,提供了一个或多个存储有计算机可读指令的可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行计算机可读指令时实现上述实施例中基于人工智能的样本评估方法,例如步骤S10至步骤S60,为避免重复,这里不再赘述。或者,处理器执 行计算机可读指令时实现基于人工智能的样本评估装置这一实施例中的各模块/单元的功能,例如模块10至模块60,为避免重复,这里不再赘述。本实施例中的可读存储介质包括非易失性可读存储介质和易失性可读存储介质。In one embodiment, one or more readable storage media storing computer readable instructions are provided, the computer readable storage medium storing computer readable instructions, and the computer readable instructions are processed by one or more When the processor executes, the one or more processors execute the computer-readable instructions to implement the artificial intelligence-based sample evaluation method in the foregoing embodiment, such as steps S10 to S60. To avoid repetition, details are not described herein again. Or, when the processor executes the computer-readable instructions, the functions of the modules/units in the embodiment of the artificial intelligence-based sample evaluation device, such as modules 10 to 60, are implemented. To avoid repetition, details are not described herein again. The readable storage medium in this embodiment includes a non-volatile readable storage medium and a volatile readable storage medium.
在一实施例中,提供一计算机可读存储介质,该计算机可读存储介质上存储有计算机可读指令,该计算机可读指令被处理器执行时实现上述实施例中基于人工智能的样本评估方法,例如例如步骤S10至步骤S60,为避免重复,这里不再赘述。或者,该计算机可读指令被处理器执行时实现上述基于人工智能的样本评估装置这一实施例中的各模块/单元的功能,例如模块10至模块60,为避免重复,这里不再赘述。In one embodiment, a computer-readable storage medium is provided, and computer-readable instructions are stored on the computer-readable storage medium. When the computer-readable instructions are executed by a processor, the artificial intelligence-based sample evaluation method in the above-mentioned embodiment is implemented. For example, step S10 to step S60, in order to avoid repetition, the details are not repeated here. Alternatively, when the computer-readable instruction is executed by the processor, the function of each module/unit in the embodiment of the artificial intelligence-based sample evaluation device, such as module 10 to module 60, is realized. In order to avoid repetition, details are not described herein again.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令可存储于一非易失性可读存储介质也可以存储在易失性可读存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the methods of the foregoing embodiments can be implemented by instructing relevant hardware through computer-readable instructions. The computer-readable instructions can be stored in a non-volatile computer. In a readable storage medium, the computer readable instruction may be stored in a non-volatile readable storage medium or may be stored in a volatile readable storage medium. When the computer readable instruction is executed, it may include The flow of the embodiment of each method. Wherein, any reference to memory, storage, database, or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. As an illustration and not a limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。Those skilled in the art can clearly understand that, for the convenience and conciseness of description, only the division of the above functional units and modules is used as an example. In practical applications, the above functions can be allocated to different functional units and modules as needed. Module completion, that is, the internal structure of the device is divided into different functional units or modules to complete all or part of the functions described above.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, a person of ordinary skill in the art should understand that it can still implement the foregoing The technical solutions recorded in the examples are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of protection of this application.

Claims (20)

  1. 一种基于人工智能的样本评估方法,其中,包括:A sample evaluation method based on artificial intelligence, which includes:
    获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;Acquiring a training data set, the training data set including N test training samples, where N is a positive integer;
    采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;Training the N test training samples with the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
    对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;Perform detection on the N test training samples, and select K target training samples, where K is a positive integer;
    将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining a first influence function corresponding to the sample loss function;
    对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;Perform sample feature changes on the K target training samples, obtain K updated feature samples, and input the K updated feature samples and NK test training samples in the training data set except the K target training samples Training the training model to be evaluated to obtain a second influence function corresponding to the sample loss function;
    基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。Based on the first influence function and the second influence function, obtain sample influence results of the K target training samples on the training model to be evaluated.
  2. 如权利要求1所述的基于人工智能的样本评估方法,其中,所述采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数,包括:The artificial intelligence-based sample evaluation method according to claim 1, wherein the training the N test training samples with the training model to be evaluated, and obtaining the sample loss function corresponding to the training model to be evaluated, comprises:
    采用待评估训练模型对N个所述测试训练样本进行训练,获取N个所述测试训练样本对应的输出预测值;Training the N test training samples with the training model to be evaluated, and obtain the output prediction values corresponding to the N test training samples;
    基于所述测试训练样本和所述输出预测值,获取所述样本损失函数。Obtain the sample loss function based on the test training sample and the output prediction value.
  3. 如权利要求1所述的基于人工智能的样本评估方法,其中,所述对N个所述测试训练样本进行检测,选取K个目标训练样本,包括:The method for sample evaluation based on artificial intelligence according to claim 1, wherein said detecting the N test training samples and selecting K target training samples comprises:
    获取N个所述测试训练样本对应的当前样本参数,判断所述当前样本参数是否满足筛选参数阈值;Acquiring current sample parameters corresponding to the N test training samples, and judging whether the current sample parameters meet the screening parameter threshold;
    若所述当前样本参数满足所述筛选参数阈值,则将所述测试训练样本确定为所述目标训练样本。If the current sample parameter meets the screening parameter threshold, the test training sample is determined as the target training sample.
  4. 如权利要求1所述的基于人工智能的样本评估方法,其中,所述将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数,包括:The artificial intelligence-based sample evaluation method according to claim 1, wherein said inputting NK test training samples in said training data set except K said target training samples into said training model to be evaluated for training, Obtaining the first influence function corresponding to the sample loss function includes:
    将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取所述待评估训练模型的第一变更权重;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining the first change weight of the training model to be evaluated;
    根据所述待评估训练模型对应的初始模型参数和所述第一变更权重,获取所述待评估训练模型的第一模型参数;Acquiring the first model parameter of the training model to be evaluated according to the initial model parameters corresponding to the training model to be evaluated and the first change weight;
    基于所述待评估训练模型对应的第一模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第一影响函数。Based on the first model parameter corresponding to the training model to be evaluated and the sample loss function, a first influence function corresponding to the sample loss function is obtained.
  5. 如权利要求4所述的基于人工智能的样本评估方法,其中,所述基于所述待评估训练模型对应的第一模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第一影响函数,包括:The artificial intelligence-based sample evaluation method according to claim 4, wherein the first model parameter corresponding to the training model to be evaluated and the sample loss function are obtained based on the first model parameter corresponding to the sample loss function. An influence function, including:
    基于黑塞向量乘积和预设迭代次数,对所述待评估训练模型对应的第一模型参数和所述样本损失函数进行处理,获取与所述样本损失函数相对应的第一影响函数。Based on the Hessian vector product and the preset number of iterations, the first model parameter corresponding to the training model to be evaluated and the sample loss function are processed to obtain the first influence function corresponding to the sample loss function.
  6. 如权利要求3所述的基于人工智能的样本评估方法,其中,所述将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数,包括:The artificial intelligence-based sample evaluation method according to claim 3, wherein the K update feature samples and NK test training samples other than the K target training samples in the training data set are input into the training data set. The training model to be evaluated is trained to obtain the second influence function corresponding to the sample loss function, including:
    将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取所述待评估训练模型对应的第二变更权重;Input the K update feature samples and the NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second training model corresponding to the training model to be evaluated. Change weight
    根据所述待评估训练模型对应的初始模型参数和所述第二变更权重,获取所述待评估训练模型对应的第二模型参数;Acquiring the second model parameter corresponding to the training model to be evaluated according to the initial model parameter corresponding to the training model to be evaluated and the second change weight;
    基于所述待评估训练模型对应的第二模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第二影响函数。Based on the second model parameter corresponding to the training model to be evaluated and the sample loss function, a second influence function corresponding to the sample loss function is obtained.
  7. 如权利要求6所述的基于人工智能的样本评估方法,其中,所述基于所述待评估训练模型对应的第二模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第二影响函数,包括:The artificial intelligence-based sample evaluation method of claim 6, wherein the second model parameter corresponding to the training model to be evaluated and the sample loss function are used to obtain the first sample loss function corresponding to the sample loss function. Two influence functions, including:
    基于黑塞向量乘积和预设迭代次数,对所述待评估训练模型对应的第二模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第二影响函数。Based on the Hessian vector product and the preset number of iterations, for the second model parameter corresponding to the training model to be evaluated and the sample loss function, a second influence function corresponding to the sample loss function is obtained.
  8. 一种基于人工智能的样本评估装置,其中,包括:A sample evaluation device based on artificial intelligence, which includes:
    数据获取模块,用于获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;A data acquisition module for acquiring a training data set, the training data set includes N test training samples, where N is a positive integer;
    样本训练模块,用于采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;The sample training module is configured to train the N test training samples by using the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
    样本检测模块,用于对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;The sample detection module is used to detect the N test training samples and select K target training samples, where K is a positive integer;
    第一影响函数模块,用于将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;The first influence function module is configured to input NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the first loss function corresponding to the sample loss function. An influence function;
    第二影响函数模块,用于对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;The second influence function module is used to change the sample characteristics of the K target training samples, obtain K updated characteristic samples, and divide the K updated characteristic samples and the training data set by the K target training samples NK outside test training samples are input into the training model to be evaluated for training, and a second influence function corresponding to the sample loss function is obtained;
    结果获取模块,用于基于所述第一影响函数和所述第二影响函数,获取K个所述目标 训练样本对所述待评估训练模型的样本影响结果。The result acquisition module is configured to acquire the sample influence results of the K target training samples on the training model to be evaluated based on the first influence function and the second influence function.
  9. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其中,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions that are stored in the memory and can run on the processor, wherein the processor implements the following steps when the processor executes the computer-readable instructions:
    获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;Acquiring a training data set, the training data set including N test training samples, where N is a positive integer;
    采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;Training the N test training samples with the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
    对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;Perform detection on the N test training samples, and select K target training samples, where K is a positive integer;
    将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining a first influence function corresponding to the sample loss function;
    对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;Perform sample feature changes on the K target training samples, obtain K updated feature samples, and input the K updated feature samples and NK test training samples in the training data set except the K target training samples Training the training model to be evaluated to obtain a second influence function corresponding to the sample loss function;
    基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。Based on the first influence function and the second influence function, obtain sample influence results of the K target training samples on the training model to be evaluated.
  10. 如权利要求9所述的计算机设备,其中,所述采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数,包括:9. The computer device according to claim 9, wherein the training the N test training samples with the training model to be evaluated, and obtaining the sample loss function corresponding to the training model to be evaluated, comprises:
    采用待评估训练模型对N个所述测试训练样本进行训练,获取N个所述测试训练样本对应的输出预测值;Training the N test training samples with the training model to be evaluated, and obtain the output prediction values corresponding to the N test training samples;
    基于所述测试训练样本和所述输出预测值,获取所述样本损失函数。Obtain the sample loss function based on the test training sample and the output prediction value.
  11. 如权利要求9所述的计算机设备,其中,所述对N个所述测试训练样本进行检测,选取K个目标训练样本,包括:9. The computer device according to claim 9, wherein said detecting the N test training samples and selecting K target training samples comprises:
    获取N个所述测试训练样本对应的当前样本参数,判断所述当前样本参数是否满足筛选参数阈值;Acquiring current sample parameters corresponding to the N test training samples, and judging whether the current sample parameters meet the screening parameter threshold;
    若所述当前样本参数满足所述筛选参数阈值,则将所述测试训练样本确定为所述目标训练样本。If the current sample parameter meets the screening parameter threshold, the test training sample is determined as the target training sample.
  12. 如权利要求9所述的计算机设备,其中,所述将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数,包括:The computer device according to claim 9, wherein the NK test training samples except the K target training samples in the training data set are input into the training model to be evaluated for training, and the samples are obtained The first influence function corresponding to the loss function includes:
    将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取所述待评估训练模型的第一变更权重;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining the first change weight of the training model to be evaluated;
    根据所述待评估训练模型对应的初始模型参数和所述第一变更权重,获取所述待评估训练模型的第一模型参数;Acquiring the first model parameter of the training model to be evaluated according to the initial model parameters corresponding to the training model to be evaluated and the first change weight;
    基于所述待评估训练模型对应的第一模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第一影响函数。Based on the first model parameter corresponding to the training model to be evaluated and the sample loss function, a first influence function corresponding to the sample loss function is obtained.
  13. 如权利要求12所述的计算机设备,其中,所述基于所述待评估训练模型对应的第一模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第一影响函数,包括:The computer device according to claim 12, wherein the obtaining the first influence function corresponding to the sample loss function based on the first model parameter corresponding to the training model to be evaluated and the sample loss function comprises :
    基于黑塞向量乘积和预设迭代次数,对所述待评估训练模型对应的第一模型参数和所述样本损失函数进行处理,获取与所述样本损失函数相对应的第一影响函数。Based on the Hessian vector product and the preset number of iterations, the first model parameter corresponding to the training model to be evaluated and the sample loss function are processed to obtain the first influence function corresponding to the sample loss function.
  14. 如权利要求11所述的计算机设备,其中,所述将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数,包括:The computer device according to claim 11, wherein the K update feature samples and NK test training samples in the training data set except the K target training samples are input into the training model to be evaluated Training to obtain the second influence function corresponding to the sample loss function includes:
    将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取所述待评估训练模型对应的第二变更权重;Input the K update feature samples and the NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second training model corresponding to the training model to be evaluated. Change weight
    根据所述待评估训练模型对应的初始模型参数和所述第二变更权重,获取所述待评估训练模型对应的第二模型参数;Acquiring the second model parameter corresponding to the training model to be evaluated according to the initial model parameter corresponding to the training model to be evaluated and the second change weight;
    基于所述待评估训练模型对应的第二模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第二影响函数。Based on the second model parameter corresponding to the training model to be evaluated and the sample loss function, a second influence function corresponding to the sample loss function is obtained.
  15. 一个或多个存储有计算机可读指令的可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其中,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more readable storage media storing computer readable instructions, the computer readable storage medium storing computer readable instructions, where the computer readable instructions when executed by one or more processors cause all The one or more processors execute the following steps:
    获取训练数据集,所述训练数据集包括N个测试训练样本,其中,N为正整数;Acquiring a training data set, the training data set including N test training samples, where N is a positive integer;
    采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数;Training the N test training samples with the training model to be evaluated, and obtain the sample loss function corresponding to the training model to be evaluated;
    对N个所述测试训练样本进行检测,选取K个目标训练样本,其中,K为正整数;Perform detection on the N test training samples, and select K target training samples, where K is a positive integer;
    将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining a first influence function corresponding to the sample loss function;
    对K个所述目标训练样本进行样本特征变更,获取K个更新特征样本,将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数;Perform sample feature changes on the K target training samples, obtain K updated feature samples, and input the K updated feature samples and NK test training samples in the training data set except the K target training samples Training the training model to be evaluated to obtain a second influence function corresponding to the sample loss function;
    基于所述第一影响函数和所述第二影响函数,获取K个所述目标训练样本对所述待评估训练模型的样本影响结果。Based on the first influence function and the second influence function, obtain sample influence results of the K target training samples on the training model to be evaluated.
  16. 如权利要求15所述的可读存储介质,其中,所述采用待评估训练模型对N个所述测试训练样本进行训练,获取所述待评估训练模型对应的样本损失函数,包括:15. The readable storage medium according to claim 15, wherein the training the N test training samples using the training model to be evaluated and obtaining the sample loss function corresponding to the training model to be evaluated comprises:
    采用待评估训练模型对N个所述测试训练样本进行训练,获取N个所述测试训练样本对应的输出预测值;Training the N test training samples with the training model to be evaluated, and obtain the output prediction values corresponding to the N test training samples;
    基于所述测试训练样本和所述输出预测值,获取所述样本损失函数。Obtain the sample loss function based on the test training sample and the output prediction value.
  17. 如权利要求15所述的可读存储介质,其中,所述对N个所述测试训练样本进行检测,选取K个目标训练样本,包括:15. The readable storage medium according to claim 15, wherein said detecting the N test training samples and selecting K target training samples comprises:
    获取N个所述测试训练样本对应的当前样本参数,判断所述当前样本参数是否满足筛选参数阈值;Acquiring current sample parameters corresponding to the N test training samples, and judging whether the current sample parameters meet the screening parameter threshold;
    若所述当前样本参数满足所述筛选参数阈值,则将所述测试训练样本确定为所述目标训练样本。If the current sample parameter meets the screening parameter threshold, the test training sample is determined as the target training sample.
  18. 如权利要求15所述的可读存储介质,其中,所述将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第一影响函数,包括:The readable storage medium according to claim 15, wherein the NK test training samples except the K target training samples in the training data set are input into the training model to be evaluated for training, and the training data is obtained The first influence function corresponding to the sample loss function includes:
    将所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取所述待评估训练模型的第一变更权重;Inputting N-K test training samples in the training data set excluding the K target training samples into the training model to be evaluated for training, and obtaining the first change weight of the training model to be evaluated;
    根据所述待评估训练模型对应的初始模型参数和所述第一变更权重,获取所述待评估训练模型的第一模型参数;Acquiring the first model parameter of the training model to be evaluated according to the initial model parameters corresponding to the training model to be evaluated and the first change weight;
    基于所述待评估训练模型对应的第一模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第一影响函数。Based on the first model parameter corresponding to the training model to be evaluated and the sample loss function, a first influence function corresponding to the sample loss function is obtained.
  19. 如权利要求18所述的可读存储介质,其中,所述基于所述待评估训练模型对应的第一模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第一影响函数,包括:The readable storage medium of claim 18, wherein the first model parameter corresponding to the training model to be evaluated and the sample loss function are used to obtain a first influence function corresponding to the sample loss function ,include:
    基于黑塞向量乘积和预设迭代次数,对所述待评估训练模型对应的第一模型参数和所述样本损失函数进行处理,获取与所述样本损失函数相对应的第一影响函数。Based on the Hessian vector product and the preset number of iterations, the first model parameter corresponding to the training model to be evaluated and the sample loss function are processed to obtain the first influence function corresponding to the sample loss function.
  20. 如权利要求17所述的可读存储介质,其中,所述将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取与所述样本损失函数相对应的第二影响函数,包括:The readable storage medium according to claim 17, wherein the K pieces of the updated feature samples and the NK pieces of test training samples in the training data set excluding the K pieces of the target training samples are input into the to-be-evaluated The training model is trained to obtain the second influence function corresponding to the sample loss function, including:
    将K个所述更新特征样本和所述训练数据集中除K个所述目标训练样本外的N-K个测试训练样本输入所述待评估训练模型进行训练,获取所述待评估训练模型对应的第二变更权重;Input the K update feature samples and the NK test training samples in the training data set except the K target training samples into the training model to be evaluated for training, and obtain the second training model corresponding to the training model to be evaluated. Change weight
    根据所述待评估训练模型对应的初始模型参数和所述第二变更权重,获取所述待评估训练模型对应的第二模型参数;Acquiring the second model parameter corresponding to the training model to be evaluated according to the initial model parameter corresponding to the training model to be evaluated and the second change weight;
    基于所述待评估训练模型对应的第二模型参数和所述样本损失函数,获取与所述样本损失函数相对应的第二影响函数。Based on the second model parameter corresponding to the training model to be evaluated and the sample loss function, a second influence function corresponding to the sample loss function is obtained.
PCT/CN2020/135339 2020-06-08 2020-12-10 Artificial intelligence-based sample evaluation method, apparatus, device, and storage medium WO2021121128A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010514014.0 2020-06-08
CN202010514014.0A CN111667010A (en) 2020-06-08 2020-06-08 Sample evaluation method, device and equipment based on artificial intelligence and storage medium

Publications (1)

Publication Number Publication Date
WO2021121128A1 true WO2021121128A1 (en) 2021-06-24

Family

ID=72385723

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/135339 WO2021121128A1 (en) 2020-06-08 2020-12-10 Artificial intelligence-based sample evaluation method, apparatus, device, and storage medium

Country Status (2)

Country Link
CN (1) CN111667010A (en)
WO (1) WO2021121128A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113723835A (en) * 2021-09-02 2021-11-30 国网河北省电力有限公司电力科学研究院 Thermal power plant water use evaluation method and terminal equipment
CN115935761A (en) * 2023-01-10 2023-04-07 中国电子产品可靠性与环境试验研究所((工业和信息化部电子第五研究所)(中国赛宝实验室)) Method and device for simulating reliability of equipment, computer equipment and storage medium
CN116070469A (en) * 2023-04-04 2023-05-05 合肥通用机械研究院有限公司 Equipment operation parameter optimization method and risk management method
CN116388884A (en) * 2023-06-05 2023-07-04 浙江大学 Method, system and device for designing anti-eavesdrop ultrasonic interference sample
CN116842853A (en) * 2023-09-01 2023-10-03 中国空气动力研究与发展中心计算空气动力研究所 Missile aerodynamic characteristic prediction model construction method for uncertainty quantization
CN116911662A (en) * 2023-06-25 2023-10-20 施勒智能科技(上海)股份有限公司 Air quality optimization method and system based on artificial intelligence
CN116957052A (en) * 2023-06-09 2023-10-27 清华大学 Training method of risk assessment model based on similarity of construction samples
CN117057027A (en) * 2023-10-13 2023-11-14 江苏通创现代建筑产业技术研究院有限公司 Wall heat preservation and insulation performance evaluation method and system
CN117170991A (en) * 2023-07-21 2023-12-05 国网信息通信产业集团有限公司 Verification method based on artificial intelligent platform service interface
CN117315466A (en) * 2023-09-20 2023-12-29 北京佳格天地科技有限公司 Growth monitoring management method and system

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111667010A (en) * 2020-06-08 2020-09-15 平安科技(深圳)有限公司 Sample evaluation method, device and equipment based on artificial intelligence and storage medium
CN112528159B (en) * 2020-12-24 2024-03-26 北京百度网讯科技有限公司 Feature quality assessment method and device, electronic equipment and storage medium
CN113066549B (en) * 2021-04-06 2022-07-26 青岛瑞斯凯尔生物科技有限公司 Clinical effectiveness evaluation method and system of medical instrument based on artificial intelligence

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150153476A1 (en) * 2012-01-12 2015-06-04 Schlumberger Technology Corporation Method for constrained history matching coupled with optimization
CN109859039A (en) * 2018-12-28 2019-06-07 北京邮电大学 A kind of prediction technique and device
CN110363243A (en) * 2019-07-12 2019-10-22 腾讯科技(深圳)有限公司 The appraisal procedure and device of disaggregated model
CN111667010A (en) * 2020-06-08 2020-09-15 平安科技(深圳)有限公司 Sample evaluation method, device and equipment based on artificial intelligence and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150153476A1 (en) * 2012-01-12 2015-06-04 Schlumberger Technology Corporation Method for constrained history matching coupled with optimization
CN109859039A (en) * 2018-12-28 2019-06-07 北京邮电大学 A kind of prediction technique and device
CN110363243A (en) * 2019-07-12 2019-10-22 腾讯科技(深圳)有限公司 The appraisal procedure and device of disaggregated model
CN111667010A (en) * 2020-06-08 2020-09-15 平安科技(深圳)有限公司 Sample evaluation method, device and equipment based on artificial intelligence and storage medium

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113723835B (en) * 2021-09-02 2024-02-06 国网河北省电力有限公司电力科学研究院 Water consumption evaluation method and terminal equipment for thermal power plant
CN113723835A (en) * 2021-09-02 2021-11-30 国网河北省电力有限公司电力科学研究院 Thermal power plant water use evaluation method and terminal equipment
CN115935761A (en) * 2023-01-10 2023-04-07 中国电子产品可靠性与环境试验研究所((工业和信息化部电子第五研究所)(中国赛宝实验室)) Method and device for simulating reliability of equipment, computer equipment and storage medium
CN115935761B (en) * 2023-01-10 2023-06-20 中国电子产品可靠性与环境试验研究所((工业和信息化部电子第五研究所)(中国赛宝实验室)) Reliability simulation method and device for equipment, computer equipment and storage medium
CN116070469A (en) * 2023-04-04 2023-05-05 合肥通用机械研究院有限公司 Equipment operation parameter optimization method and risk management method
CN116388884B (en) * 2023-06-05 2023-10-20 浙江大学 Method, system and device for designing anti-eavesdrop ultrasonic interference sample
CN116388884A (en) * 2023-06-05 2023-07-04 浙江大学 Method, system and device for designing anti-eavesdrop ultrasonic interference sample
CN116957052A (en) * 2023-06-09 2023-10-27 清华大学 Training method of risk assessment model based on similarity of construction samples
CN116957052B (en) * 2023-06-09 2024-06-04 清华大学 Training method of risk assessment model based on similarity of construction samples
CN116911662A (en) * 2023-06-25 2023-10-20 施勒智能科技(上海)股份有限公司 Air quality optimization method and system based on artificial intelligence
CN116911662B (en) * 2023-06-25 2024-04-16 施勒智能科技(上海)股份有限公司 Air quality optimization method and system based on artificial intelligence
CN117170991A (en) * 2023-07-21 2023-12-05 国网信息通信产业集团有限公司 Verification method based on artificial intelligent platform service interface
CN116842853B (en) * 2023-09-01 2023-11-28 中国空气动力研究与发展中心计算空气动力研究所 Missile aerodynamic characteristic prediction model construction method for uncertainty quantization
CN116842853A (en) * 2023-09-01 2023-10-03 中国空气动力研究与发展中心计算空气动力研究所 Missile aerodynamic characteristic prediction model construction method for uncertainty quantization
CN117315466A (en) * 2023-09-20 2023-12-29 北京佳格天地科技有限公司 Growth monitoring management method and system
CN117315466B (en) * 2023-09-20 2024-04-09 北京佳格天地科技有限公司 Growth monitoring management method and system
CN117057027A (en) * 2023-10-13 2023-11-14 江苏通创现代建筑产业技术研究院有限公司 Wall heat preservation and insulation performance evaluation method and system
CN117057027B (en) * 2023-10-13 2023-12-22 江苏通创现代建筑产业技术研究院有限公司 Wall heat preservation and insulation performance evaluation method and system

Also Published As

Publication number Publication date
CN111667010A (en) 2020-09-15

Similar Documents

Publication Publication Date Title
WO2021121128A1 (en) Artificial intelligence-based sample evaluation method, apparatus, device, and storage medium
CN111950329B (en) Target detection and model training method, device, computer equipment and storage medium
JP6312630B2 (en) How to detect anomalies in time series data
US20190392252A1 (en) Systems and methods for selecting a forecast model for analyzing time series data
KR102219346B1 (en) Systems and methods for performing bayesian optimization
US11561954B2 (en) Method and system to estimate the cardinality of sets and set operation results from single and multiple HyperLogLog sketches
US11030246B2 (en) Fast and accurate graphlet estimation
US20140032450A1 (en) Classifying unclassified samples
JP2017224027A (en) Machine learning method related to data labeling model, computer and program
CN115376518B (en) Voiceprint recognition method, system, equipment and medium for real-time noise big data
US20220092407A1 (en) Transfer learning with machine learning systems
WO2021159748A1 (en) Model compression method and apparatus, computer device, and storage medium
CN115905450A (en) Unmanned aerial vehicle monitoring-based water quality abnormity tracing method and system
CN111160394A (en) Training method and device of classification network, computer equipment and storage medium
CN114169460A (en) Sample screening method, sample screening device, computer equipment and storage medium
JP2023162140A (en) Versatile anomaly detection system for industrial systems
Colgan et al. Architectural optimization and feature learning for high-dimensional time series datasets
US20230259756A1 (en) Graph explainable artificial intelligence correlation
Huo et al. Traffic anomaly detection method based on improved GRU and EFMS-Kmeans clustering
CN112905166B (en) Artificial intelligence programming system, computer device, and computer-readable storage medium
JP2019160240A (en) Information processing device and information processing method
CN114398228A (en) Method and device for predicting equipment resource use condition and electronic equipment
CN114880363A (en) Data center flow prediction system, training method and prediction method
CN110008100B (en) Method and device for detecting abnormal access volume of web page
CN112420135A (en) Virtual sample generation method based on sample method and quantile regression

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20903797

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20903797

Country of ref document: EP

Kind code of ref document: A1