WO2021139465A1 - Dispositif et procédé de sélection de modèle arrière, et support d'enregistrement lisible - Google Patents

Dispositif et procédé de sélection de modèle arrière, et support d'enregistrement lisible Download PDF

Info

Publication number
WO2021139465A1
WO2021139465A1 PCT/CN2020/134736 CN2020134736W WO2021139465A1 WO 2021139465 A1 WO2021139465 A1 WO 2021139465A1 CN 2020134736 W CN2020134736 W CN 2020134736W WO 2021139465 A1 WO2021139465 A1 WO 2021139465A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
trained
features
training
feature
Prior art date
Application number
PCT/CN2020/134736
Other languages
English (en)
Chinese (zh)
Inventor
唐兴兴
黄启军
陈瑞钦
林冰垠
李诗琦
Original Assignee
深圳前海微众银行股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳前海微众银行股份有限公司 filed Critical 深圳前海微众银行股份有限公司
Publication of WO2021139465A1 publication Critical patent/WO2021139465A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning

Definitions

  • This application relates to the artificial intelligence technology field of Fintech, and in particular to a backward model selection method, device and readable storage medium.
  • the backward selection mode is an important model selection strategy. Compared with all the features added to the model training, it can effectively prevent the model from overfitting.
  • the current backward selection mode usually requires the modeler to have High code development capabilities, and can only be implemented in a single machine, that is, the current implementation of the backward selection mode has higher threshold requirements for modelers, and because it can only be implemented in a single machine, it leads to the backward selection mode.
  • the modeling time is long and the modeling efficiency is low. Therefore, the prior art has the technical problems of high modeling threshold and low efficiency of the backward selection mode.
  • the main purpose of this application is to provide a backward model selection method, device and readable storage medium, aiming to solve the technical problems of high modeling threshold and low efficiency of backward selection mode in the prior art.
  • the present application provides a backward model selection method, the backward model selection method is applied to the server, and the backward model selection method includes:
  • each of the features to be trained calculates the first saliency corresponding to each of the features to be trained, and based on each of the first saliency, eliminate the features to be removed that meet the preset saliency requirements for removal from the features to be trained, so as to be based on the removed features.
  • Each of the features to be trained performs cyclic training on the first initial training model to obtain a cyclic training model set;
  • the cyclic training model set includes one or more model elements, and each of the model elements includes a second initial training model,
  • the feature to be removed that meets the preset removal saliency requirement is removed from the features to be trained, so as to compare the first initial saliency based on the removed features to be trained.
  • the training model performs cyclic training, and the steps to obtain the cyclic training model set include:
  • the second initial training model is cyclically trained to obtain one or more of the model elements until the feature to be removed does not exist in each of the features to be trained.
  • the step of selecting the feature to be removed from the features to be trained based on each of the first saliency and the preset removal saliency requirement includes:
  • the target significance is less than the preset rejection significance threshold, it is determined that the target feature meets the preset rejection significance requirement, and the target feature is taken as the feature to be rejected.
  • the step of calculating the first saliency corresponding to each of the features to be trained includes:
  • each of the first saliences is calculated.
  • the configuration parameter includes a training completion determination condition, and the feature to be trained includes one or more pieces of feature data;
  • the step of training a preset model to be trained based on each of the features to be trained and the configuration parameters, and obtaining a first initial training model includes:
  • the updated preset to-be-trained model does not meet the training completion judgment condition, continue to perform iterative training updates on the preset to-be-trained model until the updated preset to-train model satisfies the training Complete the judgment condition.
  • the step of selecting a target training model from the first initial training model and the cyclic training model set based on the configuration parameters includes:
  • model selection strategy includes AUC (Area Under Curve, the area under the receiver operating characteristic curve and the coordinate axis) value and AIC (Akaike information criterion, Akaike information) Quantity criterion) value;
  • model selection strategy is the AUC value
  • the model selection strategy is the AIC value
  • the AIC values of the elements in the cyclic training model set are compared, and the element corresponding to the smallest AIC value is selected as the target training model.
  • the client includes a visual interface
  • the step of generating visualization data corresponding to the target training model and feeding back the visualization data to the client includes:
  • the present application also provides a backward model selection method.
  • the backward model selection method is applied to the client, and the backward model selection method includes:
  • Receive a model selection task and send the configuration parameters corresponding to the model selection task to the server associated with the client, so that the server performs model selection based on the configuration parameters and the acquired features to be trained to obtain A target training model, and obtaining visualization data corresponding to the target training model, so as to send the visualization data to the client;
  • the visualization data fed back by the server is received, and the visualization data is displayed on a preset visualization interface.
  • the present application also provides a backward model selection device, which is applied to a backward model selection device, and the backward model selection device includes:
  • the first training module is configured to receive the configuration parameters sent by the client associated with the server and obtain the features to be trained, and train a preset model to be trained based on each of the features to be trained and the configuration parameters , To obtain the first initial training model;
  • the second training module is used to calculate the first saliency corresponding to each of the features to be trained, and based on each of the first saliences, to remove the features that meet the preset removal saliency requirements from the features to be trained The features to be eliminated, to perform cyclic training on the first initial training model based on each of the features to be trained after culling, to obtain a cyclic training model set;
  • a selection module for selecting a target training model from the first initial training model and a set of cyclic training models based on the configuration parameters
  • the feedback module is used for generating the visualization data corresponding to the target training model, and feeding back the visualization data to the client.
  • the second training module includes:
  • the first culling sub-module is configured to select the feature to be removed among the features to be trained based on each of the first saliency and the preset saliency removal requirement, and to remove the feature to be removed ;
  • a training sub-module configured to train the first initial training model based on the eliminated features to be trained to obtain the second initial training model
  • the second culling sub-module is used to calculate the second saliency of each feature to be trained after being removed, and based on each of the second saliency, remove the coincidence again from each feature to be trained after being removed Other features to be removed that are required to be removed by the preset saliency;
  • the cyclic training sub-module is used to perform cyclic training on the second initial training model based on each of the features to be trained after being removed again, to obtain one or more of the model elements, until each feature to be trained The feature to be removed does not exist in.
  • the selection submodule includes:
  • the first comparison unit is configured to compare each of the first saliency, and select the feature with the lowest saliency among the features to be trained as the target feature;
  • the second comparison unit is used to compare the target significance of the target feature with a preset significance threshold for rejection
  • the determining unit is configured to determine that if the target significance is less than the preset rejection significance threshold, determine that the target feature meets the preset rejection significance requirement, and use the target feature as the pending Remove features.
  • the second training module further includes:
  • the first calculation sub-module is used to calculate the chi-square value wald of each of the features to be trained
  • the second calculation sub-module is used for calculating each of the first saliency based on each of the chi-square value wald and the degrees of freedom of each of the features to be trained.
  • the first training module includes:
  • a training update sub-module for inputting the feature data corresponding to each of the features to be trained into the preset model to be trained, so as to train and update the preset model to be trained;
  • the first judging sub-module is used to judge whether the updated preset model to be trained satisfies the training completion judging condition, and if the updated preset to be trained model satisfies the training completion judging condition, then Obtaining the first initial training model;
  • the second judgment sub-module is configured to continue to perform iterative training updates on the preset to-be-trained model if the updated preset to-be-trained model does not satisfy the training completion judgment condition until the updated all-in-one model The preset model to be trained satisfies the training completion judgment condition.
  • the selection module includes:
  • the first obtaining sub-module is configured to obtain the model selection strategy in the parameter configuration, wherein the model selection strategy includes an AUC value and an AIC value;
  • the first comparison sub-module is configured to compare the AUC value of each element in the cyclic training model set if the model selection strategy is the AUC value to select the largest corresponding AUC value As the target training model;
  • the second comparison sub-module is used to compare the AIC value of each element in the cyclic training model set if the model selection strategy is the AIC value to select the smallest corresponding AIC value As the target training model.
  • the feedback module includes:
  • the second acquisition sub-module is used to acquire the candidate feature data, selection summary data, and training process data corresponding to the backward model selection process of the target training model;
  • a generating sub-module is used to generate the visualization data corresponding to the candidate feature data, the selection summary data, and the training process data, and feed back the visualization data to the visualization interface in real time.
  • the present application also provides a backward model selection device.
  • the backward model selection device is applied to a client, and the backward selection device includes:
  • the sending module is configured to receive the model selection task, and send the configuration parameters corresponding to the model selection task to the server associated with the client, so that the server can use the configuration parameters and the acquired configuration parameters.
  • model selection on training features obtaining a target training model, and obtaining visualization data corresponding to the target training model, so as to send the visualization data to the client;
  • the receiving module is configured to receive the visualization data fed back by the server, and display the visualization data on a preset visualization interface.
  • the present application also provides a backward model selection device.
  • the backward model selection device includes a memory, a processor, and a device for the backward model selection method that is stored on the memory and can run on the processor.
  • a program when the program of the backward model selection method is executed by a processor, the steps of the backward model selection method as described above can be realized.
  • the present application also provides a readable storage medium, the readable storage medium stores a program for implementing the backward model selection method, and when the program for the backward model selection method is executed by a processor, the backward model as described above is implemented Select the steps of the method.
  • This application receives the configuration parameters sent by the client associated with the server and obtains the features to be trained, and trains a preset model to be trained based on each of the features to be trained and the configuration parameters to obtain the first initial training
  • the model further calculates the first saliency corresponding to each of the features to be trained, and based on each of the first salience, removes the features to be removed that meet the preset removal saliency requirements from the features to be trained, and then based on
  • the first initial training model is cyclically trained to obtain a cyclic training model set, and then based on the configuration parameters, from the first initial training model and the cyclic training model set
  • the target training model is selected, and then the visualization data corresponding to the target training model is generated, and the visualization data is fed back to the client.
  • this application first sends the configuration parameters sent by the client associated with the server and acquires the features to be trained, and based on each of the features to be trained and the configuration parameters, performs a comparison of the preset model to be trained Training, obtain the first initial training model, and then perform the calculation of the first saliency corresponding to each of the features to be trained, and then based on each of the first saliency, the elimination of the features to be trained meets the preset elimination
  • the features to be removed with the saliency requirement are further based on the removed features to be trained, the cyclic training of the first initial training model is performed to obtain the cyclic training model set, and then based on the configuration parameters, from the first
  • a target training model is selected from an initial training model and a cyclic training model set, and then the visualization data corresponding to the target training model is generated, and the visualization data is fed back to the client.
  • this application provides a model selection method of backward selection mode of codeless distributed modeling and visual modeling.
  • the user only needs to set and send the necessary configuration parameters to the server through the client, and the server is It can feed back the visual data and the result of the backward model selection process corresponding to the corresponding backward model selection process, that is, through the communication connection between the client and the server for model modeling, distributed modeling is realized, and compared with a single machine
  • the modeling of the backward selection mode performed improves the modeling efficiency of the backward selection mode.
  • the visualization modeling is realized and the construction is reduced.
  • the ability threshold of the model personnel is required and the modeling efficiency of the backward selection mode is further improved.
  • the user only needs to enter the necessary model parameters in the visual interface of the client to obtain the corresponding backward model selection results.
  • code development ability which realizes no-code modeling, and further reduces the requirement for the ability threshold of modelers. Therefore, it solves the technology of high modeling threshold and low efficiency of backward selection mode in the existing technology. problem.
  • FIG. 1 is a schematic flowchart of the first embodiment of the backward model selection method of this application
  • FIG. 2 is a schematic diagram of a visual interface for configuring the parameters in the backward model selection method of this application;
  • FIG. 3 is a schematic flowchart of a second embodiment of the backward model selection method of this application.
  • FIG. 4 is a schematic diagram of the process of performing backward model selection in combination with the first embodiment in the second embodiment of the backward model selection method of this application;
  • FIG. 5 is a schematic flowchart of a third embodiment of a backward model selection method according to this application.
  • FIG. 6 is a schematic diagram of the device structure of the hardware operating environment involved in the solution of the embodiment of the application.
  • the embodiment of the present application provides a method for selecting a backward model.
  • the method for selecting a backward model is applied to the server.
  • the backward model selection is Methods include:
  • Step S10 receiving configuration parameters sent by the client associated with the server and acquiring features to be trained, and training a preset model to be trained based on each of the features to be trained and the configuration parameters to obtain a first initial training model.
  • the client includes a visualization interface, and the user can configure parameters of a preset model to be trained on the visualization interface for model training, as shown in FIG.
  • the parameters such as the maximum iteration coefficient, minimum convergence error, and category weight are all parameters that need to be set before model training.
  • the backward model selection mode includes backward selection mode and stepwise selection mode.
  • the feature to be trained includes one or more features, and each feature includes one to obtain multiple pieces of feature data.
  • the preset model to be trained includes a logistic regression model.
  • the configuration parameters sent by the client associated with the server are received and the features to be trained are acquired, and a preset model to be trained is trained based on each of the features to be trained and the configuration parameters to obtain a first initial training model.
  • the configuration parameters sent by the client are received, and training completion judgment conditions are extracted from the configuration parameters, and then each feature to be trained is obtained from the local database of the backward model selection server, and each feature to be trained is
  • the feature data corresponding to the feature is input into the preset to-be-trained model to perform iterative training updates on the preset to-be-trained model, until the preset to-be-trained model reaches the preset training completion judgment condition, then the iterative training is completed, and
  • the updated preset model to be trained that is, the first initial training model is obtained, wherein the preset training completion judgment condition includes reaching the minimum convergence error, reaching the maximum number of iterations, and so on.
  • the configuration parameters include training completion judgment conditions, and the features to be trained include one or more pieces of feature data;
  • the step of training a preset model to be trained based on each of the features to be trained and the configuration parameters, and obtaining a first initial training model includes:
  • Step S11 input the feature data corresponding to each of the features to be trained into the preset model to be trained, so as to train and update the preset model to be trained;
  • the preset model to be trained is updated once, wherein the preset model to be trained is trained and updated The gradient descent method and so on.
  • Step S12 Determine whether the updated preset to-be-trained model satisfies the training completion judgment condition, and if the updated preset to-be-trained model satisfies the training completion judgment condition, obtain the first initial training model;
  • the training completion judgment condition includes reaching the minimum convergence error, reaching the maximum number of iterations, and so on.
  • the first initial training model is obtained, specifically To determine whether the updated preset to-be-trained model satisfies the training completion judgment condition, and if the updated preset to-be-trained model satisfies the training completion judgment condition, the updated model obtained in this training
  • the preset model to be trained is used as the first initial training model, that is, the first initial training model is obtained.
  • Step S13 If the updated preset to-be-trained model does not meet the training completion judgment condition, continue to perform iterative training updates on the preset to-be-trained model until the updated preset to-be-trained model satisfies The training completion judgment condition.
  • the updated preset to-be-trained model does not meet the training completion determination condition, then iterative training and update of the preset to-be-trained model continues until the updated preset to-be-trained model The training model satisfies the training completion judgment condition. Specifically, if the updated preset model to be trained does not satisfy the training completion judgment condition, it indicates that the updated preset model to be trained obtained in this training Cannot be used as the first initial training model, and then input the feature data corresponding to each of the features to be trained into the updated preset model to be trained, so as to perform iterative training updates on the preset model to be trained, Until the updated preset to-be-trained model satisfies the training completion judgment condition.
  • Step S20 Calculate the first saliency corresponding to each of the features to be trained, and based on each of the first saliency, remove the features to be removed that meet the preset saliency removal requirements from the features to be trained, so as to be based on After removing each of the features to be trained, performing cyclic training on the first initial training model to obtain a cyclic training model set;
  • the first saliency corresponding to each of the features to be trained is calculated, and based on each of the first saliency, the features to be removed that meet the preset removal saliency requirements are eliminated from the features to be trained ,
  • To perform cyclic training on the first initial training model based on the removed features to be trained to obtain a cyclic training model set specifically, based on each of the features to be trained and the features corresponding to each of the features to be trained
  • the chi-square value wald of each feature to be trained is calculated by the preset chi-square value wald calculation formula, and then based on each chi-square value wald and the degrees of freedom of each feature to be trained, the corresponding to each feature to be trained is calculated
  • the first saliency of, and then based on each of the first saliency find and remove the feature to be removed in each of the features to be trained, and then based on the feature to be trained after removing the feature to be removed, the The first initial training model is re
  • step S20 the step of calculating the first saliency corresponding to each of the features to be trained includes:
  • Step S21 Calculate the chi-square value wald of each of the features to be trained
  • the chi-square value wald of each feature to be trained is calculated, specifically, the feature data representation matrix corresponding to each feature to be trained is substituted into the preset chi-square value wald calculation formula, and each of the features is calculated in parallel.
  • S is the chi-square value wald
  • X includes n pieces of data
  • each piece of data includes k values
  • X can be represented by a feature data representation matrix
  • the feature data indicates that each column of the matrix is a piece of data and corresponds to the feature to be trained, and the model parameter obtained by training the preset model to be trained corresponding to X is ⁇ , where ⁇ is a k-dimensional vector ( ⁇ 1 , ⁇ 2 , ..., ⁇ k-1 , ⁇ k ), and the feature set X to be trained can be divided into a first model feature set and a second model feature set, wherein the feature corresponding to the first model feature set
  • the data representation matrix is X0
  • the feature data representation matrix corresponding to the second model feature set is X1
  • X 0 includes n pieces of data
  • each piece of data includes (kt) values
  • X 0 trains the preset model to be trained
  • the model parameter obtained is ⁇ 0 , where ⁇ 0 is a (kt)-dimensional vector ( ⁇ 1 , ⁇ 2 ,..., ⁇ kt ), X 1 includes n pieces of data, and each piece of data
  • the non-saliency features refer to the features of the features to be trained that are significantly less than a preset significance threshold , wherein the saliency can be obtained based on the chi-square value wald and the degree of freedom of the feature to be trained, wherein the degree of freedom is related to the value of the feature to be trained, for example, suppose the feature to be trained Including bank deposits, credit card consumption records, and loan records, then the feature to be trained includes 3 variables, and the degree of freedom is 2.
  • Step S22 Calculate each of the first saliency based on each of the chi-square value wald and the degrees of freedom of each of the features to be trained.
  • the first significance can be determined based on the Pearson correlation value, and when the Pearson correlation value is less than or equal to the preset Pearson correlation threshold, the determination is The feature corresponding to the first saliency does not meet the preset saliency removal requirement, that is, the feature corresponding to the first saliency appears to be significant, when the Pearson correlation value is greater than the preset Pearson correlation threshold
  • the degree of freedom corresponds to the number of feature data corresponding to the feature Correlation, for example, assuming that there are 100 different pieces of data in the feature data, the degree of freedom is 99.
  • the Pearson correlation value of each feature to be trained is calculated by a preset Pearson correlation value calculation formula, and then the significance of each feature to be trained is calculated by each Pearson correlation value, for example, assuming that each The Pearson correlation values are 0.0001, 0.01, and 0.05, respectively, and the corresponding measurement values for determining each of the significance are 100, 1, and 0.2. The larger the measurement value, the more significant the significance.
  • Step S30 based on the configuration parameters, select a target training model from the first initial training model and the cyclic training model set;
  • the configuration parameters include a model selection strategy.
  • a target training model is selected from the first initial training model and the cyclic training model set. Specifically, based on the model selection strategy, the first initial training model and the cyclic training model From each element of the training model set, a model that best meets the model selection strategy is selected as the target training model.
  • the step of selecting a target training model from the first initial training model and cyclic training model set based on the configuration parameters includes:
  • Step S31 Obtain a model selection strategy in the parameter configuration, where the model selection strategy includes an AUC value and an AIC value;
  • the AUC value is the criterion for evaluating the training model, and the larger the AUC value is, the better the training model is.
  • the AUC value is the area enclosed by the coordinate axis under the ROC (receiver operating characteristic curve) curve, and the value of this area will not be greater than 1, where the ROC curve is based on a A series of different binary classification methods (cutoff value or decision threshold), the true positive rate (sensitivity) is the ordinate, the false positive rate (1-specificity) is the curve drawn on the abscissa, the AIC value is calculated based on the AIC criterion Among them, the AIC criterion is a standard for measuring the goodness of the statistical model.
  • Step A32 if the model selection strategy is the AUC value, compare the AUC values of the elements in the cyclic training model set, and select the element corresponding to the largest AUC value as the target training model .
  • the model selection strategy is the AUC value
  • the AUC values of the elements in the cyclic training model set are compared, and the element corresponding to the largest AUC value is selected as the The target training model, specifically, if the model selection strategy is the AUC value, compare the AUC values to obtain the maximum AUC value, and use the training model corresponding to the maximum AUC value as the target training A model, wherein the training model includes a first initial training model and each element in the cyclic training model set.
  • Step S33 if the model selection strategy is the AIC value, compare the AIC values of the elements in the cyclic training model set, and select the element corresponding to the smallest AIC value as the target training model .
  • the model selection strategy is the AIC value
  • the AIC value of each element in the cyclic training model set is compared, and the element corresponding to the smallest AIC value is selected as the The target training model, specifically, if the model selection strategy is the AIC value, the AIC values are compared to obtain the minimum AIC value, and the training model corresponding to the minimum AIC value is used as the target training A model, wherein the training model includes a first initial training model and each element in the cyclic training model set.
  • Step S40 Generate visualization data corresponding to the target training model, and feed back the visualization data to the client.
  • the visualization data includes candidate feature visualization data, model selection summary visualization data, and training process visualization data, where the candidate feature is a feature in the feature set to be trained
  • the model selection summary data includes summary data for model selection of the first initial training model and the model elements in the cyclic training model set.
  • Generate visualization data corresponding to the target training model and feed back the visualization data to the client, specifically, generate visualization data corresponding to the acquisition process corresponding to the target training model, wherein the acquisition process includes features Selection process, model training process, model selection process, etc., and then feedback the visualization data to the visualization interface of the client for display to the customer, wherein the feature selection process is the process of selecting features in the feature set to be trained
  • the model training process is a process of training a target model, wherein the target model includes a preset model to be trained, a first initial training model, model elements, etc., and the model selection process is based on a preset model selection strategy The process of selecting the target training model.
  • the client includes a visual interface
  • the step of generating visualization data corresponding to the target training model and feeding back the visualization data to the client includes:
  • Step S41 Obtain candidate feature data, selection summary data, and training process data corresponding to the model selection process of the target training model;
  • the model selection process of the target training model includes a model iterative training process, a feature selection process, a model selection process, etc., wherein the feature selection process is a process of removing the feature to be removed, and the model selection process The process of selecting a target training model based on a preset model selection strategy.
  • Obtain candidate feature data, selection summary data, and training process data corresponding to the model selection process of the target training model specifically, acquire candidate feature data of the feature selection process and selection summary data of the model selection process in real time And training process data of the model iterative training process.
  • Step S42 Generate visualization data corresponding to the candidate feature data, the selection summary data, and the training process data, and feed back the visualization data to the visualization interface in real time.
  • the visualization data includes graphic data, table data, and the like.
  • the selection of the visualization data corresponding to the summary data and the training process data, and the real-time feedback of the visualization data to the visualization interface in real time, wherein the time interval for real-time feedback of the visualization data to the visualization interface The user of the server can be selected by the backward model to set it, and the user of the client can query the visualization data in real time on the client.
  • the preset model to be trained is trained based on each of the features to be trained and the configuration parameters to obtain the first initial Training the model, and then calculate the first saliency corresponding to each of the features to be trained, and based on each of the first salience, remove the features to be removed from the features to be trained that meet the preset removal saliency requirements, and then Based on the eliminated features to be trained, the first initial training model is cyclically trained to obtain a cyclic training model set, and then based on the configuration parameters, from the first initial training model and the cyclic training model set Selecting a target training model in, then generating visualization data corresponding to the target training model, and feeding back the visualization data to the client.
  • this embodiment first sends the configuration parameters sent by the client associated with the server and acquires the features to be trained, and based on each of the features to be trained and the configuration parameters, performs a comparison of the preset to be trained
  • the training of the model, the first initial training model is obtained, and the first saliency corresponding to each of the features to be trained is calculated, and then based on each of the first salience, the features to be trained are eliminated in accordance with the preset Remove the features to be removed that require saliency, and then perform cyclic training on the first initial training model based on the removed features to be trained to obtain a cyclic training model set, and then based on the configuration parameters, from the
  • the target training model is selected from the first initial training model and the cyclic training model set, and then the visualization data corresponding to the target training model is generated, and the visualization data is fed back to the client.
  • this embodiment provides a model selection method for the backward selection mode of codeless distributed modeling and visual modeling.
  • the user only needs to set and send the necessary configuration parameters to the server through the client. That is to say, the visual data corresponding to the backward model selection process and the backward model selection result can be fed back, that is, the client and the server are connected to communicate with each other for model modeling, which realizes distributed modeling, which is compared with The modeling of the backward selection mode performed by a stand-alone machine improves the modeling efficiency of the backward selection mode.
  • the visualization modeling is realized, which reduces The ability threshold of modelers is required and the modeling efficiency of the backward selection mode is further improved.
  • the user only needs to input the necessary model parameters in the visual interface of the client to obtain the corresponding backward model selection results.
  • the cyclic training model set includes one or more model elements, each of which The model element includes the second initial training model,
  • the feature to be removed that meets the preset removal saliency requirement is removed from the features to be trained, so as to compare the first initial saliency based on the removed features to be trained.
  • the training model performs cyclic training, and the steps to obtain the cyclic training model set include:
  • Step C10 based on each of the first saliency and the preset removal saliency requirements, select the feature to be removed among the features to be trained, and remove the feature to be removed;
  • the first significance can be determined based on the Pearson correlation value, and when the Pearson correlation value is less than or equal to the preset Pearson correlation threshold, the determination is The feature corresponding to the first saliency does not meet the preset saliency removal requirement, that is, the feature corresponding to the first saliency appears to be significant, when the Pearson correlation value is greater than the preset Pearson correlation threshold When, it is determined that the feature corresponding to the first saliency satisfies the preset saliency removal requirement, that is, the feature corresponding to the first saliency is not significant.
  • the feature to be removed among the features to be trained Based on each of the first saliency and the preset removal saliency requirements, select the feature to be removed among the features to be trained, and remove the feature to be removed, specifically, combine each of the first The saliency is compared, the feature with the lowest saliency among the features to be trained is selected as the target feature, and it is judged whether the target feature satisfies the pre-determined saliency requirement, if the target feature meets the pre-determined removal If the saliency requirement is required, the target feature is used as the feature to be eliminated, and the feature to be eliminated is eliminated. If the target feature does not meet the pre-determined saliency requirement for elimination, the current cycle training is ended.
  • the step of selecting the feature to be removed among the features to be trained based on each of the first saliency and the preset removal saliency requirement includes:
  • Step C11 comparing each of the first saliency, and selecting the feature with the lowest saliency among the features to be trained as the target feature;
  • each of the first saliency is compared, and the feature with the lowest saliency is selected as the target feature among the features to be trained. Specifically, the first saliency is selected as a target feature.
  • a comparison to obtain the least significant feature of each of the features to be trained corresponding to each of the saliency, that is, to obtain the feature with the highest Pearson correlation value, that is, in each of the features to be trained The least significant feature is selected as the target feature.
  • Step C12 comparing the target saliency of the target feature with a preset saliency rejection threshold
  • Step C13 If the target significance is less than the preset rejection significance threshold, it is determined that the target feature meets the preset rejection significance requirement, and the target feature is taken as the feature to be rejected.
  • the target saliency of the target feature is compared with a preset rejection saliency threshold, and if the target saliency is less than the preset rejection saliency threshold, the target feature is determined Meet the preset saliency requirement for rejection, and use the target feature as the feature to be rejected.
  • the target saliency of the target feature is compared with a preset saliency threshold, wherein the target The saliency is the first saliency of the target feature. If the target saliency is lower than the preset saliency threshold, the target feature meets the preset saliency removal requirement, that is, the The target feature is not significant, and then the target feature is used as the feature to be eliminated. If the target significance is higher than or equal to the preset significance threshold, the target feature does not satisfy the preset Excluding the significance requirement, that is, the target feature is significant, then this cycle training is ended.
  • Step C20 training the first initial training model based on the eliminated features to be trained to obtain the second initial training model.
  • the cyclic training model set includes one or more model elements.
  • the first initial training model is trained to obtain the second initial training model.
  • the feature data of the eliminated features to be trained is input into the A first initial training model to perform an iterative training update on the first initial training model until the updated first initial training model satisfies a preset training completion judgment condition to obtain the updated first initial training model That is, the second initial training model is obtained, wherein the preset training completion judgment condition includes reaching the maximum number of iterations and reaching the minimum convergence error.
  • Step C30 Calculate the second saliency of each feature to be trained after culling, and based on each of the second saliency, remove again from each feature to be trained after culling that meets the preset removal saliency Other required features to be removed;
  • the second saliency of each feature to be trained after being removed is calculated, and based on each of the second saliency, the removal of each feature to be trained after removal is again consistent with the preset
  • the other features to be removed that require saliency are removed, specifically, the chi-square value wald of each feature to be trained after removal is recalculated, and based on the recalculated chi-square value wald and each removed feature.
  • the degrees of freedom of the features to be trained are calculated, and the second saliency of each feature to be trained after being removed is calculated, and based on each of the second saliency, it is determined whether there is any feature that satisfies the preset after being removed. Remove the feature to be removed that requires saliency.
  • Step C40 Perform cyclic training on the second initial training model based on each of the features to be trained after being removed again, to obtain one or more of the model elements, until the feature to be trained does not exist in each of the features to be trained. Remove features.
  • the second initial training model is cyclically trained to obtain one or more of the model elements until there is no feature in each of the features to be trained
  • the features to be removed specifically, based on the features to be trained after being removed again, the second initial training model is iteratively trained and updated until the second initial training model reaches the training completion judgment condition, and the update is obtained
  • the latter second initial training model that is, one of the model elements is obtained, and the search and elimination of the features to be eliminated are re-circulated, and the bone-setting training update of the cyclically updated second initial training model is performed, Obtain one or more model elements, until there is no feature to be removed that meets the preset removal significance requirement among the features to be trained, then this cyclic training is ended, and then a cyclic training model set is obtained, as shown in Figure 4
  • This embodiment is a schematic diagram of the flow of backward model selection in combination with the first embodiment, where the features in the model are each of the features to be trained
  • the feature to be removed from the features to be trained is selected, and the feature to be removed is removed, and then based on each removed feature
  • the first initial training model is trained to obtain the second initial training model, and then the second saliency of each feature to be trained after being eliminated is calculated, and based on each of the second Saliency, among the features to be trained after being removed, other features to be removed that meet the pre-determined saliency requirement are removed again, and then based on the features to be trained after being removed again, the first 2.
  • the initial training model performs cyclic training to obtain one or more of the model elements until the feature to be removed does not exist in each feature to be trained.
  • the features to be eliminated in each feature to be trained are eliminated one by one, and the first initial training model is analyzed based on the features to be trained after each elimination.
  • the training update is performed until the feature to be removed does not exist in each feature to be trained, the cyclic training model set is obtained, and the model selection of the backward selection mode can be performed based on the cyclic training model set, that is,
  • the model selection of the backward selection mode of distributed modeling and visual modeling lays the foundation, that is, it lays a foundation for solving the technical problems of high threshold and low efficiency of backward selection mode modeling in the prior art.
  • the forward model selection method is applied to the client, and the forward model selection method includes:
  • Step A10 Receive a model selection task, and send configuration parameters corresponding to the model selection task to the server associated with the client, so that the server can perform model selection based on the configuration parameters to obtain a target training model , And obtain the visualization data corresponding to the target training model, so as to send the visualization data to the client;
  • the model selection task includes target model requirements
  • the target model requirements are determined by the configuration parameters
  • the configuration parameters include large iteration coefficients, minimum convergence errors, model selection modes, etc. parameter.
  • the visualization data corresponding to the target training model is sent to the client, specifically, the model selection task is received, and the configuration parameters corresponding to the model selection task are matched in a preset local database or determined by The user sets the configuration parameters by himself based on the model selection task, and further, sends the configuration parameters to the server associated with the client, so that the server can perform a preset initialization based on the configuration parameters.
  • the training update of the model, the model to be trained is obtained, and the cyclic training update is performed on the model to be trained to obtain one or more models to be selected, that is, the cyclic training model set is obtained, and the model to be selected is selected in each of the models to be selected.
  • the model of the preset model selection strategy is used as the target training model, and the process data corresponding to the target training model is converted into the visualization data and fed back to the client, where the visualization data includes candidate feature visualization data and models Select and summarize visualization data and model training process visualization data, where the candidate features are each of the features to be trained, and the model selection summary data includes performing model elements in the cyclic training model set based on a preset model selection strategy. Summary data for model selection.
  • Step A20 Receive the visualization data fed back by the server, and display the visualization data on a preset visualization interface.
  • the client can query the visualization data corresponding to the process data of the server in real time on the preset visualization interface, and it can be in the process of model selection or model selection. After the selection is completed, the process data is inquired, and the client is in communication with the server.
  • a model selection task is received, and the configuration parameters corresponding to the model selection task are sent to the server associated with the client, so that the server can perform model selection based on the configuration parameters to obtain target training Model, and obtain the visualization data corresponding to the target training model to send the visualization data to the client, and then receive the visualization data fed back by the server, and set the visualization data in a preset visualization
  • the interface is displayed. That is, this implementation provides a model selection method for codeless distributed modeling and visual modeling.
  • this embodiment implements distributed modeling, improves the modeling efficiency during model selection, and the model selection process does not have any code development capability requirements for users, which reduces the ability threshold requirements for modelers.
  • the server can convert the process data corresponding to the target training model into visualized data and feed it back to the client, it further reduces the ability threshold requirements for modelers, and the visualized data is convenient for modelers to understand and read.
  • the modeling efficiency of modelers can be further improved, and therefore, the technical problems of high threshold and low efficiency of forward selection model modeling in the prior art are solved.
  • FIG. 6 is a schematic diagram of the device structure of the hardware operating environment involved in the solution of the embodiment of the present application.
  • the backward model selection device may include a processor 1001, such as a CPU, a memory 1005, and a communication bus 1002.
  • the communication bus 1002 is used to implement connection and communication between the processor 1001 and the memory 1005.
  • the memory 1005 may be a high-speed RAM memory, or a non-volatile memory (non-volatile memory), such as a magnetic disk memory.
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001.
  • the backward model selection device may further include a rectangular user interface, a network interface, a camera, an RF (Radio Frequency) circuit, a sensor, an audio circuit, a WiFi module, and so on.
  • the rectangular user interface may include a display screen (Display) and an input sub-module such as a keyboard (Keyboard), and the optional rectangular user interface may also include a standard wired interface and a wireless interface.
  • the network interface can optionally include a standard wired interface and a wireless interface (such as a WI-FI interface).
  • the structure of the backward model selection device shown in FIG. 6 does not constitute a limitation on the backward model selection device, and may include more or less components than shown in the figure, or a combination of certain components, Or different component arrangements.
  • the memory 1005 which is a computer-readable storage medium, may include an operating system, a network communication module, and a backward model selection program.
  • the operating system is a program that manages and controls the hardware and software resources of the backward model selection device, and supports the operation of the backward model selection program and other software and/or programs.
  • the network communication module is used to realize the communication between the components in the memory 1005 and the communication with other hardware and software in the backward model selection system.
  • the processor 1001 is configured to execute the backward model selection program stored in the memory 1005 to implement the steps of the backward model selection method described in any one of the foregoing items.
  • the specific implementation of the backward model selection device of the present application is basically the same as the foregoing embodiments of the backward model selection method, and will not be repeated here.
  • An embodiment of the present application also provides a backward model selection device.
  • the backward model selection device is applied to a server, and the backward model selection device includes:
  • the first training module is configured to receive the configuration parameters sent by the client associated with the server and obtain the features to be trained, and train a preset model to be trained based on each of the features to be trained and the configuration parameters , To obtain the first initial training model;
  • the second training module is used to calculate the first saliency corresponding to each of the features to be trained, and based on each of the first saliences, to remove the features that meet the preset removal saliency requirements from the features to be trained The features to be eliminated, to perform cyclic training on the first initial training model based on each of the features to be trained after culling, to obtain a cyclic training model set;
  • a selection module for selecting a target training model from the first initial training model and a set of cyclic training models based on the configuration parameters
  • the feedback module is used for generating the visualization data corresponding to the target training model, and feeding back the visualization data to the client.
  • the second training module includes:
  • the first culling sub-module is configured to select the feature to be removed among the features to be trained based on each of the first saliency and the preset saliency removal requirement, and to remove the feature to be removed ;
  • a training sub-module configured to train the first initial training model based on the eliminated features to be trained to obtain the second initial training model
  • the second culling sub-module is used to calculate the second saliency of each feature to be trained after being removed, and based on each of the second saliency, remove the coincidence again from each feature to be trained after being removed Other features to be removed that are required to be removed by the preset saliency;
  • the cyclic training sub-module is used to perform cyclic training on the second initial training model based on each of the features to be trained after being removed again, to obtain one or more of the model elements, until each feature to be trained The feature to be removed does not exist in.
  • the selection submodule includes:
  • the first comparison unit is configured to compare each of the first saliency, and select the feature with the lowest saliency among the features to be trained as the target feature;
  • the second comparison unit is used to compare the target significance of the target feature with a preset significance threshold for rejection
  • the determining unit is configured to determine that if the target significance is less than the preset rejection significance threshold, determine that the target feature meets the preset rejection significance requirement, and use the target feature as the pending Remove features.
  • the second training module further includes:
  • the first calculation sub-module is used to calculate the chi-square value wald of each of the features to be trained
  • the second calculation sub-module is used for calculating each of the first saliency based on each of the chi-square value wald and the degrees of freedom of each of the features to be trained.
  • the first training module includes:
  • a training update sub-module for inputting the feature data corresponding to each of the features to be trained into the preset model to be trained, so as to train and update the preset model to be trained;
  • the first judging sub-module is used to judge whether the updated preset model to be trained satisfies the training completion judging condition, and if the updated preset to be trained model satisfies the training completion judging condition, then Obtaining the first initial training model;
  • the second judgment sub-module is configured to continue to perform iterative training updates on the preset to-be-trained model if the updated preset to-be-trained model does not satisfy the training completion judgment condition until the updated all-in-one model The preset model to be trained satisfies the training completion judgment condition.
  • the selection module includes:
  • the first obtaining sub-module is configured to obtain the model selection strategy in the parameter configuration, wherein the model selection strategy includes an AUC value and an AIC value;
  • the first comparison sub-module is configured to compare the AUC value of each element in the cyclic training model set if the model selection strategy is the AUC value to select the largest corresponding AUC value As the target training model;
  • the second comparison sub-module is used to compare the AIC value of each element in the cyclic training model set if the model selection strategy is the AIC value to select the smallest corresponding AIC value As the target training model.
  • the feedback module includes:
  • the second acquisition sub-module is used to acquire the candidate feature data, selection summary data, and training process data corresponding to the backward model selection process of the target training model;
  • a generating sub-module is used to generate the visualization data corresponding to the candidate feature data, the selection summary data, and the training process data, and feed back the visualization data to the visualization interface in real time.
  • the specific implementation of the backward model selection device of the present application is basically the same as the foregoing embodiments of the backward model selection method, and will not be repeated here.
  • an embodiment of the present application also provides a backward model selection device, the backward model selection device is applied to a client, and the backward model selection device includes:
  • the sending module is configured to receive the model selection task, and send the configuration parameters corresponding to the model selection task to the server associated with the client, so that the server can use the configuration parameters and the acquired configuration parameters.
  • model selection on training features obtaining a target training model, and obtaining visualization data corresponding to the target training model, so as to send the visualization data to the client;
  • the receiving module is configured to receive the visualization data fed back by the server, and display the visualization data on a preset visualization interface.
  • the specific implementation of the backward model selection device of the present application is basically the same as the foregoing embodiments of the backward model selection method, and will not be repeated here.
  • the embodiments of the present application provide a readable storage medium, and the readable storage medium stores one or more programs, and the one or more programs may also be executed by one or more processors for implementation The steps of the backward model selection method described in any one of the above.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention concerne un procédé et un dispositif de sélection de modèle arrière, et un support d'informations lisible. Le procédé de sélection de modèle arrière consiste à : recevoir des paramètres de configuration envoyés par un client associé à un serveur et acquérir des caractéristiques destinées à un entraînement, et entraîner, sur la base desdites caractéristiques et des paramètres de configuration, un modèle prédéfini à entraîner en vue d'obtenir un premier modèle d'apprentissage initial (S10) ; calculer des premières significations correspondant auxdites caractéristiques, et éliminer, sur la base des premières significations, des caractéristiques à éliminer satisfaisant une exigence d'importance d'élimination prédéfinie parmi les caractéristiques destinées à un entraînement, de façon à effectuer un apprentissage en boucle sur le premier modèle d'apprentissage initial sur la base des caractéristiques éliminées destinées à un entraînement pour obtenir un ensemble de modèles d'apprentissage en boucle (S20) ; sélectionner un modèle d'apprentissage cible parmi le premier modèle d'apprentissage initial et l'ensemble de modèles d'apprentissage en boucle sur la base des paramètres de configuration (S30) ; et générer des données visuelles correspondant au modèle d'apprentissage cible, et renvoyer les données visuelles au client (S40).
PCT/CN2020/134736 2020-01-09 2020-12-09 Dispositif et procédé de sélection de modèle arrière, et support d'enregistrement lisible WO2021139465A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010024439.3A CN111210022B (zh) 2020-01-09 2020-01-09 向后模型选择方法、设备及可读存储介质
CN202010024439.3 2020-01-09

Publications (1)

Publication Number Publication Date
WO2021139465A1 true WO2021139465A1 (fr) 2021-07-15

Family

ID=70786101

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/134736 WO2021139465A1 (fr) 2020-01-09 2020-12-09 Dispositif et procédé de sélection de modèle arrière, et support d'enregistrement lisible

Country Status (2)

Country Link
CN (1) CN111210022B (fr)
WO (1) WO2021139465A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111241746B (zh) * 2020-01-09 2024-01-26 深圳前海微众银行股份有限公司 向前模型选择方法、设备和可读存储介质
CN111210022B (zh) * 2020-01-09 2024-05-17 深圳前海微众银行股份有限公司 向后模型选择方法、设备及可读存储介质
CN112434620B (zh) * 2020-11-26 2024-03-01 新奥新智科技有限公司 场景文字识别方法、装置、设备和计算机可读介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108830380A (zh) * 2018-04-11 2018-11-16 开放智能机器(上海)有限公司 一种基于云服务的训练模型生成方法及系统
US20180374104A1 (en) * 2017-06-26 2018-12-27 Sap Se Automated learning of data aggregation for analytics
CN110298389A (zh) * 2019-06-11 2019-10-01 上海冰鉴信息科技有限公司 训练模型时的多轮循环特征选择方法和装置
CN110543946A (zh) * 2018-05-29 2019-12-06 百度在线网络技术(北京)有限公司 用于训练模型的方法和装置
CN111210022A (zh) * 2020-01-09 2020-05-29 深圳前海微众银行股份有限公司 向后模型选择方法、设备及可读存储介质

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108875289B (zh) * 2017-05-08 2021-12-14 腾讯科技(深圳)有限公司 一种算法调试方法、客户端、后台服务器及系统
US10600005B2 (en) * 2018-06-01 2020-03-24 Sas Institute Inc. System for automatic, simultaneous feature selection and hyperparameter tuning for a machine learning model
CN110378472A (zh) * 2019-07-24 2019-10-25 苏州浪潮智能科技有限公司 一种深度神经网络模型的数据并行训练方法、装置及设备

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180374104A1 (en) * 2017-06-26 2018-12-27 Sap Se Automated learning of data aggregation for analytics
CN108830380A (zh) * 2018-04-11 2018-11-16 开放智能机器(上海)有限公司 一种基于云服务的训练模型生成方法及系统
CN110543946A (zh) * 2018-05-29 2019-12-06 百度在线网络技术(北京)有限公司 用于训练模型的方法和装置
CN110298389A (zh) * 2019-06-11 2019-10-01 上海冰鉴信息科技有限公司 训练模型时的多轮循环特征选择方法和装置
CN111210022A (zh) * 2020-01-09 2020-05-29 深圳前海微众银行股份有限公司 向后模型选择方法、设备及可读存储介质

Also Published As

Publication number Publication date
CN111210022B (zh) 2024-05-17
CN111210022A (zh) 2020-05-29

Similar Documents

Publication Publication Date Title
WO2021139465A1 (fr) Dispositif et procédé de sélection de modèle arrière, et support d'enregistrement lisible
WO2021139462A1 (fr) Procédé et dispositif de sélection progressive de modèles et support lisible de stockage
WO2019233421A1 (fr) Procédé et dispositif de traitement d'image, appareil informatique, et support de stockage
US20190378044A1 (en) Processing dynamic data within an adaptive oracle-trained learning system using curated training data for incremental re-training of a predictive model
WO2022077646A1 (fr) Procédé et appareil d'entraînement d'un modèle d'étudiant pour le traitement d'image
WO2019214344A1 (fr) Procédé et appareil d'apprentissage par renforcement de système, dispositif électronique et support de stockage informatique
WO2020007177A1 (fr) Procédé de proposition de prix exécuté par ordinateur, dispositif de proposition de prix, dispositif électronique et support de stockage
US11687804B2 (en) Latent feature dimensionality bounds for robust machine learning on high dimensional datasets
Hvarfner et al. Joint entropy search for maximally-informed Bayesian optimization
CN113326852A (zh) 模型训练方法、装置、设备、存储介质及程序产品
WO2021139483A1 (fr) Procédé et dispositif de sélection de modèles de transfert et support lisible de stockage
US20220309779A1 (en) Neural network training and application method, device and storage medium
Maire et al. Adaptive incremental mixture markov chain monte carlo
CN111161238A (zh) 图像质量评价方法及装置、电子设备、存储介质
Lin et al. Plug-in performative optimization
US11847599B1 (en) Computing system for automated evaluation of process workflows
US20220351533A1 (en) Methods and systems for the automated quality assurance of annotated images
US11762562B2 (en) Performance analysis apparatus and performance analysis method
CN113793298A (zh) 肺结节检测模型构建优化方法、设备、存储介质及产品
CN112070162A (zh) 多类别处理任务训练样本构建方法、设备及介质
CN113361402B (zh) 识别模型的训练方法、确定准确率的方法、装置和设备
US20230195838A1 (en) Discovering distribution shifts in embeddings
Liu et al. A fuzzy density peak optimization initial centers selection for k-medoids clustering algorithm
CN113269431B (zh) 库存风险预测方法、设备、介质及计算机程序产品
CN114581751B (zh) 图像识别模型的训练方法和图像识别方法、装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20911889

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20911889

Country of ref document: EP

Kind code of ref document: A1