WO2021143067A1 - Method and apparatus for predicting workpiece quality, and computer device - Google Patents

Method and apparatus for predicting workpiece quality, and computer device Download PDF

Info

Publication number
WO2021143067A1
WO2021143067A1 PCT/CN2020/099475 CN2020099475W WO2021143067A1 WO 2021143067 A1 WO2021143067 A1 WO 2021143067A1 CN 2020099475 W CN2020099475 W CN 2020099475W WO 2021143067 A1 WO2021143067 A1 WO 2021143067A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
xgboost
neural network
deep learning
sample data
Prior art date
Application number
PCT/CN2020/099475
Other languages
French (fr)
Chinese (zh)
Inventor
林宏达
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021143067A1 publication Critical patent/WO2021143067A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2415Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Definitions

  • This application relates to the medical field, in particular to methods, devices and computer equipment for predicting the quality of processed parts.
  • the main purpose of this application is to provide a method for predicting the quality of a machined part, which aims to solve the technical problem that the existing method for predicting the quality of a machined part cannot be industrialized and widely promoted.
  • This application proposes a method for predicting the quality of processed parts, including:
  • the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
  • This application also provides a device for predicting the quality of a processed part, including:
  • the first judging module is used to judge whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective processed part;
  • the input module is used to input the current to-be-analyzed data into a predictive model for predictive analysis if the current to-be-analyzed data contains the mark characteristics corresponding to the quality parameters of the defective workpiece, wherein the predictive model includes at least the Xgboost model and RandomForest Two of the model and the deep learning neural network model, the Xgboost model has been modified and adjusted through a linear regression model;
  • the summary module is configured to summarize the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method to obtain a summary result;
  • the second judgment module is used for judging the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
  • the present application also provides a computer device, including a memory and a processor, the memory stores a computer program, and the method for realizing the quality of a workpiece when the processor executes the computer program includes:
  • the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
  • the present application also provides a computer-readable storage medium on which a computer program is stored.
  • the method for realizing the quality of a workpiece when the computer program is executed by a processor includes:
  • the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
  • This application designs the loss function of the Xgboost model based on the characteristics of the Xgboost model and the sparse data characteristics when the Xgboost model is used to predict the quality of processed parts.
  • the logarithmic maximum similarity is used as the loss function and corrected by linear regression, so that The Xgboost model is more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization.
  • Fig. 1 is a schematic flow chart of a method for predicting the quality of a processed part according to an embodiment of the present application
  • Fig. 2 is a schematic structural diagram of a device for predicting the quality of a processed part according to an embodiment of the present application
  • Fig. 3 is a schematic diagram of the internal structure of a computer device according to an embodiment of the present application.
  • a method for predicting the quality of a processed part according to an embodiment of the present application includes:
  • the current data to be analyzed in this embodiment is the feature data of the processed part, including product feature data such as height, width, length, weight, density, color uniformity, surface flatness, hardness, etc., as well as production time, production environment parameters, and batches of raw materials.
  • Feature data of secondary processing information By judging whether the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, and when the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, the predictive model is triggered to perform predictive analysis.
  • the quality parameters of the above-mentioned defective workpiece are included in the characteristic data of the workpiece.
  • the density of the processed part has the greatest impact on its quality compliance, so it is necessary to search whether the current data to be analyzed includes low-density signature features. If there is a low-density signature feature, it is considered that there is a risk that the quality of the processed part does not meet the standard, which will trigger the prediction.
  • the model analyzes and predicts all characteristic data.
  • the above-mentioned flag characteristics associated with the quality parameters of defective processed parts also include uneven surface, uneven color, and substandard size, etc., which vary according to the quality requirements of the processed parts to be tested.
  • a confidential small gear processing workpiece has very high requirements for its strength, but the special strength testing equipment will scrap the gear after inspecting the gear, so it needs to pass the length, width, height, weight and thermodynamics of the small gear Use characteristic data such as imaging to determine whether the gear's strength is up to standard.
  • the sample data in the model training of this embodiment is thousands of processed parts data, defective processed parts account for a small number, and the data structure features are sparse features, that is, most of the data is assigned a value of zero, which results in the failure to reflect the data during model training.
  • the degree of discrimination affects the discriminative analysis effect of model training.
  • the Xgboost model of this embodiment undergoes a specific correction process to make it meet the differential analysis of sparse data, and the deep learning neural network model meets the differential analysis of sparse data by designing a specific construction structure.
  • the analysis results of at least two of the above-mentioned Xgboost model, RandomForest model and deep learning neural network model are fused to highlight the main results of the machine learning model, and the error analysis content of the machine learning model is carried out in-depth Learn to correct.
  • the above-mentioned three models simultaneously analyze the current data to be analyzed in parallel, and then merge the analysis.
  • the above fusion result is an evaluation of the risk score of the current data to be analyzed. The higher the score, the higher the probability that the quality of the processed parts predicted by the current data to be analyzed will not meet the standard.
  • the loss function of the Xgboost model is designed to use the logarithmic maximum similarity as the loss function and correct it through linear regression to make the Xgboost model It is more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization.
  • the loss function of the Xgboost model is constructed based on the logarithm maximum similarity, and the step S1 of judging whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece includes:
  • S12 Perform gradient optimization on the gradient of the Xgboost model according to the objective function, where the optimization direction of the gradient optimization is X refers to the input sample data, ⁇ is the weight of each variable in the linear regression model, J( ⁇ ; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, ⁇ is Weight vector configuration ratio;
  • too large learning parameters will cause the problem of too fast stride in the Xgboost optimization approaching process.
  • the above loss function uses a linear regression model to form the objective function of the Xgboost model to achieve adaptive adjustments to the learning parameters in the Xgboost model, so as to select the most suitable learning parameters for the above sparse data analysis results through the linear regression model to ensure that the Xgboost model is suitable Optimization speed. Since the optimization approach method used by the Xgboost model is the second-order Taylor expansion of the loss function and the node value as the objective function, the shape of the loss function will largely determine the limit of the performance of the Xgboost model.
  • each optimization direction and optimization amount are mainly determined by the first derivative and the second derivative of the loss function .
  • Using the logarithmic maximum similarity as the loss function will prevent the approaching stride of each optimization from being too large, avoiding the situation of f'->infi, that is, overfitting, and f'->0 will not appear In the case of gradient explosion, to ensure the smooth progress of the optimization.
  • the above-mentioned pre-processed sample data refers to the data after the sample data has passed the specified pre-processing method.
  • the above-mentioned pre-processing method includes classifying the characteristic data of the processed parts in the sample data, using Monte Carlo tree search and other statistical methods to filter
  • the feature data in model training is used to improve the development trend path that determines the quality of processed parts through various features and improve the accuracy of prediction.
  • the Random Froes model is used to filter the factors related to the quality of the processed parts, and the two are searched for intersection, and the quality parameter groups of the processed parts related to the substandard quality are obtained, and each quality parameter group corresponding to the development of the substandard product category is formed.
  • the trend path such as the trend path of non-compliant workpieces developed through density feature data, and the trend path of non-compliant workpieces developed through color feature data, weight feature data, and so on.
  • the pre-processed sample data includes the above-mentioned characteristic data and various trend paths that develop into substandard product categories.
  • the sample data is the characteristic data of the turbine blades.
  • the turbine blades are huge processed parts. Whether the center of gravity of the turbine blades meets the requirements of the standard is the required prediction standard.
  • the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer.
  • S1a Select construction elements to respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers connected in sequence Multilayer, the learning layer includes multiple hidden layers accumulated in sequence;
  • S1b Connect the amplification layer, the deconstruction layer, and the learning layer in sequence to form the deep learning neural network model;
  • S1c Input the preprocessed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model.
  • the sample data is sparse structure data
  • Data realizes differentiated analysis.
  • the above-mentioned deep learning neural network model uses the amplification layer as the starting structure, the deconstruction layer as the intermediate structure, and the learning layer as the end structure.
  • the above-mentioned amplification layer is composed of 4 hidden layers, which are 2 ⁇ 10ReLu, 2 ⁇ 11ReLu, 2 ⁇ 12ReLu and 2 ⁇ 13ReLu which are connected in sequence, 2 ⁇ 10ReLu is connected to the sample data input terminal, and 2 ⁇ 13ReLu is connected to the deconstruction layer. Since the sample data entering each hidden layer of the amplification layer is not subjected to convolution processing, only the sample data is partially enlarged to determine the distinguishing characteristics between the sample data, and the sample data is originally assigned to 0 local features for refinement distinguish.
  • the above-mentioned deconstruction layer is composed of two convolutional layers, and each convolutional layer includes Bach Normalization*2 ⁇ 10 and Average Pooling which are sequentially connected.
  • the sample data is rearranged and deconstructed through the deconstruction layer to obtain the local features with the original value of zero, and the association relationship to the entire sample data is distinguished after the magnification and refinement, and the value of non-zero is obtained.
  • the above-mentioned learning layer is composed of three hidden layers, respectively 2 ⁇ 10ReLu, 2 ⁇ 5ReLu and 2 ⁇ 4ReLu connected in sequence, 2 ⁇ 10ReLu is connected to the deconstruction layer, and 2 ⁇ 4ReLu is connected to the Softmax classifier.
  • the preprocessing process of the above sample data is the same as above, and will not be repeated.
  • the number of the aforementioned convolutional layers and hidden layers is determined according to the degree of optimization and the amount of calculation in the specific training process.
  • step S13 or S1c it includes:
  • the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and displayed at the sample data of the mapping area ,
  • the area selection instruction includes at least add and delete;
  • S104 Restore the display state of the corrected sample data to the state before the partial enlarged display.
  • the sample data preprocessed in this embodiment can be manually revised.
  • the preprocessed sample data is sample data processed through statistical processing or model processing, and it is presumed to be data that is beneficial to improving the prediction accuracy of the prediction model. , Improve the prediction accuracy after training the prediction model with sample data.
  • the above-mentioned partial enlargement means that the connection relationship of the feature in the trend path of the product category that is developing into the substandard product category is not changed, and only the partial enlargement of the feature is performed, so that the feature can be accurately corrected.
  • step S3 of collecting the analysis results of at least two of the Xgboost model, the RandomForest model and the deep learning neural network model according to a preset method to obtain the summary result includes:
  • the analysis results of the same input sample data are analyzed by the Xgboost model, the RandomForest model, and the deep learning neural network model, and the weighted average is used to obtain the summary result, so that the summary result can avoid the influence of the defects of each model, and the result is merged It is more in line with objective reality and the forecast results are more accurate.
  • the weights of the Xgboost model, the RandomForest model, and the deep learning neural network model are W1, W2, and W3, respectively.
  • performing a weighted average of the matrix data corresponding to each of the analysis results according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result before step S31 includes:
  • S311 Input sample data carrying tags into the Xgboost model, RandomForest model, and deep learning neural network model for training;
  • S312 Obtain feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the labeled sample data;
  • S313 Calculate the respective weights corresponding to the Xgboost model, RandomForest model, and deep learning neural network model through a linear regression model according to each of the feedback results and the assignment of the carrying tags.
  • a device for predicting the quality of a processed part includes:
  • the first judgment module 1 is used for judging whether the current data to be analyzed contains the mark features corresponding to the quality parameters of the defective processed parts;
  • the input module 2 is used to input the current to-be-analyzed data into a predictive model for predictive analysis if the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, wherein the predictive model includes at least the Xgboost model, Two of the RandomForest model and the deep learning neural network model, the Xgboost model has been modified and adjusted through a linear regression model;
  • the summary module 3 is configured to summarize the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method to obtain a summary result;
  • the second judgment module 4 is configured to judge the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
  • the current data to be analyzed in this embodiment is the feature data of the processed part, including product feature data such as height, width, length, weight, density, color uniformity, surface flatness, hardness, etc., as well as production time, production environment parameters, and batches of raw materials.
  • Feature data of secondary processing information By judging whether the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, and when the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, the predictive model is triggered to perform predictive analysis.
  • the quality parameters of the above-mentioned defective workpiece are included in the characteristic data of the workpiece.
  • the density of the processed part has the greatest impact on its quality compliance, so it is necessary to search whether the current data to be analyzed includes low-density signature features. If there is a low-density signature feature, it is considered that there is a risk that the quality of the processed part does not meet the standard, which will trigger the prediction.
  • the model analyzes and predicts all characteristic data.
  • the above-mentioned flag characteristics associated with the quality parameters of defective processed parts also include uneven surface, uneven color, and substandard size, etc., which vary according to the quality requirements of the processed parts to be tested.
  • a confidential small gear processing workpiece has very high requirements for its strength, but the special strength testing equipment will scrap the gear after inspecting the gear, so it needs to pass the length, width, height, weight and thermodynamics of the small gear Use characteristic data such as imaging to determine whether the gear's strength is up to standard.
  • the sample data in the model training of this embodiment is thousands of processed parts data, defective processed parts account for a small number, and the data structure features are sparse features, that is, most of the data is assigned a value of zero, which results in the failure to reflect the data during model training.
  • the degree of discrimination affects the discriminative analysis effect of model training.
  • the Xgboost model of this embodiment undergoes a specific correction process to make it meet the differential analysis of sparse data, and the deep learning neural network model meets the differential analysis of sparse data by designing a specific construction structure.
  • the analysis results of at least two of the above-mentioned Xgboost model, RandomForest model and deep learning neural network model are fused to highlight the main results of the machine learning model, and the error analysis content of the machine learning model is carried out in-depth Learn to correct.
  • the above-mentioned three models simultaneously analyze the current data to be analyzed in parallel, and then merge the analysis.
  • the above fusion result is an evaluation of the risk score of the current data to be analyzed. The higher the score, the higher the probability that the quality of the processed parts predicted by the current data to be analyzed will not meet the standard.
  • the loss function of the Xgboost model is designed to use the logarithmic maximum similarity as the loss function and correct it through linear regression to make the Xgboost model It is more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization.
  • the loss function of the Xgboost model is constructed according to the logarithmic maximum similarity, and the device for predicting the quality of the processed part includes:
  • Forming module used to calculate the two-dimensional norm of the loss function gradient matrix
  • a linear regression model is used to form the objective function of the Xgboost model
  • the loss function is y refers to the real result
  • x refers to the input sample data
  • refers to the weight of each function in the Xgboost model
  • ) is the conditional probability
  • is the weight of each variable in the linear regression model
  • J( ⁇ ; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data
  • is the weight vector configuration ratio
  • the optimization module is used to perform gradient optimization on the gradient of the Xgboost model according to the objective function, wherein the optimization direction of the gradient optimization is X refers to the input sample data, ⁇ is the weight of each variable in the linear regression model, J( ⁇ ; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, ⁇ is Weight vector configuration ratio;
  • the training module is used to input the preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
  • too large learning parameters will cause the problem of too fast stride in the Xgboost optimization approaching process.
  • the above loss function uses a linear regression model to form the objective function of the Xgboost model to achieve adaptive adjustments to the learning parameters in the Xgboost model, so as to select the most suitable learning parameters for the above sparse data analysis results through the linear regression model to ensure that the Xgboost model is suitable Optimization speed. Since the optimization approach method used by the Xgboost model is the second-order Taylor expansion of the loss function and the node value as the objective function, the shape of the loss function will largely determine the limit of the performance of the Xgboost model.
  • each optimization direction and optimization amount are mainly determined by the first derivative and the second derivative of the loss function .
  • Using the logarithmic maximum similarity as the loss function will prevent the approaching stride of each optimization from being too large, avoiding the situation of f'->infi, that is, overfitting, and f'->0 will not appear In the case of gradient explosion, to ensure the smooth progress of the optimization.
  • the above-mentioned pre-processed sample data refers to the data after the sample data has passed the specified pre-processing method.
  • the above-mentioned pre-processing method includes classifying the characteristic data of the processed parts in the sample data, using Monte Carlo tree search and other statistical methods to filter
  • the feature data in model training is used to improve the development trend path that determines the quality of processed parts through various features and improve the accuracy of prediction.
  • the Random Froes model is used to filter the factors related to the quality of the processed parts, and the two are searched for intersection, and the quality parameter groups of the processed parts related to the substandard quality are obtained, and each quality parameter group corresponding to the development of the substandard product category is formed.
  • the trend path such as the trend path of non-compliant workpieces developed through density feature data, and the trend path of non-compliant workpieces developed through color feature data, weight feature data, and so on.
  • the pre-processed sample data includes the above-mentioned characteristic data and various trend paths that develop into substandard product categories.
  • the sample data is the characteristic data of the turbine blades.
  • the turbine blades are huge processed parts. Whether the center of gravity of the turbine blades meets the requirements of the standard is the required prediction standard.
  • the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer.
  • the device for predicting the quality of the processed part includes:
  • the selection module is used to select construction elements to respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers in sequence A connected convolutional layer, where the learning layer includes multiple hidden layers accumulated in sequence;
  • connection module is configured to sequentially connect the amplification layer, the deconstruction layer, and the learning layer to form the deep learning neural network model
  • the determining module is used to input the preprocessed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model.
  • the sample data is sparse structure data
  • Data realizes differentiated analysis.
  • the above-mentioned deep learning neural network model uses the amplification layer as the starting structure, the deconstruction layer as the intermediate structure, and the learning layer as the end structure.
  • the above-mentioned amplification layer is composed of 4 hidden layers, which are 2 ⁇ 10ReLu, 2 ⁇ 11ReLu, 2 ⁇ 12ReLu and 2 ⁇ 13ReLu which are connected in sequence, 2 ⁇ 10ReLu is connected to the sample data input terminal, and 2 ⁇ 13ReLu is connected to the deconstruction layer. Since the sample data entering each hidden layer of the amplification layer is not subjected to convolution processing, only the sample data is partially enlarged to determine the distinguishing characteristics between the sample data, and the sample data is originally assigned to 0 local features for refinement distinguish.
  • the above-mentioned deconstruction layer is composed of two convolutional layers, and each convolutional layer includes Bach Normalization*2 ⁇ 10 and Average Pooling which are sequentially connected.
  • the sample data is rearranged and deconstructed through the deconstruction layer to obtain the local features with the original value of zero, and the association relationship to the entire sample data is distinguished after the magnification and refinement, and the value of non-zero is obtained.
  • the above-mentioned learning layer is composed of three hidden layers, respectively 2 ⁇ 10ReLu, 2 ⁇ 5ReLu and 2 ⁇ 4ReLu connected in sequence, 2 ⁇ 10ReLu is connected to the deconstruction layer, and 2 ⁇ 4ReLu is connected to the Softmax classifier.
  • the preprocessing process of the above sample data is the same as above, and will not be repeated.
  • the number of the aforementioned convolutional layers and hidden layers is determined according to the degree of optimization and the amount of calculation in the specific training process.
  • the device for predicting the quality of processed parts includes:
  • the third judgment module is used to judge whether a correction instruction to the preprocessed sample data is received
  • the enlargement module is configured to, if a correction instruction for the preprocessed sample data is received, the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, wherein the area selection instruction is based on the user's click
  • the mapping area on the screen is sent out and displayed at the sample data of the mapping area, and the area selection instruction includes at least adding and deleting;
  • the correction module is configured to correct the sample data corresponding to the mapping area according to the type of the received area selection instruction
  • the restoration module is used to restore the display state of the modified sample data to the state before the partial magnification display.
  • the sample data preprocessed in this embodiment can be manually revised.
  • the preprocessed sample data is sample data processed through statistical processing or model processing, and it is presumed to be data that is beneficial to improving the prediction accuracy of the prediction model. , Improve the prediction accuracy after training the prediction model with sample data.
  • the above-mentioned partial enlargement means that the connection relationship of the feature in the trend path of the product category that is developing into the substandard product category is not changed, and only the partial enlargement of the feature is performed, so that the feature can be accurately corrected.
  • summary module 3 includes:
  • the first input unit is configured to input the current data to be analyzed into the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain analysis results respectively;
  • the summary unit is configured to perform a weighted average of the matrix data corresponding to each analysis result according to the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model to obtain the summary result.
  • the analysis results of the same input sample data are analyzed by the Xgboost model, the RandomForest model, and the deep learning neural network model, and the weighted average is used to obtain the summary result, so that the summary result can avoid the influence of the defects of each model, and the result is merged It is more in line with objective reality and the forecast results are more accurate.
  • the weights of the Xgboost model, the RandomForest model, and the deep learning neural network model are W1, W2, and W3, respectively.
  • summary module 3 includes:
  • the second input unit is used to input the sample data carrying the label into the Xgboost model, the RandomForest model and the deep learning neural network model for training;
  • An obtaining unit configured to obtain feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the labeled sample data;
  • the calculation unit is used to calculate the respective weights of the Xgboost model, RandomForest model and deep learning neural network model through a linear regression model according to each of the feedback results and the assignment of the carrying tags.
  • an embodiment of the present application also provides a computer device.
  • the computer device may be a server, and its internal structure may be as shown in FIG. 3.
  • the computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus. Among them, the processor designed by the computer is used to provide calculation and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, a computer program, and a database.
  • the memory provides an environment for the operation of the operating system and the computer program in the non-volatile storage medium.
  • the database of the computer equipment is used to store all the data needed in the process of predicting the quality of the workpiece.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer program is executed by the processor to realize the method of predicting the quality of the workpiece.
  • the processor executes the method for predicting the quality of the processed part, including: judging whether the current data to be analyzed contains a mark feature corresponding to the quality parameter of the defective processed part; if so, inputting the current to-be-analyzed data into a predictive model for predictive analysis,
  • the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model.
  • the Xgboost model is modified and adjusted by a linear regression model; the Xgboost model, the RandomForest model, and the The analysis results of at least two models in the deep learning neural network model are summarized and processed to obtain a summary result; according to the summary result, the probability that the quality of the processed part corresponding to the current data to be analyzed is not up to standard is determined.
  • the above-mentioned computer equipment based on the characteristics of the Xgboost model and the sparse data characteristics of the Xgboost model when used for the quality prediction of the processed parts, designed the loss function of the Xgboost model to use the logarithmic maximum similarity as the loss function and correct it through linear regression , which makes the Xgboost model more suitable for feature analysis of sparse data, and enables the method of predicting the quality of processed parts to be widely promoted in industrialization.
  • the Random Forest model, the deep learning neural network model, and the modified Xgboost model After training the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively, the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
  • the loss function of the Xgboost model is constructed according to the logarithm maximum similarity, and the above-mentioned processor determines whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece before the step of:
  • the two-dimensional norm of the function gradient matrix As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
  • the loss function is y refers to the real result
  • x refers to the input sample data
  • refers to the weight of each function in the Xgboost model
  • ) is the conditional probability
  • is the weight of each variable in the linear regression model
  • J( ⁇ ; X , Y) is the difference between the Xgboost model's predicted result and the actual result obtained by inputting sample data
  • is the weight vector configuration ratio
  • the gradient of the Xgboost model is optimized according to the objective function, where the gradient is optimized
  • the optimization direction
  • the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer.
  • the above-mentioned processor includes: selecting The construction elements respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence, The learning layer includes multiple layers of hidden layers accumulated in sequence; sequentially connecting the amplification layer, deconstruction layer, and learning layer to form the deep learning neural network model; and inputting preprocessed sample data to the deep learning neural network model , To determine the model parameters of the deep learning neural network model.
  • the processor inputs preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model before the step, or preprocess
  • the method includes: determining whether a correction instruction for the preprocessed sample data is received; if so , The preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and is displayed at the sample data of the mapping area.
  • the area selection instruction includes at least add and delete; according to the type of the received area selection instruction, the sample data corresponding to the mapping area is corrected; the display state of the corrected sample data is restored to the state before the partial enlarged display .
  • the above-mentioned processor summarizes the analysis results of at least two of the Xgboost model, the RandomForest model and the deep learning neural network model according to a preset method, and the step of obtaining the summary result includes: combining all the analysis results of the Xgboost model, the RandomForest model, and the deep learning neural network model.
  • the analysis results obtained after the current data to be analyzed are respectively input to the Xgboost model, RandomForest model, and deep learning neural network model; the matrix data corresponding to each analysis result is calculated according to the Xgboost model, RandomForest model and depth
  • the weights corresponding to the neural network models are learned, and the weighted average is performed to obtain the summary result.
  • the above-mentioned processor performs a weighted average on the matrix data corresponding to each of the analysis results according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result. , Including: inputting sample data carrying tags into the Xgboost model, RandomForest model, and deep learning neural network model for training; acquiring the Xgboost model, RandomForest model, and deep learning neural network model to perform training on the carrying tag The feedback results of the sample data; according to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model are calculated through a linear regression model.
  • FIG. 3 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
  • the present application also provides a computer-readable storage medium.
  • the computer-readable storage medium may be non-volatile or volatile.
  • a computer program is stored thereon, and the computer program is executed by the processor to realize the prediction of the workpiece
  • the quality method includes: judging whether the current data to be analyzed contains the mark features corresponding to the quality parameters of the defective processed parts; if so, inputting the current data to be analyzed into a predictive model for predictive analysis, wherein the predictive model at least includes Two of the Xgboost model, the RandomForest model and the deep learning neural network model.
  • the Xgboost model is modified and adjusted by a linear regression model; at least one of the Xgboost model, the RandomForest model, and the deep learning neural network model is adjusted according to a preset method
  • the analysis results of the two models are summarized and processed to obtain a summary result; according to the summary result, the probability that the quality of the processed part corresponding to the current data to be analyzed is not up to standard is judged.
  • the above-mentioned computer-readable storage medium based on the characteristics of the Xgboost model and the sparse data characteristics when the Xgboost model is used for the quality prediction of processed parts, the loss function of the Xgboost model is designed to pass the logarithmic maximum similarity as the loss function and pass the linear Regression is modified to make the Xgboost model more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization.
  • the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
  • the loss function of the Xgboost model is constructed according to the logarithm maximum similarity, and the above-mentioned processor determines whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece before the step of:
  • the two-dimensional norm of the function gradient matrix As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
  • the loss function is y refers to the real result
  • x refers to the input sample data
  • refers to the weight of each function in the Xgboost model
  • ) is the conditional probability
  • is the weight of each variable in the linear regression model
  • J( ⁇ ; X , Y) is the difference between the predicted result of the Xgboost model obtained by inputting sample data and the actual result
  • is the weight vector configuration ratio
  • the gradient of the Xgboost model is optimized according to the objective function, where the gradient is optimized
  • the optimization direction
  • the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer.
  • the above-mentioned processor includes: selecting The construction elements respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence, The learning layer includes multiple layers of hidden layers accumulated in sequence; sequentially connecting the amplification layer, deconstruction layer, and learning layer to form the deep learning neural network model; and inputting preprocessed sample data to the deep learning neural network model , To determine the model parameters of the deep learning neural network model.
  • the processor inputs preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model before the step, or preprocess
  • the method includes: determining whether a correction instruction for the preprocessed sample data is received; if so , The preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and is displayed at the sample data of the mapping area.
  • the area selection instruction includes at least add and delete; according to the type of the received area selection instruction, the sample data corresponding to the mapping area is corrected; the display state of the corrected sample data is restored to the state before the partial enlarged display .
  • the above-mentioned processor summarizes the analysis results of at least two of the Xgboost model, the RandomForest model and the deep learning neural network model according to a preset method, and the step of obtaining the summary result includes: combining all the analysis results of the Xgboost model, the RandomForest model, and the deep learning neural network model.
  • the analysis results obtained after the current data to be analyzed are respectively input to the Xgboost model, RandomForest model, and deep learning neural network model; the matrix data corresponding to each analysis result is calculated according to the Xgboost model, RandomForest model and depth
  • the weights corresponding to the neural network models are learned, and the weighted average is performed to obtain the summary result.
  • the above-mentioned processor performs a weighted average on the matrix data corresponding to each of the analysis results according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result. , Including: inputting sample data carrying tags into the Xgboost model, RandomForest model, and deep learning neural network model for training; acquiring the Xgboost model, RandomForest model, and deep learning neural network model to perform training on the carrying tag The feedback results of the sample data; according to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model are calculated through a linear regression model.
  • Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), dual-rate data rate SDRAM (SSRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Medical Informatics (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • General Factory Administration (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A method for predicting workpiece quality, relating to the field of medicine, and comprising: determining whether current data to be analyzed contains an indicator feature corresponding to a quality parameter of a defective workpiece (S1); if such a feature is contained, then inputting the current data to be analyzed into a prediction model for predictive analysis, the prediction model at least comprising two from among an Xgboost model, a RandomForest model, or a deep learning neural network model, the Xgboost model performing correction adjustment by means of a linear regression model (S2); according to a preset method, performing aggregation processing on the analysis results of the at least two models from among the Xgboost model, the RandomForest model, or the deep learning neural network model to obtain an aggregated result (S3); and on the basis of the aggregated result, determining the probability that the workpiece quality corresponding to the current data to be analyzed is substandard (S4). A loss function for an Xgboost model is designed on the basis of features of the Xgboost model and sparse data features during quality prediction, causing the Xgboost model to be more suitable for feature analysis of sparse data, and causing the method for predicting workpiece quality in the field of medicine to be industrially applicable.

Description

预测加工件质量的方法、装置和计算机设备Method, device and computer equipment for predicting the quality of processed parts
本申请要求于2020年05月28日提交中国专利局、申请号为202010469626.2,发明名称为“预测加工件质量的方法、装置和计算机设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on May 28, 2020, the application number is 202010469626.2, and the invention title is "Methods, Apparatus and Computer Equipment for Predicting the Quality of Workpieces", the entire contents of which are incorporated by reference In this application.
技术领域Technical field
本申请涉及到医疗领域,特别是涉及到预测加工件质量的方法、装置和计算机设备。This application relates to the medical field, in particular to methods, devices and computer equipment for predicting the quality of processed parts.
背景技术Background technique
医疗领域的加工件种类繁多,包括诊断类的医疗器械上的加工件和治疗类的医疗器械上的加工件,质量精度要求都比较高。上述各加工件的质量预估需要参考的因素也多,现有预测加工件质量是否达标,或加工件是否有瑕疵缺陷不够准确,需要借助专业的质量评审员评审,发明人意识到这样不仅需要评审人对加工件进行各种相关检查,导致耗时长,且不能够进行工业化普遍推广。There are many types of processed parts in the medical field, including processed parts on diagnostic medical devices and processed parts on therapeutic medical devices. The quality and accuracy requirements are relatively high. There are also many factors that need to be referred to in the quality estimation of the above-mentioned processed parts. The existing prediction of whether the quality of the processed parts meets the standard, or whether the processed parts has defects or defects is not accurate enough, needs to be reviewed by a professional quality reviewer. The inventor realizes that this is not only necessary The reviewers carried out various related inspections on the processed parts, which resulted in a long time-consuming process and was unable to carry out general industrialization promotion.
技术问题technical problem
本申请的主要目的为提供预测加工件质量的方法,旨在解决现有预测加工件质量的方法不能够进行工业化普遍推广技术问题。The main purpose of this application is to provide a method for predicting the quality of a machined part, which aims to solve the technical problem that the existing method for predicting the quality of a machined part cannot be industrialized and widely promoted.
技术解决方案Technical solutions
本申请提出一种预测加工件质量的方法,包括:This application proposes a method for predicting the quality of processed parts, including:
判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;Determine whether the current data to be analyzed contains the mark characteristics corresponding to the quality parameters of the defective workpiece;
若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;If yes, input the current data to be analyzed into a prediction model for prediction analysis, where the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;Summarizing the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method, to obtain a summary result;
根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。Determine the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
本申请还提供了一种预测加工件质量的装置,包括:This application also provides a device for predicting the quality of a processed part, including:
第一判断模块,用于判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;The first judging module is used to judge whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective processed part;
输入模块,用于若当前待分析数据含有瑕疵加工件的质量参数对应的标志特征,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;The input module is used to input the current to-be-analyzed data into a predictive model for predictive analysis if the current to-be-analyzed data contains the mark characteristics corresponding to the quality parameters of the defective workpiece, wherein the predictive model includes at least the Xgboost model and RandomForest Two of the model and the deep learning neural network model, the Xgboost model has been modified and adjusted through a linear regression model;
汇总模块,用于按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;The summary module is configured to summarize the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method to obtain a summary result;
第二判断模块,用于根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The second judgment module is used for judging the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
本申请还提供了一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,所述处理器执行所述计算机程序时实现预测加工件质量的方法,包括:The present application also provides a computer device, including a memory and a processor, the memory stores a computer program, and the method for realizing the quality of a workpiece when the processor executes the computer program includes:
判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;Determine whether the current data to be analyzed contains the mark characteristics corresponding to the quality parameters of the defective workpiece;
若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;If yes, input the current data to be analyzed into a prediction model for prediction analysis, where the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;Summarizing the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method, to obtain a summary result;
根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。Determine the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
本申请还提供了一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序 被处理器执行时实现预测加工件质量的方法,包括:The present application also provides a computer-readable storage medium on which a computer program is stored. The method for realizing the quality of a workpiece when the computer program is executed by a processor includes:
判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;Determine whether the current data to be analyzed contains the mark characteristics corresponding to the quality parameters of the defective workpiece;
若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;If yes, input the current data to be analyzed into a prediction model for prediction analysis, where the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;Summarizing the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method, to obtain a summary result;
根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。Determine the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
有益效果Beneficial effect
本申请依据Xgboost模型的特征以及Xgboost模型用于加工件质量预测时的稀疏数据特征,设计了Xgboost模型的损失函数,以通过将对数最大相似性作为损失函数,并通过线性回归进行修正,使Xgboost模型更适用于稀疏数据的特征分析,使预测加工件质量的方法能够进行工业化普遍推广。通过将Random Forest模型、深度学习神经网络模型以及修正后的Xgboost模型分别通过预处理后的特征数据各自训练后,通过将待测数据样本分别输入到上述三模型中进行分析,得到三个分析结果,通过stacking模型融合上述三个结果,实现对样本数据对应加工件的质量预测,改变数据的稀疏特征对模型的不良影响。This application designs the loss function of the Xgboost model based on the characteristics of the Xgboost model and the sparse data characteristics when the Xgboost model is used to predict the quality of processed parts. The logarithmic maximum similarity is used as the loss function and corrected by linear regression, so that The Xgboost model is more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization. After training the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively, the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
附图说明Description of the drawings
图1本申请一实施例的预测加工件质量的方法流程示意图;Fig. 1 is a schematic flow chart of a method for predicting the quality of a processed part according to an embodiment of the present application;
图2本申请一实施例的预测加工件质量的装置结构示意图;Fig. 2 is a schematic structural diagram of a device for predicting the quality of a processed part according to an embodiment of the present application;
图3本申请一实施例的计算机设备内部结构示意图。Fig. 3 is a schematic diagram of the internal structure of a computer device according to an embodiment of the present application.
本发明的最佳实施方式The best mode of the present invention
参照图1,本申请一实施例的预测加工件质量的方法,包括:1, a method for predicting the quality of a processed part according to an embodiment of the present application includes:
S1:判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;S1: Determine whether the current data to be analyzed contains the mark features corresponding to the quality parameters of the defective processed parts;
S2:若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;S2: If yes, input the current to-be-analyzed data into a predictive model for predictive analysis, where the predictive model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model passes linearity The regression model has been revised and adjusted;
S3:按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;S3: According to a preset method, the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model are summarized and processed to obtain a summary result;
S4:根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。S4: Determine the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
本实施例的当前待分析数据为加工件的特征数据,包括高度、宽度、长度、重量、密度、色彩均一度、表面平整度、硬度等产品特征数据,以及生产时间、生产环境参数、原料批次等加工信息特征数据。通过判断当前待分析数据中是否含有瑕疵加工件的质量参数对应的标志特征,当前待分析数据中含有瑕疵加工件的质量参数分别对应的标志特征时,则会触发预测模型进行预测分析。上述瑕疵加工件的质量参数包含于加工件的特征数据。比如加工件的密度对其质量达标的影响最大,则搜索当前待分析数据是否包括密度低的标志特征,若存在密度低的标志特征,则认为存在加工件质量不达标的风险,则会触发预设模型对所有特征数据进行分析预测。上述瑕疵加工件的质量参数相关联的标志特征除密度低外,还包括表面不平整、色彩不均一、尺寸不达标等,根据待测的加工件的质量要求不同而不同。比如,机密的小型齿轮加工工件,对其强度有非常高的要求,但是特殊的强度检测设备在检查完这个齿轮之后就会报废齿轮,所以需要通过小型齿轮的长、宽、高、重量和热力学成像等特征数据来判断齿轮的强度是否达标。The current data to be analyzed in this embodiment is the feature data of the processed part, including product feature data such as height, width, length, weight, density, color uniformity, surface flatness, hardness, etc., as well as production time, production environment parameters, and batches of raw materials. Feature data of secondary processing information. By judging whether the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, and when the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, the predictive model is triggered to perform predictive analysis. The quality parameters of the above-mentioned defective workpiece are included in the characteristic data of the workpiece. For example, the density of the processed part has the greatest impact on its quality compliance, so it is necessary to search whether the current data to be analyzed includes low-density signature features. If there is a low-density signature feature, it is considered that there is a risk that the quality of the processed part does not meet the standard, which will trigger the prediction. Suppose the model analyzes and predicts all characteristic data. In addition to low density, the above-mentioned flag characteristics associated with the quality parameters of defective processed parts also include uneven surface, uneven color, and substandard size, etc., which vary according to the quality requirements of the processed parts to be tested. For example, a confidential small gear processing workpiece has very high requirements for its strength, but the special strength testing equipment will scrap the gear after inspecting the gear, so it needs to pass the length, width, height, weight and thermodynamics of the small gear Use characteristic data such as imaging to determine whether the gear's strength is up to standard.
由于本实施例的模型训练中的样本数据为成千上万的加工件数据,瑕疵加工件占少数,表现为数据结构特征为稀疏特征,即多数数据赋值为零,导致模型训练时无法体现数据区分度,影响模型训练的区别分析效果。本实施例的Xgboost模型经过特定的修正过程,使其满足对稀疏数据的区别分析,深度学习神经网络模型通过设计特定的构建结构使满足对 稀疏数据的区别分析。然后根据stacking模型对上述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行融合处理,以突出机器学习模型的主体结果,同时对于机器学习模型的错误分析内容进行深度学习修正。本实施例为上述三种模型同时并行对当前待分析数据分别进行分析后,再融合分析。上述融合结果即对当前待分析数据的风险得分评价,得分越高说明当前待分析数据预示的加工件质量不达标的几率大。依据Xgboost模型的特征以及Xgboost模型用于加工件质量预测时的稀疏数据特征,设计了Xgboost模型的损失函数,以通过将对数最大相似性作为损失函数,并通过线性回归进行修正,使Xgboost模型更适用于稀疏数据的特征分析,使预测加工件质量的方法能够进行工业化普遍推广。通过将Random Forest模型、深度学习神经网络模型以及修正后的Xgboost模型分别通过预处理后的特征数据各自训练后,通过将待测数据样本分别输入到上述三模型中进行分析,得到三个分析结果,通过stacking模型融合上述三个结果,实现对样本数据对应加工件的质量预测,改变数据的稀疏特征对模型的不良影响。Since the sample data in the model training of this embodiment is thousands of processed parts data, defective processed parts account for a small number, and the data structure features are sparse features, that is, most of the data is assigned a value of zero, which results in the failure to reflect the data during model training. The degree of discrimination affects the discriminative analysis effect of model training. The Xgboost model of this embodiment undergoes a specific correction process to make it meet the differential analysis of sparse data, and the deep learning neural network model meets the differential analysis of sparse data by designing a specific construction structure. Then, according to the stacking model, the analysis results of at least two of the above-mentioned Xgboost model, RandomForest model and deep learning neural network model are fused to highlight the main results of the machine learning model, and the error analysis content of the machine learning model is carried out in-depth Learn to correct. In this embodiment, the above-mentioned three models simultaneously analyze the current data to be analyzed in parallel, and then merge the analysis. The above fusion result is an evaluation of the risk score of the current data to be analyzed. The higher the score, the higher the probability that the quality of the processed parts predicted by the current data to be analyzed will not meet the standard. According to the characteristics of the Xgboost model and the sparse data characteristics when the Xgboost model is used to predict the quality of processed parts, the loss function of the Xgboost model is designed to use the logarithmic maximum similarity as the loss function and correct it through linear regression to make the Xgboost model It is more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization. After training the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively, the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
进一步地,所述Xgboost模型的损失函数依据对数最大相似性构建得到,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤S1之前,包括:Further, the loss function of the Xgboost model is constructed based on the logarithm maximum similarity, and the step S1 of judging whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece includes:
S11:以损失函数梯度矩阵的二维范数
Figure PCTCN2020099475-appb-000001
为基准,运用线性回归模型构成所述Xgboost模型的目标函数
Figure PCTCN2020099475-appb-000002
其中,所述损失函数为
Figure PCTCN2020099475-appb-000003
y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
S11: Take the two-dimensional norm of the gradient matrix of the loss function
Figure PCTCN2020099475-appb-000001
As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
Figure PCTCN2020099475-appb-000002
Wherein, the loss function is
Figure PCTCN2020099475-appb-000003
y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data, and α is the weight vector configuration ratio;
S12:根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
Figure PCTCN2020099475-appb-000004
X是指输入的样本数据,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
S12: Perform gradient optimization on the gradient of the Xgboost model according to the objective function, where the optimization direction of the gradient optimization is
Figure PCTCN2020099475-appb-000004
X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio;
S13:将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。S13: Input the preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
本实施例中,过大的学习参数会导致Xgboost优化趋近过程中出现步幅过快的问题,这种情况下很容易在优化到一个程度之后让优化方向和优化数量变成无穷大,表现为梯度爆炸;如果学习参数过小,会让Xgboost缓慢趋近最优结果,但是过慢的优化趋近可能会让它陷入局部最小值的陷阱,如果陷入局部最小值则表现为过拟合,通过上述损失函数运用线性回归模型构成Xgboost模型的目标函数,实现对Xgboost模型中的学习参数进行适应性调整,以通过线性回归模型甄选出最适宜上述稀疏数据分析结果的学习参数,确保Xgboost模型具有合适的优化速度。由于Xgboost模型使用的优化趋近方式是损失函数的二阶泰勒展开和节点值作为目标函数,所以损失函数的形态会很大程度上决定Xgboost模型性能的极限。由于Xgboost模型的节点激活方式是sigmoid,对于输入的高维度稀疏结构数据情况下,Xgboost模型优化的过程中,每一次优化方向和优化量主要是来自于损失函数的一阶导数和二阶导数决定,使用对数最大相似性作为损失函数,会让每次优化的趋近步幅不会过大,避免出现f’->infi的情况,即过拟合,也不会出现f’->0的情况,即梯度爆炸,以保证优化顺利进行。In this embodiment, too large learning parameters will cause the problem of too fast stride in the Xgboost optimization approaching process. In this case, it is easy to make the optimization direction and optimization number become infinite after optimization to a certain degree, which is expressed as Gradient explosion; if the learning parameter is too small, Xgboost will slowly approach the optimal result, but too slow optimization approach may make it fall into the trap of local minimum. If it falls into the local minimum, it will be overfitting. Pass The above loss function uses a linear regression model to form the objective function of the Xgboost model to achieve adaptive adjustments to the learning parameters in the Xgboost model, so as to select the most suitable learning parameters for the above sparse data analysis results through the linear regression model to ensure that the Xgboost model is suitable Optimization speed. Since the optimization approach method used by the Xgboost model is the second-order Taylor expansion of the loss function and the node value as the objective function, the shape of the loss function will largely determine the limit of the performance of the Xgboost model. Since the node activation method of the Xgboost model is sigmoid, for the input high-dimensional sparse structure data, during the optimization process of the Xgboost model, each optimization direction and optimization amount are mainly determined by the first derivative and the second derivative of the loss function , Using the logarithmic maximum similarity as the loss function will prevent the approaching stride of each optimization from being too large, avoiding the situation of f'->infi, that is, overfitting, and f'->0 will not appear In the case of gradient explosion, to ensure the smooth progress of the optimization.
上述预处理的样本数据指对样本数据通过指定预处理方式后的数据,上述预处理方式 包括对样本数据中的加工件的特征数据进行分类,运用蒙特卡洛树搜索等统计学方法筛选用于模型训练中的特征数据,以提高通过各种特征确定加工件走向质量不达标的发展趋势路径,提高预测精准度。同时运用Random Froes模型筛选与加工件质量相关的因素,两者寻找交集,得到与质量不达标相关联的加工件的质量参数分组,形成各质量参数分组对应的发展为不达标的产品种类的各趋势路径,比如通过密度特征数据发展为不达标加工件的趋势路径、通过色彩特征数据、重量特征数据等发展为不达标加工件的趋势路径等等。通过进一步深挖与不达标的产品种类或质量参数分组相关的标志特征的特征数据,通过归一化、正则化等操作,以达到降维的目的。预处理的样本数据包括上述特征数据以及发展为不达标的产品种类的各趋势路径。举例地,样本数据为水轮机叶片的特征数据,水轮机叶片是巨大的加工件,水轮机叶片的重心是否符合标准要求,是需要的预测标准,但现实生产中无法实现对每个水轮机叶片进行重心达标检测。但若获得了水轮机叶片的长、宽、高、重量等特征数据后,输入上述模型进行预测分析。比如分别通过长、宽、高、重量等特征数据发展为水轮机叶片的重心不是在需要的位置的趋势路径。比如通过上述两者筛选到密度低很容易导致重心出现问题,则会特别关注密度这个特征数据,并将密度特征数据用于加工件不达标的几率模型预测,当分析有多个特征数据均有明显的影响趋势,则将多个特征数据同时输入到预测模型中进行预测。The above-mentioned pre-processed sample data refers to the data after the sample data has passed the specified pre-processing method. The above-mentioned pre-processing method includes classifying the characteristic data of the processed parts in the sample data, using Monte Carlo tree search and other statistical methods to filter The feature data in model training is used to improve the development trend path that determines the quality of processed parts through various features and improve the accuracy of prediction. At the same time, the Random Froes model is used to filter the factors related to the quality of the processed parts, and the two are searched for intersection, and the quality parameter groups of the processed parts related to the substandard quality are obtained, and each quality parameter group corresponding to the development of the substandard product category is formed. The trend path, such as the trend path of non-compliant workpieces developed through density feature data, and the trend path of non-compliant workpieces developed through color feature data, weight feature data, and so on. Through further digging into the feature data of the logo features related to the substandard product types or quality parameter groupings, through normalization, regularization and other operations, the purpose of dimensionality reduction can be achieved. The pre-processed sample data includes the above-mentioned characteristic data and various trend paths that develop into substandard product categories. For example, the sample data is the characteristic data of the turbine blades. The turbine blades are huge processed parts. Whether the center of gravity of the turbine blades meets the requirements of the standard is the required prediction standard. However, in actual production, it is impossible to detect the center of gravity compliance of each turbine blade. . However, if the characteristic data such as the length, width, height and weight of the turbine blade are obtained, input the above model for predictive analysis. For example, through characteristic data such as length, width, height, weight, etc., it is developed into a trend path that the center of gravity of the turbine blade is not at the required position. For example, if low density is selected through the above two, it is easy to cause problems in the center of gravity, and the characteristic data of density will be paid special attention, and the density characteristic data will be used to predict the probability of the machined part not meeting the standard. When the analysis has multiple characteristic data For obvious influence trends, input multiple feature data into the prediction model for prediction at the same time.
进一步地,所述深度学习神经网络模型包括放大层、解构层和学习层,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤S1之前,包括:Further, the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer. Before step S1 of judging whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece, it includes:
S1a:选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;S1a: Select construction elements to respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers connected in sequence Multilayer, the learning layer includes multiple hidden layers accumulated in sequence;
S1b:依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;S1b: Connect the amplification layer, the deconstruction layer, and the learning layer in sequence to form the deep learning neural network model;
S1c:将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。S1c: Input the preprocessed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model.
本实施例中,由于样本数据为稀疏结构数据,需要对赋值为0的局部特征进行深层的细节放大、重新排列和解构,以便样本数据对于模型训练是有效的,即训练的模型对不同的样本数据实现区分分析。上述的深度学习神经网络模型以放大层为开始结构,以解构层为中间结构,以学习层为结尾结构。上述放大层由4层隐藏层构成,分别为依次连接的2^10ReLu、2^11ReLu、2^12ReLu和2^13ReLu,2^10ReLu与样本数据输入端相连,2^13ReLu与解构层相连。由于进入放大层各隐藏层的样本数据未进行卷积处理,因此,只对样本数据进行局部放大,以确定各样本数据之间的区别特征,实现将样本数据原本赋值为0局部特征进行细化区分。上述解构层由两层卷积层组成,各卷积层包括顺次连接的Bach Normalization*2^10和Average Pooling组成。通过解构层对样本数据进行重新排列和解构,获取原赋值为零的局部特征,经过放大后的细化区分对于整个样本数据的关联关系,得到不为0的赋值。将通过处理后的样本数据输入到学习层,进行学习训练,学习记忆通过处理后的样本数据的各特征。上述学习层由三层隐藏层组成,分别为依次连接的2^10ReLu、2^5ReLu和2^4ReLu,2^10ReLu连接解构层,2^4ReLu连接Softmax分类器相连。上述样本数据的预处理过程同上,不赘述。上述卷积层和隐藏层的数量根据具体训练过程中的优化程度以及计算量进行确定。In this embodiment, since the sample data is sparse structure data, it is necessary to perform deep detail enlargement, rearrangement, and deconstruction of the local features assigned to 0, so that the sample data is effective for model training, that is, the trained model responds to different samples. Data realizes differentiated analysis. The above-mentioned deep learning neural network model uses the amplification layer as the starting structure, the deconstruction layer as the intermediate structure, and the learning layer as the end structure. The above-mentioned amplification layer is composed of 4 hidden layers, which are 2^10ReLu, 2^11ReLu, 2^12ReLu and 2^13ReLu which are connected in sequence, 2^10ReLu is connected to the sample data input terminal, and 2^13ReLu is connected to the deconstruction layer. Since the sample data entering each hidden layer of the amplification layer is not subjected to convolution processing, only the sample data is partially enlarged to determine the distinguishing characteristics between the sample data, and the sample data is originally assigned to 0 local features for refinement distinguish. The above-mentioned deconstruction layer is composed of two convolutional layers, and each convolutional layer includes Bach Normalization*2^10 and Average Pooling which are sequentially connected. The sample data is rearranged and deconstructed through the deconstruction layer to obtain the local features with the original value of zero, and the association relationship to the entire sample data is distinguished after the magnification and refinement, and the value of non-zero is obtained. Input the processed sample data to the learning layer for learning and training, and learn and memorize the features of the processed sample data. The above-mentioned learning layer is composed of three hidden layers, respectively 2^10ReLu, 2^5ReLu and 2^4ReLu connected in sequence, 2^10ReLu is connected to the deconstruction layer, and 2^4ReLu is connected to the Softmax classifier. The preprocessing process of the above sample data is the same as above, and will not be repeated. The number of the aforementioned convolutional layers and hidden layers is determined according to the degree of optimization and the amount of calculation in the specific training process.
进一步地,S13或S1c步骤之前,包括:Further, before step S13 or S1c, it includes:
S101:判断是否接收到对所述预处理后的样本数据的修正指令;S101: Determine whether a correction instruction for the preprocessed sample data is received;
S102:若是,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据处,所述区域选择指令至少包括添加和删除;S102: If yes, the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and displayed at the sample data of the mapping area , The area selection instruction includes at least add and delete;
S103:根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行 修正;S103: Correct the sample data corresponding to the mapping area according to the type of the received area selection instruction;
S104:将修正后的样本数据的显示状态恢复至局部放大显示前的状态。S104: Restore the display state of the corrected sample data to the state before the partial enlarged display.
本实施例经过预处理后的样本数据可接收人工修订,上述预处理后的样本数据为经过统计处理或模型处理等方式处理后的样本数据,推定为有利于提高预测模型的预测准确度的数据,提高通过样本数据训练预测模型后的预测精准度。上述局部放大是指不改变该特征所处于发展为不达标的产品种类的趋势路径中的连接关系,仅局部放大特征,以便对该特征进行精准修正。The sample data preprocessed in this embodiment can be manually revised. The preprocessed sample data is sample data processed through statistical processing or model processing, and it is presumed to be data that is beneficial to improving the prediction accuracy of the prediction model. , Improve the prediction accuracy after training the prediction model with sample data. The above-mentioned partial enlargement means that the connection relationship of the feature in the trend path of the product category that is developing into the substandard product category is not changed, and only the partial enlargement of the feature is performed, so that the feature can be accurately corrected.
进一步地,所述按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果的步骤S3,包括:Further, the step S3 of collecting the analysis results of at least two of the Xgboost model, the RandomForest model and the deep learning neural network model according to a preset method to obtain the summary result includes:
S30:将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深度学习神经网络模型后,分别得到的分析结果;S30: After inputting the current data to be analyzed into the Xgboost model, RandomForest model, and deep learning neural network model, respectively, analysis results obtained respectively;
S31:将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。S31: Perform a weighted average on the matrix data corresponding to each analysis result according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result.
本实施例通过对Xgboost模型、RandomForest模型和深度学习神经网络模型对同一输入样本数据进行分析的分析结果,进行加权平均得到汇总结果,使得汇总结果能避开各模型的自身的缺陷影响,融合结果更符合客观实际,预测结果更精准。举例地,Xgboost模型、RandomForest模型和深度学习神经网络模型的权重分别为W1、W2和W3。Xgboost模型、RandomForest模型和深度学习神经网络模型对当前待分析数据的预测概率分别为n1、n2和n3,则汇总结果为M,汇总结果M=W1*n1+W2*n2+W3*n3,其中,n1、n2、n3和汇总结果M均为0到1之间的小数。In this embodiment, the analysis results of the same input sample data are analyzed by the Xgboost model, the RandomForest model, and the deep learning neural network model, and the weighted average is used to obtain the summary result, so that the summary result can avoid the influence of the defects of each model, and the result is merged It is more in line with objective reality and the forecast results are more accurate. For example, the weights of the Xgboost model, the RandomForest model, and the deep learning neural network model are W1, W2, and W3, respectively. The prediction probabilities of the Xgboost model, RandomForest model, and deep learning neural network model for the current data to be analyzed are n1, n2, and n3, respectively, then the summary result is M, and the summary result M=W1*n1+W2*n2+W3*n3, where , N1, n2, n3 and the summary result M are all decimals between 0 and 1.
进一步地,将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果的步骤S31之前,包括:Further, performing a weighted average of the matrix data corresponding to each of the analysis results according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result before step S31 includes:
S311:将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;S311: Input sample data carrying tags into the Xgboost model, RandomForest model, and deep learning neural network model for training;
S312:获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;S312: Obtain feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the labeled sample data;
S313:根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。S313: Calculate the respective weights corresponding to the Xgboost model, RandomForest model, and deep learning neural network model through a linear regression model according to each of the feedback results and the assignment of the carrying tags.
本实施例中,所述Xgboost模型、RandomForest模型和深度学习神经网络模型,分别对所述携带标签的样本数据的反馈结果分别为x、y、z,多个携带标签的样本数据,则对应多组x、y、z以及每个标签对应的赋值t,标签对应的赋值t为0或1,当t为0时为不达标的标签,当t为1时为达标的标签。有多少个样本数据,就存在多少个x、y、z和t的组合,组成W1*x+W2*y+W3*z=t的组合,通过线性回归模型计算得到权重W1、W2和W3。本申请其他实施例中使用两个模型进行汇总分析,两个模型的汇总过程和原理与上述三个模型的汇总过程和原理近似,不赘述。In this embodiment, the Xgboost model, the RandomForest model, and the deep learning neural network model respectively give the feedback results of the sample data with tags x, y, and z. If there are multiple sample data with tags, the corresponding multiple Group x, y, z and the assignment t corresponding to each label, the assignment t corresponding to the label is 0 or 1, when t is 0, it is a label that does not meet the standard, and when t is 1, it is a label that meets the standard. There are as many sample data as there are combinations of x, y, z, and t to form a combination of W1*x+W2*y+W3*z=t, and the weights W1, W2, and W3 are calculated by linear regression model. In other embodiments of the present application, two models are used for summary analysis, and the summary process and principle of the two models are similar to the summary process and principle of the above three models, and will not be repeated.
参照图2,本申请一实施例的预测加工件质量的装置,包括:Referring to Fig. 2, a device for predicting the quality of a processed part according to an embodiment of the present application includes:
第一判断模块1,用于判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;The first judgment module 1 is used for judging whether the current data to be analyzed contains the mark features corresponding to the quality parameters of the defective processed parts;
输入模块2,用于若当前待分析数据含有瑕疵加工件的质量参数对应的标志特征,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;The input module 2 is used to input the current to-be-analyzed data into a predictive model for predictive analysis if the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, wherein the predictive model includes at least the Xgboost model, Two of the RandomForest model and the deep learning neural network model, the Xgboost model has been modified and adjusted through a linear regression model;
汇总模块3,用于按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;The summary module 3 is configured to summarize the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method to obtain a summary result;
第二判断模块4,用于根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The second judgment module 4 is configured to judge the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
本实施例的当前待分析数据为加工件的特征数据,包括高度、宽度、长度、重量、密度、色彩均一度、表面平整度、硬度等产品特征数据,以及生产时间、生产环境参数、原料批次等加工信息特征数据。通过判断当前待分析数据中是否含有瑕疵加工件的质量参数对应的标志特征,当前待分析数据中含有瑕疵加工件的质量参数分别对应的标志特征时,则会触发预测模型进行预测分析。上述瑕疵加工件的质量参数包含于加工件的特征数据。比如加工件的密度对其质量达标的影响最大,则搜索当前待分析数据是否包括密度低的标志特征,若存在密度低的标志特征,则认为存在加工件质量不达标的风险,则会触发预设模型对所有特征数据进行分析预测。上述瑕疵加工件的质量参数相关联的标志特征除密度低外,还包括表面不平整、色彩不均一、尺寸不达标等,根据待测的加工件的质量要求不同而不同。比如,机密的小型齿轮加工工件,对其强度有非常高的要求,但是特殊的强度检测设备在检查完这个齿轮之后就会报废齿轮,所以需要通过小型齿轮的长、宽、高、重量和热力学成像等特征数据来判断齿轮的强度是否达标。The current data to be analyzed in this embodiment is the feature data of the processed part, including product feature data such as height, width, length, weight, density, color uniformity, surface flatness, hardness, etc., as well as production time, production environment parameters, and batches of raw materials. Feature data of secondary processing information. By judging whether the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, and when the current to-be-analyzed data contains the mark features corresponding to the quality parameters of the defective workpiece, the predictive model is triggered to perform predictive analysis. The quality parameters of the above-mentioned defective workpiece are included in the characteristic data of the workpiece. For example, the density of the processed part has the greatest impact on its quality compliance, so it is necessary to search whether the current data to be analyzed includes low-density signature features. If there is a low-density signature feature, it is considered that there is a risk that the quality of the processed part does not meet the standard, which will trigger the prediction. Suppose the model analyzes and predicts all characteristic data. In addition to low density, the above-mentioned flag characteristics associated with the quality parameters of defective processed parts also include uneven surface, uneven color, and substandard size, etc., which vary according to the quality requirements of the processed parts to be tested. For example, a confidential small gear processing workpiece has very high requirements for its strength, but the special strength testing equipment will scrap the gear after inspecting the gear, so it needs to pass the length, width, height, weight and thermodynamics of the small gear Use characteristic data such as imaging to determine whether the gear's strength is up to standard.
由于本实施例的模型训练中的样本数据为成千上万的加工件数据,瑕疵加工件占少数,表现为数据结构特征为稀疏特征,即多数数据赋值为零,导致模型训练时无法体现数据区分度,影响模型训练的区别分析效果。本实施例的Xgboost模型经过特定的修正过程,使其满足对稀疏数据的区别分析,深度学习神经网络模型通过设计特定的构建结构使满足对稀疏数据的区别分析。然后根据stacking模型对上述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行融合处理,以突出机器学习模型的主体结果,同时对于机器学习模型的错误分析内容进行深度学习修正。本实施例为上述三种模型同时并行对当前待分析数据分别进行分析后,再融合分析。上述融合结果即对当前待分析数据的风险得分评价,得分越高说明当前待分析数据预示的加工件质量不达标的几率大。依据Xgboost模型的特征以及Xgboost模型用于加工件质量预测时的稀疏数据特征,设计了Xgboost模型的损失函数,以通过将对数最大相似性作为损失函数,并通过线性回归进行修正,使Xgboost模型更适用于稀疏数据的特征分析,使预测加工件质量的方法能够进行工业化普遍推广。通过将Random Forest模型、深度学习神经网络模型以及修正后的Xgboost模型分别通过预处理后的特征数据各自训练后,通过将待测数据样本分别输入到上述三模型中进行分析,得到三个分析结果,通过stacking模型融合上述三个结果,实现对样本数据对应加工件的质量预测,改变数据的稀疏特征对模型的不良影响。Since the sample data in the model training of this embodiment is thousands of processed parts data, defective processed parts account for a small number, and the data structure features are sparse features, that is, most of the data is assigned a value of zero, which results in the failure to reflect the data during model training. The degree of discrimination affects the discriminative analysis effect of model training. The Xgboost model of this embodiment undergoes a specific correction process to make it meet the differential analysis of sparse data, and the deep learning neural network model meets the differential analysis of sparse data by designing a specific construction structure. Then, according to the stacking model, the analysis results of at least two of the above-mentioned Xgboost model, RandomForest model and deep learning neural network model are fused to highlight the main results of the machine learning model, and the error analysis content of the machine learning model is carried out in-depth Learn to correct. In this embodiment, the above-mentioned three models simultaneously analyze the current data to be analyzed in parallel, and then merge the analysis. The above fusion result is an evaluation of the risk score of the current data to be analyzed. The higher the score, the higher the probability that the quality of the processed parts predicted by the current data to be analyzed will not meet the standard. According to the characteristics of the Xgboost model and the sparse data characteristics when the Xgboost model is used to predict the quality of processed parts, the loss function of the Xgboost model is designed to use the logarithmic maximum similarity as the loss function and correct it through linear regression to make the Xgboost model It is more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization. After training the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively, the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
进一步地,所述Xgboost模型的损失函数依据对数最大相似性构建得到,预测加工件质量的装置,包括:Further, the loss function of the Xgboost model is constructed according to the logarithmic maximum similarity, and the device for predicting the quality of the processed part includes:
构成模块,用于以损失函数梯度矩阵的二维范数
Figure PCTCN2020099475-appb-000005
为基准,运用线性回归模型构成所述Xgboost模型的目标函数
Figure PCTCN2020099475-appb-000006
其中,所述损失函数为
Figure PCTCN2020099475-appb-000007
y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
Forming module, used to calculate the two-dimensional norm of the loss function gradient matrix
Figure PCTCN2020099475-appb-000005
As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
Figure PCTCN2020099475-appb-000006
Wherein, the loss function is
Figure PCTCN2020099475-appb-000007
y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data, and α is the weight vector configuration ratio;
优化模块,用于根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
Figure PCTCN2020099475-appb-000008
X是指输入的样本数据,ω 为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
The optimization module is used to perform gradient optimization on the gradient of the Xgboost model according to the objective function, wherein the optimization direction of the gradient optimization is
Figure PCTCN2020099475-appb-000008
X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio;
训练模块,用于将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。The training module is used to input the preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
本实施例中,过大的学习参数会导致Xgboost优化趋近过程中出现步幅过快的问题,这种情况下很容易在优化到一个程度之后让优化方向和优化数量变成无穷大,表现为梯度爆炸;如果学习参数过小,会让Xgboost缓慢趋近最优结果,但是过慢的优化趋近可能会让它陷入局部最小值的陷阱,如果陷入局部最小值则表现为过拟合,通过上述损失函数运用线性回归模型构成Xgboost模型的目标函数,实现对Xgboost模型中的学习参数进行适应性调整,以通过线性回归模型甄选出最适宜上述稀疏数据分析结果的学习参数,确保Xgboost模型具有合适的优化速度。由于Xgboost模型使用的优化趋近方式是损失函数的二阶泰勒展开和节点值作为目标函数,所以损失函数的形态会很大程度上决定Xgboost模型性能的极限。由于Xgboost模型的节点激活方式是sigmoid,对于输入的高维度稀疏结构数据情况下,Xgboost模型优化的过程中,每一次优化方向和优化量主要是来自于损失函数的一阶导数和二阶导数决定,使用对数最大相似性作为损失函数,会让每次优化的趋近步幅不会过大,避免出现f’->infi的情况,即过拟合,也不会出现f’->0的情况,即梯度爆炸,以保证优化顺利进行。In this embodiment, too large learning parameters will cause the problem of too fast stride in the Xgboost optimization approaching process. In this case, it is easy to make the optimization direction and optimization number become infinite after optimization to a certain degree, which is expressed as Gradient explosion; if the learning parameter is too small, Xgboost will slowly approach the optimal result, but too slow optimization approach may make it fall into the trap of local minimum. If it falls into the local minimum, it will be overfitting. Pass The above loss function uses a linear regression model to form the objective function of the Xgboost model to achieve adaptive adjustments to the learning parameters in the Xgboost model, so as to select the most suitable learning parameters for the above sparse data analysis results through the linear regression model to ensure that the Xgboost model is suitable Optimization speed. Since the optimization approach method used by the Xgboost model is the second-order Taylor expansion of the loss function and the node value as the objective function, the shape of the loss function will largely determine the limit of the performance of the Xgboost model. Since the node activation method of the Xgboost model is sigmoid, for the input high-dimensional sparse structure data, during the optimization process of the Xgboost model, each optimization direction and optimization amount are mainly determined by the first derivative and the second derivative of the loss function , Using the logarithmic maximum similarity as the loss function will prevent the approaching stride of each optimization from being too large, avoiding the situation of f'->infi, that is, overfitting, and f'->0 will not appear In the case of gradient explosion, to ensure the smooth progress of the optimization.
上述预处理的样本数据指对样本数据通过指定预处理方式后的数据,上述预处理方式包括对样本数据中的加工件的特征数据进行分类,运用蒙特卡洛树搜索等统计学方法筛选用于模型训练中的特征数据,以提高通过各种特征确定加工件走向质量不达标的发展趋势路径,提高预测精准度。同时运用Random Froes模型筛选与加工件质量相关的因素,两者寻找交集,得到与质量不达标相关联的加工件的质量参数分组,形成各质量参数分组对应的发展为不达标的产品种类的各趋势路径,比如通过密度特征数据发展为不达标加工件的趋势路径、通过色彩特征数据、重量特征数据等发展为不达标加工件的趋势路径等等。通过进一步深挖与不达标的产品种类或质量参数分组相关的标志特征的特征数据,通过归一化、正则化等操作,以达到降维的目的。预处理的样本数据包括上述特征数据以及发展为不达标的产品种类的各趋势路径。举例地,样本数据为水轮机叶片的特征数据,水轮机叶片是巨大的加工件,水轮机叶片的重心是否符合标准要求,是需要的预测标准,但现实生产中无法实现对每个水轮机叶片进行重心达标检测。但若获得了水轮机叶片的长、宽、高、重量等特征数据后,输入上述模型进行预测分析。比如分别通过长、宽、高、重量等特征数据发展为水轮机叶片的重心不是在需要的位置的趋势路径。比如通过上述两者筛选到密度低很容易导致重心出现问题,则会特别关注密度这个特征数据,并将密度特征数据用于加工件不达标的几率模型预测,当分析有多个特征数据均有明显的影响趋势,则将多个特征数据同时输入到预测模型中进行预测。The above-mentioned pre-processed sample data refers to the data after the sample data has passed the specified pre-processing method. The above-mentioned pre-processing method includes classifying the characteristic data of the processed parts in the sample data, using Monte Carlo tree search and other statistical methods to filter The feature data in model training is used to improve the development trend path that determines the quality of processed parts through various features and improve the accuracy of prediction. At the same time, the Random Froes model is used to filter the factors related to the quality of the processed parts, and the two are searched for intersection, and the quality parameter groups of the processed parts related to the substandard quality are obtained, and each quality parameter group corresponding to the development of the substandard product category is formed. The trend path, such as the trend path of non-compliant workpieces developed through density feature data, and the trend path of non-compliant workpieces developed through color feature data, weight feature data, and so on. Through further digging into the feature data of the logo features related to the substandard product types or quality parameter groupings, through normalization, regularization and other operations, the purpose of dimensionality reduction can be achieved. The pre-processed sample data includes the above-mentioned characteristic data and various trend paths that develop into substandard product categories. For example, the sample data is the characteristic data of the turbine blades. The turbine blades are huge processed parts. Whether the center of gravity of the turbine blades meets the requirements of the standard is the required prediction standard. However, in actual production, it is impossible to detect the center of gravity compliance of each turbine blade. . However, if the characteristic data such as the length, width, height and weight of the turbine blade are obtained, input the above model for predictive analysis. For example, through characteristic data such as length, width, height, weight, etc., it is developed into a trend path that the center of gravity of the turbine blade is not at the required position. For example, if low density is selected through the above two, it is easy to cause problems in the center of gravity, and the characteristic data of density will be paid special attention, and the density characteristic data will be used to predict the probability of the machined part not meeting the standard. When the analysis has multiple characteristic data For obvious influence trends, input multiple feature data into the prediction model for prediction at the same time.
进一步地,所述深度学习神经网络模型包括放大层、解构层和学习层,预测加工件质量的装置,包括:Further, the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer. The device for predicting the quality of the processed part includes:
选择模块,用于选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;The selection module is used to select construction elements to respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers in sequence A connected convolutional layer, where the learning layer includes multiple hidden layers accumulated in sequence;
连接模块,用于依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;The connection module is configured to sequentially connect the amplification layer, the deconstruction layer, and the learning layer to form the deep learning neural network model;
确定模块,用于将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。The determining module is used to input the preprocessed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model.
本实施例中,由于样本数据为稀疏结构数据,需要对赋值为0的局部特征进行深层的细节放大、重新排列和解构,以便样本数据对于模型训练是有效的,即训练的模型对不同 的样本数据实现区分分析。上述的深度学习神经网络模型以放大层为开始结构,以解构层为中间结构,以学习层为结尾结构。上述放大层由4层隐藏层构成,分别为依次连接的2^10ReLu、2^11ReLu、2^12ReLu和2^13ReLu,2^10ReLu与样本数据输入端相连,2^13ReLu与解构层相连。由于进入放大层各隐藏层的样本数据未进行卷积处理,因此,只对样本数据进行局部放大,以确定各样本数据之间的区别特征,实现将样本数据原本赋值为0局部特征进行细化区分。上述解构层由两层卷积层组成,各卷积层包括顺次连接的Bach Normalization*2^10和Average Pooling组成。通过解构层对样本数据进行重新排列和解构,获取原赋值为零的局部特征,经过放大后的细化区分对于整个样本数据的关联关系,得到不为0的赋值。将通过处理后的样本数据输入到学习层,进行学习训练,学习记忆通过处理后的样本数据的各特征。上述学习层由三层隐藏层组成,分别为依次连接的2^10ReLu、2^5ReLu和2^4ReLu,2^10ReLu连接解构层,2^4ReLu连接Softmax分类器相连。上述样本数据的预处理过程同上,不赘述。上述卷积层和隐藏层的数量根据具体训练过程中的优化程度以及计算量进行确定。In this embodiment, since the sample data is sparse structure data, it is necessary to perform deep detail enlargement, rearrangement, and deconstruction of the local features assigned to 0, so that the sample data is effective for model training, that is, the trained model responds to different samples. Data realizes differentiated analysis. The above-mentioned deep learning neural network model uses the amplification layer as the starting structure, the deconstruction layer as the intermediate structure, and the learning layer as the end structure. The above-mentioned amplification layer is composed of 4 hidden layers, which are 2^10ReLu, 2^11ReLu, 2^12ReLu and 2^13ReLu which are connected in sequence, 2^10ReLu is connected to the sample data input terminal, and 2^13ReLu is connected to the deconstruction layer. Since the sample data entering each hidden layer of the amplification layer is not subjected to convolution processing, only the sample data is partially enlarged to determine the distinguishing characteristics between the sample data, and the sample data is originally assigned to 0 local features for refinement distinguish. The above-mentioned deconstruction layer is composed of two convolutional layers, and each convolutional layer includes Bach Normalization*2^10 and Average Pooling which are sequentially connected. The sample data is rearranged and deconstructed through the deconstruction layer to obtain the local features with the original value of zero, and the association relationship to the entire sample data is distinguished after the magnification and refinement, and the value of non-zero is obtained. Input the processed sample data to the learning layer for learning and training, and learn and memorize the features of the processed sample data. The above-mentioned learning layer is composed of three hidden layers, respectively 2^10ReLu, 2^5ReLu and 2^4ReLu connected in sequence, 2^10ReLu is connected to the deconstruction layer, and 2^4ReLu is connected to the Softmax classifier. The preprocessing process of the above sample data is the same as above, and will not be repeated. The number of the aforementioned convolutional layers and hidden layers is determined according to the degree of optimization and the amount of calculation in the specific training process.
进一步地,预测加工件质量的装置,包括:Further, the device for predicting the quality of processed parts includes:
第三判断模块,用于判断是否接收到对所述预处理后的样本数据的修正指令;The third judgment module is used to judge whether a correction instruction to the preprocessed sample data is received;
放大模块,用于若接收到对所述预处理后的样本数据的修正指令,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据处,所述区域选择指令至少包括添加和删除;The enlargement module is configured to, if a correction instruction for the preprocessed sample data is received, the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, wherein the area selection instruction is based on the user's click The mapping area on the screen is sent out and displayed at the sample data of the mapping area, and the area selection instruction includes at least adding and deleting;
修正模块,用于根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行修正;The correction module is configured to correct the sample data corresponding to the mapping area according to the type of the received area selection instruction;
恢复模块,用于将修正后的样本数据的显示状态恢复至局部放大显示前的状态。The restoration module is used to restore the display state of the modified sample data to the state before the partial magnification display.
本实施例经过预处理后的样本数据可接收人工修订,上述预处理后的样本数据为经过统计处理或模型处理等方式处理后的样本数据,推定为有利于提高预测模型的预测准确度的数据,提高通过样本数据训练预测模型后的预测精准度。上述局部放大是指不改变该特征所处于发展为不达标的产品种类的趋势路径中的连接关系,仅局部放大特征,以便对该特征进行精准修正。The sample data preprocessed in this embodiment can be manually revised. The preprocessed sample data is sample data processed through statistical processing or model processing, and it is presumed to be data that is beneficial to improving the prediction accuracy of the prediction model. , Improve the prediction accuracy after training the prediction model with sample data. The above-mentioned partial enlargement means that the connection relationship of the feature in the trend path of the product category that is developing into the substandard product category is not changed, and only the partial enlargement of the feature is performed, so that the feature can be accurately corrected.
进一步地,所述汇总模块3,包括:Further, the summary module 3 includes:
第一输入单元,用于将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深度学习神经网络模型后,分别得到的分析结果;The first input unit is configured to input the current data to be analyzed into the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain analysis results respectively;
汇总单元,用于将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。The summary unit is configured to perform a weighted average of the matrix data corresponding to each analysis result according to the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model to obtain the summary result.
本实施例通过对Xgboost模型、RandomForest模型和深度学习神经网络模型对同一输入样本数据进行分析的分析结果,进行加权平均得到汇总结果,使得汇总结果能避开各模型的自身的缺陷影响,融合结果更符合客观实际,预测结果更精准。举例地,Xgboost模型、RandomForest模型和深度学习神经网络模型的权重分别为W1、W2和W3。Xgboost模型、RandomForest模型和深度学习神经网络模型对当前待分析数据的预测概率分别为n1、n2和n3,则汇总结果为M,汇总结果M=W1*n1+W2*n2+W3*n3,其中,n1、n2、n3和汇总结果M均为0到1之间的小数。In this embodiment, the analysis results of the same input sample data are analyzed by the Xgboost model, the RandomForest model, and the deep learning neural network model, and the weighted average is used to obtain the summary result, so that the summary result can avoid the influence of the defects of each model, and the result is merged It is more in line with objective reality and the forecast results are more accurate. For example, the weights of the Xgboost model, the RandomForest model, and the deep learning neural network model are W1, W2, and W3, respectively. The prediction probabilities of the Xgboost model, RandomForest model, and deep learning neural network model for the current data to be analyzed are n1, n2, and n3, respectively, then the summary result is M, and the summary result M=W1*n1+W2*n2+W3*n3, where , N1, n2, n3 and the summary result M are all decimals between 0 and 1.
进一步地,所述汇总模块3,包括:Further, the summary module 3 includes:
第二输入单元,用于将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;The second input unit is used to input the sample data carrying the label into the Xgboost model, the RandomForest model and the deep learning neural network model for training;
获取单元,用于获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;An obtaining unit, configured to obtain feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the labeled sample data;
计算单元,用于根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。The calculation unit is used to calculate the respective weights of the Xgboost model, RandomForest model and deep learning neural network model through a linear regression model according to each of the feedback results and the assignment of the carrying tags.
本实施例中,所述Xgboost模型、RandomForest模型和深度学习神经网络模型,分别对所述携带标签的样本数据的反馈结果分别为x、y、z,多个携带标签的样本数据,则对应多组x、y、z以及每个标签对应的赋值t,标签对应的赋值t为0或1,当t为0时为不达标的标签,当t为1时为达标的标签。有多少个样本数据,就存在多少个x、y、z和t的组合,组成W1*x+W2*y+W3*z=t的组合,通过线性回归模型计算得到权重W1、W2和W3。本申请其他实施例中使用两个模型进行汇总分析,两个模型的汇总过程和原理与上述三个模型的汇总过程和原理近似,不赘述。In this embodiment, the Xgboost model, the RandomForest model, and the deep learning neural network model respectively give the feedback results of the sample data with tags x, y, and z. If there are multiple sample data with tags, the corresponding multiple Group x, y, z and the assignment t corresponding to each label, the assignment t corresponding to the label is 0 or 1, when t is 0, it is a label that does not meet the standard, and when t is 1, it is a label that meets the standard. There are as many sample data as there are combinations of x, y, z, and t to form a combination of W1*x+W2*y+W3*z=t, and the weights W1, W2, and W3 are calculated by linear regression model. In other embodiments of the present application, two models are used for summary analysis, and the summary process and principle of the two models are similar to the summary process and principle of the above three models, and will not be repeated.
参照图3,本申请实施例中还提供一种计算机设备,该计算机设备可以是服务器,其内部结构可以如图3所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设计的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机程序和数据库。该内存器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机设备的数据库用于存储预测加工件质量的过程需要的所有数据。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现预测加工件质量的方法。Referring to FIG. 3, an embodiment of the present application also provides a computer device. The computer device may be a server, and its internal structure may be as shown in FIG. 3. The computer equipment includes a processor, a memory, a network interface, and a database connected through a system bus. Among them, the processor designed by the computer is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, a computer program, and a database. The memory provides an environment for the operation of the operating system and the computer program in the non-volatile storage medium. The database of the computer equipment is used to store all the data needed in the process of predicting the quality of the workpiece. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer program is executed by the processor to realize the method of predicting the quality of the workpiece.
上述处理器执行上述预测加工件质量的方法,包括:判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The processor executes the method for predicting the quality of the processed part, including: judging whether the current data to be analyzed contains a mark feature corresponding to the quality parameter of the defective processed part; if so, inputting the current to-be-analyzed data into a predictive model for predictive analysis, Wherein, the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model. The Xgboost model is modified and adjusted by a linear regression model; the Xgboost model, the RandomForest model, and the The analysis results of at least two models in the deep learning neural network model are summarized and processed to obtain a summary result; according to the summary result, the probability that the quality of the processed part corresponding to the current data to be analyzed is not up to standard is determined.
上述计算机设备,依据Xgboost模型的特征以及Xgboost模型用于加工件质量预测时的稀疏数据特征,设计了Xgboost模型的损失函数,以通过将对数最大相似性作为损失函数,并通过线性回归进行修正,使Xgboost模型更适用于稀疏数据的特征分析,使预测加工件质量的方法能够进行工业化普遍推广。通过将Random Forest模型、深度学习神经网络模型以及修正后的Xgboost模型分别通过预处理后的特征数据各自训练后,通过将待测数据样本分别输入到上述三模型中进行分析,得到三个分析结果,通过stacking模型融合上述三个结果,实现对样本数据对应加工件的质量预测,改变数据的稀疏特征对模型的不良影响。The above-mentioned computer equipment, based on the characteristics of the Xgboost model and the sparse data characteristics of the Xgboost model when used for the quality prediction of the processed parts, designed the loss function of the Xgboost model to use the logarithmic maximum similarity as the loss function and correct it through linear regression , Which makes the Xgboost model more suitable for feature analysis of sparse data, and enables the method of predicting the quality of processed parts to be widely promoted in industrialization. After training the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively, the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
在一个实施例中,所述Xgboost模型的损失函数依据对数最大相似性构建得到,上述处理器判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:以损失函数梯度矩阵的二维范数
Figure PCTCN2020099475-appb-000009
为基准,运用线性回归模型构成所述Xgboost模型的目标函数
Figure PCTCN2020099475-appb-000010
其中,所述损失函数为
Figure PCTCN2020099475-appb-000011
y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其 中,梯度优化的优化方向为
Figure PCTCN2020099475-appb-000012
X是指输入的样本数据,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。
In one embodiment, the loss function of the Xgboost model is constructed according to the logarithm maximum similarity, and the above-mentioned processor determines whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece before the step of: The two-dimensional norm of the function gradient matrix
Figure PCTCN2020099475-appb-000009
As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
Figure PCTCN2020099475-appb-000010
Wherein, the loss function is
Figure PCTCN2020099475-appb-000011
y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the Xgboost model's predicted result and the actual result obtained by inputting sample data, and α is the weight vector configuration ratio; the gradient of the Xgboost model is optimized according to the objective function, where the gradient is optimized The optimization direction is
Figure PCTCN2020099475-appb-000012
X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio; input pre-processed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
在一个实施例中,所述深度学习神经网络模型包括放大层、解构层和学习层,上述处理器判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。In an embodiment, the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer. Before the step of determining whether the current data to be analyzed contains the flag characteristics corresponding to the quality parameters of the defective workpiece, the above-mentioned processor includes: selecting The construction elements respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence, The learning layer includes multiple layers of hidden layers accumulated in sequence; sequentially connecting the amplification layer, deconstruction layer, and learning layer to form the deep learning neural network model; and inputting preprocessed sample data to the deep learning neural network model , To determine the model parameters of the deep learning neural network model.
在一个实施例中,上述处理器将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数的步骤之前,或将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数的步骤之前,包括:判断是否接收到对所述预处理后的样本数据的修正指令;若是,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据处,所述区域选择指令至少包括添加和删除;根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行修正;将修正后的样本数据的显示状态恢复至局部放大显示前的状态。In one embodiment, the processor inputs preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model before the step, or preprocess Before the step of inputting the latter sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model, the method includes: determining whether a correction instruction for the preprocessed sample data is received; if so , The preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and is displayed at the sample data of the mapping area. The area selection instruction includes at least add and delete; according to the type of the received area selection instruction, the sample data corresponding to the mapping area is corrected; the display state of the corrected sample data is restored to the state before the partial enlarged display .
在一个实施例中,上述处理器按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果的步骤,包括:将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深度学习神经网络模型后,分别得到的分析结果;将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。In one embodiment, the above-mentioned processor summarizes the analysis results of at least two of the Xgboost model, the RandomForest model and the deep learning neural network model according to a preset method, and the step of obtaining the summary result includes: combining all the analysis results of the Xgboost model, the RandomForest model, and the deep learning neural network model. The analysis results obtained after the current data to be analyzed are respectively input to the Xgboost model, RandomForest model, and deep learning neural network model; the matrix data corresponding to each analysis result is calculated according to the Xgboost model, RandomForest model and depth The weights corresponding to the neural network models are learned, and the weighted average is performed to obtain the summary result.
在一个实施例中,上述处理器将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果的步骤之前,包括:将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。In one embodiment, the above-mentioned processor performs a weighted average on the matrix data corresponding to each of the analysis results according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result. , Including: inputting sample data carrying tags into the Xgboost model, RandomForest model, and deep learning neural network model for training; acquiring the Xgboost model, RandomForest model, and deep learning neural network model to perform training on the carrying tag The feedback results of the sample data; according to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model are calculated through a linear regression model.
本领域技术人员可以理解,图3中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定。Those skilled in the art can understand that the structure shown in FIG. 3 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device to which the solution of the present application is applied.
本申请还提供一种计算机可读存储介质,所述计算机可读存储介质可以是非易失性,也可以是易失性,其上存储有计算机程序,计算机程序被处理器执行时实现预测加工件质量的方法,包括:判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The present application also provides a computer-readable storage medium. The computer-readable storage medium may be non-volatile or volatile. A computer program is stored thereon, and the computer program is executed by the processor to realize the prediction of the workpiece The quality method includes: judging whether the current data to be analyzed contains the mark features corresponding to the quality parameters of the defective processed parts; if so, inputting the current data to be analyzed into a predictive model for predictive analysis, wherein the predictive model at least includes Two of the Xgboost model, the RandomForest model and the deep learning neural network model. The Xgboost model is modified and adjusted by a linear regression model; at least one of the Xgboost model, the RandomForest model, and the deep learning neural network model is adjusted according to a preset method The analysis results of the two models are summarized and processed to obtain a summary result; according to the summary result, the probability that the quality of the processed part corresponding to the current data to be analyzed is not up to standard is judged.
上述计算机可读存储介质,依据Xgboost模型的特征以及Xgboost模型用于加工件质量预测时的稀疏数据特征,设计了Xgboost模型的损失函数,以通过将对数最大相似性作为损失函数,并通过线性回归进行修正,使Xgboost模型更适用于稀疏数据的特征分析,使预测加工件质量的方法能够进行工业化普遍推广。通过将Random Forest模型、深度学习神经网络模型以及修正后的Xgboost模型分别通过预处理后的特征数据各自训练后,通过将待测数据样本分别输入到上述三模型中进行分析,得到三个分析结果,通过stacking模型融合上述三个结果,实现对样本数据对应加工件的质量预测,改变数据的稀疏特征对模型的不良影响。The above-mentioned computer-readable storage medium, based on the characteristics of the Xgboost model and the sparse data characteristics when the Xgboost model is used for the quality prediction of processed parts, the loss function of the Xgboost model is designed to pass the logarithmic maximum similarity as the loss function and pass the linear Regression is modified to make the Xgboost model more suitable for feature analysis of sparse data, so that the method of predicting the quality of processed parts can be widely promoted in industrialization. After training the Random Forest model, the deep learning neural network model, and the modified Xgboost model through the preprocessed feature data, respectively, the data samples to be tested are input into the above three models for analysis, and three analysis results are obtained. Integrate the above three results through the stacking model to realize the quality prediction of the processed parts corresponding to the sample data, and change the negative impact of the sparse feature of the data on the model.
在一个实施例中,所述Xgboost模型的损失函数依据对数最大相似性构建得到,上述处理器判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:以损失函数梯度矩阵的二维范数
Figure PCTCN2020099475-appb-000013
为基准,运用线性回归模型构成所述Xgboost模型的目标函数
Figure PCTCN2020099475-appb-000014
其中,所述损失函数为
Figure PCTCN2020099475-appb-000015
y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
Figure PCTCN2020099475-appb-000016
X是指输入的样本数据,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。
In an embodiment, the loss function of the Xgboost model is constructed according to the logarithm maximum similarity, and the above-mentioned processor determines whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece before the step of: The two-dimensional norm of the function gradient matrix
Figure PCTCN2020099475-appb-000013
As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
Figure PCTCN2020099475-appb-000014
Wherein, the loss function is
Figure PCTCN2020099475-appb-000015
y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result of the Xgboost model obtained by inputting sample data and the actual result, α is the weight vector configuration ratio; the gradient of the Xgboost model is optimized according to the objective function, where the gradient is optimized The optimization direction is
Figure PCTCN2020099475-appb-000016
X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio; input pre-processed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
在一个实施例中,所述深度学习神经网络模型包括放大层、解构层和学习层,上述处理器判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。In an embodiment, the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer. Before the step of determining whether the current data to be analyzed contains the flag characteristics corresponding to the quality parameters of the defective workpiece, the above-mentioned processor includes: selecting The construction elements respectively construct the amplification layer, deconstruction layer, and learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence, The learning layer includes multiple layers of hidden layers accumulated in sequence; sequentially connecting the amplification layer, deconstruction layer, and learning layer to form the deep learning neural network model; and inputting preprocessed sample data to the deep learning neural network model , To determine the model parameters of the deep learning neural network model.
在一个实施例中,上述处理器将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数的步骤之前,或将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数的步骤之前,包括:判断是否接收到对所述预处理后的样本数据的修正指令;若是,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据处,所述区域选择指令至少包括添加和删除;根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行修正;将修正后的样本数据的显示状态恢复至局部放大显示前的状态。In one embodiment, the processor inputs preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model before the step, or preprocess Before the step of inputting the latter sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model, the method includes: determining whether a correction instruction for the preprocessed sample data is received; if so , The preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and is displayed at the sample data of the mapping area. The area selection instruction includes at least add and delete; according to the type of the received area selection instruction, the sample data corresponding to the mapping area is corrected; the display state of the corrected sample data is restored to the state before the partial enlarged display .
在一个实施例中,上述处理器按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果的步骤,包括:将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深 度学习神经网络模型后,分别得到的分析结果;将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。In one embodiment, the above-mentioned processor summarizes the analysis results of at least two of the Xgboost model, the RandomForest model and the deep learning neural network model according to a preset method, and the step of obtaining the summary result includes: combining all the analysis results of the Xgboost model, the RandomForest model, and the deep learning neural network model. The analysis results obtained after the current data to be analyzed are respectively input to the Xgboost model, RandomForest model, and deep learning neural network model; the matrix data corresponding to each analysis result is calculated according to the Xgboost model, RandomForest model and depth The weights corresponding to the neural network models are learned, and the weighted average is performed to obtain the summary result.
在一个实施例中,上述处理器将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果的步骤之前,包括:将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。In one embodiment, the above-mentioned processor performs a weighted average on the matrix data corresponding to each of the analysis results according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result. , Including: inputting sample data carrying tags into the Xgboost model, RandomForest model, and deep learning neural network model for training; acquiring the Xgboost model, RandomForest model, and deep learning neural network model to perform training on the carrying tag The feedback results of the sample data; according to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model are calculated through a linear regression model.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,上述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的和实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可以包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双速据率SDRAM(SSRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by computer programs instructing relevant hardware. The above-mentioned computer programs can be stored in a non-volatile computer readable storage medium. Here, when the computer program is executed, it may include the procedures of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other media provided in this application and used in the embodiments may include non-volatile and/or volatile memory. Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. As an illustration and not a limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), dual-rate data rate SDRAM (SSRSDRAM), enhanced SDRAM (ESDRAM), synchronous Link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、装置、物品或者方法不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、装置、物品或者方法所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、装置、物品或者方法中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variants thereof are intended to cover non-exclusive inclusion, so that a process, device, article or method including a series of elements not only includes those elements, It also includes other elements not explicitly listed, or elements inherent to the process, device, article, or method. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, device, article, or method that includes the element.
以上所述仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only the preferred embodiments of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the specification and drawings of this application, or directly or indirectly applied to other related The technical field is equally included in the scope of patent protection of this application.

Claims (20)

  1. 一种预测加工件质量的方法,其中,包括:A method for predicting the quality of machined parts, which includes:
    判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;Determine whether the current data to be analyzed contains the mark characteristics corresponding to the quality parameters of the defective workpiece;
    若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;If yes, input the current data to be analyzed into a prediction model for prediction analysis, where the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
    按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;Summarizing the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method, to obtain a summary result;
    根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard is determined according to the summary result.
  2. 根据权利要求1所述的预测加工件质量的方法,其中,所述Xgboost模型的损失函数依据对数最大相似性构建得到,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:The method for predicting the quality of a machined part according to claim 1, wherein the loss function of the Xgboost model is constructed based on logarithmic maximum similarity, and the judgment whether the current data to be analyzed contains a mark corresponding to the quality parameter of the defective machined part Before the characteristic steps, include:
    以损失函数梯度矩阵的二维范数
    Figure PCTCN2020099475-appb-100001
    为基准,运用线性回归模型构成所述Xgboost模型的目标函数
    Figure PCTCN2020099475-appb-100002
    其中,所述损失函数为
    Figure PCTCN2020099475-appb-100003
    y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Take the two-dimensional norm of the gradient matrix of the loss function
    Figure PCTCN2020099475-appb-100001
    As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
    Figure PCTCN2020099475-appb-100002
    Wherein, the loss function is
    Figure PCTCN2020099475-appb-100003
    y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data, and α is the weight vector configuration ratio;
    根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
    Figure PCTCN2020099475-appb-100004
    X是指输入的样本数据,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Perform gradient optimization on the gradient of the Xgboost model according to the objective function, where the optimization direction of the gradient optimization is
    Figure PCTCN2020099475-appb-100004
    X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio;
    将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。The preprocessed sample data is input into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
  3. 根据权利要求1所述的预测加工件质量的方法,其中,所述深度学习神经网络模型包括放大层、解构层和学习层,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:The method for predicting the quality of a machined part according to claim 1, wherein the deep learning neural network model includes an amplification layer, a deconstruction layer and a learning layer, and the judging whether the current data to be analyzed contains the quality parameter corresponding to the defective machined part Before the steps to mark features, include:
    选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;The construction elements are selected to construct the amplification layer, the deconstruction layer, and the learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence , The learning layer includes multiple hidden layers accumulated in sequence;
    依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;Sequentially connecting the amplification layer, the deconstruction layer, and the learning layer to form the deep learning neural network model;
    将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。The preprocessed sample data is input to the deep learning neural network model to determine the model parameters of the deep learning neural network model.
  4. 根据权利要求2或3所述的预测加工件质量的方法,其中,所述将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数的步骤之前,或将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数的步骤之前,包括:The method for predicting the quality of a workpiece according to claim 2 or 3, wherein the pre-processed sample data is input into the Xgboost model for gradient optimization for training, so as to determine the optimized Xgboost model Before the step of learning parameters, or before the step of inputting pre-processed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model, it includes:
    判断是否接收到对所述预处理后的样本数据的修正指令;Judging whether a correction instruction to the preprocessed sample data is received;
    若是,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据 处,所述区域选择指令至少包括添加和删除;If yes, the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and displayed at the sample data of the mapping area, so The area selection instructions include at least add and delete;
    根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行修正;Correcting the sample data corresponding to the mapping area according to the type of the received area selection instruction;
    将修正后的样本数据的显示状态恢复至局部放大显示前的状态。Restore the display state of the corrected sample data to the state before the partial enlarged display.
  5. 根据权利要求1所述的预测加工件质量的方法,其中,所述按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果的步骤,包括:The method for predicting the quality of a machined part according to claim 1, wherein the analysis results of at least two of the Xgboost model, RandomForest model, and deep learning neural network model are summarized and processed according to a preset method to obtain The steps to summarize the results include:
    将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深度学习神经网络模型后,分别得到的分析结果;The analysis results obtained after inputting the current data to be analyzed into the Xgboost model, RandomForest model and deep learning neural network model respectively;
    将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。The matrix data corresponding to each analysis result is weighted and averaged according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result.
  6. 根据权利要求5所述的预测加工件质量的方法,其中,将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果的步骤之前,包括:The method for predicting the quality of a workpiece according to claim 5, wherein the matrix data corresponding to each of the analysis results is weighted and averaged according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model. Before the step of summarizing the results, it includes:
    将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;Input the sample data carrying labels into the Xgboost model, RandomForest model and deep learning neural network model for training;
    获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;Acquiring the feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the sample data with tags;
    根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。According to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model are calculated through a linear regression model.
  7. 一种预测加工件质量的装置,其中,包括:A device for predicting the quality of processed parts, which includes:
    第一判断模块,用于判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;The first judging module is used to judge whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective processed part;
    输入模块,用于若当前待分析数据含有瑕疵加工件的质量参数对应的标志特征,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;The input module is used to input the current to-be-analyzed data into a predictive model for predictive analysis if the current to-be-analyzed data contains the mark characteristics corresponding to the quality parameters of the defective workpiece, wherein the predictive model includes at least the Xgboost model and RandomForest Two of the model and the deep learning neural network model, the Xgboost model has been modified and adjusted through a linear regression model;
    汇总模块,用于按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;The summary module is configured to summarize the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method to obtain a summary result;
    第二判断模块,用于根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The second judgment module is used for judging the probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard according to the summary result.
  8. 根据权利要求7所述的预测加工件质量的装置,其中,所述Xgboost模型的损失函数依据对数最大相似性构建得到,装置包括:The device for predicting the quality of a processed part according to claim 7, wherein the loss function of the Xgboost model is constructed based on logarithmic maximum similarity, and the device comprises:
    构成模块,用于以损失函数梯度矩阵的二维范数
    Figure PCTCN2020099475-appb-100005
    为基准,运用线性回归模型构成所述Xgboost模型的目标函数
    Figure PCTCN2020099475-appb-100006
    其中,所述损失函数为
    Figure PCTCN2020099475-appb-100007
    y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Forming module, used to calculate the two-dimensional norm of the gradient matrix of the loss function
    Figure PCTCN2020099475-appb-100005
    As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
    Figure PCTCN2020099475-appb-100006
    Wherein, the loss function is
    Figure PCTCN2020099475-appb-100007
    y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data, and α is the weight vector configuration ratio;
    优化模块,用于根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
    Figure PCTCN2020099475-appb-100008
    X是指输入的样本数据,ω 为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    The optimization module is used to perform gradient optimization on the gradient of the Xgboost model according to the objective function, wherein the optimization direction of the gradient optimization is
    Figure PCTCN2020099475-appb-100008
    X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio;
    训练模块,用于将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。The training module is used to input the preprocessed sample data into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
  9. 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其中,所述处理器执行所述计算机程序时实现预测加工件质量的方法,包括:A computer device includes a memory and a processor, the memory stores a computer program, wherein the method for realizing the quality of a workpiece when the processor executes the computer program includes:
    判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;Determine whether the current data to be analyzed contains the mark characteristics corresponding to the quality parameters of the defective workpiece;
    若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;If yes, input the current data to be analyzed into a prediction model for prediction analysis, where the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
    按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;Summarizing the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method, to obtain a summary result;
    根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard is determined according to the summary result.
  10. 根据权利要求9所述的计算机设备,其中,所述Xgboost模型的损失函数依据对数最大相似性构建得到,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:The computer device according to claim 9, wherein the loss function of the Xgboost model is constructed according to the logarithmic maximum similarity, and the step of judging whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece ,include:
    以损失函数梯度矩阵的二维范数
    Figure PCTCN2020099475-appb-100009
    为基准,运用线性回归模型构成所述Xgboost模型的目标函数
    Figure PCTCN2020099475-appb-100010
    其中,所述损失函数为
    Figure PCTCN2020099475-appb-100011
    y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Take the two-dimensional norm of the gradient matrix of the loss function
    Figure PCTCN2020099475-appb-100009
    As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
    Figure PCTCN2020099475-appb-100010
    Wherein, the loss function is
    Figure PCTCN2020099475-appb-100011
    y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data, and α is the weight vector configuration ratio;
    根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
    Figure PCTCN2020099475-appb-100012
    X是指输入的样本数据,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Perform gradient optimization on the gradient of the Xgboost model according to the objective function, where the optimization direction of the gradient optimization is
    Figure PCTCN2020099475-appb-100012
    X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio;
    将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。The preprocessed sample data is input into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
  11. 根据权利要求9所述的计算机设备,其中,所述深度学习神经网络模型包括放大层、解构层和学习层,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:The computer device according to claim 9, wherein the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer, and the step of judging whether the current data to be analyzed contains a mark feature corresponding to a quality parameter of a defective workpiece Before, including:
    选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;The construction elements are selected to construct the amplification layer, the deconstruction layer, and the learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence , The learning layer includes multiple hidden layers accumulated in sequence;
    依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;Sequentially connecting the amplification layer, the deconstruction layer, and the learning layer to form the deep learning neural network model;
    将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。The preprocessed sample data is input to the deep learning neural network model to determine the model parameters of the deep learning neural network model.
  12. 根据权利要求10或11所述的计算机设备,其中,所述将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数的步骤之前,或将预处理后的样本数据输入到所述深度学习神经网络模型,以确 定所述深度学习神经网络模型的模型参数的步骤之前,包括:The computer device according to claim 10 or 11, wherein the pre-processed sample data is input into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model Before the step, or before the step of inputting preprocessed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model, the method includes:
    判断是否接收到对所述预处理后的样本数据的修正指令;Judging whether a correction instruction to the preprocessed sample data is received;
    若是,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据处,所述区域选择指令至少包括添加和删除;If yes, the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and displayed at the sample data of the mapping area, so The area selection instructions include at least add and delete;
    根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行修正;Correcting the sample data corresponding to the mapping area according to the type of the received area selection instruction;
    将修正后的样本数据的显示状态恢复至局部放大显示前的状态。Restore the display state of the corrected sample data to the state before the partial enlarged display.
  13. 根据权利要求9所述的计算机设备,其中,所述按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果的步骤,包括:8. The computer device according to claim 9, wherein the step of performing summary processing on the analysis results of at least two of the Xgboost model, RandomForest model, and deep learning neural network model according to a preset method, to obtain the summary result ,include:
    将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深度学习神经网络模型后,分别得到的分析结果;The analysis results obtained after inputting the current data to be analyzed into the Xgboost model, RandomForest model and deep learning neural network model respectively;
    将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。The matrix data corresponding to each analysis result is weighted and averaged according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result.
  14. 根据权利要求13所述的计算机设备,其中,将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果的步骤之前,包括:The computer device according to claim 13, wherein the matrix data corresponding to each of the analysis results are weighted and averaged to obtain the summary result according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model. Before the steps, include:
    将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;Input the sample data carrying labels into the Xgboost model, RandomForest model and deep learning neural network model for training;
    获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;Acquiring the feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the labeled sample data;
    根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。According to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model and the deep learning neural network model are calculated through a linear regression model.
  15. 一种计算机可读存储介质,其上存储有计算机程序,其中,所述计算机程序被处理器执行时实现预测加工件质量的方法,包括:A computer-readable storage medium having a computer program stored thereon, wherein the method for predicting the quality of a processed part when the computer program is executed by a processor includes:
    判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征;Determine whether the current data to be analyzed contains the mark characteristics corresponding to the quality parameters of the defective workpiece;
    若是,则将所述当前待分析数据输入预测模型中进行预测分析,其中,所述预测模型至少包括Xgboost模型、RandomForest模型和深度学习神经网络模型中的两种,所述Xgboost模型通过线性回归模型进行了修正调整;If yes, input the current data to be analyzed into a prediction model for prediction analysis, where the prediction model includes at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model, and the Xgboost model adopts a linear regression model Made corrections and adjustments;
    按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果;Summarizing the analysis results of at least two of the Xgboost model, the RandomForest model, and the deep learning neural network model according to a preset method, to obtain a summary result;
    根据所述汇总结果判断所述当前待分析数据对应的加工件质量不达标的几率。The probability that the quality of the processed part corresponding to the current data to be analyzed does not meet the standard is determined according to the summary result.
  16. 根据权利要求15所述的计算机可读存储介质,其中,所述Xgboost模型的损失函数依据对数最大相似性构建得到,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:15. The computer-readable storage medium according to claim 15, wherein the loss function of the Xgboost model is constructed based on the logarithmic maximum similarity, and the judgment whether the current data to be analyzed contains the mark feature corresponding to the quality parameter of the defective workpiece Before the steps, include:
    以损失函数梯度矩阵的二维范数
    Figure PCTCN2020099475-appb-100013
    为基准,运用线性回归模型构成所述Xgboost模型的目标函数
    Figure PCTCN2020099475-appb-100014
    其中,所述损失函数为
    Figure PCTCN2020099475-appb-100015
    y是指真实的结果,x是指输入的样本数据,θ是指Xgboost模型中各函数权重,P(|)是条件概率,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Take the two-dimensional norm of the gradient matrix of the loss function
    Figure PCTCN2020099475-appb-100013
    As a benchmark, a linear regression model is used to form the objective function of the Xgboost model
    Figure PCTCN2020099475-appb-100014
    Wherein, the loss function is
    Figure PCTCN2020099475-appb-100015
    y refers to the real result, x refers to the input sample data, θ refers to the weight of each function in the Xgboost model, P(|) is the conditional probability, ω is the weight of each variable in the linear regression model, J(ω; X , Y) is the difference between the predicted result and the actual result obtained by the Xgboost model by inputting sample data, and α is the weight vector configuration ratio;
    根据所述目标函数对所述Xgboost模型的梯度进行梯度优化,其中,梯度优化的优化方向为
    Figure PCTCN2020099475-appb-100016
    X是指输入的样本数据,ω为线性回归模型中每个变量的权重,J(ω;X,y)是Xgboost模型通过输入样本数据得到的预测结果和实际结果之间的差值,α为权重向量配置比例;
    Perform gradient optimization on the gradient of the Xgboost model according to the objective function, where the optimization direction of the gradient optimization is
    Figure PCTCN2020099475-appb-100016
    X refers to the input sample data, ω is the weight of each variable in the linear regression model, J(ω; X, y) is the difference between the predicted result and the actual result obtained by the Xgboost model through the input sample data, α is Weight vector configuration ratio;
    将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数。The preprocessed sample data is input into the Xgboost model for gradient optimization for training, so as to determine the learning parameters of the optimized Xgboost model.
  17. 根据权利要求15所述的计算机可读存储介质,其中,所述深度学习神经网络模型包括放大层、解构层和学习层,所述判断当前待分析数据是否含有瑕疵加工件的质量参数对应的标志特征的步骤之前,包括:The computer-readable storage medium according to claim 15, wherein the deep learning neural network model includes an amplification layer, a deconstruction layer, and a learning layer, and the judgment whether the current data to be analyzed contains a mark corresponding to the quality parameter of the defective processed part Before the characteristic steps, include:
    选择构建元素分别构建所述深度学习神经网络模型的放大层、解构层和学习层,其中,所述放大层包括多层依次累积的隐藏层,所述解构层包括多层依次连接的卷积层,所述学习层包括多层依次累积的隐藏层;The construction elements are selected to construct the amplification layer, the deconstruction layer, and the learning layer of the deep learning neural network model, wherein the amplification layer includes multiple layers of hidden layers accumulated in sequence, and the deconstruction layer includes multiple layers of convolutional layers connected in sequence , The learning layer includes multiple hidden layers accumulated in sequence;
    依次连接所述放大层、解构层和学习层形成所述深度学习神经网络模型;Sequentially connecting the amplification layer, the deconstruction layer, and the learning layer to form the deep learning neural network model;
    将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数。The preprocessed sample data is input to the deep learning neural network model to determine the model parameters of the deep learning neural network model.
  18. 根据权利要求16或17所述的计算机可读存储介质,其中,所述将预处理后的样本数据输入到进行梯度优化的所述Xgboost模型中进行训练,以确定优化后的所述Xgboost模型的学习参数的步骤之前,或将预处理后的样本数据输入到所述深度学习神经网络模型,以确定所述深度学习神经网络模型的模型参数的步骤之前,包括:The computer-readable storage medium according to claim 16 or 17, wherein the pre-processed sample data is input into the Xgboost model for gradient optimization for training, so as to determine the optimized Xgboost model Before the step of learning parameters, or before the step of inputting pre-processed sample data into the deep learning neural network model to determine the model parameters of the deep learning neural network model, it includes:
    判断是否接收到对所述预处理后的样本数据的修正指令;Judging whether a correction instruction to the preprocessed sample data is received;
    若是,则将所述预处理后的样本数据按照区域选择指令进行局部放大显示,其中,所述区域选择指令根据用户点击屏幕时的映射区域发出并显示在所述映射区域的样本数据处,所述区域选择指令至少包括添加和删除;If yes, the preprocessed sample data is partially enlarged and displayed according to the area selection instruction, where the area selection instruction is issued according to the mapping area when the user clicks on the screen and displayed at the sample data of the mapping area, so The area selection instructions include at least add and delete;
    根据接收到的所述区域选择指令的类型,对所述映射区域对应的样本数据进行修正;Correcting the sample data corresponding to the mapping area according to the type of the received area selection instruction;
    将修正后的样本数据的显示状态恢复至局部放大显示前的状态。Restore the display state of the corrected sample data to the state before the partial enlarged display.
  19. 根据权利要求15所述的计算机可读存储介质,其中,所述按照预设方法对所述Xgboost模型、RandomForest模型和深度学习神经网络模型中的至少两种模型的分析结果进行汇总处理,得到汇总结果的步骤,包括:The computer-readable storage medium according to claim 15, wherein the analysis results of at least two of the Xgboost model, RandomForest model, and deep learning neural network model are summarized according to a preset method to obtain a summary The results of the steps include:
    将所述当前待分析数据分别输入至所述Xgboost模型、RandomForest模型和深度学习神经网络模型后,分别得到的分析结果;The analysis results obtained after inputting the current data to be analyzed into the Xgboost model, RandomForest model and deep learning neural network model respectively;
    将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果。The matrix data corresponding to each analysis result is weighted and averaged according to the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model to obtain the summary result.
  20. 根据权利要求19所述的计算机可读存储介质,其中,将各所述分析结果对应的矩阵数据,按照所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重,进行加权平均得到所述汇总结果的步骤之前,包括:The computer-readable storage medium according to claim 19, wherein the matrix data corresponding to each of the analysis results is weighted and averaged according to the weights corresponding to the Xgboost model, RandomForest model, and deep learning neural network model. Before describing the steps to summarize the results, include:
    将携带标签的样本数据,分别输入到所述Xgboost模型、RandomForest模型和深度学习神经网络模型中进行训练;Input the sample data carrying labels into the Xgboost model, RandomForest model and deep learning neural network model for training;
    获取所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对所述携带标签的样本数据的反馈结果;Acquiring the feedback results of the Xgboost model, the RandomForest model, and the deep learning neural network model on the labeled sample data;
    根据各所述反馈结果以及携带标签的赋值,通过线性回归模型,计算所述Xgboost模型、RandomForest模型和深度学习神经网络模型分别对应的权重。According to each of the feedback results and the assignment of the tags, the weights corresponding to the Xgboost model, the RandomForest model, and the deep learning neural network model are calculated through a linear regression model.
PCT/CN2020/099475 2020-05-28 2020-06-30 Method and apparatus for predicting workpiece quality, and computer device WO2021143067A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010469626.2A CN111814385B (en) 2020-05-28 2020-05-28 Method, device and computer equipment for predicting quality of machined part
CN202010469626.2 2020-05-28

Publications (1)

Publication Number Publication Date
WO2021143067A1 true WO2021143067A1 (en) 2021-07-22

Family

ID=72847830

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/099475 WO2021143067A1 (en) 2020-05-28 2020-06-30 Method and apparatus for predicting workpiece quality, and computer device

Country Status (2)

Country Link
CN (1) CN111814385B (en)
WO (1) WO2021143067A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113642251A (en) * 2021-08-31 2021-11-12 佛山众陶联供应链服务有限公司 Data analysis and prediction method and system for powder making quality of architectural ceramic spray
CN114733640A (en) * 2022-03-03 2022-07-12 江苏丰尚智能科技有限公司 Method and device for adjusting processing parameters of pulverizer and computer equipment
CN115310357A (en) * 2022-08-09 2022-11-08 大庆正方软件科技股份有限公司 Fracturing analysis method based on data-driven decision
CN115512166A (en) * 2022-10-18 2022-12-23 湖北华鑫光电有限公司 Intelligent preparation method and system of lens
CN116362599A (en) * 2022-12-12 2023-06-30 武汉同捷信息技术有限公司 Quality data acquisition method and device based on MES system
CN116976208A (en) * 2023-07-28 2023-10-31 沈阳飞机工业(集团)有限公司 Aviation fastener dividing method based on machine learning
CN117549219A (en) * 2024-01-12 2024-02-13 东方电气集团东方汽轮机有限公司 Blade sand blasting quality prediction method, system, equipment and storage medium
CN117649906A (en) * 2024-01-30 2024-03-05 浙江大学 Casting quality prediction method for integrated aluminum alloy structural part, electronic equipment and medium
CN113642251B (en) * 2021-08-31 2024-05-28 佛山众陶联供应链服务有限公司 Data analysis and prediction method and system for building ceramic spray powder preparation quality

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112966827B (en) * 2021-02-26 2022-02-11 普赛微科技(杭州)有限公司 Method for predicting yield in memory development process
CN112884081B (en) * 2021-03-31 2023-07-14 福州大学 On-line intelligent predicting method for quality of workpieces produced in small batches of multiple varieties
CN113516285B (en) * 2021-05-12 2024-02-13 中船重工鹏力(南京)智能装备系统有限公司 Product quality analysis and prediction method of automatic assembly detection production line in production
CN113987619B (en) * 2021-12-29 2022-04-12 广州赛意信息科技股份有限公司 Industrial design system and method for improving interactive experience
CN115237054B (en) * 2022-09-21 2022-12-16 中科航迈数控软件(深圳)有限公司 Main shaft driving motor control method, device, terminal and storage medium
CN115630847B (en) * 2022-12-07 2023-03-28 四川省华盾防务科技股份有限公司 Transceiving assembly detection method and system based on data prediction and storage medium
CN116214808B (en) * 2023-03-29 2023-10-03 连云港春旭滚塑科技有限公司 High-temperature steam rotational molding equipment with full-automatic quantitative feeding function
CN117647367B (en) * 2024-01-29 2024-04-16 四川航空股份有限公司 Machine learning-based method and system for positioning leakage points of aircraft fuel tank

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190147350A1 (en) * 2016-04-27 2019-05-16 The Fourth Paradigm (Beijing) Tech Co Ltd Method and device for presenting prediction model, and method and device for adjusting prediction model
CN110288199A (en) * 2019-05-29 2019-09-27 北京航空航天大学 The method of product quality forecast
CN110766660A (en) * 2019-09-25 2020-02-07 上海众壹云计算科技有限公司 Integrated circuit defect image recognition and classification system based on fusion depth learning model
CN111079865A (en) * 2019-12-31 2020-04-28 联想(北京)有限公司 Detection method and apparatus, electronic device, and medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107492043A (en) * 2017-09-04 2017-12-19 国网冀北电力有限公司电力科学研究院 stealing analysis method and device
CN108764597A (en) * 2018-04-02 2018-11-06 华南理工大学 A kind of product quality control method based on integrated study
CN108805259A (en) * 2018-05-23 2018-11-13 北京达佳互联信息技术有限公司 neural network model training method, device, storage medium and terminal device
CN108877905B (en) * 2018-06-12 2020-11-10 中南大学 Hospital outpatient quantity prediction method based on Xgboost framework
WO2020020088A1 (en) * 2018-07-23 2020-01-30 第四范式(北京)技术有限公司 Neural network model training method and system, and prediction method and system
CN110990135B (en) * 2019-11-28 2023-05-12 中国人民解放军国防科技大学 Spark job time prediction method and device based on deep migration learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190147350A1 (en) * 2016-04-27 2019-05-16 The Fourth Paradigm (Beijing) Tech Co Ltd Method and device for presenting prediction model, and method and device for adjusting prediction model
CN110288199A (en) * 2019-05-29 2019-09-27 北京航空航天大学 The method of product quality forecast
CN110766660A (en) * 2019-09-25 2020-02-07 上海众壹云计算科技有限公司 Integrated circuit defect image recognition and classification system based on fusion depth learning model
CN111079865A (en) * 2019-12-31 2020-04-28 联想(北京)有限公司 Detection method and apparatus, electronic device, and medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
IBTISSAM EL HASSANI; CHOUMICHA EL MAZGUALDI; TAWFIK MASROUR: "Artificial Intelligence and Machine Learning to Predict and Improve Efficiency in Manufacturing Industry", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 8 January 2019 (2019-01-08), 201 Olin Library Cornell University Ithaca, NY 14853, XP081012464 *
YUE PENG, HOU LINGYAN;YANG DALI;TONG QIANG: "XLC-Stacking Method for Disease Diagnosis Based on XGBoost Feature Selection", COMPUTER ENGINEERING AND APPLICATIONS, HUABEI JISUAN JISHU YANJIUSUO, CN, vol. 56, no. 17, 1 January 2020 (2020-01-01), CN, pages 136 - 141, XP055828936, ISSN: 1002-8331, DOI: 10.3778/j.issn.1002-8331.1908-0337 *

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113642251B (en) * 2021-08-31 2024-05-28 佛山众陶联供应链服务有限公司 Data analysis and prediction method and system for building ceramic spray powder preparation quality
CN113642251A (en) * 2021-08-31 2021-11-12 佛山众陶联供应链服务有限公司 Data analysis and prediction method and system for powder making quality of architectural ceramic spray
CN114733640A (en) * 2022-03-03 2022-07-12 江苏丰尚智能科技有限公司 Method and device for adjusting processing parameters of pulverizer and computer equipment
CN114733640B (en) * 2022-03-03 2023-09-12 江苏丰尚智能科技有限公司 Method and device for adjusting processing parameters of pulverizer and computer equipment
CN115310357A (en) * 2022-08-09 2022-11-08 大庆正方软件科技股份有限公司 Fracturing analysis method based on data-driven decision
CN115512166A (en) * 2022-10-18 2022-12-23 湖北华鑫光电有限公司 Intelligent preparation method and system of lens
CN116362599A (en) * 2022-12-12 2023-06-30 武汉同捷信息技术有限公司 Quality data acquisition method and device based on MES system
CN116362599B (en) * 2022-12-12 2023-11-10 武汉同捷信息技术有限公司 Quality data acquisition method and device based on MES system
CN116976208A (en) * 2023-07-28 2023-10-31 沈阳飞机工业(集团)有限公司 Aviation fastener dividing method based on machine learning
CN117549219B (en) * 2024-01-12 2024-05-10 东方电气集团东方汽轮机有限公司 Blade sand blasting quality prediction method, system, equipment and storage medium
CN117549219A (en) * 2024-01-12 2024-02-13 东方电气集团东方汽轮机有限公司 Blade sand blasting quality prediction method, system, equipment and storage medium
CN117649906A (en) * 2024-01-30 2024-03-05 浙江大学 Casting quality prediction method for integrated aluminum alloy structural part, electronic equipment and medium
CN117649906B (en) * 2024-01-30 2024-04-26 浙江大学 Casting quality prediction method for integrated aluminum alloy structural part, electronic equipment and medium

Also Published As

Publication number Publication date
CN111814385B (en) 2023-11-17
CN111814385A (en) 2020-10-23

Similar Documents

Publication Publication Date Title
WO2021143067A1 (en) Method and apparatus for predicting workpiece quality, and computer device
Malhotra An empirical framework for defect prediction using machine learning techniques with Android software
CN109800885A (en) It is determined for the rule of black box machine learning model
US20230084638A1 (en) Method and apparatus for classification model training and classification, computer device, and storage medium
US11640539B2 (en) Techniques for visualizing the operation of neural networks using samples of training data
US7558803B1 (en) Computer-implemented systems and methods for bottom-up induction of decision trees
CN110162681A (en) Text identification, text handling method, device, computer equipment and storage medium
CN112907026A (en) Comprehensive evaluation method based on editable mesh index system
CN114357170A (en) Model training method, analysis method, device, equipment and medium
Ye et al. Development of the ELECTRE method under Pythagorean fuzzy sets based on existing correlation coefficients for cotton fabric selection
CN106096635A (en) The warning sorting technique of cost-sensitive neutral net based on threshold operation
CN114387089A (en) Client credit risk assessment method, device, equipment and storage medium
CN109300030A (en) Realize the method and apparatus that equity investment is recommended
Tsukahara et al. Validation of default probability models: A stress testing approach
Sheikhkhoshkar et al. An advanced exploration of functionalities as the underlying principles of construction control metrics
CN113792187A (en) Crowd-sourcing software development contribution quality assessment method, device, equipment and medium
Malhotra et al. Analysis of evolutionary algorithms to improve software defect prediction
WO2021159101A1 (en) Fine-grained stochastic neural architecture search
Chen et al. Surface defect detection of industrial components based on vision
Khan et al. Applying Machine Learning Analysis for Software Quality Test
Azadeh et al. Optimization of human resources and industrial banks with ambiguous inputs using intelligent fuzzy mathematical programming approach
Rafatirad et al. Machine learning for computer scientists and data analysts
CN117112741A (en) Competitive product information acquisition method, device, computer equipment and storage medium
CN114820618B (en) Defect detection model training method, device, equipment and storage medium
US11822564B1 (en) Graphical user interface enabling interactive visualizations using a meta-database constructed from autonomously scanned disparate and heterogeneous sources

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20913293

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 22/03/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 20913293

Country of ref document: EP

Kind code of ref document: A1