WO2022267509A1 - Method for training smt printing parameter optimization model, device, and storage medium - Google Patents

Method for training smt printing parameter optimization model, device, and storage medium Download PDF

Info

Publication number
WO2022267509A1
WO2022267509A1 PCT/CN2022/077781 CN2022077781W WO2022267509A1 WO 2022267509 A1 WO2022267509 A1 WO 2022267509A1 CN 2022077781 W CN2022077781 W CN 2022077781W WO 2022267509 A1 WO2022267509 A1 WO 2022267509A1
Authority
WO
WIPO (PCT)
Prior art keywords
trained
optimization model
data
training
model
Prior art date
Application number
PCT/CN2022/077781
Other languages
French (fr)
Chinese (zh)
Inventor
李席
常建涛
李晨阳
孔宪光
武彦斌
王凤伟
Original Assignee
中兴通讯股份有限公司
西安电子科技大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司, 西安电子科技大学 filed Critical 中兴通讯股份有限公司
Publication of WO2022267509A1 publication Critical patent/WO2022267509A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/14Force analysis or force optimisation, e.g. static or dynamic forces

Definitions

  • This application belongs to the field of intelligent manufacturing technology, and involves surface mount technology (Surface Mounted Technology, referred to as: SMT) production line printing and solder paste inspection (SolderPaste Inspection, referred to as: SPI) stage, especially relates to a SMT printing parameter optimization model training , equipment and storage media.
  • SMT Surface Mounted Technology
  • SPI solder paste Inspection
  • An embodiment of the present application provides a training method for an SMT printing parameter optimization model, the method comprising the following steps: receiving initial production data; reconstructing influencing factors according to the initial production data to obtain influencing factor data packets; loading to be trained Optimizing the model: training the optimization model to be trained according to the influencing factor data package to obtain a trained optimization model.
  • the embodiment of the present application also provides a computer device, the device includes a memory and a processor; the memory is used to store a computer program; the processor is used to execute the computer program and when executing the computer program Steps for realizing the training method of the SMT printing parameter optimization model as described above.
  • the embodiment of the present application also provides a storage medium for computer-readable storage, the storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to realize The steps of the training method of the aforementioned SMT printing parameter optimization model.
  • Fig. 1 is the schematic flow chart of the training method of a kind of SMT printing parameter optimization model that an embodiment of the present application provides;
  • Fig. 2 is a schematic flow chart of the steps of obtaining the influencing factor data package provided by an embodiment of the present application
  • FIG. 3 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application.
  • Fig. 4 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application.
  • FIG. 5 is a schematic flowchart of the steps of training the optimization model to be trained provided by an embodiment of the present application
  • FIG. 6 is a schematic flowchart of the steps of iterative training provided by an embodiment of the present application.
  • FIG. 7 is a schematic flow diagram of the steps of obtaining a trained optimization model provided by an embodiment of the present application.
  • Fig. 8 is a schematic flow chart of the process parameter prediction and recommended steps provided by an embodiment of the present application.
  • Fig. 9 is a schematic structural block diagram of a computer device provided by an embodiment of the present application.
  • the main purpose of the embodiments of the present application is to propose a training, equipment and storage medium for an SMT printing parameter optimization model, aiming at improving the prediction result of printing quality in the SMT production line and improving the accuracy of process parameter recommendation.
  • Figure 1 is a schematic flow diagram of a training method for an SMT printing parameter optimization model provided by an embodiment of the present application, the method comprising the following steps:
  • Step S101 receiving input initial production data.
  • SMT Surface Mount Technology
  • Surface Mount Technology which is called Surface Mount Technology in English. It is the most popular technology and process in the electronic assembly industry.
  • SMC/SMD for short, chip components in Chinese are installed on the surface of a printed circuit board (Printed Circuit Board, referred to as: PCB) or on the surface of other substrates, and are soldered and assembled by reflow soldering or dip soldering. technology.
  • Step S102 reconstruct the influencing factors according to the initial production data, and obtain the reconstructed influencing factor data package.
  • the initial production data After the input initial production data is received, the initial production data needs to be processed, specifically, the received initial production data is reconstructed to obtain a reconstructed influencing factor data package.
  • the optimization model when training the SMT printing parameter optimization model (hereinafter referred to as the optimization model), firstly, the training samples for training are determined, and then the optimization model to be trained is trained according to the obtained training samples. Specifically, firstly, the input initial production data is received, and then the impact factors are reconstructed on the received initial production data, so as to obtain the impact factor data package after the reconstruction of the influence factors is completed.
  • the initial production data input for model training includes a lot, specifically including manpower, machine, material, method, environment, measurement and collection of all collectable data in the printing stage of the SMT production line and SPI detection, and then the The received data is further processed.
  • the obtained original production data includes the following six types of data, namely: time batch data, PCB board attribute data, Process parameter data, small elements, printing process data and SPI detection data.
  • PCB board attribute data includes PCB board length, board width and board height
  • process parameter data includes scraper pressure, scraper speed, demoulding speed, demoulding distance, automatic cleaning count and cleaning speed
  • small elements include scraper separation distance, Squeegee separation speed, manual cleaning count, cleaning options, table printing height compensation, automatic cleaning and manual cleaning
  • printing process data include printing time, production count, scraper count, Mask count, Mark_1X axis coordinates, Mark_1Y axis coordinates, Mark_2X axis coordinates , Mark_2 Y-axis coordinates, work separation delay, start offset, end offset, down offset, cleaning supply time, wiping paper waiting distance, machine waiting time, solder paste usage count, average pressure, minimum pressure, maximum pressure and Post-squeegee counting
  • SPI detection data includes solder paste volume and area.
  • the obtained initial production data is not directly used for training, but the influencing factors are reconstructed for the initial production data.
  • the initial production data records many different factors that have an impact on the actual SMT printing. At the same time, there are many of these factors. If all the data are directly used for model training at this time, it will increase the cost of model training. At the same time, it will also make the efficiency of model training extremely low, and because all the data is used for model training, the model will be too single, and the model will not be accurate when recommending process parameters. Therefore, after receiving the initial production data, the influencing factors of the initial production data will be reconstructed to fully consider the impact of feature hidden information on the model, so that the trained model can have higher prediction accuracy and recommendation accuracy.
  • FIG. 2 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by an embodiment of the present application.
  • the initial production data after receiving the inputted initial production data, the initial production data will be reconstructed for influencing factors, so as shown in FIG. 2 , the specific steps for reconstructing influencing factors include step S201 to step S202.
  • Step S201 Filter and screen the initial production data to obtain an original data set that does not contain fixed attributes.
  • the obtained initial production data includes manpower, machine, material, method, environment, measurement, collection of SMT production line printing stage and SPI detection all collectable data
  • the original data obtained after data filtering The set includes PCB board attribute parameters, printing process parameters, printing process parameter data and SPI detection data, etc. For data or factors that are not these attributes, it may be data that needs to be filtered out.
  • the solder paste printing process of the SMT production line as an example, after obtaining the relevant initial production data, the corresponding data filtering and screening will be carried out to remove some characteristic factors that have no influence on the SMT process, and then obtain the The original data set, and the original data set obtained at this time can be shown in Table 1 below:
  • the data that needs to be filtered can be manually screened, or can be automatically screened. Screening can be achieved by setting corresponding thresholds, and then it is determined which factor or factors can be filtered and screened out through the set thresholds.
  • the filtering of initial production data includes:
  • the features contained in the initial production data where the feature is a data category, and then summarize and analyze all the values corresponding to the feature to determine the feature change value corresponding to each feature.
  • the pre-set change value is obtained, and then the initial production data is filtered by comparing the characteristic change value with the change preset change value, and at the same time, it is determined which or which features/factors need to be filtered out, wherein, The first feature whose feature change value is greater than or equal to the preset change value is deleted, and the second feature whose feature change value is greater than or equal to the preset change value is used as the original data set, so as to use the obtained original data set to train the optimization model.
  • the feature change value corresponding to a certain feature it can be determined by calculating the variance corresponding to this feature, and then using a set threshold to determine whether it is the data that needs to be filtered and screened out. Under normal circumstances, in order to ensure the accuracy of model training, when filtering the initial production data, the same type of data is filtered out. Then when setting the preset change value, the preset change value to zero.
  • Step S202 using feature crossover and principal component analysis to reconstruct the influencing factors of the original data set, and obtain a reconstructed influencing factor data package.
  • the principal component analysis FI-PCA of feature intersection is used to reconstruct the features of the data, and at the same time, after the feature reconstruction is completed, the dimensionality reduction processing is performed on the relevant data of the reconstructed features.
  • the specific process of reconstruction of influencing factors includes: first, data specification is performed on the original data set, and the influence of data dimension on the reconstruction of influencing factors is eliminated; Crossover, realize the characteristic crossover of all influencing factors, complete the crossover between influencing factors, and then conduct principal component analysis on the obtained new influencing factors to determine the principal components, and then realize dimensionality reduction to obtain reconstructed influencing factors, and finally combine the obtained new influencing factors
  • the obtained reconstructed influencing factors are combined with the process parameters to be optimized to obtain the data package of influencing factors in the printing stage of the SMT production line.
  • FIG. 3 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application.
  • the process includes step S301 to step S304.
  • Step S301 Perform data reduction on the original data set to obtain a reduced original data set.
  • the data is actually normalized to reduce the data between (0, 1), so as to eliminate the influence of dimension on data mining speed and model convergence speed. influences.
  • x * is the value obtained by normalization
  • x is the actual value corresponding to a feature before normalization
  • x max is the maximum value of this type of feature
  • x min is the minimum value of this type of feature value.
  • Step S302. Determine the process parameters to be optimized corresponding to the optimization model to be trained, and remove the process parameters to be optimized contained in the reduced original data set to obtain a first data set.
  • the initial production data received during model training contains process parameters that need to be optimized, and it is necessary to determine the impact of other parameters on the optimized process parameters in addition to the optimized process parameters during model training, Therefore, after data reduction, for the original data set obtained after reduction, it is necessary to extract the process parameters to be optimized contained in it, and then process the characteristic data of other parts that do not contain the process parameters to be optimized.
  • the process parameters to be optimized corresponding to the optimization model currently being trained are determined, and then the process parameters corresponding to the process parameters to be optimized contained in the original data set after data reduction are completed The data are eliminated to obtain the first data set.
  • the process parameters include: scraper pressure, scraper speed, demoulding speed, demoulding distance, automatic cleaning count and cleaning speed, then the original data after the protocol
  • the data set is eliminated, the data corresponding to the six features are eliminated, and the data set obtained after data elimination is the first data set, which is used for training and optimizing the optimization model to be trained.
  • Step S303 performing feature cross multiplication on the influencing factors included in the first data set to obtain a second data set.
  • the data contained in the obtained first data set will be processed, specifically, when the first data set is obtained, the first data set will be processed.
  • the influencing factors contained in a data set are subjected to feature cross multiplication, and then the corresponding second data set is obtained after the processing is completed.
  • the data contained in the first data set is processed by feature intersection, wherein the feature intersection is mined by performing functional operations on the original features of the sample data.
  • x i represents the i-th data of the feature
  • x j is the j-th feature of the feature
  • x i, j is the product interaction result of x i and x j .
  • Step S304 using principal component analysis to analyze each influencing factor included in the second data set to obtain an influencing factor data package.
  • Principal Component Analysis is a linear dimensionality reduction method, which can effectively eliminate the correlation between features and improve the expressive ability of features. After the original features are crossed, although more hidden features are extracted, the number of crossed features generated is huge, and it is necessary to reduce the dimensionality and replace the original features with a small number of unrelated features to improve the speed of model training.
  • principal component analysis is used to analyze the obtained second data set, so as to obtain the corresponding influencing factor data package after the analysis process is completed. Specifically, when analyzing each influencing factor contained in the second data set, determine the degree of influence of each influencing factor on the actual SMT process, so as to determine a number of influencing factors in the second data set as having a greater impact on the SMT process.
  • the reconstructed influencing factors with great influence, and after obtaining the reconstructed influencing factors combine the reconstructed influencing factors with the process parameters to be optimized and specific parameters to obtain the influencing factor data package.
  • the specific parameters include solder paste volume and area, so the characteristics or parameters contained in the obtained influencing factor data package at this time include: reconstructing influencing factors, process parameters to be optimized (squeegee pressure, scraper speed, demoulding speed, ejection distance, automatic cleaning count, and cleaning speed), and solder paste volume and area.
  • FIG. 4 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application.
  • the principal component analysis when used to analyze and process the second data set to obtain the influencing factor data package, it includes:
  • Step S401 using principal component analysis to calculate the eigenvalues corresponding to each influencing factor in the second data set, and sorting the eigenvalues;
  • Step S402 according to the selection method from large to small, select several eigenvalues from the sorted eigenvalues to add, and when the added value is greater than the set preset threshold for the first time, determine the number of eigenvalues to be added.
  • the features corresponding to the eigenvalues are reconstruction influencing factors;
  • Step S403 combining the reconstructed influencing factors and the process parameters to be optimized to obtain an influencing factor data package.
  • Feature crossover is to improve the accuracy of model training, and to ensure model training The efficiency makes it impossible to use all the data obtained by feature crossover as the input of model training, so it is necessary to further judge all the influencing factors contained in the second data set to select more appropriate influencing factors as model training input of.
  • the eigenvalues corresponding to each influencing factor in the second data set it is to determine the influence degree of each influencing factor on the actual process in the SMT process, and the larger the eigenvalue, the greater the influence, so the calculated
  • the eigenvalues corresponding to each influencing factor are sorted and then selected to obtain the reconstructed influencing factors.
  • X is the data set after the centralization of the second data set, is the average of the data contained in the dataset.
  • the covariance matrix of the data set X obtained by centering is calculated, and the calculation formula of the covariance matrix is as follows:
  • the eigenvalue decomposition is performed on the covariance matrix, and all the eigenvalues are arranged in descending order from large to small, which is recorded as ⁇ 1 , ⁇ 2 ,..., ⁇ p ⁇ , and the corresponding eigenvector is ⁇ 1 , ⁇ 2 ,..., ⁇ p ⁇ , where ⁇ is the eigenvalue, ⁇ is the eigenvector, and then select the eigenvector corresponding to the first d largest eigenvalue for mapping, and map p-dimensional data to d-dimensional data, specifically
  • the mapping method is as follows:
  • the parameter d is determined by the proportion of information, and the calculation method of the proportion of information is as follows:
  • is the proportion of information.
  • Step S103 loading the optimization model to be trained.
  • the constructed optimized model to be trained is a deep neural network model with multiple outputs, and when the optimized model to be trained is trained, it is the hidden layer of the deep neural network with multiple outputs The number of nodes and the activation function are optimized, and finally multiple groups of optimized multi-output deep neural network prediction models are obtained.
  • the deep neural network is a fully connected neural network, mainly composed of input layer, hidden layer and output layer, which has better fitting ability and prediction effect on massive data than traditional machine learning models.
  • the hidden layer nodes and activation functions of deep neural networks are often set according to empirical formulas, and different combinations of hidden layer nodes and activation functions will affect the accuracy of deep neural networks. It should be noted that, for the loaded optimized model to be trained, in addition to the multi-output deep neural network model constructed based on the deep neural network, it may also be constructed based on network models of other structures.
  • the optimized model to be trained is a model used to predict and recommend relevant process parameters that need to be used in the SMT printing process, and can also judge the quality of the combination of different input process parameters.
  • the optimized model to be trained is obtained based on the deep neural network. After the construction is completed, the model is trained and optimized, and several better nodes are selected in the multi-node optimized model to be trained for integration to obtain training. Good optimization model.
  • Step S104 Train the optimization model to be trained according to the influencing factor data package to obtain a trained optimization model.
  • the optimized model to be trained is loaded, the optimized model to be trained is trained and optimized using the influencing factor data package obtained in advance, and finally the converged optimized model is recorded and stored when the optimized model to be trained is determined to converge , that is, the trained optimization model is obtained when the convergence is determined.
  • an elite-retaining genetic algorithm is used to realize the training and optimization of the model.
  • Genetic algorithm is a random global search and optimization method developed by imitating the biological evolution mechanism in nature, drawing on Darwin's theory of evolution and Mendel's genetic theory.
  • the elite-retaining genetic algorithm improves the traditional genetic algorithm. The best individual that has appeared so far in the evolution process of the group, that is, the elite individual, is directly copied to the next generation without pairing and crossover, and the fitness value of the new generation group is The smallest individual is eliminated, so as to speed up the global convergence to find the best solution.
  • FIG. 5 is a schematic flowchart of the steps of training the optimization model to be trained according to an embodiment of the present application, wherein the steps include steps S501 to S502.
  • Step S501 initialize the optimization model to be trained, obtain corresponding initialization parameters, and encode the influencing factor data packet according to a preset encoding method to obtain encoded data;
  • Step S502 Perform several iterations of training on the initialized optimization model to be trained according to the encoded data, so as to adjust the initialization parameters, and obtain a trained optimization model when convergence is determined.
  • the model parameters or parameter information in the model to be trained and optimized are optimized by using the elite-reserving genetic algorithm, and then the training and optimization of the model are completed at the time of convergence to obtain the training optimization The final optimized model is then available for users to use.
  • the optimized model to be trained When optimizing and training the model, first initialize the optimized model to be trained, that is, initialize the relevant parameters and parameter information of the model, and at the same time record the corresponding initialization parameters during initialization, where the initialization parameters are also related to the model.
  • Parameter information and when the model is initialized, the data contained in the obtained influencing factor data package will be encoded according to the preset encoding method to obtain the corresponding encoded data, and then the obtained encoded data will be used
  • the optimization model to be trained is trained and optimized to adjust the initialization parameters, and finally the trained optimization model is obtained and stored and recorded when the convergence is determined.
  • the optimization model to be trained when the optimization model to be trained is trained, it is implemented by using the elite-retaining genetic algorithm, so several iterations of training are performed during the actual training, and each training will update the relevant parameters of the model Adjust and optimize.
  • FIG. 6 is a schematic flowchart of the steps of iterative training provided by an embodiment of the present application. Wherein this step includes step S601 to step S603.
  • Step S601 input the encoded data into the initialized optimization model to be trained, so as to perform the first training on the initialized optimization model to be trained, and obtain the first intermediate parameters after the first training;
  • Step S602 adjusting the optimization model to be trained based on the first intermediate parameters to obtain a first intermediate optimization model to be trained
  • Step S603 input the encoded data into the first intermediate optimization model to be trained, so as to perform a second training on the optimization model to be trained, and obtain second intermediate parameters after the second training, so as to By analogy, the optimization model to be trained is trained several times.
  • the initial parameters of the model to be trained and optimized are adjusted and optimized according to the influencing factor data package. Specifically, after obtaining the encoded data corresponding to the influencing factor data package, the encoded data is input into the initialized model to be trained In the optimization model, the optimized model after initialization is trained for the first time, and the first intermediate parameter after the first training is obtained when the first training is completed, and the optimized model to be trained is adjusted according to the first intermediate parameter, Obtain the first intermediate optimized model to be trained, and then input the coded data into the first intermediate optimized model to be trained, so that the optimized model to be trained is trained for the second time, and so on to complete several trainings of the optimized model to be trained.
  • the optimized model obtained when the training is completed will be recorded.
  • the optimized model obtained and stored when the optimized model obtained and stored, it includes:
  • Step S701 when determining the convergence, sort the objective function values obtained in each training, so as to select N objective function values from the sorted objective function values according to the selection method from large to small, wherein the encoded data input into the optimization model to be trained to obtain the objective function value, and output an objective function value for each training;
  • Step S702 determining N sets of parameter information corresponding to the N objective function values, wherein the parameter information includes the number of hidden layer network nodes and corresponding activation functions;
  • Step S703 input the N sets of parameter information into the N sub-models in the optimization model to be trained, and integrate the N sub-models to obtain a trained optimization model, so as to carry out the trained optimization model storage.
  • the obtained optimization model is a neural network model with multiple outputs, so when the convergence is determined, the objective function values obtained during each training are sorted, and then according to The method selects N objective function values from the sorted objective function values, and then determines the parameter information corresponding to the selected N objective function values, where the parameter information includes the number of hidden layer network nodes and the activation function corresponding to the hidden layer , and finally determine N sub-models in the optimization model according to the obtained N sets of parameter information, and integrate the determined N sub-models to obtain a trained optimization model.
  • encoding method of the elite reserved genetic algorithm to RI (real number encoding)
  • binary encoding and gray encoding which are determined according to actual usage requirements
  • set the population size to 200 at the same time to optimize training
  • the maximum number of iterations is 300
  • the mutation probability is 0.05
  • the crossover probability is 0.95.
  • the number of nodes in hidden layers 1 and 2 is optimized in the range [10,128], and the activation function optimization space is [relu, sigmoid, tanh].
  • the objective function when acquiring the above-mentioned objective function, it is first necessary to determine the calculation method of the objective function, and then calculate the corresponding objective function value for each training according to relevant data information.
  • the objective function can be as follows:
  • y area, i is the actual value of the solder paste volume
  • y vol, i is the actual value of the area of the solder paste
  • sample predicted value of the volume is the sample prediction value of the area
  • m is the sample size.
  • the activation function represented by 0 in the activation function is relu
  • 1 represents sigmoid
  • 2 represents tanh
  • the prediction error is the average relative percentage error.
  • FIG. 8 is a schematic flowchart of the steps of predicting and recommending process parameters provided by an embodiment of the present application. Wherein this step includes: step S801 to step S804.
  • Step S801 when receiving the parameter optimization instruction, load the trained optimization model, and receive the input process parameters to be analyzed;
  • Step S802 input the process parameters to be analyzed into the optimization model, and calculate the function value based on the process parameters to be analyzed;
  • Step S803 when the function value is greater than a preset threshold, output the predicted process parameters according to the optimization model.
  • the stored optimized model can be used for quality prediction of parameter quality and recommendation of process parameters.
  • a new objective function is set in the model, where the new objective function is:
  • vol means volume
  • area means area
  • 100 means perfect volume and area
  • means absolute value
  • the volume and area predicted by the model After receiving the input process parameters to be analyzed, the volume and area predicted by the model will be output, and then brought into the objective function to determine whether the currently received process parameters to be analyzed are qualified. When it is determined to be qualified, it will directly output qualified instruction information, and when it is determined to be unqualified, it will predict the process parameters.
  • predicting and recommending process parameters set the optimization interval of the process parameters to be optimized, optimize the objective function through the elite-retaining genetic algorithm, obtain the optimal solution, and obtain the recommended process parameter combination for recommendation and feedback.
  • Blade pressure Squeegee speed Release speed Release distance Automatic cleaning count cleaning speed objective function 5.34 63.21 2.40 1.13 12 43.04 0.193 11.85 50.70 2.93 0.13 30 36.68 2.594 8.48 47.64 2.95 0.13 27 33.55 3.002
  • the training sample data for training is firstly reconstructed to the influencing factors, specifically, the relationship between the influencing factors is established through feature crossover. Then, principal component analysis is performed on the constructed features to complete the reconstruction of influencing factors, and the influencing factors are deeply excavated, and then different deep neural network models are integrated to obtain the final optimization model. It solves the problem of poor prediction effect of a single fixed model, greatly improves the accuracy of the model, and finally searches out the combination of process parameters for the best printing quality through the elite-retaining genetic algorithm, which has better applicability.
  • FIG. 9 is a schematic structural block diagram of a computer device provided by an embodiment of the present application.
  • the device may be a tablet computer, a notebook, or a desktop computer.
  • the device also includes a processor, memory for storing a computer program.
  • the processor is configured to execute the computer program and implement any one of the SMT printing parameter optimization model training methods provided in the embodiments of the present application when executing the computer program.
  • the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP for short), application-specific integrated circuits (Application Specific Integrated Circuit, referred to as: ASIC), Field-Programmable Gate Array (Field-Programmable Gate Array, referred to as: FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • Embodiments of the present application also provide a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the The training method of the SMT printing parameter optimization model described in any one.
  • the functional modules/units in the system, and the device can be implemented as software, firmware, hardware, and an appropriate combination thereof.
  • the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be composed of several physical components. Components cooperate to execute.
  • Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application-specific integrated circuit .
  • Such software may be distributed on computer-readable storage media, which may include computer-readable storage media (or non-transitory media) and communication media (or transitory media).
  • computer-readable storage medium includes both volatile and non-volatile media implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, program modules, or other data. Volatile, removable and non-removable media.
  • Computer-readable storage media include, but are not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, tape, magnetic disk storage or other magnetic storage devices, Or any other medium that can be used to store desired information and that can be accessed by a computer.
  • communication media typically embodies computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and may include any information delivery media .
  • the computer-readable storage medium may be an internal storage unit of the electronic device described in the foregoing embodiments, such as a hard disk or a memory of the electronic device.
  • the computer-readable storage medium can also be an external storage device of the electronic device, such as a plug-in hard disk equipped on the electronic device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD ) card, flash memory card (Flash Card), etc.
  • the electronic device and the computer-readable storage medium provided by the foregoing embodiments display at least two virtual keyboards in different display areas on the display screen when the user inputs information, so that information can be input through at least two virtual keyboards;
  • the software infers the difficulty of input information by monitoring the state of the sensor, which enhances the security of information input.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Genetics & Genomics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Computer Hardware Design (AREA)
  • Geometry (AREA)
  • Quality & Reliability (AREA)
  • Physiology (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A method for training an SMT printing parameter optimization model, a device, and a storage medium. The method comprises: receiving initial production data; performing influencing factor reconstruction according to the initial production data to obtain an influencing factor data packet; loading an optimization model to be trained; and training said optimization model according to the influencing factor data packet to obtain a trained optimization model.

Description

SMT印刷参数优化模型的训练方法、设备和存储介质Training method, equipment and storage medium for SMT printing parameter optimization model
相关申请的交叉引用Cross References to Related Applications
本申请基于申请号为“202110715871.1”、申请日为2021年06月24日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此以引入方式并入本申请。This application is based on the Chinese patent application with the application number "202110715871.1" and the filing date is June 24, 2021, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is hereby incorporated by reference. Application.
技术领域technical field
本申请属于智能制造技术领域,涉及表面贴装技术(Surface Mounted Technology,简称:SMT)产线印刷及锡膏检测(SolderPaste Inspection,简称:SPI)阶段,尤其涉及一种SMT印刷参数优化模型的训练、设备和存储介质。This application belongs to the field of intelligent manufacturing technology, and involves surface mount technology (Surface Mounted Technology, referred to as: SMT) production line printing and solder paste inspection (SolderPaste Inspection, referred to as: SPI) stage, especially relates to a SMT printing parameter optimization model training , equipment and storage media.
背景技术Background technique
在表面贴装技术中,大部分质量问题的出现是在锡膏印刷阶段,因此如何设置更加合理和正确的印刷工艺参数是尤为重要。In surface mount technology, most quality problems occur in the solder paste printing stage, so how to set more reasonable and correct printing process parameters is particularly important.
而在进行工艺参数的设置时,主要方式有两种,一种是依靠人为经验利用试印刷的方式来进行设置,还有一种则是利用数据挖掘模型实现对SMT产线印刷工艺参数的优化,但是这些方式存在有各种问题和不足,包括:对于生产过程涉及到的各类工艺参数分析不足,漏掉了一些影响锡膏印刷质量的重要参数、单一模型对印刷参数与质量之间的分析不足以及未充分分析印刷质量指标。进而导致相关质量预测模型及工艺参数优化准确性仍然不高。When setting process parameters, there are two main ways, one is to rely on human experience and use trial printing to set, and the other is to use data mining models to optimize the printing process parameters of SMT production lines. However, there are various problems and deficiencies in these methods, including: insufficient analysis of various process parameters involved in the production process, missing some important parameters that affect the quality of solder paste printing, and the analysis of the relationship between printing parameters and quality by a single model Insufficient and insufficient analysis of print quality indicators. As a result, the accuracy of relevant quality prediction models and process parameter optimization is still not high.
因此,现在亟需一种能够更好的提高工艺参数优化和预测准确性的方式。Therefore, there is an urgent need for a method that can better improve the accuracy of process parameter optimization and prediction.
发明内容Contents of the invention
本申请实施例提供了一种SMT印刷参数优化模型的训练方法,所述方法包括以下步骤:接收初始生产数据;根据所述初始生产数据进行影响因素重构,得到影响因素数据包;加载待训练优化模型;根据所述影响因素数据包对所述待训练优化模型进行训练,得到训练好的优化模型。An embodiment of the present application provides a training method for an SMT printing parameter optimization model, the method comprising the following steps: receiving initial production data; reconstructing influencing factors according to the initial production data to obtain influencing factor data packets; loading to be trained Optimizing the model: training the optimization model to be trained according to the influencing factor data package to obtain a trained optimization model.
本申请实施例还提供了一种计算机设备,所述设备包括存储器以及处理器;所述存储器用于存储计算机程序;所述处理器,用于执行所述计算机程序并在执行所述计算机程序时实现如前述所述的SMT印刷参数优化模型的训练方法的步骤。The embodiment of the present application also provides a computer device, the device includes a memory and a processor; the memory is used to store a computer program; the processor is used to execute the computer program and when executing the computer program Steps for realizing the training method of the SMT printing parameter optimization model as described above.
本申请实施例还提供了一种存储介质,用于计算机可读存储,所述存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现前述所述的 SMT印刷参数优化模型的训练方法的步骤。The embodiment of the present application also provides a storage medium for computer-readable storage, the storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to realize The steps of the training method of the aforementioned SMT printing parameter optimization model.
附图说明Description of drawings
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present application more clearly, the drawings that need to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are some embodiments of the present application. Ordinary technicians can also obtain other drawings based on these drawings on the premise of not paying creative work.
图1为本申请一实施例提供的一种SMT印刷参数优化模型的训练方法的流程示意图;Fig. 1 is the schematic flow chart of the training method of a kind of SMT printing parameter optimization model that an embodiment of the present application provides;
图2为本申请一实施例提供的得到影响因素数据包的步骤的流程示意图;Fig. 2 is a schematic flow chart of the steps of obtaining the influencing factor data package provided by an embodiment of the present application;
图3为本申请另一实施例提供的得到影响因素数据包的步骤的流程示意图;FIG. 3 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application;
图4为本申请又一实施例提供的得到影响因素数据包的步骤的流程示意图;Fig. 4 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application;
图5为本申请一实施例提供的对待训练优化模型进行训练的步骤的流程示意图;FIG. 5 is a schematic flowchart of the steps of training the optimization model to be trained provided by an embodiment of the present application;
图6为本申请一实施例提供的进行迭代训练的步骤的流程示意图;FIG. 6 is a schematic flowchart of the steps of iterative training provided by an embodiment of the present application;
图7为本申请一实施例提供的得到训练好的优化模型的步骤的流程框图示意图;FIG. 7 is a schematic flow diagram of the steps of obtaining a trained optimization model provided by an embodiment of the present application;
图8为本申请一实施例提供的工艺参数预测和推荐的步骤的流程示意图;Fig. 8 is a schematic flow chart of the process parameter prediction and recommended steps provided by an embodiment of the present application;
图9为本申请一实施例提供的一种计算机设备的结构示意性框图。Fig. 9 is a schematic structural block diagram of a computer device provided by an embodiment of the present application.
具体实施方式detailed description
本申请实施例的主要目的在于提出一种SMT印刷参数优化模型的训练、设备和存储介质,旨在提高SMT产线印刷质量预测结果,以及提高工艺参数推荐准确性。The main purpose of the embodiments of the present application is to propose a training, equipment and storage medium for an SMT printing parameter optimization model, aiming at improving the prediction result of printing quality in the SMT production line and improving the accuracy of process parameter recommendation.
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.
附图中所示的流程图仅是示例说明,不是必须包括所有的内容和操作/步骤,也不是必须按所描述的顺序执行。例如,有的操作/步骤还可以分解、组合或分合并,因此实际执行的顺序有可能根据实际情况改变。The flow charts shown in the drawings are just illustrations, and do not necessarily include all contents and operations/steps, nor must they be performed in the order described. For example, some operations/steps can be decomposed, combined or merged, so the actual order of execution may be changed according to the actual situation.
如图1所示,图1为本申请一实施例提供的一种SMT印刷参数优化模型的训练方法的流程示意图,该方法包括以下步骤:As shown in Figure 1, Figure 1 is a schematic flow diagram of a training method for an SMT printing parameter optimization model provided by an embodiment of the present application, the method comprising the following steps:
步骤S101、接收输入的初始生产数据。Step S101, receiving input initial production data.
SMT是表面组装技术(表面贴装技术),英文全称为Surface Mount Technology,是目前电子组装行业里最流行的一种技术和工艺,它是一种将无引脚或短引线表面组装元器件(简称SMC/SMD,中文称片状元器件)安装在印制电路板(Printed Circuit Board,简称:PCB) 的表面或其它基板的表面上,通过回流焊或浸焊等方法加以焊接组装的电路装连技术。SMT is Surface Mount Technology (Surface Mount Technology), which is called Surface Mount Technology in English. It is the most popular technology and process in the electronic assembly industry. SMC/SMD for short, chip components in Chinese) are installed on the surface of a printed circuit board (Printed Circuit Board, referred to as: PCB) or on the surface of other substrates, and are soldered and assembled by reflow soldering or dip soldering. technology.
而在进行表面组装时,并不是随意的进行的,需要根据实际的情况,包括产品的实际信息,来确定实际的组装或贴装方式,同时在进行贴装时如何设定贴装的参数也是非常有必要的,因此需要合理且正确的选择相关的工艺参数以实现对器件的表面贴装。When performing surface assembly, it is not done randomly. It is necessary to determine the actual assembly or placement method according to the actual situation, including the actual information of the product. At the same time, how to set the placement parameters during placement is also It is very necessary, so it is necessary to select the relevant process parameters reasonably and correctly to realize the surface mount of the device.
在一实施例中,为了更好的实现对元器件的贴装,需要更加正确的工艺参数,而为了在进行贴装时有更好的工艺参数,此时会预先构建和训练相关的模型,以使得根据实际的情况推荐出更加合适的工艺参数的组合。而在训练SMT印刷参数优化模型时,首先接收所输入的初始生产数据,进而根据所得到的初始生产数据对需要进行训练的参数优化模型进行训练。In one embodiment, in order to better realize the placement of components, more correct process parameters are required, and in order to have better process parameters during placement, relevant models will be pre-built and trained at this time, In order to recommend a more suitable combination of process parameters according to the actual situation. When training the SMT printing parameter optimization model, the input initial production data is firstly received, and then the parameter optimization model to be trained is trained according to the obtained initial production data.
步骤S102、根据所述初始生产数据进行影响因素重构,得到重构后的影响因素数据包。Step S102 , reconstruct the influencing factors according to the initial production data, and obtain the reconstructed influencing factor data package.
在接收到所输入的初始生产数据之后,需要对初始生产数据进行处理,具体地,对所接收到的初始生成数据进行影响因素的重构,以得到重构的影响因素数据包。After the input initial production data is received, the initial production data needs to be processed, specifically, the received initial production data is reconstructed to obtain a reconstructed influencing factor data package.
实际上,在对SMT印刷参数优化模型(以下简称为:优化模型)进行训练时,首先确定进行训练的训练样本,然后根据所得到的训练样本对需要进行训练的优化模型进行训练。具体地,首先会接收所输入的初始生产数据,然后对所接收到的初始生产数据进行影响因素的重构,以得到完成影响因素重构后的影响因素数据包。In fact, when training the SMT printing parameter optimization model (hereinafter referred to as the optimization model), firstly, the training samples for training are determined, and then the optimization model to be trained is trained according to the obtained training samples. Specifically, firstly, the input initial production data is received, and then the impact factors are reconstructed on the received initial production data, so as to obtain the impact factor data package after the reconstruction of the influence factors is completed.
示例性的,对于进行模型训练的而输入的初始生产数据包含有很多,具体包括人、机、料、法、环、测采集SMT产线印刷阶段及SPI检测所有可采集数据,然后再对所接收到的数据进行进一步的处理。Exemplarily, the initial production data input for model training includes a lot, specifically including manpower, machine, material, method, environment, measurement and collection of all collectable data in the printing stage of the SMT production line and SPI detection, and then the The received data is further processed.
比如,以SMT产线锡膏印刷工艺为例,在获取SMT产线印刷阶段原始生产数据时,所得到的原始生产数据包括以下六类数据,分别为:时间批次数据、PCB板属性数据、工艺参数数据、小要素、印刷过程数据以及SPI检测数据。其中,PCB板属性数据包含PCB板长、板宽以及板高;工艺参数数据指包括刮刀压力、刮刀速度、脱模速度、脱模距离、自动清洗计数以及清洗速度;小要素包括刮刀分离距离、刮刀分离速度、人工清洗计数、清洗选项、工作台印刷高度补偿、自动清洗以及手工清洗;印刷过程数据包括印刷时间、生产计数、刮刀计数、Mask计数、Mark_1X轴坐标、Mark_1Y轴坐标、Mark_2X轴坐标、Mark_2Y轴坐标、工作分离延迟、起始偏移、结束偏移、向下偏移、清洗供给时间、擦纸等待距离、机器等待时间、锡膏使用计数、平均压力、最小压力、最大压力以及后刮刀计数;SPI检测数据包括锡膏体积、面积。For example, taking the solder paste printing process of the SMT production line as an example, when obtaining the original production data of the printing stage of the SMT production line, the obtained original production data includes the following six types of data, namely: time batch data, PCB board attribute data, Process parameter data, small elements, printing process data and SPI detection data. Among them, PCB board attribute data includes PCB board length, board width and board height; process parameter data includes scraper pressure, scraper speed, demoulding speed, demoulding distance, automatic cleaning count and cleaning speed; small elements include scraper separation distance, Squeegee separation speed, manual cleaning count, cleaning options, table printing height compensation, automatic cleaning and manual cleaning; printing process data include printing time, production count, scraper count, Mask count, Mark_1X axis coordinates, Mark_1Y axis coordinates, Mark_2X axis coordinates , Mark_2 Y-axis coordinates, work separation delay, start offset, end offset, down offset, cleaning supply time, wiping paper waiting distance, machine waiting time, solder paste usage count, average pressure, minimum pressure, maximum pressure and Post-squeegee counting; SPI detection data includes solder paste volume and area.
需要说明的是,在接收到对优化模型进行训练的初始生产数据时,并不是直接使用所得到的初始生产数据进行训练,而是对初始生产数据进行影响因素重构。具体地,初始生产数据中记录着很多不同的对实际的SMT印刷有影响的因素,同时这些因素的数量也是有很多的, 若此时直接使用所有的数据进行模型的训练,会增加模型训练的负担,同时也会使得模型训练的效率极低,并且由于是单一的使用所有数据进行模型的训练,会使得模型过于单一,进而使得模型在进行工艺参数的推荐时,推荐准确性不高。因此在接收到初始生产数据之后会对初始生产数据进行影响因素重构,以充分考虑特征隐藏信息对模型的影响,使得训练之后的模型可以具有更高的预测准确率和推荐准确率。It should be noted that when the initial production data for training the optimization model is received, the obtained initial production data is not directly used for training, but the influencing factors are reconstructed for the initial production data. Specifically, the initial production data records many different factors that have an impact on the actual SMT printing. At the same time, there are many of these factors. If all the data are directly used for model training at this time, it will increase the cost of model training. At the same time, it will also make the efficiency of model training extremely low, and because all the data is used for model training, the model will be too single, and the model will not be accurate when recommending process parameters. Therefore, after receiving the initial production data, the influencing factors of the initial production data will be reconstructed to fully consider the impact of feature hidden information on the model, so that the trained model can have higher prediction accuracy and recommendation accuracy.
参照图2,图2为本申请一实施例提供的得到影响因素数据包的步骤的流程示意图。Referring to FIG. 2 , FIG. 2 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by an embodiment of the present application.
在一实施例中,在接收到所输入的初始生产数据之后,将会对初始生产数据进行影响因素的重构,因此如图2所示,具体的影响因素重构的步骤包括步骤S201至步骤S202。In one embodiment, after receiving the inputted initial production data, the initial production data will be reconstructed for influencing factors, so as shown in FIG. 2 , the specific steps for reconstructing influencing factors include step S201 to step S202.
步骤S201、对所述初始生产数据进行过滤筛选,得到不包含固定属性的原始数据集。Step S201. Filter and screen the initial production data to obtain an original data set that does not contain fixed attributes.
在得到初始生产数据之后,由于初始生产数据中所包含的所有数据并不一定是对SMT印刷具有影响的数据,因此在接收到初始生产数据之后,将会对初始生产数据中所包含的所有特征/数据进行过滤筛选,以得到用来进行模型训练的原始数据集。After the initial production data is obtained, since all the data contained in the initial production data are not necessarily the data that have an impact on SMT printing, after receiving the initial production data, all the features contained in the initial production data will be /Data is filtered to obtain the original data set used for model training.
在一实施例中,在获取初始生产数据时,会存在部分并不会对实际的工艺产生影响的数据,比如某一类数据在所有的数据中的数值均为1(一固定的数值),那显然这一类数据对于整体的SMT印刷不会产生影响,因此需要将这一类数据过滤筛选掉。In one embodiment, when obtaining the initial production data, there will be some data that will not affect the actual process, such as a certain type of data in which the value of all data is 1 (a fixed value), Obviously, this type of data will not affect the overall SMT printing, so this type of data needs to be filtered out.
示例性的,所得到的初始生产数据包括人、机、料、法、环、测采集SMT产线印刷阶段及SPI检测所有可采集数据,而在通过进行数据的过滤筛选之后所得到的原始数据集包含PCB板属性参数、印刷工艺参数、印刷过程参数数据以及SPI检测数据等。而对于不是这些属性的数据或者说因素则可能是需要进行过滤筛选掉的数据。以SMT产线锡膏印刷工艺为例,在获取了相关的初始生产数据之后,会进行相应的数据过滤和筛选,以将部分对SMT工艺没有影响的特征因素剔除,进而得到包含有所有影响因素的原始数据集,而此时所得到的原始数据集可以如下表1所示:Exemplarily, the obtained initial production data includes manpower, machine, material, method, environment, measurement, collection of SMT production line printing stage and SPI detection all collectable data, and the original data obtained after data filtering The set includes PCB board attribute parameters, printing process parameters, printing process parameter data and SPI detection data, etc. For data or factors that are not these attributes, it may be data that needs to be filtered out. Taking the solder paste printing process of the SMT production line as an example, after obtaining the relevant initial production data, the corresponding data filtering and screening will be carried out to remove some characteristic factors that have no influence on the SMT process, and then obtain the The original data set, and the original data set obtained at this time can be shown in Table 1 below:
表1Table 1
Figure PCTCN2022077781-appb-000001
Figure PCTCN2022077781-appb-000001
需要说明的是,在进行数据的筛选时,对于需要进行过滤掉的数据可以是人为进行筛选的,还可以是自动实现筛选的,比如,由于SMT印刷时所有的数据都是数字化的,因此在进行筛选时可以通过设定相应的阈值来实现,进而通过所设定的阈值确定哪一或者哪些因素是可以被过滤和筛选掉的。具体地,在对初始生产数据进行过滤筛选时包括:It should be noted that when screening data, the data that needs to be filtered can be manually screened, or can be automatically screened. Screening can be achieved by setting corresponding thresholds, and then it is determined which factor or factors can be filtered and screened out through the set thresholds. Specifically, the filtering of initial production data includes:
确定初始生产数据所包含的特征,并确定特征的特征变化值;获取预设变化值,并根据特征变化值以及预设变化值对初始生产数据所包含的数据进行筛选;其中,将特征变化值小于预设变化值的第一特征删除;将特征变化值大于或者等于预设变化值的第二特征作为原始数据集。Determine the characteristics contained in the initial production data, and determine the characteristic change value of the characteristic; obtain the preset change value, and filter the data contained in the initial production data according to the characteristic change value and the preset change value; wherein, the characteristic change value The first feature that is less than the preset change value is deleted; the second feature whose feature change value is greater than or equal to the preset change value is used as the original data set.
因此,在对初始生产数据进行过滤时,确定初始生产数据中所包含的特征,其中特征为一数据类别,然后对特征所对应的所有数值汇总分析,以确定各特征所对应的特征变化值,同时获取预先所设定的变化值,进而通过特征变化值与变化预设变化值的对比,以对初始生产数据进行过滤,同时确定哪一或者哪些特征/因素是需要被过滤掉的,其中,将特征变化值大小预设变化值的第一特征删除,而将特征变化值大于或者等于预设变化值的第二特征作为原始数据集,以利用所得到的原始数据集对优化模型进行训练。Therefore, when filtering the initial production data, determine the features contained in the initial production data, where the feature is a data category, and then summarize and analyze all the values corresponding to the feature to determine the feature change value corresponding to each feature. At the same time, the pre-set change value is obtained, and then the initial production data is filtered by comparing the characteristic change value with the change preset change value, and at the same time, it is determined which or which features/factors need to be filtered out, wherein, The first feature whose feature change value is greater than or equal to the preset change value is deleted, and the second feature whose feature change value is greater than or equal to the preset change value is used as the original data set, so as to use the obtained original data set to train the optimization model.
在确定某一特征所对应的特征变化值时,可以通过计算这一特征所对应的方差来确定,然后利用所设定的一个阈值确定是否为需要进行过滤和筛选掉的数据。而在通常情况下,为了保证模型训练的准确性,在对初始生产数据进行过滤时,是将完全一样的一类数据过滤掉,那么此时在设定预设变化值时,预设变化值为零。When determining the feature change value corresponding to a certain feature, it can be determined by calculating the variance corresponding to this feature, and then using a set threshold to determine whether it is the data that needs to be filtered and screened out. Under normal circumstances, in order to ensure the accuracy of model training, when filtering the initial production data, the same type of data is filtered out. Then when setting the preset change value, the preset change value to zero.
步骤S202、利用特征交叉以及主成分分析对所述原始数据集进行影响因素重构,得到重构的影响因素数据包。Step S202, using feature crossover and principal component analysis to reconstruct the influencing factors of the original data set, and obtain a reconstructed influencing factor data package.
其中,特征交叉的主成分分析FI-PCA用来对数据进行特征重构,同时在完成特征重构之后对重构后的特征的相关数据进行降维处理。Among them, the principal component analysis FI-PCA of feature intersection is used to reconstruct the features of the data, and at the same time, after the feature reconstruction is completed, the dimensionality reduction processing is performed on the relevant data of the reconstructed features.
在得到进行过滤筛选之后的原始数据集之后,利用特征交叉以及主成分分析实现对原始数据集进行影响因素的重构,具体地,首先进行特征交叉,然后在进行主成分分析以得到用于进行模型训练的影响因素数据包。After obtaining the original data set after filtering and screening, use feature crossover and principal component analysis to realize the reconstruction of the influencing factors of the original data set, specifically, first perform feature crossover, and then perform principal component analysis to obtain Influencing factor data package for model training.
在实际应用中,影响因素的重构的具体过程包括:首先对原始数据集进行数据规约,消除数据维度对影响因素重构的影响,然后将待优化工艺参数之外的影响因素与其他影响因素交叉,实现所有影响因素的特征交叉,完成影响因素之间交叉作用,接着对得到的新的影响因素进行主成分分析,以确定主成分,进而实现进行降维得到重构影响因素,最后将所得到的重构影响因素与待优化工艺参数合并,以得到SMT产线印刷阶段影响因素数据包。In practical applications, the specific process of reconstruction of influencing factors includes: first, data specification is performed on the original data set, and the influence of data dimension on the reconstruction of influencing factors is eliminated; Crossover, realize the characteristic crossover of all influencing factors, complete the crossover between influencing factors, and then conduct principal component analysis on the obtained new influencing factors to determine the principal components, and then realize dimensionality reduction to obtain reconstructed influencing factors, and finally combine the obtained new influencing factors The obtained reconstructed influencing factors are combined with the process parameters to be optimized to obtain the data package of influencing factors in the printing stage of the SMT production line.
参照图3,图3为本申请另一实施例提供的得到影响因素数据包的步骤的流程示意图。Referring to FIG. 3 , FIG. 3 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application.
其中,该过程包括步骤S301至步骤S304。Wherein, the process includes step S301 to step S304.
步骤S301、对所述原始数据集进行数据规约,得到规约后的原始数据集。Step S301. Perform data reduction on the original data set to obtain a reduced original data set.
在得到原始数据集时,由于原始数据集中所记录的数据是按照实际的数据所记录的,因此会存在数据值之间差异巨大的情况,这样除了会对影响因素的重构存在影响,还会对后续的模型训练有所影响,比如降低模型训练速度,因此合理的对数据进行处理,可以降低数据差异所带来的影响,同时还可以提升模型后续训练的效率。When the original data set is obtained, since the data recorded in the original data set is recorded according to the actual data, there will be a huge difference between the data values, which will not only affect the reconstruction of the influencing factors, but also It has an impact on subsequent model training, such as reducing the speed of model training. Therefore, reasonable processing of data can reduce the impact of data differences and improve the efficiency of subsequent model training.
在对原始数据集中记录的数据进行规约时,实际上是对数据进行归一化处理,以将数据规约在区间(0,1)之间,以消除量纲对数据挖掘速度、模型收敛速度的影响。When reducing the data recorded in the original data set, the data is actually normalized to reduce the data between (0, 1), so as to eliminate the influence of dimension on data mining speed and model convergence speed. influences.
而进行归一化规约的计算公式具体如下:The calculation formula for normalization reduction is as follows:
Figure PCTCN2022077781-appb-000002
Figure PCTCN2022077781-appb-000002
其中,x *为进行归一化所得到的数值,x为一特征在进行归一化之前所对应的实际数值,x max为该类特征中的最大值,x min为该类特征中的最小值。 Among them, x * is the value obtained by normalization, x is the actual value corresponding to a feature before normalization, x max is the maximum value of this type of feature, and x min is the minimum value of this type of feature value.
步骤S302、确定所述待训练优化模型所对应的待优化工艺参数,并将规约后的所述原始数据集中所包含的待优化工艺参数剔除,得到第一数据集。Step S302. Determine the process parameters to be optimized corresponding to the optimization model to be trained, and remove the process parameters to be optimized contained in the reduced original data set to obtain a first data set.
在进行模型训练时所接收到的初始生产数据中包含有需要进行优化的工艺参数,而在进行模型训练时需要确定除了进行优化的工艺参数之外的其他参数对进行优化的工艺参数的影响,因此在进行数据规约之后,对于所得到的规约后的原始数据集,需要将其中所包含的待优化工艺参数提取出来,然后对不包含有待优化工艺参数的其他部分的特征数据进行处理。The initial production data received during model training contains process parameters that need to be optimized, and it is necessary to determine the impact of other parameters on the optimized process parameters in addition to the optimized process parameters during model training, Therefore, after data reduction, for the original data set obtained after reduction, it is necessary to extract the process parameters to be optimized contained in it, and then process the characteristic data of other parts that do not contain the process parameters to be optimized.
因此,在得到进行规约后的原始数据集的时候,确定当前进行训练的优化模型所对应的待优化工艺参数,然后将完成数据规约之后的原始数据集中所包含的与待优化工艺参数所对应的数据进行剔除,以得到第一数据集。Therefore, when the original data set after reduction is obtained, the process parameters to be optimized corresponding to the optimization model currently being trained are determined, and then the process parameters corresponding to the process parameters to be optimized contained in the original data set after data reduction are completed The data are eliminated to obtain the first data set.
示例性的,比如该优化模型在进行工艺参数优化和推荐时,工艺参数包括:刮刀压力、刮刀速度、脱模速度、脱模距离、自动清洗计数以及清洗速度,那么在对规约后的原始数据集进行数据的剔除时,将此六部分特征所对应的数据进行剔除,而完成数据剔除之后所得到的数据集为第一数据集,以用于对待训练优化模型进行训练和优化。Exemplary, for example, when the optimization model optimizes and recommends process parameters, the process parameters include: scraper pressure, scraper speed, demoulding speed, demoulding distance, automatic cleaning count and cleaning speed, then the original data after the protocol When the data set is eliminated, the data corresponding to the six features are eliminated, and the data set obtained after data elimination is the first data set, which is used for training and optimizing the optimization model to be trained.
此时,在基于表1所得到的原始数据集进行归一化的数据规约处理以及将待优化工艺参数剔除之后,此时所得到的原始数据集可以如下表2所示:At this time, after performing normalized data reduction processing and removing process parameters to be optimized based on the original data set obtained in Table 1, the original data set obtained at this time can be shown in Table 2 below:
表2Table 2
00 11 22 ……... 2626 2727 2828
-0.200753-0.200753 0.4233550.423355 0.5832490.583249 ……... 0.7313650.731365 0.2119440.211944 -0.267736-0.267736
-0.047214-0.047214 0.4259750.425975 0.5857250.585725 ……... 0.7563900.756390 0.2732570.273257 -0.267736-0.267736
-0.091925-0.091925 0.4312160.431216 0.5906780.590678 ……... 0.7568290.756829 0.2978460.297846 -0.267736-0.267736
……... ……... ……... ……... ……... ……... ……...
0.0524190.052419 0.2318090.231809 -1.241435-1.241435 ……... 0.7605610.760561 0.2248690.224869 -0.267736-0.267736
步骤S303、对所述第一数据集所包含的影响因素进行特征交叉相乘,以得到第二数据集。Step S303, performing feature cross multiplication on the influencing factors included in the first data set to obtain a second data set.
在完成对规约后的原始数据集进行数据剔除以得到第一数据集之后,将会对所得到的第一数据集中所包含的数据进行处理,具体地,在得到第一数据集时,对第一数据集中所包含的影响因素进行特征交叉相乘,进而在完成处理之后得到对应的第二数据集。After completing the data elimination of the reduced original data set to obtain the first data set, the data contained in the obtained first data set will be processed, specifically, when the first data set is obtained, the first data set will be processed. The influencing factors contained in a data set are subjected to feature cross multiplication, and then the corresponding second data set is obtained after the processing is completed.
在一实施例中,在得到第一数据集之后,采用特征交叉(Feature intersection)对第一数据集中所包含的数据进行处理,其中特征交叉是通过对样本数据原有特征进行函数运算从而挖掘出原有特征之间的隐藏信息特征,而对于SMT产线印刷阶段除工艺参数之外还有小要素、印刷过程参数等,这些参数特征之间物理逻辑关系不明,且数量巨大,通过特征交叉挖掘隐藏特征,让模型学习到更复杂的特征,从而提高算法精度。In one embodiment, after the first data set is obtained, the data contained in the first data set is processed by feature intersection, wherein the feature intersection is mined by performing functional operations on the original features of the sample data. The hidden information features between the original features, and for the printing stage of the SMT production line, in addition to the process parameters, there are small elements, printing process parameters, etc., the physical and logical relationship between these parameter features is unclear, and the number is huge. Hiding features allows the model to learn more complex features, thereby improving the accuracy of the algorithm.
示例性的,在进行特征交叉时,是将特征与特征进行交叉以得到一个新的特征,其中特征交叉的实际计算公式如下:Exemplarily, when performing feature crossing, a new feature is obtained by crossing features with features, where the actual calculation formula of feature crossing is as follows:
x i,j=x i×x j x i,j = x i × x j
其中,x i表示特征的第i个数据,x j为特征的第j个特征,x i,j为x i与x j进行积交互的结果。 Among them, x i represents the i-th data of the feature, x j is the j-th feature of the feature, and x i, j is the product interaction result of x i and x j .
按照该方式对规约和数据剔除后的原始数据集进行特征交叉之后,得到对应的第二数据集,而此时所得到的第二数据集可以如下表3所示:After performing feature crossover on the original data set after reduction and data elimination in this way, the corresponding second data set is obtained, and the second data set obtained at this time can be shown in Table 3 below:
表3table 3
00 11 22 ……... 2626 2727 2828
0.0403020.040302 -0.084990-0.084990 -0.117089-0.117089 ……... 0.0449200.044920 -0.056745-0.056745 0.0716830.071683
0.0022290.002229 -0.020112-0.020112 -0.027655-0.027655 ……... 0.0746700.074670 -0.073161-0.073161 0.0716830.071683
0.0084500.008450 -0.039640-0.039640 -0.054298-0.054298 ……... 0.0887120.088712 -0.079744-0.079744 0.0716830.071683
……... ……... ……... ……... ……... ……... ……...
0.0084140.008414 -0.040275-0.040275 -0.054863-0.054863 ……... 0.0449200.044920 -0.056745-0.056745 0.0716830.071683
步骤S304、利用主成分分析对所述第二数据集中所包含的每一影响因素进行分析,得到影响因素数据包。Step S304 , using principal component analysis to analyze each influencing factor included in the second data set to obtain an influencing factor data package.
主成分分析(Principal Component Analysis,简称:PCA)是一种线性降维方法,该方法能够有效消除特征之间的相关性并提升特征的表达能力。而在原有特征交叉后,虽然提取出了更多隐藏特征,但产生的交叉特征数量巨大,需要进行降维以少量互不相关特征替代原有特征,提高模型训练速度。Principal Component Analysis (PCA for short) is a linear dimensionality reduction method, which can effectively eliminate the correlation between features and improve the expressive ability of features. After the original features are crossed, although more hidden features are extracted, the number of crossed features generated is huge, and it is necessary to reduce the dimensionality and replace the original features with a small number of unrelated features to improve the speed of model training.
因此,在完成对第一数据集的特征交叉之后,利用主成分分析对所得到的第二数据集进行分析处理,以在完成分析处理之后得到相应的影响因素数据包。具体地,在对第二数据集中所包含的每一影响因素进行分析时,确定每一影响因素对实际的SMT工艺的影响程度,以在第二数据集中确定若干影响因素作为对SMT工艺具有较大影响的重构影响因素,而在得到重构影响因素之后,将重构影响因素与待优化工艺参数以及特定参数组合以得到影响因素数据包。Therefore, after the feature intersection of the first data set is completed, principal component analysis is used to analyze the obtained second data set, so as to obtain the corresponding influencing factor data package after the analysis process is completed. Specifically, when analyzing each influencing factor contained in the second data set, determine the degree of influence of each influencing factor on the actual SMT process, so as to determine a number of influencing factors in the second data set as having a greater impact on the SMT process. The reconstructed influencing factors with great influence, and after obtaining the reconstructed influencing factors, combine the reconstructed influencing factors with the process parameters to be optimized and specific parameters to obtain the influencing factor data package.
示例性的,特定参数包括锡膏体积和面积,因此此时所得到的影响因素数据包中所包含的特征或者参数包括:重构影响因素、待优化工艺参数(刮刀压力、刮刀速度、脱模速度、脱模距离、自动清洗计数以及清洗速度)以及锡膏体积和面积。Exemplarily, the specific parameters include solder paste volume and area, so the characteristics or parameters contained in the obtained influencing factor data package at this time include: reconstructing influencing factors, process parameters to be optimized (squeegee pressure, scraper speed, demoulding speed, ejection distance, automatic cleaning count, and cleaning speed), and solder paste volume and area.
参照图4,图4为本申请又一实施例提供的得到影响因素数据包的步骤的流程示意图。其中,在利用主成分分析对第二数据集进行分析处理以得到影响因素数据包时,包括:Referring to FIG. 4 , FIG. 4 is a schematic flowchart of the steps of obtaining the influencing factor data package provided by another embodiment of the present application. Wherein, when the principal component analysis is used to analyze and process the second data set to obtain the influencing factor data package, it includes:
步骤S401、利用主成分分析计算得到所述第二数据集中每一影响因素所对应的特征值,并对所述特征值进行排序;Step S401, using principal component analysis to calculate the eigenvalues corresponding to each influencing factor in the second data set, and sorting the eigenvalues;
步骤S402、根据从大到小的选择方式在排序后的所述特征值中选择若干特征值相加,并在相加得到的数值首次大于所设定的预设阈值时,确定进行相加的特征值所对应的特征为重构影响因素;Step S402, according to the selection method from large to small, select several eigenvalues from the sorted eigenvalues to add, and when the added value is greater than the set preset threshold for the first time, determine the number of eigenvalues to be added. The features corresponding to the eigenvalues are reconstruction influencing factors;
步骤S403、将所述重构影响因素与所述待优化工艺参数进行合并,得到影响因素数据包。Step S403, combining the reconstructed influencing factors and the process parameters to be optimized to obtain an influencing factor data package.
在利用主成分分析对第二数据集进行分析处理时,首先根据主成分分析计算得到第二数据集中每一影响因素所对应的特征值,其中第二数据集中的每一影响因素均未重构后的影响因素,并对所得到的特征值进行排序,如从大到小或者从小到大的排序,而在完成排序之后,按照从大到小的选择方式对特征值进行选择以及相加,同时将相加所得到的数值与预设阈值进行对比,以在第二数据集中得到相对应的重构影响因素,最后在得到重构影响因素之后,将重构影响因数与待优化工艺参数进行合并组合,以得到影响因素数据包。When using principal component analysis to analyze and process the second data set, first calculate the eigenvalues corresponding to each influencing factor in the second data set according to principal component analysis, wherein each influencing factor in the second data set is not reconstructed The final influencing factors, and sort the obtained eigenvalues, such as sorting from large to small or small to large, and after the sorting is completed, select and add the eigenvalues according to the selection method from large to small, At the same time, compare the value obtained by the addition with the preset threshold to obtain the corresponding reconstruction influencing factors in the second data set. Finally, after obtaining the reconstruction influencing factors, compare the reconstruction influencing factors with the process parameters to be optimized. Merge the combination to get the impact factor data package.
在实际应用中,不同的影响因素对实际的SMT工艺的影响程度不同,而特征交叉是对影响因素进行更深层次的提取,通过对隐藏特征的挖掘提高模型训练更加准确。而在完成特征交叉之后,由于交叉之后数据量的庞大,使得需要进行相应的处理,比如进行降维处理,以提取得到更加具有特征的部分数据作为模型训练的输入数据。In practical applications, different influencing factors have different influences on the actual SMT process, and feature crossover is to extract the influencing factors at a deeper level, and to improve model training by mining hidden features is more accurate. After the feature crossover is completed, due to the huge amount of data after the crossover, corresponding processing is required, such as dimensionality reduction processing, to extract more characteristic part of the data as input data for model training.
对于所得到第二数据集中的每一影响因素,都是对模型具有一定的影响的,但是影响程度的高低是有所不同的,特征交叉是为了提高模型训练的准确性,而为了保证模型训练的效率,使得并不能将所有的进行特征交叉所得到的数据作为模型训练的输入,因此需要对第二数据集中所包含的所有影响因素进行进一步的判断,以选择更加合适的影响因素作为模型训练的输入。For each influencing factor in the obtained second data set, it has a certain influence on the model, but the degree of influence is different. Feature crossover is to improve the accuracy of model training, and to ensure model training The efficiency makes it impossible to use all the data obtained by feature crossover as the input of model training, so it is necessary to further judge all the influencing factors contained in the second data set to select more appropriate influencing factors as model training input of.
在计算第二数据集中每一影响因素所对应的特征值时,也就是确定每一影响因素对在SMT工艺中对实工艺的影响程度,而特征值越大说明影响越大,因此在计算得到每一影响因素所对应的特征值时,通过对特征值进行排序,然后进行选择以得到重构影响因素。在对影响因素进行选择时,根据实际的特征值的大小从大到小进行选择,同时将所得到的特征值求和得到求和结果,进而根据所得到的求和值选择相应的影响因素作为重构影响因素。When calculating the eigenvalue corresponding to each influencing factor in the second data set, it is to determine the influence degree of each influencing factor on the actual process in the SMT process, and the larger the eigenvalue, the greater the influence, so the calculated For the eigenvalues corresponding to each influencing factor, the eigenvalues are sorted and then selected to obtain the reconstructed influencing factors. When selecting the influencing factors, select according to the size of the actual eigenvalues from large to small, and at the same time sum the obtained eigenvalues to obtain the summation result, and then select the corresponding influencing factors according to the obtained summation value as Refactoring factors.
示例性的,在利用主成分分析对第二数据进行处理时,具体地处理过程如下:Exemplarily, when using principal component analysis to process the second data, the specific processing process is as follows:
在得到第二数据集Z={z 1,z 2,…,z p}时,对第二数据集进行中心化处理,以消除特征之间量纲不同带来的影响,而中心化公式如下所示: When the second data set Z={z 1 ,z 2 ,…,z p } is obtained, the second data set is centralized to eliminate the impact of the dimension difference between features, and the centralization formula is as follows Shown:
Figure PCTCN2022077781-appb-000003
Figure PCTCN2022077781-appb-000003
Figure PCTCN2022077781-appb-000004
Figure PCTCN2022077781-appb-000004
其中,X为进行第二数据集中心化之后的数据集,
Figure PCTCN2022077781-appb-000005
为数据集中所包含数据的平均值。
Among them, X is the data set after the centralization of the second data set,
Figure PCTCN2022077781-appb-000005
is the average of the data contained in the dataset.
在完成对第二数据集的中心化之后,计算中心化所得到的数据集X的协方差矩阵,而协方差矩阵的计算公式如下所示:After the centering of the second data set is completed, the covariance matrix of the data set X obtained by centering is calculated, and the calculation formula of the covariance matrix is as follows:
而在得到协方差矩阵之后对协方差矩阵进行特征值分解,并将所有特征值从大到小降序排列,记为{λ 12,…,λ p},对应的特征向量为{ω 12,…,ω p},其中λ为特征值,ω为特征向量,然后选择前d大的特征值所对应的特征向量进行映射,将p维数据映射为d维数据,具体地映射方式如式下所示: After the covariance matrix is obtained, the eigenvalue decomposition is performed on the covariance matrix, and all the eigenvalues are arranged in descending order from large to small, which is recorded as {λ 12 ,…,λ p }, and the corresponding eigenvector is {ω 12 ,…,ω p }, where λ is the eigenvalue, ω is the eigenvector, and then select the eigenvector corresponding to the first d largest eigenvalue for mapping, and map p-dimensional data to d-dimensional data, specifically The mapping method is as follows:
Figure PCTCN2022077781-appb-000006
Figure PCTCN2022077781-appb-000006
式中,
Figure PCTCN2022077781-appb-000007
表示映射之后的数据。
In the formula,
Figure PCTCN2022077781-appb-000007
Represents the data after mapping.
且参数d是通过信息占比确定的,且信息占比的计算方式如式下所示:And the parameter d is determined by the proportion of information, and the calculation method of the proportion of information is as follows:
Figure PCTCN2022077781-appb-000008
Figure PCTCN2022077781-appb-000008
其中,η为信息占比。Among them, η is the proportion of information.
步骤S103、加载待训练优化模型。Step S103, loading the optimization model to be trained.
在一实施例中,在构建待训练优化模型时,所构建的待训练优化模型为具有多输出的深度神经网络模型,而在待训练优化模型训练完成时,是对多输出深度神经网络隐藏层节点数及激活函数进行优化,最终得到多组优化后的多输出深度神经网络预测模型。In one embodiment, when constructing the optimized model to be trained, the constructed optimized model to be trained is a deep neural network model with multiple outputs, and when the optimized model to be trained is trained, it is the hidden layer of the deep neural network with multiple outputs The number of nodes and the activation function are optimized, and finally multiple groups of optimized multi-output deep neural network prediction models are obtained.
其中,深度神经网络是一种全连接神经网络,主要由输入层、隐藏层以及输出层构成,比传统的机器学习模型对海量数据有更好的拟合能力及预测效果。深度神经网络的隐藏层节点及激活函数往往是根据经验公式设定,而隐藏层节点以及激活函数的不同组合会影响深度神经网络的准确性。需要说明的是,在对于所加载的待训练优化模型,除了可以是基于深度神经网络所构建的多输出深度神经网络模型,还可以是基于其他结构的网络模型所构建。Among them, the deep neural network is a fully connected neural network, mainly composed of input layer, hidden layer and output layer, which has better fitting ability and prediction effect on massive data than traditional machine learning models. The hidden layer nodes and activation functions of deep neural networks are often set according to empirical formulas, and different combinations of hidden layer nodes and activation functions will affect the accuracy of deep neural networks. It should be noted that, for the loaded optimized model to be trained, in addition to the multi-output deep neural network model constructed based on the deep neural network, it may also be constructed based on network models of other structures.
实际上,待训练优化模型是用于对SMT印刷工艺中需要使用的相关的工艺参数进行预测和推荐的模型,同时还可以对所输入的不同的工艺参数的组合进行质量好坏的判断。对于所构建的待训练的优化模型是基于深度神经网络所得到的,在构建完成之后对模型的训练和优化,在多节点的待训练优化模型中选择若干较优的节点进行集成,以得到训练好的优化模型。In fact, the optimized model to be trained is a model used to predict and recommend relevant process parameters that need to be used in the SMT printing process, and can also judge the quality of the combination of different input process parameters. The optimized model to be trained is obtained based on the deep neural network. After the construction is completed, the model is trained and optimized, and several better nodes are selected in the multi-node optimized model to be trained for integration to obtain training. Good optimization model.
步骤S104、根据所述影响因素数据包对所述待训练优化模型进行训练,得到训练好的优化模型。Step S104: Train the optimization model to be trained according to the influencing factor data package to obtain a trained optimization model.
在加载了待训练优化模型之后,利用预先进行处理所得到的影响因素数据包对待训练优化模型进行训练和优化,最后在确定训练后的待训练优化模型收敛时对收敛的优化模型进行记录和存储,也就是在确定收敛时得到训练好的优化模型。After the optimized model to be trained is loaded, the optimized model to be trained is trained and optimized using the influencing factor data package obtained in advance, and finally the converged optimized model is recorded and stored when the optimized model to be trained is determined to converge , that is, the trained optimization model is obtained when the convergence is determined.
在一实施例中,在对待训练优化模型进行训练时是采用精英保留遗传算法来实现模型的训练和优化的。遗传算法是模仿自然界生物进化机制发展起来的随机全局搜索和优化方法,借鉴了达尔文的进化论和孟德尔的遗传学说。精英保留遗传算法对传统遗传算法进行了改进,把群体在进化过程中迄今出现的最好个体,即精英个体,不进行配对交叉而直接复制到下一代中,并将新一代群体中适应度值最小的个体淘汰掉,从而加快全局收敛以求得最佳解。In one embodiment, when the model to be trained and optimized is trained, an elite-retaining genetic algorithm is used to realize the training and optimization of the model. Genetic algorithm is a random global search and optimization method developed by imitating the biological evolution mechanism in nature, drawing on Darwin's theory of evolution and Mendel's genetic theory. The elite-retaining genetic algorithm improves the traditional genetic algorithm. The best individual that has appeared so far in the evolution process of the group, that is, the elite individual, is directly copied to the next generation without pairing and crossover, and the fitness value of the new generation group is The smallest individual is eliminated, so as to speed up the global convergence to find the best solution.
参照图5,图5为本申请一实施例提供的对待训练优化模型进行训练的步骤的流程示意图,其中该步骤包括步骤S501至步骤S502。Referring to FIG. 5, FIG. 5 is a schematic flowchart of the steps of training the optimization model to be trained according to an embodiment of the present application, wherein the steps include steps S501 to S502.
步骤S501、初始化所述待训练优化模型,并得到对应的初始化参数,以及根据预设的编码方式对所述影响因素数据包进行编码,得到编码数据;Step S501, initialize the optimization model to be trained, obtain corresponding initialization parameters, and encode the influencing factor data packet according to a preset encoding method to obtain encoded data;
步骤S502、根据所述编码数据对所述初始化后的待训练优化模型进行若干次迭代训练,以对所述初始化参数进行调节,并在确定收敛时得到训练好的优化模型。Step S502: Perform several iterations of training on the initialized optimization model to be trained according to the encoded data, so as to adjust the initialization parameters, and obtain a trained optimization model when convergence is determined.
在对待训练优化模型进行训练和优化时,是使用精英保留遗传算法的方式对待训练优化模型中的模型参数或者参数信息进行优化处理,进而在收敛时完成对模型的训练和优化,以 得到训练优化后的优化模型,进而以供使用者进行使用。When training and optimizing the model to be trained and optimized, the model parameters or parameter information in the model to be trained and optimized are optimized by using the elite-reserving genetic algorithm, and then the training and optimization of the model are completed at the time of convergence to obtain the training optimization The final optimized model is then available for users to use.
而在进行模型的优化和训练时,首先对待训练优化模型进行初始化,也就是对模型的相关参数和参数信息进行初始化,同时在初始化的时候记录相应的初始化参数,其中初始化参数也就是模型的相关参数信息,而在进行模型初始化的时候,还会根据预设的编码方式对所得到的影响因素数据包中所包含的数据进行编码处理,以得到对应的编码数据,接着利用所得到的编码数据对待训练的优化模型进行训练和优化,以实现对初始化参数进行调节,最后在确定收敛时得到训练好的优化模型并进行存储和记录。When optimizing and training the model, first initialize the optimized model to be trained, that is, initialize the relevant parameters and parameter information of the model, and at the same time record the corresponding initialization parameters during initialization, where the initialization parameters are also related to the model. Parameter information, and when the model is initialized, the data contained in the obtained influencing factor data package will be encoded according to the preset encoding method to obtain the corresponding encoded data, and then the obtained encoded data will be used The optimization model to be trained is trained and optimized to adjust the initialization parameters, and finally the trained optimization model is obtained and stored and recorded when the convergence is determined.
在一实施例中,在对待训练的优化模型进行训练时是利用精英保留遗传算法来实现的,因此在进行实际的训练时进行了若干次的迭代训练,而每一次的训练都会模型的相关参数进行调节和优化。In one embodiment, when the optimization model to be trained is trained, it is implemented by using the elite-retaining genetic algorithm, so several iterations of training are performed during the actual training, and each training will update the relevant parameters of the model Adjust and optimize.
参照图6,图6为本申请一实施例提供的进行迭代训练的步骤的流程示意图。其中该步骤包括步骤S601至步骤S603。Referring to FIG. 6 , FIG. 6 is a schematic flowchart of the steps of iterative training provided by an embodiment of the present application. Wherein this step includes step S601 to step S603.
步骤S601、将所述编码数据输入至所述初始化后的待训练优化模型中,以对初始化后的待训练优化模型进行第一次训练,并得到第一次训练后的第一中间参数;Step S601, input the encoded data into the initialized optimization model to be trained, so as to perform the first training on the initialized optimization model to be trained, and obtain the first intermediate parameters after the first training;
步骤S602、基于所述第一中间参数对所述待训练优化模型进行调节,得到第一中间待训练优化模型;Step S602, adjusting the optimization model to be trained based on the first intermediate parameters to obtain a first intermediate optimization model to be trained;
步骤S603、将所述编码数据输入至所述第一中间待训练优化模型中,以对所述待训练优化模型进行第二次训练,并得到第二次训练后的第二中间参数,以此类推以对所述待训练优化模型进行若干次训练。Step S603, input the encoded data into the first intermediate optimization model to be trained, so as to perform a second training on the optimization model to be trained, and obtain second intermediate parameters after the second training, so as to By analogy, the optimization model to be trained is trained several times.
在进行迭代训练时,是根据影响因素数据包对待训练优化模型的初始参数进行调节和优化,具体地,在得到影响因素数据包所对应的编码数据之后,将编码数据输入至初始化后的待训练优化模型中,以对初始化后的优化模型进行第一次训练,同时在完成第一次训练时得到第一次训练后的第一中间参数,并且根据第一中间参数对待训练优化模型进行调节,得到第一中间待训练优化模型,接着再将编码数据输入至第一中间待训练优化模型中,以对待训练优化模型进行第二次训练,以此类推以完成对待训练优化模型进行若干次训练。When performing iterative training, the initial parameters of the model to be trained and optimized are adjusted and optimized according to the influencing factor data package. Specifically, after obtaining the encoded data corresponding to the influencing factor data package, the encoded data is input into the initialized model to be trained In the optimization model, the optimized model after initialization is trained for the first time, and the first intermediate parameter after the first training is obtained when the first training is completed, and the optimized model to be trained is adjusted according to the first intermediate parameter, Obtain the first intermediate optimized model to be trained, and then input the coded data into the first intermediate optimized model to be trained, so that the optimized model to be trained is trained for the second time, and so on to complete several trainings of the optimized model to be trained.
在对待训练优化模型进行训练时,在收敛时确定训练完成,而收敛的判断条件可以有多种,比如设定若干次训练次数,再比如模型中的某一或者某些参数满足预设条件。When training the optimized model to be trained, it is determined that the training is completed when it converges, and there are many conditions for judging the convergence, such as setting a number of training times, and for example, one or some parameters in the model meet the preset conditions.
而在完成对模型的训练时,将会记录所得到的训练完成时的优化模型,具体地,参照图7,在得到并存储训练好的优化模型时包括:When the training of the model is completed, the optimized model obtained when the training is completed will be recorded. Specifically, referring to FIG. 7, when the optimized model obtained and stored is obtained and stored, it includes:
步骤S701、在确定收敛时对每次训练所得到的目标函数值进行排序,以根据从大到小的选择方式在排序后的目标函数值中选择N个目标函数值,其中将所述编码数据输入至所述待 训练优化模型中得到所述目标函数值,且每一次训练均输出一目标函数值;Step S701, when determining the convergence, sort the objective function values obtained in each training, so as to select N objective function values from the sorted objective function values according to the selection method from large to small, wherein the encoded data input into the optimization model to be trained to obtain the objective function value, and output an objective function value for each training;
步骤S702、确定所述N个目标函数值所对应的N组参数信息,其中参数信息包括隐藏层网络节点数以及对应的激活函数;Step S702, determining N sets of parameter information corresponding to the N objective function values, wherein the parameter information includes the number of hidden layer network nodes and corresponding activation functions;
步骤S703、将所述N组参数信息输入至所述待训练优化模型中的N个子模型中,并将所述N个子模型进行集成,得到训练好的优化模型,以将训练好的优化模型进行存储。Step S703, input the N sets of parameter information into the N sub-models in the optimization model to be trained, and integrate the N sub-models to obtain a trained optimization model, so as to carry out the trained optimization model storage.
在得到训练好的优化模型时,所得到的优化模型是一个具有多输出的神经网络模型,因此在确定收敛时,对每次训练时所得到的目标函数值进行排序,然后根据从大到小的方式在排序后的目标函数值中选择N个目标函数值,进而确定所选择的N个目标函数值所对应的参数信息,其中参数信息包括隐藏层网络节点数以及隐藏层所对应的激活函数,最后根据所得到的N组参数信息确定优化模型中的N个子模型,并对所确定的N个子模型进行集成以得到训练好的优化模型。When the trained optimization model is obtained, the obtained optimization model is a neural network model with multiple outputs, so when the convergence is determined, the objective function values obtained during each training are sorted, and then according to The method selects N objective function values from the sorted objective function values, and then determines the parameter information corresponding to the selected N objective function values, where the parameter information includes the number of hidden layer network nodes and the activation function corresponding to the hidden layer , and finally determine N sub-models in the optimization model according to the obtained N sets of parameter information, and integrate the determined N sub-models to obtain a trained optimization model.
示例性的,在进行初始化时,设置精英保留遗传算法编码方式为RI(实数编码),同时也可以使用二进制编码以及格雷编码,根据实际的使用需求所确定,同时设置种群规模为200,优化训练的最大迭代次数为300,变异概率为0.05,交叉概率为0.95,隐藏层1和2的节点数目寻优区间为[10,128],激活函数寻优空间为[relu,sigmoid,tanh]。Exemplarily, when initializing, set the encoding method of the elite reserved genetic algorithm to RI (real number encoding), and also use binary encoding and gray encoding, which are determined according to actual usage requirements, and set the population size to 200 at the same time to optimize training The maximum number of iterations is 300, the mutation probability is 0.05, and the crossover probability is 0.95. The number of nodes in hidden layers 1 and 2 is optimized in the range [10,128], and the activation function optimization space is [relu, sigmoid, tanh].
在一实施例中,在上述的目标函数的获取时,首先需要确定目标函数的计算方式,进而根据相关数据信息计算得到每次训练时所对应的目标函数值。其中目标函数可以如下:In an embodiment, when acquiring the above-mentioned objective function, it is first necessary to determine the calculation method of the objective function, and then calculate the corresponding objective function value for each training according to relevant data information. The objective function can be as follows:
Figure PCTCN2022077781-appb-000009
Figure PCTCN2022077781-appb-000009
其中,y area,i为锡膏体积的真实值、y vol,i为锡膏的面积真实值,
Figure PCTCN2022077781-appb-000010
为体积的样本预测值,
Figure PCTCN2022077781-appb-000011
为面积的样本预测值,m为样本数量。
Among them, y area, i is the actual value of the solder paste volume, y vol, i is the actual value of the area of the solder paste,
Figure PCTCN2022077781-appb-000010
is the sample predicted value of the volume,
Figure PCTCN2022077781-appb-000011
is the sample prediction value of the area, and m is the sample size.
例如,在完成300此的迭代训练时,会得到300个目标函数值,同时每一个目标函数值会对应一个参数信息,然后对所得到的300个目标函数值按照大小进行排序,进而在排序完成之后从大到小选择5个目标函数值,以及确定该5个目标函数值所对应的隐藏层网络几点书以及对应的激活函数,其中目标函数值的数量选择根据实际的需求所确定,在此设定N为5。而在完成排序和选择时,所得到的深度神经网络结构以及目标函数值可以如下表4所示:For example, when 300 iterative training is completed, 300 objective function values will be obtained, and each objective function value will correspond to a parameter information, and then the obtained 300 objective function values will be sorted according to the size, and then the sorting is completed Then select 5 objective function values from large to small, and determine the hidden layer network points corresponding to the 5 objective function values and the corresponding activation functions. The number of objective function values is determined according to actual needs. This sets N to be 5. When the sorting and selection are completed, the obtained deep neural network structure and objective function value can be shown in Table 4 below:
表4Table 4
隐藏层1节点数量Number of hidden layer 1 nodes 隐藏层2节点数量Number of hidden layer 2 nodes 激活函数1Activation function 1 激活函数2Activation function 2 预测误差forecast error
3636 112112 22 00 0.13080.1308
117117 6868 22 00 0.13250.1325
3535 126126 22 00 0.14020.1402
6565 110110 22 00 0.14420.1442
3636 109109 22 00 0.14650.1465
其中,激活函数中0代表的激活函数为relu,1代表sigmoid,2代表tanh,预测误差为平均相对百分比误差。在得到如上表所示的参数信息之后,将会根据该参数信息对模型进行调节,以得到5个深度神经网络模型,然后对所得到的5个深度神经网络模型进行集成,以得到最终的优化模型。而在进行模型集成时,对所得到的5个神经网络模型进行stacking堆栈集成,进而得到集成深度神经网络的优化模型。Among them, the activation function represented by 0 in the activation function is relu, 1 represents sigmoid, 2 represents tanh, and the prediction error is the average relative percentage error. After obtaining the parameter information shown in the above table, the model will be adjusted according to the parameter information to obtain 5 deep neural network models, and then the obtained 5 deep neural network models will be integrated to obtain the final optimization Model. When performing model integration, stacking stack integration is performed on the obtained five neural network models, and then an optimized model of the integrated deep neural network is obtained.
在一实施例中,参照图8,图8为本申请一实施例提供的工艺参数预测和推荐的步骤的流程示意图。其中该步骤包括:步骤S801至步骤S804。In an embodiment, refer to FIG. 8 . FIG. 8 is a schematic flowchart of the steps of predicting and recommending process parameters provided by an embodiment of the present application. Wherein this step includes: step S801 to step S804.
步骤S801、当接收到参数优化指令时,加载训练好的优化模型,并接收输入的待分析工艺参数;Step S801, when receiving the parameter optimization instruction, load the trained optimization model, and receive the input process parameters to be analyzed;
步骤S802、将所述待分析工艺参数输入至所述优化模型中,计算得到基于所述待分析工艺参数所得到的函数值;Step S802, input the process parameters to be analyzed into the optimization model, and calculate the function value based on the process parameters to be analyzed;
步骤S803、当所述函数值大于预设阈值时,根据所述优化模型输出预测工艺参数。Step S803, when the function value is greater than a preset threshold, output the predicted process parameters according to the optimization model.
在模型训练和优化完成之后,所存储的优化模型可以用来对参数质量进行质量预测以及工艺参数的推荐。After the model training and optimization are completed, the stored optimized model can be used for quality prediction of parameter quality and recommendation of process parameters.
在接收到参数优化指令时,加载训练好的优化模型,同时接收所输入的待分析工艺参数,然后将所接收到的待分析工艺参数输入到所加载的优化模型中,以计算得到该待分析工艺参数所对应的函数值,进而将所得到的函数值与预设的阈值进行对比,以确定该待分析工艺参数是否合格,其中在确定所得到的函数值大于所设定的预设阈值时,说明此时所接收到的待分析工艺参数不是合格的工艺参数组合,此时将会根据所加载的优化模型输出预测工艺参数,以得到更加合理的工艺参数组合,而在确定所得到的函数值小于或者等于预设阈值时确定工艺参数的设定合格,此时则会输出合格的提示信息。When receiving a parameter optimization instruction, load the trained optimization model, and receive the input process parameters to be analyzed, and then input the received process parameters to be analyzed into the loaded optimization model to calculate the to-be-analyzed The function value corresponding to the process parameter, and then compare the obtained function value with the preset threshold value to determine whether the process parameter to be analyzed is qualified, wherein when it is determined that the obtained function value is greater than the set preset threshold value , indicating that the process parameters to be analyzed received at this time are not qualified process parameter combinations. At this time, the predicted process parameters will be output according to the loaded optimization model to obtain a more reasonable process parameter combination. After determining the obtained function When the value is less than or equal to the preset threshold, it is determined that the setting of the process parameter is qualified, and at this time, a qualified prompt message will be output.
在一实施例中,在模型完成训练之后被使用的时候,在模型中设定一个新的目标函数,其中新的目标函数为:In one embodiment, when the model is used after training, a new objective function is set in the model, where the new objective function is:
g(·)=|vol-100|+|area-100|g(·)=|vol-100|+|area-100|
其中,vol表示体积,area代表面积,100为完美体积及面积,|·|代表绝对值。Among them, vol means volume, area means area, 100 means perfect volume and area, and |·| means absolute value.
在接收到输入的待分析工艺参数之后,将会输出模型预测的体积和面积,然后带入到该 目标函数中,以确定当前所接收到的待分析工艺参数是否合格。在确定合格时直接输出合格的指令信息,而在确定不合格时将会进行工艺参数的预测。而在进行工艺参数的预测和推荐时,设置待优化工艺参数寻优区间,通过精英保留遗传算法对目标函数进行优化,求取最优解,得到推荐工艺参数组合进行推荐和反馈。After receiving the input process parameters to be analyzed, the volume and area predicted by the model will be output, and then brought into the objective function to determine whether the currently received process parameters to be analyzed are qualified. When it is determined to be qualified, it will directly output qualified instruction information, and when it is determined to be unqualified, it will predict the process parameters. When predicting and recommending process parameters, set the optimization interval of the process parameters to be optimized, optimize the objective function through the elite-retaining genetic algorithm, obtain the optimal solution, and obtain the recommended process parameter combination for recommendation and feedback.
示例性的,在进行工艺参数组合的推荐时,进行推荐的若干组推荐工艺参数组合如下表5所示:Exemplarily, when recommending a combination of process parameters, several sets of recommended process parameter combinations for recommendation are shown in Table 5 below:
表5table 5
刮刀压力Blade pressure 刮刀速度Squeegee speed 脱模速度Release speed 脱模距离Release distance 自动清洗计数Automatic cleaning count 清洗速度cleaning speed 目标函数objective function
5.345.34 63.2163.21 2.402.40 1.131.13 1212 43.0443.04 0.1930.193
11.8511.85 50.7050.70 2.932.93 0.130.13 3030 36.6836.68 2.5942.594
8.488.48 47.6447.64 2.952.95 0.130.13 2727 33.5533.55 3.0023.002
由上表可见,根据目标函数值可以确定,在工艺参数刮刀压力为5.34、刮刀速度为63.21、脱模速度为2.40、自动清洗速度为12以及清洗速度为43.04时,具有更好的效果,也就是该组合所对应的工艺参数为此时的最优工艺参数组合。It can be seen from the above table that according to the value of the objective function, it can be determined that when the process parameters scraper pressure is 5.34, scraper speed is 63.21, demolding speed is 2.40, automatic cleaning speed is 12 and cleaning speed is 43.04, it has a better effect and also That is, the process parameter corresponding to the combination is the optimal process parameter combination at this time.
在上述描述的SMT印刷参数优化模型的训练方法、设备以及存储介质中,在进行训练时,首先对进行训练的训练样本数据进行影响因素的重构,具体地,通过特征交叉建立影响因素之间的关系,然后对构造的特征进行主成分分析,完成影响因素重构,以深度挖掘了影响因素,然后对不同的深度神经网络模型进行集成,以得到最终的优化模型。解决了单一固定模型预测效果不佳的问题,大幅提高了模型的精度,最后通过精英保留遗传算法搜索出使得印刷质量最佳的工艺参数组合,具有更好的适用性。In the training method, equipment and storage medium of the SMT printing parameter optimization model described above, when training, the training sample data for training is firstly reconstructed to the influencing factors, specifically, the relationship between the influencing factors is established through feature crossover. Then, principal component analysis is performed on the constructed features to complete the reconstruction of influencing factors, and the influencing factors are deeply excavated, and then different deep neural network models are integrated to obtain the final optimization model. It solves the problem of poor prediction effect of a single fixed model, greatly improves the accuracy of the model, and finally searches out the combination of process parameters for the best printing quality through the elite-retaining genetic algorithm, which has better applicability.
参照图9,图9为本申请一实施例提供的一种计算机设备的结构示意性框图。Referring to FIG. 9 , FIG. 9 is a schematic structural block diagram of a computer device provided by an embodiment of the present application.
示例性的,该设备可以为平板电脑、笔记本或者台式机等。Exemplarily, the device may be a tablet computer, a notebook, or a desktop computer.
该设备还包括处理器、存储器,所述存储器用于存储计算机程序。The device also includes a processor, memory for storing a computer program.
所述处理器,用于执行所述计算机程序并在执行所述计算机程序时实现本申请实施例提供的任一项所述的SMT印刷参数优化模型的训练方法。The processor is configured to execute the computer program and implement any one of the SMT printing parameter optimization model training methods provided in the embodiments of the present application when executing the computer program.
应当理解的是,处理器可以是中央处理单元(Central Processing Unit,CPU),该处理器还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,简称:DSP)、专用集成电路(Application Specific Integrated Circuit,简称:ASIC)、现场可编程门阵列(Field-Programmable Gate Array,简称:FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that the processor may be a central processing unit (Central Processing Unit, CPU), and the processor may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP for short), application-specific integrated circuits (Application Specific Integrated Circuit, referred to as: ASIC), Field-Programmable Gate Array (Field-Programmable Gate Array, referred to as: FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. Wherein, the general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
本申请的实施例中还提供一种计算机可读存储介质,所述计算机可读存储介质存储有计 算机程序,所述计算机程序被处理器执行时,使所述处理器实现本申请实施例提供的任一项所述的SMT印刷参数优化模型的训练方法。Embodiments of the present application also provide a computer-readable storage medium, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor implements the The training method of the SMT printing parameter optimization model described in any one.
本领域普通技术人员可以理解,上文中所公开方法中的全部或某些步骤、系统、装置中的功能模块/单元可以被实施为软件、固件、硬件及其适当的组合。在硬件实施例中,在以上描述中提及的功能模块/单元之间的划分不一定对应于物理组件的划分;例如,一个物理组件可以具有多个功能,或者一个功能或步骤可以由若干物理组件合作执行。某些物理组件或所有物理组件可以被实施为由处理器,如中央处理器、数字信号处理器或微处理器执行的软件,或者被实施为硬件,或者被实施为集成电路,如专用集成电路。这样的软件可以分布在计算机可读存储介质上,计算机可读存储介质可以包括计算机可读存储介质(或非暂时性介质)和通信介质(或暂时性介质)。Those of ordinary skill in the art can understand that all or some of the steps in the methods disclosed above, the functional modules/units in the system, and the device can be implemented as software, firmware, hardware, and an appropriate combination thereof. In hardware embodiments, the division between functional modules/units mentioned in the above description does not necessarily correspond to the division of physical components; for example, one physical component may have multiple functions, or one function or step may be composed of several physical components. Components cooperate to execute. Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application-specific integrated circuit . Such software may be distributed on computer-readable storage media, which may include computer-readable storage media (or non-transitory media) and communication media (or transitory media).
如本领域普通技术人员公知的,术语计算机可读存储介质包括在用于存储信息(诸如计算机可读指令、数据结构、程序模块或其他数据)的任何方法或技术中实施的易失性和非易失性、可移除和不可移除介质。计算机可读存储介质包括但不限于RAM、ROM、EEPROM、闪存或其他存储器技术、CD-ROM、数字多功能盘(DVD)或其他光盘存储、磁盒、磁带、磁盘存储或其他磁存储装置、或者可以用于存储期望的信息并且可以被计算机访问的任何其他的介质。此外,本领域普通技术人员公知的是,通信介质通常包含计算机可读指令、数据结构、程序模块或者诸如载波或其他传输机制之类的调制数据信号中的其他数据,并且可包括任何信息递送介质。As known to those of ordinary skill in the art, the term computer-readable storage medium includes both volatile and non-volatile media implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, program modules, or other data. Volatile, removable and non-removable media. Computer-readable storage media include, but are not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, tape, magnetic disk storage or other magnetic storage devices, Or any other medium that can be used to store desired information and that can be accessed by a computer. In addition, as is well known to those of ordinary skill in the art, communication media typically embodies computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and may include any information delivery media .
示例性的,所述计算机可读存储介质可以是前述实施例所述的电子设备的内部存储单元,例如所述电子设备的硬盘或内存。所述计算机可读存储介质也可以是所述电子设备的外部存储设备,例如所述电子设备上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。Exemplarily, the computer-readable storage medium may be an internal storage unit of the electronic device described in the foregoing embodiments, such as a hard disk or a memory of the electronic device. The computer-readable storage medium can also be an external storage device of the electronic device, such as a plug-in hard disk equipped on the electronic device, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD ) card, flash memory card (Flash Card), etc.
前述各实施例提供的电子设备和计算机可读存储介质,通过在用户输入信息时,在显示屏上不同的显示区域显示至少两个虚拟键盘,使得可以通过至少两个虚拟键盘输入信息;提高恶意软件通过监听传感器的状态推测输入信息的难度,增强了信息输入的安全性。The electronic device and the computer-readable storage medium provided by the foregoing embodiments display at least two virtual keyboards in different display areas on the display screen when the user inputs information, so that information can be input through at least two virtual keyboards; The software infers the difficulty of input information by monitoring the state of the sensor, which enhances the security of information input.
以上所述,仅为本申请的具体实施例,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above is only a specific embodiment of the application, but the scope of protection of the application is not limited thereto. Any person familiar with the technical field can easily think of various equivalents within the scope of the technology disclosed in the application. Modifications or replacements, these modifications or replacements shall be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be based on the protection scope of the claims.

Claims (10)

  1. 一种SMT印刷参数优化模型的训练方法,所述方法包括以下步骤:A kind of training method of SMT printing parameter optimization model, described method comprises the following steps:
    接收初始生产数据;receive initial production data;
    根据所述初始生产数据进行影响因素重构,得到影响因素数据包;Reconstructing the influencing factors according to the initial production data to obtain the influencing factor data package;
    加载待训练优化模型;Load the optimized model to be trained;
    根据所述影响因素数据包对所述待训练优化模型进行训练,得到训练好的优化模型。The optimization model to be trained is trained according to the influencing factor data package to obtain a trained optimization model.
  2. 根据权利要求1所述的方法,其中,所述根据所述初始生产数据进行影响因素重构,得到影响因素数据包,包括:The method according to claim 1, wherein said reconstruction of influencing factors according to said initial production data to obtain an influencing factor data package includes:
    对所述初始生产数据进行过滤筛选,得到不包含于固定属性的原始数据集,其中所述固定属性为影响因素所对应的数据为固定值;Filtering the initial production data to obtain an original data set not included in the fixed attribute, wherein the data corresponding to the fixed attribute being an influencing factor is a fixed value;
    利用特征交叉以及主成分分析对所述原始数据集进行影响因素重构,得到重构后的影响因素数据包。Reconstructing the influencing factors of the original data set by using feature crossover and principal component analysis to obtain a reconstructed influencing factor data package.
  3. 根据权利要求2所述的方法,其中,所述利用特征交叉以及主成分分析对所述原始数据集进行影响因素重构,得到重构后的影响因素数据包,包括:The method according to claim 2, wherein said utilizing feature crossover and principal component analysis to reconstruct the influencing factors of said original data set to obtain a reconstructed influencing factor data package, comprising:
    对所述原始数据集进行数据规约,得到规约后的原始数据集;performing data reduction on the original data set to obtain a reduced original data set;
    确定所述参数待训练优化模型所对应的待优化工艺参数,并将所述原始数据集中所包含的待优化工艺参数剔除,得到第一数据集;Determining the process parameters to be optimized corresponding to the parameter to be trained optimization model, and removing the process parameters to be optimized contained in the original data set to obtain the first data set;
    对所述第一数据集所包含的影响因素进行特征交叉相乘,得到第二数据集;performing feature cross multiplication on the influencing factors contained in the first data set to obtain a second data set;
    利用主成分分析对所述第二数据集中所包含的每一影响因素进行分析,得到影响因素数据包。Each influencing factor included in the second data set is analyzed by principal component analysis to obtain an influencing factor data package.
  4. 根据权利要求3所述的方法,其中,所述利用主成分分析对所述第二数据集中所包含的每一影响因素进行分析,得到影响因素数据包,包括:The method according to claim 3, wherein said analysis of each influencing factor contained in said second data set by using principal component analysis to obtain an influencing factor data package includes:
    利用主成分分析计算得到所述第二数据集中每一影响因素所对应的特征值,并对所述特征值进行排序;Using principal component analysis to calculate the eigenvalues corresponding to each influencing factor in the second data set, and sorting the eigenvalues;
    根据从大到小的选择方式在排序后的所述特征值中选择若干特征值相加,并在相加得到的数值首次大于所设定的预设阈值时,确定进行相加的特征值所对应的特征为重构影响因素;According to the selection method from large to small, select a number of eigenvalues from the sorted eigenvalues to add, and when the added value is greater than the set preset threshold for the first time, determine the eigenvalues to be added. The corresponding features are reconstruction influencing factors;
    将所述重构影响因素与所述待优化工艺参数进行合并,得到影响因素数据包。Merge the reconstructed influencing factors with the process parameters to be optimized to obtain an influencing factor data package.
  5. 根据权利要求1至4中任一项所述的方法,其中,所述根据所述影响因素数据包对所述待训练优化模型进行训练,得到训练好的优化模型,包括:The method according to any one of claims 1 to 4, wherein the training of the optimization model to be trained according to the influencing factor data package to obtain a trained optimization model includes:
    初始化所述待训练优化模型,并得到对应的初始化参数,以及根据预设的编码方式对所述影响因素数据包进行编码,得到编码数据;Initializing the optimization model to be trained, obtaining corresponding initialization parameters, and encoding the influencing factor data packet according to a preset encoding method to obtain encoded data;
    根据所述编码数据对所述初始化后的待训练优化模型进行若干次迭代训练,以对所述初始化参数进行调节,并在确定收敛时得到训练好的优化模型。Perform several iterations of training on the initialized optimization model to be trained according to the encoded data, so as to adjust the initialization parameters, and obtain a trained optimization model when convergence is determined.
  6. 根据权利要求5所述的方法,其中,所述根据所述编码数据对所述初始化后的待训练优化模型进行若干次迭代训练,包括:The method according to claim 5, wherein said performing several iterations of training on said initialized optimization model to be trained according to said coded data comprises:
    将所述编码数据输入至所述初始化后的待训练优化模型中,以对初始化后的待训练优化模型进行第一次训练,并得到第一次训练后的第一中间参数;Inputting the encoded data into the initialized optimization model to be trained, so as to perform the first training on the initialized optimization model to be trained, and obtain the first intermediate parameters after the first training;
    基于所述第一中间参数对所述待训练优化模型进行调节,得到第一中间待训练优化模型;adjusting the optimization model to be trained based on the first intermediate parameters to obtain a first intermediate optimization model to be trained;
    将所述编码数据输入至所述第一中间待训练优化模型中,以对所述第一中间待训练优化模型进行第二次训练,并得到第二次训练后的第二中间参数,以此类推以对所述待训练优化模型进行若干次训练。inputting the encoded data into the first intermediate optimization model to be trained, so as to perform a second training on the first intermediate optimization model to be trained, and obtain second intermediate parameters after the second training, thereby By analogy, the optimization model to be trained is trained several times.
  7. 根据权利要求6所述的方法,其中,所述在确定收敛时得到训练好的优化模型,包括:The method according to claim 6, wherein said obtaining a trained optimization model when determining convergence comprises:
    在确定收敛时对每次训练所得到的目标函数值进行排序,以根据从大到小的选择方式在排序后的目标函数值中选择N个目标函数值,其中将所述编码数据输入至所述待训练优化模型中得到所述目标函数值,且每一次训练均输出一目标函数值;When the convergence is determined, the objective function values obtained by each training are sorted, so as to select N objective function values from the sorted objective function values according to the selection mode from large to small, wherein the encoded data is input to the The objective function value is obtained in the optimized model to be trained, and an objective function value is output for each training;
    确定所述N个目标函数值所对应的N组参数信息,其中参数信息包括隐藏层网络节点数以及对应的激活函数;Determining N sets of parameter information corresponding to the N objective function values, wherein the parameter information includes the number of hidden layer network nodes and the corresponding activation function;
    将所述N组参数信息输入至所述待训练优化模型中的N个子模型中,并将所述N个子模型进行集成,得到训练好的优化模型,其中所述优化模型为基于深度神经网络所构建的多输出深度神经网络模型。The N sets of parameter information are input into the N sub-models in the optimization model to be trained, and the N sub-models are integrated to obtain a trained optimization model, wherein the optimization model is based on a deep neural network. Constructed multi-output deep neural network model.
  8. 根据权利要求1至7中任一项所述的方法,其中,在所述根据所述影响因素数据包对所述待训练优化模型进行训练,得到训练好的优化模型之后,还包括:The method according to any one of claims 1 to 7, wherein, after said training the optimization model to be trained according to the influencing factor data package, and obtaining the trained optimization model, further comprising:
    当接收到参数优化指令时,加载训练好的优化模型,并接收输入的待分析工艺参数;When the parameter optimization instruction is received, the trained optimization model is loaded, and the input process parameters to be analyzed are received;
    将所述待分析工艺参数输入至所述优化模型中,计算得到基于所述待分析工艺参数所得到的函数值;inputting the process parameter to be analyzed into the optimization model, and calculating a function value based on the process parameter to be analyzed;
    当所述函数值大于预设阈值时,根据所述优化模型输出预测工艺参数。When the function value is greater than a preset threshold, the predicted process parameters are output according to the optimization model.
  9. 一种计算机设备,包括存储器以及处理器;A computer device comprising a memory and a processor;
    所述存储器用于存储计算机程序;The memory is used to store computer programs;
    所述处理器,用于执行所述计算机程序并在执行所述计算机程序时实现如权利要求1至8中任一项所述的SMT印刷参数优化模型的训练方法的步骤。The processor is configured to execute the computer program and realize the steps of the method for training the SMT printing parameter optimization model according to any one of claims 1 to 8 when the computer program is executed.
  10. 一种存储介质,用于计算机可读存储,所述存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现权利要求1至8中任一项所述的SMT印刷参数优化模型的训练方法的步骤。A storage medium for computer-readable storage, the storage medium stores one or more programs, and the one or more programs can be executed by one or more processors to implement any one of claims 1 to 8 A step in the training method of the SMT printing parameter optimization model.
PCT/CN2022/077781 2021-06-24 2022-02-24 Method for training smt printing parameter optimization model, device, and storage medium WO2022267509A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110715871.1 2021-06-24
CN202110715871.1A CN115526093A (en) 2021-06-24 2021-06-24 Training method, equipment and storage medium for SMT printing parameter optimization model

Publications (1)

Publication Number Publication Date
WO2022267509A1 true WO2022267509A1 (en) 2022-12-29

Family

ID=84545060

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/077781 WO2022267509A1 (en) 2021-06-24 2022-02-24 Method for training smt printing parameter optimization model, device, and storage medium

Country Status (2)

Country Link
CN (1) CN115526093A (en)
WO (1) WO2022267509A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116029356A (en) * 2023-03-24 2023-04-28 杭州景业智能科技股份有限公司 Tool monitoring model training method, tool state monitoring method and related devices
CN116713709A (en) * 2023-05-29 2023-09-08 苏州索力伊智能科技有限公司 Control system and method for automatic connector assembly equipment
CN117557082A (en) * 2023-11-16 2024-02-13 深圳市航盛电子股份有限公司 Process processing method, device, equipment and storage medium for electronic components

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117312802B (en) * 2023-11-29 2024-03-22 安徽大学 SMT production line key process parameter optimization method based on man-machine fusion and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109597968A (en) * 2018-12-29 2019-04-09 西安电子科技大学 Paste solder printing Performance Influence Factor analysis method based on SMT big data
CN110427593A (en) * 2018-12-19 2019-11-08 西安电子科技大学 SMT printing parameter optimization method based on industrial big data
CN110543616A (en) * 2018-12-26 2019-12-06 西安电子科技大学 SMT solder paste printing volume prediction method based on industrial big data
CN110781174A (en) * 2019-10-15 2020-02-11 支付宝(杭州)信息技术有限公司 Feature engineering modeling method and system using pca and feature intersection
US20210149363A1 (en) * 2019-11-14 2021-05-20 Hongfujin Precision Electronics(Tianjin)Co.,Ltd. Parameters suggestion system of solder paste screen printer including method, device employing method, and non-transitory storage

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110427593A (en) * 2018-12-19 2019-11-08 西安电子科技大学 SMT printing parameter optimization method based on industrial big data
CN110543616A (en) * 2018-12-26 2019-12-06 西安电子科技大学 SMT solder paste printing volume prediction method based on industrial big data
CN109597968A (en) * 2018-12-29 2019-04-09 西安电子科技大学 Paste solder printing Performance Influence Factor analysis method based on SMT big data
CN110781174A (en) * 2019-10-15 2020-02-11 支付宝(杭州)信息技术有限公司 Feature engineering modeling method and system using pca and feature intersection
US20210149363A1 (en) * 2019-11-14 2021-05-20 Hongfujin Precision Electronics(Tianjin)Co.,Ltd. Parameters suggestion system of solder paste screen printer including method, device employing method, and non-transitory storage

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116029356A (en) * 2023-03-24 2023-04-28 杭州景业智能科技股份有限公司 Tool monitoring model training method, tool state monitoring method and related devices
CN116029356B (en) * 2023-03-24 2023-06-13 杭州景业智能科技股份有限公司 Tool monitoring model training method, tool state monitoring method and related devices
CN116713709A (en) * 2023-05-29 2023-09-08 苏州索力伊智能科技有限公司 Control system and method for automatic connector assembly equipment
CN116713709B (en) * 2023-05-29 2023-12-19 苏州索力伊智能科技有限公司 Control system and method for automatic connector assembly equipment
CN117557082A (en) * 2023-11-16 2024-02-13 深圳市航盛电子股份有限公司 Process processing method, device, equipment and storage medium for electronic components
CN117557082B (en) * 2023-11-16 2024-04-19 深圳市航盛电子股份有限公司 Process processing method, device, equipment and storage medium for electronic components

Also Published As

Publication number Publication date
CN115526093A (en) 2022-12-27

Similar Documents

Publication Publication Date Title
WO2022267509A1 (en) Method for training smt printing parameter optimization model, device, and storage medium
WO2022247955A1 (en) Abnormal account identification method, apparatus and device, and storage medium
CN108052505A (en) Text emotion analysis method and device, storage medium, terminal
CN110263009A (en) Generation method, device, equipment and the readable storage medium storing program for executing of log classifying rules
US20230259739A1 (en) Image detection method and apparatus, computer-readable storage medium, and computer device
CN111797320B (en) Data processing method, device, equipment and storage medium
CN110689368B (en) Method for designing advertisement click rate prediction system in mobile application
WO2021012894A1 (en) Method and apparatus for obtaining neural network test report, device, and storage medium
CN110909868A (en) Node representation method and device based on graph neural network model
CN110458600A (en) Portrait model training method, device, computer equipment and storage medium
CN107122786A (en) A kind of mass-rent learning method and device
CN111815169A (en) Business approval parameter configuration method and device
CN109783459A (en) The method, apparatus and computer readable storage medium of data are extracted from log
CN110929028A (en) Log classification method and device
CN112365007A (en) Model parameter determination method, device, equipment and storage medium
CN108647714A (en) Acquisition methods, terminal device and the medium of negative label weight
CN113742069A (en) Capacity prediction method and device based on artificial intelligence and storage medium
CN113761193A (en) Log classification method and device, computer equipment and storage medium
CN113704389A (en) Data evaluation method and device, computer equipment and storage medium
CN110264311B (en) Business promotion information accurate recommendation method and system based on deep learning
CN101877866B (en) Method and device for detecting mobile terminal model capability
CN109726824A (en) The transfer learning method and terminal device of training pattern
CN110263803A (en) Processing method, device, server and the storage medium of sample data
CN110196797A (en) Automatic optimization method and system suitable for credit scoring card system
CN113641823B (en) Text classification model training, text classification method, device, equipment and medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22827016

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE