CN116502959B

CN116502959B - Product manufacturing quality prediction method based on meta learning

Info

Publication number: CN116502959B
Application number: CN202310738959.4A
Authority: CN
Inventors: 单忠德; 张朔山; 汪俊; 单鹏飞; 李大伟
Original assignee: Nanjing University of Aeronautics and Astronautics
Current assignee: Nanjing University of Aeronautics and Astronautics
Priority date: 2023-06-21
Filing date: 2023-06-21
Publication date: 2023-09-08
Anticipated expiration: 2043-06-21
Also published as: CN116502959A

Abstract

The application relates to the technical field of product manufacturing quality prediction, solves the technical problem that small sample size data cannot support quality prediction of complex industrial structural parts with multiple production procedures, and particularly relates to a product manufacturing quality prediction method based on meta-learning, which comprises the following steps: s1, acquiring sample data of each manufacturing procedure in the process of manufacturing a complex structural member, wherein the sample data comprises process parameters, related sensor data and quality indexes; s2, preprocessing sample data to obtain a plurality of sample data sets, dividing the sample data sets into a training set, a verification set and a test set, and dividing each sample data set into a support set and a query set. The method can realize the prediction of the processing state of the complex structural member in the aerospace field, further realize the process control, obviously improve the processing efficiency and quality of the key structural member of the aero-engine and ensure the safety of the processing process.

Description

Product manufacturing quality prediction method based on meta learning

Technical Field

The application relates to the technical field of product manufacturing quality prediction, in particular to a product manufacturing quality prediction method based on meta learning.

Background

In recent years, the manufacturing industry at home and abroad gradually develops to organically combine the traditional processing technology with digital twin and artificial intelligence, so as to intelligently regulate and control the processing process through a virtual twin system. In the aerospace field, complex structural parts exist, the manufacturing process of the complex structural parts is a nonlinear multi-field coupling machining process, and along with material removal and surface creation, various complex physical processes are highly coupled, so that the shape precision and the surface state of the target structural parts are comprehensively influenced.

For the problem of data-driven processing state prediction, conventional machine learning algorithms such as support vector machines and the like have limitations of generalizing performance in the face of complex industrial scenarios of multiple process coupling. The deep neural network has good prediction performance, such as a recurrent neural network, a long-short-term memory network and the like, can capture time dynamic behaviors in data, but is dependent on data quantity, the prediction performance of small sample quantity data is general, data samples cannot be fully utilized, and the accuracy of the small sample quantity prediction under a complex technological process is difficult to realize.

Disclosure of Invention

Aiming at the defects of the prior art, the application provides a product manufacturing quality prediction method based on meta-learning, which solves the technical problem that small sample size data cannot support quality prediction of complex industrial structural parts with multiple production procedures.

In order to solve the technical problems, the application provides the following technical scheme: a product manufacturing quality prediction method based on meta learning, the prediction method comprising the steps of:

s1, acquiring sample data of each manufacturing procedure in the process of manufacturing a complex structural member, wherein the sample data comprises process parameters, related sensor data and quality indexes;

s2, preprocessing sample data to obtain a plurality of sample data sets, dividing the sample data sets into a training set, a verification set and a test set, and dividing each sample data set into a support set and a query set;

s3, mapping sample features of the sample data set corresponding to different procedures to the same dimension, and performing time sequence coding on the features according to the sequence of each procedure to serve as supplementary features;

s4, mining expected characterization reflecting sample commonalities related to measurement indexes through sample data in the support set to serve as newly generated supplementary features;

s5, constructing a decoder, and predicting the quality index of the sample in the query set by combining the input and the supplementary features of the support set to obtain a quality prediction result;

s6, respectively constructing a loss function for the newly generated supplementary features and the quality prediction result, and updating parameters through a back propagation algorithm to obtain a new digital twin model;

s7, inputting a sample of the complex structural member manufacturing process into a new digital twin model to finish the manufacturing quality prediction of the industrial manufacturing process.

Further, in step S2, the specific process includes the following steps:

s201, performing outlier processing and normalization processing on sample data of technological parameters of each manufacturing procedure and quality indexes of complex structural members to obtain a sample data set;

s202, dividing a sample data set into a training set, a verification set and a test set, taking out the first K samples in each data set on the basis of the training set, the verification set and the test set as supporting sets, and taking the rest samples as query sets.

Further, in step S3, the specific process includes the following steps:

s301, mapping sample characteristics of different dimensions corresponding to different working procedures to the same dimension through a fully connected neural network;

further, mapping sample features of different dimensions to the same dimension is accomplished using the following formula:

；

wherein ,sample characteristics are shown,/->Representing the feature representation mapped to the same dimension;

s302, constructing a two-way long-short-term memory network BiLSTM according to the sequence of each procedure, then encoding sample characteristics, and splicing the two-way hidden states of the two-way long-term memory network BiLSTM as output to serve as time sequence characteristics of each procedure by taking the mapped sample characteristics of each procedure in a sample as input;

s303, splicing the sample features with the dimensions mapped to the same dimension with the time sequence features to serve as initial feature codes of the samples.

Further, in step S4, the specific process includes the following steps:

s401, splicing initial feature codes and quality indexes of samples to obtain new sample representations;

s402, inputting a new sample representation into a multi-head attention mechanism, and outputting an updated representation as each sample, wherein the updated representation comprises correlations among the samples;

s403, the updated representation of all the support samples is averaged and scaled through a layer of fully connected network to be used as newly generated supplementary features.

Further, the newly generated supplemental features are accomplished using the following formula:

；

wherein ,representing new sample representations after splicing the support set feature representation with the quality index, K representing the number of support set samples,/for the support set feature representation>Representation->Dimension of->Reflecting the importance of the jth sample to the ith sample,/>Is an updated representation of sample i.

；

wherein ,for the temporary representation of the supplementary features, K represents the number of support set samples, +.>For supplementing features, a->Representing a weight matrix, +.>Representing the bias term.

Further, in step S5, the specific process includes the steps of:

s501, combining the sample representation of the support set with newly generated supplementary features to serve as an input feature representation of the support set, and taking the feature representation of the query set after encoding as the input of the query set;

s502, calculating an update representation of the support samples as part of the input of the decoder;

s503, calculating correlation between the support set and the sample of the query set through an attention mechanism, wherein the importance of each support sample is regarded as a weight parameter generated in a decoder;

s504, multiplying the generation parameters of the support samples with the characteristic representation of the query sample to add the bias, and obtaining the quality prediction result of the query sample.

Further, in step S6, the specific process includes the steps of:

s601, respectively constructing a multiple loss function for the newly generated supplementary features, the prediction result and the generation parameters of the decoder;

s602, weighting and summing the multiple loss functions to be used as a total loss function of the digital twin model;

and S603, carrying out iterative updating on parameters of the digital twin model through a back propagation algorithm to obtain a new digital twin model.

By means of the technical scheme, the application provides a product manufacturing quality prediction method based on meta learning, which has at least the following beneficial effects:

1. according to the application, the time sequence characteristics corresponding to each process in the product process flow can be mined through the bidirectional long-short-term memory network, and the influence relationship of mutual coupling among different processes is reserved in the characteristics;

based on the idea of meta-learning, the priori knowledge reflecting sample commonality corresponding to each quality index in the data is mined through a multi-head attention mechanism to be used as the supplement of the features, and limited data can be fully utilized, so that the prediction has better interpretability; the decoding prediction process is based on an attention mechanism, and the prediction accuracy of the small sample under the two supports can be effectively improved by combining the relation between the known result sample and the unknown result sample.

2. The method can realize the prediction of the processing state of the complex structural member in the aerospace field, further realize the process control, obviously improve the processing efficiency and quality of the key structural member of the aero-engine and ensure the safety of the processing process.

3. The application adopts the idea of meta learning, and combines the attention mechanism to mine the priori knowledge in the sample examples, fully utilizes the data samples, and is beneficial to improving the accuracy of small sample quantity prediction in the complex process.

Drawings

The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute a limitation on the application. In the drawings:

FIG. 1 is a flow chart of a method for predicting manufacturing quality of a product according to the present application;

FIG. 2 is a flow chart of the critical fabrication of a complex structure of the present application;

FIG. 3 is a schematic diagram of a network architecture of the new digital twin model of the present application;

FIG. 4 is a graph of the predictive test results of the novel digital twin model of the present application.

Detailed Description

In order that the above-recited objects, features and advantages of the present application will become more readily apparent, a more particular description of the application will be rendered by reference to the appended drawings and appended detailed description. Therefore, the realization process of how to apply the technical means to solve the technical problems and achieve the technical effects can be fully understood and implemented.

Those of ordinary skill in the art will appreciate that all or a portion of the steps in a method of implementing an embodiment described above may be implemented by a program to instruct related hardware, and thus, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

Referring to fig. 1-4, a specific implementation manner of the present embodiment is shown, in which the present embodiment adopts the idea of meta-learning, and the attention mechanism is combined to mine priori knowledge in the sample instance, so that the data sample is fully utilized, and the accuracy of small sample size prediction in the complex process is improved.

Referring to fig. 1 and 2, taking a key manufacturing process of a complex structural member shown in fig. 2 as an example, a specific embodiment of the product manufacturing quality prediction method includes the following steps:

the sample data of each manufacturing procedure is used for determining the technological parameters of each procedure and the quality indexes obtained by measurement according to the specific technological process of industrial products in a manufacturing workshop, wherein the number of the technological parameters and the quality indexes is multiple, and each manufacturing procedure corresponds to at least one sample data, so that the technological parameters, the related sensor data and the quality indexes in each procedure are determined according to the sample data.

as a preferred mode of the present embodiment, the method adopted for realizing step S2 is as follows:

abnormal value processing adopts abnormal value processing of data processing in the prior art, the abnormal value processing is realized by adopting a deletion, deletion value treatment, average value correction or capping method, wherein the abnormal value processing is realized by adopting a 3 sigma principle to establish that the interference or noise of singular data is difficult to meet the normal distribution on the basis of equal-precision repeated measurement of the normal distribution, so that abnormal value detection is completed. The normalization process adopts a normalization process method known in the prior art, and is not described in detail here.

S202, dividing a sample data set into a training set, a verification set and a test set, taking the first K samples in each data set as a supporting set and taking the rest samples as a query set, wherein the value of K is usually 5-20% of the number of the samples of the whole data set, and the ratio can provide enough training data to train the digital twin model, and meanwhile, excessive fitting of the digital twin model caused by using too much data is avoided.

In the step, each manufacturing procedure corresponds to at least one sample data, the sample data corresponds to at least one sample data set after pretreatment, and the digital twin model can be fully trained by selecting more sample data sets, and meanwhile, the accurate prediction of the manufacturing quality of the digital twin model for the manufacturing process of the complex structural member is improved.

as a preferred mode of the present embodiment, the method adopted for realizing step S3 is as follows:

in step S301, mapping sample features of different dimensions to the same dimension is accomplished using the following formula:

；

wherein ,representing sample characteristics->Representation of the feature after mapping to the same dimension, < +.>Representing a weight matrix, each row of which corresponds to an embedded vector,>representing the bias term.

in step S302, the extraction of the time-series features is completed using the following formula:

；

wherein The hidden state, which is expressed as the output of the bi-directional long-short term memory network BiLSTM, is characterized as a time series.

The specific calculation method of the two-way long-short-term memory network BiLSTM comprises the following steps:

for a forward LSTM there are:

；

wherein Indicating forgetfulness door, < >>Representing the input door,/->Representing the output door, & lt + & gt>Representing the feature vector of the input +.>Indicating the temporary status of the cell,/->Indicates the hidden state of the last moment, +.>Representing a weight matrix, +.>Representing the bias term.

；

wherein Indicates the cell state at time t,/->The hidden state at time t is indicated.

；

wherein and />Respectively representing hidden states of the forward and reverse outputs of the two-way long-short-term memory network BiLSTM, and obtaining output +.>。

as a preferred mode of the present embodiment, the method adopted for realizing step S4 is as follows:

In step S4, the newly generated supplemental features are completed using the following formula:

；

wherein ,representing new sample representations after splicing the support set feature representation with the quality index, K representing the number of support set samples,/for the support set feature representation>Representation->Dimension of->Reflecting the importance of the jth sample to the ith sample,/>For an updated representation of sample i, +.>Are weight matrices.

；

as a preferred mode of the present embodiment, the method adopted for realizing step S5 is as follows:

s502, repeating the process in the step S402, and calculating an update representation of the support samples as part of the input of the decoder;

the weight parameter generation is expressed as:

；

wherein , and />Representing the feature representation of the query set and the support set encoded, respectively, < >>Representation->Is a dimension of (2); />Sample for support set->The updated representation obtained after repeating the procedure of step S402, is->Representing a weight matrix, +.>Representing temporary generation parameters that are not projected via the fully connected network, K represents the number of support set samples.

；

wherein ,is +.>，/>Front middle>Item->Representing the generation weight, last item +.>For bias item->An activation function representing a fully connected network, typically a softmax function,/for example>Representing a weight matrix, +.>Representing the bias term.

The quality prediction result is calculated according to the following formula:

；

as a preferred mode of the present embodiment, the method adopted for realizing step S6 is as follows:

multiple loss function、/> and />The method comprises the following steps of:

；

wherein ,for supporting the sample size of the set corresponding to all quality indicators, < >>Representation divide sample->The remaining sample sets, except ∈ ->For optimizing a priori feature generation such that the quality index characterizes a similar sample with a feature representation closer,/i>、Representing the newly generated supplemental features.

；

wherein ,for balancing weights in the decoder, preventing the overfitting phenomenon caused by the existence of excessive partial weights,is +.>，/>Front middle>Item->Representing the generation weight, last item +.>Is a bias term.

；

Wherein, the mean square error is taken as the loss of the product quality index prediction，/>Indicating the procedure->Is>Prediction of individual quality indicators, < >>Representing the true value +_>Represents the number of procedures in the current training batch, +.>Indicating procedure->The number of training samples included.

；

wherein the weighted sum of the losses of each part is used as a multi-element loss function of the whole digital twin model iteration，Representing the weight parameters, respectively.

The product manufacturing quality prediction digital twin model based on meta learning provided by the embodiment can dig out time sequence characteristics corresponding to each process in the product process flow through a two-way long-short-term memory network, and the influence relationship of mutual coupling among different processes is reserved in the characteristics;

Specifically, please refer toFig. 3 is a schematic diagram of a network structure of a new digital twin model, in fig. 3,representing support set sample features, +.>Representing sample characteristics of a query set,/->Sample tags representing support sets, i.e. quality index,/->Representing learned supplemental features->The prediction result of the quality index is shown, in this embodiment, the new digital twin model is verified and quantitatively analyzed by using the sample of the product manufacturing process shown in fig. 2, the average absolute error of the prediction is 0.36, the root mean square error is 0.48, and the prediction result is shown in fig. 4. Compared with other traditional algorithms, the method has better performance and lower error, can effectively solve the problem of product manufacturing quality prediction under the support of small sample size, and has the results shown in table 1.

The prediction method provided by the embodiment can be used for predicting the processing state of the complex structural member in the aerospace field, further realizing process regulation, remarkably improving the processing efficiency and quality of the key structural member of the aero-engine and ensuring the safety of the processing process.

And meanwhile, the idea of meta learning is adopted, priori knowledge in sample examples is mined by combining an attention mechanism, data samples are fully utilized, and the accuracy of small sample quantity prediction in a complex process is improved.

The foregoing embodiments have been presented in a detail description of the application, and are presented herein with a particular application to the understanding of the principles and embodiments of the application, the foregoing embodiments being merely intended to facilitate an understanding of the method of the application and its core concepts; meanwhile, as those skilled in the art will have variations in the specific embodiments and application scope in accordance with the ideas of the present application, the present description should not be construed as limiting the present application in view of the above.

Claims

1. A product manufacturing quality prediction method based on meta learning, characterized in that the prediction method comprises the following steps:

s3, mapping sample features of the sample data set corresponding to different procedures to the same dimension, and performing time sequence coding on the features according to the sequence of each procedure to serve as supplementary features; in step S3, the specific process includes the following steps:

s303, splicing the sample features with the dimensions mapped to the same dimension with the time sequence features to serve as initial feature codes of the samples;

s4, mining expected characterization reflecting sample commonalities related to measurement indexes through sample data in the support set to serve as newly generated supplementary features; in step S4, the specific process includes the following steps:

s403, taking average and scaling of updated representations of all support samples through a layer of fully connected network to serve as newly generated supplementary features;

s5, constructing a decoder, and predicting the quality index of the sample in the query set by combining the input and the supplementary features of the support set to obtain a quality prediction result; in step S5, the specific process includes the following steps:

s504, multiplying the generation parameters of the support samples with the characteristic representation of the query sample to add a bias, so as to obtain a quality prediction result of the query sample;

s6, respectively constructing a loss function for the newly generated supplementary features and the quality prediction result, and updating parameters through a back propagation algorithm to obtain a new digital twin model; in step S6, the specific process includes the following steps:

s603, carrying out iterative updating on parameters of the digital twin model through a back propagation algorithm to obtain a new digital twin model;

2. The product manufacturing quality prediction method according to claim 1, wherein in step S2, the specific process includes the steps of:

3. The method according to claim 1, wherein in step S301, mapping sample features of different dimensions to the same dimension is accomplished using the following formula:

；

4. The method of claim 1, wherein in step S4, the newly generated supplemental features are accomplished using the following formula:

；

wherein ,representing new sample representations after splicing the support set feature representation with the quality index, K representing the number of support set samples,/for the support set feature representation>Representation->Dimension of->Reflecting the importance of the jth sample to the ith sample,/>For an updated representation of sample i, +.>Are weight matrixes;

；

5. The product manufacturing quality prediction method according to claim 1, wherein in step S61, a multiple loss function is used、/> and />The method comprises the following steps of:

；

wherein ,for supporting the sample size of the set corresponding to all quality indicators, < >>Representation divide sample->The remaining sample sets, except ∈ ->For optimizing a priori feature generation such that the quality index characterizes a similar sample with a feature representation closer,/i>、/>Representing the newly generated supplemental features;

；

wherein ,for balancing weights in a decoder, preventing the existence of partial weights that are too large resulting in an overfitting phenomenon, < >>Is +.>，/>Front middle>Item->Representing the generation weight, last item +.>Is a bias term;

；