CN113076583A

CN113076583A - Cantilever beam structure design method based on self-attention mechanism neural network

Info

Publication number: CN113076583A
Application number: CN202110385261.XA
Authority: CN
Inventors: 郑帅; 栗阳阳; 范浩杰; 洪军; 李宝童
Original assignee: Xian Jiaotong University
Current assignee: Xian Jiaotong University
Priority date: 2021-04-09
Filing date: 2021-04-09
Publication date: 2021-07-06
Anticipated expiration: 2041-04-09
Also published as: CN113076583B

Abstract

The invention discloses a cantilever beam structure design method based on a self-attention mechanism neural network. The design method can solve the problem that the calculation iteration time is high in the process of generating the cantilever beam structure by using the traditional MMC algorithm, and comprises two parts of model acquisition and model use, wherein the main flow of the model acquisition is as follows: 1. generating a training set and a test set by using an MMC method in advance; 2. performing data cleaning on the generated data; 3. constructing a word-like vector model by using a multi-perceptron model; 4. final output generated using the improved Transformer model; 5. training by using the model; 6. the final result was obtained using the model. When the final model is used for carrying out structure optimization design, the model output is obtained by inputting the boundary condition vector into the model, and then the output is input into the MMC drawing function, so that the rapid calculation of the final optimization structure is realized.

Description

Cantilever beam structure design method based on self-attention mechanism neural network

Technical Field

The invention belongs to the field of artificial intelligence and structural design optimization, and particularly relates to a cantilever beam structure design method based on a self-attention mechanism neural network.

Background

In order to design an optimal cantilever beam structure under given load conditions, constraint conditions and performance indexes, researchers at home and abroad apply a topological optimization method to carry out optimization design. The basic idea of topology optimization is to convert the optimal topology problem of seeking a structure into the distribution problem of seeking an optimal material in a given design area, the current topology optimization method mainly comprises a SIMP algorithm, an ESO algorithm, a level set method, an MMC algorithm and the like, the calculated amount of the method depends on the scale of the design area, and the calculated amount is multiplied along with the continuous increase of the design area, so that the time for obtaining the optimal design result is long.

With the continuous development of deep learning, the effect of the deep learning in various fields is also greater and greater; deep learning is a new field in machine learning research, and the motivation is to establish and simulate a neural network for human brain to analyze and learn, which simulates the mechanism of human brain to interpret data. Based on a neural network model in deep learning, a constraint condition matrix can be defined, the constraint condition matrix is input into a neural network, and a well-converged neural network model is trained, so that the problems that the calculation amount is too large when a cantilever beam structure is generated, and the time for generating a result is long are solved.

Disclosure of Invention

The invention provides a cantilever beam structure design method based on a self-attention mechanism neural network, which can greatly reduce the calculation complexity when a cantilever beam structure is generated and reduce the calculation overhead.

The invention is realized by adopting the following technical scheme:

a cantilever beam structure design method based on a self-attention mechanism neural network comprises the following steps:

1) preparing cantilever beam structure data by using an MMC algorithm;

2) carrying out data cleaning on the generated cantilever beam structure data;

3) constructing a word-like vector model by using a multi-perceptron model;

4) final output generated using the improved Transformer model;

5) training a model by using a K-fold cross validation method;

6) and outputting a final result by using the final model.

The further improvement of the invention is that the specific implementation method of the step 1) is as follows:

101) generating input vectors and output vectors by using an MMC algorithm to construct a cantilever beam structure data set;

102) according to the following steps of 8:1: the proportion of 1 divides the generated cantilever beam structure data set into a training set, a test set and a verification set.

The further improvement of the invention is that the specific implementation method of the step 2) is as follows:

and (2) acquiring cantilever beam structure data of the data set in the step 1), screening the cantilever beam structure data, and selecting data with an unsatisfactory generation effect from the cantilever beam structure data set for deletion.

The further improvement of the invention is that the specific implementation method of the step 3) is as follows:

and (3) carrying out size change on the original data by using a multi-perceptron algorithm, converting the input one-dimensional vector of the boundary condition into a two-dimensional vector conforming to the data format of the MMC algorithm, and transmitting the two-dimensional vector into a next-layer model as the input of the next-layer model.

The further improvement of the invention is that the specific implementation method of the step 4) is as follows:

401) taking the two-dimensional vector generated in the step 3) as the input of a Transformer model, and performing vector combination by using a positionEncoding method;

402) a multi-layer network with a self-attention mechanism is used for feature extraction and combination to generate a two-dimensional vector which can be recognized by an MMC algorithm.

The further improvement of the invention is that the specific implementation method of the step 5) is as follows:

501) taking part 1 in the training data as a test set, and taking the rest as a training set;

502) training a model, and calculating the loss of the model on a test set;

503) using different parts as test sets each time, repeating the steps for a plurality of times until the neural network converges;

504) the average accuracy is taken as the final loss of the model.

The further improvement of the invention is that the specific implementation method of the step 6) is as follows:

601) training the improved Transformer model, improving the feature extraction capability of the model and the capability of generating the model, and when the loss of the test set is converged, the capability of the model reaches the limit;

602) using the model for verification, randomly selecting a part of data from the verification set as input, and inputting the data into the trained model to obtain output;

603) and inputting the output result into an MMC drawing function for processing to finally obtain the generated cantilever beam structure diagram.

The invention has at least the following beneficial technical effects:

according to the cantilever beam structure design method based on the self-attention mechanism neural network, the improved transform model is used, so that the model can be trained under the condition of a data set generated by an MMC algorithm, a two-dimensional vector capable of being identified by an MMC drawing function is obtained by inputting a boundary condition vector through the trained model, and a cantilever beam structure diagram is obtained through the MMC drawing function. According to the invention, the data set is generated by using the MMC method, so that the training of the neural network model can be carried out under the condition of separating from the MMC algorithm, and meanwhile, the output format of the trained neural network is consistent with that of the MMC algorithm, so that the model calculation complexity is reduced and the applicability of the model is improved.

Furthermore, the invention uses the positionEncoding method to carry out vector combination, and adds position information to the calculated vector, thereby ensuring that the model distributes different weights to the vectors at different positions in the prediction process, and improving the learning capability and the prediction accuracy of the model.

Drawings

FIG. 1 is a schematic flow chart of the present invention.

FIG. 2 is a schematic diagram of the construction of a multiple perceptron in an embodiment.

FIG. 3 is a schematic diagram of an improved Transformer model constructed in the example.

Detailed Description

The invention is explained in detail below with reference to the drawings and the embodiments;

as shown in fig. 1, the method for designing a cantilever beam structure based on a neural network of a self-attention mechanism provided by the invention comprises the following steps: the method comprises the following steps: preparing cantilever beam structure data by using an MMC algorithm; step two: carrying out data cleaning on the generated cantilever beam structure data; step three: constructing a word-like vector model by using a multi-perceptron model; step four: final output generated using the improved Transformer model; step five: training by using the model; step six: using the final model; the method has the advantages of accurately generating an optimized structure, greatly reducing the calculation complexity and reducing the calculation overhead.

The method for the structural optimization design of the acceleration of the deep neural network of the self-attention mechanism comprises the following steps:

firstly, cantilever beam structure data are prepared by using an MMC algorithm: in all topology optimization methods, a traditional MMC algorithm is selected to generate 1000 groups of data, each group of data comprises an input vector, an output vector and a result graph, and the input vector, the output vector and the result graph are respectively stored in a txt format, a txt format and a jpg format. The data set is generated as follows: under the design domain of 2 x 1, a 200 x 100 grid is used for finite element analysis, the value range of Poisson ratio is taken as [0.1-0.8], the position of a load point is [0-100] at the rightmost side of the grid, the volume fraction is 0.4, and the Young modulus is 1. Simultaneously dividing the 800 groups of data into a training set (640 groups), a testing set (80 groups) and a verifying set (80 groups) according to the proportion of 8:1:1 in the data set;

secondly, carrying out data cleaning on the generated cantilever beam structure data: and (3) data of the data set obtained in the step one is subjected to data screening, and the difference between the result possibly generated randomly in the step one and the data used in normal engineering is overlarge, so that the final result of the iteration of the MMC algorithm loses physical significance. And selecting data with unsatisfactory generation effect from the data set for deletion.

Thirdly, constructing a word-like vector model by using a multiple perceptron model:

as shown in FIG. 2, the second step is performed to obtain the data set that can be used as input, and we input the data with 64 sets as a batch. The input data size is [ batch, 7], and the data is mapped to a higher dimension [ batch,112] by a multiple perceptron. A residual module is added to each layer of the multiple perceptron at the same time, and the residual block is divided into two parts, namely a direct mapping part and a residual part. In the model, two layers of full-connection layer networks are used as residual mapping, so that the degree of freedom of the network model can be increased, a function can be better fitted in a high-dimensional space, and the problem of gradient explosion caused by a deep network can be relieved; and a batch standardization technology is used after the fully-connected output of each layer, so that the data distribution output by each layer is re-concentrated in the range with the maximum gradient, and the convergence speed of the model is accelerated. Finally, the data dimension [ batch,112] is readjusted to [ batch,16,7] to be input into the next model.

Fourth, final output generated using the improved Transformer model:

as shown in fig. 3, the data output in step three is input into a Transformer model for transformation to calculate the self-attention matrix. In the Transformer model, each set of vectors is input into the network at the same time, and position information needs to be added to distinguish different components. Before data is input into the network, sine and cosine position codes are added to the data, and position information is added to the input matrix. Data with position information is input into the self-attitude layer. In self-integration layer, each 16-7 dimensional input is used as input sequence x₁,K,x₁₆Respectively multiplied by a conversion matrix W, calculated by a_i＝Wx_iTo obtain a₁，K,a₁₆Then each a_iMultiplying by conversion matrixes Wq, Wk and Wv to obtain q_i,k_i,v_i. Then q and k are used for the calculation of the Attention to obtain

Where d is the dimension of q and k. Alpha to be obtained_i,jThe corresponding result is obtained through calculation of a softmax layer

Is calculated by the formula

Each will be

And each vⁱMultiplication to obtain bⁱThe calculation formula is

Finally, an output sequence b can be obtainedⁱ。

Fifthly, training by using the model:

as shown in fig. 3, after the networks of the two modules are connected to each other and the training word vector training model network and the improved Transformer network model are defined respectively, the two modules need to be connected to each other for training; inputting an input vector in the input data set into a word vector training model to obtain a two-dimensional vector, and inputting the two-dimensional vector into a next network model; in the secondary process, the vector generated by the positionEncoding layer needs to be combined, so that the vector has position information. And inputting the combined vector into an improved Transformer model for training to obtain an output value. We use the MAE method to calculate the loss value loss, which is the difference between the output from the model and the output in the dataset. The updating of the model parameters is performed by a back propagation algorithm.

Sixth, using the final model:

when the improved Transformer model is trained, the feature extraction capability and the model generation capability of the model are improved, and when the loss of the test set is converged, the capability of the model reaches the limit. At this time, the model is used for verification, a part of data is randomly selected from the test set to be used as input, and the input is input into the trained model to obtain an output result. And inputting the output result into an MMC drawing function for processing to finally obtain a generated result graph, thereby realizing the quick calculation of the final optimization result.

Claims

1. A cantilever beam structure design method based on a self-attention mechanism neural network is characterized by comprising the following steps:

1) preparing cantilever beam structure data by using an MMC algorithm;

2) carrying out data cleaning on the generated cantilever beam structure data;

3) constructing a word-like vector model by using a multi-perceptron model;

4) final output generated using the improved Transformer model;

5) training a model by using a K-fold cross validation method;

6) and outputting a final result by using the final model.

2. The cantilever beam structure design method based on the self-attention mechanism neural network as claimed in claim 1, wherein the specific implementation method of step 1) is as follows:

3. The cantilever beam structure design method based on the self-attention mechanism neural network as claimed in claim 2, wherein the specific implementation method of step 2) is as follows:

4. The cantilever beam structure design method based on the self-attention mechanism neural network as claimed in claim 3, wherein the specific implementation method of step 3) is as follows:

5. The cantilever beam structure design method based on the self-attention mechanism neural network as claimed in claim 4, wherein the specific implementation method of step 4) is as follows:

6. The cantilever beam structure design method based on the self-attention mechanism neural network as claimed in claim 5, wherein the specific implementation method of step 5) is as follows:

502) training a model, and calculating the loss of the model on a test set;

504) the average accuracy is taken as the final loss of the model.

7. The cantilever beam structure design method based on the self-attention mechanism neural network as claimed in claim 6, wherein the specific implementation method of step 6) is as follows: