CN117150680A

CN117150680A - Airfoil profile optimization design method based on deep learning and reinforcement learning

Info

Publication number: CN117150680A
Application number: CN202311196497.4A
Authority: CN
Inventors: 刘迎圆; 沈剑雄; 安康
Original assignee: Shanghai Normal University
Current assignee: Shanghai Normal University
Priority date: 2023-09-15
Filing date: 2023-09-15
Publication date: 2023-12-01

Abstract

The invention belongs to the technical field of mechanical optimization design, and discloses an airfoil optimization design method based on deep learning and reinforcement learning, which comprises the steps of constructing an airfoil parameterized design platform based on the deep learning, parameterizing and dimension-reducing a basic airfoil to be optimized into at least four fitting parameters based on the airfoil parameterized design platform, and completing geometric shape modification of the basic airfoil by adjusting the fitting parameters to obtain geometric coordinates of a first airfoil; acquiring an airfoil lift coefficient and a drag coefficient based on an airfoil performance prediction model; based on the wing section optimal design model, interaction with the wing section performance prediction model and the wing section parameterized design platform is realized, a termination condition for reinforcement learning training is set for the wing section optimal design model until a preset termination condition is met, and the optimized geometric coordinates and wing section lift-drag ratio of the first wing section are output.

Description

Airfoil profile optimization design method based on deep learning and reinforcement learning

Technical Field

The invention relates to the technical field of mechanical optimization design, in particular to an airfoil optimization design method based on deep learning and reinforcement learning.

Background

Fluid machinery and equipment such as aircrafts, underwater thrusters, nuclear power station main pumps and the like face high-parameter design requirements. The design of the blade as a core component of such high-end fluid machinery and equipment is related to the performance of the equipment. The airfoil is the basis of blade design, and research on the optimization design of the airfoil has great significance for improving the performance of fluid machinery and equipment.

The optimal design method of the wing profile relates to the parameterized modeling technology of the wing profile and the optimal design method of the wing profile. At present, in the parametric modeling method of the airfoil, design parameters describing the first target are too many, the airfoil optimization design faces the problem of large design space, so that not only can the computational fluid dynamics prediction cost be increased, but also the searching difficulty of an optimization algorithm can be increased. Regarding the optimization design method of the airfoil, the application of some evolutionary algorithms, such as genetic algorithms, to the optimization design research of the airfoil becomes a current research hotspot.

With the development of artificial intelligence, deep learning and reinforcement learning methods are increasingly applied to the field of optimal design of mechanical equipment. Particularly, alpha-go has good performance in the field of go or games, and the application of reinforcement learning in the field of control is one of the most popular research fields in the current machine learning. Therefore, the application research of the deep learning and the reinforcement learning on the aspect of the optimal design of mechanical equipment is imperative, and the problems of too many airfoil parameters and difficult searching of an optimization algorithm can be relieved.

In view of the above, the invention provides an airfoil optimization design method based on deep learning and reinforcement learning.

Disclosure of Invention

In order to overcome the problems in the prior art, the invention provides an airfoil optimal design method based on deep learning and reinforcement learning, which solves the problems of overlarge design space, overlarge calculation cost and large algorithm search space caused by overlarge design parameter dimension in the existing airfoil design. Meanwhile, the problems that the application process is unknown, and the reward value and the training termination condition are difficult to determine due to the fact that the current reinforcement learning technology is applied less in the field of fluid mechanical optimization design are solved, so that the application potential of reinforcement learning is found to a greater extent.

According to one aspect of the present invention, there is provided an airfoil optimization design method based on deep learning and reinforcement learning, comprising the steps of:

selecting a basic airfoil profile to be optimized, and obtaining fitting parameters of the basic airfoil profile through an airfoil profile parameterized design platform based on deep learning: the fitting parameters of the basic wing profile are adjusted to obtain a large number of geometric coordinates of the first wing profile;

constructing an airfoil performance prediction model based on hydrodynamic software Xfoil; based on the geometric coordinates of the first airfoil and preset flow working condition parameters, a lift coefficient and a drag coefficient corresponding to the first airfoil are obtained, and the ratio of the lift coefficient to the drag coefficient is marked as an airfoil lift-drag ratio;

establishing an airfoil optimization design model by adopting an DQN reinforcement learning frame, marking the geometric coordinates of a first airfoil and the airfoil lift-drag ratio as an airfoil optimization state, and setting the rewarding value based on the change of the airfoil lift-drag ratio; the wing profile fitting parameters are used for continuously predicting action strategies based on the first adjustment interval, so that the wing profile fitting parameters reciprocate, and a large number of state-action-rewarding value samples can be obtained; the airfoil optimization design model based on the DQN is learned according to a large number of acquired samples, so that the maximization of the rewarding value is realized;

the network training process is carried out in 2 rounds, each round is preset with a fixed step number, after the fixed step number is reached, the basic wing profile is updated to the optimal wing profile geometry and the fitting parameters thereof obtained in the earlier training process, and the next round of optimization and network training are carried out continuously by taking the optimal wing profile geometry and the fitting parameters as starting points;

and setting a termination condition of reinforcement learning training on the airfoil optimization design model until a preset termination condition is met, and outputting the optimized geometric coordinates of the first airfoil and the airfoil lift-drag ratio.

Preferably, the specific application logic of the airfoil parameterized design platform is:

taking an airfoil image dataset in a public airfoil database as an input value of an airfoil parameterized design platform;

convolutional neural network for extracting airfoil image dataClassifying and compressing the airfoil image data features through a multi-layer perceptron, and describing the dimensionality reduction of the airfoil image data features into N fitting parameters; the fitting parameter is marked as P _n ，P _n ＝P ₀ 、P ₁ 、......、P _N-1 N is more than or equal to 4;

selecting a basic airfoil profile to be optimized, and recovering the geometric coordinates of the basic airfoil profile corresponding to the N fitting parameters to obtain the geometric coordinates of the initialized basic airfoil profile;

and adjusting N fitting parameters of the dimension reduction description based on a first adjustment interval, recovering the geometric coordinates of the adjusted basic airfoil, setting constraint conditions for preventing airfoil deformity, marking the airfoil meeting the constraint conditions as a first airfoil, and outputting the geometric coordinates of the first airfoil. .

Preferably, the fitting parameters of the base airfoil are at least four fitting parameters.

Preferably, the specific logic of the constraint condition for preventing airfoil deformity is:

y _up ＞y _down ；

wherein y is _up Is the ordinate value of the coordinate point of the upper surface of the airfoil; y is _down Is the ordinate value of the airfoil lower surface coordinate point.

Preferably, the specific application logic of the airfoil performance prediction model is:

taking a first target and flow working condition parameters as input values of an airfoil performance prediction model, wherein the flow working condition parameters comprise airfoil attack angles and Reynolds numbers;

and taking the lift-drag ratio of the airfoil as an output value of the airfoil performance prediction model.

Preferably, the airfoil optimization design model specifically includes the following logic:

the optimization state of the wing profile corresponding to each round is Z= [ P, CL, CD ], wherein the fitting parameters of the first wing profile are P, and CL and CD respectively represent the lift coefficient and the drag coefficient of the first wing profile under the corresponding fitting parameters;

determining the next action by the airfoil optimization design model according to the airfoil optimization state corresponding to Z, so as to obtain a fitting parameter set P_next of the next first airfoil;

wherein: the fitting parameter set P of the first airfoil is an N-dimensional vector, namely: p= [ P ] ₀ 、P ₁ 、......、P _N-1 ]N is the number of parameters after the dimension reduction of the airfoil parameters; the action is a vector with the same shape as the fitting parameter set P of the first airfoil, and represents the influence exerted by the action on each parameter, namely: action= [ a ] ₀ 、a ₁ 、......、a _N-1 ]；a ₀ -a _N-1 Different values can be taken as required, and the fixed step number is uniformly sampled with step=0.001 in the design, namely a ₀ -a _N-1 The first adjustment interval of each variable is + -0.001; after determining the action, the next round of airfoil optimization state Z _m+1 The method comprises the following steps:

Z _m+1 ＝[P _next ，CL _next ，CD _next ]

P _next ＝P+action

wherein CL is _next ，CD _next From new parameters P _next Calculated by computational fluid dynamics software.

Preferably, the prize value is set according to the optimization objective and various constraints, and the weight of each prize in the whole Reward is controlled by the parameter alpha, and the specific formula is as follows:

wherein, the optimization objective is: maximizing the lift-drag ratio; the constraint conditions are as follows: cl/Cd > Cl0/Cd0; the lift-drag ratio of the airfoil corresponding to the initial state is Cl0/Cd0; the lift-drag ratio of the wing profile corresponding to the current state is Cl (i)/Cd (i); the lift-drag ratio of the airfoil corresponding to the former state is Cl (i-1)/Cd (i-1); alpha ₁ And alpha is ₂ Is a weight coefficient. .

Preferably, the termination condition of the reinforcement learning training process includes:

if the first wing profile generated in each round has the wing profile malformation, ending the track and returning, and rewarding the value of reward=a;

if the calculation result of the first airfoil profile generated by each round by using the computational fluid dynamics software cannot be converged, ending the track and returning, wherein the reward value is found=b;

if the first wing profile generated in each round does not meet the additional design condition, ending the current track and returning, wherein the reward value is reorder=c;

if the number of steps of the reinforcement learning training exceeds the preset fixed number of steps, terminating the reinforcement learning training.

The wing section optimization design method based on deep learning and reinforcement learning has the technical effects and advantages that:

the invention provides a new thought and solving method for the optimal design of the wing section and even the fluid machinery, and the dimension reduction of the first target design parameter is realized based on a deep learning method, so that the calculation workload and the algorithm search space in the optimal design can be reduced; the real-time optimization design of the wing profile is further realized through the reinforcement learning technology, the real-time interaction between simulation software and actions can be realized, and the total sample size is reduced. Meanwhile, the application of the strengthening technology in the aspect of the optimal design of the fluid machinery belongs to the newer field, and the application process and how to set the rewarding value and the termination condition during the training are all the current difficulties.

Drawings

FIG. 1 is a flow chart of an airfoil optimization design method of the present invention;

FIG. 2 is a network training process of the airfoil optimization design method of the present invention;

FIG. 3 is a graph of network training effects of the airfoil optimization design method of the present invention;

fig. 4 is a window diagram of wing profile parameterized design software based on deep learning construction.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

Example 1

Referring to fig. 1-2, the method for optimizing design of an airfoil based on deep learning and reinforcement learning according to the present embodiment includes the following steps:

what needs to be explained here is: after fitting parameters of the basic airfoil are adjusted, design optimization of basic airfoil geometry is realized, and an optimized airfoil geometry image is marked as a first airfoil as shown in fig. 4; in practical application, the fitting parameters of the adjusting basic airfoil have an applicable interval, and the applicable interval is marked as a first adjusting interval.

The specific application logic of the airfoil parameterized design platform is as follows:

the method comprises the steps of extracting airfoil image data features by a convolutional neural network, classifying and compressing the airfoil image data features through a multi-layer perceptron, and describing the airfoil image data features into N fitting parameters through parameterization dimension reduction; the fitting parameter is marked as P _n ，P _n ＝P ₀ 、P ₁ 、......、P _N-1 N is more than or equal to 4;

and adjusting N fitting parameters of the dimension reduction description based on a first adjustment interval, recovering the geometric coordinates of the adjusted basic airfoil, setting constraint conditions for preventing airfoil deformity, marking the airfoil meeting the constraint conditions as a first airfoil, and outputting the geometric coordinates of the first airfoil.

Further, the number of the fitting parameters is preferably 6.

What needs to be explained here is: the airfoil parameterized design comprises 6 fitting parameters for describing the geometrical characteristics of the airfoil, wherein the 6 parameters do not belong to the traditional geometrical characteristics of the airfoil such as chord length, thickness, half span, front edge sweepback angle, thickness distribution and the like, but can accurately express the airfoil geometry.

The six fitting parameters can carry out enough flexibility adjustment on the wing profile in the design process, and meanwhile, the design space is not excessively complicated, so that the efficiency and stability of an optimization algorithm can be maintained; the number of parameters may of course also be adjusted according to the requirements and complexity of the particular problem.

In particular exemplary, the present example uses NACA0010 as the base airfoil to be optimized, scaled down to 6 fitting parameters [0.1625, -0.0610,0.1478,0.0179, -0.2006, -0.1182] by a deep-learning based airfoil parametric design platform.

Further, the airfoil parameterized design platform comprises a deep learning model of a combination of a convolutional neural network and a multi-layer perceptron;

furthermore, the specific application logic of the airfoil parameterized design platform is as follows:

what needs to be explained here is: the disclosed airfoil image dataset in the airfoil database is airfoil two-dimensional image data obtained based on conversion of an airfoil coordinate point database disclosed by the university of Ill Baner-Champagne division.

Extracting data features of the airfoil image by using a convolution layer, classifying and compressing the image features by using a multi-layer perceptron, and compressing a first target into 6 fitting parameters;

and 6 fitting parameters described by dimension reduction are adjusted based on a first adjustment interval, the geometric coordinates of the adjusted basic wing profile are recovered, constraint conditions for preventing deformation of the wing profile are set, the wing profile meeting the constraint conditions is marked as a first wing profile, and the geometric coordinates of the first wing profile are output.

What needs to be explained here is: the wing profile parameterized design platform can realize two functions of obtaining 6 dimension reduction parameters of a basic wing profile and adjusting 6 fitting parameters to obtain a large number of geometric coordinates of a first wing profile.

The specific logic of the constraint condition for preventing airfoil profile deformity is as follows:

y _up ＞y _down ；

further, the specific application logic of the airfoil performance prediction model is:

taking the geometric coordinates of the first airfoil and preset flow working condition parameters as input values of an airfoil performance prediction model, wherein the flow working condition parameters comprise attack angles and Reynolds numbers; illustratively, the airfoil angle of attack is selected here to be 3 ° and the reynolds number is 30000; the optimum design is iterated for about 600 rounds in total, and the effect is shown in fig. 3. In fig. 3, where the blue curve represents the airfoil with the greatest lift-to-drag ratio in each pass, the orange curve is the average of all the greatest lift-to-drag ratios for the past 100 passes, and passes less than 100 are not calculated. Through the optimal design, the lift-drag ratio 93 of the final NACA0010 airfoil under the working conditions of 30000 Reynolds number and 3 DEG attack angle is improved to 102, and the improvement ratio is as follows: 9.68%.

further, the airfoil optimization design model specifically includes the following logic:

determining a next action by the airfoil optimization design model according to the airfoil optimization state corresponding to Z, so as to obtain a fitting parameter set P_next of a next first airfoil, as shown in FIG. 3;

Z _m+1 ＝[P _next ，CL _next ，CD _next ]

P _next ＝P+action

Specific examples: the specific application logic of the airfoil optimization design model is as follows:

the wing section optimal design model is a reinforced learning wing section optimal design framework based on DQN, the DQN is an agent built by a convolutional neural network, and the actions of adjusting 6 fitting parameters and the interactions with computational fluid dynamics software xfoil are realized based on the agent;

when training the neural network, the intelligent agent selects the next action according to the corresponding rewarding value of the wing-shaped lift-drag ratio output by the wing-shaped performance prediction model, and interacts with the wing-shaped performance prediction model again to give the next wing-shaped lift-drag ratio after the action and rewards corresponding to the wing-shaped lift-drag ratio;

in this way, a large number of airfoil fitting parameter-action-rewarding value samples are generated in the training process, the neural network is trained by using the samples, network super-parameters are updated continuously, estimation accuracy of the network to rewards is improved, and training is finished when the training meets the termination condition.

What needs to be explained here is: the convolutional neural network is used as an agent and is responsible for selecting actions for adjusting six fitting parameters of the airfoil.

The training of the agent is based on rewards, and the rewards function is generally based on the airfoil lift-drag ratio output by the airfoil performance prediction model. In each training step, the intelligent agent interacts with the wing profile performance prediction model according to the current wing profile parameter configuration, obtains the lift-drag ratio and takes the lift-drag ratio as a rewarding value. This prize value is used to instruct the agent to select the next action.

The prize value is set according to the optimization target and various constraint conditions, and the weight of each prize in the whole Reward is controlled by the parameter alpha, wherein the specific formula is as follows:

wherein, the optimization objective is: maximizing the lift-drag ratio; the constraint conditions are as follows: cl/Cd is more than Cl0/CdO; the lift-drag ratio of the wing profile corresponding to the initial state is ClO/Cd0; the lift-drag ratio of the wing profile corresponding to the current state is Cl (i)/Cd (i); the lift-drag ratio of the airfoil corresponding to the former state is Cl (i-1)/Cd (i-1); alpha ₁ And alpha is ₂ As a weight coefficient, specifically exemplified, alpha ₁ And alpha is ₂ Preferably 0.5, respectively, this value and the specific prizeThe excitation value sub-term can be changed according to the application scene, the optimization target and the constraint condition of the wing profile to be optimized.

And obtaining a large number of airfoil geometric parameter-action-rewarding value samples through an airfoil optimization design model, and continuously adjusting airfoil parameters to obtain new airfoil lift-drag ratio and rewarding value. These experience samples are accumulated and used to train the reinforcement learning network to evaluate the difference between the actual rewards and the estimated rewards. The network weights are updated continuously during the training process to improve the accuracy of estimating the rewards.

The network training process is carried out in 2 rounds, each round is preset with a fixed step number, after the fixed step number is reached, the basic wing profile is updated to the optimal wing profile geometry and the fitting parameters thereof obtained in the earlier training process, and the next round of optimization and network training are carried out continuously by taking the optimal wing profile geometry and the fitting parameters as starting points; .

Specific exemplary, the termination conditions for the reinforcement learning training process include:

if the first wing profile generated in each round has the wing profile malformation condition, ending the current track and returning, and rewarding the value of reward= -100;

if the calculation result of the first airfoil profile generated by each round can not be converged by using the computational fluid dynamics software, ending the current track and returning, and then rewarding the value of the reward= -100;

if the first wing profile generated in each round does not meet the additional design conditions, such as the maximum thickness of the wing profile, the moment coefficient of the wing profile and the like, ending the track and returning, and then rewarding the value of reward= -1;

What needs to be explained here is: such termination conditions may avoid wasting time on improper airfoil configuration and learn an effective optimization strategy faster. Meanwhile, the device can also process the situation that the pneumatic parameters cannot be calculated so as to exit and enter the next round of training in time when a problem occurs. These measures help to improve the efficiency and stability of the model.

The above formulas are all formulas with dimensionality removed and numerical calculation, the formulas are formulas with the latest real situation obtained by software simulation through collecting a large amount of data, and preset parameters and threshold selection in the formulas are set by those skilled in the art according to the actual situation.

The foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any person skilled in the art will readily recognize that variations or substitutions are within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Finally: the foregoing description of the preferred embodiments of the invention is not intended to limit the invention to the precise form disclosed, and any such modifications, equivalents, and alternatives falling within the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims

1. The wing section optimal design method based on deep learning and reinforcement learning is characterized by comprising the following steps of:

establishing an airfoil optimization design model by adopting a DQN reinforcement learning frame, and marking the geometric coordinates of a first airfoil and the airfoil lift-drag ratio as an airfoil optimization state; and setting the prize value based on a change in airfoil lift-drag ratio; the wing profile fitting parameters are used for continuously predicting action strategies based on the first adjustment interval, so that the wing profile fitting parameters reciprocate, and a large number of state-action-rewarding value samples can be obtained; the airfoil optimization design model based on the DQN is learned according to a large number of acquired samples, so that the maximization of the rewarding value is realized;

2. The method for optimizing design of an airfoil based on deep learning and reinforcement learning according to claim 1, wherein the specific application logic of the airfoil parameterized design platform is as follows:

the method comprises the steps of extracting airfoil image data features by a convolutional neural network, classifying and compressing the airfoil image data features through a multi-layer perceptron, and describing the airfoil image data feature dimension reduction as N fitting parameters; the fitting parameter is marked as P _n ，P _n ＝P ₀ 、P ₁ 、……、P _N-1 N is more than or equal to 4;

3. The method for optimizing design of an airfoil based on deep learning and reinforcement learning according to claim 2, wherein the fitting parameters of the basic airfoil are at least four fitting parameters.

4. The method for optimizing design of airfoil based on deep learning and reinforcement learning according to claim 2, wherein the specific logic of the constraint condition for preventing airfoil deformity is:

y _up ＞y _down ；

5. The method for optimizing design of an airfoil based on deep learning and reinforcement learning according to claim 4, wherein the specific application logic of the airfoil performance prediction model is as follows:

6. The method for optimizing design of an airfoil based on deep learning and reinforcement learning according to claim 5, wherein the airfoil optimizing design model specifically comprises the following logic:

the optimization state of the wing profile corresponding to each round is Z= [ P, CL, CD ], wherein the fitting parameters of the first wing profile are P, CL and CD respectively represent the lift coefficient and the drag coefficient of the first wing profile under the corresponding fitting parameters;

Z _m+1 ＝[P _next ，CL _next ，CD _next ]

P _next ＝P+action

7. The method for optimizing design of airfoil based on deep learning and reinforcement learning according to claim 6, wherein the prize value is set according to the optimization objective and various constraints, and the weight of each prize in the whole report is controlled by the parameter α, and the specific formula is:

wherein, the optimization objective is: maximizing the lift-drag ratio; the constraint conditions are as follows: CI/Cd>CI0/Cd0; the lift-drag ratio of the airfoil corresponding to the initial state is Cl0/Cd0; the lift-drag ratio of the wing profile corresponding to the current state is Cl (i)/Cd (i); the lift-drag ratio of the airfoil corresponding to the former state is CI (i-1)/Cd (i-1); alpha ₁ And alpha is ₂ Is a weight coefficient.

8. The method for optimizing design of an airfoil based on deep learning and reinforcement learning according to claim 7, wherein the termination condition of the reinforcement learning training process comprises:

if the first wing profile generated in each round has wing profile malformation, ending the track and returning, and rewarding the value of reward=a;