CN111597760B - A method for obtaining the deviation value of gas path parameters under the condition of small sample - Google Patents
A method for obtaining the deviation value of gas path parameters under the condition of small sample Download PDFInfo
- Publication number
- CN111597760B CN111597760B CN202010418279.0A CN202010418279A CN111597760B CN 111597760 B CN111597760 B CN 111597760B CN 202010418279 A CN202010418279 A CN 202010418279A CN 111597760 B CN111597760 B CN 111597760B
- Authority
- CN
- China
- Prior art keywords
- domain
- deviation value
- gas circuit
- parameter deviation
- engine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 238000012549 training Methods 0.000 claims abstract description 32
- 230000000694 effects Effects 0.000 claims abstract description 26
- 238000012360 testing method Methods 0.000 claims abstract description 17
- 238000010606 normalization Methods 0.000 claims abstract description 4
- 230000006978 adaptation Effects 0.000 claims description 56
- 238000009826 distribution Methods 0.000 claims description 49
- 238000000605 extraction Methods 0.000 claims description 27
- 238000005457 optimization Methods 0.000 claims description 20
- 230000006870 function Effects 0.000 claims description 17
- 238000013507 mapping Methods 0.000 claims description 12
- 238000005065 mining Methods 0.000 claims description 10
- 235000004257 Cordia myxa Nutrition 0.000 claims description 5
- 244000157795 Cordia myxa Species 0.000 claims description 5
- 230000004913 activation Effects 0.000 claims description 5
- 210000002569 neuron Anatomy 0.000 claims description 4
- 238000005259 measurement Methods 0.000 claims description 3
- 238000002759 z-score normalization Methods 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 abstract description 11
- 230000036541 health Effects 0.000 abstract description 5
- 230000003044 adaptive effect Effects 0.000 description 30
- 238000013508 migration Methods 0.000 description 17
- 230000005012 migration Effects 0.000 description 17
- 238000012546 transfer Methods 0.000 description 16
- 238000004422 calculation algorithm Methods 0.000 description 15
- 238000013526 transfer learning Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 6
- 238000013528 artificial neural network Methods 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 238000007781 pre-processing Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 239000000306 component Substances 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 239000008358 core component Substances 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
- G06F30/20—Design optimisation, verification or simulation
- G06F30/27—Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Biophysics (AREA)
- Computing Systems (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Operations Research (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Computer Hardware Design (AREA)
- Geometry (AREA)
- Feedback Control In General (AREA)
Abstract
Description
技术领域technical field
本申请属于航空发动机监控以及健康管理技术领域,涉及一种获取气路参数偏差值的方法,尤其涉及一种实现小样本条件下获取气路参数偏差值的方法。The present application belongs to the technical field of aero-engine monitoring and health management, and relates to a method for obtaining the deviation value of gas path parameters, in particular to a method for obtaining the deviation value of gas path parameters under the condition of small samples.
背景技术Background technique
气路参数监控是航空发动机特别是民航发动机健康管理的重要技术手段。航空发动机作为热机的一种,其核心部件是气路系统部件,例如压气机、燃烧室、涡轮等。气路部件的热力参数反映了发动机的性能状态,常用的气路参数有:EGT、FF、N1、N2等。这些参数以机载设备采集,以ACARS(Aircraft CommunicationAddressing and Reporting System)报文的形式传输到飞机监控基地。航空发动机气路分析法是在飞机在起飞阶段(Take off)和巡航阶段(Cruise)分别由机载监控系统采集状态参数以ACARS报文形式发送给监控基地,利用报文格式对不同的报文进行解析得到原始状态参数,然后利用特定模型计算气路参数的偏差值,接着对该偏差值进行平滑处理,以方便观察其偏差变化趋势,从而了解发动机的性能。最后为了对发动机进行“预诊断”,还需对发动机气路参数偏差值进行预测。可见,建立准确的发动机气路参数偏差值模型是发动机状态监控和故障诊断的前提。Air circuit parameter monitoring is an important technical means for the health management of aero-engines, especially civil aviation engines. As a kind of heat engine, the core components of aero-engines are air system components, such as compressors, combustion chambers, turbines, etc. The thermal parameters of the air path components reflect the performance state of the engine. The commonly used air path parameters are: EGT, FF, N1, N2, etc. These parameters are collected by airborne equipment and transmitted to the aircraft monitoring base in the form of ACARS (Aircraft CommunicationAddressing and Reporting System) messages. The aero-engine gas path analysis method is that the state parameters are collected by the airborne monitoring system in the take-off phase (Take off) and the cruise phase (Cruise) of the aircraft, respectively, and sent to the monitoring base in the form of ACARS messages. The original state parameters are obtained by analysis, and then the deviation value of the gas path parameters is calculated by a specific model, and then the deviation value is smoothed to facilitate the observation of the deviation change trend, so as to understand the performance of the engine. Finally, in order to "pre-diagnose" the engine, it is also necessary to predict the deviation value of the engine gas path parameters. It can be seen that establishing an accurate model of engine gas path parameter deviation value is the premise of engine state monitoring and fault diagnosis.
然而由于民航发动机工况环境复杂、型号多样,导致气路参数偏差值模型缺乏普适性且新机型存在可使用信息量匮乏的问题。即现有技术中缺少实现跨工况、跨机型下建立民航发动机气路参数偏差值回归模型以及相应的实现知识迁移与复用的发动机气路参数偏差值挖掘方法。However, due to the complex working environment and diverse models of civil aviation engines, the model of gas path parameter deviation value lacks universality and the new model has the problem of lack of usable information. That is, the prior art lacks a regression model for the deviation value of civil aviation engine gas path parameters under cross-working conditions and cross-models, and a corresponding method for mining the deviation value of engine gas path parameters for realizing knowledge transfer and reuse.
发明内容SUMMARY OF THE INVENTION
本申请解决的技术问题是航空发动机健康管理领域中的领域自适应问题。本发明的目的在于提供一种实现小样本条件下获取气路参数偏差值的方法。通过将不同机型下已学习到的飞行参数间的关联知识进行相互迁移,实现跨工况、跨机型下建立气路参数偏差值模型,进而获取发动机监控自主性。The technical problem solved by this application is a domain adaptation problem in the field of aero-engine health management. The purpose of the present invention is to provide a method for obtaining the deviation value of gas path parameters under the condition of small sample. By transferring the associated knowledge of flight parameters learned under different aircraft types to each other, a model of gas path parameter deviation value can be established under different operating conditions and aircraft types, and then the autonomy of engine monitoring can be obtained.
为实现上述发明目的,本发明采取以下技术方案。In order to achieve the above purpose of the invention, the present invention adopts the following technical solutions.
一种实现小样本条件下获取气路参数偏差值的方法,包括:A method for obtaining the deviation value of gas path parameters under the condition of small sample, comprising:
步骤1收集源域及目标域航空发动机ACARS数据,构建发动机样本数据集,并将所述发动机样本数据集划分为训练集和测试集;Step 1: Collect source domain and target domain aero-engine ACARS data, construct an engine sample data set, and divide the engine sample data set into a training set and a test set;
步骤2对所述源域与目标域发动机的训练集与测试集的数据进行归一化预处理;Step 2 normalizes the data of the training set and the test set of the source domain and target domain engine;
步骤3构建深度领域自适应气路参数偏差值回归模型,深度领域自适应气路参数偏差值回归模型由特征提取模块、领域自适应模块以及回归模块三部分组成;Step 3: constructing a regression model of the deviation value of the depth domain adaptive gas path parameter, and the depth domain adaptive gas path parameter deviation value regression model is composed of three parts: a feature extraction module, a domain adaptive module and a regression module;
具体包括:Specifically include:
步骤3.1通过堆叠多层Res-BP残差学习模块,构建气路高阶特征提取模块,模型架构及超参数参考大样本条件下建立的气路参数偏差值回归模型;Step 3.1 By stacking multi-layer Res-BP residual learning modules, build a gas path high-order feature extraction module, model architecture and hyperparameters refer to the gas path parameter deviation value regression model established under the condition of a large sample;
步骤3.2构建深度领域自适应模块,堆叠多层多核均值差异适配层度量源域与目标域发动机之间分布差异性;Step 3.2 Build a deep domain adaptation module, stacking multi-layer multi-core mean difference adaptation layers to measure the distribution difference between the source domain and target domain engines;
步骤3.3构建气路参数偏差值回归模块,在深度领域自适应模块后连接回归模块,实现气路参数偏差值回归;Step 3.3 build a regression module of the gas path parameter deviation value, and connect the regression module after the depth domain adaptive module to realize the gas path parameter deviation value regression;
步骤4利用目标域发动机及源域发动机训练集训练上述深度领域自适应气路参数偏差值回归模型;Step 4 uses the target domain engine and the source domain engine training set to train the above-mentioned deep domain adaptive gas path parameter deviation value regression model;
步骤5利用训练好的所述深度领域自适应气路参数偏差值回归模型对所述目标域发动机测试集提取到的发动机样本进行测试;Step 5: Test the engine samples extracted from the target domain engine test set by using the trained deep domain adaptive air path parameter deviation value regression model;
步骤6分析所述深度领域自适应气路参数偏差值回归模型的回归效果,并进行评价,若精度满足要求,则保存当前的深度领域自适应气路参数偏差值回归模型,若不合格则返回步骤1;Step 6: Analyze the regression effect of the regression model of the self-adaptive air path parameter deviation value in the depth field, and evaluate it. If the accuracy meets the requirements, save the current depth field self-adaptive air path parameter deviation value regression model, and return if it is unqualified. step 1;
步骤7获取小样本新机型航空发动机ACARS数据,利用保存的气路参数偏差值回归模型获取气路参数偏差值,得到回归结果。Step 7: Acquire the ACARS data of a small sample of the aero-engine of the new model, and obtain the deviation value of the gas path parameter by using the saved regression model of the deviation value of the gas path parameter, and obtain the regression result.
可选地,步骤2中采用Z-score归一化预处理。Optionally, Z-score normalization preprocessing is used in step 2.
可选地,步骤3.2中,为充分挖掘域不变特征,达到最优深度领域自适应效果,引入领域对抗机制,通过在多层多核均值差异适配层后连接领域判别器以及梯度反转层以实现域混淆最大化。Optionally, in step 3.2, in order to fully exploit the domain invariant features and achieve the optimal depth domain adaptation effect, a domain confrontation mechanism is introduced, and the domain discriminator and the gradient inversion layer are connected after the multi-layer multi-core mean difference adaptation layer. to maximize domain confusion.
可选地,所述特征提取模块通过堆叠三个Res-BP残差学习块实现气路状态特征提取,特征提取模块共十层,包括一个输入层;每个Res-BP残差学习块主要由三个全连接层和一个残差恒等捷径组成;各全连接层的输出首先采用批量正则化,然后通过SELU激活函数对每个神经元进行激活。Optionally, the feature extraction module realizes the feature extraction of the air path state by stacking three Res-BP residual learning blocks, and the feature extraction module has ten layers in total, including one input layer; each Res-BP residual learning block is mainly composed of It consists of three fully-connected layers and a residual identity shortcut; the output of each fully-connected layer first adopts batch regularization, and then activates each neuron through the SELU activation function.
可选地,所述深度领域自适应模块由一个域混淆层和三个MK-MMD域适配层组成;将三层MK-MMD域适配层与特征提取模块连接,利用MK-MMD度量源域与目标域中已提取的高阶特征分布的差异;域混淆层与MK-MMD域适配层连接,包括一个二元领域分类器,其输出即领域标签值。Optionally, the depth domain adaptation module is composed of a domain confusion layer and three MK-MMD domain adaptation layers; the three MK-MMD domain adaptation layers are connected with the feature extraction module, and the MK-MMD metric source is used. The difference in the distribution of the extracted higher-order features in the domain and the target domain; the domain confusion layer is connected with the MK-MMD domain adaptation layer and includes a binary domain classifier whose output is the domain label value.
可选地,所述回归模块与域混淆层连接,主要实现提取的域不变特征与其对应气路参数偏差值之间的映射,最终实现偏差值挖掘。Optionally, the regression module is connected to a domain confusion layer, and mainly realizes the mapping between the extracted domain invariant features and their corresponding gas path parameter deviation values, and finally realizes deviation value mining.
所述深度领域自适应气路参数偏差值回归模型有以下最优化目标:The deep domain adaptive air path parameter deviation value regression model has the following optimization objectives:
最小化源域数据集及目标域数据集上的气路参数偏差值回归误差;Minimize the regression error of the gas path parameter deviation value on the source domain data set and the target domain data set;
最小化源域与目标域之间的分布差异;Minimize the distribution difference between the source and target domains;
以及最大化源域与目标域上的域混淆损失。and maximizing the domain confusion loss on the source and target domains.
相对于现有技术,本发明取得以下有益效果。Compared with the prior art, the present invention achieves the following beneficial effects.
本发明采用Res-BPNN模型深层挖掘航空发动机气路状态高阶特征,堆叠多层多核最大均值差异MK-MMD适配层,进而将提取到的高阶特征映射到RKHS中进行差异度量。为进一步降低各机型气路状态特征概率分布差异,引入基于对抗机制的最大化领域混淆方法,通过在多层领域特征适配层后连接域混淆层以实现最大化域混淆损失,使得不同领域间学习到的特征分布尽可能相近,从而混淆目标发动机当前状态下已学习到的特征分布来源,进而深度挖掘域不变特征,达到最优深度领域自适应效果。本发明成功解决了民航发动机工况环境复杂、型号多样导致气路参数偏差值模型缺乏普适性且新机型存在可使用信息量匮乏的问题,通过将不同机型下已学习到的飞行参数间的关联知识进行相互迁移,实现跨工况、跨机型下建立气路参数偏差值模型,进而获取发动机监控自主性。The invention uses the Res-BPNN model to deeply mine the high-order features of the air path state of the aero-engine, stacks multi-layer multi-core maximum mean difference MK-MMD adaptation layers, and then maps the extracted high-order features to the RKHS for difference measurement. In order to further reduce the difference in the probability distribution of air path state features of various models, a maximizing domain confusion method based on adversarial mechanism is introduced. The learned feature distributions are as close as possible, so as to confuse the source of the learned feature distribution in the current state of the target engine, and then deeply mine the domain invariant features to achieve the optimal depth domain self-adaptation effect. The invention successfully solves the problems that the air path parameter deviation value model lacks universality due to the complex working conditions and diverse models of civil aviation engines and the lack of usable information for new models. The related knowledge between them can be transferred to each other to realize the establishment of a gas path parameter deviation value model under cross working conditions and cross models, and then obtain the autonomy of engine monitoring.
附图说明Description of drawings
图1为本申请实施例一的整体流程图;Fig. 1 is the overall flow chart of the first embodiment of the application;
图2为本申请实施例一最大化领域混淆示意图;2 is a schematic diagram of maximizing field confusion in Embodiment 1 of the present application;
图3为本申请实施例一基于Res-BPNN的深度领域自适应回归模型总体框架;Fig. 3 is the overall framework of the deep domain adaptive regression model based on Res-BPNN in the first embodiment of the application;
图4为本申请实施例二网络训练过程中的MK-MMD及损失值随迭代次数的变化图;FIG. 4 is a graph showing the variation of MK-MMD and loss value with the number of iterations in the network training process of Embodiment 2 of the present application;
图5为本申请实施例二迁移任务A的DEGT回归效果图(无监督方法);Fig. 5 is the DEGT regression effect diagram (unsupervised method) of the transfer task A of the second embodiment of the application;
图6为本申请实施例二迁移任务A的DEGT回归效果图(有监督方法)。FIG. 6 is a DEGT regression effect diagram (supervised method) of the transfer task A in the second embodiment of the present application.
具体实施方式Detailed ways
本申请实施例提供的方案中,所描述的实施例仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其它实施例,都属于本申请保护的范围。In the solutions provided in the embodiments of the present application, the described embodiments are only a part of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present application.
为了更好的理解上述技术方案,下面通过附图以及具体实施例对本申请技术方案做详细的说明,应当理解本申请实施例以及实施例中的具体特征是对本申请技术方案的详细的说明,而不是对本申请技术方案的限定,在不冲突的情况下,本申请实施例以及实施例中的技术特征可以相互组合。In order to better understand the above technical solutions, the technical solutions of the present application will be described in detail below through the accompanying drawings and specific embodiments. It is not a limitation on the technical solutions of the present application, and the embodiments of the present application and the technical features in the embodiments may be combined with each other under the condition of no conflict.
实施例一Example 1
本实施例提供的一种获取气路参数偏差值的方法流程图如图1所示,A flow chart of a method for obtaining a deviation value of a gas path parameter provided by this embodiment is shown in FIG. 1 ,
该方法包括:The method includes:
步骤1收集源域及目标域航空发动机ACARS数据,构建发动机样本数据集,并将所述发动机样本数据集划分为训练集和测试集;通常按80%训练集与20%测试集进行划分;Step 1: Collect the source domain and target domain aero-engine ACARS data, construct an engine sample data set, and divide the engine sample data set into a training set and a test set; usually 80% of the training set and 20% of the test set are divided;
步骤2对所述源域与目标域发动机的训练集与测试集的数据进行归一化预处理;本实施例中,可以采用从民航公司获取的ACARS数据直接进行样本归一化;In step 2, normalization preprocessing is performed on the data of the training set and the test set of the engine in the source domain and the target domain; in this embodiment, the ACARS data obtained from the civil aviation company can be used to directly perform sample normalization;
步骤3构建深度领域自适应气路参数偏差值回归模型,深度领域自适应气路参数偏差值回归模型由特征提取模块、领域自适应模以及回归模块三部分组成;具体包括:Step 3 constructs a regression model of the deviation value of the air path parameter of the depth field adaptation, and the regression model of the deviation value of the air path parameter of the depth field adaptation is composed of three parts: a feature extraction module, a field adaptation module and a regression module; it specifically includes:
步骤3.1通过堆叠多层Res-BP残差学习模块,构建气路高阶特征提取模块,模型架构及超参数参考大样本条件下建立的气路参数偏差值回归模型;Step 3.1 By stacking multi-layer Res-BP residual learning modules, build a gas path high-order feature extraction module, model architecture and hyperparameters refer to the gas path parameter deviation value regression model established under the condition of a large sample;
步骤3.2构建深度领域自适应模块,堆叠多层多核均值差异适配层度量源域与目标域发动机之间分布差异性;Step 3.2 Build a deep domain adaptation module, stacking multi-layer multi-core mean difference adaptation layers to measure the distribution difference between the source domain and target domain engines;
步骤3.3构建气路参数偏差值回归模块,在深度领域自适应模块后连接回归模块,实现气路参数偏差值回归;Step 3.3 build a regression module of the gas path parameter deviation value, and connect the regression module after the depth domain adaptive module to realize the gas path parameter deviation value regression;
步骤4利用目标域发动机及源域发动机训练集训练上述深度领域自适应气路参数偏差值回归模型;Step 4 uses the target domain engine and the source domain engine training set to train the above-mentioned deep domain adaptive gas path parameter deviation value regression model;
步骤5利用训练好的所述深度领域自适应气路参数偏差值回归模型对所述目标域发动机测试集提取到的发动机样本进行测试;Step 5: Test the engine samples extracted from the target domain engine test set by using the trained deep domain adaptive air path parameter deviation value regression model;
步骤6分析所述深度领域自适应气路参数偏差值回归模型的回归效果,并进行评价,若精度满足要求,则保存当前的深度领域自适应气路参数偏差值回归模型,若不合格则返回步骤1;Step 6: Analyze the regression effect of the regression model of the self-adaptive air path parameter deviation value in the depth field, and evaluate it. If the accuracy meets the requirements, save the current depth field self-adaptive air path parameter deviation value regression model, and return if it is unqualified. step 1;
步骤7获取小样本新机型航空发动机ACARS数据,利用保存的气路参数偏差值回归模型获取气路参数偏差值,得到回归结果。Step 7: Acquire the ACARS data of a small sample of the aero-engine of the new model, and obtain the deviation value of the gas path parameter by using the saved regression model of the deviation value of the gas path parameter, and obtain the regression result.
步骤2中采用Z-score归一化预处理。In step 2, Z-score normalization preprocessing is used.
步骤3.2中,为充分挖掘域不变特征,达到最优深度领域自适应效果,引入领域对抗值,通过在多层多核均值差异适配层后连接领域判别器以及梯度反转层以实现域混淆最大化。In step 3.2, in order to fully exploit the domain invariant features and achieve the optimal depth domain adaptation effect, domain adversarial values are introduced, and domain confusion is achieved by connecting the domain discriminator and gradient inversion layer after the multi-layer multi-core mean difference adaptation layer. maximize.
特征提取模块通过堆叠三个Res-BP残差学习块实现气路状态特征提取,特征提取模块共十层,包括一个输入层;每个Res-BP残差学习块主要由三个全连接层和一个残差恒等捷径组成;各全连接层的输出首先采用批量正则化,然后通过SELU激活函数对每个神经元进行激活。The feature extraction module realizes the feature extraction of the air path state by stacking three Res-BP residual learning blocks. The feature extraction module has ten layers in total, including one input layer; each Res-BP residual learning block is mainly composed of three fully connected layers and It consists of a residual identity shortcut; the output of each fully connected layer is first batch regularized, and then each neuron is activated through the SELU activation function.
深度领域自适应模块由一个域混淆层和三个MK-MMD域适配层组成;将三层MK-MMD域适配层与特征提取模块连接,利用MK-MMD度量源域与目标域中已提取的高阶特征分布的差异;域混淆层与MK-MMD域适配层连接,包括一个二元领域分类器,其输出即领域标签值。The deep domain adaptation module consists of a domain confusion layer and three MK-MMD domain adaptation layers; the three-layer MK-MMD domain adaptation layer is connected with the feature extraction module, and the MK-MMD is used to measure the range of the source domain and the target domain. Differences in the distribution of extracted higher-order features; the domain confusion layer is concatenated with the MK-MMD domain adaptation layer, including a binary domain classifier whose output is the domain label value.
回归模块与域混淆层连接,主要实现提取的域不变特征与其对应气路参数偏差值之间的映射,最终实现偏差值挖掘。The regression module is connected with the domain confusion layer, which mainly realizes the mapping between the extracted domain invariant features and their corresponding gas path parameter deviation values, and finally realizes deviation value mining.
深度领域自适应方法的思想在于匹配源域与目标域之间的特征分布进而学习域不变特征。所谓学习域不变特征是指无论从源域数据或是目标域数据中学习到的特征都应该服从相同或几乎相同的特征分布。如果特征具有域不变特性,则可利用该特征有效地实现目标域数据的气路参数偏差值挖掘任务,因此学习域不变特征是实现知识迁移与复用的关键所在。以下分别从领域间差异性度量方法与域混淆方法对深度自适应问题展开进一步描述。The idea of deep domain adaptation methods is to match the feature distribution between the source domain and the target domain to learn domain-invariant features. The so-called learning domain invariant features means that the features learned from the source domain data or the target domain data should obey the same or almost the same feature distribution. If the feature has domain invariant characteristics, the feature can be used to effectively realize the task of mining the deviation value of air path parameters in the target domain data. Therefore, learning domain invariant features is the key to realize knowledge transfer and reuse. The following is a further description of the deep adaptation problem from the method of measuring the dissimilarity between domains and the method of domain confusion.
最大均值差异(Maximum Mean Discrepancy,MMD)主要通过将两个不同分布映射到再生核希尔伯特空间(Reproducing Kernel Hilbert Space,RKHS)中,并度量两个分布之间差异性。The Maximum Mean Discrepancy (MMD) mainly measures the difference between the two distributions by mapping two different distributions into the Reproducing Kernel Hilbert Space (RKHS).
首先,定义属于s分布的样本集合以及属于t分布的样本集合定义样本空间的连续函数集F,其连续映射f:χ→R,则两个分布之间的MMD,如公式(3-1)所示。First, define the set of samples belonging to the s-distribution and the set of samples belonging to the t-distribution Define the continuous function set F of the sample space, and its continuous mapping f:χ→R, then the MMD between the two distributions is shown in formula (3-1).
式中:Es[·]——代表分布s的数学期望;In the formula: E s [ ]——represents the mathematical expectation of the distribution s;
Et[·]——代表分布t的数学期望。E t [ ] - represents the mathematical expectation of the distribution t.
假设样本集合XS和XT分别是从分布s和t通过独立同分布采样获得的两个样本集合,样本容量分别为m及n。基于XS和XT可以得到MMD的经验估计,如公式(3-2)所示。Assume that the sample sets X S and X T are two sample sets obtained by independent and identical distribution sampling from the distributions s and t, respectively, and the sample sizes are m and n, respectively. An empirical estimate of MMD can be obtained based on X S and X T , as shown in formula (3-2).
其次,定义H表示RHKS空间,并F约束为RHKS空间内的单位球。连续映射f:χ→R,对任意x∈χ,函数f(x)如公式(3-3)所示。Second, define H to represent the RHKS space, and F to be constrained to be a unit sphere within the RHKS space. Continuous mapping f:χ→R, for any x∈χ, the function f(x) is shown in formula (3-3).
式中:——代表映射:χ→H,并且f与的内积定义为核函数,如高斯核函数: where: ——represents the mapping: χ→H, and f is equal to The inner product of is defined as a kernel function, such as a Gaussian kernel function:
因此,通过上述两个定义,可知s分布以及t分布在再生核希尔伯特空间下的MMD如公式(3-4)所示。Therefore, through the above two definitions, it can be known that the MMD of the s distribution and the t distribution in the regenerated kernel Hilbert space is shown in formula (3-4).
式中:μs=Es[f(XS)],μt=Et[f(XT)]。In the formula: μ s =E s [f(X S )], μ t =E t [f(X T )].
最后,为了便于计算,通常采用MMD的平方形式,如公式(3-5)。Finally, in order to facilitate the calculation, the square form of MMD is usually adopted, such as formula (3-5).
式中:m、n——分别代表源域数据集XS及目标域数据集XT的样本规模。当XS与XS的分布差异性越小,则MMD距离越小,当且仅当s与t服从统一分布时,MMD=0。In the formula: m, n——represent the sample size of the source domain dataset X S and the target domain dataset X T , respectively. When the distribution difference between X S and X S is smaller, the MMD distance is smaller, if and only if s and t obey the uniform distribution, MMD=0.
深度适配网络(Deep Adaptation Networks)可采用MMD变体算法,即多核最大均值差异(Multi-Kernel Maximum Mean Discrepancy,MK-MMD),能够更好的实现领域适配,增强神经网络的特征表示能力。Deep Adaptation Networks can use the MMD variant algorithm, namely Multi-Kernel Maximum Mean Discrepancy (MK-MMD), which can better achieve domain adaptation and enhance the feature representation capability of neural networks. .
首先,给出MK-MMD的定义,分布s以及分布t之间MK-MMD距离的经验估计如公式(3-6)所示。First, the definition of MK-MMD is given, and the empirical estimation of the MK-MMD distance between distribution s and distribution t is shown in formula (3-6).
式中:Hk——代表具有特征核k的RKHS;where: H k ——represents the RKHS with characteristic kernel k;
f(·)——代表连续映射函数;f( )——represents a continuous mapping function;
Es[·]——代表分布s的数学期望;E s [ ]——represents the mathematical expectation of the distribution s;
Et[·]——代表分布t的数学期望。E t [ ] - represents the mathematical expectation of the distribution t.
MK-MMD中的特征内核f(·),k(x,y)=<f(x),f(y)>,多核具体定义为具有m个半正定内核ku的凸组合,如公式(3-7)所示。The feature kernels f( ), k (x, y)=<f(x), f(y)> in MK-MMD, the multi-kernel is specifically defined as a convex combination with m positive semi-definite kernels ku, such as formula ( 3-7).
式中:——为约束系数,以保证不同分布之间进行领域适配时存在多内核k独有特性。where: - is a constraint coefficient to ensure that there are unique characteristics of multi-kernel k when performing domain adaptation between different distributions.
由于不同特征分布在深度神经网络进行学习时会发生变化,因此无法确定何种核函数能够表现出更强的映射能力,而基于MK-MMD的多内核k可以通过不同的核函数增强特征分布间的适配性,达到最优,最合理的核函数选择。Since different feature distributions will change when the deep neural network is learning, it is impossible to determine which kernel function can show stronger mapping ability, and the multi-kernel k based on MK-MMD can enhance the feature distribution through different kernel functions. The adaptability to achieve the optimal and most reasonable kernel function selection.
本实施例为进一步减少从不同领域学习到的特征之间的边缘分布差异,引入了基于对抗机制的领域混淆方法,其原理图如图2所示。In this embodiment, in order to further reduce the marginal distribution difference between features learned from different domains, a domain confusion method based on an adversarial mechanism is introduced, the schematic diagram of which is shown in FIG. 2 .
本实施例为实现最大化领域混淆,在提取的深度特征层之后添加一层领域混淆层,即判断通过训练后的某一样本特征分布是来源于源域还是目标域。提取的特征越是能够体现领域间共性,领域混淆效果则越好。如果采用Res-BP神经网络提取的深度特征无法被已训练领域分类器判别是来源于源域还是目标域的样本时,则称该深度特征是具有域不变特性的。最优效果的领域分类器D可通过优化公式(3-8)。In this embodiment, in order to maximize the domain confusion, a domain confusion layer is added after the extracted depth feature layer, that is, it is determined whether the feature distribution of a certain sample after training comes from the source domain or the target domain. The more the extracted features can reflect the commonality between domains, the better the domain confusion effect will be. If the deep features extracted by the Res-BP neural network cannot be discriminated by the trained domain classifier whether it is a sample from the source domain or the target domain, the deep feature is said to have domain-invariant characteristics. The optimal effect of the domain classifier D can be optimized by formula (3-8).
式中:where:
m——代表训练样本批量大小;m——represents the batch size of training samples;
gi——代表第i个样本的真实域标签,gi=0代表样本xi来源于源域,gi=1代表样本xi来源于目标域;gi ——represents the true domain label of the ith sample, gi = 0 represents that the sample xi comes from the source domain, and gi =1 represents that the sample xi comes from the target domain;
D(xi)——代表样本xi经过域分类器后输出的标签值。D( xi )——represents the label value output by the sample xi after passing through the domain classifier.
本实施例提供的基于Res-BPNN的深度领域自适应回归模型由特征提取模块、领域自适应模块以及回归模块三部分组成。高阶特征提取模块采用Res-BPNN网络结构,通过堆叠N个Res-BP残差学习块进行气路状态特征提取。将深度领域自适应模块与特征提取模块相连接,进而学习域不变特征。最后添加回归模块,构成基于Res-BPNN的深度领域自适应回归模型框架,如图3所示。The deep domain adaptive regression model based on Res-BPNN provided in this embodiment is composed of three parts: a feature extraction module, a domain adaptation module and a regression module. The high-order feature extraction module adopts the Res-BPNN network structure to extract the air path state features by stacking N Res-BP residual learning blocks. The deep domain adaptation module is connected with the feature extraction module to learn domain invariant features. Finally, a regression module is added to form a deep domain adaptive regression model framework based on Res-BPNN, as shown in Figure 3.
(1)特征提取模块:通过堆叠三个Res-BP残差学习块实现气路状态特征提取,特征提取模块共十层,包括一个输入层。每个Res-BP残差学习块主要由三个全连接层和一个残差恒等捷径组成。各全连接层的输出首先采用批量正则化,然后通过SELU激活函数对每个神经元进行激活。(1) Feature extraction module: The feature extraction of air path state is realized by stacking three Res-BP residual learning blocks. The feature extraction module has ten layers in total, including one input layer. Each Res-BP residual learning block mainly consists of three fully connected layers and a residual identity shortcut. The output of each fully connected layer is first batch regularized, and then each neuron is activated through the SELU activation function.
(2)深度领域自适应模块:深度领域自适应模块由一个域混淆层和三个MK-MMD域适配层组成。将三层MK-MMD域适配层与特征提取模块连接,利用MK-MMD度量源域与目标域中已提取的高阶特征分布的差异。域混淆层与MK-MMD域适配层连接,包括一个二元领域分类器,其输出即领域标签值。(2) Deep domain adaptation module: The deep domain adaptation module consists of one domain confusion layer and three MK-MMD domain adaptation layers. The three-layer MK-MMD domain adaptation layer is connected with the feature extraction module, and the MK-MMD is used to measure the difference of the extracted higher-order feature distributions in the source and target domains. The domain confusion layer is connected with the MK-MMD domain adaptation layer, including a binary domain classifier whose output is the domain label value.
(3)回归模块:回归模块与域混淆层连接,主要实现提取的域不变特征与其对应气路参数偏差值之间的映射,最终实现偏差值挖掘。(3) Regression module: The regression module is connected with the domain confusion layer, and mainly realizes the mapping between the extracted domain invariant features and their corresponding gas path parameter deviation values, and finally realizes deviation value mining.
目前,航空公司引入新型号民航发动机,且仅拥有该机型小批量有标签数据,为了在短期内建立该型号民航发动机气路参数偏差值模型,因此需要充分利用有标签数据资源,本实施例将在传统无监督领域自适应迁移学习的最终优化目标中引入小批量目标域有标签数据的回归误差损失。为了便于与其他迁移学习算法进行对比,按照目标域有标签数据的回归损失是否被引入最终优化目标中,将提出方法分为提出方法(有监督)和提出方法(无监督)。At present, the airline has introduced a new model of civil aviation engine, and only has the labelled data of this model in small batches. In order to establish a model of the gas path parameter deviation value of this model of civil aviation engine in a short period of time, it is necessary to make full use of the labelled data resources. This embodiment The regression error loss of the labeled data in the mini-batch target domain will be introduced into the final optimization objective of adaptive transfer learning in the traditional unsupervised domain. In order to facilitate comparison with other transfer learning algorithms, the proposed methods are divided into proposed methods (supervised) and proposed methods (unsupervised) according to whether the regression loss of labeled data in the target domain is introduced into the final optimization objective.
因此,本实施例所提出的基于Res-BPNN的深度领域自适应回归模型最终有如下三个最优化目标:Therefore, the deep domain adaptive regression model based on Res-BPNN proposed in this embodiment finally has the following three optimization objectives:
(1)为了实现航空发动机气路参数偏差值挖掘,基于Res-BPNN的深度领域自适应回归模型必须实现气路状态特征提取以及偏差值挖掘。因此,深度领域自适应回归模型的第一个优化目标就是最小化源域数据集及目标域数据集上的气路参数偏差值回归误差,回归误差的损失函数定义为标准MSE损失函数,如公式(3-9)所示。(1) In order to realize the deviation value mining of aero-engine gas path parameters, the deep domain adaptive regression model based on Res-BPNN must realize the feature extraction of the gas path and the deviation value mining. Therefore, the first optimization goal of the deep domain adaptive regression model is to minimize the regression error of the deviation value of the gas path parameters on the source domain data set and the target domain data set. The loss function of the regression error is defined as the standard MSE loss function, such as the formula (3-9).
式中:where:
nS、nT——代表一个批次训练集中的源域数据及目标域数据数量;n S , n T ——represent the number of source domain data and target domain data in a batch training set;
yS、yT——代表源域样本与目标域样本气路参数偏差值真实值;y S , y T ——represent the true value of the gas path parameter deviation between the source domain sample and the target domain sample;
——代表源域样本与目标域样本气路参数偏差值的回归值; ——Regression value representing the deviation of the gas path parameters between the source domain sample and the target domain sample;
β——代表目标域有标签数据训练权重。当且仅当β=0时,最终优化目标中不引入目标域有标签数据集。β—represents the training weights for labeled data in the target domain. If and only if β = 0, no labeled datasets in the target domain are introduced into the final optimization objective.
(2)深度领域自适应模块用来学习域不变特征,如图3-2所示,主要包括域混淆层以及多层MK-MMD域适配层。MK-MMD域适配层主要是为了度量不同领域学习到的特征之间的分布差异。因此,深度领域自适应回归模型的第二个优化目标是最小化源域与目标域之间的分布差异。采用MK-MMD度量源域与目标域之间高阶特征的分布差异,MK-MMD损失函数如公式(3-10)所示。(2) The deep domain adaptation module is used to learn domain invariant features, as shown in Figure 3-2, mainly including domain confusion layer and multi-layer MK-MMD domain adaptation layer. The MK-MMD domain adaptation layer is mainly to measure the distribution difference between the learned features in different domains. Therefore, the second optimization goal of the deep domain adaptive regression model is to minimize the distribution difference between the source and target domains. MK-MMD is used to measure the distribution difference of high-order features between the source domain and the target domain. The MK-MMD loss function is shown in formula (3-10).
式中:where:
——分别代表第i层适配层的源域与目标域特征表示; ——represent the source domain and target domain feature representation of the i-th adaptation layer;
XS、XT——分别代表源域数据集及目标域数据集;X S and X T - represent the source domain dataset and the target domain dataset, respectively;
li——代表第i层适配层;l i ——represents the i-th adaptation layer;
Hk——代表具有特征核k的RKHS。H k - represents the RKHS with feature kernel k.
(3)为进一步降低各机型气路状态特征概率分布差异,采用对抗机制的最大化领域混淆方法,使得不同领域间学习到的特征分布尽可能相近,从而混淆目标发动机当前状态下已学习到的特征分布来源,进而深度挖掘域不变特征。因此,深度领域自适应回归模型的第三个优化目标是最大化源域与目标域上的域混淆损失,域分类损失函数如公式(3-11)所示。(3) In order to further reduce the difference in the probability distribution of the state characteristics of the gas path of each model, the maximum domain confusion method of the confrontation mechanism is adopted to make the learned feature distributions between different domains as close as possible, so as to confuse the current state of the target engine. source of feature distribution, and then deeply mine domain-invariant features. Therefore, the third optimization objective of the deep domain adaptive regression model is to maximize the domain confusion loss on the source and target domains, and the domain classification loss function is shown in Equation (3-11).
式中:——代表域混淆层上已学习的特征表示。where: — represents the learned feature representation on the domain confusion layer.
通过结合MSE回归损失、MK-MMD损失以及域混淆损失,可得到基于Res-BPNN的深度领域自适应回归模型的最终优化目标如公式(3-12)所示。By combining the MSE regression loss, the MK-MMD loss and the domain confusion loss, the final optimization objective of the deep domain adaptive regression model based on Res-BPNN can be obtained as shown in formula (3-12).
L=Lreg+λLMK-MMD+μLdomain (3-12)L=L reg + λL MK-MMD + μL domain (3-12)
式中:where:
LMK-MMD——代表域适配损失;L MK-MMD - represents the domain adaptation loss;
Lreg——代表回归损失;L reg - represents regression loss;
Ldomain——代表域混淆损失;L domain — represents the domain confusion loss;
λ、μ——代表网络超参数,用于控制深度领域自适应性的强弱;λ, μ——represent network hyperparameters, which are used to control the strength of adaptability in the depth domain;
当模型训练完成时,如果学习到的高阶特征具有模糊的领域类别和较小的领域间差异性,气路参数偏差值回归模块就能够准确地对目标域样本进行偏差值挖掘。When the model training is completed, if the learned high-order features have ambiguous domain categories and small differences between domains, the airway parameter deviation value regression module can accurately mine the deviation value of the target domain samples.
实施例二Embodiment 2
本实施例二利用民航发动机历史巡航数据对实施例一的技术方案进行实验验证。以下从数据选样与预处理、超参数设置、模型性能对比方面进行详细描述。In the second embodiment, the technical solution of the first embodiment is experimentally verified by using the historical cruise data of the civil aviation engine. The following is a detailed description of data sampling and preprocessing, hyperparameter settings, and model performance comparison.
为充分验证实施例一提出的基于Res-BPNN的深度领域自适应回归模型在民航发动机气路参数偏差值挖掘领域的实例应用以及模型的普遍适用性,本实施例从GE公司生产的CFM56-5B2/3以及CFM56-7B26两台不同型号的民航发动机中分别获取两组数据集分别应用于两次气路参数偏差值挖掘实验。两组迁移学习任务分别是迁移任务A:CFM56-5B2/3→CFM56-7B26以及迁移任务B:CFM56-7B26→CFM56-5B2/3。In order to fully verify the application of the deep domain adaptive regression model based on Res-BPNN proposed in the first embodiment in the field of civil aviation engine gas path parameter deviation value mining and the general applicability of the model, this embodiment is from CFM56-5B2 produced by GE. /3 and CFM56-7B26 two different types of civil aviation engines, respectively, two sets of data sets were obtained and applied to two gas path parameter deviation value mining experiments. The two sets of transfer learning tasks are transfer task A: CFM56-5B2/3→CFM56-7B26 and transfer task B: CFM56-7B26→CFM56-5B2/3.
迁移任务A(CFM56-5B2/3→CFM56-7B26)表示:首先通过从已收集到的CFM56-5B2型号民航发动机的有标签数据进行有监督训练,并同时辅助CFM56-7B26型号民航发动机上获取的小批量数据进行领域自适应的知识迁移。迁移任务B(CFM56-7B26→CFM56-5B2/3)则是将迁移任务CFM56-5B2/3→CFM56-7B26中的源域数据集以及目标域数据集互换并重新进行迁移实验,进而验证模型的有效性。两组迁移学习实验的部分数据及数据集分配比例如表3-1及表3-2所示。Migration task A (CFM56-5B2/3→CFM56-7B26) means: firstly carry out supervised training from the collected labeled data of the CFM56-5B2 civil aviation engine, and assist the CFM56-7B26 civil aviation engine at the same time. Domain-adaptive knowledge transfer with small batches of data. Migration task B (CFM56-7B26→CFM56-5B2/3) is to exchange the source domain dataset and target domain dataset in the migration task CFM56-5B2/3→CFM56-7B26 and re-run the migration experiment to verify the model effectiveness. Table 3-1 and Table 3-2 show some data and data set distribution ratios of the two groups of transfer learning experiments.
如表3-1所示,本实施例所用CFM56-5B2/3以及CFM56-7B26两台不同型号的民航发动机历史巡航数据同样需要消除参数之间的量纲影响,因此需要对其进行标准化预处理,采用Z-score标准化预处理方法,以便样本参数之间具有可比性,并且能够提高网络训练速率,其转换公式如式(2-9)所示。As shown in Table 3-1, the historical cruise data of two different types of civil aviation engines, CFM56-5B2/3 and CFM56-7B26 used in this example also need to eliminate the dimensional influence between parameters, so it needs to be standardized and preprocessed , using the Z-score standardized preprocessing method, so that the sample parameters are comparable, and can improve the network training rate, the conversion formula is shown in formula (2-9).
表3-1两台不同型号发动机部分数据Table 3-1 Partial data of two different types of engines
表3-2两组不同迁移学习任务的数据集分配Table 3-2 Dataset assignments for two groups of different transfer learning tasks
气路参数偏差值回归模型的输入与输出的对应关系,如公式(3-13)所示。The corresponding relationship between the input and output of the regression model of the gas path parameter deviation value is shown in formula (3-13).
式中:where:
ID-[·]——代表各气路参数偏差值对应的输入;I D- [ ]——represents the input corresponding to the deviation value of each gas path parameter;
OD-——代表各气路参数偏差值的输出;O D- ——represents the output of the deviation value of each gas path parameter;
xi——代表第i个飞行参数的测量值。x i — represents the measured value of the i-th flight parameter.
优化方法采用随机梯度下降算法SGD,定义θf,θd,θreg分别是特征提取器,域分类器以及最终回归器的优化参数,可将公式(3-12)改写成公式(3-14)。The optimization method adopts the stochastic gradient descent algorithm SGD, and defines θ f , θ d , θ reg as the optimization parameters of the feature extractor, the domain classifier and the final regressor, respectively. The formula (3-12) can be rewritten as formula (3-14) ).
基于随机梯度下降算法SGD及公式(3-14),参数θf,θd,θreg更新过程可写成公式(3-15)。Based on the stochastic gradient descent algorithm SGD and formula (3-14), the updating process of parameters θ f , θ d , and θ reg can be written as formula (3-15).
式中:α——代表神经网络学习率,可通过公式(3-16)进行自调整。In the formula: α——represents the learning rate of the neural network, which can be self-adjusted by formula (3-16).
式中:where:
epoch、epochs——分别代表网络已完成训练次数以及设定需完成的训练次数;epoch, epochs - represent the number of times the network has completed training and set the number of training times to be completed;
α0、β、δ——分别代表常数,此处取0.01,0.75以及10;α 0 , β, δ——represent constants, here are 0.01, 0.75 and 10;
t——代表网络训练进度,从0至1线性变化。t——represents the network training progress, which varies linearly from 0 to 1.
公式(3-13)中的超参数λ及μ的自调节过程与学习率的自调整过程相似,该方法使得域混淆层及域适配层在训练过程的早期阶段对噪声信号的敏感度较低,其自调整公式如公式(3-17)所示。The self-adjustment process of the hyperparameters λ and μ in Eq. (3-13) is similar to the self-adjustment process of the learning rate. This method makes the domain confusion layer and the domain adaptation layer more sensitive to noise signals in the early stage of the training process. low, its self-adjusting formula is shown in formula (3-17).
式中:γ——代表常数,此处取为10。In the formula: γ——represents a constant, which is taken as 10 here.
本模型优化器采用SGD,模型训练批量大小(Batch-size)设置为100,迭代次数(Epochs)设置为1000,动量比(Momentum)设置为0.9,激活函数采用SELU,初始权重和偏置均为默认值。本实施例中,特征提取模块采用三个Res-BP残差学习模块堆叠构成,深度领域自适应模块采用三层MK-MMD领域适配层以及一层域混淆层构成,进而充分挖掘源域与目标域之间域不变特征。The model optimizer uses SGD, the model training batch size (Batch-size) is set to 100, the number of iterations (Epochs) is set to 1000, the momentum ratio (Momentum) is set to 0.9, the activation function is SELU, and the initial weight and bias are Defaults. In this embodiment, the feature extraction module is composed of three Res-BP residual learning modules stacked, and the depth domain adaptation module is composed of three MK-MMD domain adaptation layers and one domain confusion layer, so as to fully mine the source domain and the Domain-invariant features between target domains.
为了进一步证明实施例一所提出的基于Res-BPNN的深度领域自适应回归模型的有效性,采用九种不同迁移学习算法分别对迁移任务A与迁移任务B建立三个关键气路参数偏差值回归模型并进行实验对比。所用对比方法如表3-3所示,迁移成分分析(TransferComponent Analysis,TCA)、联合分布自适应(Joint Distribution Adaptation,JDA)、平衡分布自适应(Balanced Distribution Adaptation,BDA)、深度领域混淆网络(DeepDomain Confusion,DDC)、深度适配网络(Deep Adaptation Networks,DAN)、域对抗神经网络(Domain-Adversarial Neural Network,DANN)以及本申请实施例一所提出方法(无监督)均为无监督迁移学习算法。考虑到目前民航公司引入新型号民航发动机的实际情况,为充分利用现有小批量有标签目标域数据集资源,可将其回归误差融入最终优化目标中,并与Res-BPNN模型、层迁移微调(Fine-tuning)等有监督迁移学习进行实验对比。In order to further prove the effectiveness of the deep domain adaptive regression model based on Res-BPNN proposed in Example 1, nine different transfer learning algorithms are used to establish three key gas path parameter deviation value regressions for transfer task A and transfer task B respectively. model and conduct experimental comparisons. The comparison methods used are shown in Table 3-3. Transfer Component Analysis (TCA), Joint Distribution Adaptation (JDA), Balanced Distribution Adaptation (BDA), Deep Domain Confusion Network ( Deep Domain Confusion (DDC), Deep Adaptation Networks (DAN), Domain-Adversarial Neural Network (DANN) and the method (unsupervised) proposed in the first embodiment of the present application are all unsupervised transfer learning algorithm. Considering the actual situation of the introduction of new models of civil aviation engines by civil aviation companies, in order to make full use of the existing small batch labeled target domain dataset resources, the regression error can be integrated into the final optimization goal, and fine-tuned with the Res-BPNN model and layer transfer. (Fine-tuning) and other supervised transfer learning for experimental comparison.
表3-3迁移学习方法Table 3-3 Transfer learning methods
a.最终优化目标中未引入目标域数据,b.最终优化目标中引入了目标域数据. a. The target domain data is not introduced into the final optimization objective, b. The target domain data is introduced into the final optimization objective.
分别采用上述几种有监督及无监督学习方法分别建立DEGT、DFF以及DN2三个关键气路参数偏差值回归模型进行实验验证,并与OEM提供的气路参数偏差值进行比较,迁移任务A与迁移任务B的性能对比结果分别见表3-4~表3-6及表3-7~表3-9所示。The above-mentioned supervised and unsupervised learning methods were used to establish three key gas path parameter deviation value regression models of DEGT, DFF and DN2 for experimental verification, and compared with the gas path parameter deviation values provided by OEM. The performance comparison results of migration task B are shown in Table 3-4 to Table 3-6 and Table 3-7 to Table 3-9 respectively.
表3-4迁移任务A的DEGT回归效果对比Table 3-4 Comparison of DEGT regression effects of migration task A
表3-5迁移任务A的DFF回归效果对比Table 3-5 Comparison of DFF regression effects of migration task A
表3-6迁移任务A的DN2回归效果对比Table 3-6 Comparison of DN2 regression effects of migration task A
表3-7迁移任务B的DEGT回归效果对比Table 3-7 Comparison of DEGT regression effects of migration task B
表3-8迁移任务B的DFF回归效果对比Table 3-8 Comparison of DFF regression effects of migration task B
表3-9迁移任务B的DN2回归效果对比Table 3-9 Comparison of DN2 regression effects of migration task B
如表3-4~表3-9所示,本申请所提出的基于Res-BPNN的深度领域自适应回归模型在迁移任务A与迁移任务B中预测三个关键性能偏差值DEGT、DFF和DN2时得到的拟合结果在所有方法中最为突出。As shown in Table 3-4 to Table 3-9, the deep domain adaptive regression model based on Res-BPNN proposed in this application predicts three key performance deviation values DEGT, DFF and DN2 in transfer task A and transfer task B The fitting results obtained when , are the most prominent among all methods.
(1)通过分析比较TCA、JDA和BDA以及DDC、DAN、DANN和提出方法(无监督)两组无监督算法的实验结果来看,可以发现第二组实验结果相比于第一组的迁移效果更优。第一组的三种迁移算法由于在特征降维过程中需要选取前m个特征值,中间过程中会丢失飞行参数之间有用信息,导致迁移效果变差。而第二组的四种迁移算法均采用深层网络提取深度特征,因此能够更好的表征气路参数与其偏差值之间的精确映射关系,充分减少领域之间差异性。(1) By analyzing and comparing the experimental results of the two groups of unsupervised algorithms of TCA, JDA and BDA, as well as DDC, DAN, DANN and the proposed method (unsupervised), it can be found that the experimental results of the second group are compared with the migration of the first group. The effect is better. The three migration algorithms in the first group need to select the first m eigenvalues in the process of feature dimensionality reduction, and the useful information between flight parameters will be lost in the intermediate process, resulting in poor migration effect. The four migration algorithms in the second group all use deep networks to extract deep features, so they can better characterize the precise mapping relationship between air path parameters and their deviations, and fully reduce the differences between fields.
(2)通过分析比较四种无监督算法DDC、DAN、DANN以及提出算法(无监督),可以发现本申请所提出的基于Res-BPNN的深度自适应回归模型的回归效果最优。DDC通过在Res-BPNN回归模型架构中引入了一层MMD域适配层,进而学习域不变特征。DAN在DDC的基础之上,通过引入三层MK-MMD域适配层,在不增加网络额外训练时间上,能够更好地学习域不变特征。DANN通过在Res-BPNN回归模型架构中引入了对抗思想,能够深度提取源域与目标域之间的共性特征。而本申请所提出的方法,通过结合DANN以及DAN的优势,在对抗机制的基础之上,额外引入三层MK-MMD域适配层,能更有效地减少领域之间特征分布的差异性。(2) By analyzing and comparing the four unsupervised algorithms DDC, DAN, DANN and the proposed algorithm (unsupervised), it can be found that the regression effect of the deep adaptive regression model based on Res-BPNN proposed in this application is the best. DDC learns domain-invariant features by introducing a layer of MMD domain adaptation layer into the Res-BPNN regression model architecture. On the basis of DDC, DAN can better learn domain-invariant features without increasing the additional training time of the network by introducing three-layer MK-MMD domain adaptation layers. DANN can deeply extract the common features between the source domain and the target domain by introducing the adversarial idea into the Res-BPNN regression model architecture. The method proposed in this application, by combining the advantages of DANN and DAN, and on the basis of the confrontation mechanism, additionally introduces three MK-MMD domain adaptation layers, which can more effectively reduce the difference in feature distribution between domains.
(3)通过分析比较三种有监督学习算法Res-BPNN回归模型、层迁移微调和提出方法(有监督),发现本申请所提出的基于Res-BPNN的深度自适应回归模型的回归效果同样最优。由于新型号航空发动机有标签样本少,直接采用Res-BPNN回归模型进行训练,必然会导致深层网络训练效果差,出现模型过拟合、回归精度差的问题。采用层迁移微调方法,通过预训练源域回归模型,并冻结前n层网络参数权值,采用小批量有标签目标域数据微调后几层参数,学习高阶特征,实现偏差值回归模型建立。而本申请基于Res-BPNN的深度领域自适应回归模型框架,通过在最终优化目标中引入并调节目标域小批量有标签数据集回归损失,将无监督学习问题转化为有监督学习问题,充分挖掘源域与目标域之间的域不变特征,进一步缩小领域特征分布差异,迁移效果得到显著提升。(3) By analyzing and comparing the three supervised learning algorithms Res-BPNN regression model, layer transfer fine-tuning and proposed method (supervised), it is found that the regression effect of the deep adaptive regression model based on Res-BPNN proposed in this application is also the best excellent. Since the new model of aero-engine has few labeled samples, the Res-BPNN regression model is directly used for training, which will inevitably lead to poor training effect of the deep network, and problems of model overfitting and poor regression accuracy. The layer migration fine-tuning method is used to pre-train the source domain regression model, and freeze the weights of the first n layers of network parameters, and use a small batch of labeled target domain data to fine-tune the parameters of the latter layers, learn high-order features, and realize the establishment of the bias value regression model. In this application, the deep domain adaptive regression model framework based on Res-BPNN, by introducing and adjusting the regression loss of the target domain mini-batch labeled dataset in the final optimization goal, transforms the unsupervised learning problem into a supervised learning problem, and fully exploits the The domain-invariant features between the source domain and the target domain further reduce the difference in domain feature distribution, and the transfer effect is significantly improved.
为直观体现本申请所提出的基于Res-BPNN的深度领域自适应回归模型的优越性,以下通过选取迁移任务A中DEGT的回归实验结果进行说明,MK-MMD以及训练损失随迭代次数的变化如图4所示。由于无监督对比实验方法较多,为清晰体现本申请所提出方法(无监督)的拟合结果,此处仅绘制50个测试点,如图5所示。有监督对比实验结果绘制100个测试点,如图6所示。In order to directly reflect the superiority of the deep domain adaptive regression model based on Res-BPNN proposed in this application, the following is an explanation by selecting the regression experimental results of DEGT in the migration task A. The changes of MK-MMD and training loss with the number of iterations are as follows: shown in Figure 4. Since there are many unsupervised comparison experimental methods, in order to clearly reflect the fitting results of the method proposed in this application (unsupervised), only 50 test points are drawn here, as shown in FIG. 5 . The supervised comparison experiment results are plotted with 100 test points, as shown in Figure 6.
从图4可以看出,随着网络迭代次数的不断增大,MK-MMD分布差异度量值及网络训练损失值逐渐降低,并在一定程度上趋于收敛。从图5与图6也可以明显看出,本申请所提出的基于Res-BPNN的深度领域自适应回归模型在无监督学习算法或者有监督学习算法中,都能够获得较好的回归效果,进而证明了本申请所提方法的有效性。It can be seen from Figure 4 that with the continuous increase of the number of network iterations, the MK-MMD distribution difference measurement value and the network training loss value gradually decrease, and tend to converge to a certain extent. It can also be clearly seen from Figure 5 and Figure 6 that the deep domain adaptive regression model based on Res-BPNN proposed in this application can achieve better regression effects in both unsupervised learning algorithms or supervised learning algorithms, and further The effectiveness of the method proposed in this application is proved.
综上,本发明提供了跨工况、跨机型下建立民航发动机气路参数偏差值回归模型。针对民航发动机气路参数与其偏差值之间的实际映射关系,通过N个Res-BP残差学习模块进行气路状态特征提取,采用多个MK-MMD域适配层以及域混淆层深度挖掘源域与目标域之间的共性特征,最后实现偏差值回归,构成基于Res-BPNN的深度领域自适应回归模型。考虑到民航公司引入新机型的实际情况,为充分利用目标域小批量有标签数据资源,在回归模型的最终优化目标中引入目标域的回归损失。为了验证本模型的有效性,将回归模型是否引入有标签目标域数据,划分为有监督及无监督两种方法,并与其他算法进行实验对比。结果表明,采用域适配与域混淆联合优化的方法,能够更充分挖掘源域与目标域之间的域不变特征,进一步缩小领域特征分布差异,提高回归效果。To sum up, the present invention provides a regression model of the deviation value of the air path parameters of a civil aviation engine under cross-working conditions and cross-models. According to the actual mapping relationship between the air path parameters of civil aviation engine and their deviation values, N Res-BP residual learning modules are used to extract air path state features, and multiple MK-MMD domain adaptation layers and domain confusion layers are used to deeply mine the source. The common features between the domain and the target domain are finally realized, and finally the deviation value regression is realized to form a deep domain adaptive regression model based on Res-BPNN. Taking into account the actual situation of the introduction of new aircraft models by civil aviation companies, in order to make full use of the small batch labeled data resources in the target domain, the regression loss of the target domain is introduced into the final optimization goal of the regression model. In order to verify the effectiveness of this model, whether the regression model is introduced with labeled target domain data, it is divided into two methods: supervised and unsupervised, and compared with other algorithms in experiments. The results show that the joint optimization method of domain adaptation and domain confusion can fully mine the domain invariant features between the source domain and the target domain, further reduce the difference in domain feature distribution, and improve the regression effect.
本发明所采用的方法步骤中,明确了处理的数据均为航空发动机ACARS数据以及各步骤如何处理ACARS数据,体现出深度领域自适应回归模型训练方法与航空发动机ACARS数据处理密切相关。本发明提出的解决方案所解决的是航空发动机健康管理领域中的领域自适应问题,即如何将不同机型下已学习到的飞行参数间的关联知识进行相互迁移,实现跨工况、跨机型下建立气路参数偏差值模型,进而获取发动机监控自主性的技术问题,采用了通过N个Res-BP残差学习模块进行气路状态特征提取、采用多个MK-MMD域适配层以及域混淆层深度挖掘源域与目标域之间的共性特征,最后实现偏差值回归的技术手段,利用的是遵循自然规律的技术手段,能够更充分挖掘源域与目标域民航发动机之间的域不变特征,进一步缩小领域特征分布差异,实现跨工况、跨机型下气路参数偏差值的获取方法,相比传统建模方法,回归效果得到显著提高。In the method steps adopted in the present invention, it is clarified that the processed data are all aero-engine ACARS data and how to process the ACARS data in each step, which shows that the deep domain adaptive regression model training method is closely related to aero-engine ACARS data processing. The solution proposed by the present invention solves the domain self-adaptation problem in the field of aero-engine health management, that is, how to transfer the associated knowledge between flight parameters learned under different aircraft types to each other, so as to realize cross-working conditions and cross-aircraft A model of the gas path parameter deviation value is established under the model, and then the technical problem of engine monitoring autonomy is obtained. The N Res-BP residual learning modules are used to extract the gas path state features, and multiple MK-MMD domain adaptation layers are used. The domain confusion layer deeply mines the common features between the source domain and the target domain, and finally realizes the technical means of deviation value regression. Invariant features, further reduce the difference in field feature distribution, and realize the method of obtaining the deviation value of gas path parameters under cross-working conditions and cross-models. Compared with the traditional modeling method, the regression effect is significantly improved.
本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。As will be appreciated by those skilled in the art, the embodiments of the present application may be provided as a method, a system, or a computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media having computer-usable program code embodied therein, including but not limited to disk storage, optical storage, and the like.
显然,本领域的技术人员可以对本申请进行各种改动和变型而不脱离本申请的精神和范围。这样,倘若本申请的这些修改和变型属于本申请权利要求及其等同技术的范围之内,则本申请也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the present application without departing from the spirit and scope of the present application. Thus, if these modifications and variations of the present application fall within the scope of the claims of the present application and their equivalents, the present application is also intended to include these modifications and variations.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010418279.0A CN111597760B (en) | 2020-05-18 | 2020-05-18 | A method for obtaining the deviation value of gas path parameters under the condition of small sample |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010418279.0A CN111597760B (en) | 2020-05-18 | 2020-05-18 | A method for obtaining the deviation value of gas path parameters under the condition of small sample |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111597760A CN111597760A (en) | 2020-08-28 |
CN111597760B true CN111597760B (en) | 2022-07-22 |
Family
ID=72191345
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010418279.0A Active CN111597760B (en) | 2020-05-18 | 2020-05-18 | A method for obtaining the deviation value of gas path parameters under the condition of small sample |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111597760B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112183626A (en) * | 2020-09-28 | 2021-01-05 | 上海明略人工智能(集团)有限公司 | Power grid stability judging method and system and computer equipment |
CN113420775B (en) * | 2021-03-31 | 2024-03-29 | 中国矿业大学 | Picture classification method based on nonlinearity self-adaptive subdomain field adaptation under extremely small quantity of training samples |
CN112989712B (en) * | 2021-04-27 | 2021-08-03 | 浙大城市学院 | An aero-engine fault diagnosis method based on 5G edge computing and deep learning |
CN113506253B (en) * | 2021-06-30 | 2023-04-11 | 东北石油大学 | Sandstone pore detection method based on multilayer multi-core learning and region combination |
CN116128049B (en) * | 2023-04-04 | 2023-06-30 | 厦门大学 | XGBoost model-based migration condition selection method for water quality prediction model |
CN117034732B (en) * | 2023-04-14 | 2025-01-17 | 北京百度网讯科技有限公司 | Automatic driving model training method based on true and simulated countermeasure learning |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109766921A (en) * | 2018-12-19 | 2019-05-17 | 合肥工业大学 | A fault classification method for vibration data based on deep domain adaptation |
CN110909783A (en) * | 2019-11-15 | 2020-03-24 | 重庆邮电大学 | Blind domain image classification and reconstruction method based on enhanced reconstruction classification network |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109710636B (en) * | 2018-11-13 | 2022-10-21 | 广东工业大学 | Unsupervised industrial system anomaly detection method based on deep transfer learning |
-
2020
- 2020-05-18 CN CN202010418279.0A patent/CN111597760B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109766921A (en) * | 2018-12-19 | 2019-05-17 | 合肥工业大学 | A fault classification method for vibration data based on deep domain adaptation |
CN110909783A (en) * | 2019-11-15 | 2020-03-24 | 重庆邮电大学 | Blind domain image classification and reconstruction method based on enhanced reconstruction classification network |
Non-Patent Citations (2)
Title |
---|
【深度域适配】一、DANN与梯度反转层(GRL)详解;AI那点小事;《腾讯云社区》;20200415;全文 * |
Regression Model for Civil Aero-engine Gas Path Parameter Deviations Based on Res-BP Neural Network;Xingjie Zhou .etc;《2019 International Conference on Sensing, Diagnostics, Prognostics, and Control (SDPC)》;20191231;P188-196 * |
Also Published As
Publication number | Publication date |
---|---|
CN111597760A (en) | 2020-08-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111597760B (en) | A method for obtaining the deviation value of gas path parameters under the condition of small sample | |
Li et al. | A directed acyclic graph network combined with CNN and LSTM for remaining useful life prediction | |
Chen et al. | Aero-engine remaining useful life prediction method with self-adaptive multimodal data fusion and cluster-ensemble transfer regression | |
Fan et al. | A novel machine learning method based approach for Li-ion battery prognostic and health management | |
CN108446711B (en) | A software defect prediction method based on transfer learning | |
CN102445902B (en) | For the system and method for the condition multi-output regression that machine condition monitors | |
Liu et al. | Membership inference attacks against machine learning models via prediction sensitivity | |
Mo et al. | Few-shot RUL estimation based on model-agnostic meta-learning | |
CN110187990A (en) | A Dynamic Reliability Evaluation Method for Virtual Machine Hybrid Standby Based on Mode Transfer | |
CN113722985A (en) | Method and system for evaluating health state and predicting residual life of aircraft engine | |
Mao et al. | Construction of health indicators for rotating machinery using deep transfer learning with multiscale feature representation | |
CN110082738B (en) | Radar target identification method based on Gaussian mixture and tensor recurrent neural network | |
CN111144552A (en) | Multi-index grain quality prediction method and device | |
Yan et al. | Case-based reasoning classifier based on learning pseudo metric retrieval | |
Zheng et al. | Fault diagnosis method based on supervised particle swarm optimization classification algorithm | |
CN109993232A (en) | Vibration fault diagnosis method of steam turbine based on deep learning artificial neural network | |
Wang et al. | A remaining useful life prediction model based on hybrid long-short sequences for engines | |
CN116306289A (en) | Multi-source domain self-adaption-based electromechanical device cross-domain residual life prediction method | |
Voronov et al. | Predictive maintenance of lead-acid batteries with sparse vehicle operational data | |
Yao et al. | Fault Diagnosis of Rolling Bearing based on VMD Feature Energy Reconstruction and ADE-ELM | |
CN115935187B (en) | Mechanical Fault Diagnosis Method Under Variable Conditions Based on Kernel Sensitivity Alignment Network | |
US20230267302A1 (en) | Large-Scale Architecture Search in Graph Neural Networks via Synthetic Data | |
Zhang et al. | Learning long-range relationships for temporal aircraft anomaly detection | |
Xiang et al. | Spacecraft Intelligent Fault Diagnosis under Variable Working Conditions via Wasserstein Distance‐Based Deep Adversarial Transfer Learning | |
CN114818116B (en) | Aircraft engine failure mode identification and life prediction method based on joint learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |