CN111598435B - A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm - Google Patents

A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm Download PDF

Info

Publication number
CN111598435B
CN111598435B CN202010405648.2A CN202010405648A CN111598435B CN 111598435 B CN111598435 B CN 111598435B CN 202010405648 A CN202010405648 A CN 202010405648A CN 111598435 B CN111598435 B CN 111598435B
Authority
CN
China
Prior art keywords
data
mode
quality
subgroup
particles
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010405648.2A
Other languages
Chinese (zh)
Other versions
CN111598435A (en
Inventor
初红艳
赵凯林
程强
刘宸菲
李�瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN202010405648.2A priority Critical patent/CN111598435B/en
Publication of CN111598435A publication Critical patent/CN111598435A/en
Priority to PCT/CN2020/127970 priority patent/WO2021227406A1/en
Application granted granted Critical
Publication of CN111598435B publication Critical patent/CN111598435B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06395Quality analysis or management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Artificial Intelligence (AREA)
  • Development Economics (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Educational Administration (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Software Systems (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Evolutionary Computation (AREA)
  • Operations Research (AREA)
  • Primary Health Care (AREA)
  • Quality & Reliability (AREA)
  • Game Theory and Decision Science (AREA)
  • Manufacturing & Machinery (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开了一种基于自适应特征选择及改进思维进化算法的质量趋势预测方法,该方法主要包括三个模块:特征自适应处理模块、数据融合模块、质量趋势预测模块。该方法的实现主要包括以下几个步骤:(1)设计相应参数生成建立该模型的数据;(2)应用误差影响程度算法建立特征自适应选择模块;(3)应用KPCA数据融合方法建立数据融合模块;(4)应用改进的思维进化算法优化多层感知器(MLPNN)网络建立质量趋势预测模块。通过建立该方法,本发明能够实施在质量趋势预测领域,能够自适应根据不同类型的数据选择不同的特征进行预测,并且应用数据融合、算法改进提高产品质量趋势预测的精度,及时采取适当的方式进行修正。

The invention discloses a quality trend prediction method based on self-adaptive feature selection and improved thinking evolution algorithm. The method mainly includes three modules: a feature self-adaptive processing module, a data fusion module, and a quality trend prediction module. The implementation of this method mainly includes the following steps: (1) Designing the corresponding parameters to generate the data for building the model; (2) Applying the error influence degree algorithm to establish the feature adaptive selection module; (3) Applying the KPCA data fusion method to establish the data fusion (4) Apply the improved thinking evolution algorithm to optimize the multi-layer perceptron (MLPNN) network to establish the quality trend prediction module. By establishing this method, the present invention can be implemented in the field of quality trend prediction, and can adaptively select different features according to different types of data for prediction, and apply data fusion and algorithm improvement to improve the accuracy of product quality trend prediction, and take appropriate methods in time Make corrections.

Description

一种基于自适应特征选择及改进思维进化算法的质量趋势预 测方法A Quality Trend Prediction Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm test method

技术领域technical field

本发明属于智能制造及智能质量监控领域,特别涉及一种基于自适应特征选择及改进思维进化算法的质量趋势预测方法。The invention belongs to the field of intelligent manufacturing and intelligent quality monitoring, in particular to a quality trend prediction method based on adaptive feature selection and improved thinking evolution algorithm.

背景技术Background technique

在航天、航空、船舶、汽车等领域应用广泛,由于在这些领域的产品大部分需要的质量数据精度相对较高,在生产过程中产品会时常会受到人、机、料、法、环、测等多种因素的影响,同时质量数据常常具有时变性、非线性、相关性和动态等特性常常会导致产品产生较大的质量问题无法及时预测趋势提出修改措施,严重影响产品的使用性能和质量。It is widely used in the fields of aerospace, aviation, ships, automobiles, etc. Since most of the products in these fields require relatively high quality data accuracy, the products will often be subject to human, machine, material, method, environment, and measurement during the production process. At the same time, quality data often have time-varying, non-linear, correlation and dynamic characteristics, which often lead to large quality problems in products. It is impossible to predict the trend in time and propose modification measures, which seriously affects the performance and quality of products. .

一直以来控制图作为质量控制及预测的辅助手段广泛用于生产过程中。控制图控制质量数据的上下限捕捉产品质量的波动与异常,随着技术的发展与生产节奏的加快只通过控制图的上下限来判断质量波动经常会出现较大的质量问题,无法适应现代加工数据采集等现代化手段进行质量控制。目前有很多学者也开始研究控制图的模式以追求质量的精准控制,但是大多数研究的模式较少、不对混合模式进行研究或是无法适应数据的动态变化不能对数据进行自适应精准控制、识别、预测并且大多采用线下识别,智能化程度不够。Control charts have been widely used in the production process as an auxiliary means of quality control and forecasting. The control chart controls the upper and lower limits of quality data to capture product quality fluctuations and abnormalities. With the development of technology and the acceleration of production rhythm, only the upper and lower limits of the control chart are used to judge quality fluctuations. Large quality problems often occur, which cannot adapt to modern processing. Data collection and other modern means of quality control. At present, many scholars have also begun to study the mode of the control chart to pursue the precise control of quality, but most of the research modes are less, do not study the mixed mode, or cannot adapt to the dynamic changes of the data, and cannot perform adaptive and precise control on the data. , Prediction, and most of them use offline recognition, and the degree of intelligence is not enough.

为实现产品智能化质量控制、提高产品质量提出一种基于自适应特征选择及改进思维进化算法的质量趋势预测方法。本发明可实现智能化质量预测与控制,及时提出措施进行修正并提高产品质量。In order to realize product intelligent quality control and improve product quality, a quality trend prediction method based on adaptive feature selection and improved thinking evolution algorithm is proposed. The invention can realize intelligent quality prediction and control, timely propose measures to correct and improve product quality.

发明内容Contents of the invention

本发明的目的:针对质量数据的动态性、时变性等特性,基于控制图的多种模式提出一种基于自适应特征选择及改进思维进化算法的质量趋势预测方法实现多种模式识别并根据质量数据的时变性自适应选择特征预测动态生产过程中的线上产品质量趋势,同时还应用改进思维进化算法优化MLPNN网络提高质量趋势的识别精度。解决现有质量趋势智能化识别模式少、精度低、不能自适应变化、质量控制力度不够的问题,提高良品率达到智能化生产及智能化趋势预测。Purpose of the present invention: Aiming at the dynamic and time-varying characteristics of quality data, a quality trend prediction method based on adaptive feature selection and improved thinking evolution algorithm is proposed based on multiple modes of control charts to realize multiple mode recognition and according to quality The time-varying adaptive selection feature of the data predicts the quality trend of online products in the dynamic production process, and at the same time, the improved thinking evolution algorithm is applied to optimize the MLPNN network to improve the recognition accuracy of quality trends. Solve the existing problems of few intelligent identification modes of quality trends, low precision, inability to adapt to changes, and insufficient quality control, and improve the yield rate to achieve intelligent production and intelligent trend prediction.

本发明提出来一种基于自适应特征选择及改进思维进化算法的质量趋势预测方法解决上述问题,构建趋势预测模型以便能够预测异常状态。为实现上述目的,本发明采用以下技术方案:The present invention proposes a quality trend prediction method based on adaptive feature selection and improved thinking evolution algorithm to solve the above problems, and builds a trend prediction model to predict abnormal states. To achieve the above object, the present invention adopts the following technical solutions:

一种基于自适应特征选择及改进思维进化算法的质量趋势预测方法包括以下步骤:A quality trend prediction method based on adaptive feature selection and improved thinking evolution algorithm includes the following steps:

步骤1:生成模型建立所需数据。Step 1: Generate the data required for model building.

生成9种模式的数据分别为正常模式(NOR)、周期模式(CYC)、系统模式(SYS)、分层模式(STR)、上升趋势模式(IT)、下降趋势模式(DT)、向上阶跃模式(US)、向下阶跃模式(DS)、混合模式(MIX)。The data generated in 9 modes are normal mode (NOR), cyclical mode (CYC), system mode (SYS), stratified mode (STR), uptrend mode (IT), downtrend mode (DT), upward step mode (US), step down mode (DS), mixed mode (MIX).

步骤2:特征自适应处理模块建立。Step 2: The feature adaptive processing module is established.

在质量趋势预测的过程中,一般的常用方法直接应用数据进行趋势的预测精度不高,该步骤的建立可将特征合理选择并进一步提高智能化程度。该模块建立分为两步:In the process of quality trend prediction, the general method of directly applying data to predict the trend is not very accurate. The establishment of this step can select features reasonably and further improve the degree of intelligence. The module is built in two steps:

第一步:建立特征提取模型,根据研究表明提取的数据特征包括以下统计特征和形状特征更有说服力;其中,质量数据的统计特征包括:MEAN、VS、STD、SKEW、KURT、A;质量数据的形状特征包括:SL、NC1、NC2、APML、APLS、AASL、ACLPI、SRANGE、SB、PSMLSC、REAE、ABDPE。Step 1: Establish a feature extraction model. According to research, the extracted data features include the following statistical features and shape features are more convincing; among them, the statistical features of quality data include: MEAN, VS, STD, SKEW, KURT, A; quality The shape features of the data include: SL, NC1, NC2, APML, APLS, AASL, ACLPI, SRANGE, SB, PSMLSC, REAE, ABDPE.

下列特征为质量数据在观测窗口所提取的,各个符号的含义如下:The following features are extracted from the quality data in the observation window, and the meanings of each symbol are as follows:

MEAE:质量数据的均值;MEAE: mean value of quality data;

VS:质量数据的均方值;VS: mean square value of quality data;

STD:质量数据的标准差;STD: standard deviation of quality data;

SKEW:质量数据的偏态系数;SKEW: Skewness coefficient of quality data;

KURT:质量数据的峰态系数;KURT: kurtosis coefficient of quality data;

A:质量数据的自相关系数;A: autocorrelation coefficient of quality data;

SL:质量数据拟合的最小二乘回归线斜率;SL: the slope of the least squares regression line fitted to the quality data;

NC1:质量数据所成曲线与平均值所成线的交叉点数;NC1: the number of intersections between the curve formed by the quality data and the line formed by the average value;

NC2:质量数据所成曲线与最小二乘回归线的交叉点数;NC2: the number of intersections between the curve formed by the quality data and the least squares regression line;

APML:质量数据所成曲线与平均线之间的面积;APML: the area between the curve formed by the quality data and the average line;

APLS:质量数据所成曲线与最小二乘回归线之间的面积;APLS: the area between the curve formed by the quality data and the least squares regression line;

AASL:质量数据所成曲线分为四个区域,各区域中点组合连线所形成曲线斜率的平均值AASL: The curve formed by the quality data is divided into four areas, and the average value of the slope of the curve formed by the combination of the midpoints in each area

ACLPI:APML与质量数据的标准差之比;ACLPI: ratio of APML to standard deviation of quality data;

SRANGE:质量数据所成曲线分为四个区域,各区域中点连线所成斜率的最大值与最小值之差;SRANGE: The curve formed by the quality data is divided into four areas, and the difference between the maximum value and the minimum value of the slope formed by the line connecting the midpoints of each area;

SB:质量数据最小二乘回归线的斜率标识符;SB: Slope identifier for the least squares regression line of quality data;

PSMLSC:质量数据与中心线交点、最小二乘回归线交点和的平均值;PSMLSC: the average value of the intersection point of the quality data and the center line, and the intersection point of the least squares regression line;

REAE:质量数据的MSE与分成四个区域数据MSE平均值的误差之比;REAE: The ratio of the MSE of the quality data to the error of the average MSE of the data divided into four regions;

ABDPE:将质量数据所成曲线分为两个区域时,整体质量数据所成曲线最小二乘回归线斜率与两个区域最小二乘回归线斜率平均值之差的绝对值。ABDPE: When the curve formed by the quality data is divided into two regions, the absolute value of the difference between the slope of the least squares regression line of the curve formed by the overall quality data and the average slope of the least squares regression line of the two regions.

第二步:建立自适应特征选取模型,该模型建立引入误差影响计算的方法。若有新的数据可根据数据特性应用该模块选择合适的特征。基于不同种类的质量数据,根据特征重要的程度选取不同的特征。The second step: establish an adaptive feature selection model, which establishes the method of introducing errors to affect the calculation. If there is new data, the module can be applied to select appropriate features according to the data characteristics. Based on different types of quality data, different features are selected according to the importance of the features.

建立该模型的方法如下:The way to build this model is as follows:

a)设质量数据特征有N个,将质量数据特征进行预处理;a) Assuming that there are N quality data features, the quality data features are preprocessed;

b)建立初始MLPNN神经网络,并应用处理好的质量数据特征进行训练;b) Establish an initial MLPNN neural network, and apply the processed quality data features for training;

c)若有新数据输入,预处理新质量数据特征并将每种质量数据特征数值分别增加10%、减少10%共生成2N组数据,将每组数据分别带入MLPNN中进行识别得到2N个误差值,然后将每种质量数据特征对应的增加10%、减少10%所得误差取平均值得到N个误差值;c) If there is new data input, preprocess the new quality data features and increase the value of each quality data feature by 10% and decrease it by 10% to generate 2N sets of data, and bring each set of data into MLPNN for identification to obtain 2N Error value, and then average the error obtained by increasing 10% and decreasing 10% corresponding to each quality data feature to obtain N error values;

d)将误差由大到小排序选取前85%的特征规定为影响该质量数据程度较高的特征。d) The first 85% of the features selected in order of error from large to small are defined as the features that affect the quality data to a higher degree.

e)完成自适应质量数据特征选择模型。e) Complete the adaptive quality data feature selection model.

步骤3:数据特征融合模块。Step 3: Data feature fusion module.

为了能更加精准的进行趋势的预测,将原始数据与自适应选择的数据进行融合。若将原始数据直接与选择的特征进行质量趋势的预测,输入数据太过庞大、大大增加了模型的计算复杂性,在数据降维方法中分为线性数据降维、非线性数据降维,该模块采用KPCA数据降维方法将原始数据与特征数据进行融合,建立该模块的方法步骤如下:In order to predict trends more accurately, the original data is fused with adaptively selected data. If the original data is directly combined with the selected features to predict the quality trend, the input data is too large, which greatly increases the computational complexity of the model. In the data dimension reduction method, it is divided into linear data dimension reduction and nonlinear data dimension reduction. The module adopts the KPCA data dimension reduction method to fuse the original data and the characteristic data, and the method steps to establish the module are as follows:

a)将组合数据进行标准化、中心化;a) Standardize and centralize the combined data;

b)构造组合数据的核函数,将数据映射到高维度中并计算核矩阵;b) Construct a kernel function for combining data, map the data into high dimensions and calculate the kernel matrix;

c)计算的特征值、选择特征向量;c) Calculated eigenvalues, selected eigenvectors;

d)进行数据降维、融合;d) Perform data dimension reduction and fusion;

步骤4:建立质量趋势预测模块。Step 4: Establish quality trend prediction module.

基于自适应特征融合后的数据建立3层感知器MLPNN神经网络模型,使用改进的思维进化算法对MLPNN神经网络权值、阈值进行优化。Based on the data after adaptive feature fusion, a 3-layer perceptron MLPNN neural network model is established, and an improved thinking evolution algorithm is used to optimize the weights and thresholds of the MLPNN neural network.

一般的思维进化算法主要是通过迭代优化的学习方式,进化过程中所有的个体叫做群体,一个群体分为若干个子群。子群包括优胜子群和临时子群。在思维进化时优胜子群和临时子群在飞行进化过程中是以最优的粒子为中心随机产生的子群没有任何的限制,子群内粒子之间包含的信息程度也无法判断,从而产生对进化没有意义的粒子。在这里引入互信息理论来判定子群进化的优劣程度。当子群内产生的粒子与中心粒子包含的信息程度大于85%则认为此粒子为无效粒子,同时若某粒子的得分大于中心粒子则保留,否则就重新生成该粒子。The general thinking evolution algorithm is mainly through the learning method of iterative optimization. All individuals in the evolution process are called groups, and a group is divided into several subgroups. Subgroups include superior subgroups and temporary subgroups. In the evolution of thinking, the superior subgroup and the temporary subgroup are randomly generated subgroups centered on the optimal particle in the process of flight evolution without any restrictions, and the degree of information contained between the particles in the subgroup cannot be judged, resulting in Particles that have no meaning for evolution. The mutual information theory is introduced here to determine the degree of subgroup evolution. When the information contained in the particles generated in the subgroup and the central particle is greater than 85%, the particle is considered to be an invalid particle. At the same time, if the score of a particle is greater than the central particle, it will be retained, otherwise the particle will be regenerated.

建立改进思维进化算法方法步骤如下:The steps of establishing an improved thinking evolution algorithm are as follows:

a)初始化种群产生,在空间内生成种群;a) Initialize population generation and generate populations in the space;

b)在初始化种群中选择得分较高的粒子分别作为优胜子群中心、临时子群中心并产生子群。b) Select particles with higher scores in the initialization population as the center of the winning subgroup and the center of the temporary subgroup to generate subgroups.

c)引入信息判断算子:分别计算各优胜子群、临时子群的粒子与本身子群的中心粒子之间的互信息程度,若个体粒子与中心粒子的互信息程度大于85%则认为是相似粒子,同时若个体粒子比中心粒子得分高则保留,否则释放;c) Introduce an information judgment operator: Calculate the degree of mutual information between the particles of each winning subgroup and temporary subgroup and the central particle of its own subgroup respectively. If the mutual information degree between individual particles and the central particle is greater than 85%, it is considered to be Similar particles, at the same time, if the individual particle has a higher score than the central particle, it will be retained, otherwise it will be released;

d)趋同操作算子:计算所有子群中的个体粒子得分,选出优胜者作为中心重新生成子群。同时进行步骤c操作,若子群中个体粒子与中心粒子包含信息程度合适则继续下一步操作,否则重新生成粒子并进行步骤c操作。d) Convergence operator: Calculate the individual particle scores in all subgroups, and select the winner as the center to regenerate subgroups. At the same time, perform the operation of step c, if the information contained in the individual particles and the central particle in the subgroup is appropriate, proceed to the next step, otherwise regenerate the particles and proceed to the operation of step c.

e)判断是否各子群是否成熟,若子群成熟继续下一步操作,否则继续步骤d操作。e) Judging whether each subgroup is mature, if the subgroup is mature, proceed to the next step, otherwise proceed to step d.

f)异化操作算子:将所有的成熟优胜子群和临时子群进行信息的交流,得分较高的临时子群将代替得分较低的临时子群。f) Alienation operator: exchange information between all mature winning subgroups and temporary subgroups, and the temporary subgroups with higher scores will replace the temporary subgroups with lower scores.

g)判断若成熟临时子群中没有得分超过成熟优胜子群则跳出循环,否则重复进行步骤c-步骤g操作。g) Judging that if there is no score in the mature temporary subgroup that exceeds the mature winning subgroup, then jump out of the loop, otherwise repeat step c-step g.

同时上述操作中在步骤d中以新的中心粒子生成新的子群时引入熵变理论增加粒子生成的混沌程度,增加熵变惯性系数。在搜索前期要求范围较大且与中心粒子的信息保持一定关系,后期则要求收敛较快,而引入惯性系数可提高对优秀粒子的搜索能力。At the same time, in the above operation, when a new subgroup is generated with a new central particle in step d, the entropy change theory is introduced to increase the degree of chaos generated by the particles, and the entropy change inertia coefficient is increased. In the early stage of the search, it is required to have a large range and maintain a certain relationship with the information of the central particle, and in the later stage, it is required to converge faster, and the introduction of the inertia coefficient can improve the search ability for excellent particles.

通过分析生产过程中的异常情况,根据质量数据的特点制定自适应选择合适特征的方法,并将自适应选择的特征与原始数据融合降低维度的同时增加识别精度,引入信息判断算子和熵增理论改进的思维进化算法提高算法的精度及搜索能力,本发明的有益效果是:能够根据质量数据的特性自适应选择合适的特征保证识别的精度;数据融合方法的使用可增强质量趋势预测的性能,保证识别器具有良好的训练效率;改进的算法能够保证分类时具有良好的容错性、分类能力更强。By analyzing the abnormal situation in the production process, according to the characteristics of the quality data, a method for adaptively selecting appropriate features is developed, and the adaptively selected features are fused with the original data to reduce the dimension while increasing the recognition accuracy, and introduce information judgment operators and entropy increase The theoretically improved thinking evolution algorithm improves the accuracy and search ability of the algorithm. The beneficial effects of the present invention are: the ability to adaptively select appropriate features according to the characteristics of quality data to ensure the accuracy of recognition; the use of data fusion methods can enhance the performance of quality trend prediction , to ensure that the recognizer has good training efficiency; the improved algorithm can ensure good fault tolerance and stronger classification ability during classification.

附图说明Description of drawings

下面结合附图和实施例对本发明进行进一步说明The present invention will be further described below in conjunction with accompanying drawing and embodiment

图1是本方法的流程图;Fig. 1 is the flowchart of this method;

图2是9种模式的图像;(a)正常模式;(b)混合模式;(c)周期模式;(e)系统模式;(f)分层模式;(g)上升趋势;(h)下降趋势;(i)上阶跃趋势;(j)下阶跃模式;Figure 2 is the image of 9 modes; (a) normal mode; (b) mixed mode; (c) periodic mode; (e) systemic mode; (f) hierarchical mode; (g) rising trend; (h) decreasing trend; (i) upward step trend; (j) downward step pattern;

图3是9种模式对应特征的分布图;Figure 3 is a distribution diagram of the corresponding features of the nine modes;

图4是特征融合的示意图Figure 4 is a schematic diagram of feature fusion

图5是改进思维进化算法的流程图Figure 5 is a flowchart of the improved thinking evolution algorithm

图6是思维进化算法粒子简化的示意图Figure 6 is a schematic diagram of the particle simplification of the thinking evolution algorithm

具体实施方式Detailed ways

本发明考虑生产时质量数据的动态性、时变特性引入自适应特征选择方法,并且为准确识别应用KPCA数据融合方法将动态数据与自适应特征进行融合,最终应用MLPNN网络结合改进的思维进化算法,建立了精准的识别模型。下面结合所示附图以及具体的方法建立的实施方式,对本发明的建立做进一步的描述:The present invention considers the dynamics and time-varying characteristics of quality data during production and introduces an adaptive feature selection method, and applies KPCA data fusion method to fuse dynamic data and adaptive features for accurate identification, and finally applies MLPNN network combined with improved thinking evolution algorithm , and established an accurate recognition model. The establishment of the present invention will be further described below in conjunction with the accompanying drawings and the implementation of the specific method establishment:

如图1所示为该方法实现的总体流程图,下面是针对每一个模块进行具体的展开。As shown in Figure 1, it is the overall flow chart of the implementation of the method, and the following is a specific expansion for each module.

步骤1:生成模型建立所需数据。Step 1: Generate the data required for model building.

分析在生产过程中会出现的模式。受实际生产的影响,产品质量的波动会呈一定的规律出现并影响接下来的生产。设计不同的参数来生成不同类型的趋势图如图2所示为9种模式的图形:正常模式、混合模式、周期模式、分层模式、系统模式、上趋势模式、下趋势模式、上阶跃模式、下阶跃模式的数据,具体方法如下:Analyze patterns that will arise during production. Affected by actual production, fluctuations in product quality will appear in a certain order and affect the next production. Different parameters are designed to generate different types of trend graphs, as shown in Figure 2. There are 9 modes of graphics: normal mode, mixed mode, cycle mode, layered mode, system mode, up trend mode, down trend mode, and up step mode, the data of the next step mode, the specific method is as follows:

仿真公式:y=μ+R(t)+d(t)Simulation formula: y=μ+R(t)+d(t)

其中:μ为质量数据的均值,R(t)为t时刻的正态分布随机偏差,d(t)为生产过程出现异常因素导致的偏差,下面是生产时各模式公式以及参数说明:Among them: μ is the mean value of the quality data, R(t) is the normal distribution random deviation at time t, d(t) is the deviation caused by abnormal factors in the production process, the following are the formulas and parameter descriptions of each mode during production:

(1)生产过程的正常模式(NOR):(1) Normal mode (NOR) of the production process:

y=μ+r(t)×σ+d(t)y=μ+r(t)×σ+d(t)

其中:μ为质量数据的均值,r(t)为标准正态随机的分布函数,d(t)为异常因素引起的波动,正常模式下d(t)=鲀,建议σ取值最小为0.05,最大为0.5;Among them: μ is the mean value of the quality data, r(t) is the standard normal random distribution function, d(t) is the fluctuation caused by abnormal factors, in the normal mode d(t)=fish, the minimum value of σ is recommended to be 0.05 , the maximum is 0.5;

(2)生产过程的混合模式(MIX):(2) Mixed mode (MIX) of the production process:

y=μ+r(t)×σ+(-1)W×m×σy=μ+r(t)×σ+(-1) W ×m×σ

其中:d(t)为异常因素引起的波动,建议σ取值最小为0.05,最大为0.5;W为生产过程的扰动因子0或1,m为生产过程的扰动幅值建议取[1.5,2.5]。Among them: d(t) is the fluctuation caused by abnormal factors. It is recommended that the minimum value of σ be 0.05 and the maximum value be 0.5; W is the disturbance factor 0 or 1 of the production process, and m is the disturbance amplitude of the production process. It is recommended to take [1.5, 2.5 ].

(3)生产过程的周期模式(CYC):(3) Cycle mode (CYC) of the production process:

y=μ+r(t)×σ+a×sin(2πt/T)×σy=μ+r(t)×σ+a×sin(2πt/T)×σ

其中:d(t)为异常因素引起的波动,周期模式下d(t)=a×sin(2πt/T)×σ,建议σ取值最小为0.05,最大为0.5;建议周期性生产过程扰动的幅值a最小取值范围[1,1.5],a最大取值范围为[2.5,3];建议生产过程扰动的周期T的最小取值范围为[4,8],T最大取值范围为[10,16]。Among them: d(t) is the fluctuation caused by abnormal factors. In the periodic mode, d(t)=a×sin(2πt/T)×σ, the minimum value of σ is recommended to be 0.05, and the maximum value is 0.5; periodic production process disturbance is recommended The minimum value range of the amplitude a is [1,1.5], the maximum value range of a is [2.5, 3]; the minimum value range of the cycle T of the suggested production process disturbance is [4,8], and the maximum value range of T is [10,16].

(4)生产过程的分层模式(STR):(4) Hierarchical mode (STR) of the production process:

y=μ+r(t)×σ×Ky=μ+r(t)×σ×K

其中:d(t)为异常因素引起的波动,建议σ取值最小为0.05,最大为0.5;建议生产过程数据的扰动比例关系K的最小取值范围为[0.1,0.3],K最大取值范围为[0.4,0.6]。Among them: d(t) is the fluctuation caused by abnormal factors, the minimum value of σ is recommended to be 0.05, and the maximum value is 0.5; the minimum value range of the disturbance proportional relationship K of the production process data is suggested to be [0.1,0.3], and the maximum value of K The range is [0.4,0.6].

(5)生产过程的系统模式(SYS):(5) System mode (SYS) of the production process:

y=μ+r(t)×σ+d×(-1)t×σy=μ+r(t)×σ+d×(-1) t ×σ

其中:d(t)为异常因素引起的波动,系统模式下d(t)=d×(-1)t×σ,建议σ取值最小为0.05,最大为0.5;建议质量数据的偏离程度d最小取值为1;d最大取值为3。Among them: d(t) is the fluctuation caused by abnormal factors. In the system mode, d(t)=d×(-1) t ×σ. It is recommended that the minimum value of σ be 0.05 and the maximum value be 0.5; the deviation degree of the recommended quality data is d The minimum value is 1; the maximum value of d is 3.

(6)生产过程的上下趋势模式(IT/DT):(6) The up and down trend mode (IT/DT) of the production process:

y=μ+r(t)×σ±g×t×σy=μ+r(t)×σ±g×t×σ

其中:d(t)为异常因素引起的波动,趋势模式下d(t)=±g×t×σ,建议σ取值最小为0.05,最大为0.5;建议质量数据的斜率g取值范围为[0.05,0.1];Among them: d(t) is the fluctuation caused by abnormal factors. In the trend mode, d(t)=±g×t×σ, the minimum value of σ is recommended to be 0.05, and the maximum value is 0.5; the value range of the slope g of quality data is recommended to be [0.05,0.1];

(7)生产过程的上下阶跃模式(US/DS):(7) Up and down step mode (US/DS) of the production process:

y=μ+r(t)×σ±b×s×σy=μ+r(t)×σ±b×s×σ

其中:d(t)为异常因素引起的波动,阶跃模式下d(t)=±b×s×σ,建议σ取值最小为0.05,最大为0.5;当t<P时b=0,t>P时b=1;建议质量数据的随机阶跃位置P最小取值为[4,9],P最大取值范围为[13,19];建议阶跃幅值s最小取值范围为[0.5,1.5],s最大取值范围为[2.5,3.5]。Among them: d(t) is the fluctuation caused by abnormal factors. In the step mode, d(t)=±b×s×σ. It is recommended that the minimum value of σ be 0.05 and the maximum value be 0.5; when t<P, b=0, When t>P, b=1; the recommended minimum value of the random step position P of quality data is [4,9], and the maximum value range of P is [13,19]; the minimum value range of the recommended step amplitude s is [0.5,1.5], the maximum value range of s is [2.5,3.5].

步骤2:特征自适应处理模块建立Step 2: Establish feature adaptive processing module

在质量趋势预测的过程中,一般的常用方法直接应用数据进行趋势的预测,研究表明合理的使用质量数据特征将大大提高预测精准。提取质量数据特征,具体特征分布如图3所示。建立特征自适应处理模块分为两步:In the process of quality trend prediction, the general common method directly applies data to predict the trend. Research shows that reasonable use of quality data features will greatly improve the prediction accuracy. The quality data features are extracted, and the specific feature distribution is shown in Figure 3. The establishment of the feature adaptive processing module is divided into two steps:

第一步:建立特征提取模型,提取质量数据的统计特征包括:MEAN、VS、STD、SKEW、KURT、A;质量数据的形状特征包括:SL、NC1、NC2、APML、APLS、AASL、ACLPI、SRANGE、SB、PSMLSC、REAE、ABDPE。设质量数据为Y=(y1,y2,y3....yP)(i=1,2,3...P),特征提取如下方法:Step 1: Establish a feature extraction model to extract statistical features of quality data including: MEAN, VS, STD, SKEW, KURT, A; shape features of quality data include: SL, NC1, NC2, APML, APLS, AASL, ACLPI, SRANGE, SB, PSMLSC, REAE, ABDPE. Let the quality data be Y=(y 1 ,y 2 ,y 3 ....y P )(i=1,2,3...P), feature extraction is as follows:

质量数据的均值公式: Mean formula for quality data:

质量数据的均方值公式: The mean square value formula for quality data:

质量数据的标准差 Standard Deviation of Quality Data

质量数据的偏态系数公式: The coefficient of skewness formula for quality data:

质量数据的峰态系数公式: The kurtosis formula for mass data:

质量数据的自相关系数: Autocorrelation coefficient for quality data:

质量数据所成曲线的最小二乘回归线斜率: The slope of the least squares regression line of the curve formed by the quality data:

ti为第i次时间的检测质量数据点到原点的距离,为P次检测质量数据点距离原点的平均值。yi为第i次质量数据。t i is the distance from the detection quality data point to the origin at the i-th time, is the average value of the distance from the origin of the quality data points of P tests. y i is the i-th quality data.

质量数据所成曲线与平均值所成曲线的交叉点数:如果(yi-MEAN)(yi+1-MEAN)<0,则NC1加1;Number of intersections between the curve formed by the quality data and the curve formed by the average value: if (y i -MEAN)(y i+1 -MEAN)<0, then add 1 to NC1;

质量数据所成曲线与最小二乘回归线的交叉点数:如果(yi-L(yi))(yi+1-L(yi+1))<0,则NC2加1;Number of intersections between the curve formed by the quality data and the least squares regression line: if (y i -L(y i ))(y i+1 -L(y i+1 ))<0, then NC2 plus 1;

质量数据所成曲线与平均线之间所成的面积:APML;The area between the curve formed by the quality data and the average line: APML;

质量数据所成曲线与最小二乘回归线所成的面积:APLS;The area formed by the curve formed by the quality data and the least squares regression line: APLS;

将质量数据分成四段区域,两两区域中点连线得到斜率平均值: 其中,Sjk为第j个中点和第k个中点组成的质量数据区域斜率。各区域中点坐标为/> Divide the quality data into four regions, and connect the midpoints of the two regions to obtain the average value of the slope: Among them, S jk is the slope of the quality data area composed of the jth midpoint and the kth midpoint. The coordinates of the midpoint of each area are />

质量数据与中心线组成的面积与质量数据的标准差之比: The ratio of the area formed by the quality data and the center line to the standard deviation of the quality data:

质量数据分为四个区域,各区域中点连线所成斜率的最大值与最小值之差:SRANGE=max(Sjk)-min(Sjk);(j=123;k=234;J<k)The quality data is divided into four areas, the difference between the maximum value and the minimum value of the slope formed by the line connecting the points in each area: SRANGE=max(S jk )-min(S jk ); (j=123; k=234; J <k)

质量数据所成曲线的最小二乘回归线斜率标识符:如果质量数据所成曲线的最小二乘斜率大于0则SB为1,否则SB为0;Slope identifier of the least squares regression line of the curve formed by the quality data: if the least squares slope of the curve formed by the quality data is greater than 0, then SB is 1, otherwise SB is 0;

质量数据所成曲线与中心线交点、最小二乘回归线交点和的平均值: The average value of the intersection point of the curve formed by the quality data and the center line, and the intersection point of the least squares regression line:

质量数据的MSE与分成四个区域数据的MSE平均值的误差之比: The ratio of the error of the MSE of the quality data to the mean of the MSE of the data divided into four regions:

质量数据分为两个区域时,整体的最小二乘回归线斜率与两个区域最小二乘回归线斜率的平均值之差的绝对值:其中B为质量数据所成曲线整体的最小二乘回归线斜率,/>为两个区域的最小二乘回归线斜率平均值。When the quality data is divided into two regions, the absolute value of the difference between the slope of the overall least squares regression line and the average of the slopes of the least squares regression line of the two regions: Where B is the slope of the least squares regression line of the overall curve formed by the quality data, /> is the average of the slopes of the least squares regression line for the two regions.

第二步:建立自适应特征选取模型,该模型建立引入误差影响计算的方法。在生产过程中不同种类的质量数据隐含的重要的特征不同,基于不同的数据,选取重要的特征。如下为该模型建立的具体方法步骤:The second step: establish an adaptive feature selection model, which establishes the method of introducing errors to affect the calculation. Different types of quality data imply different important features in the production process, and important features are selected based on different data. The specific method steps for the establishment of the model are as follows:

a)将质量数据特征进行预处理;a) Preprocessing the quality data features;

设特征矩阵F=(fab)(a=1,2,3...H;b=1,2,3...N),其中有H个样本,N类特征,对质量数据特征进行标准化处理:其中:/>为第b类质量数据特征的均值,sb为第b类质量数据特征的标准差/> Let the feature matrix F=(f ab )(a=1,2,3...H; b=1,2,3...N), in which there are H samples and N types of features, and the quality data features are Normalization: where: /> is the mean value of the quality data characteristics of the b category, s b is the standard deviation of the quality data characteristics of the b category />

b)建立3层感知器神经网络,隐含层为个节点,m为输入节点数,n为输出节点数并用在上述质量数据特征中选取训练数据进行MLPNN神经网络训练;b) Establish a 3-layer perceptron neural network, the hidden layer is nodes, m is the number of input nodes, n is the number of output nodes and is used to select training data in the above-mentioned quality data characteristics to carry out MLPNN neural network training;

c)若新输入质量数据特征[f1,f2,f3....fN],将质量数据特征预处理得到F′=(fb′)(b=1,2,3...N),并使每种质量数据特征数值分别增加10%、减少10%共生成2N组数据[f1 *,f2 *,f3 *....f2N *],其中f1 *-fN *为每种质量数据特征数值增加10%数据,fN+1 *-f2N *为将每种质量数据特征数值减少10%数据,将两组质量数据特征分别带入MLPNN中进行识别得到2N个趋势预测误差分别为[E1 U,E2 U,E3 U....EN U]、[E1 D,E2 D,E3 D....EN D],然后将每种质量数据特征对应增加10%、减少10%所得趋势预测误差取平均值Ei=(Ei U+Ei D)/2(i=123....N)得到N个趋势预测误差值[E1,E2,E3....EN];c) If the new input quality data features [f 1 , f 2 , f 3 ...f N ], preprocess the quality data features to get F′=(f b ′)(b=1,2,3.. .N), and increase the feature value of each quality data by 10% and decrease by 10% respectively to generate 2N sets of data [f 1 * , f 2 * , f 3 * .... f 2N * ], where f 1 * -f N * is to add 10% data for each quality data feature value, f N+1 * -f 2N * is to reduce each quality data feature value by 10% data, and bring two sets of quality data features into MLPNN respectively The identified 2N trend prediction errors are [E 1 U , E 2 U , E 3 U .... E N U ], [E 1 D , E 2 D , E 3 D .... E N D ] , and then each quality data feature is increased by 10% and decreased by 10% to obtain the average value of the trend prediction error E i =(E i U +E i D )/2(i=123....N) to obtain N Trend prediction error value [E 1 , E 2 , E 3 .... E N ];

d)将[E1,E2,E3....EN]误差由大到小排序选取前85%的特征规定为影响该质量数据程度较高的特征。d) The [E 1 , E 2 , E 3 .... E N ] errors are sorted from large to small and the top 85% of the features are selected as the features that affect the quality data to a higher degree.

e)完成自适应质量数据特征选择得到L(L<N)个特征,完成该模型的建立。为后面特征融合做准备。e) Complete adaptive quality data feature selection to obtain L (L<N) features, and complete the establishment of the model. Prepare for later feature fusion.

步骤3:数据特征融合模块Step 3: Data feature fusion module

建立数据特征融合模块,为了能更加精准的进行趋势的预测,将原始数据与自适应选择的数据进行融合。该模块采用KPCA数据降维方法将原始数据与特征数据进行融合如图4所示,具体实现的步骤方法如下:A data feature fusion module is established to fuse the original data with the adaptively selected data in order to predict trends more accurately. This module uses the KPCA data dimensionality reduction method to fuse the original data and feature data, as shown in Figure 4. The specific implementation steps are as follows:

a)将组合数据进行标准化、中心化;a) Standardize and centralize the combined data;

设数据矩阵F=(fab)(a=1,2,3...H;b=1,2,3...(L+P)),其中有H个样本,每个样本有L+P个数据,对组合数据进行标准化处理:其中:/>组合数据中第b维数据的均值,sb为质量数据与质量数据特征组合后第b维数据的标准差/> 得到Fa=[fa1″,fa2″,fa3″....fa(L+P)″],F″=[F1,F2....FH]。Let the data matrix F=(f ab )(a=1,2,3...H; b=1,2,3...(L+P)), there are H samples, and each sample has L +P data, standardize the combined data: where: /> The mean value of the b-th dimension data in the combined data, s b is the standard deviation of the b-th dimension data after the combination of quality data and quality data features/> It is obtained that F a =[f a1 ", f a2 ", f a3 "...f a(L+P) "], F"=[F 1 , F 2 ....F H ].

b)构造组合数据核函数,将处理的数据映射到高维度中,并计算组合数据的核矩阵;b) Construct a combined data kernel function, map the processed data into high dimensions, and calculate the kernel matrix of the combined data;

根据组合数据的特征值λ及特征向量W选择组合数据的载荷因子并生成融合数据,这里引非线性映射Φ,可以将问题转换成Φ(F″)Φ(F″)TW=λW。According to the eigenvalue λ and eigenvector W of the combined data, the load factor of the combined data is selected and the fusion data is generated. Here, the nonlinear mapping Φ is used to convert the problem into Φ(F″)Φ(F″) T W=λW.

引入组合数据的映射函数得到组合数据的向量矩阵W=(w1,w2,w3....wH),wa为组合数据的基向量,映射数据的关系可表示为 The vector matrix W=(w 1 ,w 2 ,w 3 ....w H ) of the combined data is obtained by introducing the mapping function of the combined data, w a is the base vector of the combined data, and the relationship of the mapped data can be expressed as

可得 Available

得α=[α12....αH]make Get α=[α 12 ....α H ]

最终,得到Φ(F″)Φ(F″)TΦ(F″)α=λΦ(F″)αFinally, get Φ(F″)Φ(F″) T Φ(F″)α=λΦ(F″)α

这里引入核技巧方程两边同时乘Φ(F″)T得到:Here we introduce the kernel skill equation and multiply Φ(F″) T on both sides to get:

c)计算组合数据的特征值、选择特征向量;c) Calculate the eigenvalues and select eigenvectors of the combined data;

引入组合数据的核函数并进行核技巧转换K(Fa1,Fa2)=Φ(Fa1)Φ(Fa2)T其中(a1∈(1,H),a2∈(1,H));此时可以定义当特征值累计贡献率大于85%时的前g个主成分可表示原数据矩阵,此时的K为H×H的矩阵选择后的载荷因子矩阵为A=[α12....αg]。Introduce the kernel function of combined data and perform kernel skill transformation K(F a1 ,F a2 )=Φ(F a1 )Φ(F a2 ) T where (a1∈(1,H),a2∈(1,H)); At this time, it can be defined that when the cumulative contribution rate of eigenvalues is greater than 85%, the first g principal components can represent the original data matrix. At this time, K is the matrix of H×H. The load factor matrix after selection is A=[α 12 .... αg ].

d)进行数据降维;d) Perform data dimensionality reduction;

将组合数据进行核变换后数据和选择的载荷向量相乘得到融合后的数据X*=K×A即为: Multiply the combined data after nuclear transformation with the selected load vector to obtain the fused data X * = K × A is:

步骤4:建立质量趋势预测模块Step 4: Build Quality Trend Prediction Module

自适应特征融合后的数据建立3层感知器MLPNN神经网络模型,使用改进的思维进化算法对MLPNN神经网络权值、阈值进行优化,训练数据采用第3步融合的数据。The 3-layer perceptron MLPNN neural network model is established from the data after adaptive feature fusion, and the weights and thresholds of the MLPNN neural network are optimized using the improved thinking evolution algorithm. The training data uses the data fused in the third step.

一般的思维进化算法主要是通过迭代优化的学习方式,进化过程中所有的个体叫做群体,一个群体分为若干个子群。子群包括优胜群体和临时群体。在思维进化时优胜粒子群和临时粒子群在飞行进化过程中是以最优的粒子为中心随机产生的子群没有任何的限制,子群内粒子之间包含的信息程度也无法判断,从而很可能产生很多对进化没有任何意义的粒子,对进化没有意义。在这里引入互信息理论来判定子群进化的有优劣程度。当子群内产生的粒子与领头粒子包含的信息程度大于85%则认为此粒子为无效粒子,同时若某粒子的得分大于领头粒子则保留,否则就以领头粒子为中心生成粒子,如图5所示为具体操作的步骤,图6为粒子进化的过程示意图。The general thinking evolution algorithm is mainly through the learning method of iterative optimization. All individuals in the evolution process are called groups, and a group is divided into several subgroups. Subgroups include winner groups and temporary groups. During the evolution of thinking, the superior particle swarm and the temporary particle swarm are randomly generated subgroups centered on the optimal particle in the process of flight evolution without any restrictions, and the degree of information contained between the particles in the subgroup cannot be judged, so it is very easy It may produce a lot of particles which are meaningless to evolution, and are meaningless to evolution. The mutual information theory is introduced here to determine the degree of subgroup evolution. When the information contained in the particle generated in the subgroup and the leading particle is greater than 85%, the particle is considered to be an invalid particle. At the same time, if the score of a certain particle is greater than the leading particle, it will be kept; otherwise, the particle will be generated centering on the leading particle, as shown in Figure 5. Shown are the steps of the specific operation, and Fig. 6 is a schematic diagram of the particle evolution process.

建立改进思维进化算法方法步骤如下:The steps of establishing an improved thinking evolution algorithm are as follows:

a)初始化种群产生,在空间内生成具有T个粒子的种群,并根据适应函数计算各个粒子的得分;a) Initialize population generation, generate a population with T particles in the space, and calculate the score of each particle according to the fitness function;

b)在初始化种群中选择A个得分较高的粒子作为优胜子群中心,选择B个得分较高的粒子作为临时子群中心。设定各子群大小为T*,其中T*=T/(A+B)。b) In the initialization population, select A particles with higher scores as the winning subgroup centers, and select B particles with higher scores as temporary subgroup centers. Set the size of each subgroup as T * , where T * =T/(A+B).

c)引入信息判断算子:设粒子长度为L′分别计算优胜子群各粒子中心与相对应 临时子群各粒子中心与/>之间的互信息/>及/>其中(i=1,2,3....T*,l=1 2 3.....L′ t1=1,2,3,...A,t2=1,2,3,...B),若两个子群的个体粒子与中心粒子互信息程度大于85%则认为粒子为相似粒子,同时制定规则若粒子得分大于中心粒子保留,否则释放;c) Introduce an information judgment operator: set the length of the particle as L′ to calculate the center of each particle of the winning subgroup and the corresponding Each particle center of the temporary subgroup and /> mutual information between and /> where (i=1,2,3....T * , l=1 2 3.....L′ t 1 =1,2,3,...A, t 2 =1,2,3 ,...B), if the degree of mutual information between the individual particles of the two subgroups and the central particle is greater than 85%, the particles are considered to be similar particles, and at the same time, a rule is formulated that if the particle score is greater than the central particle, it is retained, otherwise it is released;

以优胜子群的互信息程度计算为例:Take the calculation of the mutual information degree of the winning subgroup as an example:

首先,计算优胜子群的信息熵公式如下:First, the formula for calculating the information entropy of the winning subgroup is as follows:

其中表示个体粒子/>的信息量,这里通常对数所用的底为2,e或10。in represent individual particles /> The amount of information, where the base used for logarithms is usually 2, e or 10.

对于中心粒子和个体粒子互信息程度可以表达为:The degree of mutual information between the central particle and individual particles can be expressed as:

同理对于临时子群G也应用上述公式进行计算。Similarly, the above formula is also applied to the temporary subgroup G for calculation.

d)趋同操作算子:分别以选定的粒子为优胜中心、临时中心服从正态分布产生A个优胜子群与B个临时子群,计算所有子群中的个体粒子得分,选出优胜者作为中心重新生成子群。同时进行步骤c操作,若子群中个体粒子与中心粒子包含信息程度合适则继续下一步操作,否则重新生成粒子并进行步骤c操作。d) Convergence operator: use the selected particles as the winning center and the temporary center to obey the normal distribution to generate A winning subgroups and B temporary subgroups, calculate the individual particle scores in all subgroups, and select the winner Regenerate subgroups as centers. At the same time, perform the operation of step c, if the information contained in the individual particles and the central particle in the subgroup is appropriate, proceed to the next step, otherwise regenerate the particles and proceed to the operation of step c.

e)判断是否各子群是否成熟,若子群成熟继续下一步操作,否则继续步骤d操作。e) Judging whether each subgroup is mature, if the subgroup is mature, proceed to the next step, otherwise proceed to step d.

f)异化操作算子:将所有的成熟优胜子群和临时子群进行信息的交流,优胜子群M1,M2...MA,临时子群G1,G2,G3...GB进行竞争,若存在成熟的临时子群得分大于成熟的优胜子群得分,则优胜粒子的释放被临时子群替代并且重新补充临时子群。f) Alienation operation operator: exchange information between all mature superior subgroups and temporary subgroups, superior subgroups M 1 , M 2 ... M A , temporary subgroups G 1 , G 2 , G 3 .. .GB competes, if there is a mature temporary subgroup whose score is greater than that of the mature winning subgroup, the release of the winning particle is replaced by the temporary subgroup and the temporary subgroup is replenished.

g)判断若成熟临时子群中得分没有超过成熟优胜子群得分则跳出循环,否则重复进行步骤c-步骤g操作。g) Judging that if the score in the mature temporary subgroup does not exceed the score of the mature winning subgroup, jump out of the loop, otherwise repeat step c-step g.

同时上述操作中在步骤d中以最优粒子为中心生成子群,则需要新个体粒子的生成。新个体粒子生成引入熵变理论增加粒子生成的混沌程度,将熵变惯性系数加入粒子生成公式中。在搜索前期要求范围较大且与中心粒子的信息保持一定关系,后期则要求收敛较快,而引入惯性系数可提高对优秀粒子的搜索能力。At the same time, in the above operation, generating a subgroup centered on the optimal particle in step d requires the generation of new individual particles. The new individual particle generation introduces the theory of entropy change to increase the degree of chaos of particle generation, and adds the entropy change inertia coefficient to the particle generation formula. In the early stage of the search, it is required to have a large range and maintain a certain relationship with the information of the central particle, and in the later stage, it is required to converge faster, and the introduction of the inertia coefficient can improve the search ability for excellent particles.

对于生成新的个体粒子的方法给出以下方式进行:For the method of generating new individual particles, the following methods are given:

a)为扩大范围搜索,根据公式生成初始粒子矩阵RD×L′,其中粒子有D个、每个粒子长度为L′。粒子生成的公式为:Rd=mc+×θ×(2×Z1×L′-1),其中θ取0.5-1,Z1×L′为1×L′的正态分布0-1的数值矩阵,mc为中心粒子。a) In order to expand the range of search, the initial particle matrix R D×L' is generated according to the formula, in which there are D particles and the length of each particle is L'. The formula for particle generation is: R d =m c +×θ×(2×Z 1×L′ -1), where θ is 0.5-1, Z 1×L′ is the normal distribution of 1×L′ 0- 1 numerical matrix, m c is the central particle.

b)对RD×L′矩阵进行粒子的信息熵计算并进行粒子的混沌迭代。设置迭代次数为Kmax次,k为当次迭代次数的值,最终选择最佳的粒子。将粒子矩阵转换成S1×(D×L′)的矩阵,整体粒子的信息熵计算公式为:应用公式得到变化的权值: b) Calculate the information entropy of the particles on the RD×L' matrix and perform the chaotic iteration of the particles. Set the number of iterations to K max times, k is the value of the number of iterations at this time, and finally select the best particle. Convert the particle matrix into a matrix of S 1×(D×L′) , and the calculation formula of the information entropy of the whole particle is: Apply the formula to get the changed weights:

c)改变公式为Rd+1=wRd+(1-w)×θ×(2×Z1×L′-1),循环Kmax次后得到新的RD×L′,同时在循环过程中对于每个粒子得分高的会保留,循环结束后最终选择矩阵中得分最高的粒子为新粒子。c) Change the formula to R d+1 = wR d + (1-w) × θ × (2 × Z 1 × L' -1), get the new R D × L' after looping K max times, and at the same time in the loop During the process, the one with the highest score for each particle will be kept, and the particle with the highest score in the matrix will be finally selected as the new particle after the loop ends.

此过程为在某一粒子为中心在此范围内混沌搜索的过程,防止粒子搜索过拟合搜索,开始扩大范围最后逐渐向最优粒子靠近,提高在中心粒子周围的搜索精度。This process is a process of chaotic search within this range centered on a certain particle, preventing particle search from overfitting search, starting to expand the range and finally gradually approaching the optimal particle, and improving the search accuracy around the central particle.

Claims (5)

1. A quality prediction and control method is characterized in that: modeling of the method comprises the following four steps:
step 1: generating product quality model building required data, and simulating quality data of the product; the modes of the quality data of the simulation product comprise a normal mode, a periodic mode, a mixed mode, a system mode, an ascending trend mode, a descending trend mode, an upward step mode and a downward step mode;
step 2: and (3) establishing a characteristic self-adaptive processing module, preprocessing according to the quality data of the product in the step (1), wherein the characteristic self-adaptive processing module is established by two steps: extracting product quality data characteristics and establishing a self-adaptive characteristic selection model by applying an initialization MLPNN network;
step 3: establishing a data feature fusion module, and realizing feature fusion and data dimension reduction by applying a KPCA method, thereby simplifying a subsequent product quality trend prediction module;
step 4: establishing a product quality trend prediction module, optimizing an MLPNN neural network by using an improved thinking evolution algorithm, wherein the optimization target of the model is prediction accuracy, and introducing an entropy change theory by adding a mutual information judgment operator to make the algorithm obtain a prediction model;
the method for establishing the improved thinking evolutionary algorithm comprises the following steps:
a) Initializing population generation, and generating a population in a space;
b) Selecting particles with higher scores in the initialized population as a winning subgroup center and a temporary subgroup center respectively and generating subgroups;
c) Introducing an information judgment operator: calculating mutual information degrees between particles of each winning subgroup, temporary subgroup and central particles of the own subgroup respectively, if the mutual information degree of individual particles and the central particles is more than 85%, the particles are considered to be similar particles, and meanwhile, if the individual particles are higher than the central particles, the particles are reserved, otherwise, the particles are released;
d) Convergence operator: calculating individual particle scores in all subgroups, and selecting winners as centers to regenerate the subgroups; c, if the information degree of the individual particles and the central particles in the subgroup is proper, continuing the next operation, otherwise, regenerating the particles and performing the operation of the step c;
e) Judging whether each subgroup is mature or not, if so, continuing the next operation, otherwise, continuing the operation of the step d;
f) Dissimilating operation operators: carrying out information communication on all mature winning subgroups and temporary subgroups, wherein the temporary subgroup with higher score replaces the temporary subgroup with lower score;
g) And c, judging that if no score exceeds the mature winning subgroup in the mature temporary subgroup, jumping out of the circulation, otherwise, repeating the operations of the steps c-g.
2. The quality prediction and control method of claim 1, wherein: the data of the 9 modes generated in the step 1 are respectively: normal mode NOR, periodic mode CYC, system mode SYS, hierarchical mode STR, upward trend mode IT, downward trend mode DT, upward step mode US, downward step mode DS, mixed mode MIX.
3. The quality prediction and control method of claim 1, wherein: the statistical features of the quality data adopted in the first step in the step 2 include: MEAN, VS, STD, SKEW, KURT, A; the shape characteristics of the quality data include: SL, NC1, NC2, APML, APLS, AASL, ACLPI, SRANGE, SB, PSMLSC, REAE, ABDPE; and secondly, adopting an initialized MLPNN network to apply error influence degree algorithm to adaptively select the characteristics.
4. The quality prediction and control method of claim 1, wherein: and 3, fusing the self-adaptively selected characteristics with the original product quality data by using a KPCA method.
5. The quality prediction and control method of claim 1, wherein: and 4, designing a thinking evolutionary algorithm improved by the mutual information judgment operator, and improving a particle generation mode of the thinking evolutionary algorithm by applying an entropy change theory.
CN202010405648.2A 2020-05-14 2020-05-14 A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm Active CN111598435B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202010405648.2A CN111598435B (en) 2020-05-14 2020-05-14 A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm
PCT/CN2020/127970 WO2021227406A1 (en) 2020-05-14 2020-11-11 Quality trend prediction method based on adaptive feature selection and improved mind evolutionary algorithim

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010405648.2A CN111598435B (en) 2020-05-14 2020-05-14 A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm

Publications (2)

Publication Number Publication Date
CN111598435A CN111598435A (en) 2020-08-28
CN111598435B true CN111598435B (en) 2023-08-04

Family

ID=72190810

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010405648.2A Active CN111598435B (en) 2020-05-14 2020-05-14 A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm

Country Status (2)

Country Link
CN (1) CN111598435B (en)
WO (1) WO2021227406A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111598435B (en) * 2020-05-14 2023-08-04 北京工业大学 A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm
CN114496209B (en) * 2022-02-18 2022-09-27 青岛市中心血站 Intelligent decision-making method and system for blood donation
CN114997070B (en) * 2022-07-15 2022-11-11 合肥中科迪宏自动化有限公司 Training method of control chart pattern recognition model and control chart pattern recognition method
CN117310118B (en) * 2023-11-28 2024-03-08 济南中安数码科技有限公司 A visual monitoring method for groundwater pollution

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105139093A (en) * 2015-09-07 2015-12-09 河海大学 Method for forecasting flood based on Boosting algorithm and support vector machine
CN106769030A (en) * 2016-11-10 2017-05-31 浙江工业大学 A kind of bearing state tracking and Forecasting Methodology based on MEA BP neural network algorithms
CN110047015A (en) * 2019-04-22 2019-07-23 水利部信息中心 A kind of water total amount prediction technique merging KPCA and thinking Optimized BP Neural Network
CN111027745A (en) * 2019-11-08 2020-04-17 广东财经大学 Stock index prediction method based on self-adaptive feature extraction

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111598435B (en) * 2020-05-14 2023-08-04 北京工业大学 A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105139093A (en) * 2015-09-07 2015-12-09 河海大学 Method for forecasting flood based on Boosting algorithm and support vector machine
CN106769030A (en) * 2016-11-10 2017-05-31 浙江工业大学 A kind of bearing state tracking and Forecasting Methodology based on MEA BP neural network algorithms
CN110047015A (en) * 2019-04-22 2019-07-23 水利部信息中心 A kind of water total amount prediction technique merging KPCA and thinking Optimized BP Neural Network
CN111027745A (en) * 2019-11-08 2020-04-17 广东财经大学 Stock index prediction method based on self-adaptive feature extraction

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
初红艳 ; 李鹏 ; 蔡力钢 ; 李风光 ; .基于MC方法和BP网络的印刷质量控制图模式识别研究.北京工业大学学报.(第06期),第816页数第1节-第820页第4节. *

Also Published As

Publication number Publication date
WO2021227406A1 (en) 2021-11-18
CN111598435A (en) 2020-08-28

Similar Documents

Publication Publication Date Title
CN111598435B (en) A Quality Trend Prediction Method Based on Adaptive Feature Selection and Improved Thinking Evolution Algorithm
CN106599797B (en) A kind of infrared face recognition method based on local parallel neural network
CN107688825B (en) An Improved Integrated Weighted Extreme Learning Machine Sewage Treatment Fault Diagnosis Method
CN102707256B (en) Fault diagnosis method based on BP-Ada Boost nerve network for electric energy meter
CN109829541A (en) Deep neural network incremental training method and system based on learning automaton
Chen et al. Adaptive feature selection-based AdaBoost-KNN with direct optimization for dynamic emotion recognition in human–robot interaction
CN113420508B (en) Unit combination calculation method based on LSTM
CN109492748A (en) A kind of Mid-long term load forecasting method for establishing model of the electric system based on convolutional neural networks
CN110009108A (en) A brand new quantum extreme learning machine
CN113807040B (en) An Optimal Design Method for Microwave Circuits
CN106650933A (en) Deep neural network optimizing method based on coevolution and back propagation
CN111311702A (en) Image generation and identification module and method based on BlockGAN
CN112149796A (en) Driving style identification method for optimizing BP neural network based on improved genetic algorithm
CN107832789A (en) Characteristic weighing k nearest neighbor method for diagnosing faults based on the conversion of average influence Value Data
CN108647772A (en) A method of it is rejected for slope monitoring data error
CN117010570A (en) Short-term power load prediction model based on CSWOA-TPA-BiGRU
CN116738339A (en) Multi-classification deep learning recognition detection method for small-sample electric signals
CN109214500A (en) A kind of transformer fault recognition methods based on integrated intelligent algorithm
CN116468075A (en) Big data processing method and system based on quantum fuzzy neural network model
CN107590538A (en) A kind of dangerous source discrimination based on online Sequence Learning machine
Khosla et al. A comparison of computational efforts between particle swarm optimization and genetic algorithm for identification of fuzzy models
CN108304856A (en) Image classification method based on cortex thalamus computation model
CN111498148A (en) FDNN-based intelligent spacecraft controller and control method
CN118035843A (en) EEG data classification method based on classical quantum hybrid convolutional neural network
CN102707661A (en) Multi-axle numerical servo-control system model identification method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant