CN111814342A - Complex equipment reliability hybrid model and construction method thereof - Google Patents

Complex equipment reliability hybrid model and construction method thereof Download PDF

Info

Publication number
CN111814342A
CN111814342A CN202010688009.1A CN202010688009A CN111814342A CN 111814342 A CN111814342 A CN 111814342A CN 202010688009 A CN202010688009 A CN 202010688009A CN 111814342 A CN111814342 A CN 111814342A
Authority
CN
China
Prior art keywords
distribution
parameter
data
psi
algorithm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010688009.1A
Other languages
Chinese (zh)
Other versions
CN111814342B (en
Inventor
张琳
张保山
刘捷
李波
汪文峰
张搏
魏桥
魏圣军
高娜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Air Force Engineering University of PLA
Original Assignee
Air Force Engineering University of PLA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Air Force Engineering University of PLA filed Critical Air Force Engineering University of PLA
Priority to CN202010688009.1A priority Critical patent/CN111814342B/en
Publication of CN111814342A publication Critical patent/CN111814342A/en
Application granted granted Critical
Publication of CN111814342B publication Critical patent/CN111814342B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/02Reliability analysis or reliability optimisation; Failure analysis, e.g. worst case scenario performance, failure mode and effects analysis [FMEA]

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Evolutionary Computation (AREA)
  • Geometry (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Complex Calculations (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a complex equipment reliability mixed model and a construction method thereof, wherein the method comprises the following steps: (S100) constructing a mixed distribution model by normal distribution, Weibull distribution, exponential distribution and lognormal distribution, and setting an observed equipment life data set to obtain a mixed density function of observed data; (S200) estimating unknown parameters in the mixed density function by adopting an EM algorithm, and converting the estimation of the unknown parameters into an optimization problem by means of maximum likelihood estimation; (S300) optimizing the complex equipment reliability modeling module by adopting Bayesian random classification and a K-means algorithm to obtain a complex equipment reliability mixed model based on Bayesian random classification. According to the method, the distribution parameters are optimized through the EM algorithm, and the Bayesian random classification and the K-means algorithm are combined to optimize the reliability modeling module of the complex equipment, so that the accuracy of parameter estimation is greatly improved, and the iteration speed is improved.

Description

Complex equipment reliability hybrid model and construction method thereof
Technical Field
The invention relates to a complex equipment reliability hybrid model, in particular to a complex equipment reliability hybrid model and a construction method thereof.
Background
The faults or failures of the equipment are random and difficult to avoid, so the time of the faults or failures of the equipment also occurs randomly, the fault rules of different equipment or the same equipment under different conditions are presented in different distribution types, and the general application range of the common fault distribution types is shown in table 1.
TABLE 1 Fault distribution function types and their application ranges
Figure BDA0002588290290000011
As can be seen from table 1, the failure rates of the same type of devices obey a certain specific probability distribution function, and if a parameter value of the failure probability distribution function of the certain type of devices can be obtained, the failure rates can be accurately characterized. As shown in fig. 1 (a), it is a fault distribution whose fault probability obeys a normal function, and it can be seen that if an equipment fault obeys a single fault distribution function, the fault rate at each moment is relatively easy to obtain, but when the equipment fault distribution is as shown in fig. 1 (b) and (c), that is, the equipment fault is simultaneously influenced by the combination of multiple fault distribution functions, the complexity of the equipment fault distribution is greatly increased, and because the conditions of the working environment, the use intensity, the processing technique and the like of the equipment are different, the difference of the fault distribution functions of the same type of equipment is relatively large, the fault characteristics of the equipment fault distribution functions often exhibit the characteristic of ambiguity [, so that it is very difficult to find a method for accurately characterizing the fault probability distribution function of the equipment fault.
For the reliability prediction problem in fig. 1 (b), (c), many scholars have proposed a series of models, such as black box theory based on fault interval time data modeling, and random process theory based on system state. However, both of the two reliability modeling theories have certain defects, only simple faults can be modeled according to the black box theory, and when system faults are caused by multiple failure modes, the accuracy rate is not high; the stochastic process theory requires that the service life distribution, the repair time distribution after the fault and other distributions of each part of the system are exponential distributions, and if the system does not meet the assumption, the stochastic process modeling method is very difficult.
Disclosure of Invention
The invention aims to provide a complex equipment reliability hybrid model and a construction method thereof, which solve the problem that the existing method can only carry out modeling on simple faults according to the black box theory, can optimize distribution parameters by an EM (effective electromagnetic radiation) algorithm aiming at various faults in hybrid distribution, and optimizes a complex equipment reliability modeling module by combining Bayesian random classification and a K-means algorithm, greatly improves the accuracy of parameter estimation, and improves the iteration speed.
In order to achieve the above object, the present invention provides a method for constructing a complex device reliability hybrid model, the method comprising:
(S100): constructing a mixed distribution model by normal distribution, Weibull distribution, exponential distribution and lognormal distribution, and setting an observed equipment life data set y (y ═ y)1,y2,…,yn)T,yjThe sample value representing the j-th collected point, j is 1,2, … …, n, n is the total number of the collected data, the observed device life data is from a mixed distribution formed by combining a normal distribution, an exponential distribution, a Weibull distribution and a lognormal distribution, and the distribution quantity of the normal distribution, the exponential distribution, the Weibull distribution and the lognormal distribution in the mixed distribution is weighted as {π1,π2,π3,π4}, the sum of which is 1, observed data yjThe mixing density function of (a) is as shown in formula (1):
Figure BDA0002588290290000021
wherein the content of the first and second substances,
Figure BDA0002588290290000031
Figure BDA0002588290290000032
Figure BDA0002588290290000033
Figure BDA0002588290290000034
in the formula, all unknown parameters are psi ═ (pi)12341122,λ,θ,β),π1、π2、π3、π4Weights, σ, representing the total distribution of normal, exponential, Weibull and lognormal distributions, respectively1、μ1Is a normally distributed parameter, theta and beta are parameters of Weibull distribution, lambda is a parameter of exponential distribution, sigma2、μ2A parameter that is lognormal distributed; pi represents a circumferential ratio; f. of1(yj) Is composed of
Figure BDA0002588290290000035
f2(yj) Is f2(yj;λ),f3(yj) Is f3(yj;θ,β),f4(yj) Is composed of
Figure BDA0002588290290000036
(S200) estimating unknown parameters psi in the mixed density function by adopting an EM algorithm, converting the estimation of the unknown parameters psi into an optimization problem by means of maximum likelihood estimation, wherein the optimized objective function is a likelihood function L (psi) or an equivalent log likelihood function lnL (psi), and the definition domain is the whole parameter value space;
the likelihood function for the unknown parameter Ψ is:
Figure BDA0002588290290000037
the equivalent log likelihood function lnL (Ψ) for the unknown parameter Ψ is:
Figure BDA0002588290290000038
Figure BDA0002588290290000041
(S300) optimizing the complex equipment reliability modeling module by adopting Bayesian random classification and K-means algorithm to obtain a complex equipment reliability mixed model based on Bayesian random classification, which comprises the following steps:
(S310) classifying the equipment life data set by using Kmeans cluster analysis, and performing maximum likelihood estimation on independent distribution parameters of each type of data to serve as prior distribution parameters of each independent distribution, wherein the distribution parameter corresponding to the ith type of life data is psiiData amount g of each type of life dataiThe ratio of the data quantity G occupying the service life data is the weight w of each independent distributioniIs wi=gi/G;
(S320) calculating the density function value f of the j point in the i type independent distributionij(yj) Density function values f of four distributionsij(yj) Respectively calculated by formulas (2) to (5), and normalized according to formula (14) to obtain pij,pijThe posterior probability of the j point belonging to the i type independent distribution is represented as follows:
Figure BDA0002588290290000042
(S330) assigning the jth point to the posterior probability interval of the ith type independent distribution
Figure BDA0002588290290000043
Wherein p isj00, each point generates a random probability r according to a uniform distributionjAccording to rjJudging the classification of the j point according to whether the i-th independent distribution posterior probability interval belongs to the i-th independent distribution posterior probability interval or not, and updating the corresponding parameters of each independent distribution according to the current sample classification;
(S340) calculating the maximum likelihood function value of the sample classification after Kmeans clustering according to the formula (15), and selecting the sample with the maximum likelihood function value as the current optimal classification:
Figure BDA0002588290290000044
wherein N represents the number of independent distributions;
(S350) repeating (S320) - (S340) until the maximum likelihood function converges or the number of iterations ends.
The method of constructing a hybrid model of complex plant reliability as claimed in claim 1, wherein in step (S200), under EM framework, each y isjOne component, considered to be from a finite mixture model, is given by z ═ z (z)1,z2,…,zn)TAn indicator vector representing the non-observable component, wherein
Figure BDA0002588290290000045
In formula (8), i is 1,2,3,4, j is 1,2, …, n;
the observation data vector y is equal to (y)1,y2,…,yn)TAnd missing data vector z ═ z (z)1,z2,…,zn)TCombined together to get the complete data vector x ═ (y)T,zT)TThen, in the mixed distribution model, the log likelihood obtained based on the complete data of the parameter ΨFunction lnLc(ψ) is:
Figure BDA0002588290290000051
preferably, in step (S200), the parameter estimation method based on the EM algorithm includes:
e, step E: computing conditional probability expectation Q (psi; psi) of a joint distributionk) In the k +1 th iteration of the EM algorithm, there are:
Figure BDA0002588290290000052
at a given y and current ΨkConditional expectation of log-likelihood function of complete data
Figure BDA0002588290290000053
Comprises the following steps:
Figure BDA0002588290290000054
wherein, taui(yj;Ψk) Is the jth observable data yjA posterior probability of an ith component belonging to the finite mixture model; superscript k represents the kth iteration;
then the log-likelihood function lnL derived from the complete data based on the parameter Ψ is combinedc(psi) conditional expectation of log-likelihood function of complete data
Figure BDA0002588290290000055
From formulae (9) and (11), it is possible:
Figure BDA0002588290290000061
wherein, piiWeights for the respective distributions;
and M: updating the estimated value Ψ of Ψk+1Such that the entire parameter space of Ψ is Q (Ψ; Ψ)k) The function takes the maximum value, and the parameter iteration formula is as follows:
Figure BDA0002588290290000062
determination of parameter Ψk+1If the convergence is not achieved, continuing to repeat the steps E and M until the convergence is achieved finally, and outputting psik+1
It is another object of the present invention to provide a complex plant reliability hybrid model constructed by the method.
The complex equipment reliability hybrid model and the construction method thereof solve the problem that the existing model can only be established for simple faults according to the black box theory, and have the following advantages:
according to the model constructed by the method, the EM algorithm is adopted to optimize the distribution parameters, and although the EM algorithm has the advantages of easiness in implementation, high reliability, easiness in understanding and the like, the whole data set needs to be traversed during iteration, when the data volume is excessive, the iteration speed of the algorithm is low, and the EM algorithm has global search capability unlike a genetic algorithm, so that the problems that whether the EM algorithm can obtain a global optimal solution or not and whether the selected initial parameters are appropriate or not are excessively depended on exist. The invention combines a Bayesian random classification method to randomly distribute the data sets to each independent distribution so as to make up the defects.
The model constructed by the method overcomes the defect that a mixed model constructed by the traditional EM algorithm is easy to be locally optimal, parameters of each independent distribution are adaptively adjusted according to data characteristics, the weights of each independent distribution can be used for accepting or rejecting whether each independent distribution is reserved, the performance of the model is far higher than that of the mixed model constructed by the traditional EM algorithm when the parameters are fitted, the model is iterated for 7 times, the precision of parameter estimation is up to 99.97%, however, the iteration frequency of the mixed model constructed by the traditional EM algorithm is about 35 times, and the precision is only 82.17%.
Drawings
Fig. 1 is a three-dimensional diagram of a distribution function of a fault of a conventional device.
FIG. 2 is a flow chart of the method of the present invention.
Fig. 3 is a conventional EM algorithm computation framework diagram.
FIG. 4 shows normal distribution parameters
Figure BDA0002588290290000071
And (5) a variation graph.
FIG. 5 shows a normal distribution parameter μ1And (5) a variation graph.
Fig. 6 is a graph of the variation of the exponential distribution parameter λ.
Fig. 7 is a diagram showing the variation of the weibull distribution parameter β.
Fig. 8 is a diagram showing a variation of the weibull distribution parameter θ.
FIG. 9 is a diagram of a lognormal distribution parameter μ2And (5) a variation graph.
FIG. 10 is a diagram of lognormal distribution parameters
Figure BDA0002588290290000072
And (5) a variation graph.
FIG. 11 shows distribution weights π1、π2、π3、π4A variation diagram of (2).
FIG. 12 is a Bayesian stochastic classification reliability hybrid model error graph.
FIG. 13 is a hybrid model error map of a conventional EM algorithm.
FIG. 14 is a graph of a Bayesian stochastic classification reliability mixture model cumulative distribution function.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
Example 1
A method for constructing a complex equipment reliability mixed model is shown in a flow chart of fig. 2, the complex equipment reliability mixed model based on Bayesian random classification is constructed according to a black box theory, the defect that a traditional EM algorithm is easy to be trapped in local optimization is optimized by means of the Bayesian random classification, the faults aimed at by the mixed model are the result of comprehensive action of various fault distributions (table 1), sudden failures are random, and degeneration failures are predictable.
The EM algorithm is an optimization algorithm capable of performing Maximum Likelihood Estimation (MLE) through iteration, and is generally used for performing parameter estimation on a probability model containing hidden variables (latent variables) or missing data (incomplete-data), and the basic process of the EM algorithm is as follows: firstly, estimating the value of a model parameter according to the given observation data; then, the value of the missing data is estimated according to the parameter value estimated in the previous step, the parameter value is estimated again according to the estimated missing data and the previously observed data, iteration is repeated until the last convergence is reached, and the iteration is finished, as shown in fig. 3, a frame diagram is calculated for the EM algorithm.
The specific steps of the traditional EM algorithm are as follows:
(S1) inputting: observed data x ═ x1,x2,…,xn)TJoint distribution p (x, z; theta), conditional distribution p (z | x; theta), maximum number of iterations J, where z ═ z (z ═ x; theta)1,z2,…,zn)TIf the implicit data is not observed, theta is a sample model parameter, and T represents transposition;
(S2) initializing the initial value theta of the model parameter theta at random0I.e. the shape and position of the initialization curve (1);
(S3) starting EM algorithm iteration:
(S31) step E: computing a conditional probability expectation L (θ, θ) of the joint distributionj) I.e. the increment of curve (1) to curve (2):
Qi(zi)=P(zi|xij) (1’)
Figure BDA0002588290290000091
wherein Q isi(zi) For implicit data ziDistribution of (2).
(S32) M step: maximization of L (theta )j) To obtain thetaj+1Namely, calculating a parameter theta when the curve (2) intersects the curve (4):
Figure BDA0002588290290000092
(S33) if thetaj+1And after convergence, the algorithm is ended, otherwise, the algorithm returns to the step (S31) to perform E-step iteration operation, namely when the curve (3) is intersected with the curve (4), another parameter of the curve (3) is changed, and the iteration operation is continued.
(S4) the model parameter θ is output.
The method for constructing the complex equipment reliability hybrid model specifically comprises the following steps:
(S100): constructing a mixed distribution model by normal distribution, Weibull distribution, exponential distribution and lognormal distribution, and setting an observed equipment life data set y (y ═ y)1,y2,…,yn)T,yjA sample value indicating a j-th collected point, j being 1,2, … …, n, n being the total number of collected data, and the observed data being from a mixed distribution in which a normal distribution, an exponential distribution, a weibull distribution, and a lognormal distribution are combined, wherein weights of distribution amounts of the normal distribution, the exponential distribution, the weibull distribution, and the lognormal distribution are expressed as {π1,π2,π3,π4} in the mixed distribution, the sum thereof is 1; then observe data yjIs shown in formula (1):
Figure BDA0002588290290000093
wherein the content of the first and second substances,
Figure BDA0002588290290000094
Figure BDA0002588290290000095
Figure BDA0002588290290000096
Figure BDA0002588290290000101
in the formula, all unknown parameters are psi ═ (pi)12341122,λ,θ,β),π1、π2、π3、π4Weights, σ, representing the total distribution of normal, exponential, Weibull and lognormal distributions, respectively1、μ1Is a normally distributed parameter, theta and beta are parameters of Weibull distribution, lambda is a parameter of exponential distribution, sigma2、μ2A parameter that is lognormal distributed; pi represents the weight of each distribution in the total distribution; f. of1(yj) Is composed of
Figure BDA0002588290290000102
f2(yj) Is f2(yj;λ),f3(yj) Is f3(yj;θ,β),f4(yj) Is composed of
Figure BDA0002588290290000103
For mixed distributions, it is difficult to resolve each sample value y from the data itself alonejFrom which distribution, the observation does not contain all the information of the data in this sense, and is "incomplete data". The estimates of the finite mixture model of the four distributions finally end up as an estimate of the parameter vector Ψ.
(S200) estimating the unknown parameter Ψ in the mixed density function by using an EM algorithm, and transforming the estimation of the unknown parameter Ψ into an optimization problem by means of Maximum Likelihood Estimation (MLE), wherein an optimized objective function is a likelihood function L (Ψ) or an equivalent log likelihood function lnL (Ψ), and a domain of the optimization problem is the whole parameter value space, specifically as follows:
the likelihood function for the unknown parameter Ψ is:
Figure BDA0002588290290000104
Figure BDA0002588290290000105
under EM framework, each yjOne component, considered to be from a finite mixture model, is given by z ═ z (z)1,z2,…,zn)TAn indicator vector representing the non-observable component, wherein
Figure BDA0002588290290000106
Wherein, i is 1,2,3,4, j is 1,2, …, n.
The observation data vector y is equal to (y)1,y2,…,yn)TAnd missing data vector z ═ z (z)1,z2,…,zn)TCombined together, the complete data vector x ═ y is obtainedT,zT)T. Then the log-likelihood function lnL based on the complete data for the parameter Ψ in the mixed distribution modelc(ψ) is:
Figure BDA0002588290290000111
the parameter estimation steps based on the EM algorithm are as follows:
e, step E: calculating the conditional probability expectation Q (psi; psi) of the joint distribution using the E step in the conventional EM algorithm calculationk) In the k +1 th iteration of the EM algorithm, there are:
Figure BDA0002588290290000112
at a given y and current ΨkConditional expectation of log-likelihood function of complete data
Figure BDA0002588290290000113
Comprises the following steps:
Figure BDA0002588290290000114
wherein, taui(yj;Ψk) Is the jth observable data yjA posterior probability of an ith component belonging to the finite mixture model; the superscript k represents the kth iteration.
Then the log-likelihood function lnL derived from the complete data based on the parameter Ψ is combinedc(psi) conditional expectation of log-likelihood function of complete data
Figure BDA0002588290290000115
From formulae (9) and (11), it is possible:
Figure BDA0002588290290000121
wherein, piiThe weights of the respective distributions.
And M: updating the estimated value psi of psi by using M steps in the traditional EM algorithm calculationk+1Such that the entire parameter space of Ψ is Q (Ψ; Ψ)k) The function takes the maximum value, and the parameter iteration formula is as follows:
Figure BDA0002588290290000122
determination of parameter Ψk+1If the convergence is not achieved, continuing to repeat the steps E and M until the convergence is achieved finally, and outputting psik+1
(S300) modeling of complex equipment reliability hybrid model based on Bayesian stochastic classification
And optimizing the reliability modeling module of the complex equipment by adopting Bayesian random classification and a K-means algorithm to obtain a complex equipment reliability mixed model based on Bayesian random classification.
Although the EM algorithm has the advantages of easiness in implementation, high reliability, easiness in understanding and the like, the whole data set needs to be traversed when the EM algorithm is iterated, when the data volume is excessive, the iteration speed of the EM algorithm is low, and the EM algorithm does not have the global search capability like a genetic algorithm, whether the EM algorithm can obtain a global optimal solution or not depends excessively on whether the selected initial parameters are proper or not. The traditional EM algorithm adopts a random selection or experience selection method to initialize the parameters to be estimated, so that the iterative algorithm is easy to fall into local optimum. The mixed die is generally obtained by superposing a plurality of independent distributions, a certain intersection exists between the independent distributions, and if absolute classification initialization parameters are adopted, the parameters are inconsistent with the theoretical situation.
Aiming at the problems existing in the optimization of the EM initial parameters, a Bayesian random classification method is adopted to randomly distribute the data sets to each independent distribution so as to make up the defects. The method comprises the following specific steps:
(S310) classifying the equipment life data set by using Kmeans cluster analysis, and performing maximum likelihood estimation on independent distribution parameters of each type of data to serve as prior distribution parameters of each independent distribution, wherein the distribution parameter corresponding to the ith type of life data is psiiData amount g of each type of life dataiThe ratio of the data quantity G occupying the service life data is the weight w of each independent distributioniIs wi=gi/G。
(S320) calculating the density function value f of the j point in the i type independent distributionij(yj) Density function values f of four distributionsij(yj) Respectively calculated by formulas (2) to (5), and normalized according to formula (14) to obtain pij,pijThe posterior probability of the j point belonging to the i type independent distribution is represented as follows:
Figure BDA0002588290290000131
(S330) assigning the jth point to the posterior probability interval of the ith type independent distribution
Figure BDA0002588290290000132
Wherein p isj00, each point generates a random probability r according to a uniform distributionjAccording to rjWhether the classification belongs to the posterior probability interval of the ith independent distribution or not is judged, and the corresponding parameter of each independent distribution is updated according to the classification of the current sample.
(S340) calculating the maximum likelihood function value of the sample classification after Kmeans clustering according to the formula (15), and selecting the sample with the maximum likelihood function value as the current optimal classification:
Figure BDA0002588290290000133
where N represents the number of independent distributions.
(S350) repeating (S320) - (S340) until the maximum likelihood function converges or the number of iterations ends.
Specific application of the model of example 1
In order to verify the accuracy of the model constructed by the invention, the mixed distribution of the reliability of the complex equipment is constructed by normal distribution, exponential distribution, Weibull distribution and logarithmic normal distribution, the data of the mixed distribution is generated by randomly selecting mixed distribution parameters, the specific parameter setting is shown as the mixed distribution 1 in table 2, the capacity of the generated reliability data is 10000, the cluster number of the Bayesian random classification reliability mixed model is 4, the maximum iteration time is set to be 1000 times, the absolute value of the difference value between the random mixed distribution of the generated data and the complex equipment reliability mixed model based on the Bayesian random classification is defined as an error, and the setting error is less than 1 multiplied by 10-3For the convergence condition, a reliability mixed model constructed by the traditional EM algorithm is selected for the comparison test, the set parameters of the reliability mixed model are the same as those of a complex equipment reliability mixed model based on Bayesian random classification, and the effectiveness of the model provided by the invention is verified by comparing the iteration errors of the two algorithms.
TABLE 2 respective mixing distribution parameters
Figure BDA0002588290290000141
As can be seen from fig. 4-11 and fig. 14, the bayesian stochastic classification reliability hybrid model constructed by the present invention can overcome the defect that the hybrid model constructed by the conventional EM algorithm is prone to be locally optimal, adaptively adjust the parameters of each independent distribution according to the data characteristics, and use the weight of each independent distribution to accept or reject whether each independent distribution is reserved. As can be seen from fig. 12-13, the performance of the bayesian stochastic classification reliability hybrid model when the bayesian stochastic classification reliability hybrid model is fitted to the parameters is much higher than that of the hybrid model constructed by the conventional EM algorithm, which only iterates 7 times, and the accuracy of the parameter estimation is as high as 99.97%, whereas the hybrid model constructed by the conventional EM algorithm starts to converge about 35 times and only reaches 82.17%.
While the present invention has been described in detail with reference to the preferred embodiments thereof, it should be understood that the above description should not be taken as limiting the invention. Various modifications and alterations to this invention will become apparent to those skilled in the art upon reading the foregoing description. Accordingly, the scope of the invention should be determined from the following claims.

Claims (4)

1. A method for constructing a complex equipment reliability hybrid model is characterized by comprising the following steps:
(S100): constructing a mixed distribution model by normal distribution, Weibull distribution, exponential distribution and lognormal distribution, and setting an observed equipment life data set y (y ═ y)1,y2,…,yn)T,yjThe sample value representing the j-th collected point, j being 1,2, … …, n, n being the total number of collected data, the observed device lifetime data being from a mixed distribution in which a normal distribution, an exponential distribution, a weibull distribution, and a lognormal distribution are combined, and the weight of the distribution quantity of the normal distribution, the exponential distribution, the weibull distribution, and the lognormal distribution in the mixed distribution being {π 1,π2,π3,π4}, the sum of which is 1, observed data yjIs represented by the formula (1):
Figure FDA0002588290280000011
wherein the content of the first and second substances,
Figure FDA0002588290280000012
Figure FDA0002588290280000013
Figure FDA0002588290280000014
Figure FDA0002588290280000015
in the formula, all unknown parameters are psi ═ (pi)12341122,λ,θ,β),π1、π2、π3、π4Weights, σ, representing the total distribution of normal, exponential, Weibull and lognormal distributions, respectively1、μ1Is a normally distributed parameter, theta and beta are parameters of Weibull distribution, lambda is a parameter of exponential distribution, sigma2、μ2A parameter that is lognormal distributed; pi represents a circumferential ratio; f. of1(yj) Is composed of
Figure FDA0002588290280000016
f2(yj) Is f2(yj;λ),f3(yj) Is f3(yj;θ,β),f4(yj) Is composed of
Figure FDA0002588290280000017
(S200) estimating unknown parameters psi in the mixed density function by adopting an EM algorithm, converting the estimation of the unknown parameters psi into an optimization problem by means of maximum likelihood estimation, wherein the optimized objective function is a likelihood function L (psi) or an equivalent log likelihood function lnL (psi), and the definition domain is the whole parameter value space;
the likelihood function for the unknown parameter Ψ is:
Figure FDA0002588290280000021
the equivalent log likelihood function lnL (Ψ) for the unknown parameter Ψ is:
Figure FDA0002588290280000022
(S300) optimizing the reliability modeling module of the complex equipment by adopting Bayesian random classification and K-means algorithm to obtain a complex equipment reliability mixed model based on Bayesian random classification, which comprises the following steps:
(S310) classifying the equipment life data set by using Kmeans cluster analysis, and carrying out maximum likelihood estimation on independent distribution parameters of each type of data to serve as prior distribution parameters of each independent distribution, wherein the distribution parameter corresponding to the ith type of life data is psiiData amount g of each type of life dataiThe ratio of the data quantity G occupying the service life data is the weight w of each independent distributioniIs wi=gi/G;
(S320) calculating the density function value f of the j point in the i type independent distributionij(yj) Density function values f of four distributionsij(yj) Respectively calculated by formulas (2) to (5), and normalized according to formula (14) to obtain pij,pijThe posterior probability of the j point belonging to the i type independent distribution is represented as follows:
Figure FDA0002588290280000023
(S330) assigning the jth point to the posterior probability interval of the ith type independent distribution
Figure FDA0002588290280000024
Wherein p isj00, each point generates a random probability r according to a uniform distributionjAccording to rjJudging the classification of the j point according to whether the i-th independent distribution posterior probability interval belongs to the i-th independent distribution posterior probability interval or not, and updating the corresponding parameters of each independent distribution according to the current sample classification;
(S340) calculating the maximum likelihood function value of the sample classification after Kmeans clustering according to the formula (15), and selecting the sample classification with the maximum likelihood function value as the current optimal classification:
Figure FDA0002588290280000031
wherein N represents the number of independent distributions;
(S350) repeating (S320) - (S340) until the maximum likelihood function converges or the number of iterations ends.
2. The method of constructing a hybrid model of complex plant reliability as claimed in claim 1, wherein in step (S200), under EM framework, each y isjOne component, considered to be from a finite mixture model, is given by z ═ z (z)1,z2,…,zn)TAn indicator vector representing the non-observable component, wherein
Figure FDA0002588290280000032
In formula (8), i is 1,2,3,4, j is 1,2, …, n;
the observation data vector y is equal to (y)1,y2,…,yn)TAnd missing data vector z ═ z (z)1,z2,…,zn)TCombined together, the complete data vector x ═ y is obtainedT,zT)TThen, in the mixed distribution model, the log-likelihood function lnL based on the complete data of the parameter Ψc(ψ) is:
Figure FDA0002588290280000033
3. the method for constructing a hybrid model of complex plant reliability according to claim 2, wherein in the step (S200), the parameter estimation method based on the EM algorithm comprises:
e, step E: computing conditional probability expectation Q (psi; psi) of a joint distributionk) In the k +1 th iteration of the EM algorithm, there are:
Figure FDA0002588290280000034
at a given y and current ΨkConditional expectation of log-likelihood function of complete data
Figure FDA0002588290280000035
Comprises the following steps:
Figure FDA0002588290280000041
wherein, taui(yj;Ψk) Is the jth observable data yjA posterior probability of an ith component belonging to the finite mixture model; superscript k represents the kth iteration;
then the log-likelihood function lnL derived from the complete data based on the parameter Ψ is combinedc(psi) conditional expectation of log-likelihood function of complete data
Figure FDA0002588290280000042
From formulae (9) and (11), it is possible:
Figure FDA0002588290280000043
wherein, piiWeights for the respective distributions;
and M: updating the estimated value Ψ of Ψk+1Such that the entire parameter space of Ψ is Q (Ψ; Ψ)k) The function takes the maximum value, and the parameter iteration formula is as follows:
Figure FDA0002588290280000051
determination of parameter Ψk+1If the convergence is not achieved, continuing to repeat the steps E and M until the convergence is achieved finally, and outputting psik+1
4. A complex device reliability hybrid model constructed by the method of any one of claims 1-3.
CN202010688009.1A 2020-07-16 2020-07-16 Complex equipment reliability hybrid model and construction method thereof Active CN111814342B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010688009.1A CN111814342B (en) 2020-07-16 2020-07-16 Complex equipment reliability hybrid model and construction method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010688009.1A CN111814342B (en) 2020-07-16 2020-07-16 Complex equipment reliability hybrid model and construction method thereof

Publications (2)

Publication Number Publication Date
CN111814342A true CN111814342A (en) 2020-10-23
CN111814342B CN111814342B (en) 2022-10-11

Family

ID=72865693

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010688009.1A Active CN111814342B (en) 2020-07-16 2020-07-16 Complex equipment reliability hybrid model and construction method thereof

Country Status (1)

Country Link
CN (1) CN111814342B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112529037A (en) * 2020-11-06 2021-03-19 重庆恢恢信息技术有限公司 Construction site construction optimization method based on big data analysis
CN113420425A (en) * 2021-06-03 2021-09-21 杭州电子科技大学 ARMA model parameter estimation method and system based on low-quality data
CN114972341A (en) * 2022-07-28 2022-08-30 华南理工大学 WSI image classification method, system and medium based on Bayesian assisted learning
CN115329895A (en) * 2022-09-06 2022-11-11 南昌大学 Multi-source heterogeneous data noise reduction analysis processing method
CN117371876A (en) * 2023-12-07 2024-01-09 深圳品阔信息技术有限公司 Index data analysis method and system based on keywords

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106683122A (en) * 2016-12-16 2017-05-17 华南理工大学 Particle filtering method based on Gaussian mixture model and variational Bayes
CN111369002A (en) * 2020-02-20 2020-07-03 杭州电子科技大学 Gibbs parameter sampling method applied to random point mode finite hybrid model

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106683122A (en) * 2016-12-16 2017-05-17 华南理工大学 Particle filtering method based on Gaussian mixture model and variational Bayes
CN111369002A (en) * 2020-02-20 2020-07-03 杭州电子科技大学 Gibbs parameter sampling method applied to random point mode finite hybrid model

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王小英等: "正态分布和瑞利分布混合情形下的参数估计及分类问题", 《数学建模及其应用》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112529037A (en) * 2020-11-06 2021-03-19 重庆恢恢信息技术有限公司 Construction site construction optimization method based on big data analysis
CN113420425A (en) * 2021-06-03 2021-09-21 杭州电子科技大学 ARMA model parameter estimation method and system based on low-quality data
CN114972341A (en) * 2022-07-28 2022-08-30 华南理工大学 WSI image classification method, system and medium based on Bayesian assisted learning
CN114972341B (en) * 2022-07-28 2022-12-16 华南理工大学 WSI image classification method, system and medium based on Bayesian assisted learning
CN115329895A (en) * 2022-09-06 2022-11-11 南昌大学 Multi-source heterogeneous data noise reduction analysis processing method
CN117371876A (en) * 2023-12-07 2024-01-09 深圳品阔信息技术有限公司 Index data analysis method and system based on keywords
CN117371876B (en) * 2023-12-07 2024-04-02 深圳品阔信息技术有限公司 Index data analysis method and system based on keywords

Also Published As

Publication number Publication date
CN111814342B (en) 2022-10-11

Similar Documents

Publication Publication Date Title
CN111814342B (en) Complex equipment reliability hybrid model and construction method thereof
Malinsky et al. Causal structure learning from multivariate time series in settings with unmeasured confounding
CN106874581B (en) Building air conditioner energy consumption prediction method based on BP neural network model
CN111860982A (en) Wind power plant short-term wind power prediction method based on VMD-FCM-GRU
US20020138457A1 (en) Approximate fitness functions
CN110659722A (en) AdaBoost-CBP neural network-based electric vehicle lithium ion battery health state estimation method
CN110910004A (en) Reservoir dispatching rule extraction method and system with multiple uncertainties
CN113962145A (en) Parameter uncertainty quantitative modeling method under interval data sample condition
CN111027229B (en) Wind power curve fitting method based on sparse heteroscedastic multi-spline regression
CN109540089B (en) Bridge deck elevation fitting method based on Bayes-Kriging model
CN107204616B (en) Power system random state estimation method based on self-adaptive sparse pseudo-spectral method
Zhang et al. A hypothesis testing framework for modularity based network community detection
CN115408949B (en) Load model parameter identification method System, apparatus and medium
CN113222263A (en) Photovoltaic power generation power prediction method based on long-term and short-term memory neural network
Ao et al. Entropy estimation via normalizing flow
CN112287605A (en) Flow check method based on graph convolution network acceleration
CN116303786B (en) Block chain financial big data management system based on multidimensional data fusion algorithm
Chaudhuri et al. On a variational approximation based empirical likelihood ABC method
Ridder Asymptotic optimality of the cross-entropy method for Markov chain problems
CN115712977A (en) Gear reducer robust optimization design method based on assistance of Kriging surrogate model
Cho et al. An adaptive reduced basis collocation method based on PCM ANOVA decomposition for anisotropic stochastic PDEs
CN113807019A (en) MCMC wind power simulation method based on improved scene classification and coarse grain removal
CN112464173A (en) MLE and BIC based random point mode parameter estimation method
Sandmann Relative error and asymptotic optimality in estimating rare event probabilities by importance sampling
CN116757098B (en) Automatic verification method based on SWAT model multi-objective optimization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant