CN109460862A - The method that meta-heuristic algorithms based on MAB solve multi-objective optimization question - Google Patents
The method that meta-heuristic algorithms based on MAB solve multi-objective optimization question Download PDFInfo
- Publication number
- CN109460862A CN109460862A CN201811230929.8A CN201811230929A CN109460862A CN 109460862 A CN109460862 A CN 109460862A CN 201811230929 A CN201811230929 A CN 201811230929A CN 109460862 A CN109460862 A CN 109460862A
- Authority
- CN
- China
- Prior art keywords
- value
- heuristic
- low layer
- operator
- population
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/006—Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0639—Performance analysis of employees; Performance analysis of enterprise or organisation operations
- G06Q10/06393—Score-carding, benchmarking or key performance indicator [KPI] analysis
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Economics (AREA)
- Strategic Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- General Physics & Mathematics (AREA)
- Game Theory and Decision Science (AREA)
- Educational Administration (AREA)
- Tourism & Hospitality (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Biophysics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The present invention relates to meta-heuristic algorithms fields, a kind of more particularly to method that the meta-heuristic algorithms based on MAB solve multi-objective optimization question, this method is using MAB strategy as learning strategy, and the performance of the heuristic operator of each low layer is assessed using four kinds of Performance Evaluation mechanism, the advantages of by the mechanism that learns and select preferably to combine each low layer heuristic operator;The algorithm is tested on continuous multi-objective optimization question collection WFG, and achieves good experimental result.Versatility is poor when it is an object of the invention to overcome the design of traditional heuritic approach and single heuritic approach may in some problem-instances the poor problem of effect, propose a kind of method that the meta-heuristic algorithms based on MAB solve multi-objective optimization question.
Description
Technical field
The present invention relates to meta-heuristic algorithms fields more particularly to a kind of meta-heuristic algorithms based on MAB to solve more mesh
The method for marking optimization problem.
Background technique
Multi-objective optimization question is used as one kind NP double linear problems of difficulty for solving, and big quantity algorithm is for solving such problem.Researcher's warp
The heuritic approach for often taking problem to customize, to obtain acceptable solution within reasonable time.Heuritic approach
The advantages of domain knowledge is incorporated according to problem with being to can be convenient, disadvantage is that versatility is bad, it is often necessary to be directed to problem
Design specialization algorithm.Since the design of heuritic approach has stronger problem correlation, which increases the works of algorithm design
It measures.In addition to this, single heuritic approach may obtain more excellent effect in some problem-instances, and in other examples
Upper effect is poor, i.e., it cannot be guaranteed that single heuritic approach can obtain high quality results in all problems example.
Meta-heuristic algorithms (Hyper-heuristic, HH) are as a kind of heuristic operator selection or heuristic operator
The searching method of building can solve problem above.A given search problem and a heuristic operator relevant to problem
Gather (the referred to as heuristic operator set of low layer).The neighborhood space of meta-heuristic algorithms not instead of direct search problem, conduct
Search is risen a level by high-rise heuristic strategies, to search for the neighborhood space of the heuristic operator of low layer.In search process
In, meta-heuristic algorithms select from the heuristic operator set of low layer according to different solving states and apply suitable low layer
Heuristic operator, the heuristic operator of the low layer finally selected constitute the heuristic sequence of operators of low layer.This method can incite somebody to action
The advantages of each low layer heuristic operator, combines, and the shortcomings that avoid the heuristic operator of low layer to a certain extent.Meta-heuristic
Algorithm solves on single-object problem and has been achieved for preferable effect, however, meta-heuristic algorithms are excellent in multiple target
In change problem using less.
Through the retrieval discovery to prior art document, Burke et al. is on " Springer " (2005, pp:129-158)
Publish "Metaheuristics:Progress as Real Problem Solvers" propose using reinforce study and
TABU search alternatively tactful meta-heuristic algorithms for solving the distribution of multiple target space and time planning problem.
Vazquez-And Petrovic " Journal of Heuristics " (2010, Vol.16, No.6, pp:
771-793) article " the A new dispatching rule based genetic algorithm for the delivered on
Multi-objective job shop problem ", proposes the meta-heuristic algorithms based on scheduling rule and genetic algorithm
For solving multi-objective Job Shop problem.However, meta-heuristic algorithms set forth above are only in noncontinuity multiple-objection optimization
It is solved in problem.Mashael Maashi et al. " Expert Systems with Applications " (2014,
Vol.41, pp:4475-4493) on article " the A multi-objective hyper-heuristic based on that delivers
Choice function " proposes a kind of meta-heuristic algorithms HH_CF based on selection function for solving the more mesh of continuity
Optimization problem is marked, and achieves preferable effect.But the algorithm uses the interval of the heuristic operator of low layer (when CPU
Between) parameter of alternatively mechanism, so that operational effect less stable on the different machine of algorithm.Giovani
Article " the A Hyper- that Guizzo et al. is delivered on " GECCO " (2015, Madrid, Spain, pp:1343-1350)
Heuristic for the Multi-Objective Integration and Test Order Problem " proposes two
Kind meta-heuristic algorithms HITO-CF and HITO-MAB is for solving the multiple target software engineering method problem based on search.
HITO-CF and HITO-MAB uses CF and MAB using a variety of crossover operators and mutation operator as the heuristic operator of low layer respectively
It is alternatively tactful.In the selection process, cumulative summation is used according to the dominance relation between parent individuality and offspring individual
The return value of the method calculating heuristic operator of low layer.However this return value calculating method focuses on the convergence of disaggregation, and neglect
The distributivity of disaggregation is omited.
Summary of the invention
Versatility is poor when it is an object of the invention to overcome the design of traditional heuritic approach and single heuritic approach
May in some problem-instances the poor problem of effect, it is excellent to propose that a kind of meta-heuristic algorithms based on MAB solve multiple target
The method of change problem, this method are assessed each low layer using four kinds of Performance Evaluation mechanism and are inspired using MAB strategy as learning strategy
The performance of formula operator, the advantages of by the mechanism that learns and select preferably to combine each low layer heuristic operator;The algorithm exists
It is tested on continuous multi-objective optimization question collection WFG, and achieves good experimental result.
To achieve the goals above, the present invention adopts the following technical scheme that: a kind of meta-heuristic algorithms based on MAB
The method for solving multi-objective optimization question, this method comprises:
(1) the set H={ h of the heuristic operator of low layer is provided1, h2...hnum, wherein num is the heuristic operator of low layer
Number, h1, h2...hnumFor multi-objective optimization algorithm or operator;
(2) the random initial population P that construction size is N0, in decision space X, N number of solution is generated at random;
(3) performance number of each heuristic operator of low layer in four existing Performance Evaluation indexs is calculated.Each low layer opens
Hairdo operator h1, h2...hnum(f=1,2...num) independently runs once same initial population P0, and each run is held
Row hiTotal M generation, and calculate separately each heuristic operator hiFour performance evaluation indexes AE, RNI, SSC and UD;M is primary changes
The number of the heuristic operator of low layer is executed in generation, M takes the integer in (0, ∞) range;
1) AE: assessment algorithm obtains the calculation amount of approximate disaggregation, and value range is (0, ∞), the smaller representative of AE value
It can be better;
2) RNI: assessment algorithm obtain approximate solution concentrate non-domination solution ratio, value range be (0,1], RNI value
It is bigger to indicate that approximate solution concentrates the ratio of non-domination solution higher;If RNI=1, all individuals for showing that approximate solution is concentrated all are
Non-dominant;
3) SSC: assessment algorithm obtain approximate disaggregation institute covered object space size, value range for [0,
∞);SSC value is higher, and the covering space for indicating seemingly nearly disaggregation is bigger, also indicates that performance is better on convergence and distributivity;
4) UD: assessing approximate disaggregation in the distribution situation of object space, value range be (0,1];In non-dominant skill
It is better to measure the higher distributivity for indicating approximate disaggregation on object space of UD value in identical situation;
(4) the return value r of the heuristic operator of each low layer is calculated using operator Performance Evaluation mechanismi, each heuristic calculation of low layer
Sub- evaluation mechanism includes the following steps:
1) performance data obtained to the heuristic operator of each low layer is normalized, so that each Performance Evaluation index
It is mapped within the same value range [0,1], the heuristic operator h of low layeriAfter the normalizing on four performance evaluation indexes
Value is denoted as AE respectivelyi_nor、RNIi_nor、SSCi_norAnd UDi_nor;
2) according to the Performance Evaluation index value obtained after normalization, the heuristic operator of each low layer is 1. calculated using formula
Return value ri:
ri=(1-AEi_nor)+2RNIi_nor+SSCi_nor+UDi_nor ①
Formula 1. in: because of AEi_norValue it is the smaller the better, and other three indexs are the bigger the better, therefore in return value
riCalculating in used the heuristic operator h of low layeriAEi_norReverse value, i.e. 1-AEi_norValue;SSCi_norIt can reflect out
The distributivity and convergence for the population that algorithm obtains after executing, UDi_norIt can reflect out the distribution that population is obtained after algorithm executes
Property;Under Population Size unanimous circumstances, SSCi_norAnd UDi_norValue be the bigger the better;In order to avoid the variation of Population Size
To SSCi_norAnd UDi_norThe influence of value, therefore in riCalculating in give SSCi_norAnd UDi_norArranged in pairs or groups respectively one it is non-dominant
The ratio RNI of solutioni_nor;
(5) according to return value ri, the MAB value of the heuristic operator of each low layer is calculated, steps are as follows:
1) for there is the set H={ h of the heuristic operator of num low layer1, h2...hnum, it is each hi (i∈
{ 1...num }) one performance tuple being made of three variables of settingWherein ri be return value,It is average
Return value, kiFor hiThe number selected, in the algorithm initial stage, all values are initialized as 0;
2) h is 2. updated using formulaiPerformance tuple
3) according to the heuristic operator h of low layeriPerformance tuple ViCalculate the MAB of the heuristic operator of current generation low layeriValue,
Following formula is 3.:
Wherein C is control parameter, controls the heuristic operator h of each low layeriAverage behaviorIt is held with operator in frame operation
A kind of balance between capable number;
(6) the heuristic operator h of the maximum low layer of MAB value is selectedtoP, i.e. top=argmax { MABi, i=1,2 ...,
num;If the algorithm for possessing maximum MAB value has multiple, therefrom random selection one;
(7) h is usedtopAdvanced group species Pt, execute htopIn total M generation, new population P ' is obtained, and calculated heuristic using low layer
Operator htopPerformance number of the obtained population P ' on four performance evaluation indexes;
(8) using improvement reception strategy Population Regeneration Pt+1, judge to execute h using SSCtopWhether population has improvement afterwards, if
Without improvement, then the population P of next iteration (t+1 generation)t+1It is replaced by the population in t generation, i.e. Pt+1=Pt;If there is improvement, the
The population P in t+1 generationt+1Population P ' the replacement just obtained, i.e. Pt+1=P ';
(9) judge the number of iterations whether be more than setting maximum value W, if it does, HH-MAB algorithm terminates;If not yet
It has more than, into next round iteration, returns (4);W is the maximum number of iterations of algorithm, takes the integer in (0, ∞) range.
What technical solution of the present invention generated has the beneficial effect that:
1, a kind of novel meta-heuristic algorithms HH-MAB solution multi-objective optimization question based on MAB is proposed, at present
There has been no meta-heuristic algorithms using MAB strategy in calculation amount, non-domination solution accounting, disaggregation distributivity and disaggregation convergence side
Face learns low layer heuritic approach.
2, new return value calculating method is devised, is normalized using four performance evaluation indexes (AE, RNI, SSC and UD)
The method of summation, comprehensively consider from many aspects the heuristic operator of low layer calculation amount, non-domination solution accounting, disaggregation distributivity and
Performance in disaggregation convergence.
3, each evaluation index value is normalized so that the Performance Evaluation index of different value intervals is normalized to together there are four gathering around
In one section so that the calculating of return value to four performance evaluation indexes without preference, thus preferably each decision point select close
The heuristic operator of suitable bottom, with the performance of boosting algorithm.
4, relative to the existing meta-heuristic method (HH-RAND and HH- CF) for solving multi-objective optimization question, HH-MAB
The heuristic operator of low layer is selected according to return value using MAB strategy, by selecting suitable low layer to inspire in the different solution stages
Formula operator can preferably combine the advantages of operator that each low layer is heuristic, improve algorithm performance, obtain preferably close
Like disaggregation.
5, HH-MAB not only increases the performance of algorithm, is also promoted using reception strategy Population Regeneration, this method is improved
The stability of algorithm.
Detailed description of the invention
Fig. 1 is HH-MAB algorithm frame flow chart of the present invention.Fig. 2 is that HH-RAND, HH-CF and HH-MAB exist in embodiment
Independent operating 30 times is to the flat of three heuristic operators of low layer (NSGA-II, SPEA2, MOGA) on WFG1-WFG9 test problem
The schematic diagram of equal utilization rate.
Fig. 3 is that HH-MAB is being assessed for independent operating 30 times on WFG1-WFG9 test problem with comparison algorithm in embodiment
The schematic diagram of box figure on index RNI.
Fig. 4 is that HH-MAB is being assessed for independent operating 30 times on WFG1-WFG9 test problem with comparison algorithm in embodiment
The schematic diagram of box figure on index S SC.
Fig. 5 is that HH-MAB is being assessed for independent operating 30 times on WFG1-WFG9 test problem with comparison algorithm in embodiment
The schematic diagram of box figure on index UD.
Fig. 6 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG1 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Fig. 7 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG2 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Fig. 8 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG3 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Fig. 9 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG4 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Figure 10 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG5 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Figure 11 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG6 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Figure 12 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG7 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Figure 13 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG8 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Figure 14 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG9 test problem in embodiment
Approximate Pareto leading surface schematic diagram.
Figure 15 be step 4 1) in the heuristic operator of each low layer obtain performance data original value and normalization after numerical value
Schematic diagram.
Specific embodiment
With reference to the accompanying drawing and specific embodiment the invention will be further elaborated.
A kind of meta-heuristic algorithms based on MAB solve the method HH-MAB of multi-objective optimization question, for be solved
Multi-objective optimization question provides the set H={ h of the existing heuristic operator of low layer for problem to be solved first1,
h2...hnum}.The heuristic operator of these low layers has stronger problem correlation.HH-MAB is as high-rise heuristic strategies, no
With to problem direct solution, and the space of the heuristic operator of low layer is scanned for.HH-MAB is designed and operator is used to assess machine
It makes with MAB selection mechanism and learns the performance of the heuristic operator of each low layer, and decide whether to receive using acceptance mechanism is improved
The new population of generation.
Without loss of generality, it is assumed that multi-objective optimization question has a n decision variable, m objective function, then more mesh
Mark optimization problem is defined as min F (x)=(f1(x), f2(x) ..., fm(x))T, wherein x=(x1.., xn) ∈ X be n dimension
Decision vector (referred to as solves), and X is the decision space of n dimension, xiIndicate i-th of decision component of x, i ∈ { 1,2 ..., n };F(x)
=(f1(x), f2(x) ..., fm(x))T∈ Y is the object vector of m dimension, and Y is the object space of m dimension.f1(x) ..., fm(x)
For m conflicting objective functions;For multi-objective optimization question, if there are two solve xAAnd xB, when to all i=1,
2 ..., m, meets fi(xA)≤fi(xB), and there are f at least one targetj(xA) < fj(xB), then claim xADominate xB;
Other solutions dominate solution x if it does not exist, then x is referred to as Pareto optimal solution;The optimal collection being deconstructed into of all Pareto is collectively referred to as
Pareto optimal solution set is known as Pareto leading surface in the mapping of object space;For multi-objective optimization question, algorithm is set
The target of meter is to obtain being uniformly distributed on object space and to the convergent approximate disaggregation of Pareto leading surface.
As represented in figures 1 through 14, the method for the present embodiment includes the following steps:
Step 1: providing the set H={ h of the heuristic operator of low layer1, h2...hnum, wherein num is the heuristic calculation of low layer
The number of son selects 3 heuristic operators of low layer, i.e. num=3 in the present invention;h1, h2, h3For the already present more mesh of classics
Mark optimization algorithm;
h1: NSGAII (Deb, K.;Pratap, A.;Agarwal, S.;And Meyarivan, T.A fast and
Elitist multi-objective genetic algorithm:NSGA-II.IEEE Trans.on Evolutionary
Computation, 2002,6 (2): 182-197)
h2: SPEA2 (Zitzler, E.;Laumanns, M.;And Thiele, L.SPEA2:Improving the
Strength Pareto Evolutionary Algorithm.In:Proceedings of Evolutionary Methods
For Design, Optimization and Control with Applications to Industrial Problems,
Berlin, Germany:Springer-Verlag, 2001.95-100)
h3: MOGA (Fonseca CM, Fleming PJ.Genetic algorithm for multiobjective
Optimization:Formulation.discussion and generation.In:Proceedings of the 5th
Int ' l Conf.on Genetic Algorithms, San Mateo:Morgan Kauffman Publishers,
1993.416- 423)。
These three algorithms respectively have advantage and disadvantage, more suitable for verifying HH-MAB on combining the calculation heuristic operator advantage of low layer
Ability, intersection and mutation probability and classic algorithm NSGA-II and most of multiple targets in each heuristic operator of low layer into
It is identical to change algorithm, is respectively set as 0.9 and 1/24, intersects and mutation profile exponent is respectively 10 and 20.
Step 2: construction contains the random initial population P of N number of solution0(due to being initial population, i.e. the 0th generation population, therefore
Initial population is denoted as P0;In this experiment, N=100);In decision space X, for problem to be solved, if each decision component xi
The value range set is [xil, xiu], i ∈ { 1,2 ..., n }, xilAnd xiuIt is illustrated respectively in i-th that x is solved in problem definition
Decision component xiValue lower bound and the upper bound, then generated at random in decision space X and possess the random initial population P of N number of solution0=
{x1, x2..., xNMethod are as follows: for each RANDOM SOLUTION xj(j ∈ { 1,2 ..., N }), i-th of decision component xj i's
Value is its value range [xil, xiu] random value in (i ∈ { 1,2 ..., n }).
Step 3: calculate initial performance values of the heuristic operator of each low layer on four performance evaluation indexes, with from multi-party
Planar survey algorithm obtains the performance of disaggregation;Each heuristic operator h of low layer1,h2,h3Respectively to same initial population P0Independent fortune
Row is primary, and each run executes hiIn total M generation, (M is user's custom parameter to i ∈ { 1,2,3 }, to execute low layer in an iteration
The number of heuristic operator, M take the integer in (0, ∞) range, M=250 are arranged in the present embodiment), it then calculates separately each
Heuristic operator hiFour performance evaluation indexes (AE, RNI, SSC and UD), i ∈ { 1,2,3 };
h1Four performance evaluation indexes be denoted as AE respectively1、RNI1、SSC1And UD1;
h2Four performance evaluation indexes be denoted as AE respectively2、RNI2、SSC2And UD2;
h3Four performance evaluation indexes be denoted as AE respectively3、RNI3、SSC3And UD3;
1) Algorithm effort (AE): assessment algorithm obtains the calculation amount of approximate disaggregation, and value is set by user
For set time step-length divided by the function evaluation quantity executed in the step-length, value range is (0, ∞), and lesser AE value indicates more
Good performance (Tan, K.C., Lee, T.H., &Khor, E.F. (2002) Evolutionary algorithms for
multi-objective optimization:Performance assessments and
comparisons.Artificial Intelligence Review,17,253–290)
2) Ratio of non-dominated individuals (RNI): the approximate solution that assessment algorithm obtains is concentrated non-
Dominate the ratio of solution, value is the number of non-domination solution in current population divided by initial population size N, value range be (0,
1], RNI value is bigger indicates that approximate solution concentrates the ratio of non-domination solution higher, performance better (Tan, K.C., Lee, T.H., &
Khor,E.F.(2002).Evolutionary algorithms for multi- objective optimization:
Performance assessments and comparisons.Artificial Intelligence Review,17,
253–290.)
3) Size of space covered or so-called S_metric Hypervolume (SSC): assessment is calculated
The approximate disaggregation and reference point ref=(ref that method obtains1,ref2,...,refm) between covered object space it is big
It is small, value range be [0, ∞);Reference point ref=(ref1,ref2,...,refm) be set as in object space by approximate disaggregation
In the solution that all dominates of all solutions, for minimizing multi-objective optimization question, usual refiIt indicates on the i-th objective function most
Big value;For two target WFG problems, common reference point is set as ref=(4,4);SSC value is higher to indicate covering like nearly disaggregation
Lid space is bigger, also indicates that performance better (Zitzler, E. , &Thiele, L. (1999) on convergence and distributivity
.Multiobjective evolutionary algorithms:A comparative case study and the
strength)
4) Uniform distribution of a non-dominated population (UD): assessment is non-dominant
Disaggregation object space distribution situation, value range be (0,1];In the identical situation of non-domination solution number, UD value is got over
Height indicate distributivity of the approximate disaggregation on object space it is better (Srinivas, N., K. (1994)
.Multiobjective optimization using nondominated sorting in genetic
algorithms.Evolutionary Computation,2,221–248.)
Step 4: calculating the return value r of the heuristic operator of each low layer using operator Performance Evaluation mechanismi;Each low layer inspires
Formula operator evaluation mechanism includes the following steps:
1) performance data obtained to the heuristic operator of each low layer is normalized;By four performance evaluation indexes
AE, RNI, SSC and UD are normalized within the same value range [0,1] from different value intervals;Four performances are obtained respectively
Minimum value and maximum value of the evaluation index in three heuristic operators of low layer of current generation, are denoted as AE respectivelymin、AEmax、
RNImin、RNImax、SSCmin、SSCmax、UDminAnd UDmax, wherein AEmin=argmin { AE1,AE2,AE3, AEmax=argmax
{AE1,AE2,AE3, RNImin=argmin { RNI1,RNI2,RNI3, RNImax=argmax { RNI1,RNI2,RNI3, SSCmin
=argmin { SSC1,SSC2,SSC3, SSCmax=argmax { SSC1,SSC2,SSC3, UDmin=argmin { UD1,UD2,
UD3, UDmax=argmax { UD1,UD2,UD3};If hiThe normalization of (i ∈ { 1,2,3 }) on four performance evaluation indexes it
Preceding original value is denoted as AE respectivelyi、RNIi、SSCiAnd UDi, then hiNormalized value AE on four performance evaluation indexesi_nor,
RNIi_nor, SSCi_nor,UD i_norIt calculates as follows:
By taking Figure 15 as an example, (a) is original of the heuristic operator of first three low layer of normalization on four performance evaluation indexes
Initial value (b) is normalized value of three heuristic operators of low layer on four performance evaluation indexes;It, can from (a) by taking AE as an example
Know AEmin=0.000108, AEmax=0.000430;Then h1The normalized value AE on AE1_nor=(0.000259-
0.000108)/(0.000430-0.000108)=0.468944.
2) according to the Performance Evaluation index value after the normalization 1) obtained, the heuristic calculation of each low layer is 1. calculated using formula
Sub- hiReturn value ri, i ∈ { 1,2,3 }:
ri=(1-AEi_nor)+2RNIi_nor+SSCi_nor+UDi_nor ①
Wherein AEi_norIt represents and executes the heuristic operator h of low layeriCalculation amount, SSCi_norIt is heuristic to represent execution low layer
Operator hiThe distributivity and convergence for the population obtained afterwards, UDi_norIt represents and executes the heuristic operator h of low layeriPoint of population is obtained afterwards
Cloth, RNIi_norThe heuristic operator h of low layer is held in expressioniAfterwards in population non-domination solution ratio;Due to RNIi_nor、SSCi_norWith
UDi_norIt is that the bigger performance of value is better, and AEi_norIt is that the smaller performance of value is better.Herein by AEi_norTurn from minimization problem
Turn to maximization problems, i.e., formula 1. in use 1-AEi_norValue;But formula 1. in, UDi_norValue and SSCi_norWhat is be worth is big
The small influence that will receive non-domination solution number in population;For example, when population at individual is less, even if the distributivity of current population
It is poor, UDi_norThe more population of non-domination solution number in still possible larger or even preferable than the distributivity but population of value
UDi_norValue wants high.Algorithm in the process of implementation, it is contemplated that Population Size may change, return value riCalculating process
In we be SSCi_norAnd UDi_norThe RNI for an equal proportion of having arranged in pairs or groups respectivelyi_nor。
1. formula mixes four performance evaluation indexes, after normalization, due to the value of each Performance Evaluation index
Range is identical, so the heuristic operator h of each low layeriReturn value riIt is four performances to four performance evaluation indexes without preference
The comprehensive of evaluation index embodies;1. according to formula, each heuristic operator h of low layer after normalizationiReturn value riAs shown in table 2;
With h1For, r1=(1-0.468944)+2 × 1.000000+ 1.000000+1.000000=4.531056.
Table 2- return value
h1 | h2 | h3 | |
r | 4.531056 | 4.406907 | 0.000000 |
Step 5: according to return value ri, the MAB value of the heuristic operator of each low layer is calculated, steps are as follows:
1) the set H={ h of operator heuristic for low layer1, h2, h3, it is the heuristic operator h of each low layeriSetting one
The performance tuple being made of three variablesWherein riIt is hiStep 4 obtain return value,It is from the beginning of
Up to the present hiAverage value, the k of obtained return valueiFor hiUp to the present the number selected, i ∈ { 1,2,3 };In algorithm
Initial stage, three variables were initialized to ri=0,ki=0;
2) the return value r obtained according to step 4i, the heuristic operator h of low layer is 2. updated using formulaiCorresponding performance tupleIn latter two variableAnd ki,
3) according to the heuristic operator h of low layeriPerformance tuple Vi, calculate the heuristic operator h of current generation low layeriMABi
It is worth (MABi> 0), as formula 3. shown in:
Wherein C is control parameter (2 are set as in this experiment), for controlling the heuristic operator h of each low layeriAverage return
ValueWith the heuristic operator h of low layer in algorithm operationiExecution number kiBetween a kind of balance;
Step 6: the selection heuristic operator h of the maximum low layer of MAB valuetop, i.e. top=argmax { MAB1, MAB2,
MAB3};If have the heuristic operator of multiple low layers while possessing maximum MAB value, i.e. MABi=MABj> MABk, wherein i, j, k
∈ { 1,2,3 }, then from hiAnd hjMiddle random selection one is used as htop;
Step 7: using the heuristic operator h of low layertopEvolve current population Pt, execute htopIn total M=250 generation, obtains new
Population P ';It is calculated according to Performance Evaluation index AE, RNI, SSC and UD of step 3 introduction using the heuristic operator h of low layertop
Performance index value AE of the obtained population P ' on four performance evaluation indexestop、RNItop、SSCtopAnd UDtop。
Step 8: using reception strategy Population Regeneration P is improvedt+1;Due to the four performance evaluation indexes used at us
In, SSC is that only one can assess convergence in population but also assess the index of Species structure, therefore be held using SSC judgement
Row htopWhether population has improvement afterwards (i.e. whether SSC value becomes larger);If the more last round of iterative value of SSC evaluation index value does not become
Greatly, then the population P of next iteration (t+1 generation)t+1By the population P in t generationtReplacement, i.e. Pt+1=Pt;If value becomes larger, t+1
The population P in generationt+1Population P ' the replacement just obtained, i.e. Pt+1=P '.
Step 9: judging whether the number of iterations is more than the iterations max W that sets (W as user's custom parameter, to calculate
The maximum number of iterations of method takes the integer in (0, ∞) range, W=25 is arranged in the present embodiment);If it does, HH-MAB is calculated
Method terminates;If be not above, into next round iteration, return step four.
Fig. 6-14 is HH-RAND, HH-CF and HH-MAB independent operating 30 times, 50% on WFG1-WFG9 test problem
Approximate Pareto leading surface schematic diagram.Approximate Pareto leading surface is before true Pareto acquired in three kinds of algorithms
Along face, (PF) is closer, illustrates that convergence is better.From Fig. 6-14 as can be seen that the constringency performance of HH-MAB is compared with HH-RAND
It is more preferable with HH-CF effect.
The above is only embodiments of the present invention.It should be pointed out that the present embodiment is enterprising in two target WFG test sets
Row experiment, which shares 9 test cases, cover Pareto leading surface it is continuous discontinuous, Pareto leading surface convex surface
Concave surface it is linear, multi-modal many aspects such as single mode, fraud problems, be to be widely used for verifying multi-objective optimization algorithm
The test set of energy.The characteristics of according to meta-heuristic algorithms, only needs change or replacement low layer to inspire if the problem that replacement is to be solved
The number and type of formula operator, without changing high-rise heuristic strategies.Therefore, the heuristic operator number of change low layer and
Type is still the protection scope of this patent.
Claims (1)
1. a kind of method that the meta-heuristic algorithms based on MAB solve multi-objective optimization question, it is characterised in that: this method packet
It includes:
(1) the set H={ h of the heuristic operator of low layer is provided1,h2…hnum, wherein num is the number of the heuristic operator of low layer,
h1,h2…hnumFor multi-objective optimization algorithm or operator;
(2) the random initial population P that construction size is N0, in decision space X, N number of solution is generated at random;
(3) performance number of each heuristic operator of low layer in four existing Performance Evaluation indexs is calculated.Each low layer is heuristic
Operator h1,h2…hnum(i=1,2 ... num) is to same initial population P0Independently operation is primary, and each run executes hiTotal M
Generation, and calculate separately each heuristic operator hiFour performance evaluation indexes AE, RNI, SSC and UD;M is to execute in an iteration
The number of the heuristic operator of low layer, M take the integer in (0, ∞) range;
1) AE: assessment algorithm obtains the calculation amount of approximate disaggregation, and value range is (0, ∞), and the smaller expression performance of AE value is got over
It is good;
2) RNI: assessment algorithm obtain approximate solution concentrate non-domination solution ratio, value range be (0,1], RNI value is bigger
Indicate that approximate solution concentrates the ratio of non-domination solution higher;If RNI=1, all individuals for showing that approximate solution is concentrated all are non-branch
Match;
3) SSC: assessment algorithm obtain approximate disaggregation institute covered object space size, value range for [0, ∞);
SSC value is higher, and the covering space for indicating seemingly nearly disaggregation is bigger, also indicates that performance is better on convergence and distributivity;
4) UD: assessing approximate disaggregation in the distribution situation of object space, value range be (0,1];In non-domination solution quantity phase
The higher distributivity for indicating approximate disaggregation on object space of UD value is better in the case where;
(4) the return value r of the heuristic operator of each low layer is calculated using operator Performance Evaluation mechanismi, each heuristic operator assessment of low layer
Mechanism includes the following steps:
1) performance data obtained to the heuristic operator of each low layer is normalized, so that each Performance Evaluation index is mapped to
Within the same value range [0,1], the heuristic operator h of low layeriValue difference after the normalizing on four performance evaluation indexes
It is denoted as AEi_nor、RNIi_nor、SSCi_norAnd UDi_nor;
2) according to the Performance Evaluation index value obtained after normalization, the return value of the heuristic operator of each low layer is 1. calculated using formula
ri:
ri=(1-AEi_nor)+2RNIi_nor+SSCi_nor+UDi_nor ①
Formula 1. in: because of AEi_norValue it is the smaller the better, and other three indexs are the bigger the better, therefore in return value ri's
The heuristic operator h of low layer has been used in calculatingiAEi_norReverse value, i.e. 1-AEi_norValue;SSCi_norIt can reflect out algorithm
The distributivity and convergence of the population obtained after execution, UDi_norIt can reflect out the distributivity that population is obtained after algorithm executes;?
Under Population Size unanimous circumstances, SSCi_norAnd UDi_norValue be the bigger the better;In order to avoid the variation pair of Population Size
SSCi_norAnd UDi_norThe influence of value, therefore in riCalculating in give SSCi_norAnd UDi_norIt has arranged in pairs or groups respectively a non-domination solution
Ratio RNIi_nor;
(5) according to return value ri, the MAB value of the heuristic operator of each low layer is calculated, steps are as follows:
1) for there is the set H={ h of the heuristic operator of num low layer1,h2…hnum, it is each hi(i ∈ { 1 ... num }) setting
One performance tuple being made of three variablesWherein riBe return value,It is average return value, kiFor hiIt is selected
The number selected, in the algorithm initial stage, all values are initialized as 0;
2) h is 2. updated using formulaiPerformance tuple
3) according to the heuristic operator h of low layeriPerformance tuple ViCalculate the MAB of the heuristic operator of current generation low layeriValue is as follows
Formula is 3.:
Wherein C is control parameter, controls the heuristic operator h of each low layeriAverage behaviorWith time that operator executes in frame operation
A kind of balance between number;
(6) the heuristic operator h of the maximum low layer of MAB value is selectedtop, i.e. top=argmax { MABi, i=1,2 ..., num;If
The algorithm for possessing maximum MAB value has multiple, then therefrom random selection one;
(7) h is usedtopAdvanced group species Pt, execute htopIn total M generation, new population P ' is obtained, and calculated using the heuristic operator h of low layertop
Performance number of the obtained population P ' on four performance evaluation indexes;
(8) using improvement reception strategy Population Regeneration Pt+1, judge to execute h using SSCtopWhether population has improvement afterwards, if without changing
Into the then population P of next iteration (t+1 generation)t+1It is replaced by the population in t generation, i.e. Pt+1=Pt;If there is improvement, t+1 generation
Population Pt+1Population P ' the replacement just obtained, i.e. Pt+1=P ';
(9) judge the number of iterations whether be more than setting maximum value W, if it does, HH-MAB algorithm terminates;If do not surpassed
It crosses, into next round iteration, returns (4);W is the maximum number of iterations of algorithm, takes the integer in (0, ∞) range.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811230929.8A CN109460862B (en) | 2018-10-22 | 2018-10-22 | Method for solving multi-objective optimization problem based on MAB (multi-object-based) hyperheuristic algorithm |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811230929.8A CN109460862B (en) | 2018-10-22 | 2018-10-22 | Method for solving multi-objective optimization problem based on MAB (multi-object-based) hyperheuristic algorithm |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109460862A true CN109460862A (en) | 2019-03-12 |
CN109460862B CN109460862B (en) | 2021-04-27 |
Family
ID=65608137
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811230929.8A Active CN109460862B (en) | 2018-10-22 | 2018-10-22 | Method for solving multi-objective optimization problem based on MAB (multi-object-based) hyperheuristic algorithm |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109460862B (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110826145A (en) * | 2019-09-09 | 2020-02-21 | 西安工业大学 | Automobile multi-parameter operation condition design method based on heuristic Markov chain evolution |
CN111917529A (en) * | 2020-07-15 | 2020-11-10 | 燕山大学 | Underwater sound OFDM resource allocation method based on improved EXP3 algorithm |
CN112631907A (en) * | 2020-12-19 | 2021-04-09 | 北京化工大学 | Super-heuristic framework upper-layer scheduling method based on utilization and exploration |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100241654A1 (en) * | 2009-03-23 | 2010-09-23 | Riverbed Technology, Inc. | Virtualized data storage system optimizations |
US20160021671A1 (en) * | 2011-09-08 | 2016-01-21 | Drexel University | Method For Selecting State Of A Reconfigurable Antenna In A Communication System Via Machine Learning |
CN106156891A (en) * | 2016-07-08 | 2016-11-23 | 华南师范大学 | A kind of Scheduling method based on Pareto multiple target ant colony optimization algorithm |
CN106384169A (en) * | 2016-09-22 | 2017-02-08 | 合肥工业大学 | Hyper-heuristic algorithm-based satellite task planning method |
CN107705157A (en) * | 2017-10-19 | 2018-02-16 | 大连理工大学 | Automotive supplies Method for Sales Forecast method and system based on unified dynamic integrity model and meta-heuristic algorithm |
-
2018
- 2018-10-22 CN CN201811230929.8A patent/CN109460862B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100241654A1 (en) * | 2009-03-23 | 2010-09-23 | Riverbed Technology, Inc. | Virtualized data storage system optimizations |
US20160021671A1 (en) * | 2011-09-08 | 2016-01-21 | Drexel University | Method For Selecting State Of A Reconfigurable Antenna In A Communication System Via Machine Learning |
CN106156891A (en) * | 2016-07-08 | 2016-11-23 | 华南师范大学 | A kind of Scheduling method based on Pareto multiple target ant colony optimization algorithm |
CN106384169A (en) * | 2016-09-22 | 2017-02-08 | 合肥工业大学 | Hyper-heuristic algorithm-based satellite task planning method |
CN107705157A (en) * | 2017-10-19 | 2018-02-16 | 大连理工大学 | Automotive supplies Method for Sales Forecast method and system based on unified dynamic integrity model and meta-heuristic algorithm |
Non-Patent Citations (4)
Title |
---|
ALEXANDRE SILVESTRE FERREIRA;RICHARD ADERBAL GONÇALVES;AURORA PO: "A Multi-Armed Bandit selection strategy for Hyper-heuristics", 《2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC)》 * |
GIOVANI GUIZZO;SILVIA R. VERGILIO;AURORA T.R. POZO: "Evaluating a Multi-objective Hyper-Heuristic for the Integration and Test Order Problem", 《2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS)》 * |
RICHARD ADERBAL GONÇALVES ETAL.: "A New Hyper-Heuristic Based on a Restless Multi-armed Bandit for Multi-objective Optimization", 《2017 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS)》 * |
张艳梅等: "基于粒子群优化算法的类集成测试序列确定方法", 《计算机学报》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110826145A (en) * | 2019-09-09 | 2020-02-21 | 西安工业大学 | Automobile multi-parameter operation condition design method based on heuristic Markov chain evolution |
CN111917529A (en) * | 2020-07-15 | 2020-11-10 | 燕山大学 | Underwater sound OFDM resource allocation method based on improved EXP3 algorithm |
CN111917529B (en) * | 2020-07-15 | 2021-06-15 | 燕山大学 | Underwater sound OFDM resource allocation method based on improved EXP3 algorithm |
CN112631907A (en) * | 2020-12-19 | 2021-04-09 | 北京化工大学 | Super-heuristic framework upper-layer scheduling method based on utilization and exploration |
CN112631907B (en) * | 2020-12-19 | 2024-04-02 | 北京化工大学 | Super heuristic frame upper layer scheduling method based on utilization and exploration |
Also Published As
Publication number | Publication date |
---|---|
CN109460862B (en) | 2021-04-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | Multi-objective self-adaptive differential evolution with elitist archive and crowding entropy-based diversity measure | |
Paquete et al. | Stochastic local search algorithms for multiobjective combinatorial optimization: A review | |
CN109460862A (en) | The method that meta-heuristic algorithms based on MAB solve multi-objective optimization question | |
Antonio et al. | Use of cooperative coevolution for solving large scale multiobjective optimization problems | |
Ferreira et al. | Methodology to select solutions from the pareto-optimal set: a comparative study | |
Shang et al. | A multi-population cooperative coevolutionary algorithm for multi-objective capacitated arc routing problem | |
Maciel et al. | Multi-objective evolutionary particle swarm optimization in the assessment of the impact of distributed generation | |
CN106845642B (en) | A kind of adaptive multi-target evolution method of belt restraining cloud workflow schedule | |
Sahoo et al. | Simple heuristics-based selection of guides for multi-objective PSO with an application to electrical distribution system planning | |
CN109066710A (en) | A kind of multi-objective reactive optimization method, apparatus, computer equipment and storage medium | |
US20200292340A1 (en) | Robot running path, generation method, computing device and storage medium | |
Colby et al. | Shaping fitness functions for coevolving cooperative multiagent systems. | |
Shi et al. | Multimodal multi-objective optimization using a density-based one-by-one update strategy | |
Jaszkiewicz et al. | Multiobjective memetic algorithms | |
CN116914751B (en) | Intelligent power distribution control system | |
Wang et al. | A memetic genetic programming with decision tree-based local search for classification problems | |
Zhao et al. | A decomposition-based many-objective ant colony optimization algorithm with adaptive solution construction and selection approaches | |
CN111191343A (en) | Multi-mode multi-target differential evolution algorithm based on random sequencing learning | |
Liao et al. | Multi-objective optimization by learning automata | |
CN111008790A (en) | Hydropower station group power generation electric scheduling rule extraction method | |
CN115860170A (en) | Power quality optimization method of power distribution system considering power electronic load | |
CN108960486A (en) | Interactive set evolvement method based on grey support vector regression prediction adaptive value | |
CN104778368A (en) | Pareto set individual ranking method aiming at high-dimensional multi-objective optimization problem | |
CN110321995A (en) | A kind of difference Particle Swarm Mixed Algorithm based on Stochastic inertia weight | |
Zhou et al. | Multi-objective parameter calibration and multi-attribute decision-making: an application to conceptual hydrological model calibration |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |